Skip to main content

Cloud-Based Massive Electricity Data Mining and Consumption Pattern Discovery

  • Conference paper
Web Information Systems Engineering – WISE 2013 Workshops (WISE 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8182))

Included in the following conference series:

  • 1496 Accesses

Abstract

With the development of the power systems in China, there is large volume of basic electricity consumption data accumulated. Mining these data to discover possible consumption patterns and group the users in a more fine-grained way can help the State Grid Corporation to understand users’ personalized and differentiated requirements. In this work, an algorithm called TMeans is proposed to mine the electricity consumption patterns. TMeans improves the classical K-Means algorithm by presenting a set of static and dynamical rules which can dynamically adjust the clustering process according to the statistical features of the clusters, making the process more flexible and practical. Then a MapReduce-based implementation of TMeans is proposed to make itself capable of handling large volume of data efficiently. Through experiment, we first demonstrate that the consumption patterns can be effectively discovered and can be refined to very small granularity through TMeans, and then we show that the MapReduce-based implementation of TMeans can efficiently speed up the clustering process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Smart Grid (July 20, 2013), http://en.wikipedia.org/wiki/Smart_grid

  2. Firth, S., Lomas, K., Wright, A., et al.: Identifying trends in the use of domestic appliances from household electricity consumption measurements. Energy and Buildings 40(5), 926–936 (2008)

    Article  Google Scholar 

  3. Ball, G.H., Hall, D.J.: ISODATA, a novel method of data analysis and pattern classification. Stanford Research Inst., Menlo Park (1965)

    Google Scholar 

  4. Arthur, D., Vassilvitskii, S.: k-means++: The advantages of careful seeding. In: Proceedings of the Eighteenth ACM-SIAM Symposium on Discrete Algorithms, pp. 1027–1035. Society for Industrial and Applied Mathematics (2007)

    Google Scholar 

  5. Keogh, E., Pazzani, M.: An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback. In: Proceedings of the 3rd International Conference of Knowledge Discovery and Data Mining, pp. 239–241. The Association for the Advancement of Artificial Intelligence, New York (1998)

    Google Scholar 

  6. Zhao, G., Qu, G.: Analysis and implementation of CLARA algorithm on clustering. Journal of Shandong University of Technology: Sci. & Tech. (2), 45–48 (2006) (in Chinese)

    Google Scholar 

  7. Li, B., Zhao, H., Lv, Z.: Parallel ISODATA Clustering of Remote Sensing Images Based on MapReduce. In: 2010 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery, pp. 380–383 (2010)

    Google Scholar 

  8. Zhang, S., Liu, J., Zhao, B., et al.: Cloud Computing-Based Analysis on Residential Electricity Consumption Behavior. Power System Technology 37(6), 1542–1546 (2013) (in Chinese)

    Google Scholar 

  9. Borthakur, D.: The hadoop distributed file system: Architecture and design. Hadoop Project Website (2013), http://hadoop.apache.org/common/docs/

  10. Beckel, C., Sadamori, L., Santini, S.: Towards automatic classification of private households using electricity consumption data. In: Proceedings of the Fourth ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Buildings, pp. 169–176. ACM (2012)

    Google Scholar 

  11. Chicco, G., Ilie, I.S.: Support vector clustering of electrical load pattern data. IEEE Transactions on Power Systems 24(3), 1619–1628 (2009)

    Article  Google Scholar 

  12. De Silva, D., Yu, X., Alahakoon, D., et al.: A data mining framework for electricity consumption analysis from meter data. IEEE Transactions on Industrial Informatics 7(3), 399–407 (2011)

    Article  Google Scholar 

  13. Zhu, Z., Gu, Z., Wu, J., et al.: Application of cloud computing in electric power system data recovery. Power System Technology 36(9), 44–50 (2012) (in Chinese)

    Google Scholar 

  14. Mu, L., Cui, L., An, N.: Research and practice of cloud computing center for power system. Power System Technology 35(6), 171–175 (2011) (In Chinese)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ming, C., Maoyong, C., Yan, W. (2014). Cloud-Based Massive Electricity Data Mining and Consumption Pattern Discovery. In: Huang, Z., Liu, C., He, J., Huang, G. (eds) Web Information Systems Engineering – WISE 2013 Workshops. WISE 2013. Lecture Notes in Computer Science, vol 8182. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54370-8_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-54370-8_18

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-54369-2

  • Online ISBN: 978-3-642-54370-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics