Skip to main content

Research and Design of the Clustering System Based on the Column-Store

  • Conference paper
  • First Online:
Proceedings of the 2012 International Conference on Cybernetics and Informatics

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 163))

  • 154 Accesses

Abstract

With the rapid development of the network and the increase of the information on the web, rapid access to the database and data mining become very important. Column-store has the advantage of quick read speed, saving the disk I/O, and can be read by uncompressed, which is helpful to acquire knowledge in the massive data. So based on the traditional data mining module, introduce the column store technology. Information base and knowledge base all adopt column store to store and access, and also provide the access interface between the store module and its upper layer module. Then, compute the Minkowski distance and use k-medoids methods in the data clustering module. On the base of the access advantage of the column store and k-medoids methods, this system can improve the speed and the quality of the clustering. The innovation is the application of column store in the clustering system, and provide completed data access interface, using k-medoids methods can detect clusters of arbitrary shape. Computing the Minkowski distance can improve the efficiency of the dissimilarity of objects and clustering speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 429.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Mike Stonebraker, Daniel J. Abadi, Adam Batkin, Xuedong Chen (2005) Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Sam Madden, Elizabeth O’Neil, Pat O’Neil, Alex Rasin, Nga Tran, Stan Zdonik. C-Store: A Column-Oriented DBMS[C]. In VLDB, Trondheim, 21: 57–63

    Google Scholar 

  2. CoPeland GP, Koshafian SF (1985) A decomposition storage model. In: Proceedings of the ACM SIGMOD international conference on management of data 5:268–279

    Google Scholar 

  3. http://wenku.baidu.com/view/ebc8a9c04028915f804dc2f1.html

  4. Harry KT Wong, Hsiu-Fen Liu (1985) Frank Olken, Doron Rotem, Linda Wong. Bit transPosed files. In VLDB

    Google Scholar 

  5. Anastassia Ailamaki (2001) A storage model to bridge the proeessor/memory speed gap. In HPTS

    Google Scholar 

  6. Ravishankar Ramamurthy, David J Dewitt, Qi Su (2002) A case for fractured mirrors. In VLDB

    Google Scholar 

  7. Han J, Kamber M (2010) Data mining concepts and techniques. Morgan Kaufmann, San Francisco, pp 338–353

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lijun Shen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this paper

Cite this paper

Shen, L., Zhang, T., Song, J., Chen, P., Wang, J. (2014). Research and Design of the Clustering System Based on the Column-Store. In: Zhong, S. (eds) Proceedings of the 2012 International Conference on Cybernetics and Informatics. Lecture Notes in Electrical Engineering, vol 163. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3872-4_223

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-3872-4_223

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-3871-7

  • Online ISBN: 978-1-4614-3872-4

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics