Abstract
In this paper, we focus on the problem of unsupervised clustering of a data-set. We introduce the traditional K-Means (K-means) cluster analysis and fuzzy C-means (FCM) cluster analysis of the principles and algorithms process at first, then a novel method to initialize the cluster centers is proposed. The idea is that the cluster centers’ distribution should be as evenly as possible within the input field. A “Two-step method” is used in our evolutionary models, with evolutionary algorithms to get the initialized centers, and traditional methods to get the final results. Experiment results show our initialization method can speed up the convergence, and in some cases, make the algorithm performs better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gose, E., Johnsonbaugh, R., Jost, S.: Pattern Recognition and Image Analysis. Prentice-Hall, Englewood Cliffs (1996)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data Clustering: a Review. ACM Computing Surveys 31(3), 264–323 (1999)
Boley, D.L.: Principal Direction Divisive Partitioning. Technical Report TR-97-056, Dept. of Computer Science, University of Minnesota, Minneapolis (1997)
Boley, D.L.: Principal Direction Divisive Partitioning. Data Mining and Knowledge Discovery 2(4), 325–344 (1998)
Boley, D.L., Gini, M., Gross, R., Han, S., Hastings, K., Karypis, G., Kumar, V., Mobasher, B., Moore, J.: Partitioning-Based Clustering for Web Document Categorization. Decision Support Systems 27(3), 329–341 (2000)
Boley, D.L., Gini, M., Gross, R., Han, S., Hastings, K., Karypis, G., Kumar, V., Mobasher, B., Moore, J.: Document Categorization and Query Generation on the World Wide Web Using WebACE. AI Review 13(5-6), 365–391 (2000)
Jim, Z.C.L., Tsung, J.H.: Fast global k-means clustering using cluster membership and inequality. Elsevier Science Inc., 655 Avenue of the Americas New York (2010)
Forgy, E.: Cluster Analysis of Multivariate Data: Efficiency versus Interpretability of Classification. Biometrics, 768–780 (1965)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall advance reference series. Prentice-Hall, Upper Saddle River (1988)
Selim, S.Z., Ismail, M.A.: K-means-type Algorithms: a Generalized Convergence Theorem and Characterization of Local Optimality. IEEE Trans. on Pattern Analysis and Machine Intelligence 6(1), 81–86 (1984)
Liu, X.Y., Liu, X., Wang, M.: Fuzzy Clustering Method and Application. Computer Knowledge and Technology (2011)
Ling, W.: Intelligent Optimization Algorithm and its Application. Tsing hua University Press, Beijing (2001)
Ling, W., Bo, L.: Particle Swarm Optimization and Scheduling. Tsing hua University Press, Beijing (2008)
Krishna, K., Narasimha Murty, M.: Genetic K-means Algorithm. IEEE Trans. Syst. Man Cybern B Cybern 29(3), 433–439 (1999)
Niknama, T., Amirib, B.: An Efficient Hybrid Approach Based on PSO, ACO and k-means for cluster analysis. Applied Soft Computing 10(1), 183–197 (2010)
Ye, J.X., Lin, Q.: Fuzzy K-means Document Clustering Analysis Based on PSO Algorithm. Computer Engineering and Design (2009)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liang, X., Ren, S., Yang, L. (2011). Succinct Initialization Methods for Clustering Algorithms. In: Huang, DS., Gan, Y., Bevilacqua, V., Figueroa, J.C. (eds) Advanced Intelligent Computing. ICIC 2011. Lecture Notes in Computer Science, vol 6838. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24728-6_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-24728-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24727-9
Online ISBN: 978-3-642-24728-6
eBook Packages: Computer ScienceComputer Science (R0)