Abstract
An overview of a variety of methods of agglomerative hierarchical clustering as well as non-hierarchical clustering for semi-supervised classification is given. Two different formulations for semi-supervised classification are introduced: one is with pairwise constraints, while the other does not use constraints. Two methods of the mixture of densities and fuzzy c-means are contrasted and their theoretical properties are discussed. A number of agglomerative hierarchical algorithms are then discussed. It will be shown that the single linkage has different characteristics when compared with the complete linkage and average linkage. Moreover the centroid method and the Ward method are discussed. It will also be shown that the must-link constraints and the cannot-link constraints are handled in different ways in these methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Basu, S., Bilenko, M., Mooney, R.J.: A Probabilistic Framework for Semi-Supervised Clustering. In: Proc. of the Tenth ACM SIGKDD (KDD 2004), pp. 59–68 (2004)
Basu, S., Banerjee, A., Mooney, R.J.: Active Semi-Supervision for Pairwise Constrained Clustering. In: Proc. of the SIAM International Conference on Data Mining (SDM 2004), pp. 333–344 (2004)
Basu, S., Davidson, I., Wagstaff, K.L.: Constrained Clustering. CRC Press, Boca Raton (2009)
Chapelle, O., Schölkopf, B., Zien, A. (eds.): Semi-Supervised Learning. MIT Press, Cambridge (2006)
Davidson, I., Ravi, S.S.: Agglomerative Hierarchical Clustering with Constraints: Theoretical and Empirical Results. In: Jorge, A.M., Torgo, L., Brazdil, P.B., Camacho, R., Gama, J. (eds.) PKDD 2005. LNCS (LNAI), vol. 3721, pp. 59–70. Springer, Heidelberg (2005)
Davidson, I., Ravi, S.S.: Using instance-level constraints in agglomerative hierarchical clustering: theoretical and empirical results. Data Min., Knowl., Disc. 18, 257–282 (2009)
Endo, Y., Haruyama, H., Okubo, T.: On some hierarchical clustering algorithms using kernel functions. In: Proc. of FUZZ-IEEE 2004, CD-ROM Proc., Budapest, Hungary, July 25-29, pp. 1–6 (2004)
Everitt, B.S.: Cluster Analysis, 3rd edn. Arnold, London (1993)
Klein, D., Kamvar, S.D., Manning, C.: From Instance-level Constraints to Space-level Constraints: Making the Most of Prior Knowledge in Data Clustering. In: Proc. of the Intern. Conf. on Machine Learning, Sydney, Australia, pp. 307–314 (2002)
Kulis, B., Basu, S., Dhillon, I., Mooney, R.: Semi-supervised graph clustering: a kernel approach. Mach. Learn. 74, 1–22 (2009)
Ichihashi, H., Honda, K., Tani, N.: Gaussian mixture PDF approximation and fuzzy c-means clustering with entropy regularization. In: Proc. of Fourth Asian Fuzzy Systems Symposium, vol. 1, pp. 217–221 (2000)
Ichihashi, H., Miyagishi, K., Honda, K.: Fuzzy c-means clustering with regularization by K-L information. In: Proc. of 10th IEEE International Conference on Fuzzy Systems, vol. 2, pp. 924–927 (2001)
McLachlan, G.J., Krishnan, T.: The EM algorithms and Extensions. Wiley, New York (1997)
Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer, Dordrecht (1990)
Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for Fuzzy Clustering. Springer (2008)
Miyamoto, S., Terami, A.: Semi-Supervised Agglomerative Hierarchical Clustering Algorithms with Pairwise Constraints. In: Proc. of WCCI 2010 IEEE World Congress on Computational Intelligence, CCIB, Barcelona, Spain, July, 18-23, pp. 2796–2801 (2010)
Miyamoto, S., Terami, A.: Constrained Agglomerative Hierarchical Clustering Algorithms with Penalties. In: Proc. of 2011 IEEE International Conference on Fuzzy Systems, Taipei, Taiwan, June 27-30, pp. 422–427 (2011)
Miyamoto, S., Terami, A.: Inductive vs. Transductive Clustering Using Kernel Functions and Pairwise Constraints. In: Proc. of 11th Intern. Conf. on Intelligent Systems Design and Applications (ISDA 2011), Cordoba, Spain, November 22-24, pp. 1258–1264 (2011)
Schölkopf, B., Smola, A.J.: Learning with Kernels. MIT Press, Cambridge (2002)
Shental, N., Bar-Hillel, A., Hertz, T., Weinshall, D.: Computing Gaussian Mixture Models with EM using Equivalence Constraints. In: Thrun, S., Saul, L.K., Schölkopf, B. (eds.) Advances In Neural Information Processing Systems 16, pp. 465–472 (2004)
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained K-means Clustering with Background Knowledge. In: Proc. of the 9th ICML, pp. 577–584 (2001)
Wang, N., Li, X., Luo, X.: Semi-supervised Kernel-based Fuzzy c-Means with Pairwise Constraints. In: Proc. of WCCI 2008, pp.1099-1103 (2008)
Zhu, X., Goldberg, A.B.: Introduction to Semi-Supervised Learning. Morgan and Claypool (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Miyamoto, S. (2012). An Overview of Hierarchical and Non-hierarchical Algorithms of Clustering for Semi-supervised Classification. In: Torra, V., Narukawa, Y., López, B., Villaret, M. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2012. Lecture Notes in Computer Science(), vol 7647. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34620-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-34620-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34619-4
Online ISBN: 978-3-642-34620-0
eBook Packages: Computer ScienceComputer Science (R0)