Abstract
Spectral clustering has been applied in various applications. But there still exist some important issues to be resolved, among which the two major ones are to (1) specify the scale parameter in calculating the similarity between data objects, and (2) select propoer eigenvectors to reduce data dimensionality. Though these topics have been studied extensively, the existing methods cannot work well in some complicated scenarios, which limits the wide deployment of the spectral clustering method. In this work, we revisit the above two problems and propose three contributions to the field: 1) a unified framework is designed to study the impact of the scale parameter on similarity between data objects. This framework can easily accommodate various state of art spectral clustering methods in determining the scale parameter; 2) a novel approach based on local connectivity analysis is proposed to specify the scale parameter; 3) propose a new method for eigenvector selection. Compared with existing techniques, the proposed approach has a rigorous theoretical basis and is efficient from practical perspective. Experimental results show the efficacy of our approach to clustering data of different scenarios.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bach, F.R., Jordan, M.I.: Learning spectral clustering. In: Advances in Neural Information Processing Systems, vol. 16. MIT Press, Cambridge (2003)
Chung, F.R.K.: Spectral Graph Theory. CBMS Regional Conference Series in Mathematics, vol. 92. American Mathematical Society, Providence (February 1997)
Dhillon, I.S., Guan, Y., Kulis, B.: Kernel k-means: spectral clustering and normalized cuts. In: KDD 2004: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 551–556. ACM, New York (2004)
Filippone, M., Camastra, F., Masulli, F., Rovetta, S.: A survey of kernel and spectral methods for clustering. Pattern Recogn. 41(1), 176–190 (2008)
Higham, D.J., Kalna, G., Kibble, M.: Spectral clustering and its use in bioinformatics. J. Comput. Appl. Math. 204(1), 25–37 (2007)
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Jain, A.K.: Data Clustering: 50 Years Beyond K-means. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008, Part I. LNCS (LNAI), vol. 5211, pp. 3–4. Springer, Heidelberg (2008)
Kamvar, S.D., Klein, D., Manning, C.D.: Spectral learning. In: IJCAI 2003, pp. 561–566 (2003)
Kleinberg, J.: An impossibility theorem for clustering. In: Becker, S., Thrun, S., Obermayer, K. (eds.) Neural Information Processing Systems, vol. 14, pp. 446–453. MIT Press, Cambridge (2002)
Kurucz, M., Benczúr, A.A., Csalogány, K., Lukács, L.: Spectral clustering in social networks, pp. 1–20 (2009)
Luxburg, U.: A tutorial on spectral clustering. Statistics and Computing 17(4), 395–416 (2007)
Luxburg, U.V., Bousquet, O., Belkin, M.: Limits of spectral clustering. In: Advances in Neural Information Processing Systems. MIT Press, Cambridge (2005)
Meila, M., Shi, J.: A random walks view of spectral segmentation (2001)
Meila, M., Shi, J.: Learning segmentation by random walks. In: NIPS, pp. 873–879 (2000)
Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: Analysis and an algorithm. In: Advances in Neural Information Processing Systems, vol. 14 (2001)
Pavel, B.: Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA (2002), http://www.accrue.com/products/rp_cluster_review.pdf
Pentney, W., Meila, M.: Spectral clustering of biological sequence data. In: AAAI 2005: Proceedings of the 20th National Conference on Artificial Intelligence, pp. 845–850. AAAI Press, Menlo Park (2005)
Qin, G., Gao, L.: Spectral clustering for detecting protein complexes in protein-protein interaction (ppi) networks. Mathematical and Computer Modelling 52(11-12), 2066–2074 (2010); The BIC-TA 2009 Special Issue, International Conference on Bio-Inspired Computing: Theory and Applications
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) (2000)
Tolliver, D.A., Miller, G.L.: Graph partitioning by spectral rounding: Applications in image segmentation and clustering. In: CVPR 2006: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 1053–1060. IEEE Computer Society Press, Washington, DC, USA (2006)
Tung, F., Wong, A., Clausi, D.A.: Enabling scalable spectral clustering for image segmentation. Pattern Recogn. 43(12), 4069–4076 (2010)
Verma, D., Meila, M.: A comparison of spectral clustering algorithms. Technical report, Department of CSE University of Washington Seattle, WA 98195-2350 (2005)
von Luxburg, U., Belkin, M., Bousquet, O.: Consistency of spectral clustering (April 2008)
White, S., Smyth, P.: A spectral clustering approach to finding communities in graphs (2005)
Xiang, T., Gong, S.: Spectral clustering with eigenvector selection. Pattern Recogn. 41(3), 1012–1029 (2008)
Xu, K.S., Kliger, M., Chen, Y., Woolf, P.J., Hero III, A.O.: Revealing social networks of spammers through spectral clustering. In: ICC 2009: Proceedings of the IEEE International Conference on Communications, Piscataway, NJ, USA, 2009, pp. 735–740. IEEE Press, Los Alamitos (2009)
Yu, S.X., Shi, J.: Multiclass spectral clustering. In: ICCV 2003: Proceedings of the Ninth IEEE International Conference on Computer Vision, p. 313. IEEE Computer Society, Washington, DC, USA (2003)
Zelnik-Manor, L., Perona, P.: Self-tuning spectral clustering. In: Advances in Neural Information Processing Systems (2004)
Zhao, F., Jiao, L., Liu, H., Gao, X., Gong, M.: Spectral clustering with eigenvector selection based on entropy ranking. Neurocomput. 73(10-12), 1704–1717 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, H., Qu, G., Zhu, X. (2011). Self-adjust Local Connectivity Analysis for Spectral Clustering. In: Huang, J.Z., Cao, L., Srivastava, J. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2011. Lecture Notes in Computer Science(), vol 6634. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20841-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-20841-6_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20840-9
Online ISBN: 978-3-642-20841-6
eBook Packages: Computer ScienceComputer Science (R0)