Abstract
Aimed at the problems of Chinese experts’ name repetition and representation diversity, a Chinese expert name disambiguation approach based on spectral clustering with the expert page-associated relationships is proposed. Firstly, the TF-IDF algorithm is used to calculate the word-based feature weights, and then the cosine similarity algorithm is employed to compute the similarity between the evidence-pages to obtain the initial similarity matrix of expert evidence-pages. Secondly, the expert page-associated relationship features are taken as the semi-supervised constraint information to correct the initial similarity matrix, and next the spectral clustering-based method is used to build expert disambiguation model. Finally, taking the contrast experiments on Chinese expert evidence-page corpus of manually labeled, the result shows that the semi-supervised spectral clustering on Chinese experts’ name disambiguation method with the expert page-associated relationships than that without the associated constraint information, the F-value has an average increase of 9.02 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Wang H, Mei Z (2005) Chinese multi-document person name disambiguation. High Technol Lett 11(3):280–283
Danushka B, Yutaka M, Mitsuru I (2006) Disambiguation person names on the web using automatically extracted key phrases. In: Brewka G, Coradeschi S, Perini A, Traverso P (eds) Proceedings of the 17th European conference on artificial intelligence. IOS Press, Riva del Garda, Italy, pp 553–557
Zhang S, You L (2010) Chinese people name disambiguation by hierarchical clustering. New Technol Libr Inf Serv 2010(11):64–68 (in Chinese)
Lang J, Qin B (2009) Person name disambiguation of searching results using social network. Chin J Comput 32(7):1365–1375
Quattoni A, Wang S, Morency LP et al (2007) Hidden conditional random fields. IEEE Trans Pattern Anal Mach Intell 29(10):1848–1852
Tang J, Yao L, Zhang D et al (2010) A combination approach to web user profiling. ACM Trans Knowl Discov Data 5(1):2-2
Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):888–905
Wagstaff K, Cardie C (2000) Clustering with instance-level constraints. In: Langley P (ed) Proceedings of the 17th international conference on machine learning. Morgan Kaufmann Publishers, San Francisco, pp 1103–1110
Acknowledgments
This paper is supported by National Nature Science Foundation (No. 61175068), and the Open Fund of Software Engineering Key Laboratory of Yunnan Province (No. 2011SE14), and the Ministry of Education of Returned Overseas Students to Start Research and Fund Projects.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tian, W., Shen, T., Yu, Z., Guo, J., Xian, Y. (2013). A Chinese Expert Name Disambiguation Approach Based on Spectral Clustering with the Expert Page-Associated Relationships. In: Sun, Z., Deng, Z. (eds) Proceedings of 2013 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 256. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38466-0_28
Download citation
DOI: https://doi.org/10.1007/978-3-642-38466-0_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38465-3
Online ISBN: 978-3-642-38466-0
eBook Packages: EngineeringEngineering (R0)