Abstract
The analysis of topic evolution mainly refers to the mining of topic content which evolves as the time goes on. With the assumption that topic content may be embodied by key words, this article adopted word2vec for the training of 750,000 pieces of news and micro-blog texts and thus established the model of word vector. Then, the text information flow was applied into the model and all word vectors by time series were acquired. Finally, the word vectors were clustered by K-means before the key words were drawn and the analysis of topic evolution was visualized. By comparing the effect of the model of word vector on drawing topic with those of LDA or PLSA topic models, the results showed that the former is superior to the latter two models. Besides, to collect abundant and varied data will facilitate the training of the model of word vector with better generalization ability and the investigation on real-time analysis of topic evolution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Chen, H., Wang, F., Zeng, D.: Intelligence and security informatics for homeland security: information, communication, and transportation. IEEE Trans. Intell. Transp. Syst. 5(4), 329–341 (2004)
Wang, F.: Decision service and academic analytics for development of S&T based on open source intelligence and big data. Bull. Chin. Acad. Sci. 27(5), 527–537 (2012)
Allan, J., Papka, R., Lavrenko, V.: On-line new event detection and tracking. In: Proceedings of the 21st Annual International Conference on Research and Development in Information Retrieval (SIGIR), pp. 37–45. ACM (1998)
Han, Z., Chen, N., Le, J.J., Duan, D., Sun, J.: An efficient and effective clustering algorithm for time series of hot topics. Chin. J. Comput. 35(11), 2337–2347 (2012)
Zhang, S., Liu, Z.: Study on clustering method for internet public opinion hotspot topic. J. Chin. Comput. Syst. 34(3), 471–474 (2013)
Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake shakes twitter users: real-time event detection by social sensors. In: Proceedings of the Nineteenth International World Wide Web Conference Series (WWW), pp. 851–860. ACM (2010)
Zhou, H., Liu, J., Wang, X.: Retrospective topic identification model for short text information flow. J. Chin. Inf. Process. 29(1), 111–117 (2015a)
Zhou, Y., Liu, X., Youtian, D., Guan, X., Liu, J.: A method for Identifying the evolutionary focuses of online social topics. Chin. J. Comput. 38(2), 261–271 (2015b)
Xing, E.P., Yan, R., Hauptmann, A.G.: Mining associated text and images with dual-wing harmoniums. In: Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence (UAI), pp. 633–641 (2005)
Lin, J., Zhou, Y., Yang, A., Chen, Y., Chen, X.: Analysis of public emotion evolution based on probabilistic latent semantic analysis. J. Comput. Appl. 35(10), 2747–2751 (2015). 2756
He, J., Chen, X., Min, D., Jiang, H.: Topic evolution analysis based on improved online LDA model. J. Cent. S. Univ. (Sci. Technol.) 2, 547–553 (2015)
Lin, P., Huang, W.: Topic evolution analysis of internet emergency based on LDA model. Inf. Sci. 32(10), 20–23 (2014)
Cao, J., Wang, H., Xia, Y., Qiao, F., Zhang, X.: Bi-path evolution model for online topic model based on LDA. Acta Automatica Sin. 40(12), 2877–2886 (2014)
Alsumait, L., Barbará, D., Domeniconi, C.: On-line LDA: adaptive topic models for mining text streams with applications to topic detection and tracking. In: Proceedings of the 8th IEEE International Conference on Data Mining (ICDM), pp. 3–12. IEEE (2008)
Keane, N., Yee, C., Zhou, L.: Using topic modeling and similarity thresholds to detect events. In: Proceedings of the 3rd Workshop on EVENTS at the NAACL-HLT, pp. 34–42 (2015)
Chu, K., Li, F.: LDA model-based news topic evolution. Comput. Appl. Softw. 28(4), 4–7 (2011)
Zhangjie, F., Xingming, S., Qi, L., Lu, Z., Jiangang, S.: Achieving efficient cloud search services: multi-keyword ranked search over encrypted cloud data supporting parallel computing. IEICE Trans. Commun. E98-B(1), 190–200 (2015)
Bin, G., Sheng, V.S., Tay, K.Y., Romano, W., Li, S.: Incremental support vector learning for ordinal regression. IEEE Trans. Neural Netw. Learn. Syst. 26(7), 1403–1416 (2015)
Xia, Z., Wang, X., Sun, X., Wang, Q.: A secure and dynamic multi-keyword ranked search scheme over encrypted cloud data. IEEE Trans. Parallel Distrib. Syst. 27(2), 340–352 (2015)
Zhang, D., Xua, H., Sua, Z., Xua, Y.: Chinese comments sentiment classification based on word2vec and SVMperf. Expert Syst. Appl. 42(4), 1857–1863 (2015)
Acknowledgment
Supported by The National Social Science Funding Project of China (12BYY045); The Twelfth Five-Year Plan for Philosophy and Social Sciences Project of GuangZhou (15Q16); The Philosophy and Social Sciences Project of GuangDong (GD15YTS01).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Jianghao, L., Yongmei, Z., Aimin, Y., Jin, C. (2016). Analysis of Topic Evolution on News Comments Based on Word Vectors. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10040. Springer, Cham. https://doi.org/10.1007/978-3-319-48674-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-319-48674-1_41
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48673-4
Online ISBN: 978-3-319-48674-1
eBook Packages: Computer ScienceComputer Science (R0)