Detecting Keyphrases in Micro-blogging with Graph Modeling of Information Diffusion

Song, Shuangyong; Meng, Yao; Sun, Jun

doi:10.1007/978-3-319-13560-1_3

Shuangyong Song²¹,
Yao Meng²¹ &
Jun Sun²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8862))

Included in the following conference series:

Pacific Rim International Conference on Artificial Intelligence

6365 Accesses
2 Citations

Abstract

The rapid increasing popularity of micro-blogging has made it an important information seeking channel. Keyphrase extraction is an effective way for summarizing and analyzing micro-blogging content, which can help users gain insights into internet hotspots. Existing methods for keyphrase extraction usually unilaterally consider phrase frequency or user retweet count as key factors. However, those methods may neglect the relationships between different phrases and the importance of user influence to further information diffusion. Generally, phrases shown in the influential users’ micro-blogs are more likely to attract other users’ interest, making them more likely to be diffused in the near future. Besides, phrases may have relations with each other, and some phrases usually have similar diffusion paths and attract the attention of the same population. In this paper, by comprehensively considering all the above mentioned factors to detect micro-blogging keyphrases, we proposed a novel model. The proposed model first detect high frequency term from abundant micro-blogs as candidate keyphrases, then construct a relation graph about them with user interest and user following web. Finally, we rank those candidates with graph models for realizing keyphrases detection. Experiments show this model is very effective for micro-blogging keyphrase extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aral, S., Brynjolfsson, E., Alstyne, M.V.: Productivity Effects of Information Diffusion in Networks, The MIT Center for Digital Business, paper 234 (2007)
Google Scholar
Barker, K., Cornacchia, N.: Using Noun Phrase Heads to Extract Document Keyphrases. In: Hamilton, H.J. (ed.) Canadian AI 2000. LNCS (LNAI), vol. 1822, pp. 40–52. Springer, Heidelberg (2000)
Chapter Google Scholar
Bellaachia, A., Al-Dhelaan, M.: NE-Rank: A Novel Graph-Based Keyphrase Extraction in Twitter. In: Proceedings of the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, pp. 372–379 (2012)
Google Scholar
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the Seventh International Conference on World Wide Web, pp. 107–117 (1998)
Google Scholar
Celli, F., Di Lascio, F.M.L., Magnani, M., Pacelli, B., Rossi, L.: Social network data and practices: The case of FriendFeed. In: Chai, S.-K., Salerno, J.J., Mabry, P.L. (eds.) SBP 2010. LNCS, vol. 6007, pp. 346–353. Springer, Heidelberg (2010)
Chapter Google Scholar
Cha, M., Haddadi, H., Benevenuto, F., Gummadi, K.P.: Measuring user influence in Twitter: the million follower fallacy. In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 10–17 (2010)
Google Scholar
Cheng, J., Sun, A., Hu, D., Zeng, D.: An information diffusion based recommendation framework for micro-blogging. Journal of the Association for Information Systems 12(7), 463–486 (2011)
Google Scholar
Choudhury, M.D., Lin, Y.-R., Sundaram, H.: How Does the Data Sampling Strategy Impact the Discovery of Information Diffusion in Social Media? In: Proceedings of the 4th International AAAI Conference on Weblogs and Social Media, pp. 34–41 (2010)
Google Scholar
Ding, Z., Zhang, Q., Huang, X.: Keyphrase Extraction from Online News Using Binary Integer Programming. In: Proceedings of the 5th International Joint Conference on Natural Language Processing, pp. 165–173 (2011)
Google Scholar
Haveliwala, T.: Topic Sensitive PageRank. In: Proceedings of the 11th International World Wide Web Conference, pp. 517–526 (2002)
Google Scholar
Hussey, R., Williams, S., Mitchell, R., Field, I.: A Comparison of Automated Keyphrase Extraction Techniques and of Automatic Evaluation vs. Human Evaluation. International Journal on Advances in Life Sciences 4(3&4), 136–153 (2012)
Google Scholar
Java, A., Song, X., Finin, T., Tseng, B.: Why we twitter: understanding microblogging usage and communities. In: Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 Workshop on Web Mining and Social Network Analysis, pp. 56–65 (2007)
Google Scholar
Nakagawa, H., Mori, T.: A simple but powerful automatic term extraction method. In: Proceedings of COLING 2002 on COMPUTERM 2002: Second International Workshop on Computational Terminology, pp. 1–7 (2002)
Google Scholar
Li, X., Liu, B., Yu, P.: Time sensitive ranking with application to publication search. In: Proceedings of the 2008 Eighth IEEE International Conference on Data Mining, pp. 893–898 (2008)
Google Scholar
Liu, Z., Huang, W., Zheng, Y., Sun, M.: Automatic keyphrase extraction via topic decomposition. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 366–376 (2010)
Google Scholar
Lui, M., Baldwin, T.: Cross-domain Feature Selection for Language Identification. In: Proceedings of the Fifth International Joint Conference on Natural Language Processing, pp. 553–561 (2011)
Google Scholar
Yang, M., Lee, J., Lee, S., Rim, H.: Finding interesting posts in Twitter based on retweet graph analysis. In: Proceedings of the 35th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1073–1074 (2012)
Google Scholar
Mei, Q., Shen, X., Zhai, C.: Automatic labeling of multinomial topic model. In: Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 490–499 (2007)
Google Scholar
Mori, J., Ishizuka, M., Matsuo, Y.: Extracting Keyphrases to Represent Relations in Social Networks from Web. In: Proceedings of the Twentieth International Joint Conference on Artificial Intelligence, pp. 2820–2827 (2007)
Google Scholar
Paukkeri, M., Nieminen, I., Pöllä, M., Honkela, T.: A Language-independent Approach to Keyphrase Extraction and Evaluation. In: Proceedings of the 22nd International Conference on Computational Linguistics, pp. 237–252 (2008)
Google Scholar
Song, S., Li, Q., Zheng, N.: A spatio-temporal framework for related topic search in micro-blogging. In: An, A., Lingras, P., Petty, S., Huang, R. (eds.) AMT 2010. LNCS, vol. 6335, pp. 63–73. Springer, Heidelberg (2010)
Chapter Google Scholar
Song, S., Li, Q., Zheng, X.: Detecting popular topics in micro-blogging based on a user interest-based model. In: Proceedings of the 2012 International Joint Conference on Neural Networks, pp. 1–8 (2012)
Google Scholar
Wan, X., Xiao, J.: Single Document Keyphrase Extraction Using Neighborhood Knowledge. In: Proceedings of the 23rd AAAI Conference on Artificial Intelligence, pp. 855–860 (2008)
Google Scholar
Weng, J., Lim, E.-P., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3rd ACM International Conference on Web Search and Data Mining, pp. 261–270 (2010)
Google Scholar
Wu, W., Zhang, B., Ostendorf, M.: Automatic generation of personalized annotation tags for twitter users. In: Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 689–692 (2010)
Google Scholar
Zhao, W., Jiang, J., He, J., Song, Y., Achananuparp, P., Lim, E.P., Li, X.: Topical Keyphrase Extraction from Twitter. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, pp. 379–388 (2011)
Google Scholar
Cho, V., Esfahbod, B., Mansouri, M.: City of New York on Twitter:@ NYCGov. In: Proceedings of the 13th Annual International Conference on Digital Government Research, pp. 274–275 (2012)
Google Scholar
Macdonald, C., Ounis, I.: Voting Techniques for Expert Search. J. Knowledge and Information Systems, 259–280 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Fujitsu R&D Center CO., LTD, 100025, Beijing, China
Shuangyong Song, Yao Meng & Jun Sun

Authors

Shuangyong Song
View author publications
You can also search for this author in PubMed Google Scholar
Yao Meng
View author publications
You can also search for this author in PubMed Google Scholar
Jun Sun
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

MIMOS Berhad Technology Park Malaysia, 57000, Bukit Jalil, KL, Malaysia
Duc-Nghia Pham
Kyungpook National University, Sankyuk-Dong, Buk-Gu, 702-701, Daegu, Korea
Seong-Bae Park

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Song, S., Meng, Y., Sun, J. (2014). Detecting Keyphrases in Micro-blogging with Graph Modeling of Information Diffusion. In: Pham, DN., Park, SB. (eds) PRICAI 2014: Trends in Artificial Intelligence. PRICAI 2014. Lecture Notes in Computer Science(), vol 8862. Springer, Cham. https://doi.org/10.1007/978-3-319-13560-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-13560-1_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13559-5
Online ISBN: 978-3-319-13560-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics