Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?

Yuan, Hua; Guo, Lei; Xu, Hualin; Xiang, Yong

doi:10.1007/978-3-642-39787-5_2

Hua Yuan²⁰,
Lei Guo²⁰,
Hualin Xu²⁰ &
…
Yong Xiang²¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8041))

Included in the following conference series:

International Conference on Knowledge Science, Engineering and Management

2251 Accesses
4 Citations

Abstract

In this work, we present a method to extract interesting information for a specific reader from massive tourism blog data. To this end, we first introduce the web crawler tool to obtain blog contents from the web and divide them into semantic word segments. Then, we use the frequent pattern mining method to discover the useful frequent 1- and 2-itemset between words after necessary data cleaning. Third, we visualize all the word correlations with a word network. Finally, we propose a local information search method based on the max-confidence measurement that enables the blog readers to specify an interesting topic word to find the relevant contents. We illustrate the benefits of this approach by applying it to a Chinese online tourism blog dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zheng, Y., Zhang, L., Xie, X., Ma, W.-Y.: Mining interesting locations and travel sequences from GPS trajectories. In: Proceedings of WWW 2009, pp. 791–800 (2009)
Google Scholar
Pan, B., MacLaurin, T., Crotts, J.: Travel blogs and the implications for destination marketing. Journal of Travel Research 46, 35–45 (2007)
Article Google Scholar
Sigurbj\(\ddot{o}\)rnsson, B., van Zwol, R.: Flickr tag recommendation based on collective knowledge. In: Proceedings of the WWW 2008, pp. 327–336 (2008)
Google Scholar
Asbagh, M.J., Sayyadi, M., Abolhassani, H.: Blog Summarization for Blog Mining. In: Lee, R., Ishii, N. (eds.) Software Engineering, Artificial Intelligence. SCI, vol. 209, pp. 157–167. Springer, Heidelberg (2009)
Google Scholar
Provart, N.: Correlation networks visualization. Frontiers in Plant Science 3(artical 240), 1–6 (2012)
Google Scholar
Cao, Q., Duan, W., Gan, Q.: Exploring determinants of voting for the “helpfulness” of online user reviews: A text mining approach. Decision Support Systems 50(2), 511–521 (2011)
Article Google Scholar
Ghose, A., Ipeirotis, P.G.: Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. IEEE Transactions on Knowledge and Data Engineering 23, 1498–1512 (2011)
Article Google Scholar
Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the KDD 2004, pp. 168–177 (2004)
Google Scholar
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval 2(1-2), 1–135 (2008)
Article Google Scholar
Dave, K., Lawrence, S., Pennock, D.M.: Mining the peanut gallery: Opinion extraction and semantic classification of product reviews. In: Proceedings of the WWW 2003, pp. 519–528 (2003)
Google Scholar
Cui, H., Mittal, V., Datar, M.: Comparative experiments on sentiment classification for online product reviews. In: Proceedings of the AAAI 2006, pp. 1265–1270 (2006)
Google Scholar
Qamra, A., Tseng, B., Chang, E.Y.: Mining blog stories using community-based and temporal clustering. In: Proceedings of the CIKM 2006, pp. 58–67 (2006)
Google Scholar
Attardi, G., Simi, M.: Blog Mining Through Opinionated Words. In: Proceedings of the Fifteenth Text REtrieval Conference (TREC 2006), pp. 14–17 (2006)
Google Scholar
Bai, X., Sun, J., Che, H., Wang, J.: Towards Knowledge Extraction from Weblogs and Rule-Based Semantic Querying. In: Paschke, A., Biletskiy, Y. (eds.) RuleML 2007. LNCS, vol. 4824, pp. 215–223. Springer, Heidelberg (2007)
Chapter Google Scholar
Liu, B.: Sentiment analysis and subjectivity, 2nd edn. Handbook of Natural Language Processing (2010)
Google Scholar
O’Leary, D.E.: Blog mining-review and extensions: “From each according to his opinion”. Decision Support Systems 51(4), 821–830 (2011)
Article Google Scholar
Liu, Y., Yu, X., Huang, X., An, A.: Blog Data Mining: The Predictive Power of Sentiments. In: Data Mining for Business Applications, pp. 183–195 (2009)
Google Scholar
Wang, F., Wu, Y.: Mining Market Trend from Blog Titles Based on Lexical Semantic Similarity. In: Gelbukh, A. (ed.) CICLing 2012, Part II. LNCS, vol. 7182, pp. 261–273. Springer, Heidelberg (2012)
Chapter Google Scholar
Kobayashi, M., Takeda, K.: Information retrieval on the web. ACM Computing Surveys 32(2), 144–173 (2000)
Article Google Scholar
Raghavan, S., Garcia-Molina, H.: Crawling the Hidden Web. In: Proceedings of the VLDB 2001, pp. 129–138 (2001)
Google Scholar
Han, J., Kamber, M.: Data Mining: Concepts and Techniques, 2nd edn. Morgan Kaufmann (2005)
Google Scholar
Xiong, H., Tan, P.-N., Kumar, V.: Hyperclique pattern discovery. Data Mining and Knowledge Discovery Journal 13(2), 219–242 (2006)
Article MathSciNet Google Scholar
Jo, Y., Oh, A.H.: Aspect and sentiment unification model for online review analysis. In: Proceedings of the WSDM 2011, pp. 815–824 (2011)
Google Scholar
Wu, T., Chen, Y., Han, J.: Re-examination of interestingness measures in pattern mining: a unified framework. Data Min. Knowl. Discov. 21(3), 371–397 (2010)
Article MathSciNet Google Scholar
Java, A., Song, X., Finin, T., Tseng, B.: Why we twitter: understanding microblogging usage and communities. In: Proceedings of the WebKDD/SNA-KDD 2007, pp. 56–65. ACM, New York (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Management and Economics, University of Electronic Science and Technology of China, 610054, Chengdu, China
Hua Yuan, Lei Guo & Hualin Xu
Public Security Department of Sichuan Province, 610041, Chengdu, China
Yong Xiang

Authors

Hua Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Lei Guo
View author publications
You can also search for this author in PubMed Google Scholar
Hualin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yong Xiang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Management and Economics, Dalian University of Technology, 116023, Dalian, China
Mingzheng Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yuan, H., Guo, L., Xu, H., Xiang, Y. (2013). Frequent Patterns Based Word Network: What Can We Obtain from the Tourism Blogs?. In: Wang, M. (eds) Knowledge Science, Engineering and Management. KSEM 2013. Lecture Notes in Computer Science(), vol 8041. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39787-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-642-39787-5_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39786-8
Online ISBN: 978-3-642-39787-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics