Skip to main content

Ranking Location-Dependent Keywords to Extract Geographical Characteristics from Microblogs

  • Conference paper
Web Information Systems and Technologies (WEBIST 2012)

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 140))

Included in the following conference series:

  • 2392 Accesses

Abstract

The spread of microblogging services, such as Twitter, has made it possible to extract geographical characteristics such as keywords specific to a geographical region, with fine granularity. The results of content analysis of microblogging services are easily affected by users who post excessive messages. In addition, because geographical granularity of users’ interests differs, it is preferable to support multiple levels of granularity for usability. Thus, we propose a ranking method of location-dependent keywords based on a term frequency-inverse document frequency method to extract geographical characteristics. In our method, ranking scores are weighted by diversity of information sources so that the effect of loud users is mitigated. Multiple zoom levels of geographical areas are supported by approximation while databases at only several zoom levels are maintained. We evaluated our ranking method with a real dataset from Twitter and showed its effectiveness. We also describe a prototype implementation of a system using our ranking method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arakawa, Y., Tagashira, S., Fukuda, A.: Extraction of Location Dependent Words from Twitter Logs. IPSJ SIG Technical Reports 2010-MBL-55(10), 1–6 (2010) (in Japanese)

    Google Scholar 

  2. Bernstein, M., Suh, B., Hong, L., Chen, J., Kairam, S., Chi, E.: Eddi: Interactive Topic-based Browsing of Social Status Streams. In: UIST 2010: 23rd Annual ACM Symposium on User Interface Software and Technology, pp. 303–312 (2010)

    Google Scholar 

  3. Chen, J., Nairn, R., Nelson, L., Bernstein, M., Chi, E.: Short and Tweet: Experiments on Recommending Content from Information Streams. In: CHI 2010: 28th International Conference on Human Factors in Computing Systems, pp. 1185–1194 (2010)

    Google Scholar 

  4. Cranshaw, J., Toch, E., Hong, J., Kittur, A., Sadeh, N.: Bridging the Gap between Physical Location and Online Social Networks. In: UbiComp 2010: 12th ACM International Conference on Ubiquitous Computing, pp. 119–128 (2010)

    Google Scholar 

  5. Google Inc.: Google MAPs JavaScript API V3 (2009), http://code.google.com/intl/en/apis/maps/documentation/javascript/ (retrieved October 2011)

  6. Järvelin, K., Kekäläinen, J.: Cumulated Gain-based Evaluation of IR Techniques. ACM Transactions on Information Systems 20(4), 422–446 (2002)

    Article  Google Scholar 

  7. Mei, Q., Liu, C., Su, H., Zhai, C.: A Probabilistic Approach to Spatiotemporal Theme Pattern Mining on Weblogs. In: WWW 2006: 15th International Conference on World Wide Web, pp. 533–542 (2006)

    Google Scholar 

  8. Sahami, M., Dumais, S., Heckerman, D., Horvitz, E.: A Bayesian Approach to Filtering Junk E-mail. In: AAAI 1998 Workshop on Learning for Text Categorization (1998)

    Google Scholar 

  9. Sakaki, T., Okazaki, M., Matsuo, Y.: Earthquake Shakes Twitter Users: Real-time Event Detection by Social Sensors. In: WWW 2010: 19th International Conference on World Wide Web, pp. 851–860 (2010)

    Google Scholar 

  10. Salton, G., Buckley, C.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing & Management 24(5), 513–523 (1988)

    Article  Google Scholar 

  11. Sankaranarayanan, J., Samet, H., Teitler, B., Lieberman, M., Sperling, J.: Twitterstand: News in Tweets. In: ACM SIGSPATIAL GIS 2009: 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, pp. 42–51 (2009)

    Google Scholar 

  12. Toch, E., Cranshaw, J., Drielsma, P., Tsai, J., Kelley, P., Springfield, J., Cranor, L., Hong, J., Sadeh, N.: Empirical Models of Privacy in Location Sharing. In: UbiComp 2010: 12th ACM International Conference on Ubiquitous Computing, pp. 129–138 (2010)

    Google Scholar 

  13. Tumasjan, A., Sprenger, T., Sandner, P., Welpe, I.: Predicting Elections with Twitter: What 140 Characters Reveal about Political Sentiment. In: ICWSM 2010: 4th International AAAI Conference on Weblogs and Social Media, pp. 178–185 (2010)

    Google Scholar 

  14. Twitter Inc.: Streaming API | Twitter Developers (2010), https://dev.twitter.com/docs/streaming-api (retrieved October 2011)

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ikeda, S., Kami, N., Yoshikawa, T. (2013). Ranking Location-Dependent Keywords to Extract Geographical Characteristics from Microblogs. In: Cordeiro, J., Krempels, KH. (eds) Web Information Systems and Technologies. WEBIST 2012. Lecture Notes in Business Information Processing, vol 140. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36608-6_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36608-6_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36607-9

  • Online ISBN: 978-3-642-36608-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics