Skip to main content

SEM: Mining Spatial Events from the Web

  • Conference paper
Advances in Knowledge Discovery and Data Mining (PAKDD 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5012))

Included in the following conference series:

  • 2482 Accesses

Abstract

This paper is concerned with the problem of mining spatial events from the general Web. General search engine is inconvenient when searching vertical information (e.g., locations, experts) since it is designed for general purpose. For example, when finding the battlefields of World War II, listing the Web pages by relevance is not enough to tell users the spatial information clearly. A categorized result along with a map indicating these battlefields would be much easier to read. To present such a result, we propose a novel algorithm called Spatial Event Miner (SEM) to mine spatial event information from the general Web. Given a simple keyword query, SEM first collects and ranks a set of relevant locations from the Web. Then, to describe the events happened in the collected locations, SEM detects and sums up salient phrases as event topics from the context of these locations. For each specific location, the hottest event topics are also listed for quick understanding. Finally, a clear spatial distribution on the events of a given query is presented to the users. A prototype system based on SEM is also implemented. Preliminary experimental results on a set of 40 queries show that the proposed approach can capture the spatial event information effectively.

This work is supported by National Natural Science Foundation of China (Grant Number: 60473122).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Allan, J., Papka, R., Lavrenko, V.: On-Line New Event Detection and Tracking. In: Proc. of SIGIR 1998, pp. 37–45 (1998)

    Google Scholar 

  2. Bilhaut, F., Charnois, T., Enjalbert, P., Mathet, Y.: Geographic Reference Analysis for Geographic Document Querying. In: Workshop on the Analysis of Geographic References, Edmonton, Alberta, Canada (2003)

    Google Scholar 

  3. Chen, Y.F., Fabbrizio, G.D., Gibbon, D., Jana, R., Jora, S., Renger, B., Wei, B.: GeoTracker. Geospatial and Temporal RSS Navigation. In: Proc. of WWW 2007, pp. 41–50 (2007)

    Google Scholar 

  4. Chien, L.F.: Pat-tree Based Adapative Keyphrase Extraction for Intelligent Chinese information retrieval. In: SIGIR 1997, pp. 50–58 (1997)

    Google Scholar 

  5. Li, H., Srihari, R.K., Niu, C., Li, W.: Location Normalization for Information Extraction. In: Proc. of the 19th Conference on COLING 2002, Taipei, Taiwan (2002)

    Google Scholar 

  6. McCurley, K.S.: Geospatial Mapping and Navigation of the Web. In: Proc. of the 10th int. conference on World Wide Web, pp. 221–229. ACM Press, New York (2001)

    Chapter  Google Scholar 

  7. Mei, Q., Liu, C., Su, H., Zhai, C.X.: A Probabilistic Approach to Spatiotemporal Theme Pattern Mining on Weblogs. In: Proc. of WWW 2006, pp. 533–542 (2006)

    Google Scholar 

  8. Smith, D.A.: Detecting and Browsing Events in Unstructured Text. In: Proc. of SIGIR 2002, pp. 73–80 (2002)

    Google Scholar 

  9. Smith, D.A.: Detecting Events with Date and Place Information in Unstructured Text. In: Proc. of JCDL 2002, pp. 191–196 (2002)

    Google Scholar 

  10. Smith, D.A., Crane, G.: Disambiguating Geographic Names in a Historical Digital Library. In: Constantopoulos, P., Sølvberg, I.T. (eds.) ECDL 2001. LNCS, vol. 2163, pp. 127–136. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  11. Ted, D.: Accurate Methods for the Statistics of Surprise and Coincidence. Computational Linguistics 19(1), 61–74 (1993)

    Google Scholar 

  12. Tye, R., Nathaniel, G., Mor, N.: Towards Automatic Extraction of Event and Place Semantics from Flickr Tags. In: Proc. of SIGIR 2007, pp. 103–110 (2007)

    Google Scholar 

  13. Yang, Y., Pierce, T., Carbonell, J.: A Study of Retrospective and On-Line Event Detection. In: Proc. of SIGIR 1998 (1998)

    Google Scholar 

  14. Yang, Y., Pierce, T., Carbonell, J., Jin, C.: Topic-Conditioned Novelty Detection. In: Proc. of SIGKDD 2002, pp. 688–693 (2002)

    Google Scholar 

  15. Zeng, H.J., He, Q.C., Chen, Z., Ma, W.Y., Ma, J.: Learning to Cluster Web Search Results. In: Proc. of SIGIR 2004, pp. 210–217 (2004)

    Google Scholar 

  16. Zhao, Q., Liu, T.Y., Bhowmick, S.S., Ma, W.Y.: Event Detection from Evolution of Click-through Data. In: Proc. of SIGKDD 2006, pp. 484–493 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Takashi Washio Einoshin Suzuki Kai Ming Ting Akihiro Inokuchi

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Xu, K., Li, R., Bao, S., Han, D., Yu, Y. (2008). SEM: Mining Spatial Events from the Web. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2008. Lecture Notes in Computer Science(), vol 5012. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68125-0_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68125-0_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68124-3

  • Online ISBN: 978-3-540-68125-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics