Skip to main content

Trend and Behavior Detection from Web Queries

  • Chapter
Survey of Text Mining

Abstract

In this chapter, we demonstrate the type and nature of query characteristics that can be mined from web server logs. Based on a study of over half a million queries (spanning four academic years) to a university’s website, it is shown that the vocabulary (terms) generated from these queries do not have a well-defined Zipf distribution. However, some regularities in term frequency and ranking correlations suggest that piecewise polynomial data fits are reasonable for trend representations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 149.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. R. Baeza-Yates and B. Ribeiro-Neto.Modern Information Retrieval.AddisonWesley, Boston, 1999.

    Google Scholar 

  2. B.J. Jansen and U. Pooch.A review of Web searching studies and a framework for future research. Journal of the American Society for Information Science and Technology, 52 (3): 235–246, 2001.

    Article  Google Scholar 

  3. B.J. Jansen, A. Spink, and T. Saracevic.Real life, real users, and real needs: A study and analysis of user queries on the Web.Information Processing and Management, 36 (2): 207–227, 2000.

    Article  Google Scholar 

  4. R.R. Korfhage.Information Storage and Retrieval.Wiley,New York, 1977.

    Google Scholar 

  5. N. Ross and D. Wolfram.End user searching on the Internet: An analysis of term pair topics submitted to the Excite Search Engine.Journal of the American Society for Information Science and Technology, 51 (10): 949–958, 2000.

    Article  Google Scholar 

  6. B. Shneiderman, D. Byrd, and W.B. Croft.Clarifying search: A user-interface framework for text searches.D-Lib Magazine, 1:1–18, 1997.

    Google Scholar 

  7. C. Silverstein, M. Henzinger, H. Marais, and M. Moricz.Analysis of a very large Web search engine query log.SIGIR Forum, 33 (1): 6–12, 1999.

    Article  Google Scholar 

  8. A. Spink, D. Wolfram, B. Jansen, and T. Saracevic.Searching the Web: The public and their queries. Journal of the American Society for Information Science and Technology, 52 (3): 226–234, 2001.

    Article  Google Scholar 

  9. D. Wolfram.Term co-occurrence in Internet search engine queries: An analysis of the Excite data set.Canadian Journal of Information and Library Science, 24 (2/3): 12–33, 1999.

    Google Scholar 

  10. P. Wand and L. Pouchard.End-user searching of Web resources: Problems and implications.In Proceedings of the Eighth ASIS SIG/CR Workshop, Washington DC, pages 73–85, 1997.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer Science+Business Media New York

About this chapter

Cite this chapter

Wang, P., Bownas, J., Berry, M.W. (2004). Trend and Behavior Detection from Web Queries. In: Berry, M.W. (eds) Survey of Text Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4757-4305-0_8

Download citation

  • DOI: https://doi.org/10.1007/978-1-4757-4305-0_8

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4419-3057-6

  • Online ISBN: 978-1-4757-4305-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics