Abstract
In this chapter, we demonstrate the type and nature of query characteristics that can be mined from web server logs. Based on a study of over half a million queries (spanning four academic years) to a university’s website, it is shown that the vocabulary (terms) generated from these queries do not have a well-defined Zipf distribution. However, some regularities in term frequency and ranking correlations suggest that piecewise polynomial data fits are reasonable for trend representations.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
R. Baeza-Yates and B. Ribeiro-Neto.Modern Information Retrieval.AddisonWesley, Boston, 1999.
B.J. Jansen and U. Pooch.A review of Web searching studies and a framework for future research. Journal of the American Society for Information Science and Technology, 52 (3): 235–246, 2001.
B.J. Jansen, A. Spink, and T. Saracevic.Real life, real users, and real needs: A study and analysis of user queries on the Web.Information Processing and Management, 36 (2): 207–227, 2000.
R.R. Korfhage.Information Storage and Retrieval.Wiley,New York, 1977.
N. Ross and D. Wolfram.End user searching on the Internet: An analysis of term pair topics submitted to the Excite Search Engine.Journal of the American Society for Information Science and Technology, 51 (10): 949–958, 2000.
B. Shneiderman, D. Byrd, and W.B. Croft.Clarifying search: A user-interface framework for text searches.D-Lib Magazine, 1:1–18, 1997.
C. Silverstein, M. Henzinger, H. Marais, and M. Moricz.Analysis of a very large Web search engine query log.SIGIR Forum, 33 (1): 6–12, 1999.
A. Spink, D. Wolfram, B. Jansen, and T. Saracevic.Searching the Web: The public and their queries. Journal of the American Society for Information Science and Technology, 52 (3): 226–234, 2001.
D. Wolfram.Term co-occurrence in Internet search engine queries: An analysis of the Excite data set.Canadian Journal of Information and Library Science, 24 (2/3): 12–33, 1999.
P. Wand and L. Pouchard.End-user searching of Web resources: Problems and implications.In Proceedings of the Eighth ASIS SIG/CR Workshop, Washington DC, pages 73–85, 1997.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer Science+Business Media New York
About this chapter
Cite this chapter
Wang, P., Bownas, J., Berry, M.W. (2004). Trend and Behavior Detection from Web Queries. In: Berry, M.W. (eds) Survey of Text Mining. Springer, New York, NY. https://doi.org/10.1007/978-1-4757-4305-0_8
Download citation
DOI: https://doi.org/10.1007/978-1-4757-4305-0_8
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-3057-6
Online ISBN: 978-1-4757-4305-0
eBook Packages: Springer Book Archive