Abstract
User intent is defined as a user’s information need. Detecting intent in Web search helps users to obtain relevant content, thus improving their satisfaction. We propose a novel approach to instantiating intent by using adaptive categorization producing predicted intent probabilities. For this, we attempt to detect factors by which intent is formed, called intent features, by using a Web Q&A corpus. Our approach was motivated by the observation that questions related to queries are effective for finding intent features. We extract set of categories and their intent features automatically by analyzing questions within Web Q&A corpus, and categorize search results using these features. The advantages of our intent-based categorization are twofold, (1) presenting the most probable intent categories to help users clarify and choose starting points for Web searches, and (2) adapting sets of intent categories for each query. Experimental results show that distilled intent features can efficiently describe intent categories, and search results can be efficiently categorized without any human supervision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Broder, A.Z.: A taxonomy of web search. SIGIR Forum 36(2), 3–10 (2002)
Chaker, J., Ounelli, H.: Genre Categorization of Web Pages. In: Proceedings of the 7th IEEE International Conference on Data Mining Workshops, pp. 455–464 (2007)
Gabrilovich, E., Markovitch, S.: Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis. In: Proceedings of the 20th International Joint Conferences on Artificial Intelligence, pp. 1606–1611 (2007)
Gabrilovich, E., Markovitch, S.: Feature generation for text categorization using world knowledge. In: Proceedings of the 18th International Joint Conferences on Artificial Intelligence, pp. 1048–1053 (2005)
Gyöngyi, Z., Koutrika, G., Pedersen, J., Garcia-Molina, H.: Questioning Yahoo! Answers. In: Proceedings of QAWeb 2008 (2008)
Guo, Q., Agichtein, E.: Exploring mouse movements for inferring query intent. In: Proceedings of the 31st International SIGIR Conference on Research and Development in Information Retrieval, pp. 707–708 (2008)
Hu, J., Wang, G., Lochovsky, F., Chen, Z.: Understanding User’s Query Intent with Wikipedia. In: Proceedings of the 18th International Conference on World Wide Web, pp. 471–480 (2009)
Jansen, B.J., Booth, D.L., Spink, A.: Determining the informational, navigational, and transactional intent of Web queries. Information Process and Management 44(3), 1251–1266 (2008)
Kules, B., Kustanowitz, J., Shneiderman, B.: Categorizing Web Search Results into Meaningful and Stable Categories Using Fast-Feature Techniques. In: Proceedings of the 6th ACM/IEEE Joint Conference on Digital Libraries, pp. 210–219 (2006)
Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in Web search. In: Proceedings of the 14th International Conference on World Wide Web, pp. 391–400 (2005)
Li, Y., Krishnamurthy, R., Vaithyanathan, S., Jagadish, H.V.: Getting work done on the Web: Supporting transactional queries. In: Proceedings of the 29st International SIGIR Conference on Research and Development in Information Retrieval, pp. 557–564 (2006)
Metzler, D., Croft, W.B.: Analysis of Statistical Question Classification for Fact-based Questions. Information Retrieval 8(3), 481–504 (2004)
Nastase, V., Sayyad-Shirabad, J., Sokolova, M., Szpakowicz, S.: Learning Noun-Modifier Semantic Relations with Corpus-based and WordNet-based Features. In: Proceedings of American Association for Artificial Intelligence (2006)
Pandit, S., Olson, C.: Navigation-Aided Retrieval. In: Proceedings of the 16th International Conference on World Wide Web, pp. 391–400 (2007)
Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yoon, S., Jatowt, A., Tanaka, K. (2009). Intent-Based Categorization of Search Results Using Questions from Web Q&A Corpus. In: Vossen, G., Long, D.D.E., Yu, J.X. (eds) Web Information Systems Engineering - WISE 2009. WISE 2009. Lecture Notes in Computer Science, vol 5802. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04409-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-04409-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04408-3
Online ISBN: 978-3-642-04409-0
eBook Packages: Computer ScienceComputer Science (R0)