Abstract
In this paper we introduce a high-precision query classification method to identify the intent of a user query given that it has been seen in the past based on informational, navigational, and transactional categorization. We propose using three vector representations of queries which, using support vector machines, allow past queries to be classified by user’s intents. The queries have been represented as vectors using two factors drawn from click-through data: the time users take to review the documents they select and the popularity (quantity of preferences) of the selected documents. Experimental results show that time is the factor that yields higher precision in classification. The experiments shown in this work illustrate that the proposed classifiers can effectively identify the intent of past queries with high-precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baeza-Yates, R., Calderón-Benavides, L., González-Caro, C.: The intention behind web queries. In: Crestani, F., Ferragina, P., Sanderson, M. (eds.) SPIRE 2006. LNCS, vol. 4209, pp. 98–109. Springer, Heidelberg (2006)
Basu, A., Watters, C.R., Shepherd, M.A.: Support vector machines for text categorization. In: HICSS 2003: Proceedings of the 36th Hawaii International Conference on System Sciences, Hawaii, USA, January 6- 9, p. 7. IEEE Computer Press, Los Alamitos (2003)
Broder, A.: A taxonomy of web search. SIGIR Forum 36(2), 3–10 (2002)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Claypool, M., Brown, D., Le, P., Waseda, M.: Inferring user interest. IEEE Internet Computing 5(6), 32–39 (2001)
Cortes, C., Vapnik, V.: Support-vector networks. Machine Learning 20(3), 273–297 (1995)
Fawcett, T.: An introduction to ROC analysis. Pattern Recognition Letters 27(8), 861–874 (2006)
Fox, S., Karnawat, K., Mydland, M., Dumais, S., White, T.: Evaluating implicit measures to improve web search. ACM Transactions on Information Systems 23(2), 147–168 (2005)
Hsu, C.W., Chang, C.C., Lin, C.J.: A practical guide to support vector classification. Department of Computer Science and Information Engineering, National Taiwan University (2003)
Jansen, B., Booth, D., Spink, A.: Determining the informational, navigational and transactional intent of Web queries. Information Processing and Management 44(3), 1251–1266 (2008)
Kang, I.-H., Kim, G.: Query type classification for web document retrieval. In: SIGIR 2003: Proceedings of the 26th International ACM SIGIR Conference, Toronto, Canada, July 28 - August 1, pp. 64–71. ACM, New York (2003)
Lee, U., Liu, Z., Cho, J.: Automatic identification of user goals in web search. In: WWW 2005: Proceedings of the 14th International Conference on World Wide Web, Chiba, Japan, May 10-14, 2005, pp. 391–400. ACM Press, New York (2005)
Leopold, E., Kindermann, J.: Text categorization with support vector machines. how to represent texts in input space? Machine Learning 46(1-3), 423–444 (2002)
Liu, Y., Zhang, M., Ru, L., Ma, S.: Automatic query type identification based on click through information. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 593–600. Springer, Heidelberg (2006)
Rose, D.E., Levinson, D.: Understanding user goals in web search. In: WWW 2004: Proceedings of the 13th International Conference on World Wide Web, May 17-20, pp. 13–19. ACM Press, New York (2004)
Salton, G., Buckley, C.: Term-weighting approaches in automatic retrieval. Information Processing and Management 24(5), 513–523 (1988)
Wen, J., Nie, J., Zhang, H.: Clustering user queries of a search engine. In: WWW 2001: Proc. of the 10th Int. Conf. on World Wide Web, Hong Kong, May 1-5, 2001, pp. 162–168. ACM Press, New York (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mendoza, M., Zamora, J. (2009). Identifying the Intent of a User Query Using Support Vector Machines. In: Karlgren, J., Tarhio, J., Hyyrö, H. (eds) String Processing and Information Retrieval. SPIRE 2009. Lecture Notes in Computer Science, vol 5721. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03784-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-03784-9_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03783-2
Online ISBN: 978-3-642-03784-9
eBook Packages: Computer ScienceComputer Science (R0)