Skip to main content

Evolving Weighting Functions for Query Expansion Based on Relevance Feedback

  • Conference paper
Progress in WWW Research and Development (APWeb 2008)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4976))

Included in the following conference series:

Abstract

A new method for query expansion using genetic programming (GP) is proposed in this paper to enhance the retrieval performance of text information retrieval systems. Using a set of queries and retrieved relevant and non-relevant documents corresponding to each query, GP tries to evolve a criteria for selecting terms which when added to the original query improve the next retrieved set of documents. Two experiments are conducted to evaluate the proposed method over three standard datasets: Cranfield, Lisa and Medline. In first experiment a formula is evolved using GP over a training set and is then evaluated over a test query set of the same dataset. In the second experiment, evolved expansion formula over a dataset is evaluated over a different dataset. We compared our method against the base probabilistic method in literature. Results show a higher performance in comparison with original and probabilistically expanded method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Attar, A., Franenckel, S.: Local Feedback in Full Text Retrieval Systems. Journal of the ACM, 397–417 (1980)

    Google Scholar 

  2. Xu, J., Croft, W.B.: Query Expansion Using Local and Global Document Analysis. In: Proc. ACM SIGIR Annual Conference on Research and Development, Zurich (1996)

    Google Scholar 

  3. Qiu, Y., Frei, H.P.: Concept Based Query Expansion. In: Proceedings of the 16th ACM SIGIR Conference on Research and Development in Information Retrieval, Pittsburg, USA, pp. 160–169 (1993)

    Google Scholar 

  4. Schutze, H., Pedersen, J.: A Cooccurrence- Based Thesaurus and two Applications to Information Retrieval. Information Processing & Management 33(3), 307–318 (1997)

    Article  Google Scholar 

  5. Mandala, R., Tokunaga, T., Takana, H.: Combining multiple evidence from different types of thesaurus for query expansion. In: Proceedings of the 22 the Annual International ACM SIGIR. Conference on research and development in information retrieval, August, Buckley, USA (1999)

    Google Scholar 

  6. Rocchio Jr., J.J.: The SMART Retrieval System Experiments in Automatic Document Processing. In: Relevance Feedback in Information Retrieval, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)

    Google Scholar 

  7. Salton, G., Buckley, C.: Improving Retrieval Performance By Relevance Feedback, Journal of The American Society for Information Science 41(4), 288–297 (1990)

    Article  Google Scholar 

  8. Robertson, S., Walker, S., Hanckock, M.M.: Large test collection experiments on an operational interactive system: Okapi at TREC. Information Processing and Management (IPM) journal, 260–345 (1995)

    Google Scholar 

  9. Gordon, M.: Probabilistic and genetic algorithms for document retrieval. Communications of the ACM, 1208–1218 (1988)

    Google Scholar 

  10. Yang, J.J., Korfhage, R.R.: Query optimisation in information retrieval using genetic Algorithms. In: Proceedings of the fifth International Conference on Genetic Algorithms (ICGA), pp. 603–611. The University of Illinois Press, Urbana, IL (1993)

    Google Scholar 

  11. Horng, J.T., Yeh, C.C.: Applying genetic algorithms to query optimisation in document retrieval. Information Processing and Management, 737–759 (2000)

    Google Scholar 

  12. Koza, J.R.: Genetic Programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge, MA (1992)

    MATH  Google Scholar 

  13. Crestani, F.: Comparing Neural and Probabilistic Relevance Feedback in an Interactive Information Retrieval System. In: Proceedings of the IEEE International Conference on Neural Networks, Orlando, Florida, USA, June 1994, pp. 426–430 (1994)

    Google Scholar 

  14. Salton, G.: The SMART Retrieval System: Experiments in Automatic Document Processing. Prentice Hall, New Jersey (1971)

    Google Scholar 

  15. Porter, M.F.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  16. Savoy, J., Vrajitoru, D.: Evaluation of learning schemes used in information retrieval. Technical Report CR-I-95-02, Université de Neuchâtel, Faculté de droit et des Sciences Ećonomiques (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Yanchun Zhang Ge Yu Elisa Bertino Guandong Xu

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Borji, A., Jahromi, M.Z. (2008). Evolving Weighting Functions for Query Expansion Based on Relevance Feedback. In: Zhang, Y., Yu, G., Bertino, E., Xu, G. (eds) Progress in WWW Research and Development. APWeb 2008. Lecture Notes in Computer Science, vol 4976. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78849-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78849-2_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78848-5

  • Online ISBN: 978-3-540-78849-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics