Skip to main content

Using COTS Search Engines and Custom Query Strategies at CLEF

  • Conference paper
Multilingual Information Access for Text, Speech and Images (CLEF 2004)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3491))

Included in the following conference series:

Abstract

This paper presents a system for bilingual information retrieval using commercial off-the-shelf search engines (COTS). Several custom query construction, expansion and translation strategies are compared. We present the experiments and the corresponding results for the CLEF 2004 event.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Babel Fish, Babel Fish Translation (2004), http://babelfish.altavista.com/ (Source checked August 2004)

  • Brown, P.F., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2), 263–311 (1993)

    Google Scholar 

  • Buckley, C., Salton, G.: Optimization of relevance feedback weights. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval, pp. 351–357 (1995)

    Google Scholar 

  • Chen, A.: Cross-Language Retrieval Experiments at CLEF 2002. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 28–48. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  • Cöster, R., Sahlgren, M., Karlgren, J.: Selective compound splitting of Swedish queries for Boolean. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 337–344. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  • Craswell, N., Hawking, D., Wilkinson, R., Wu, M.: Overview of the TREC 2003 Web Track. In: The Twelfth Text Retrieval Conference, TREC 2003, Washington, D. C. (2003)

    Google Scholar 

  • Jarmasz, M., Barrière, C.: A Terminological Resource and a Terabyte-Sized Corpus for Automatic Keyphrase in Context Translation. Technical Report, National Research Council of Canada (2004)

    Google Scholar 

  • Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic. Prentice Hall, Upper Saddle River (1995)

    MATH  Google Scholar 

  • Lam-Adesina, A.M., Jones, G.J.H.: Exeter at CLEF 2001: Experiments with Machine Translation for bilingual retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, p. 59. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  • Leonhardt, C.: Termium ® History (2004), http://www.termium.gc.ca/site/histo_e.html (Source checked August 2004)

  • Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer Academic Publishers, Dordrecht (1990)

    MATH  Google Scholar 

  • Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)

    Google Scholar 

  • Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management: an International Journal 24(5), 513–523 (1988)

    Article  Google Scholar 

  • Termium, The Government of Canada’s Terminology and Linguistic Database (2004), http://www.termium.com/ (Source checked August 2004)

  • Terra, E., Clarke, C.L.A.: Frequencyestimates for statistical word similarity measures. In: Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), Edmonton, Canada, pp. 244–251 (2003)

    Google Scholar 

  • Turney, P.D.: Learning Algorithms for Keyphrase Extraction. Information Retrieval 2(4), 303–336 (2000)

    Article  Google Scholar 

  • Verlinde, S., Selva, T.: GRELEP (Groupe de Recherche en Lexicographie Pédagogique) (2003), Dafles

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nadeau, D., Jarmasz, M., Barrière, C., Foster, G., St-Jacques, C. (2005). Using COTS Search Engines and Custom Query Strategies at CLEF. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_10

Download citation

  • DOI: https://doi.org/10.1007/11519645_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27420-9

  • Online ISBN: 978-3-540-32051-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics