Abstract
This paper presents a system for bilingual information retrieval using commercial off-the-shelf search engines (COTS). Several custom query construction, expansion and translation strategies are compared. We present the experiments and the corresponding results for the CLEF 2004 event.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Babel Fish, Babel Fish Translation (2004), http://babelfish.altavista.com/ (Source checked August 2004)
Brown, P.F., Della Pietra, S.A., Della Pietra, V.J., Mercer, R.L.: The Mathematics of Statistical Machine Translation: Parameter Estimation. Computational Linguistics 19(2), 263–311 (1993)
Buckley, C., Salton, G.: Optimization of relevance feedback weights. In: Proceedings of the 18th annual international ACM SIGIR conference on research and development in information retrieval, pp. 351–357 (1995)
Chen, A.: Cross-Language Retrieval Experiments at CLEF 2002. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785, pp. 28–48. Springer, Heidelberg (2003)
Cöster, R., Sahlgren, M., Karlgren, J.: Selective compound splitting of Swedish queries for Boolean. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 337–344. Springer, Heidelberg (2003)
Craswell, N., Hawking, D., Wilkinson, R., Wu, M.: Overview of the TREC 2003 Web Track. In: The Twelfth Text Retrieval Conference, TREC 2003, Washington, D. C. (2003)
Jarmasz, M., Barrière, C.: A Terminological Resource and a Terabyte-Sized Corpus for Automatic Keyphrase in Context Translation. Technical Report, National Research Council of Canada (2004)
Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic. Prentice Hall, Upper Saddle River (1995)
Lam-Adesina, A.M., Jones, G.J.H.: Exeter at CLEF 2001: Experiments with Machine Translation for bilingual retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, p. 59. Springer, Heidelberg (2002)
Leonhardt, C.: Termium ® History (2004), http://www.termium.gc.ca/site/histo_e.html (Source checked August 2004)
Miyamoto, S.: Fuzzy Sets in Information Retrieval and Cluster Analysis. Kluwer Academic Publishers, Dordrecht (1990)
Porter, M.F.: An Algorithm for Suffix Stripping. Program 14(3), 130–137 (1980)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management: an International Journal 24(5), 513–523 (1988)
Termium, The Government of Canada’s Terminology and Linguistic Database (2004), http://www.termium.com/ (Source checked August 2004)
Terra, E., Clarke, C.L.A.: Frequencyestimates for statistical word similarity measures. In: Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), Edmonton, Canada, pp. 244–251 (2003)
Turney, P.D.: Learning Algorithms for Keyphrase Extraction. Information Retrieval 2(4), 303–336 (2000)
Verlinde, S., Selva, T.: GRELEP (Groupe de Recherche en Lexicographie Pédagogique) (2003), Dafles
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nadeau, D., Jarmasz, M., Barrière, C., Foster, G., St-Jacques, C. (2005). Using COTS Search Engines and Custom Query Strategies at CLEF. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds) Multilingual Information Access for Text, Speech and Images. CLEF 2004. Lecture Notes in Computer Science, vol 3491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11519645_10
Download citation
DOI: https://doi.org/10.1007/11519645_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27420-9
Online ISBN: 978-3-540-32051-7
eBook Packages: Computer ScienceComputer Science (R0)