iCLEF 2003 at Maryland: Translation Selection and Document Selection

Dorr, Bonnie; He, Daqing; Luo, Jun; Oard, Douglas W.; Schwartz, Richard; Wang, Jianqiang; Zajic, David

doi:10.1007/978-3-540-30222-3_42

Bonnie Dorr¹⁹,
Daqing He¹⁹,
Jun Luo¹⁹,
Douglas W. Oard¹⁹,
Richard Schwartz¹⁹,
Jianqiang Wang¹⁹ &
…
David Zajic¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3237))

Included in the following conference series:

Workshop of the Cross-Language Evaluation Forum for European Languages

389 Accesses
4 Citations

Abstract

Maryland performed two sets of experiments for the 2003 Cross-Language Evaluation Forum’s interactive track, one focused on interactive selection of appropriate translations for query terms, the second focused on interactive selection of relevant documents. Translation selection was supported using possible synonyms discovered through back translation and two techniques for generating KeyWord In Context (KWIC) examples of usage. The results indicate that searchers typically achieved a similar search effectiveness using fewer query iterations when interactive translation selection was available. For document selection, a complete extract of the first 40 words of each news story was compared to a compressed extract generated using an automated parse-and-trim approach that approximates one way in which people can produce headlines. The results indicate that compressed “headlines” result in faster assessment, but with a 20% relative reduction in the F _α= 0.8 search effectiveness measure.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measures 20, 37–46 (1960)
Article Google Scholar
Collins, M.: Three generative lexicalised models for statistical parsing. In: Proceedings of the 35th ACL (1997)
Google Scholar
Habash, N.Y.: Generation-heavy Hybrid Machine Translation. PhD thesis, Department of Computer Science, University of Maryland at College Park (2003)
Google Scholar
He, D., Wang, J., Oard, D.W., Nossal, M.: Comparing user-assisted and Automatic Query Translation. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785. Springer, Heidelberg (2003)
Chapter Google Scholar
Knight, K., Marcu, D.: Statistics-based summarization step one; sentence compression. In: Proceedings of AAAI 2001 (2001)
Google Scholar
Miller, S., Crystal, M., Fox, H., Ramshaw, L., Schwartz, R., Stone, R., Weischedel, R.: Algorithms that Learn to Extract Information; BBN: Description of the SIFT System as Used for MUC-7. In: Proceedings of the MUC-7 (1998)
Google Scholar
Miller, S., Ramshaw, L., Fox, H., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: Proceedings of the 1st Meeting of the North Amererican Chapter of the ACL, Seattle, WA, pp. 226–233 (2000)
Google Scholar
Oard, D.W., Levow, G., Cabezas, C.: CLEF Experiments at Maryland: Statistical Stemming and Backoff Translation. In: Peters, C. (ed.) CLEF 2000. LNCS, vol. 2069, pp. 176–187. Springer, Heidelberg (2001)
Chapter Google Scholar
Pirkola, A.: The Effects of Query Structure and Dictionary Setups in Dictionary-Based Cross-Language Information Retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia (1998)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Advanced Computer Studies, University of Maryland, College Park, MD, 20742, USA
Bonnie Dorr, Daqing He, Jun Luo, Douglas W. Oard, Richard Schwartz, Jianqiang Wang & David Zajic

Authors

Bonnie Dorr
View author publications
You can also search for this author in PubMed Google Scholar
Daqing He
View author publications
You can also search for this author in PubMed Google Scholar
Jun Luo
View author publications
You can also search for this author in PubMed Google Scholar
Douglas W. Oard
View author publications
You can also search for this author in PubMed Google Scholar
Richard Schwartz
View author publications
You can also search for this author in PubMed Google Scholar
Jianqiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
David Zajic
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISTI-CNR, Area di Ricerca, Pisa, Italy
Carol Peters
No Affiliations,
Julio Gonzalo & Martin Braschler &
German Institute for International and Security Affairs, Stiftung Wissenschaft und Politik (SWP), Ludwigkirchplatz 3-4, 10719, Berlin, Germany
Michael Kluck

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dorr, B. et al. (2004). iCLEF 2003 at Maryland: Translation Selection and Document Selection. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds) Comparative Evaluation of Multilingual Information Access Systems. CLEF 2003. Lecture Notes in Computer Science, vol 3237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30222-3_42

Download citation

DOI: https://doi.org/10.1007/978-3-540-30222-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24017-4
Online ISBN: 978-3-540-30222-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics