Abstract
Maryland performed two sets of experiments for the 2003 Cross-Language Evaluation Forum’s interactive track, one focused on interactive selection of appropriate translations for query terms, the second focused on interactive selection of relevant documents. Translation selection was supported using possible synonyms discovered through back translation and two techniques for generating KeyWord In Context (KWIC) examples of usage. The results indicate that searchers typically achieved a similar search effectiveness using fewer query iterations when interactive translation selection was available. For document selection, a complete extract of the first 40 words of each news story was compared to a compressed extract generated using an automated parse-and-trim approach that approximates one way in which people can produce headlines. The results indicate that compressed “headlines” result in faster assessment, but with a 20% relative reduction in the F α= 0.8 search effectiveness measure.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cohen, J.: A coefficient of agreement for nominal scales. Educational and Psychological Measures 20, 37–46 (1960)
Collins, M.: Three generative lexicalised models for statistical parsing. In: Proceedings of the 35th ACL (1997)
Habash, N.Y.: Generation-heavy Hybrid Machine Translation. PhD thesis, Department of Computer Science, University of Maryland at College Park (2003)
He, D., Wang, J., Oard, D.W., Nossal, M.: Comparing user-assisted and Automatic Query Translation. In: Peters, C., Braschler, M., Gonzalo, J. (eds.) CLEF 2002. LNCS, vol. 2785. Springer, Heidelberg (2003)
Knight, K., Marcu, D.: Statistics-based summarization step one; sentence compression. In: Proceedings of AAAI 2001 (2001)
Miller, S., Crystal, M., Fox, H., Ramshaw, L., Schwartz, R., Stone, R., Weischedel, R.: Algorithms that Learn to Extract Information; BBN: Description of the SIFT System as Used for MUC-7. In: Proceedings of the MUC-7 (1998)
Miller, S., Ramshaw, L., Fox, H., Weischedel, R.: A novel use of statistical parsing to extract information from text. In: Proceedings of the 1st Meeting of the North Amererican Chapter of the ACL, Seattle, WA, pp. 226–233 (2000)
Oard, D.W., Levow, G., Cabezas, C.: CLEF Experiments at Maryland: Statistical Stemming and Backoff Translation. In: Peters, C. (ed.) CLEF 2000. LNCS, vol. 2069, pp. 176–187. Springer, Heidelberg (2001)
Pirkola, A.: The Effects of Query Structure and Dictionary Setups in Dictionary-Based Cross-Language Information Retrieval. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dorr, B. et al. (2004). iCLEF 2003 at Maryland: Translation Selection and Document Selection. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds) Comparative Evaluation of Multilingual Information Access Systems. CLEF 2003. Lecture Notes in Computer Science, vol 3237. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30222-3_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-30222-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24017-4
Online ISBN: 978-3-540-30222-3
eBook Packages: Springer Book Archive