Abstract
Query translation is the mainstream in cross-language information retrieval, but ambiguity must be resolved by methods based on dictionary translation. In this paper, we propose a progressive algorithm for disambiguation which is derived from another algorithm we propose called the max-sum model. The new algorithm take a strategy called weighted-average probability distribution to redistribute the probabilities. Moreover, the new algorithm can be computed in a more direct way by solving an equation system. All the resource our method requires is a bilingual dictionary and a monolingual corpus. Experiments show it outperforms four other methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Federico, M., Bertoldi, N.: Statistical cross-language information retrieval using N-best query translations. In: SIGIR 2002: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 167–174. ACM Press, New York (2002)
Xu, J., Weischedel, R.: TREC-9 cross-lingual retrieval at BBN. In: Proceedings of the TREC-9 Conference (2001)
Lavrenko, V., Choquette, M., Croft, W.B.: Cross-lingual relevance models. In: SIGIR 2002: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 175–182. ACM Press, New York (2002)
Nie, J.-Y., Simard, M.: Using statistical translation models for bilingual ir. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 137–150. Springer, Heidelberg (2002)
Gao, J., Nie, J.-Y., Xun, E., Zhang, J., Zhou, M., Huang, C.: Improving query translation for cross-language information retrieval using statistical models. In: SIGIR 2001: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 96–104. ACM Press, New York (2001)
Kraaij, W., Pohlmann, R., Hiemstra, D.: Twenty-one at TREC-8: using language technology for information retrieval. In: Voorhees, E.M., Harman, D.K. (eds.) The Eigth Text REtrieval Conference (TREC-8). National Institute of Standards and Technology, NIST, vol. 8, pp. 285–300. NIST Special Publication 500-246 (2000)
Maeda, A., Sadat, F., Yoshikawa, M., Uemura, S.: Query term disambiguation for web cross-language information retrieval using a search engine. In: IRAL 2000: Proceedings of the fifth international workshop on Information retrieval with Asian languages, pp. 25–32. ACM Press, New York (2000)
Liu, Y., Jin, R., Chai, J.Y.: A maximum coherence model for dictionary-based cross-language information retrieval. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, Salvador, Brazil, August 15–19, 2005, pp. 536–543 (2005)
Adriani, M.: Dictionary-based CLIR for the CLEF multilingual track. In: Working Notes of the Workshop in Cross-Language Evaluation Forum (CLEF), September 2000, Lisbon (2000)
Mathews, J.H., Fink, K.D.: Numerical Methods Using MATLAB, 3rd edn. Prentice-Hall, Englewood Cliffs (2002)
Xu, S.: Computer Algorithms for Common Use, 2nd edn. Tsinghua University Press (1995)
Daelemans, W., Sima’an, K., Veenstra, J., Zavrel, J. (eds.): Different Approaches to Cross Language Information Retrieval, Language and Computers: Studies in Practical Linguistics, vol. 37. Rodopi, Amsterdam (2001)
Li, X., Yan, H., Wang, J.: Search Engine: Principle, Technology and Systems. Science Press (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yuan, S.A., Yu, S.N. (2008). A Progressive Algorithm for Cross-Language Information Retrieval Based on Dictionary Translation. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_24
Download citation
DOI: https://doi.org/10.1007/978-3-540-68636-1_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68633-0
Online ISBN: 978-3-540-68636-1
eBook Packages: Computer ScienceComputer Science (R0)