Abstract
We describe the participation of MIRACLE group in the QA track at CLEF. We participated in three subtasks and presented two systems that works in Spanish. The first system is a traditional QA system and was evaluated in the main task and the Real-Time QA pilot. The system features improved Named Entity recognition and shallow linguistic analysis and achieves moderate performance. In contrast, results obtained in RT-QA shows that this approach is promising to provide answers in constrained time. The second system focus in the WiQA pilot task, that aims at retrieving important snippets to complete a Wikipedia. The system uses collection link structure, cosine similarity and Named Entities to retrieve new and important snippets. Although the experiments have not been exhaustive it seems that the performance depends on the type of concept.
This work has been partially supported by the Regional Government of Madrid under the Research Network MAVIR (S-0505/TIC-0267) and two projects by the Spanish Ministry of Education and Science (TIN2004/07083 and TIN2004-07588-C03-02.)
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Sleepycat berkeley db xml 2.2 (last visit July 2006), On line http://www.sleepycat.com/products/bdbxml.html
Metzler, D., Bernstein, Y., Croft, B., Moffat, A., Zobel, J.: Similarity measures for tracking information flow. In: CIKM 2005: Proceedings of the 14th ACM international conference on Information and knowledge management, pp. 517–524. ACM Press, New York (2005)
de Pablo-Sanchez, C., et al.: Miracle’s 2005 approach to cross-lingual question answering. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)
Voorhees, E.M., Dang, H.T.: Overview of the trec 2005 question answering track. In: Proceedings of the Fourteenth Text REtrieval Conference (2005)
Magnini, B., et al.: Overview of the clef 2006 multilingual question answering track. In: Evaluation of Multilingual and Multi-modal Information Retrieval – Seventh Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain. LNCS, Springer, Heidelberg (2006)
Robertson, S.E., et al.: Okapi at trec-3. In: Harman, D.K. (ed.) Overview of the Third Text REtrieval Conference (TREC-3) (1995)
Soboroff, I.: Overview of the trec 2004 novelty track. In: The Thirteenth Text Retrieval Conference (TREC 2004) (2004)
Denoyer, L., Gallinari, P.: The Wikipedia XML Corpus. SIGIR Forum (2006)
Pasca, M.: Open Domain Question Answering from Large Text Collections. CSLI Publications, Stanford (2003)
Porter, M.: Snowball stemmers and resources website (last visited, July 2006), On line http://www.snowball.tartarus.org
Sekine, S.: Sekine’s extended named entity hierarchy (last visited August 2006), Online http://nlp.cs.nyu.edu/ene/
Stilus website (July 2006), On line http://www.daedalus.es
Jijkoun, V., de Rijke, M.: Overview of wiqa 2006. In: Evaluation of Multilingual and Multi-modal Information Retrieval – Seventh Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain. LNCS (September 2006)
Xapian: an open source probabilistic information retrieval library (last visited July 2006), On line http://www.xapian.org
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Pablo-Sánchez, C., González-Ledesma, A., Moreno-Sandoval, A., Vicente-Díez, M.T. (2007). MIRACLE Experiments in QA@CLEF 2006 in Spanish: Main Task, Real-Time QA and Exploratory QA Using Wikipedia (WiQA). In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_55
Download citation
DOI: https://doi.org/10.1007/978-3-540-74999-8_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74998-1
Online ISBN: 978-3-540-74999-8
eBook Packages: Computer ScienceComputer Science (R0)