Skip to main content

MIRACLE Experiments in QA@CLEF 2006 in Spanish: Main Task, Real-Time QA and Exploratory QA Using Wikipedia (WiQA)

  • Conference paper
Evaluation of Multilingual and Multi-modal Information Retrieval (CLEF 2006)

Abstract

We describe the participation of MIRACLE group in the QA track at CLEF. We participated in three subtasks and presented two systems that works in Spanish. The first system is a traditional QA system and was evaluated in the main task and the Real-Time QA pilot. The system features improved Named Entity recognition and shallow linguistic analysis and achieves moderate performance. In contrast, results obtained in RT-QA shows that this approach is promising to provide answers in constrained time. The second system focus in the WiQA pilot task, that aims at retrieving important snippets to complete a Wikipedia. The system uses collection link structure, cosine similarity and Named Entities to retrieve new and important snippets. Although the experiments have not been exhaustive it seems that the performance depends on the type of concept.

This work has been partially supported by the Regional Government of Madrid under the Research Network MAVIR (S-0505/TIC-0267) and two projects by the Spanish Ministry of Education and Science (TIN2004/07083 and TIN2004-07588-C03-02.)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sleepycat berkeley db xml 2.2 (last visit July 2006), On line http://www.sleepycat.com/products/bdbxml.html

  2. Metzler, D., Bernstein, Y., Croft, B., Moffat, A., Zobel, J.: Similarity measures for tracking information flow. In: CIKM 2005: Proceedings of the 14th ACM international conference on Information and knowledge management, pp. 517–524. ACM Press, New York (2005)

    Chapter  Google Scholar 

  3. de Pablo-Sanchez, C., et al.: Miracle’s 2005 approach to cross-lingual question answering. In: Peters, C., Gey, F.C., Gonzalo, J., Müller, H., Jones, G.J.F., Kluck, M., Magnini, B., de Rijke, M., Giampiccolo, D. (eds.) CLEF 2005. LNCS, vol. 4022, Springer, Heidelberg (2006)

    Google Scholar 

  4. Voorhees, E.M., Dang, H.T.: Overview of the trec 2005 question answering track. In: Proceedings of the Fourteenth Text REtrieval Conference (2005)

    Google Scholar 

  5. Magnini, B., et al.: Overview of the clef 2006 multilingual question answering track. In: Evaluation of Multilingual and Multi-modal Information Retrieval – Seventh Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain. LNCS, Springer, Heidelberg (2006)

    Google Scholar 

  6. Robertson, S.E., et al.: Okapi at trec-3. In: Harman, D.K. (ed.) Overview of the Third Text REtrieval Conference (TREC-3) (1995)

    Google Scholar 

  7. Soboroff, I.: Overview of the trec 2004 novelty track. In: The Thirteenth Text Retrieval Conference (TREC 2004) (2004)

    Google Scholar 

  8. Denoyer, L., Gallinari, P.: The Wikipedia XML Corpus. SIGIR Forum  (2006)

    Google Scholar 

  9. Pasca, M.: Open Domain Question Answering from Large Text Collections. CSLI Publications, Stanford (2003)

    Google Scholar 

  10. Porter, M.: Snowball stemmers and resources website (last visited, July 2006), On line http://www.snowball.tartarus.org

  11. Sekine, S.: Sekine’s extended named entity hierarchy (last visited August 2006), Online http://nlp.cs.nyu.edu/ene/

  12. Stilus website (July 2006), On line http://www.daedalus.es

  13. Jijkoun, V., de Rijke, M.: Overview of wiqa 2006. In: Evaluation of Multilingual and Multi-modal Information Retrieval – Seventh Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain. LNCS (September 2006)

    Google Scholar 

  14. Xapian: an open source probabilistic information retrieval library (last visited July 2006), On line http://www.xapian.org

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

de Pablo-Sánchez, C., González-Ledesma, A., Moreno-Sandoval, A., Vicente-Díez, M.T. (2007). MIRACLE Experiments in QA@CLEF 2006 in Spanish: Main Task, Real-Time QA and Exploratory QA Using Wikipedia (WiQA). In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74999-8_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74998-1

  • Online ISBN: 978-3-540-74999-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics