Skip to main content

Monolingual Retrieval Experiments with a Domain-Specific Document Corpus at the Chemnitz University of Technology

  • Conference paper
Evaluation of Multilingual and Multi-modal Information Retrieval (CLEF 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4730))

Included in the following conference series:

Abstract

This article describes the first participation of the Chair Media Informatics of the Chemnitz University of Technology in the Cross Language Evaluation Forum. An experimental prototype is introduced which implements several methods of optimizing search results. The configuration of the prototype is tested with the CLEF training data. The results of the Domain-Specific Monolingual German task suggest that combining the suffix stripping stemming and the decompounding approach is very useful. Also, a local document clustering (LDC) approach used to improve the query expansion (QE) based on pseudo-relevance feedback (PRF) seems to be quite beneficial. Nevertheless, the evaluation of the English task using the same configuration suggests that the qualities of the results are highly speech dependent.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. The Apache Software Foundation: Lucene. Retrieved August 10, 2006 from the World Wide Web (1998-2006), http://lucene.apache.org

  2. CLEF: Guidelines for Participation in CLEF 2006 Ad-Hoc and Domain-Specific Tracks. Retrieved August 10, 2006 from the World Wide Web (restricted access, 2006), http://www.clef-campaign.org/delos/clef/protect/guidelines06.htm

  3. Porter, M.: The Snowball Project. Retrieved August 10, 2006 from the World Wide Web (2001), www.snowball.tartarus.org

  4. Wagner, S.: A German Decompounder Retrieved August 10, 2006 from the World Wide Web (2005), http://www-user.tu-chemnitz.de/~wags/cv/clr.pdf

  5. Steinbach, M., Karypis, G., Kumar, V.: A Comparison of Document Clustering Techniques, University of Minnesota, Technical Report # 00034 (2000)

    Google Scholar 

  6. Rasmussen, E.: Clustering Algorithms. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval -Data Structures and Algorithms, Prentice Hall, Englewood Cliffs New Jersey (1992)

    Google Scholar 

  7. Willett, P.: Recent Trends in Hierarchic Document Clustering. Information Processing & Management 24(5), 577–597 (1988)

    Article  Google Scholar 

  8. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Pearson Addison-Wesley, Harlow Munich (2005)

    Google Scholar 

  9. Fox, E.A., Shaw, J.A.: Combination of Multiple Searches. In: Proceedings of the 2nd Text Retrieval Conference (TREC2), NIST Special Publication, pp. 215–500 (1994)

    Google Scholar 

  10. Savoy, J.: Data Fusion for Effective European Monolingual Information Retrieval. In: Working Notes for the CLEF 2004 Workshop (2004)

    Google Scholar 

  11. Lin, W.-C., Chen, H.-H.: Merging Mechanisms in Multilingual Information Retrieval. In: Working Notes for the CLEF 2002 Workshop (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kürsten, J., Eibl, M. (2007). Monolingual Retrieval Experiments with a Domain-Specific Document Corpus at the Chemnitz University of Technology. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74999-8_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74998-1

  • Online ISBN: 978-3-540-74999-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics