Skip to main content

Practical Evaluation of Textual Fuzzy Similarity as a Tool for Information Retrieval

  • Conference paper
  • First Online:
Advances in Web Intelligence (AWIC 2003)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2663))

Included in the following conference series:

Abstract

This paper presents a practical evaluation of a document retrieval method based on a certain textual fuzzy similarity measure. The similarity measure was originally introduced in [1] — cf. also [2], and later used in Internet-related applications [3,4]. Three textual databases of diverse level of freedom in the content of documents are used for experiments in the search. In other words, the relation of the documents within each group to the chosen topic is (according to the evaluating person) strong, average, and random. The results of the search coincide with intuition and confirm the expectation that methods based on similarity are advantageous as long as the database contains documents of a relatively well-defined topic.

This work has partly been supported by the NATO Scientific Committee via the Spanish Ministry for Science and Technology; grantholder — P.S.Szczepaniak; host institution — Politechnical University, Madrid, Spain, 2002/2003.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Niewiadomski A. (2000): Appliance of fuzzy relations for text document comparing. Proceedings of the 5th Conference NNSC, Zakopane, Poland, pp. 347–352.

    Google Scholar 

  2. Niewiadomski A., Szczepaniak P.S. (2001): Intutionistic Fuzzy Relations in Approximate Text Comparison. Published in Polish: Intuicjonistyczne relacje rozmyte w przybliżonym porównywaniu tekstów. In: Chojcan J. Łeski J. (Eds.): Zbiory rozmyte i ich zastosowania. Silesian Technical University Press, Gliwice, Poland, pp. 271–282; ISBN 83-88000-64-0.

    Google Scholar 

  3. Niewiadomski A., Szczepaniak P.S., (2002). Fuzzy Similarity in E-Commerce Domains. In: Segovia J., Szczepaniak P.S., Niedzwiedzinski M. (Eds.) E-Commerce and Intelligent Methods. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.

    Google Scholar 

  4. Szczepaniak P.S., Niewiadomski A. (2003). Internet Search Based on Text Intuitionistic Fuzzy Similarity. In: Szczepaniak P.S., Segovia J., Kacprzyk J., Zadeh L. (Eds.) Intelligent Exploration of the Web. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.

    Google Scholar 

  5. Lebart L., Salem A., Berry L. (1998). Exploring Textual Data. Kluwer Academic Publisher.

    Google Scholar 

  6. Baeza-Yates R., Ribeiro-Neto B. (1999). Modern Information Retrieval. Addison Wesley, New York.

    Google Scholar 

  7. Ho T.B., Kawasaki S., Nguyen N.B. (2003). Documents Clustering using Tolerance Rough Set Model and Its Application to Information Retrieval. In: Szczepaniak P.S., Segovia J., Kacprzyk J., Zadeh L. (Eds.) Intelligent Exploration of the Web. Physica-Verlag, A Springer-Verlag Company, Heidelberg, New York.

    Google Scholar 

  8. Zadeh L. (1965). Fuzzy Sets. Information and Control, 8, pp. 338–353.

    Article  MATH  MathSciNet  Google Scholar 

  9. Pedrycz W., Gomide F. (1998): An Introduction to Fuzzy Sets; Analysis and Design. A Bradford Book, The MIT Press, Cambridge, Massachusetts and London, England.

    Google Scholar 

  10. SleepyCat Software, Inc. BerkeleyDB Documentation; http://www.sleepycat.com

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Szczepaniak, P.S., Gil, M. (2003). Practical Evaluation of Textual Fuzzy Similarity as a Tool for Information Retrieval. In: Menasalvas, E., Segovia, J., Szczepaniak, P.S. (eds) Advances in Web Intelligence. AWIC 2003. Lecture Notes in Computer Science, vol 2663. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44831-4_26

Download citation

  • DOI: https://doi.org/10.1007/3-540-44831-4_26

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40124-7

  • Online ISBN: 978-3-540-44831-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics