Skip to main content

Linguistic processing of text for a large-scale conceptual Information Retrieval system

  • Conference paper
  • First Online:
Conceptual Structures: Current Practices (ICCS 1994)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 835))

Included in the following conference series:

Abstract

This paper describes our large-scale effort to build a conceptual Information Retrieval system that converts a large volume of natural language text into Conceptual Graph representation by means of knowledge-based processing. In order to automatically extract concepts and conceptual relations between concepts from texts, we constructed a knowledge base consisting of over 12,000 case frames for verbs and a large number of other linguistic patterns that reveal conceptual relations. They were used to process a Wall Street Journal database covering a period of three years. We describe our methods for constructing the knowledge base, how the linguistic knowledge is used to process the text, and how the retrieval system makes use of the rich representation of documents and information needs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Myaeng, S. H. & Liddy, E. (1993) Information Retrieval with Semantic Representation of Texts, in Proc. of Symposium on Document Analysis and Information Retrieval, in April, Las Vegas.

    Google Scholar 

  2. Myaeng, S. H. (1992). Using conceptual graphs for information retrieval:A framework for representation and flexible inferencing. Proceedings of Symposium on Document Analysis and Information Retrieval, Las Vegas, March 16–18.

    Google Scholar 

  3. Sowa, J. (1984). Conceptual Structures: Information Processing in Mind and Machine. Reading, MA: Addison-Wesley.

    Google Scholar 

  4. Fox, E. (1980). Lexical relations: Enhancing effectiveness of information retrieval systems. SIGIR Forum, 14, 6–35.

    Google Scholar 

  5. Wang, Y. et al. (1985). Relational thesauri in information retrieval. Journal of American Society for Information Science, 36, 15–27.

    Google Scholar 

  6. Spark Jones, K. & Kay, M. (1973). Linguistics and Information Science. New York: Academic Press.

    Google Scholar 

  7. Farradane, J. (1980). Relation indexing: Part I and part II. Journal of Information Science, 1, 267–276 & 313-24.

    Google Scholar 

  8. Lu, X. (1990). Document retrieval:A structural approach. Information Processing & Management, 26 (2), 209–218.

    Google Scholar 

  9. Fillmore, C.J. (1968). The case for case. In: Universals in Linguistic Theory, ed. Bach & Harms, 1–88. New York: Holt, Rinehart, and Winston.

    Google Scholar 

  10. Cook, W. (1989). Case Grammar Theory. Washington, D.C.: Georgetown University Press.

    Google Scholar 

  11. Somers, H. L. (1987) Valency and Case in Computational Linguistics. Edinburgh: Edinburgh University Press.

    Google Scholar 

  12. Dick, J. (1992). A conceptual, case-relation representation of text for intelligent retrieval. Technical Report CSRI-265, Computer Systems Research Institute, Univ. of Toronto.

    Google Scholar 

  13. Wendlandt, E. & Driscoll, J. (1991). Incorporating a semantic analysis into a document retrieval strategy. Proc. 14th International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, October.

    Google Scholar 

  14. Rosner, M. & Somers, H.L. (1980). Case in linguistics and cognitive science. UEA Papers in Linguistics, 13, 1–29.

    Google Scholar 

  15. Meteer, M., Schwarte, R. & Weischedel, R. (1991). POST: Using probabilities in language processing. Proceedings of the Twelfth International Conference on Artificial Intelligence, Sydney, Australia.

    Google Scholar 

  16. Hobbs, J. et al. (1992). FASTUS: System summary. Unpublished manuscript.

    Google Scholar 

  17. Myaeng, S. H. & Lopez-lopes, Aurelio (1992). A conceptual graph matching: a flexible algorithm and experiments. Journal of Experimental and Theoretical Artificial Intelligence, 4, 107–126.

    Google Scholar 

  18. Liddy, E., Paik, W. & Woelfel, J. (1992). Use of subject field codes from a machine-readable dictionary for automatic classification of documents. Proc. of 3rd ASIS Classification Research Workshop.

    Google Scholar 

  19. Myaeng, S. H. & Khoo, C. (1992). On uncertainty handling in plausible reasoning with conceptual graphs. Proc. of 7th Workshop on Conceptual Graphs, Las Craces, NM, July, 1992.

    Google Scholar 

  20. Shafer, G. (1976). A Mathematical Theory of Evidence. Princeton, N.J.: Princeton University Press.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

William M. Tepfenhart Judith P. Dick John F. Sowa

Rights and permissions

Reprints and permissions

Copyright information

© 1994 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Myaeng, S.H., Khoo, C., Li, M. (1994). Linguistic processing of text for a large-scale conceptual Information Retrieval system. In: Tepfenhart, W.M., Dick, J.P., Sowa, J.F. (eds) Conceptual Structures: Current Practices. ICCS 1994. Lecture Notes in Computer Science, vol 835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58328-9_5

Download citation

  • DOI: https://doi.org/10.1007/3-540-58328-9_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-58328-8

  • Online ISBN: 978-3-540-38675-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics