Linguistic processing of text for a large-scale conceptual Information Retrieval system

Myaeng, Sung H.; Khoo, Christopher; Li, Ming

doi:10.1007/3-540-58328-9_5

Sung H. Myaeng¹,
Christopher Khoo¹ &
Ming Li²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 835))

Included in the following conference series:

International Conference on Conceptual Structures

157 Accesses
4 Citations

Abstract

This paper describes our large-scale effort to build a conceptual Information Retrieval system that converts a large volume of natural language text into Conceptual Graph representation by means of knowledge-based processing. In order to automatically extract concepts and conceptual relations between concepts from texts, we constructed a knowledge base consisting of over 12,000 case frames for verbs and a large number of other linguistic patterns that reveal conceptual relations. They were used to process a Wall Street Journal database covering a period of three years. We describe our methods for constructing the knowledge base, how the linguistic knowledge is used to process the text, and how the retrieval system makes use of the rich representation of documents and information needs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Myaeng, S. H. & Liddy, E. (1993) Information Retrieval with Semantic Representation of Texts, in Proc. of Symposium on Document Analysis and Information Retrieval, in April, Las Vegas.
Google Scholar
Myaeng, S. H. (1992). Using conceptual graphs for information retrieval:A framework for representation and flexible inferencing. Proceedings of Symposium on Document Analysis and Information Retrieval, Las Vegas, March 16–18.
Google Scholar
Sowa, J. (1984). Conceptual Structures: Information Processing in Mind and Machine. Reading, MA: Addison-Wesley.
Google Scholar
Fox, E. (1980). Lexical relations: Enhancing effectiveness of information retrieval systems. SIGIR Forum, 14, 6–35.
Google Scholar
Wang, Y. et al. (1985). Relational thesauri in information retrieval. Journal of American Society for Information Science, 36, 15–27.
Google Scholar
Spark Jones, K. & Kay, M. (1973). Linguistics and Information Science. New York: Academic Press.
Google Scholar
Farradane, J. (1980). Relation indexing: Part I and part II. Journal of Information Science, 1, 267–276 & 313-24.
Google Scholar
Lu, X. (1990). Document retrieval:A structural approach. Information Processing & Management, 26 (2), 209–218.
Google Scholar
Fillmore, C.J. (1968). The case for case. In: Universals in Linguistic Theory, ed. Bach & Harms, 1–88. New York: Holt, Rinehart, and Winston.
Google Scholar
Cook, W. (1989). Case Grammar Theory. Washington, D.C.: Georgetown University Press.
Google Scholar
Somers, H. L. (1987) Valency and Case in Computational Linguistics. Edinburgh: Edinburgh University Press.
Google Scholar
Dick, J. (1992). A conceptual, case-relation representation of text for intelligent retrieval. Technical Report CSRI-265, Computer Systems Research Institute, Univ. of Toronto.
Google Scholar
Wendlandt, E. & Driscoll, J. (1991). Incorporating a semantic analysis into a document retrieval strategy. Proc. 14th International ACM/SIGIR Conference on Research and Development in Information Retrieval, Chicago, October.
Google Scholar
Rosner, M. & Somers, H.L. (1980). Case in linguistics and cognitive science. UEA Papers in Linguistics, 13, 1–29.
Google Scholar
Meteer, M., Schwarte, R. & Weischedel, R. (1991). POST: Using probabilities in language processing. Proceedings of the Twelfth International Conference on Artificial Intelligence, Sydney, Australia.
Google Scholar
Hobbs, J. et al. (1992). FASTUS: System summary. Unpublished manuscript.
Google Scholar
Myaeng, S. H. & Lopez-lopes, Aurelio (1992). A conceptual graph matching: a flexible algorithm and experiments. Journal of Experimental and Theoretical Artificial Intelligence, 4, 107–126.
Google Scholar
Liddy, E., Paik, W. & Woelfel, J. (1992). Use of subject field codes from a machine-readable dictionary for automatic classification of documents. Proc. of 3rd ASIS Classification Research Workshop.
Google Scholar
Myaeng, S. H. & Khoo, C. (1992). On uncertainty handling in plausible reasoning with conceptual graphs. Proc. of 7th Workshop on Conceptual Graphs, Las Craces, NM, July, 1992.
Google Scholar
Shafer, G. (1976). A Mathematical Theory of Evidence. Princeton, N.J.: Princeton University Press.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Studies, Syracuse University, USA
Sung H. Myaeng & Christopher Khoo
Department of Computer & Information Science, Syracuse University, USA
Ming Li

Authors

Sung H. Myaeng
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Khoo
View author publications
You can also search for this author in PubMed Google Scholar
Ming Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

William M. Tepfenhart Judith P. Dick John F. Sowa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Myaeng, S.H., Khoo, C., Li, M. (1994). Linguistic processing of text for a large-scale conceptual Information Retrieval system. In: Tepfenhart, W.M., Dick, J.P., Sowa, J.F. (eds) Conceptual Structures: Current Practices. ICCS 1994. Lecture Notes in Computer Science, vol 835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58328-9_5

Download citation

DOI: https://doi.org/10.1007/3-540-58328-9_5
Published: 29 May 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58328-8
Online ISBN: 978-3-540-38675-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics