Definite Descriptions in an Information Extraction System

Palomar, Manuel; Muñoz, Rafael

doi:10.1007/3-540-44399-1_33

Manuel Palomar³ &
Rafael Muñoz³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1952))

Included in the following conference series:

874 Accesses
1 Citations

Abstract

This paper presents an algorithm based on heuristic rules in order to solve Spanish definite description references. This algorithm is applied to an information extraction system for Spanish language. These heuristic rules are extracted from the study of an unrestricted corpus. This algorithm solves identity co-reference produced by a definite description whose relation with its antecedents can be solved with syntactic or semantic information. This module achieves a precision of 95.3% in classification task (anaphoric or non-anaphoric) and a average precision of 78% in Conference topics: Natural Language Processing

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

P. Christopherson. The Articles: A study of their theory and use in English. E. Munksgaard, Copenhagen, 1939.
Google Scholar
H. H. Clark. Bridging. In P. Johnson-Laird and P Wason, editors, Thinking: readings in cognitive science, pages 411–420. Cambridge: CUP, 1977.
Google Scholar
J. Fukumoto, F. Masui, M. Shimohata, and M. Sasaki. Oki Electric Industry: Description of the Oki System as used for MUC-7. http://www.muc.saic.com/proceedings/, 1998.
R. Gaizauskas and Y. Wilks. Information Extraction: Beyond Document Retrieval. Journal of Documentation, 54(1):70–105, January 1998.
Article Google Scholar
R. Garigliano, A. Urbanowicz, and D. J. Nettleton. University of Durham: Description of the LOLITA System as used in MUC-7. In Publishers [15].
Google Scholar
J. A. Hawkins. Definiteness and indefiniteness. Humanities Press, Atlantic High-lands, NJ, 1978.
Google Scholar
K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, and B. Mitchell. University of Sheffield: Description of the LaSIE-II System as used for MUC-7. In Publishers [15].
Google Scholar
F. Llopis, R. Mutano-noz, A. Suárez, and A. Montoyo. EXIT: Propuesta de un sistema de extracción de información de textos notariales. Revista Nováatica, 133:26–30, 1998.
Google Scholar
R. Mutano-noz, A. Montoyo, F. Llopis, and A. Suárez. Reconocimiento de entidades en el sistema EXIT. Procesamiento del Lenguaje Natural, 23:47–53, september 1998.
Google Scholar
R. Mutano-noz and M. Palomar. Processing of Spanish Definite Descriptions with the Same Head. In Dimitris N. Christodoulakis, editor, Proceeding of NLP2000: Filling the gap between theory and practice, Lectures Notes in Artificial Intelligence vol. 1835, pages 212–220, Patras, Greece, June 2000. Springer-Verlag.
Google Scholar
R. Mutano-noz, M. Palomar, and A. Ferrández. Processing of Spanish Definite Descriptions. In O. Cairo, E.L. Sucar, and F.J. Cantu, editors, Proceeding of Mexican International Conference on Artificial Intelligence, Lectures Notes in Artificial Intelligence vol. 1793, pages 526–537, Acapulco, Mexico, April 2000. Springer-Verlag.
Google Scholar
M. Palomar, A. Ferrández, L. Moreno, M. Saiz-Noeda, R. Mutano-noz, P. Martínez-Barco, J. Peral, and B. Navarro. A Robust Partial Parsing Strategy based on the Slot Unification Grammars. In Proceeding of 6e Conférence annuelle sur le Traitement Automatique des Langues Naturelles. TALN’99, pages 263–272, Cargèse, Corse, July 1999.
Google Scholar
M. Poesio and R. Vieira. A Corpus-Based Investigation of Definite Description Use. Computational Linguistics. MIT Press, 24:183–216, 1998.
Google Scholar
E. Prince. Toward a taxonomy of given-newinformation. In P. Cole, editor, Radical Pragmatics. Academic Press, New York, pages 223–256, 1981.
Google Scholar
Morgan Kaufman Publishers, editor. Proceedings of Seventh Message Understandig Conference, http://www.muc.saic.com/proceedings/, Spring 1998.
R. Vieira and M. Poesio. Corpus-based and computational aproach to anaphora, chapter Processing definite descriptions in corpora. S. Botley and T. McEnery eds. UCL Press, London, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

Grupo de investigación en Procesamiento del Lenguaje y Sistemas de Información Departamento de Lenguajes y Sistemas de Informáticos, Universidad de Alicante, Apartado 99, 03080, Alicante, Spain
Manuel Palomar & Rafael Muñoz

Authors

Manuel Palomar
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Muñoz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Statistics Computational Intelligence Laboratory, University of São Paulo, Avenida Trabalhador Sãocarlense 400, 13566-590, São Carlos, Brazil
Maria Carolina Monard
Computer Engineering Department Intelligent Techniques Laboratory, University of São Paulo, Av. Prof. Luciano Gualberto, 158, tv. 3, 05508-900, São Paulo, Brazil
Jaime Simão Sichman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Palomar, M., Muñoz, R. (2000). Definite Descriptions in an Information Extraction System. In: Monard, M.C., Sichman, J.S. (eds) Advances in Artificial Intelligence. IBERAMIA SBIA 2000 2000. Lecture Notes in Computer Science(), vol 1952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44399-1_33

Download citation

DOI: https://doi.org/10.1007/3-540-44399-1_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41276-2
Online ISBN: 978-3-540-44399-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics