Skip to main content

Definite Descriptions in an Information Extraction System

  • Conference paper
Advances in Artificial Intelligence (IBERAMIA 2000, SBIA 2000)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1952))

Abstract

This paper presents an algorithm based on heuristic rules in order to solve Spanish definite description references. This algorithm is applied to an information extraction system for Spanish language. These heuristic rules are extracted from the study of an unrestricted corpus. This algorithm solves identity co-reference produced by a definite description whose relation with its antecedents can be solved with syntactic or semantic information. This module achieves a precision of 95.3% in classification task (anaphoric or non-anaphoric) and a average precision of 78% in Conference topics: Natural Language Processing

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. P. Christopherson. The Articles: A study of their theory and use in English. E. Munksgaard, Copenhagen, 1939.

    Google Scholar 

  2. H. H. Clark. Bridging. In P. Johnson-Laird and P Wason, editors, Thinking: readings in cognitive science, pages 411–420. Cambridge: CUP, 1977.

    Google Scholar 

  3. J. Fukumoto, F. Masui, M. Shimohata, and M. Sasaki. Oki Electric Industry: Description of the Oki System as used for MUC-7. http://www.muc.saic.com/proceedings/, 1998.

  4. R. Gaizauskas and Y. Wilks. Information Extraction: Beyond Document Retrieval. Journal of Documentation, 54(1):70–105, January 1998.

    Article  Google Scholar 

  5. R. Garigliano, A. Urbanowicz, and D. J. Nettleton. University of Durham: Description of the LOLITA System as used in MUC-7. In Publishers [15].

    Google Scholar 

  6. J. A. Hawkins. Definiteness and indefiniteness. Humanities Press, Atlantic High-lands, NJ, 1978.

    Google Scholar 

  7. K. Humphreys, R. Gaizauskas, S. Azzam, C. Huyck, and B. Mitchell. University of Sheffield: Description of the LaSIE-II System as used for MUC-7. In Publishers [15].

    Google Scholar 

  8. F. Llopis, R. Mutano-noz, A. Suárez, and A. Montoyo. EXIT: Propuesta de un sistema de extracción de información de textos notariales. Revista Nováatica, 133:26–30, 1998.

    Google Scholar 

  9. R. Mutano-noz, A. Montoyo, F. Llopis, and A. Suárez. Reconocimiento de entidades en el sistema EXIT. Procesamiento del Lenguaje Natural, 23:47–53, september 1998.

    Google Scholar 

  10. R. Mutano-noz and M. Palomar. Processing of Spanish Definite Descriptions with the Same Head. In Dimitris N. Christodoulakis, editor, Proceeding of NLP2000: Filling the gap between theory and practice, Lectures Notes in Artificial Intelligence vol. 1835, pages 212–220, Patras, Greece, June 2000. Springer-Verlag.

    Google Scholar 

  11. R. Mutano-noz, M. Palomar, and A. Ferrández. Processing of Spanish Definite Descriptions. In O. Cairo, E.L. Sucar, and F.J. Cantu, editors, Proceeding of Mexican International Conference on Artificial Intelligence, Lectures Notes in Artificial Intelligence vol. 1793, pages 526–537, Acapulco, Mexico, April 2000. Springer-Verlag.

    Google Scholar 

  12. M. Palomar, A. Ferrández, L. Moreno, M. Saiz-Noeda, R. Mutano-noz, P. Martínez-Barco, J. Peral, and B. Navarro. A Robust Partial Parsing Strategy based on the Slot Unification Grammars. In Proceeding of 6e Conférence annuelle sur le Traitement Automatique des Langues Naturelles. TALN’99, pages 263–272, Cargèse, Corse, July 1999.

    Google Scholar 

  13. M. Poesio and R. Vieira. A Corpus-Based Investigation of Definite Description Use. Computational Linguistics. MIT Press, 24:183–216, 1998.

    Google Scholar 

  14. E. Prince. Toward a taxonomy of given-newinformation. In P. Cole, editor, Radical Pragmatics. Academic Press, New York, pages 223–256, 1981.

    Google Scholar 

  15. Morgan Kaufman Publishers, editor. Proceedings of Seventh Message Understandig Conference, http://www.muc.saic.com/proceedings/, Spring 1998.

  16. R. Vieira and M. Poesio. Corpus-based and computational aproach to anaphora, chapter Processing definite descriptions in corpora. S. Botley and T. McEnery eds. UCL Press, London, 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Palomar, M., Muñoz, R. (2000). Definite Descriptions in an Information Extraction System. In: Monard, M.C., Sichman, J.S. (eds) Advances in Artificial Intelligence. IBERAMIA SBIA 2000 2000. Lecture Notes in Computer Science(), vol 1952. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44399-1_33

Download citation

  • DOI: https://doi.org/10.1007/3-540-44399-1_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41276-2

  • Online ISBN: 978-3-540-44399-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics