Skip to main content

A New Approach in Building a Corpus for Natural Language Generation Systems

  • Conference paper
  • First Online:
Computational Linguistics and Intelligent Text Processing (CICLing 2001)

Abstract

One of the main difficulties in building NLG systems is to produce a good requirement specification. A way to face this problem is by using a corpus to show the client the features of the system to be developed. In this paper we describe a method to elaborate that corpus and how can be used for a particular system. This method consists of five steps: text collection, input determination, text and input analysis, corpus construction, and pattern extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aguado, G., Bañón, A., Bateman, J., Bernardos, S., Fernández, M., Gómez, A., Nieto, E., Olalla, A., Plaza, R., Sánchez. A.: Ontogeneration: Reusing Domain and Linguistic Ontologies for Spanish Text Generation. Workshop on Applications of Ontologies and Problem Solving Methods, ECAI.98, Brighton (1998)

    Google Scholar 

  2. Bañón, A.: Modelo de generación multisentencial EPRS. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid, Madrid (1999)

    Google Scholar 

  3. Bateman, J. A.: Enabling Technology for multilingual natural language generation: the KPML development environment. In Natural Language Engineering, Vol. 1. Cambridge University Press, Cambridge (1997) 1–42

    Google Scholar 

  4. Bernardos, S.: GUME: Extensión de la Ontología GUM para el Español. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid. Madrid (1997)

    Google Scholar 

  5. Dale, R. and Reiter, E.: Tutorial on Building applied Natural Language Generation Systems. ANLP-97 (1997)

    Google Scholar 

  6. Francis, W.N.: Language Corpora B.C. in J. Svartvik (ed) Directions in Corpus Linguistics. Proceedings of the Nobel Symposium 82. Berlin. Mouton de Gruyter (1992)

    Google Scholar 

  7. Fernández, M.: Chemicals: Una Ontología de Elementos Químicos, Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid (1996)

    Google Scholar 

  8. Fernández M. A. et al: Ciencias Naturales GAIA. Vincens-Vives. Barcelona. España (1995)

    Google Scholar 

  9. Halliday, M. A. M.: An Introduction to Functional Grammar. Edward Arnold, London (1985)

    Google Scholar 

  10. Lasheras, A. L. and Carretero, M.P. Física y Química. POSITRON Vincens-Vives. Barcelona. España (1987)

    Google Scholar 

  11. Nieto, E.: Metodología para adaptar una gramítica sistémica-funcional para la generación en castellano. TFC, Facultad de Informática, Universidad Politécnica de Madrid (1999)

    Google Scholar 

  12. Olalla. A.: Sistema de GLN basado en ontologás: Ontogeneration. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politŋica de Madrid (1999)

    Google Scholar 

  13. Pressman, R.: Software Engineering: A Practitioner’s Approach. McGraw-Hill (1994)

    Google Scholar 

  14. Reiter E., and Dale, R.: Building Applied Natural language Generation Systems. Journal of Natural Language Engineering, 3(1). (1997). 57–87

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

del Socorro Bernardos Galindo, M., de Cea, G.A. (2001). A New Approach in Building a Corpus for Natural Language Generation Systems. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2001. Lecture Notes in Computer Science, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44686-9_24

Download citation

  • DOI: https://doi.org/10.1007/3-540-44686-9_24

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41687-6

  • Online ISBN: 978-3-540-44686-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics