Abstract
One of the main difficulties in building NLG systems is to produce a good requirement specification. A way to face this problem is by using a corpus to show the client the features of the system to be developed. In this paper we describe a method to elaborate that corpus and how can be used for a particular system. This method consists of five steps: text collection, input determination, text and input analysis, corpus construction, and pattern extraction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aguado, G., Bañón, A., Bateman, J., Bernardos, S., Fernández, M., Gómez, A., Nieto, E., Olalla, A., Plaza, R., Sánchez. A.: Ontogeneration: Reusing Domain and Linguistic Ontologies for Spanish Text Generation. Workshop on Applications of Ontologies and Problem Solving Methods, ECAI.98, Brighton (1998)
Bañón, A.: Modelo de generación multisentencial EPRS. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid, Madrid (1999)
Bateman, J. A.: Enabling Technology for multilingual natural language generation: the KPML development environment. In Natural Language Engineering, Vol. 1. Cambridge University Press, Cambridge (1997) 1–42
Bernardos, S.: GUME: Extensión de la Ontología GUM para el Español. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid. Madrid (1997)
Dale, R. and Reiter, E.: Tutorial on Building applied Natural Language Generation Systems. ANLP-97 (1997)
Francis, W.N.: Language Corpora B.C. in J. Svartvik (ed) Directions in Corpus Linguistics. Proceedings of the Nobel Symposium 82. Berlin. Mouton de Gruyter (1992)
Fernández, M.: Chemicals: Una Ontología de Elementos Químicos, Trabajo Fin de Carrera, Facultad de Informática, Universidad Politécnica de Madrid (1996)
Fernández M. A. et al: Ciencias Naturales GAIA. Vincens-Vives. Barcelona. España (1995)
Halliday, M. A. M.: An Introduction to Functional Grammar. Edward Arnold, London (1985)
Lasheras, A. L. and Carretero, M.P. Física y Química. POSITRON Vincens-Vives. Barcelona. España (1987)
Nieto, E.: Metodología para adaptar una gramítica sistémica-funcional para la generación en castellano. TFC, Facultad de Informática, Universidad Politécnica de Madrid (1999)
Olalla. A.: Sistema de GLN basado en ontologás: Ontogeneration. Trabajo Fin de Carrera, Facultad de Informática, Universidad Politŋica de Madrid (1999)
Pressman, R.: Software Engineering: A Practitioner’s Approach. McGraw-Hill (1994)
Reiter E., and Dale, R.: Building Applied Natural language Generation Systems. Journal of Natural Language Engineering, 3(1). (1997). 57–87
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
del Socorro Bernardos Galindo, M., de Cea, G.A. (2001). A New Approach in Building a Corpus for Natural Language Generation Systems. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2001. Lecture Notes in Computer Science, vol 2004. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44686-9_24
Download citation
DOI: https://doi.org/10.1007/3-540-44686-9_24
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41687-6
Online ISBN: 978-3-540-44686-6
eBook Packages: Springer Book Archive