Skip to main content

Review and Evaluation of DiZer – An Automatic Discourse Analyzer for Brazilian Portuguese

  • Conference paper
Computational Processing of the Portuguese Language (PROPOR 2006)

Abstract

This paper presents the review and evaluation of DiZer – an automatic discourse analyzer for Brazilian Portuguese. Based on Rhetorical Structure Theory, DiZer is a symbolic analyzer that makes use of linguistic patterns learned from a corpus of scientific texts to identify and build the discourse structure of texts. DiZer evaluation shows satisfactory results for scientific texts. In order to test its portability, DiZer is also evaluated with news texts and presents acceptable performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aires, R.V.X., Aluísio, S.M., Kuhn, D.C.S., Andreeta, M.L.B., Oliveira Jr., O.N.: Combining Multiple Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese. In: The Proceedings of the Brazilian AI Symposium – SBIA, pp. 20–22 (2000)

    Google Scholar 

  • Carlson, L., Marcu, D.: Discourse Tagging Reference Manual. ISI Technical Report ISI-TR-545 (2001)

    Google Scholar 

  • Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis, University of California, Santa Barbara, CA, USA (1998)

    Google Scholar 

  • Cristea, D., Ide, N., Romary, L.: Veins Theory, An Approach to Global Cohesion and Coherence. In: The Proceedings of Coling/ACL (1998)

    Google Scholar 

  • Grosz, B., Sidner, C.: Attention, Intentions, and the Structure of Discourse. Computational Linguistics 12(3) (1986)

    Google Scholar 

  • Jordan, M.P.: An Integrated Three-Pronged Analysis of a Fund-Raising Letter. In: Mann, W.C., Thompson, S.A. (eds.) Discourse Description: Diverse Linguistic Analyses of a Fund-Raising Text, pp. 171–226 (1992)

    Google Scholar 

  • Kehler, A.: Coherence, Reference and the Theory of Grammar. CSLI Publications (2002)

    Google Scholar 

  • Mann, W.C. and Thompson, S.A, Rhetorical Structure Theory: A Theory of Text Organization. Technical Report ISI/RS-87-190 (1987)

    Google Scholar 

  • Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)

    Google Scholar 

  • Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, Cambridge (2000)

    Google Scholar 

  • O’Donnell, M.: Variable-Length On-Line Document Generation. In: The Proceedings of the 6th European Workshop on Natural Language Generation. Gerhard-Mercator University, Duisburg (1997)

    Google Scholar 

  • Pardo, T.A.S.: Métodos para Análise Discursiva Automática. PhD Thesis. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, June 2005, 211p. (2005)

    Google Scholar 

  • Pardo, T.A.S., Nunes, M.G.V.: A Construção de um Corpus de Textos Científicos em Português do Brasil e sua Marcação Retórica. Technical Report N. 212. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, September 2003, 26p. (2003)

    Google Scholar 

  • Pardo, T.A.S., Nunes, M.G.V.: Relações Retóricas e seus Marcadores Superficiais: Análise de um Corpus de Textos Científicos em Português do Brasil. Technical Report N. 231. Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo. São Carlos-SP, April 2004, 73p. (2004)

    Google Scholar 

  • Pardo, T.A.S., Nunes, M.G.V., Rino, L.H.M.: DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese. In: Bazzan, A.L.C., Labidi, S. (eds.) SBIA 2004. LNCS (LNAI), vol. 3171, pp. 224–234. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  • Pardo, T.A.S., Seno, E.M.R.: Rhetalho: um corpus de referência anotado retoricamente. In: Anais do V Encontro de Corpora. São Carlos-SP, November 24-25 (2005)

    Google Scholar 

  • Pereira, F.C.N., Warren, D.H.D.: Definite Clause Grammars for Language Analysis – A Survey of the Formalism and Comparison with Augmented Transition Networks. In: Artificial Intelligence, vol. 13, pp. 231–278 (1980)

    Google Scholar 

  • Schauer, H.: Referential Structure and Coherence Structure. In: The Proceedings of TALN. Lausanne, Switzerland (2000)

    Google Scholar 

  • Soricut, R., Marcu, D.: Sentence Level Discourse Parsing using Syntactic and Lexical Information. In: The Proceedings of HLT/NAACL (2003)

    Google Scholar 

  • Sumita, K., Ono, K., Chino, T., Ukita, T., Amano, S.: A discourse structure analyzer for Japonese text. In: The Proceedings of the International Conference on Fifth Generation Computer Systems, Tokyo, Japan, vol. 2, pp. 1133–1140 (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pardo, T.A.S., Nunes, M.d.G.V. (2006). Review and Evaluation of DiZer – An Automatic Discourse Analyzer for Brazilian Portuguese. In: Vieira, R., Quaresma, P., Nunes, M.d.G.V., Mamede, N.J., Oliveira, C., Dias, M.C. (eds) Computational Processing of the Portuguese Language. PROPOR 2006. Lecture Notes in Computer Science(), vol 3960. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11751984_19

Download citation

  • DOI: https://doi.org/10.1007/11751984_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-34045-4

  • Online ISBN: 978-3-540-34046-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics