An Annotated Corpus and a Grammar Model of Theorem Description

Baba, Yusuke; Suzuki, Masakazu

doi:10.1007/3-540-36469-2_8

Yusuke Baba⁴ &
Masakazu Suzuki⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2594))

Included in the following conference series:

International Conference on Mathematical Knowledge Management

351 Accesses
2 Citations
1 Altmetric

Abstract

Digitizing documents is becoming increasingly popular in various fields, and training computers to understand the contents of digitized documents is of growing interest. Since the early 90’s, research of natural language processing using large annotated corpora such as the Penn TreeBank has developed. Applying the methods of corpus-based research, we built a syntactically annotated corpus of theorem descriptions, using a book of set theory, and extracted a grammar model of theorems from the obtained corpus, as the first step to understanding mathematical documents by computer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Inoue, K., Miyazaki, R., Suzuki, M.: Optical Recognition of Printed Mathematical Documents. Proceedings of the Third Asian Technology Conference in Mathematics. Springer-Verlag. (1998) 280–289
Google Scholar
Eto, Y., Suzuki, M.: Mathematical Formula Recognition Using Virtual Link Network. Proceedings of the Sixth International Conference on Document Analysis and Recognition. Seattle. IEEE Computer Society Press. (2001) 430–437
Google Scholar
Michler, G.: A prototype of a combined digital and retrodigitaized searchable mathematical journal. Lecture Notes in Control and Infomation Sciences. 249 (1999) 219–235
Google Scholar
Michler, G.: Report on the retrodigitiization project “Archiv der Mathemark”. Archiv der Mathemark. 77 (2001) 116–128
MATH Google Scholar
Marcus, M., et al.: Building a large annotated corpus of English:the Penn Tree-Bank. In the distributed Penn TreeBank Project CD-ROM. Linguistic Data Consortium. University of Pennsylvania
Google Scholar
Sekine, S., Grishman, R.: A Corpus-based Probabilistic Grammar with Only Two Non-terminals. Fourth International Workshop on Parsing Technology. (1995)
Google Scholar
J Cameron, P.: Sets, Logic and Categories. Springer. (1999)
Google Scholar
Rotman, J.: Galois theory(2nd ed.). Springer. (1998)
Google Scholar
Hodges, W.: A shorter model theory. Cambridge University Press. (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Mathematics, Kyushu University, Japan
Yusuke Baba
Faculty of Mathematics, Kyushu University, 6-10-1 Hakozaki, 812-8581, Higashi-ku, Fukuoka, Japan
Masakazu Suzuki

Authors

Yusuke Baba
View author publications
You can also search for this author in PubMed Google Scholar
Masakazu Suzuki
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Bologna, Mura Anteo Zamboni, 7, 40127, Bologna, Italy
Andrea Asperti
Research Institute for Symbolic Computation, Johannes Kepler University, 4232 Castle of Hagenberg, Austria
Bruno Buchberger
Departments of Mathematical Sciences and Computer Science, University of Bath, BA2 7AY, Bath, England
James Harold Davenport

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baba, Y., Suzuki, M. (2003). An Annotated Corpus and a Grammar Model of Theorem Description. In: Asperti, A., Buchberger, B., Davenport, J.H. (eds) Mathematical Knowledge Management. MKM 2003. Lecture Notes in Computer Science, vol 2594. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36469-2_8

Download citation

DOI: https://doi.org/10.1007/3-540-36469-2_8
Published: 30 January 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00568-1
Online ISBN: 978-3-540-36469-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics