Abstract
We describe a streamlined knowledge acquisition method for semi-automatically constructing knowledge bases for a Knowledge Based Machine Translation (KBMT) system. This method forms the basis of a very simple Java-based user interface that enables a language expert to build lexical and syntactic transfer knowledge bases without extensive specialized training as an MT system builder. Following [Wu 1997], we assume that the permutation of binary-branching structures is a sufficient reordering mechanism for MT. Our syntactic knowledge is based on a novel, highly constrained grammar construction environment in which the only re-ordering mechanism is the permutation of binary-branching structures (Twisted Pair Grammar). We describe preliminary results for several fully implemented components of a Hindi/Urdu to English MT prototype being built with this interface.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bains, Gurprit. 1989. Complex Structures in Hindi-Urdu: Explorations in Government and Binding Theory. Doctoral Dissertation, New York University.
Barker, Muhamed ABD-AL-Rahman, Hasan Jahangir Hamdani, Khwaja Muhammad Shafi Dihlari, and Shafiqur Rahman. 1993. Spoken Urdu: A Course in Urdu. Volume 1. Spoken Language Service, Inc.
Brill, Eric. 1995. Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part of Speech Tagging. (ACL)
Chomsky, Noam. 1995. The Minimalist Program. Current Studies in Linguistics. MIT Press.
Michael Collins. 1997. Three Generative, Lexicalised Models for Statistical Parsing. Proceedings of the 35th Annual Meeting of the ACL (jointly with the 8th Conference of the Linguistics. EACL), Madrid.
Davison, Alice. 1988. “Constituent Structures and the Realization of Agreement Features.” Proceedings of the 24th Meeting of the Chicago Linguistic Society, Parasession of Agreement in Grammatical Theory.
Dorr, Bonnie. 1993. Machine Translation: A View from the Lexicon. MIT Press.
Fung, Pascal and Dekai Wu. 1994. Statistical Augmentation of a Chinese Machine-Readable Dictionary. Available as cmp-lg/9406015.
Hatzivassiloglou, Vasileios and Kevin Knight. “Unification-Based Glossing”. 1995. Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI).
Kayne, Richard. 1994. The Antisymmetry of Syntax. Linguistic Inquiry Monograph 25. MIT Press.
Kayne, Richard. 1983. Connectedness and Binary Branching. Studies in Generative Grammar 16. Foris Publications.
Knight, Kevin and Steve Luk. “Building a Large-Scale Knowledge Base For Machine Translation. 1994. Proceedings of American Association for Artificial Intelligence.
Knight, Kevin, Ishwar Chandler, Matthew Haines, Vasileios Hatzivassigloglou, Eduard Hovy, Masayo Ida, Steve Luk, Richard Whitney, and Kenji Yamada. 1994. Integrating Knowledge Bases and Statistics in MT (AMTA-94)
Knight, Kevin, Ishwar Chandler, Matthew Haines, Vasileios Hatzivassigloglou, Eduard Hovy, Masayo Ida, Steve Luk, Akitoshi Okumura, Richard Whitney, and Kenji Yamada 1995. “Filing Knowledge Gaps in a Broad-Coverage Machine Translation System”. (IJCAI-95).
Nirenburg, Sergei and Victor Raskin. 1998. “Universal Grammar and Lexis for Quick Ramp-Up of MT Systems. (manuscript)
Nirenburg, Sergei, Jaime Carbonnel, M. Tomita, and K. Goodman. 1992. Machine Translation: A Knowledge-Based Approach. Morgan Kaufmann.
Shieber, Stuart. 1985. An Introduction to Unification-Based Approaches to Grammar. Center for the Study of Language and Information, University of Chicago Press.
Wu, Dekai and Xuanyin Xia. 1995. Large-Scale Automatic Extraction of an English-Chinese Translation Lexicon. Machine Translation. 9:3, 1–28.
Wu, Dekai. 1994. Aligning a Parallel English-Chinese Corpus Statistically with Lexical Criteria. Available as cmp-lg/9406007.
Wu, Dekai. 1997. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora. Computational Linguistics. 21(4):377–404.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Jones, D., Havrilla, R. (1998). Twisted Pair Grammar: Support for Rapid Development of Machine Translation for Low Density Languages. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_29
Download citation
DOI: https://doi.org/10.1007/3-540-49478-2_29
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65259-5
Online ISBN: 978-3-540-49478-2
eBook Packages: Springer Book Archive