Abstract
This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice communication system between foreigners. We obtained three main data sets from spoken proceedings that represent three different types of human speech. TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for developmental tuning and testing of the translation system. We also conducted experiments involving part of speech tagging, compound splitting, linear language model interpolation, TrueCasing and morphosyntactic analysis. We evaluated the effects of variety of data preparations on the translation results using the BLEU, NIST, METEOR and TER metrics and tried to give answer which metric is most suitable for PL-EN language pair.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Koehn, P., Hoang, H.: Moses: Open Source Toolkit for Statistical Machine Translation, Prague (2007)
Marasek, K.: TED Polish-to-English translation system for the IWSLT 2012. In: IWSLT 2012, Hong Kong (2012)
Costa-Jussa, M., Fonollosa, J.: Using linear interpolation and weighted reordering hypotheses in the Moses system, Barcelona, Spain (2010)
Stolcke, A.: SRILM – An Extensible Language Modeling Toolkit. In: INTERSPEECH (2002)
Hsu, P., Glass, J.: Iterative Language Model Estimation: Efficient Data Structure & Algorithms, Cambridge, USA (2008)
Bojar, O.: Rich Morphology and What Can We Expect from Hybrid Approaches to MT. In: LIHMT 2011 (2011)
Radziszewski, A.: A tiered CRF tagger for Polish. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intell. Tools for Building a Scientific Information. SCI, vol. 467, pp. 215–230. Springer, Heidelberg (2013)
Koehn, P., Hoang, H.: Factored Translation Models, Scotland, United Kingdom (2007)
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger, Pennsylvania (1996)
Holz, F., Biemann, C.: Unsupervised and knowledge-free learning of compound splits and periphrases. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 117–127. Springer, Heidelberg (2008)
Cer, D., Manning, C., Jurafsky, D.: The Best Lexical Metric for Phrase-Based Statistical MT System Optimization. Stanford, USA (2010)
Gao, Q., Vogel, S.: Parallel Implementations of Word Alignment Tool (2008)
Heafield, K.: KenLM: Faster and smaller language model queries. Association for Computational Linguistics (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wołk, K., Marasek, K. (2014). Real-Time Statistical Speech Translation. In: Rocha, Á., Correia, A., Tan, F., Stroetmann, K. (eds) New Perspectives in Information Systems and Technologies, Volume 1. Advances in Intelligent Systems and Computing, vol 275. Springer, Cham. https://doi.org/10.1007/978-3-319-05951-8_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-05951-8_11
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05950-1
Online ISBN: 978-3-319-05951-8
eBook Packages: EngineeringEngineering (R0)