Real-Time Statistical Speech Translation

Wołk, Krzysztof; Marasek, Krzysztof

doi:10.1007/978-3-319-05951-8_11

Krzysztof Wołk⁶ &
Krzysztof Marasek⁶

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 275))

1974 Accesses
5 Citations
3 Altmetric

Abstract

This research investigates the Statistical Machine Translation approaches to translate speech in real time automatically. Such systems can be used in a pipeline with speech recognition and synthesis software in order to produce a real-time voice communication system between foreigners. We obtained three main data sets from spoken proceedings that represent three different types of human speech. TED, Europarl, and OPUS parallel text corpora were used as the basis for training of language models, for developmental tuning and testing of the translation system. We also conducted experiments involving part of speech tagging, compound splitting, linear language model interpolation, TrueCasing and morphosyntactic analysis. We evaluated the effects of variety of data preparations on the translation results using the BLEU, NIST, METEOR and TER metrics and tried to give answer which metric is most suitable for PL-EN language pair.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Koehn, P., Hoang, H.: Moses: Open Source Toolkit for Statistical Machine Translation, Prague (2007)
Google Scholar
Marasek, K.: TED Polish-to-English translation system for the IWSLT 2012. In: IWSLT 2012, Hong Kong (2012)
Google Scholar
Costa-Jussa, M., Fonollosa, J.: Using linear interpolation and weighted reordering hypotheses in the Moses system, Barcelona, Spain (2010)
Google Scholar
Stolcke, A.: SRILM – An Extensible Language Modeling Toolkit. In: INTERSPEECH (2002)
Google Scholar
Hsu, P., Glass, J.: Iterative Language Model Estimation: Efficient Data Structure & Algorithms, Cambridge, USA (2008)
Google Scholar
Bojar, O.: Rich Morphology and What Can We Expect from Hybrid Approaches to MT. In: LIHMT 2011 (2011)
Google Scholar
Radziszewski, A.: A tiered CRF tagger for Polish. In: Bembenik, R., Skonieczny, Ł., Rybiński, H., Kryszkiewicz, M., Niezgódka, M. (eds.) Intell. Tools for Building a Scientific Information. SCI, vol. 467, pp. 215–230. Springer, Heidelberg (2013)
Google Scholar
Koehn, P., Hoang, H.: Factored Translation Models, Scotland, United Kingdom (2007)
Google Scholar
Ratnaparkhi, A.: A Maximum Entropy Part-Of-Speech Tagger, Pennsylvania (1996)
Google Scholar
Holz, F., Biemann, C.: Unsupervised and knowledge-free learning of compound splits and periphrases. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 117–127. Springer, Heidelberg (2008)
Google Scholar
Cer, D., Manning, C., Jurafsky, D.: The Best Lexical Metric for Phrase-Based Statistical MT System Optimization. Stanford, USA (2010)
Google Scholar
Gao, Q., Vogel, S.: Parallel Implementations of Word Alignment Tool (2008)
Google Scholar
Heafield, K.: KenLM: Faster and smaller language model queries. Association for Computational Linguistics (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Multimedia, Polish Japanese Institute of Information Technology, Koszykowa 86, 02-008, Warsaw, Poland
Krzysztof Wołk & Krzysztof Marasek

Authors

Krzysztof Wołk
View author publications
You can also search for this author in PubMed Google Scholar
Krzysztof Marasek
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Krzysztof Wołk .

Editor information

Editors and Affiliations

Universidade de Coimbra & LIACC, Rio Tinto, Portugal
Álvaro Rocha
Instituto Superior de Estatística e Gestão de Informação, Universidade Nova de Lisboa, Lisboa, Portugal
Ana Maria Correia
Department of Business Information Systems, Auckland University of Technology, Auckland, New Zealand
Felix . B Tan
Empirica GmbH, Bonn, Germany
Karl . A Stroetmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wołk, K., Marasek, K. (2014). Real-Time Statistical Speech Translation. In: Rocha, Á., Correia, A., Tan, F., Stroetmann, K. (eds) New Perspectives in Information Systems and Technologies, Volume 1. Advances in Intelligent Systems and Computing, vol 275. Springer, Cham. https://doi.org/10.1007/978-3-319-05951-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-05951-8_11
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05950-1
Online ISBN: 978-3-319-05951-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics