Towards a Speech Recognizer for Multiple Languages Using Arabic Acoustic Model Application to Amazigh Language

Sadiqui, Ali; Zinedine, Ahmed

doi:10.1007/978-3-319-73500-9_5

Ali Sadiqui^14,15 &
Ahmed Zinedine^14,15

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 782))

Included in the following conference series:

International Conference on Arabic Language Processing

953 Accesses

Abstract

The construction of acoustic models of a language, used in automatic speech recognition (ASR) systems, is a developed technology achievable without great difficulty when a large amount of speech and written corpus is available. However, these technological resources are not available in a large part of languages called “Less Resourced Languages”. An alternative solution is to take advantage of the phonetic structures shared between the different languages to build an acoustic model for the target language.

In this paper, we will return to an experiment in this direction. Indeed, we used an acoustic model of the Arabic language to create one for the Amazigh language. The originality of our work comes from the will to address this language which has become an official language in Morocco, and which has not enough resources for the automatic speech recognition. In addition, both languages share several phonemes and certain characteristics. The realized system has reached a recognition rate of about 73% by word. The potential and the effectiveness of the proposed approach is demonstrated by experiments and comparison with other approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lê, V.B.: Reconnaissance automatique de la parole pour des langues peu dotées. thèse de doctorat, Joseph Fourier - Grenoble1 (2006)
Google Scholar
Rabiner, L.-R., Schafer, R.-W.: Digital Processing of Speech Signals. Prentice-Hall, Englewood Cliffs (1978)
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing, 2nd edn. Prentice Hall Inc, Englewood Cliffs (2008). Chapter 9 to end of Sect. 9.3
Google Scholar
Boite, R., Bourlard, H., Dutoit, T., Hancq, J., Leich, H.: Traitement de la parole. Presses Polytechniques et Universitaires Romandes, Collection Electricité, Lausanne, Switzerland (2000)
Google Scholar
Schultz, T., Waibel, A.: Language independent and language adaptive acoustic modeling for speech recognition. Speech Commun. 35, 31–51 (2001)
Article MATH Google Scholar
Lin, H., Deng, L., Droppo, J., Yu, D., Acero, A.: Learning methods in multilingual speech recognition. In: Proceedings of the NIPS, Vancouver, BC, Canada (2008)
Google Scholar
Byrne, W., et al.: Towards language independent acoustic modeling. In: Proceedings of the ICASSP (2000)
Google Scholar
Van Doremalen, J., Cucchiarini, C., Strik, H.: Optimizing automatic speech recognition for low-proficient non native speakers. EURASIP J. Audio Speech Music Process. 2010, 1–13 (2010)
Article Google Scholar
Heigold, G., Vanhoucke, V., Senior, A.W., Nguyen, P., Ranzato, M., Devin, M., Dean, J.: Multilingual acoustic models using distributed deep neural networks. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 8619–8623 (2013)
Google Scholar
Garcia, E., Mengusoglu, E., Janke, E.: Multilingual acoustic models for speech recognition in low-resource devices. In: Proceedings of the ICASSP (2007)
Google Scholar
De Wachter, M., Demuynck, K., van Compernolle, D., Wambaq, P.: Data driven example based continuous speech recognition. In: Proceedings of Eurospeech, Geneva, Switzerland, pp. 1133–1136 (2003)
Google Scholar
Schultz, T., Kirchhoff, K. (eds.): Multilingual Speech Processing. Academic Press, Amsterdam (2006)
Google Scholar
International Phonetic Association: Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet, pp. 1–204 (1999)
Google Scholar
Open source speech recognition toolkit CMU sphinx. https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/
Schultz, T.: GlobalPhone: A multilingual speech and text database developed at karlsruhe university. In: ICSLP 2002, Denver, CO, USA, Septembre 2002
Google Scholar
The GlobalPhone Project: http://www.cs.cmu.edu/~tanja/GlobalPhone
Ali Sadiqui, Nouredine Chenfour, Réalisation d’un système de reconnaissance automatique de la parole arabe basée sur CMU Sphinx, article publié sur «Annals. Computer Science Series» Tome 8, Avril 2010
Google Scholar
Arabic Speech Corpus. http://en.arabicspeechcorpus.com/
Greenberg J.: The Languages of Africa. The Hague (1966)
Google Scholar
Ouakrim, O.: Fonética y fonología del Bereber, Survey at the University of Autònoma de Barcelona (1995)
Google Scholar
El Barkani, B.: Le choix de la graphie tifinaghe pour enseigner, apprendre l’amazighe au Maroc: conditions, reprrésentation et pratiques. Linguistique. Université Jean Monnet -Saint-Etienne. Français (2010)
Google Scholar
The Royal Institute of Amazigh Culture. http://www.ircam.ma
Leggetter, C.-J., Woodland, P.-C.: Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models. Comput. Speech Lang. 9(2), 171–185 (1995)
Article Google Scholar
Gauvain, J.-L., Lee, C.-H.: Maximum a posteriori estimation for multi-variate gaussian mixture observations of markov chains. IEEE Trans. Speech Audio Process. 2(2), 291–298 (1994)
Article Google Scholar
Wang, Z., Schultz, T.: Non-native spontaneous speech recognition through polyphone decision tress specialization. In: Eurospeech 2003, pp. 1449–1452, Geneva, Switzerland, September 2003
Google Scholar
Open source speech recognition toolkit CMU sphinx. http://cmusphinx.sourceforge.net/wiki/tutorialadapt

Download references

Author information

Authors and Affiliations

OFPPT, ISTA Meknès, Meknès, Morocco
Ali Sadiqui & Ahmed Zinedine
Faculté des Sciences Dhar El Mehraz, Atlas, B.P. 1796, Fès, Morocco
Ali Sadiqui & Ahmed Zinedine

Authors

Ali Sadiqui
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Zinedine
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ali Sadiqui or Ahmed Zinedine .

Editor information

Editors and Affiliations

Ex ENSA-USMBA, Fez, Morocco
Abdelmonaime Lachkar
EMI, UM5, Rabat, Morocco
Karim Bouzoubaa
FS, UMP, Oujda, Morocco
Azzedine Mazroui
IERA, UM5, Rabat, Morocco
Abdelfettah Hamdani
FS, UMP, Oujda, Morocco
Abdelhak Lekhouaja

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sadiqui, A., Zinedine, A. (2018). Towards a Speech Recognizer for Multiple Languages Using Arabic Acoustic Model Application to Amazigh Language. In: Lachkar, A., Bouzoubaa, K., Mazroui, A., Hamdani, A., Lekhouaja, A. (eds) Arabic Language Processing: From Theory to Practice. ICALP 2017. Communications in Computer and Information Science, vol 782. Springer, Cham. https://doi.org/10.1007/978-3-319-73500-9_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-73500-9_5
Published: 05 January 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73499-6
Online ISBN: 978-3-319-73500-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics