Nonparametric Hidden Markov Models: Principles and Applications to Speech Recognition

Trentin, Edmondo

doi:10.1007/978-3-540-45216-4_1

Edmondo Trentin⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2859))

Included in the following conference series:

Italian Workshop on Neural Nets

420 Accesses
1 Citations

Abstract

Continuous-density hidden Markov models (HMM) are a popular approach to the problem of modeling sequential data, e.g. in automatic speech recognition (ASR), off-line handwritten text recognition, and bioinformatics. HMMs rely on strong assumptions on their statistical properties, e.g. the arbitrary parametric assumption on the form of the emission probability density functions (pdfs). This chapter proposes a nonparametric HMM based on connectionist estimates of the emission pdfs, featuring a global gradient-ascent training algorithm over the maximum-likelihood criterion. Robustness to noise may be further increased relying on a soft parameter grouping technique, namely the introduction of adaptive amplitudes of activation functions. Applications to ASR tasks are presented and analyzed, evaluating the behavior of the proposed paradigm and allowing for a comparison with standard HMMs with Gaussian mixtures, as well as with other state-of-the-art neural net/HMM hybrids.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bengio, Y.: Neural Networks for Speech and Sequence Recognition. International Thomson Computer Press, London (1996)
Google Scholar
Bengio, Y., De Mori, R., Flammia, G., Kompe, R.: Global optimization of a neural network-hidden Markov model hybrid. IEEE Transactions on Neural Networks 3(2), 252–259 (1992)
Article Google Scholar
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks 5(2), 157–166 (1994); Special Issue on Recurrent Neural Networks (March 1994)
Article Google Scholar
Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford (1995)
Google Scholar
Bourlard, H., Morgan, N.: Connectionist Speech Recognition. A Hybrid Approach, vol. 247. Kluwer Academic Publishers, Boston (1994)
Google Scholar
Bridle, J.S.: Alphanets: a recurrent ‘neural’ network architecture with a hidden Markov model interpretation. Speech Communication 9(1), 83–92 (1990)
Article Google Scholar
Davis, S.B., Mermelstein, P.: Comparison of parametric representations of monosyllabic word recognition in continuously spoken sentences. IEEE Trans. on Acoustics, Speech and Signal Processing 28(4), 357–366 (1980)
Article Google Scholar
Rabiner, R.L.: A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–286 (1989)
Article Google Scholar
Trentin, E.: Networks with trainable amplitude of activation functions. Neural Networks 14(4–5), 471–493 (2001)
Article Google Scholar
Trentin, E.: Robust Combination of Neural Networks and Hidden Markov Models for Speech Recognition. PhD thesis, DSI, Univ. di Firenze (2001)
Google Scholar
Trentin, E., Bengio, Y., Furlanello, C., De Mori, R.: Neural networks for speech recognition. In: De Mori, R. (ed.) Spoken Dialogues with Computers, pp. 311–361. Academic Press, London (1998)
Google Scholar
Trentin, E., Gori, M.: Continuous speech recognition with a robust connectionist/ markovian hybrid model. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, p. 577. Springer, Heidelberg (2001)
Chapter Google Scholar
Trentin, E., Gori, M.: A survey of hybrid ANN/HMM models for automatic speech recognition. Neurocomputing 37(1-4), 91–126 (2001)
Article MATH Google Scholar
Trentin, E., Gori, M.: Toward noise-tolerant acoustic models. In: Proceedings of Eurospeech 2001, Aalborg, Scandinavia (September 2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Ingegneria dell’Informazione, Università di Siena, Via Roma, 56, 53100, Siena, Italy
Edmondo Trentin

Authors

Edmondo Trentin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimentimento di Scienze dell’Informazione, via Comelico 39/41, 20135, Milano, Italy
Bruno Apolloni
Dipartimento di Fisica “E.R. Caianiello”, Università degli Studi di Salerno, Via S. Allende, 84081, Baronissi, (SA), Italy
Maria Marinaro
Department of Mathematics and Informatics, University of Salerno, Via Ponte Don Melillo, 84084, Fisciano, (SA), Italy
Roberto Tagliaferri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Trentin, E. (2003). Nonparametric Hidden Markov Models: Principles and Applications to Speech Recognition. In: Apolloni, B., Marinaro, M., Tagliaferri, R. (eds) Neural Nets. WIRN 2003. Lecture Notes in Computer Science, vol 2859. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45216-4_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-45216-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20227-1
Online ISBN: 978-3-540-45216-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics