Skip to main content

HMM/ANN System for Vietnamese Continuous Digit Recognition

  • Conference paper
  • First Online:
Developments in Applied Artificial Intelligence (IEA/AIE 2003)

Abstract

The study of a system for Vietnamese continuous digit recognition is described. The CSLU Toolkit was used to develop and implement hybrid HMM/ANN recognition systems. Experiments were done with a corpus of 442 sentences with 2340 words, which were extracted from two telephone-speech corpora: “22 Language v1.2” and “Multi-Language Telephone Speech v1.2”. In our experiments, a context-dependent phoneme recognizer has achieved better recognition performance than a context-dependent demi-syllable recognizer and a context-independent phoneme recognizer. Among feature sets applied to the context-dependent phoneme recognizer, the set of 12 PLP features with CMS, energy and corresponding delta values has achieved the best recognition result (96.83% word accuracy and 87.67% sentence correct).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Vu K. B., Trieu T.T.H, Bui D.B: “Am tiet tieng Viet kha nang hinh thanh va thuc te ung dung”. Proc. of conference in IT, Institute of IT, 2001.

    Google Scholar 

  2. Jim J.W, Li D., Jacky C: “Modeling context-dependent phonetic units in a continuous speech recognition system for Mandarin Chinese”. Proceeding of ICSLP’ 96.

    Google Scholar 

  3. Hosom, J.P., Cole, R.A, and Cosi, P.: “Improvements in Neural-Network Training and Search Techniques for Continuous Digit Recognition.” Australian Journal of Intelligent Information Processing Systems (AJIIPS), vol. 5, no. 4 (Summer 1998), pp. 277–284.

    Google Scholar 

  4. Hosom, J. P., Cosi, P. and Cole, R., Fanty, M., Schalkwyk, J., Yan, Y. and Wei, W.: “Training Neural Networks for Speech Recognition” http://cslu.cse.ogi.edu/tutordemos/nnet_training/tutorial.

  5. http://cslu.cse.ogi.edu/toolkit.

  6. Y., Fanty, M and Cole, R.: “Speech Recognition Using Neural Networks with Forward-Backward Probability Generated Targets”, In Proceedings ICSDDP97, April 1997, Vol. 4.

    Google Scholar 

  7. Lander. T.: “CSLU Labeling Guide”. Center for Spoken Language Understanding, Oregon Graduate Institute. 1997.

    Google Scholar 

  8. Wei, W and Van Vuuren, S.: “Improved Neural Network Training of Inter-Word Context Units for Connected Digit Recognition”. In Proceedings of International Conference on Acoustic Speech and Signal Processing (ICASSP’ 98), Seattle, Washington, May 1998, Vol. 1, pp. 497–500.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Duc, D.N., Hosom, JP., Mai, L.C. (2003). HMM/ANN System for Vietnamese Continuous Digit Recognition. In: Chung, P.W.H., Hinde, C., Ali, M. (eds) Developments in Applied Artificial Intelligence. IEA/AIE 2003. Lecture Notes in Computer Science(), vol 2718. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45034-3_48

Download citation

  • DOI: https://doi.org/10.1007/3-540-45034-3_48

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40455-2

  • Online ISBN: 978-3-540-45034-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics