Skip to main content

Maximizing Edit Distance Accuracy with Hidden Conditional Random Fields

  • Conference paper
Computer Analysis of Images and Patterns (CAIP 2013)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8047))

Included in the following conference series:

Abstract

Handwriting recognition aims at predicting a sequence of characters from an image of a handwritten text. Main approaches rely on learning statistical models such as Hidden Markov Models or Conditional Random Fields, whose quality is measured through character and word error rates while they are usually not trained to optimize such criterion. We propose an efficient method for learning Hidden Conditional Random Fields to optimize the error rate within the large margin framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Woodland, P.C., Povey, D.: Large scale discriminative training of hidden markov models for speech recognition. Computer Speech & Language (1) (2002)

    Google Scholar 

  2. Juang, B.H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Transactions on Signal Processing (12) (1992)

    Google Scholar 

  3. Fu, Q., He, X., Deng, L.: Phone-discriminating minimum classification error (p-mce) training for phonetic recognition. In: Interspeech (2007)

    Google Scholar 

  4. He, X., Deng, L., Chou, W.: A novel learning method for hidden markov models in speech and audio processing. In: Multimedia Signal Processing. IEEE (2006)

    Google Scholar 

  5. Povey, D., Woodland, P.C.: Minimum phone error and i-smoothing for improved discriminative training. In: ICASSP, vol. 1, p. I–105. IEEE (2002)

    Google Scholar 

  6. Deng, L., Wu, J., Droppo, J., Acero, A.: Analysis and comparison of two speech feature extraction/compensation algorithms. In: SPL (2005)

    Google Scholar 

  7. Cheng, C.-C., Sha, F., Saul, L.K.: Online learning and acoustic feature adaptation in large-margin hidden markov models. JSP (6) (December 2010)

    Google Scholar 

  8. Sha, F., Saul, L.K.: Large margin hidden markov models for automatic speech recognition. In: NIPS (2007)

    Google Scholar 

  9. Cheng, C.C., Sha, F., Saul, L.K.: A fast online algorithm for large margin training of continuous density hidden markov models. In: Interspeech (2009)

    Google Scholar 

  10. Do, T.M.T., Artieres, T.: Maximum margin training of gaussian hmms for handwriting recognition. In: ICDAR, pp. 976–980. IEEE Computer Society (2009)

    Google Scholar 

  11. Yu, D., Deng, L., He, X., Acero, A.: Large-margin minimum classification error training for large-scale speech recognition tasks. In: ICASSP (2007)

    Google Scholar 

  12. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML Workshop (2001)

    Google Scholar 

  13. Gunawardana, A., Mahajan, M., Acero, A., Platt, J.C.: Hidden conditional random fields for phone classification. In: Interspeech (2005)

    Google Scholar 

  14. Do, T.-M.-T., Artieres, T.: Conditional random fields for online handwriting recognition. In: ICFHR (2006)

    Google Scholar 

  15. Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic discriminative models for continuous gesture recognition. In: CPVR, pp. 1–8. IEEE (2007)

    Google Scholar 

  16. Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR, pp. 872–879. IEEE (2009)

    Google Scholar 

  17. Vinel, A., Do, T.M.T., Artières, T.: Joint optimization of hidden conditional random fields and non linear feature extraction. In: ICDAR (2011)

    Google Scholar 

  18. Soullard, Y., Artieres, T.: Hybrid hmm and hcrf model for sequence classification. In: ESANN (2011)

    Google Scholar 

  19. Reiter, S., Schuller, B., Rigoll, G.: Hidden conditional random fields for meeting segmentation. In: Multimedia and Expo. IEEE (2007)

    Google Scholar 

  20. Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: NIPS (2003)

    Google Scholar 

  21. Do, T.M.T., Artières, T.: Large margin training for hidden markov models with partially observed states. In: ICML (2009)

    Google Scholar 

  22. Keshet, J., Cheng, C.-C., Stoehr, M., McAllester, D.A.: Direct error rate minimization of hidden markov models. In: Interspeech (2011)

    Google Scholar 

  23. Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. JMLR (2) (2006)

    Google Scholar 

  24. Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. Journal of Machine Learning Research (2006)

    Google Scholar 

  25. Tran, B.H., Seide, F., Steinbiss, T.: A word graph based n-best search in continuous speech recognition. In: ICSLP (1996)

    Google Scholar 

  26. http://YAWDa.lip6.fr/

  27. Marti, U.V., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: ICDAR (2002)

    Google Scholar 

  28. Marti, U.V., Bunke, H.: Handwritten sentence recognition. In: ICPR (2000)

    Google Scholar 

  29. Keshet, J., Shalev-Shwartz, S., Bengio, S., Singer, Y., Chazan, D.: Discriminative kernel-based phoneme sequence recognition. In: Interspeech (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Vinel, A., Artières, T. (2013). Maximizing Edit Distance Accuracy with Hidden Conditional Random Fields. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds) Computer Analysis of Images and Patterns. CAIP 2013. Lecture Notes in Computer Science, vol 8047. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40261-6_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40261-6_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40260-9

  • Online ISBN: 978-3-642-40261-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics