Maximizing Edit Distance Accuracy with Hidden Conditional Random Fields

Vinel, Antoine; Artières, Thierry

doi:10.1007/978-3-642-40261-6_5

Antoine Vinel¹⁷ &
Thierry Artières¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8047))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

2577 Accesses
1 Citations

Abstract

Handwriting recognition aims at predicting a sequence of characters from an image of a handwritten text. Main approaches rely on learning statistical models such as Hidden Markov Models or Conditional Random Fields, whose quality is measured through character and word error rates while they are usually not trained to optimize such criterion. We propose an efficient method for learning Hidden Conditional Random Fields to optimize the error rate within the large margin framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Woodland, P.C., Povey, D.: Large scale discriminative training of hidden markov models for speech recognition. Computer Speech & Language (1) (2002)
Google Scholar
Juang, B.H., Katagiri, S.: Discriminative learning for minimum error classification. IEEE Transactions on Signal Processing (12) (1992)
Google Scholar
Fu, Q., He, X., Deng, L.: Phone-discriminating minimum classification error (p-mce) training for phonetic recognition. In: Interspeech (2007)
Google Scholar
He, X., Deng, L., Chou, W.: A novel learning method for hidden markov models in speech and audio processing. In: Multimedia Signal Processing. IEEE (2006)
Google Scholar
Povey, D., Woodland, P.C.: Minimum phone error and i-smoothing for improved discriminative training. In: ICASSP, vol. 1, p. I–105. IEEE (2002)
Google Scholar
Deng, L., Wu, J., Droppo, J., Acero, A.: Analysis and comparison of two speech feature extraction/compensation algorithms. In: SPL (2005)
Google Scholar
Cheng, C.-C., Sha, F., Saul, L.K.: Online learning and acoustic feature adaptation in large-margin hidden markov models. JSP (6) (December 2010)
Google Scholar
Sha, F., Saul, L.K.: Large margin hidden markov models for automatic speech recognition. In: NIPS (2007)
Google Scholar
Cheng, C.C., Sha, F., Saul, L.K.: A fast online algorithm for large margin training of continuous density hidden markov models. In: Interspeech (2009)
Google Scholar
Do, T.M.T., Artieres, T.: Maximum margin training of gaussian hmms for handwriting recognition. In: ICDAR, pp. 976–980. IEEE Computer Society (2009)
Google Scholar
Yu, D., Deng, L., He, X., Acero, A.: Large-margin minimum classification error training for large-scale speech recognition tasks. In: ICASSP (2007)
Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: ICML Workshop (2001)
Google Scholar
Gunawardana, A., Mahajan, M., Acero, A., Platt, J.C.: Hidden conditional random fields for phone classification. In: Interspeech (2005)
Google Scholar
Do, T.-M.-T., Artieres, T.: Conditional random fields for online handwriting recognition. In: ICFHR (2006)
Google Scholar
Morency, L.P., Quattoni, A., Darrell, T.: Latent-dynamic discriminative models for continuous gesture recognition. In: CPVR, pp. 1–8. IEEE (2007)
Google Scholar
Wang, Y., Mori, G.: Max-margin hidden conditional random fields for human action recognition. In: CVPR, pp. 872–879. IEEE (2009)
Google Scholar
Vinel, A., Do, T.M.T., Artières, T.: Joint optimization of hidden conditional random fields and non linear feature extraction. In: ICDAR (2011)
Google Scholar
Soullard, Y., Artieres, T.: Hybrid hmm and hcrf model for sequence classification. In: ESANN (2011)
Google Scholar
Reiter, S., Schuller, B., Rigoll, G.: Hidden conditional random fields for meeting segmentation. In: Multimedia and Expo. IEEE (2007)
Google Scholar
Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: NIPS (2003)
Google Scholar
Do, T.M.T., Artières, T.: Large margin training for hidden markov models with partially observed states. In: ICML (2009)
Google Scholar
Keshet, J., Cheng, C.-C., Stoehr, M., McAllester, D.A.: Direct error rate minimization of hidden markov models. In: Interspeech (2011)
Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y.: Large margin methods for structured and interdependent output variables. JMLR (2) (2006)
Google Scholar
Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. Journal of Machine Learning Research (2006)
Google Scholar
Tran, B.H., Seide, F., Steinbiss, T.: A word graph based n-best search in continuous speech recognition. In: ICSLP (1996)
Google Scholar
http://YAWDa.lip6.fr/
Marti, U.V., Bunke, H.: A full english sentence database for off-line handwriting recognition. In: ICDAR (2002)
Google Scholar
Marti, U.V., Bunke, H.: Handwritten sentence recognition. In: ICPR (2000)
Google Scholar
Keshet, J., Shalev-Shwartz, S., Bengio, S., Singer, Y., Chazan, D.: Discriminative kernel-based phoneme sequence recognition. In: Interspeech (2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Université Pierre et Marie Curie (LIP6), Paris, France
Antoine Vinel & Thierry Artières

Authors

Antoine Vinel
View author publications
You can also search for this author in PubMed Google Scholar
Thierry Artières
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of York, Deramore Lane, YO10 5GH, York, UK
Richard Wilson , Edwin Hancock , Adrian Bors & William Smith , , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vinel, A., Artières, T. (2013). Maximizing Edit Distance Accuracy with Hidden Conditional Random Fields. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds) Computer Analysis of Images and Patterns. CAIP 2013. Lecture Notes in Computer Science, vol 8047. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40261-6_5

Download citation

DOI: https://doi.org/10.1007/978-3-642-40261-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40260-9
Online ISBN: 978-3-642-40261-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics