Skip to main content

Chinese Image Character Recognition Using DNN and Machine Simulated Training Samples

  • Conference paper
Artificial Neural Networks and Machine Learning – ICANN 2014 (ICANN 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8681))

Included in the following conference series:

Abstract

Inspired by the success of deep neural network (DNN) models in solving challenging visual problems, this paper studies the task of Chinese Image Character Recognition (ChnICR) by leveraging DNN model and huge machine simulated training samples. To generate the samples, clean machine born Chinese characters are extracted and are plus with common variations of image characters such as changes in size, font, boldness, shift and complex backgrounds, which in total produces over 28 million character images, covering the vast majority of occurrences of Chinese character in real life images. Based on these samples, a DNN training procedure is employed to learn the appropriate Chinese character recognizer, where the width and depth of DNN, and the volume of samples are empirically discussed. Parallel to this, a holistic Chinese image text recognition system is developed. Encouraging experimental results on text from 13 TV channels demonstrate the effectiveness of the learned recognizer, from which significant performance gains are observed compared to the baseline system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lew, et al.: Content-based multimedia information retrieval: State of the art and challenges. TOMCCAP 02(1), 1–19 (2006)

    Article  MathSciNet  Google Scholar 

  2. Bai, J., et al.: Chinese image text recognition on grayscale pixels. In: ICASSP (2014)

    Google Scholar 

  3. Karatzas, et al.: Icdar 2011 robust reading competition - challenge 1: Reading text in born-digital images (web and email). In: ICDAR, pp. 1485–1490 (2011)

    Google Scholar 

  4. Shahab, et al.: Icdar 2011 robust reading competition challenge 2: Reading text in scene images. In: ICDAR, pp. 1491–1496 (2011)

    Google Scholar 

  5. Karatzas: Icdar 2013 robust reading competition. In: ICDAR, pp. 1484–1493 (2013)

    Google Scholar 

  6. ABBYY Finereader 9.0, http://www.abbyy.com

  7. Cireşan, et al.: Flexible, high performance convolutional neural networks for image classification. In: IJCAI 2011, pp. 1237–1242. AAAI Press (2011)

    Google Scholar 

  8. Krizhevsky, A., et al.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (2012)

    Google Scholar 

  9. Ciresan, et al.: Multi-column deep neural networks for image classification. In: CVPR, pp. 3642–3649 (2012)

    Google Scholar 

  10. Liu, C.-L., et al.: Handwritten chinese character recognition: Alternatives to nonlinear normalization. In: ICDAR, vol. 3, pp. 524–528 (2003)

    Google Scholar 

  11. Liu, C.-L.: Normalization-cooperated gradient feature extraction for handwritten character recognition. TPAMI 29(8), 1465–1469 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Bai, J., Chen, Z., Feng, B., Xu, B. (2014). Chinese Image Character Recognition Using DNN and Machine Simulated Training Samples. In: Wermter, S., et al. Artificial Neural Networks and Machine Learning – ICANN 2014. ICANN 2014. Lecture Notes in Computer Science, vol 8681. Springer, Cham. https://doi.org/10.1007/978-3-319-11179-7_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-11179-7_27

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-11178-0

  • Online ISBN: 978-3-319-11179-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics