Skip to main content

Techniques for Solving the Large-Scale Classification Problem in Chinese Handwriting Recognition

  • Conference paper
Arabic and Chinese Handwriting Recognition (SACH 2006)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4768))

Included in the following conference series:

Abstract

Given the large number of categories, or class types, in the Chinese language, the challenge offered by character recognition involves dealing with such a large-scale problem in both training and testing phases. This paper addresses three techniques, the combination of which has been found to be effective in solving the problem. The techniques are: 1) a prototype learning/matching method that determines the number and location of prototypes in the learning phase, and chooses the candidates for each character in the testing phase; 2) support vector machines (SVM) that post-process the top-ranked candidates obtained during the prototype learning or matching process; and 3) fast feature-vector matching techniques to accelerate prototype matching via decision trees and sub-vector matching. The techniques are applied to Chinese handwritten characters, expressed as feature vectors derived by extraction operations, such as nonlinear normalization, directional feature extraction, and feature blurring.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cortes, C., Vapnik, V.: Support-vector network. In: Machine Learning, vol. 20, pp. 273–297 (1995)

    Google Scholar 

  2. Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)

    MATH  Google Scholar 

  3. Bottou, L., Cortes, C., Denker, J., Drucker, H., Guyon, I., Jackel, L., LeCun, L., Muller, U., Sackinger, E., Simard, P., Vapnik, V.: Comparison of classifier methods: A case study in handwriting digit recognition. In: Proc. Int. Conf. Pattern Recognition., pp. 77–87 (1994)

    Google Scholar 

  4. Knerr, S., Personnaz, L., Dreyfus, G.: Single-layer learning revisited: A stepwise procedure for building and training a neural network. In: Fogelman, J. (ed.) Neurocomputing: Algorithms, Architectures and Applications, Springer, New York (1990)

    Google Scholar 

  5. Platt, Cristianini, N., Shawe-Taylor, J.: Large margin DAG’s for multiclass classification. Advances in Neural Information Processing Systems 12, 547–553 (2000)

    Google Scholar 

  6. Chang, F., Lin, C.-C., Chen, C.J.: Applying a Hybrid Method to Handwritten Char-acter Recognition. In: Intern. Conf. Pattern Recognition 2004, Cambridge, vol. 2, pp. 529–532 (2004)

    Google Scholar 

  7. Chou, C.H., Lin, C.C., Liu., Y.H., Chang, F.: A Prototype Classification Method and Its Use in A Hybrid Solution for Multiclass Pattern Recognition. Pattern Recogni-tion 39(4), 624–634 (2006)

    Article  MATH  Google Scholar 

  8. Liu, Y.-H., Lin, C.-C., Lin, W.-H., Chang, F.: Accelerating Feature-Vector Matching Using Multiple-Tree and Sub-Vector Methods. Pattern Recognition 40(9), 2392–2399 (2007)

    Article  MATH  Google Scholar 

  9. Chang, F., Lin, C.-C., Lu, C.-J.: Adaptive Prototype Learning Algorithms: Theo-retical and Experimental Studies. Journal of Machine Learning Research 7, 2125–2148 (2006)

    MathSciNet  Google Scholar 

  10. Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Chapman and Hall, New York (1984)

    MATH  Google Scholar 

  11. Quinlan, J.R.: Induction of Decision Tree. Machine Learning 1(1), 81–106 (1986)

    Google Scholar 

  12. Chang, F., Lin, C.-C., Chen, C.-J.: A hybrid method for multiclass classification and its application to handwritten character recognition. Institute of Information Science, Academia Sinica, Taipei, Taiwan, Tech. Rep. TR-IIS-04-016 (2004)

    Google Scholar 

  13. Chang, F., Chou, C.-H., Lin, C.-C., Chen, C.-J.: A prototype classification method and its application to handwritten character recognition. In: IEEE SMC, Hague (2004)

    Google Scholar 

  14. Chang, F., Lin, C.-C., Chen, C.-J.: Applying a hybrid method to handwritten character recognition. In: Proc.17th Intern. Conf. Pattern Recognition, pp. 529–532 (2004)

    Google Scholar 

  15. Lee, S.-W., Park, J.-S.: Nonlinear shape normalization methods for the recognition of large-set handwritten characters. Pattern Recognition 27(7), 895–902 (1994)

    Article  MathSciNet  Google Scholar 

  16. Yamada, H., Yamamoto, K., Saito, T.: A nonlinear normalization method for hand-printed Kanji character recognition – line density equalization. Pattern Recognition 23(9), 1023–1029 (1990)

    Article  Google Scholar 

  17. Liu, C.-L., Kim, I.-J., Kim, J.H.: High accuracy handwritten Chinese character recognition by improved feature matching method. In: Proc. 4th Intern. Conf. Document Analysis and Recognition, pp. 1033–1037 (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

David Doermann Stefan Jaeger

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chang, F. (2008). Techniques for Solving the Large-Scale Classification Problem in Chinese Handwriting Recognition. In: Doermann, D., Jaeger, S. (eds) Arabic and Chinese Handwriting Recognition. SACH 2006. Lecture Notes in Computer Science, vol 4768. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-78199-8_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-78199-8_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-78198-1

  • Online ISBN: 978-3-540-78199-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics