Recognition of Simple and Conjunct Handwritten Malayalam Characters Using LCPA Algorithm

Rahiman, M. Abdul; Rajasree, M. S.

doi:10.1007/978-3-642-22720-2_31

M. Abdul Rahiman⁶ &
M. S. Rajasree⁷

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 192))

Included in the following conference series:

International Conference on Advances in Computing and Communications

1594 Accesses
1 Citations

Abstract

This paper mainly focuses on the recognition of both simple and conjunct handwritten characters in Malayalam, a South Indian language. The algorithm proposed recognizes these characters mainly based on the strokes and lines contained in them. Here the input is an image of handwritten Malayalam characters, which undergoes different phases of processing to produce an editable document of Malayalam characters in a predefined format as output. In this paper, detailed description of the methods for character identification is given. The whole OCR process is presented in three different modules: Pre-processing, Skeletonization and Recognition. In Pre-processing, the input image is scanned and subjected to line and character separation. In Skeletonization, the digital image is transformed into a set of original components. In Recognition, the characters are classified based on their features. The feature extraction of the characters is done by the analyzing the position and count of the horizontal and vertical lines. A classification of the simple and conjunct characters is also devised based on the count and position of the horizontal and vertical lines which make up those characters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Trier, D., Jain, A.K., Taxt, T.: Feature Extraction methods for Character Recognition – A Survey. Pattern Recognition 29, 641–662 (1996)
Article Google Scholar
Srihari, S.N., Yang, X., Ball, G.R.: Offline Chinese Handwriting Recognition: an assessment of current Technology. Front. Computer Science 1(2), 137–155 (2007)
Article Google Scholar
Amin, A.: Recognition of Printed Arabic Text based on global features and Decision Tree Learning Techniques. Pattern Trcognition 33(8), 1309–1323 (2000)
Article Google Scholar
Pal, U., Chaudhuri, B.B.: Printed Devanagari script OCR System. Vivek 10 (1997)
Google Scholar
Chaudhuri, B., Pal, U., Mitra, U.: Automatic recognition of printed Oriya script. Sadhana 27, Part 1, 23–34 (2002)
Article Google Scholar
Seethalakshmi, R., Sreeranjani, T.R., Balachandar, T., Singh, A., Singh, M., Ratan, R., Kumar, S.: Optical Character Recognition for printed Tamil text using Unicode. Journal of Zhejiang University SCI 6A(11), 1297–1305 (2005)
Article Google Scholar
Lakshmi, C.V., Patvardhan, C.: A multi-font OCR system for printed Telugu text. In: Proc. of Language Engineering Conference LEC, Hyderabad, pp. 7–17 (2002)
Google Scholar
Ashwin, T.V., Sastry, P.S.: A font and size independent OCR system for printed Kannada documents using support vector machines. Saadhana 27, Part 1, 35–58 (2002)
Article Google Scholar
Abdul Rahiman, M., Rajasree, M.S.: Printed Malayalam Character Recognition Using Back propagation Neural Networks. In: Proc. of IEEE International Advance Computing Conference (IACC 2009), Patiala, pp. 1140–1144 (March 2009)
Google Scholar
Journal of Language Technology, Viswabharat@tdil (July 2003)
Google Scholar
Anuradha, Koteswarra, B.: An efficient Binarization technique for old documents. In: Proc. of International Conference on Systemics, Cybernetics and Inforrmatics, Hyderabad, pp. 771–775 (2006)
Google Scholar
Chaudhuri, B.B., Pal, U.: Skew Angle Detection of Digitized Indian Script Document. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(2) (February 1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Karpagam University, Coimbatore, Kerala, India
M. Abdul Rahiman
Govt. College of Engg, Trivandrum, Kerala, India
M. S. Rajasree

Authors

M. Abdul Rahiman
View author publications
You can also search for this author in PubMed Google Scholar
M. S. Rajasree
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Machine Intelligence Research Labs (MIR Labs), Auburn, 98071-2259, Washington, USA
Ajith Abraham
Departamento de Comunicaciones, Universidad Politcnica de Valencia, 46071, Valencia, Spain
Jaime Lloret Mauri
Avaya Labs Research, Basking Ridge, NJ, USA
John F. Buford
University of Massachusetts, 100 Morrissey Blvd., 02125-3393, Boston, MA, USA
Junichi Suzuki
Rajagiri School of Engineering and Technology, Rajagiri Valley, Kakkanad, 682 039, Kochi, India
Sabu M. Thampi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rahiman, M.A., Rajasree, M.S. (2011). Recognition of Simple and Conjunct Handwritten Malayalam Characters Using LCPA Algorithm. In: Abraham, A., Mauri, J.L., Buford, J.F., Suzuki, J., Thampi, S.M. (eds) Advances in Computing and Communications. ACC 2011. Communications in Computer and Information Science, vol 192. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22720-2_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-22720-2_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22719-6
Online ISBN: 978-3-642-22720-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics