A Benchmark System for Indian Language Text Recognition

Tulsyan, Krishna; Srivastava, Nimisha; Mondal, Ajoy; Jawahar, C. V.

doi:10.1007/978-3-030-57058-3_6

Krishna Tulsyan¹¹,
Nimisha Srivastava¹¹,
Ajoy Mondal¹¹ &
…
C. V. Jawahar¹¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12116))

Included in the following conference series:

International Workshop on Document Analysis Systems

1218 Accesses
2 Citations

Abstract

The performance various academic and commercial text recognition solutions for many languages world-wide has been satisfactory. Many projects now use the ocr as a reliable module. As of now, Indian languages are far away from this state, which is unfortunate. Beyond many challenges due to script and language, this space is adversely affected by the scattered nature of research, lack of systematic evaluation, and poor resource dissemination. In this work, we aim to design and implement a web-based system that could indirectly address some of these aspects that hinder the development of ocr for Indian languages. We hope that such an attempt will help in (i) providing and establishing a consolidated view of state-of-the-art performances for character and word recognition at one place (ii) sharing resources and practices (iii) establishing standard benchmarks that clearly explain the capabilities and limitations of the recognition methods (iv) bringing research attempts from a wide variety of languages, scripts, and modalities into a common forum. We believe the proposed system will play a critical role in further promoting the research in the Indian language text recognition domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://www.openaire.eu/faqs.
2.
https://github.com/.
3.
https://www.kaggle.com/.
4.
http://host.robots.ox.ac.uk:8080/.
5.
https://www.cityscapes-dataset.com/.
6.
Overfitting is a modeling error that occurs when a function is too closely fit a limited set of data points.
7.
https://www.djangoproject.com/.

References

Achanta, R., Hastie, T.J.: Telugu OCR framework using deep learning. ArXiv (2015)
Google Scholar
Ashwin, T.V., Sastry, P.S.: A font and size-independent OCR system for printed Kannada documents using support vector machines. Sadhana 27, 35–38 (2002)
Article Google Scholar
Bansal, V., Sinha, R.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)
Google Scholar
Bansal, V., Sinha, R.M.K.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)
Google Scholar
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: Handwritten Bangla alphabet recognition using an MLP based classifier. CoRR (2012)
Google Scholar
Breuel, T.M.: High performance text recognition using a hybrid convolutional-LSTM implementation. In: ICDAR (2017)
Google Scholar
Chandramouli, C., General, R.: Census of India 2011. Government of India, Provisional Population Totals, New Delhi (2011)
Google Scholar
Chaudhuri, B.B.: A complete handwritten numeral database of Bangla – a major Indic script. In: IWFHR (2006)
Google Scholar
Cordts, M., et al.: The Cityscapes dataset for semantic urban scene understanding. In: CVPR (2016)
Google Scholar
Das, N., Das, B., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: Handwritten Bangla basic and compound character recognition using MLP and SVM classifier. ArXiv (2010)
Google Scholar
Datta, A.K.: A generalized formal approach for description and analysis of major Indian scripts. IETE J. Res. (1984)
Google Scholar
Dutta, K., Mathew, M., Krishnan, P., Jawahar, C.V.: Localizing and recognizing text in lecture videos. In: ICFHR (2018)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal visual object classes (VOC) challenge. IJCV 88, 303–338 (2010)
Article Google Scholar
Gaur, A., Yadav, S.: Handwritten Hindi character recognition using k-means clustering and SVM. ISETTLIS (2015)
Google Scholar
Gupta, V., Rathna, G.N., Ramakrishnan, K.: Automatic Kannada text extraction from camera captured images. In: MCDES, IISc Centenary Conference (2008)
Google Scholar
Jain, M., Mathew, M., Jawahar, C.V.: Unconstrained OCR for Urdu using deep CNN-RNN hybrid networks. In: ACPR (2017)
Google Scholar
Jomy, J., Pramod, K.V., Kannan, B.: Handwritten character recognition of south Indian scripts: a review. CoRR (2011)
Google Scholar
Jordan, M.I., Mitchell, T.M.: Machine learning: trends, perspectives, and prospects. Science 349, 255–260 (2015)
Article MathSciNet Google Scholar
Karatzas, D., Gómez, L., Nicolaou, A., Rusiñol, M.: The robust reading competition annotation and evaluation platform. In: DAS (2018)
Google Scholar
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: ICDAR (2015)
Google Scholar
Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: ICDAR (2013)
Google Scholar
Klakow, D., Peters, J.: Testing the correlation of word error rate and perplexity. Speech Commun. 38, 19–28 (2002)
Article Google Scholar
Krishnan, P., Sankaran, N., Singh, A.K., Jawahar, C.: Towards a robust OCR system for Indic scripts. In: DAS (2014)
Google Scholar
Kumar, A., Jawahar, C.V.: Content-level annotation of large collection of printed document images. In: ICDAR (2007)
Google Scholar
Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10, 707–710 (1966)
MathSciNet Google Scholar
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: ICDAR (2003)
Google Scholar
Mathew, M., Jain, M., Jawahar, C.V.: Benchmarking scene text recognition in Devanagari, Telugu and Malayalam (2017)
Google Scholar
Mathew, M., Singh, A.K., Jawahar, C.V.: Multilingual OCR for Indic scripts. In: DAS (2016)
Google Scholar
Nag, S., et al.: Offline extraction of Indic regional language from natural scene image using text segmentation and deep convolutional sequence. ArXiv (2018)
Google Scholar
Negi, A., Bhagvati, C., Krishna, B.: An OCR system for Telugu. In: ICDAR (2001)
Google Scholar
Omee, F.Y., Himel, S.S., Bikas, M.A.N.: A complete workflow for development of Bangla OCR. CoRR (2012)
Google Scholar
Pal, U., Chaudhuri, B.: Indian script character recognition: a survey. Pattern Recogn. 37, 1887–1899 (2004)
Article Google Scholar
Sankaran, N., Jawahar, C.V.: Recognition of printed Devanagari text using BLSTM neural network (2012)
Google Scholar
Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level script identification from Bangla and Devanagri handwritten texts mixed with Roman script. CoRR (2010)
Google Scholar
Setlur, S., Kompalli, S., Ramanaprasad, V., Govindaraju, V.: Creation of data resources and design of an evaluation test bed for Devanagari script recognition. In: WPDS (2003)
Google Scholar
Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: ICDAR (2011)
Google Scholar
Sheshadri, K., Ambekar, P.K.T., Prasad, D.P., Kumar, R.P.: An OCR system for printed Kannada using k-means clustering. In: ICIT (2010)
Google Scholar
Sinha, R.M.K.: A journey from Indian scripts processing to Indian language processing. IEEE Ann. Hist. Comput. 31, 8–31 (2009)
Article MathSciNet Google Scholar
Smith, R.: An overview of the Tesseract OCR engine. In: ICDAR (2007)
Google Scholar
Stiehl, U.: Sanskrit-kompendium. Economica Verlag (2002)
Google Scholar
Ye, Q., Doermann, D.S.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1480–1500 (2015)
Article Google Scholar
Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
Krishna Tulsyan, Nimisha Srivastava, Ajoy Mondal & C. V. Jawahar

Authors

Krishna Tulsyan
View author publications
You can also search for this author in PubMed Google Scholar
Nimisha Srivastava
View author publications
You can also search for this author in PubMed Google Scholar
Ajoy Mondal
View author publications
You can also search for this author in PubMed Google Scholar
C. V. Jawahar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ajoy Mondal .

Editor information

Editors and Affiliations

Huazhong University of Science and Technology, Wuhan, China
Xiang Bai
Autonomous University of Barcelona, Barcelona, Spain
Dimosthenis Karatzas
Lehigh University, Bethlehem, PA, USA
Daniel Lopresti

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tulsyan, K., Srivastava, N., Mondal, A., Jawahar, C.V. (2020). A Benchmark System for Indian Language Text Recognition. In: Bai, X., Karatzas, D., Lopresti, D. (eds) Document Analysis Systems. DAS 2020. Lecture Notes in Computer Science(), vol 12116. Springer, Cham. https://doi.org/10.1007/978-3-030-57058-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-57058-3_6
Published: 14 August 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-57057-6
Online ISBN: 978-3-030-57058-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)