Skip to main content

A Benchmark System for Indian Language Text Recognition

  • Conference paper
  • First Online:
Document Analysis Systems (DAS 2020)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12116))

Included in the following conference series:

Abstract

The performance various academic and commercial text recognition solutions for many languages world-wide has been satisfactory. Many projects now use the ocr as a reliable module. As of now, Indian languages are far away from this state, which is unfortunate. Beyond many challenges due to script and language, this space is adversely affected by the scattered nature of research, lack of systematic evaluation, and poor resource dissemination. In this work, we aim to design and implement a web-based system that could indirectly address some of these aspects that hinder the development of ocr for Indian languages. We hope that such an attempt will help in (i) providing and establishing a consolidated view of state-of-the-art performances for character and word recognition at one place (ii) sharing resources and practices (iii) establishing standard benchmarks that clearly explain the capabilities and limitations of the recognition methods (iv) bringing research attempts from a wide variety of languages, scripts, and modalities into a common forum. We believe the proposed system will play a critical role in further promoting the research in the Indian language text recognition domain.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.openaire.eu/faqs.

  2. 2.

    https://github.com/.

  3. 3.

    https://www.kaggle.com/.

  4. 4.

    http://host.robots.ox.ac.uk:8080/.

  5. 5.

    https://www.cityscapes-dataset.com/.

  6. 6.

    Overfitting is a modeling error that occurs when a function is too closely fit a limited set of data points.

  7. 7.

    https://www.djangoproject.com/.

References

  1. Achanta, R., Hastie, T.J.: Telugu OCR framework using deep learning. ArXiv (2015)

    Google Scholar 

  2. Ashwin, T.V., Sastry, P.S.: A font and size-independent OCR system for printed Kannada documents using support vector machines. Sadhana 27, 35–38 (2002)

    Article  Google Scholar 

  3. Bansal, V., Sinha, R.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)

    Google Scholar 

  4. Bansal, V., Sinha, R.M.K.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)

    Google Scholar 

  5. Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: Handwritten Bangla alphabet recognition using an MLP based classifier. CoRR (2012)

    Google Scholar 

  6. Breuel, T.M.: High performance text recognition using a hybrid convolutional-LSTM implementation. In: ICDAR (2017)

    Google Scholar 

  7. Chandramouli, C., General, R.: Census of India 2011. Government of India, Provisional Population Totals, New Delhi (2011)

    Google Scholar 

  8. Chaudhuri, B.B.: A complete handwritten numeral database of Bangla – a major Indic script. In: IWFHR (2006)

    Google Scholar 

  9. Cordts, M., et al.: The Cityscapes dataset for semantic urban scene understanding. In: CVPR (2016)

    Google Scholar 

  10. Das, N., Das, B., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: Handwritten Bangla basic and compound character recognition using MLP and SVM classifier. ArXiv (2010)

    Google Scholar 

  11. Datta, A.K.: A generalized formal approach for description and analysis of major Indian scripts. IETE J. Res. (1984)

    Google Scholar 

  12. Dutta, K., Mathew, M., Krishnan, P., Jawahar, C.V.: Localizing and recognizing text in lecture videos. In: ICFHR (2018)

    Google Scholar 

  13. Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal visual object classes (VOC) challenge. IJCV 88, 303–338 (2010)

    Article  Google Scholar 

  14. Gaur, A., Yadav, S.: Handwritten Hindi character recognition using k-means clustering and SVM. ISETTLIS (2015)

    Google Scholar 

  15. Gupta, V., Rathna, G.N., Ramakrishnan, K.: Automatic Kannada text extraction from camera captured images. In: MCDES, IISc Centenary Conference (2008)

    Google Scholar 

  16. Jain, M., Mathew, M., Jawahar, C.V.: Unconstrained OCR for Urdu using deep CNN-RNN hybrid networks. In: ACPR (2017)

    Google Scholar 

  17. Jomy, J., Pramod, K.V., Kannan, B.: Handwritten character recognition of south Indian scripts: a review. CoRR (2011)

    Google Scholar 

  18. Jordan, M.I., Mitchell, T.M.: Machine learning: trends, perspectives, and prospects. Science 349, 255–260 (2015)

    Article  MathSciNet  Google Scholar 

  19. Karatzas, D., Gómez, L., Nicolaou, A., Rusiñol, M.: The robust reading competition annotation and evaluation platform. In: DAS (2018)

    Google Scholar 

  20. Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: ICDAR (2015)

    Google Scholar 

  21. Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: ICDAR (2013)

    Google Scholar 

  22. Klakow, D., Peters, J.: Testing the correlation of word error rate and perplexity. Speech Commun. 38, 19–28 (2002)

    Article  Google Scholar 

  23. Krishnan, P., Sankaran, N., Singh, A.K., Jawahar, C.: Towards a robust OCR system for Indic scripts. In: DAS (2014)

    Google Scholar 

  24. Kumar, A., Jawahar, C.V.: Content-level annotation of large collection of printed document images. In: ICDAR (2007)

    Google Scholar 

  25. Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10, 707–710 (1966)

    MathSciNet  Google Scholar 

  26. Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: ICDAR (2003)

    Google Scholar 

  27. Mathew, M., Jain, M., Jawahar, C.V.: Benchmarking scene text recognition in Devanagari, Telugu and Malayalam (2017)

    Google Scholar 

  28. Mathew, M., Singh, A.K., Jawahar, C.V.: Multilingual OCR for Indic scripts. In: DAS (2016)

    Google Scholar 

  29. Nag, S., et al.: Offline extraction of Indic regional language from natural scene image using text segmentation and deep convolutional sequence. ArXiv (2018)

    Google Scholar 

  30. Negi, A., Bhagvati, C., Krishna, B.: An OCR system for Telugu. In: ICDAR (2001)

    Google Scholar 

  31. Omee, F.Y., Himel, S.S., Bikas, M.A.N.: A complete workflow for development of Bangla OCR. CoRR (2012)

    Google Scholar 

  32. Pal, U., Chaudhuri, B.: Indian script character recognition: a survey. Pattern Recogn. 37, 1887–1899 (2004)

    Article  Google Scholar 

  33. Sankaran, N., Jawahar, C.V.: Recognition of printed Devanagari text using BLSTM neural network (2012)

    Google Scholar 

  34. Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level script identification from Bangla and Devanagri handwritten texts mixed with Roman script. CoRR (2010)

    Google Scholar 

  35. Setlur, S., Kompalli, S., Ramanaprasad, V., Govindaraju, V.: Creation of data resources and design of an evaluation test bed for Devanagari script recognition. In: WPDS (2003)

    Google Scholar 

  36. Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: ICDAR (2011)

    Google Scholar 

  37. Sheshadri, K., Ambekar, P.K.T., Prasad, D.P., Kumar, R.P.: An OCR system for printed Kannada using k-means clustering. In: ICIT (2010)

    Google Scholar 

  38. Sinha, R.M.K.: A journey from Indian scripts processing to Indian language processing. IEEE Ann. Hist. Comput. 31, 8–31 (2009)

    Article  MathSciNet  Google Scholar 

  39. Smith, R.: An overview of the Tesseract OCR engine. In: ICDAR (2007)

    Google Scholar 

  40. Stiehl, U.: Sanskrit-kompendium. Economica Verlag (2002)

    Google Scholar 

  41. Ye, Q., Doermann, D.S.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1480–1500 (2015)

    Article  Google Scholar 

  42. Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ajoy Mondal .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tulsyan, K., Srivastava, N., Mondal, A., Jawahar, C.V. (2020). A Benchmark System for Indian Language Text Recognition. In: Bai, X., Karatzas, D., Lopresti, D. (eds) Document Analysis Systems. DAS 2020. Lecture Notes in Computer Science(), vol 12116. Springer, Cham. https://doi.org/10.1007/978-3-030-57058-3_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-57058-3_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-57057-6

  • Online ISBN: 978-3-030-57058-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics