Skip to main content

Abstract

It is estimated that 285 million people globally are visually impaired. A majority of these people live in developing countries and are among the elderly population. Reading is essential in daily life for everyone. Visually impaired persons can read only by use of special scripts specially designed for them such as Braille language. Further, only trained people can read and understand. Since every product does not provide the product information on product cover in Braille, the present work proposes an assistive text reading framework to help visually impaired persons to read texts from various products/objects in their daily lives. The first step in implementation captures the image of the required by extracting frames from real-time video input from the camera. This is followed by preprocessing steps which includes conversion to grey scale and filtering. The text regions are further extracted using MSER followed by canny edge detection. The text regions from the captured image are then extracted and recognized by using Optical Character Recognition software (OCR). The OCR engine Tesseract is used here. This extracts the text of various fonts and then sizes can be recognized individually and then combined to form a word. Further, producing audio output by using Text to Speech module. The result obtained is very much comparable with other existing methods with better time efficiency. The real-time input is taken and passed through the algorithm which applies filters and removes noise then later image is passed through MSER, OCR, Canny edge detection to get the final audio output.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 44.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 59.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Strotthe T et al (1997) Mobility of blind and elderly people interacting with computers. National Institute for the Blind, report on the MOBIC project. http://www.tiresias.org/reports/mobicf.htm

  2. Real Time Text Detection and Recognition on Hand Held Objects to Assist Blind People. In: 2016 international conference on automatic control and dynamic optimization techniques (ICACDOT), International Institute of Information Technology (I2IT), Pune

    Google Scholar 

  3. Venkateswarlu K, Velaga SM. Text detection on scene images using MSER

    Google Scholar 

  4. Islam MR, Mondal C, Azam MK, Syed A, Islam MJ Text detection and recognition using enhanced MSER detection and a novel OCR technique

    Google Scholar 

  5. Gómez L, Karatzas D. MSER-based real-time text detection and tracking

    Google Scholar 

  6. Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intelligence 25(12):1631–1639

    Article  Google Scholar 

  7. Koo HI, Kim DH (2013) Scene text detection via connected component clustering and nontext filtering. IEEE Trans Image Process 22(6):2296–2305

    Article  MathSciNet  MATH  Google Scholar 

  8. Srivastav A, Kumar J (2008) Text detection in scene images using stroke width and nearest-neighbor constraints. In: TENCON IEEE region 10 conference, pp 1–5

    Google Scholar 

  9. Zhou G, Liu Y, Tian Z, Su Y (2011) A new hybrid method to detect text in natural scene. In: 18th IEEE international conference on image processing (ICIP), pp 2605–2608

    Google Scholar 

  10. Gómez L, Karatzas D (2014) MSER-based real-time text detection and tracking. In: 22nd international conference on pattern recognition (ICPR), pp 3110–3115

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. K. RajithKumar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

RajithKumar, B.K., Mohana, H.S., Jamakhandi, D.A., Akshatha, K.V., Hegde, D.B., Singh, A. (2019). Real-Time Input Text Recognition System for the Aid of Visually Impaired. In: Pandian, D., Fernando, X., Baig, Z., Shi, F. (eds) Proceedings of the International Conference on ISMAC in Computational Vision and Bio-Engineering 2018 (ISMAC-CVB). ISMAC 2018. Lecture Notes in Computational Vision and Biomechanics, vol 30. Springer, Cham. https://doi.org/10.1007/978-3-030-00665-5_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-00665-5_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-00664-8

  • Online ISBN: 978-3-030-00665-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics