Skip to main content

Text Detection in Low Resolution Scene Images Using Convolutional Neural Network

  • Conference paper
  • First Online:
Recent Advances on Soft Computing and Data Mining (SCDM 2016)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 549))

Included in the following conference series:

Abstract

Text detection on scene images has increasingly gained a lot of interests, especially due to the increase of wearable devices. However, the devices often acquire low resolution images, thus making it difficult to detect text due to noise. Notable method for detection in low resolution images generally utilizes many features which are cleverly integrated and cascaded classifiers to form better discriminative system. Those methods however require a lot of hand-crafted features and manually tuned, which are difficult to achieve in practice. In this paper, we show that the notable cascaded method is equivalent to a Convolutional Neural Network (CNN) framework to deal with text detection in low resolution scene images. The CNN framework however has interesting mutual interaction between layers from which the parameters are jointly learned without requiring manual design, thus its parameters can be better optimized from training data. Experiment results show the efficiency of the method for detecting text in low resolution scene images.

The original version of this chapter was revised: Co-author name has been deleted. The erratum to this chapter is available at DOI: 10.1007/978-3-319-51281-5_65

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Change history

Notes

  1. 1.

    http://www.google.com/mobile/goggles/#text.

  2. 2.

    http://www.artificialvision.com/android.htm.

  3. 3.

    http://tcts.fpms.ac.be/projects/sypole/index.php?lang=en.

References

  1. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893. IEEE (2005)

    Google Scholar 

  2. Jung, K., Kim, K.I., Jain, A.K.: Text information extraction in images and video: a survey. Pattern Recogn. 37(5), 977–997 (2004)

    Article  Google Scholar 

  3. LeCun, Y., Boser, B., Denker, J.S., Henderson, D., Howard, R.E., Hubbard, W., Jackel, L.D.: Backpropagation applied to handwritten zip code recognition. Neural Comput. 1(4), 541–551 (1989)

    Article  Google Scholar 

  4. Liang, J., Doermann, D., Li, H.: Camera-based analysis of text and documents: a survey. Intl. J. Doc. Anal. Recogn. (IJDAR) 7(2–3), 84–104 (2005)

    Article  Google Scholar 

  5. Mählisch, M., Oberländer, M., Löhlein, O., Gavrila, D., Ritter, W.: A multiple detector approach to low-resolution fir pedestrian recognition. In: Proceedings of the IEEE Intelligent Vehicles Symposium (IV2005), Las Vegas, NV, USA (2005)

    Google Scholar 

  6. Maji, S., Berg, A.C., Malik, J.: Classification using intersection kernel support vector machines is efficient. In: CVPR, pp. 1–8. IEEE (2008)

    Google Scholar 

  7. Mirmehdi, M., Clark, P., Lam, J.: Extracting low resolution text with an active camera for OCR. In: Spanish Symposium on Pattern Recognition and Image Processing IX, pp. 43–48 (2001)

    Google Scholar 

  8. Neumann, L., Matas, J.: Real-time scene text localization and recognition. In: CVPR, pp. 3538–3545. IEEE (2012)

    Google Scholar 

  9. Neumann, L., Matas, J.: On combining multiple segmentations in scene text recognition. In: ICDAR (2013)

    Google Scholar 

  10. Nguyen, M.H., Kim, S.-H., Lee, G.: Recognizing text in low resolution born-digital images. In: Jeong, Y.-S., Park, Y.-H., Hsu, C.-H.R., Park, J.J.J.H. (eds.) Ubiquitous Information Technologies and Applications. LNEE, vol. 280, pp. 85–92. Springer, Heidelberg (2014). doi:10.1007/978-3-642-41671-2_12

    Chapter  Google Scholar 

  11. Risnumawan, A., Chan, C.S.: Text detection via edgeless stroke width transform. In: ISPACS, pp. 336–340. IEEE (2014)

    Google Scholar 

  12. Risnumawan, A., Shivakumara, P., Chan, C.S., Tan, C.L.: A robust arbitrary text detection system for natural scene images. Expert Syst. Appl. 41(18), 8027–8048 (2014)

    Article  Google Scholar 

  13. Sahli, S., Ouyang, Y., Sheng, Y., Lavigne, D.A.: Robust vehicle detection in low-resolution aerial imagery. In: SPIE Defense, Security, and Sensing, p. 76680G. International Society for Optics and Photonics (2010)

    Google Scholar 

  14. Sanketi, P., Shen, H., Coughlan, J.M.: Localizing blurry and low-resolution text in natural images. In: 2011 IEEE Workshop on Applications of Computer Vision (WACV), pp. 503–510. IEEE (2011)

    Google Scholar 

  15. Wang, K., Babenko, B., Belongie, S.: End-to-end scene text recognition. In: ICCV, pp. 1457–1464. IEEE (2011)

    Google Scholar 

  16. Wang, T., Wu, D.J., Coates, A., Ng, A.Y.: End-to-end text recognition with convolutional neural networks. In: ICPR, pp. 3304–3308. IEEE (2012)

    Google Scholar 

  17. Yin, X.-C., Yin, X., Huang, K., Hao, H.-W.: Robust text detection in natural scene images. IEEE Trans. Pattern Anal. Mach. Intell. 36(5), 970–983 (2014)

    Article  Google Scholar 

  18. Zhang, J., Gong, S.: People detection in low-resolution video with non-stationary background. Image Vis. Comput. 27(4), 437–443 (2009)

    Article  Google Scholar 

  19. Zhao, T., Nevatia, R.: Car detection in low resolution aerial images. Image Vis. Comput. 21(8), 693–703 (2003)

    Article  Google Scholar 

  20. Zhu, J., Javed, O., Liu, J., Yu, Q., Cheng, H., Sawhney, H.: Pedestrian detection in low-resolution imagery by learning multi-scale intrinsic motion structures (mims). In: CVPR, pp. 3510–3517 (2014)

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank Pusat Penelitian dan Pengabdian Masyarakat (P3M) of Politeknik Elektronika Negeri Surabaya (PENS) for supporting this research by Local Research Funding FY 2016.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Anhar Risnumawan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Risnumawan, A., Sulistijono, I.A., Abawajy, J. (2017). Text Detection in Low Resolution Scene Images Using Convolutional Neural Network. In: Herawan, T., Ghazali, R., Nawi, N.M., Deris, M.M. (eds) Recent Advances on Soft Computing and Data Mining. SCDM 2016. Advances in Intelligent Systems and Computing, vol 549. Springer, Cham. https://doi.org/10.1007/978-3-319-51281-5_37

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-51281-5_37

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-51279-2

  • Online ISBN: 978-3-319-51281-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics