Skip to main content

Robust Text Segmentation in Low Quality Images via Adaptive Stroke Width Estimation and Stroke Based Superpixel Grouping

  • Conference paper
  • First Online:
Computer Vision - ACCV 2014 Workshops (ACCV 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9009))

Included in the following conference series:

Abstract

Text segmentation is an important step in the process of character recognition. In literature, there have been numerous methods that work very well in practical applications. However, when an image includes strong noise or surface reflection distraction, accurate text segmentation still faces many challenges. Observing that the stroke width of text is stable and significantly different from that of reflective regions generally, we present a novel method for text segmentation using adaptive stroke width estimation and simple linear iterative clustering superpixel (SLIC-superpixel) region growing in this paper. It consists of four following steps: The first is to normalize image intensity to overcome the influence of gray changes. The second utilizes the intensity consistency to compute normalized stroke width (NSW) map. The third is to estimate the optimal stroke width through searching for the peak value of the histogram of normalized stroke width, the text polarity is also determined. Finally, we propose a local region growing method for text extraction using SLIC-superpixel. Unlike current existing methods of computing stroke width, such as gray level jump on a horizontal scan line and gradient-based SWT methods, the proposed method is based on the statistics of stroke width in the whole image. Hence the stroke width estimation is not only invariant in scale and rotation, but also more robust to surface reflection and noise than that of those methods based only on the pairs of sudden changes of intensity or gradient maps. Experiments with many real images, such as laser marking detonator codes, notice signatures and vehicle license plates, etc., have shown that the proposed algorithm can work well in noised images and also achieve comparable performance with current state-of-the-art method on text segmentation from low quality images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11, 23–27 (1975)

    Google Scholar 

  2. Trier, O.D., Jain, A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell. 17, 1191–1201 (1995)

    Article  Google Scholar 

  3. Wolf, C., Jolion, J.M.: Extraction and recognition of artificial text in multimedia documents. Formal Pattern Anal. Appl. 6, 309–326 (2004)

    MathSciNet  Google Scholar 

  4. Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 2, pp. II–366. IEEE (2004)

    Google Scholar 

  5. Satish, M., Lajish, V., Kopparapu, S.K.: Edge assisted fast binarization scheme for improved vehicle license plate recognition. In: 2011 National Conference on Communications (NCC), pp. 1–5. IEEE (2011)

    Google Scholar 

  6. Yin, X., Huang, K., Hao, H.: Robust text detection in natural scene images (2013)

    Google Scholar 

  7. Mancas-Thillou, C., Gosselin, B.: Color text extraction with selective metric-based clustering. Comput. Vis. Image Underst. 107, 97–107 (2007)

    Article  Google Scholar 

  8. Li, J., Tian, Y., Huang, T., Gao, W.: Multi-polarity text segmentation using graph theory. In: 15th IEEE International Conference on Image Processing, ICIP 2008, pp. 3008–3011. IEEE (2008)

    Google Scholar 

  9. Mishra, A., Alahari, K., Jawahar, C.: An MRF model for binarization of natural scene text. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 11–16. IEEE (2011)

    Google Scholar 

  10. Subramanian, K., Natarajan, P., Decerbo, M., Castañòn, D.: Character-stroke detection for text-localization and extraction. In: Ninth International Conference on Document Analysis and Recognition, ICDAR 2007, vol. 1, pp. 33–37. IEEE (2007)

    Google Scholar 

  11. Jung, C., Liu, Q., Kim, J.: A stroke filter and its application to text localization. Pattern Recogn. Lett. 30, 114–122 (2009)

    Article  Google Scholar 

  12. Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970. IEEE (2010)

    Google Scholar 

  13. Fairchild, M.D.: Color Appearance Models. John Wiley & Sons, New York (2013)

    Book  Google Scholar 

  14. Liu, L., Zhang, D., You, J.: Detecting wide lines using isotropic nonlinear filtering. IEEE Trans. Image Process. 16, 1584–1595 (2007)

    Article  MathSciNet  Google Scholar 

  15. Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2274–2282 (2012)

    Article  Google Scholar 

  16. Liu, Q., Jung, C., Moon, Y.: Text segmentation based on stroke filter. In: Proceedings of the 14th Annual ACM international Conference on Multimedia, pp. 129–132. ACM (2006)

    Google Scholar 

  17. Shen, J.: On the foundations of vision modeling: I. Weber’s law and weberized tv restoration. Physica D: Nonlinear Phenom. 175, 241–251 (2003)

    Article  MATH  Google Scholar 

  18. Shi, C., Xiao, B., Wang, C., Zhang, Y.: Adaptive graph cut based binarization of video text images. In: 2012 10th IAPR International Workshop on Document Analysis Systems (DAS), pp. 58–62. IEEE (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Guoyou Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhu, A., Wang, G., Dong, Y. (2015). Robust Text Segmentation in Low Quality Images via Adaptive Stroke Width Estimation and Stroke Based Superpixel Grouping. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9009. Springer, Cham. https://doi.org/10.1007/978-3-319-16631-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16631-5_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16630-8

  • Online ISBN: 978-3-319-16631-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics