Abstract
Text segmentation is an important step in the process of character recognition. In literature, there have been numerous methods that work very well in practical applications. However, when an image includes strong noise or surface reflection distraction, accurate text segmentation still faces many challenges. Observing that the stroke width of text is stable and significantly different from that of reflective regions generally, we present a novel method for text segmentation using adaptive stroke width estimation and simple linear iterative clustering superpixel (SLIC-superpixel) region growing in this paper. It consists of four following steps: The first is to normalize image intensity to overcome the influence of gray changes. The second utilizes the intensity consistency to compute normalized stroke width (NSW) map. The third is to estimate the optimal stroke width through searching for the peak value of the histogram of normalized stroke width, the text polarity is also determined. Finally, we propose a local region growing method for text extraction using SLIC-superpixel. Unlike current existing methods of computing stroke width, such as gray level jump on a horizontal scan line and gradient-based SWT methods, the proposed method is based on the statistics of stroke width in the whole image. Hence the stroke width estimation is not only invariant in scale and rotation, but also more robust to surface reflection and noise than that of those methods based only on the pairs of sudden changes of intensity or gradient maps. Experiments with many real images, such as laser marking detonator codes, notice signatures and vehicle license plates, etc., have shown that the proposed algorithm can work well in noised images and also achieve comparable performance with current state-of-the-art method on text segmentation from low quality images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Otsu, N.: A threshold selection method from gray-level histograms. Automatica 11, 23–27 (1975)
Trier, O.D., Jain, A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell. 17, 1191–1201 (1995)
Wolf, C., Jolion, J.M.: Extraction and recognition of artificial text in multimedia documents. Formal Pattern Anal. Appl. 6, 309–326 (2004)
Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, vol. 2, pp. II–366. IEEE (2004)
Satish, M., Lajish, V., Kopparapu, S.K.: Edge assisted fast binarization scheme for improved vehicle license plate recognition. In: 2011 National Conference on Communications (NCC), pp. 1–5. IEEE (2011)
Yin, X., Huang, K., Hao, H.: Robust text detection in natural scene images (2013)
Mancas-Thillou, C., Gosselin, B.: Color text extraction with selective metric-based clustering. Comput. Vis. Image Underst. 107, 97–107 (2007)
Li, J., Tian, Y., Huang, T., Gao, W.: Multi-polarity text segmentation using graph theory. In: 15th IEEE International Conference on Image Processing, ICIP 2008, pp. 3008–3011. IEEE (2008)
Mishra, A., Alahari, K., Jawahar, C.: An MRF model for binarization of natural scene text. In: 2011 International Conference on Document Analysis and Recognition (ICDAR), pp. 11–16. IEEE (2011)
Subramanian, K., Natarajan, P., Decerbo, M., Castañòn, D.: Character-stroke detection for text-localization and extraction. In: Ninth International Conference on Document Analysis and Recognition, ICDAR 2007, vol. 1, pp. 33–37. IEEE (2007)
Jung, C., Liu, Q., Kim, J.: A stroke filter and its application to text localization. Pattern Recogn. Lett. 30, 114–122 (2009)
Epshtein, B., Ofek, E., Wexler, Y.: Detecting text in natural scenes with stroke width transform. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970. IEEE (2010)
Fairchild, M.D.: Color Appearance Models. John Wiley & Sons, New York (2013)
Liu, L., Zhang, D., You, J.: Detecting wide lines using isotropic nonlinear filtering. IEEE Trans. Image Process. 16, 1584–1595 (2007)
Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., Susstrunk, S.: Slic superpixels compared to state-of-the-art superpixel methods. IEEE Trans. Pattern Anal. Mach. Intell. 34, 2274–2282 (2012)
Liu, Q., Jung, C., Moon, Y.: Text segmentation based on stroke filter. In: Proceedings of the 14th Annual ACM international Conference on Multimedia, pp. 129–132. ACM (2006)
Shen, J.: On the foundations of vision modeling: I. Weber’s law and weberized tv restoration. Physica D: Nonlinear Phenom. 175, 241–251 (2003)
Shi, C., Xiao, B., Wang, C., Zhang, Y.: Adaptive graph cut based binarization of video text images. In: 2012 10th IAPR International Workshop on Document Analysis Systems (DAS), pp. 58–62. IEEE (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Zhu, A., Wang, G., Dong, Y. (2015). Robust Text Segmentation in Low Quality Images via Adaptive Stroke Width Estimation and Stroke Based Superpixel Grouping. In: Jawahar, C., Shan, S. (eds) Computer Vision - ACCV 2014 Workshops. ACCV 2014. Lecture Notes in Computer Science(), vol 9009. Springer, Cham. https://doi.org/10.1007/978-3-319-16631-5_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-16631-5_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16630-8
Online ISBN: 978-3-319-16631-5
eBook Packages: Computer ScienceComputer Science (R0)