Distance Transform-Based Stroke Feature Descriptor for Text Non-text Classification

Khan, Tauseef; Mollah, Ayatullah Faruk

doi:10.1007/978-981-13-1280-9_19

Tauseef Khan¹⁸ &
Ayatullah Faruk Mollah¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 740))

1029 Accesses
7 Citations

Abstract

Natural scene or document images captured from camera devices containing text are the most informative region for communication. Extraction of text regions from such images is the primary and fundamental task of obtaining textual content present in images. Classifying foreground objects as text/non-text elements is one of the significant modules in scene text localization. Stroke width is an important discriminating feature of text blocks. In this paper, a distance transform-based stroke feature descriptor is reported for component level classification of foreground components obtained from input images. Potential stroke pixels are identified from distance map of a component using strict staircase method, and distribution of distance values of such pixels is used for designing the feature descriptors. Finally, we classify the components using a neural network-based classifier. Experimental result shows that component classification accuracy is more than 88%, which is much impressive in practical scenario.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zhang, Z., Zhang, C., Shen, W., Yao, C., Liu, W., Bai, X.: Multi-oriented text detection with fully convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4159—4167, IEEE (2016)
Google Scholar
Chen, X., Yuille, A.L.: Detecting and reading text in natural scenes. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. II–II (2004)
Google Scholar
Yao, C., Bai, X., Liu, W., Ma, Y., Tu, Z.: Detecting texts of arbitrary orientations in natural images. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1083–1090, IEEE (2012)
Google Scholar
Yi, C., Tian, Y.: Text string detection from natural scenes by structure-based partition and grouping. In: IEEE Transactions on Image Processing, pp. 2594–2605, IEEE (2011)
Google Scholar
Neumann, L., Matas, J.: Real-time scene text localization and recognition. In. IEEE Conference on Computer Vision and Pattern Recognition, pp. 3538–3545, IEEE (2012)
Google Scholar
Huang, W., Lin, Z., Yang, J., Wang, J.: Text localization in natural images using stroke feature transform and text covariance descriptors. In: IEEE International Conference on Computer Vision, pp. 1241–1248, IEEE (2013)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893, IEEE (2005)
Google Scholar
Minetto, R., Thome, N., Cord, M., Leite, N.J., Stolfi, J.: T-HOG: an effective gradient-based descriptor for single line text regions. Pattern Recognit., 1078–1090 (2013). Elsevier
Google Scholar
Tian, S., Bhattacharya, U., Lu, S., Su, B., Wang, Q., Wei, X., Lu, Y., Tan, C.L.: Multilingual scene character recognition with co-occurrence of histogram of oriented gradients. Pattern Recognit. 51, 125–134 (2016). Elsevier
Google Scholar
Ojala, T., Pietikäinen, M., Harwood, D.: A comparative study of texture measures with classification based on featured distributions. Pattern Recognit., 51–59 (1996). Elsevier
Google Scholar
Mäenpää, T., Pietikäinen, M.: Multi-scale binary patterns for texture analysis. Image Anal., 267–275 (2003). Springer
Google Scholar
Goto, H., Tanaka, M.: Text-tracking wearable camera system for the blind. In: 10th International Conference on Document Analysis and Recognition, pp. 141–145, IEEE (2009)
Google Scholar
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Imag. Vision Comput. 23(6), 565–576 (2005). Elsevier
Google Scholar
Epshtein, B., Ofek, E., Wexler, Y: Detecting text in natural scenes with stroke width transform. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2963–2970, IEEE (2010)
Google Scholar
Neumann, L., Matas, J.: Efficient scene text localization and recognition with local character refinement. In: 13th International Conference on Document Analysis and Recognition, pp. 746–750, IEEE (2015)
Google Scholar
Subramanian, K., Natarajan, P., Decerbo, M., Castanon, D.: Character-stroke detection for text-localization and extraction. In: 9th International Conference on Document Analysis and Recognition, ICDAR, pp. 33–37, IEEE (2007)
Google Scholar
Mollah, A.F., Basu, S., Nasipuri, M.: Text detection from camera captured images using a novel fuzzy-based technique. In: 3rd International Conference on Emerging Applications of Information Technology (EAIT), pp. 291–294, IEEE (2012)
Google Scholar
Khan, T., Mollah, A.F.: A novel text localization scheme for camera captured document images. In: 2nd International Conference on Computer Vision & Image Processing (CVIP), pp. 253–264, Springer Nature (2017)
Google Scholar

Download references

Acknowledgements

This work is carried out in the research lab of Computer Science & Engineering Department of Aliah University. The first author is grateful to Maulana Azad National Fellowship (MANF) for the financial support.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Aliah University, Kolkata, 700160, India
Tauseef Khan & Ayatullah Faruk Mollah

Authors

Tauseef Khan
View author publications
You can also search for this author in PubMed Google Scholar
Ayatullah Faruk Mollah
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tauseef Khan .

Editor information

Editors and Affiliations

College of Engineering and Applied Science, University of Colorado Colorado Springs, Colorado Springs, CO, USA
Jugal Kalita
Automation and Applied Informatics, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Department of Computer Applications, Sikkim Manipal University, Sikkim, India
Samarjeet Borah
Department of Computer Applications, Sikkim Manipal University, Sikkim, India
Ratika Pradhan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khan, T., Mollah, A.F. (2019). Distance Transform-Based Stroke Feature Descriptor for Text Non-text Classification. In: Kalita, J., Balas, V., Borah, S., Pradhan, R. (eds) Recent Developments in Machine Learning and Data Analytics. Advances in Intelligent Systems and Computing, vol 740. Springer, Singapore. https://doi.org/10.1007/978-981-13-1280-9_19

Download citation

DOI: https://doi.org/10.1007/978-981-13-1280-9_19
Published: 12 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1279-3
Online ISBN: 978-981-13-1280-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics