Abstract
Extracting texts from video always faces variations in font style, size, color, orientation, and brightness; thus, video preprocessing techniques are required to reduce the complexity of the succeeding steps consisting of video text detection, localization, segmentation, recognition, and script identification. This chapter gives a brief overview of the preprocessing techniques that are often used in video text detection. After introducing image preprocessing operators, we discuss several color-based and texture-based preprocessing techniques, respectively. Since image segmentation plays an important role in video text detection, we then introduce several image segmentation approaches. Next, the motion analysis technique which is helpful to improve the efficiency or the accuracy of video text detection by tracing text from temporal frames is introduced. Most of the introduced preprocessing operators and methods have been realized by MATLAB or OpenCV (Open Source Computer Vision Library), and readers can make use of these open sources for practice.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Jung K, In Kim K, Jain AK (2004) Text information extraction in images and video: a survey. Pattern Recog 37(5):977–997
Lienhart RW, Stuber F (1996) Automatic text recognition in digital videos. Proc SPIE 2666(3):180–188
Crane R (1996) Simplified approach to image processing: classical and modern techniques in C. Prentice Hall PTR. 317
Szeliski R (2010) Computer vision: algorithms and applications. Springer, New York
Kopf J et al (2007) Capturing and viewing gigapixel images. ACM Trans Graph 26(3):93
Roberts LG (1963) Machine perception of three-dimensional solids, DTIC Document
Engel K et al (2006) Real-time volume graphics: AK Peters, Limited
Ritter GX, Wilson JN (1996) Handbook of computer vision algorithms in image algebra, vol 1. Citeseer
Hasan YMY, Karam LJ (2000) Morphological text extraction from images. IEEE Trans Image Process 9(11):1978–1983
Jae-Chang S, Dorai C, Bolle R (1998) Automatic text extraction from video for content-based annotation and retrieval. In: Proceedings of the fourteenth international conference on pattern recognition, 1998
Kim H-K (1996) Efficient automatic text location method and content-based indexing and structuring of video database. J Vis Commun Image Represent 7(4):336–344
Jain AK, Yu BIN (1998) Automatic text location in images and video frames. Pattern Recogn 31(12):2055–2076
Zhong Y, Karu K, Jain AK (1995) Locating text in complex color images. Pattern Recogn 28(10):1523–1535
Wong EK, Chen M (2003) A new robust algorithm for video text extraction. Pattern Recogn 36(6):1397–1406
Shivakumara P, Trung Quy P, Tan CL (2011) A laplacian approach to multi-oriented text detection in video. IEEE Trans Pattern Anal Mach Intell 33(2):412–419
Kim KI, Jung K, Kim JH (2003) Texture-based approach for text detection in images using support vector machines and continuously adaptive mean shift algorithm. IEEE Trans Pattern Anal Mach Intell 25(12):1631–1639
Ye Q et al (2007) Text detection and restoration in natural scene images. J Vis Commun Image Represent 18(6):504–513
Shivakumara P, Trung Quy P, Tan CL (2009) A robust wavelet transform based technique for video text detection. In: ICDAR ‘09. 10th international conference on document analysis and recognition, 2009
Zhong J, Jian W, Yu-Ting S (2009) Text detection in video frames using hybrid features. In: International conference on machine learning and cybernetics, 2009
Zhao M, Li S, Kwok J (2010) Text detection in images using sparse representation with discriminative dictionaries. Image Vis Comput 28(12):1590–1599
Shivakumara P, Trung Quy P, Tan CL (2010) New Fourier-statistical features in RGB space for video text detection. Circ Syst Video Technol IEEE Trans 20(11):1520–1532
Rongrong J et al (2008) Directional correlation analysis of local Haar binary pattern for text detection. In: IEEE international conference on multimedia and expo, 2008
Hanif SM, Prevost L (2007) Text detection in natural scene images using spatial histograms. In: 2nd workshop on camera based document analysis and recognition, Curitiba
Chucai Y, YingLi T (2011) Text detection in natural scene images by Stroke Gabor Words. In: International conference on document analysis and recognition (ICDAR), 2011
Qian X et al (2007) Text detection, localization, and tracking in compressed video. Signal Process Image Commun 22(9):752–768
ZHANG YJ (2002) Image engineering and related publications. Int J Image Graph 02(03):441–452
Haralick RM, Shapiro LG (1985) Image segmentation techniques. Comp Vis Graph Image Process 29(1):100–132
Shapiro LG, Stockman GC (2001) Computer vision. Prentice-Hall, New Jersey, pp 279–325
Otsu N (1975) A threshold selection method from gray-level histograms. Automatica 11(285–296):23–27
Saraf Y (2006) Algorithms for image segmentation, Birla Institute of Technology and Science
Comaniciu D, Meer P (2002) Mean shift: a robust approach toward feature space analysis. Pattern Anal Mach Intell IEEE Trans on 24(5):603–619
Pantofaru C, Hebert M (2005) A comparison of image segmentation algorithms. Robotics Institute, p 336
Felzenszwalb P, Huttenlocher D (2004) Efficient graph-based image segmentation. Int J Comp Vis 59(2):167–181
Shi J, Malik J (2000) Normalized cuts and image segmentation. Pattern Anal Mach Intell IEEE Trans 22(8):888–905
Palma D, Ascenso J, Pereira F (2004) Automatic text extraction in digital video based on motion analysis. In: Campilho A, Kamel M (eds) Image analysis and recognition. Springer, Berlin, pp 588–596
Brox T et al (2004) High accuracy optical flow estimation based on a theory for warping. In: Pajdla T, Matas J (eds) Computer vision – ECCV 2004. Springer, Berlin, pp 25–36
Beauchemin SS, Barron JL (1995) The computation of optical flow. ACM Comput Surv 27(3):433–466
Anandan P (1989) A computational framework and an algorithm for the measurement of visual motion. Int J Comp Vis 2(3):283–310
Weickert J, Schnörr C (2001) A theoretical framework for convex regularizers in PDE-based computation of image motion. Int J Comp Vis 45(3):245–264
Mémin E, Pérez P (2002) Hierarchical estimation and segmentation of dense motion fields. Int J Comp Vis 46(2):129–155
Sun D et al (2008) Learning optical flow. In: Forsyth D, Torr P, Zisserman A (eds) Computer vision – ECCV 2008. Springer, Berlin, pp 83–97
Black MJ, Anandan P (1996) The robust estimation of multiple motions: parametric and piecewise-smooth flow fields. Comp Vis Image Underst 63(1):75–104
Głowacz A, Mikrut Z, Pawlik P (2012) Video detection algorithm using an optical flow calculation method. In: Dziech A, Czyżewski A (eds) Multimedia communications, services and security. Springer, Berlin, pp 118–129
Horn BKP, Schunck BG (1981) Determining optical flow. Artif Intell 17(1–3):185–203
Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: IJCAI
Kui L et al (2010) Optical flow and principal component analysis-based motion detection in outdoor videos. EURASIP J Adv Signal Proc
Zhao Y et al (2011) Real-time video caption detection. In: Proceedings of the ninth IAPR international workshop on graphics recognition
Gonzalez RC, Woods RE, Eddins SL Digital image processing using matlab. Prentice Hall
Nixon MS, Aguado AS Feature extraction & image processing for computer vision, 3rd edn. Academic
Forsyth DA, Ponce J Computer vision: a modern approach, 2nd edn. Prentice Hall Press
Petrou M, Petrou C Image processing: the fundamentals. Wiley
Haralick RM, Shanmugam K (1973) Its’Hak Dinstein. Textual features for image classification. IEEE Trans Syst Man Cybern 3(6):610–621
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag London
About this chapter
Cite this chapter
Lu, T., Palaiahnakote, S., Tan, C.L., Liu, W. (2014). Video Preprocessing. In: Video Text Detection. Advances in Computer Vision and Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-4471-6515-6_2
Download citation
DOI: https://doi.org/10.1007/978-1-4471-6515-6_2
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-6514-9
Online ISBN: 978-1-4471-6515-6
eBook Packages: Computer ScienceComputer Science (R0)