Abstract
This paper presents a method to automatically detect slide changes in lecture videos. For accurate detection, the regions capturing slide images are first identified from video frames. Then, SIFT features are extracted from the regions, which are invariant to image scaling and rotation. These features are used to compare similarity between frames. If the similarity is smaller than a threshold, slide transition is detected. The threshold is estimated based on the mean and standard deviation of sample frames’ similarities. Using this method, high detection accuracy can be obtained without any supplementary slide images. The proposed method also supports detection of backward slide transitions that occur when a speaker returns to a previous slide to emphasize its contents. In experiments conducted on our test collection, the proposed method showed 87 % accuracy in forward transition detection and 86 % accuracy in backward transition detection.
Similar content being viewed by others
Notes
1 We only consider f i,j satisfying 1≤i < j≤k, because f i,j is equal to f j,i , and f i,i is meaningless.
References
Cernekova Z, Pitas I, Nikow C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circ Syst Video Tech 16(1): 82–91
Che X, Yang H, Meinel C (2013) Lecture video segmentation by automatically analyzing the synchronized slides. In: Proceedings of the 21st ACM international conference on multimedia, ACMMM’13. ACM, pp 345–348
Cooper M, Liu T, Rieffel E (2007) Video segmentation via temporal pattern classification. IEEE Trans Multimed 9(3): 610–618
Fan Q, Barnard K, Amir A, Efrat A, Lin M (2006) Matching slides to presentation videos using SIFT and scene background matching. In: Proceedings of the 8th ACM international workshop on multimedia information retrieval, MIR’06. ACM, pp 239–248
Fan Q, Amir A, Barnard K, Swaminathan R, Efrat A (2007) Temporal modeling of slide change in presentation videos. In: Proceedings of the IEEE internation conference on acoustics, speech and signal processing, ICASSP’. IEEE, pp 989–992
Hu W et al. (2001) A survey on visual content-based Video indexing and retrieval. IEEE Trans Syst Man Cybern C Appl Rev 41(6): 797–819
Lowe D G (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2): 91–110
Li D, Lu H (2000) Model based video segmentation. In: IEEE workshop on signal processing systems, SiPS 2000. IEEE, pp 120–129
Ma D, Agam G (2012) Lecture video segmentation and indexing. In: SPIE 8297, document recognition and retrieval XIX, 82970V
Mukhopadhyay S, Smith B (1999) Passive capture and structuring of lectures. In: Proceedings of the 7th ACM international conference on multimedia, ACMMM’99. ACM, New York, pp 477–487
Ngo C W, Pong T C, Chin R T (1999) Detection of gradual transitions through temporal slice analysis. In: IEEE computer society conference on computer vision and pattern recognition (vol 1)
Ngo C W, Pong T C, Huang TS (2002) Detection of slide transition for topic indexing. In: Proceedings of the IEEE international conference on multimedia and expo, ICME’02. IEEE, pp 533-536
Ngo C W, Wang F, Pong T C (2003) Structuring lecture videos for distance learning applications. In: Proceedings of the IEEE 5th international symposium on multimedia software engineering, MMSE’03. IEEE, pp 215–222
Otzu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1): 62–66
Porter S (2004) Video segmentation and indexing using motion estimation. University of Bristol, Dissertation
Powers D M W (2011) Evaluation: from precision, recall and f-measure to roc., informedness, markedness & correlation. J Mach Lear Tech 2(1): 37–63
Shafait F, Keysers D, Breuel T M (2008) Efficient implementation of local adaptive thresholding techniques using integral images SPIE 6815, document recognition and retrieval XV, 681510
Smeaton A F, Over P, Doherty A R (2010) Video shot boundary detection: sevent years of TRECVid activity. Comput Vis Image Und 114(4): 411–418
Wang X, Kankanhalli M (2009) Robust alignment of presentation videos with slides. In: Proceedings of the 10th pacific rim conference on multimedia, PCM’09. Springer pp 311–322
Wu X et al. (2008) Shot boundary detection: an information saliency approach. In: Congress on the image and signal processing, CISP’08. pp 808-812
Xia D, Deng X, Zeng Q (2007) Shot boundary detection based on difference sequences of mutual information. In: Proceedings of the image and graphics, ICIG’07. pp 389-394
Zhao ZC, Cai AN, 2006Shot boundary detection algorithm in compressed domain based on ababoost and fuzzy theory. In: Proceedings of the international conference on advances in natural computation, ICNC’06. pp 617-626
Acknowledgments
This research was supported by the MSIP(Ministry of Science, ICT and Future Planning) of Korea under the ITRC support program(NIPA-2013-H0301-13-4009), and the National Research Foundation of Korea grant funded by the Korea government(MEST) (No. 2012R1A2A2A01046694).
This paper was supported by the Sahmyook University Research Fund in 2013.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Jeong, H.J., Kim, TE., Kim, H.G. et al. Automatic detection of slide transitions in lecture videos. Multimed Tools Appl 74, 7537–7554 (2015). https://doi.org/10.1007/s11042-014-1990-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-014-1990-6