Combining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation

De Santo, M.; Percannella, G.; Sansone, C.; Vento, M.

doi:10.1007/11494683_40

M. De Santo²⁰,
G. Percannella²⁰,
C. Sansone²¹ &
…
M. Vento²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3541))

Included in the following conference series:

International Workshop on Multiple Classifier Systems

1871 Accesses
2 Citations

Abstract

In this paper we propose an innovative combination strategy for a system using video and audio stream of a news video to automatically segment it into stories. In our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms using video information only. Then, the shot classification is improved by using a novel anchor shot detection method based on features extracted from the audio track.

Experimental results demonstrate that the combined use of audio and video allows our system to perform better than approaches based only on video information in terms of both shot classification and news story segmentation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing, and Classification System Based on Topics Preselection. In: Proc. of SPIE: Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose, CA (1999)
Google Scholar
Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Transactions on Circuits and Systems for Video Technology 12(9), 765–776 (2002)
Article Google Scholar
Bertini, M., Del Bimbo, A., Pala, P.: Content-based indexing and retrieval of TV News. Pattern Recognition Letters 22, 503–516 (2001)
Article MATH Google Scholar
Wei, W., Gao, W.: Automatic Segmentation of News Items Based on Video and Audio Features. Journal of Computer Science and Technology 17(2), 189–195 (2002)
Article Google Scholar
De Santo, M., Percannella, G., Sansone, C., Vento, M.: Combining experts for anchorperson shot detection in news videos. Pattern Analysis and Applications 7(4) (2004) (in press – online first: DOI: 10.1007/s10044-004-0227-0)
Google Scholar
De Santo, M., Percannella, G., Sansone, C., Vento, M.: A Multi-Expert Approach for Shot Classification in News Videos. In: Campilho, A.C., Kamel, M.S. (eds.) ICIAR 2004. LNCS, vol. 3211, pp. 564–571. Springer, Heidelberg (2004)
Chapter Google Scholar
Huang, Y.S., Suen, C.Y.: A Method of Combining Multiple Experts for the Recognition of Unconstrained Handwritten Numerals. IEEE Transactions on Pattern Analysis and Machine Intelligence 17(1), 90–94 (1995)
Article Google Scholar
Sansone, C., Tortorella, F., Vento, M.: A Classification Reliability Driven Reject Rule for Multi-Expert Systems. International Journal of Pattern Recognition and Artificial Intelligence 15(6), 885–904 (2001)
Article Google Scholar
Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A Real-Time Text-Independent Speaker Identification System. In: Proceedings of the 12th International Conference on Image Analysis and Processing, Mantova, September 17-19, pp. 632–637 (2003)
Google Scholar
Xu, L., Krzyzak, A., Suen, C.Y.: Methods of Combining Multiple Classifiers and Their Application to Handwritten Numeral Recognition. IEEE Trans. on Systems, Man and Cybernetics 22(3), 418–435 (1992)
Article Google Scholar
Xu, L., Krzyzak, A., Oja, E.: Rival Penalized Competitive Learning for Clustering Analysis, RBF net and Curve Detection. IEEE Trans. on Neural Networks 4, 636–649 (1993)
Article Google Scholar
Gargi, U., Kasturi, R., Strayer, S.H.: Performance Characterization of Video-Shot-Change Detection Methods. IEEE Trans. on Circuits and Systems for Video Technology 10(1), 1–13 (2000)
Article Google Scholar
Chaisorn, L., Chua, T.-S., Lee, C.-H.: A Multi-Modal Approach to Story Segmentation for News Video. World Wide Web 6, 187–208 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dip. di Ingegn. dell’Informazione ed Ingegn. Elettrica, Università degli Studi di Salerno, Via Ponte Don Melillo, I, I-84084, Fisciano, (SA), Italy
M. De Santo, G. Percannella & M. Vento
Dipartimento di Informatica e Sistemistica, Università degli Studi di Napoli “Federico II”, Via Claudio 21, I-80125, Napoli, Italy
C. Sansone

Authors

M. De Santo
View author publications
You can also search for this author in PubMed Google Scholar
G. Percannella
View author publications
You can also search for this author in PubMed Google Scholar
C. Sansone
View author publications
You can also search for this author in PubMed Google Scholar
M. Vento
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

NASA Ames Research Center, Mail Stop 269-1, 94035-1000, Moffett Field, CA, USA
Nikunj C. Oza
Signal Processing and Pattern Recognition Laboratory, Electrical and Computer Engineering, Rowan University, 08028, Glassboro, NJ, USA
Robi Polikar
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, UK
Josef Kittler
University of Cagliari, Department of Electrical and Electronic Engineering, Piazza d’Armi, 09123, Cagliari, Italy
Fabio Roli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

De Santo, M., Percannella, G., Sansone, C., Vento, M. (2005). Combining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation. In: Oza, N.C., Polikar, R., Kittler, J., Roli, F. (eds) Multiple Classifier Systems. MCS 2005. Lecture Notes in Computer Science, vol 3541. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11494683_40

Download citation

DOI: https://doi.org/10.1007/11494683_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26306-7
Online ISBN: 978-3-540-31578-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics