Fast and Robust Short Video Clip Search for Copy Detection

Yuan, Junsong; Duan, Ling-Yu; Tian, Qi; Ranganath, Surendra; Xu, Changsheng

doi:10.1007/978-3-540-30542-2_59

Junsong Yuan^19,20,
Ling-Yu Duan¹⁹,
Qi Tian¹⁹,
Surendra Ranganath²⁰ &
…
Changsheng Xu¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3332))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

837 Accesses
4 Citations
3 Altmetric

Abstract

Query by video clip (QVC) has attracted wide research interests in multimedia information retrieval. In general, QVC may include feature extraction, similarity measure, database organization, and search or query scheme. Towards an effective and efficient solution, diverse applications have different considerations and challenges on the abovementioned phases. In this paper, we firstly attempt to broadly categorize most existing QVC work into 3 levels: concept based video retrieval, video title identification, and video copy detection. This 3-level categorization is expected to explicitly identify typical applications, robust requirements, likely features, and main challenges existing between mature techniques and hard performance requirements. A brief survey is presented to concretize the QVC categorization. Under this categorization, in this paper we focus on the copy detection task, wherein the challenges are mainly due to the design of compact and robust low level features (i.e. an effective signature) and a kind of fast searching mechanism. In order to effectively and robustly characterize the video segments of variable lengths, we design a novel global visual feature (a fixed-size 144-d signature) combining the spatial-temporal and the color range information. Different from previous key frame based shot representation, the ambiguity of key frame selection and the difficulty of detecting gradual shot transition could be avoided. Experiments have shown the signature is also insensitive to color shifting and variations from video compression. As our feature can be extracted directly from MPEG compressed domain, lower computational cost is required. In terms of fast searching, we employ the active search algorithm. Combining the proposed signature and the active search, we have achieved an efficient and robust solution for video copy detection. For example, we can search for a short video clip among the 10.5 hours MPEG-1 video database in merely 2 seconds in the case of unknown query length, and in 0.011 second when fixing the query length as 10 seconds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://www-nlpir.nist.gov/projects/trecvid/ Web site (2004)
Sebe, N., et al.: The state of the art in image and video retrieval. In: Proc. of CIVR 2003 (2003)
Google Scholar
Jain, A.K., et al.: Query by video clip. Multimedia System 7, 369–384 (1999)
Article Google Scholar
DeMenthon, D., et al.: Video retrieval using spatio-temporal descriptors. In: Proc. of ACM Multimedia 2003, pp. 508–517 (2003)
Google Scholar
Cho, C.-Y., et al.: Efficient motion-vector-based video search using query by clip. In: Proc. of ICME 2004, Taiwan (2004)
Google Scholar
Duan, L.-Y., et al.: A unified framework for semantic shot classification in sports video. In: IEEE Transaction on Multimedia (2004) (to appear)
Google Scholar
Duan, L.-Y., et al.: Mean shift based video segment representation and applications to replay detection. In: Proc. of ICASSP 2004, pp. 709–712 (2004)
Google Scholar
Duan, L.-Y., et al.: A Mid-level Representation Framework for Semantic Sports Video Analysis. In: Proc. of ACM Multimedia 2003, pp. 33–44 (2003)
Google Scholar
Zhang, D.-Q., et al.: Detection image near-duplicate by stochastic attribute relational graph matching with learning. In: Proc. of ACM Multimedia 2004, NewYork (2004)
Google Scholar
Jaimes, A., Chang, S.-F., Loui, A.C.: Detection of non-identical duplicate consumer photographs. In: Proc. of PCM 2003, Singapore (2003)
Google Scholar
Cheung, S., Zakhor, A.: Efficient video similarity measurement with video signature. IEEE Trans. on Circuits and System for Video Technology 13, 59–74 (2003)
Article Google Scholar
Cheung, S.-C., Zakhor, A.: Fast similarity search and clustering of video sequences on the world-wide-web. In: IEEE Transactions on Multimedia (2004) (to appear)
Google Scholar
Chen, L., Chua, T.S.: A match and tiling approach to content-based video retrieval. In: Proc. of ICME 2001, pp. 301–304 (2001)
Google Scholar
Kulesh, V., et al.: Video clip recognition using joint audio-visual processing model. In: Proc. of ICPR 2002, vol. 1, pp. 500–503 (2002)
Google Scholar
Naphade, M.R., et al.: A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures. In: Proc. SPIE, Storage and Retrieval for Media Databases 2000, vol. 3972, pp. 564–572 (2000)
Google Scholar
Hampapur, A., Hyun, K., Bolle, R.: Comparison of Sequence Matching Techniques for Video Copy Detection. In: SPIE. Storage and Retrieval for Media Databases 2002, San Jose, CA, USA, vol. 4676, pp. 194–201 (January 2002)
Google Scholar
Kashino, K., et al.: A Quick Search Method forAudio andVideo Signals Based on Histogram Pruning. IEEE Trans. on Multimedia 5(3), 348–357 (2003)
Article Google Scholar
Kashino, K., et al.: Aquick video search method based on local and global feature clustering. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)
Google Scholar
Ferman, A.M., et al.: Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. on Image Processing 1(5) (May 2002)
Google Scholar
Joly, A., Frelicot, C., Buisson, O.: Robust content-based video copy identification in a large reference database. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 414–424. Springer, Heidelberg (2003)
Chapter Google Scholar
Pua, K.M., et al.: Real time repeated video sequence identification. Journal of Computer Vision and Image Understanding 93, 310–327 (2004)
Article Google Scholar
Hoad, T.C., et al.: Fast video matching with signature alignment. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, pp. 263–269 (2003)
Google Scholar
Kasutani, E., et al.: An adaptive feature comparison method for real-time video identification. In: Proc. of ICIP 2003 (2003)
Google Scholar
Diakopoulos, N., et al.: Temporally Tolerant Video Matching. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, Canada (August 2003)
Google Scholar
Yuan, J., et al.: Fast and Robust Short Video Clip Search Using an Index Structure. In: ACM Multimedia Workshop on Multimedia Information Retrieval, MIR 2004 (2004)
Google Scholar
Yuan, J., et al.: Fast and Robust Search Method for Short Video Clips from Large Video Collection. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)
Google Scholar
Kim, S.H., Park, R.-H.: An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence. IEEE Trans. On Circuits and Systems for Video Technology 12, 592–596 (2002)
Article Google Scholar
Lienhart, R., et al.: VisualGREP: A Systematic method to compare and retrieve video sequences. In: SPIE. storage and Retrieval fro Image and Video Database VI, vol. 3312 (1998)
Google Scholar
Oostveen, J., et al.: Feature extraction and a database strategy for video fingerprinting. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 117–128. Springer, Heidelberg (2002)
Chapter Google Scholar
Fan, J., et al.: Classview: hierarchical video shot classification, indexing and accessing. IEEE Trans. on Multimedia 6(1) (February 2004)
Google Scholar
Hoi, C.-H., et al.: A novel scheme for video similarity detection. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 373–382. Springer, Heidelberg (2003)
Chapter Google Scholar
Kimura, A., et al.: A Quick Search Method for Multimedia Signals Using Feature Compression Based on Piecewise Linear Maps. In: Proc. of ICASSP 2002 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore, 119613
Junsong Yuan, Ling-Yu Duan, Qi Tian & Changsheng Xu
Department of Electrical and Computer Engineering, National University of Singapore, Singapore
Junsong Yuan & Surendra Ranganath

Authors

Junsong Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Ling-Yu Duan
View author publications
You can also search for this author in PubMed Google Scholar
Qi Tian
View author publications
You can also search for this author in PubMed Google Scholar
Surendra Ranganath
View author publications
You can also search for this author in PubMed Google Scholar
Changsheng Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information and Communication Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, 113-8656, Tokyo, Japan
Kiyoharu Aizawa
Tokyo Research Laboratory, IBM Research, 1623-14 Shimo-tsuruma, 242-0001, Yamato, Kanagawa, Japan
Yuichi Nakamura
National Institute of Informatics, Tokyo, Japan
Shin’ichi Satoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yuan, J., Duan, LY., Tian, Q., Ranganath, S., Xu, C. (2004). Fast and Robust Short Video Clip Search for Copy Detection. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30542-2_59

Download citation

DOI: https://doi.org/10.1007/978-3-540-30542-2_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23977-2
Online ISBN: 978-3-540-30542-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics