Skip to main content

Fast and Robust Short Video Clip Search for Copy Detection

  • Conference paper
Advances in Multimedia Information Processing - PCM 2004 (PCM 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3332))

Included in the following conference series:

Abstract

Query by video clip (QVC) has attracted wide research interests in multimedia information retrieval. In general, QVC may include feature extraction, similarity measure, database organization, and search or query scheme. Towards an effective and efficient solution, diverse applications have different considerations and challenges on the abovementioned phases. In this paper, we firstly attempt to broadly categorize most existing QVC work into 3 levels: concept based video retrieval, video title identification, and video copy detection. This 3-level categorization is expected to explicitly identify typical applications, robust requirements, likely features, and main challenges existing between mature techniques and hard performance requirements. A brief survey is presented to concretize the QVC categorization. Under this categorization, in this paper we focus on the copy detection task, wherein the challenges are mainly due to the design of compact and robust low level features (i.e. an effective signature) and a kind of fast searching mechanism. In order to effectively and robustly characterize the video segments of variable lengths, we design a novel global visual feature (a fixed-size 144-d signature) combining the spatial-temporal and the color range information. Different from previous key frame based shot representation, the ambiguity of key frame selection and the difficulty of detecting gradual shot transition could be avoided. Experiments have shown the signature is also insensitive to color shifting and variations from video compression. As our feature can be extracted directly from MPEG compressed domain, lower computational cost is required. In terms of fast searching, we employ the active search algorithm. Combining the proposed signature and the active search, we have achieved an efficient and robust solution for video copy detection. For example, we can search for a short video clip among the 10.5 hours MPEG-1 video database in merely 2 seconds in the case of unknown query length, and in 0.011 second when fixing the query length as 10 seconds.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://www-nlpir.nist.gov/projects/trecvid/ Web site (2004)

  2. Sebe, N., et al.: The state of the art in image and video retrieval. In: Proc. of CIVR 2003 (2003)

    Google Scholar 

  3. Jain, A.K., et al.: Query by video clip. Multimedia System 7, 369–384 (1999)

    Article  Google Scholar 

  4. DeMenthon, D., et al.: Video retrieval using spatio-temporal descriptors. In: Proc. of ACM Multimedia 2003, pp. 508–517 (2003)

    Google Scholar 

  5. Cho, C.-Y., et al.: Efficient motion-vector-based video search using query by clip. In: Proc. of ICME 2004, Taiwan (2004)

    Google Scholar 

  6. Duan, L.-Y., et al.: A unified framework for semantic shot classification in sports video. In: IEEE Transaction on Multimedia (2004) (to appear)

    Google Scholar 

  7. Duan, L.-Y., et al.: Mean shift based video segment representation and applications to replay detection. In: Proc. of ICASSP 2004, pp. 709–712 (2004)

    Google Scholar 

  8. Duan, L.-Y., et al.: A Mid-level Representation Framework for Semantic Sports Video Analysis. In: Proc. of ACM Multimedia 2003, pp. 33–44 (2003)

    Google Scholar 

  9. Zhang, D.-Q., et al.: Detection image near-duplicate by stochastic attribute relational graph matching with learning. In: Proc. of ACM Multimedia 2004, NewYork (2004)

    Google Scholar 

  10. Jaimes, A., Chang, S.-F., Loui, A.C.: Detection of non-identical duplicate consumer photographs. In: Proc. of PCM 2003, Singapore (2003)

    Google Scholar 

  11. Cheung, S., Zakhor, A.: Efficient video similarity measurement with video signature. IEEE Trans. on Circuits and System for Video Technology 13, 59–74 (2003)

    Article  Google Scholar 

  12. Cheung, S.-C., Zakhor, A.: Fast similarity search and clustering of video sequences on the world-wide-web. In: IEEE Transactions on Multimedia (2004) (to appear)

    Google Scholar 

  13. Chen, L., Chua, T.S.: A match and tiling approach to content-based video retrieval. In: Proc. of ICME 2001, pp. 301–304 (2001)

    Google Scholar 

  14. Kulesh, V., et al.: Video clip recognition using joint audio-visual processing model. In: Proc. of ICPR 2002, vol. 1, pp. 500–503 (2002)

    Google Scholar 

  15. Naphade, M.R., et al.: A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures. In: Proc. SPIE, Storage and Retrieval for Media Databases 2000, vol. 3972, pp. 564–572 (2000)

    Google Scholar 

  16. Hampapur, A., Hyun, K., Bolle, R.: Comparison of Sequence Matching Techniques for Video Copy Detection. In: SPIE. Storage and Retrieval for Media Databases 2002, San Jose, CA, USA, vol. 4676, pp. 194–201 (January 2002)

    Google Scholar 

  17. Kashino, K., et al.: A Quick Search Method forAudio andVideo Signals Based on Histogram Pruning. IEEE Trans. on Multimedia 5(3), 348–357 (2003)

    Article  Google Scholar 

  18. Kashino, K., et al.: Aquick video search method based on local and global feature clustering. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)

    Google Scholar 

  19. Ferman, A.M., et al.: Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. on Image Processing 1(5) (May 2002)

    Google Scholar 

  20. Joly, A., Frelicot, C., Buisson, O.: Robust content-based video copy identification in a large reference database. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 414–424. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  21. Pua, K.M., et al.: Real time repeated video sequence identification. Journal of Computer Vision and Image Understanding 93, 310–327 (2004)

    Article  Google Scholar 

  22. Hoad, T.C., et al.: Fast video matching with signature alignment. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, pp. 263–269 (2003)

    Google Scholar 

  23. Kasutani, E., et al.: An adaptive feature comparison method for real-time video identification. In: Proc. of ICIP 2003 (2003)

    Google Scholar 

  24. Diakopoulos, N., et al.: Temporally Tolerant Video Matching. In: SIGIR Multimedia Information Retrieval Workshop 2003 (MIR 2003), Toronto, Canada (August 2003)

    Google Scholar 

  25. Yuan, J., et al.: Fast and Robust Short Video Clip Search Using an Index Structure. In: ACM Multimedia Workshop on Multimedia Information Retrieval, MIR 2004 (2004)

    Google Scholar 

  26. Yuan, J., et al.: Fast and Robust Search Method for Short Video Clips from Large Video Collection. In: Proc. of ICPR 2004, Cambridge, UK (August 2004)

    Google Scholar 

  27. Kim, S.H., Park, R.-H.: An efficient algorithm for video sequence matching using the modified Hausdorff distance and the directed divergence. IEEE Trans. On Circuits and Systems for Video Technology 12, 592–596 (2002)

    Article  Google Scholar 

  28. Lienhart, R., et al.: VisualGREP: A Systematic method to compare and retrieve video sequences. In: SPIE. storage and Retrieval fro Image and Video Database VI, vol. 3312 (1998)

    Google Scholar 

  29. Oostveen, J., et al.: Feature extraction and a database strategy for video fingerprinting. In: Chang, S.-K., Chen, Z., Lee, S.-Y. (eds.) VISUAL 2002. LNCS, vol. 2314, pp. 117–128. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  30. Fan, J., et al.: Classview: hierarchical video shot classification, indexing and accessing. IEEE Trans. on Multimedia 6(1) (February 2004)

    Google Scholar 

  31. Hoi, C.-H., et al.: A novel scheme for video similarity detection. In: Bakker, E.M., Lew, M., Huang, T.S., Sebe, N., Zhou, X.S. (eds.) CIVR 2003. LNCS, vol. 2728, pp. 373–382. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  32. Kimura, A., et al.: A Quick Search Method for Multimedia Signals Using Feature Compression Based on Piecewise Linear Maps. In: Proc. of ICASSP 2002 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yuan, J., Duan, LY., Tian, Q., Ranganath, S., Xu, C. (2004). Fast and Robust Short Video Clip Search for Copy Detection. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds) Advances in Multimedia Information Processing - PCM 2004. PCM 2004. Lecture Notes in Computer Science, vol 3332. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30542-2_59

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30542-2_59

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23977-2

  • Online ISBN: 978-3-540-30542-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics