Skip to main content

Video Automatic Annotation

  • Reference work entry
Encyclopedia of Multimedia

Synonyms

Automatic extraction of the information from video

Definition

Automatic annotation of video refers to the extraction of the information about video automatically, which can serve as the first step for different data access modalities such as browsing, searching, comparison, and categorization.

Introduction

Advances in digital video technology and the ever increasing availability of computing resources have resulted, in the last few years, in an explosion of digital video data. Moreover, the increased availability of Internet bandwidth has defined new means of video distribution, other than physical media. The major web search engines have already started to provide specific services to index, search and retrieve videos on the Internet.

Improving of video accessibility is the true challenge. In fact, access to video data requires that video content is appropriately indexed but manually annotating or tagging video is at best a laborious and economically infeasible process....

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 449.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. N. Dimitrova, H.-J. Zhang, B. Shahraray, I. Sezan, T. Huang, and A. Zakhor. “Applications of Videocontent Analysis and Retrieval,” IEEE Multimedia Magazine, Vol. 12, No. 3, July 2002.

    Google Scholar 

  2. T. Lin and H.J. Zhang, “Automatic Video Scene Extraction by Shot Grouping,” Proceedings of he 15th International Conference on Pattern Recognition. Vol. 4, September 2000, pp. 39–42.

    Google Scholar 

  3. J.S. Boreczky and L.A. Rowe, “Comparison of Video Shot Boundary Detection Techniques,” Proceedings of the IS&T/SPIE Conference Storage and Retrieval for Image and Video Databases IV, Vol. SPIE 2670, 1996, pp. 170–179.

    Google Scholar 

  4. A. Dailianas, R.B. Allen, and P. England, “Comparison of Automatic Video Segmentation Algorithms,” Proceedings of the Integration Issues in Large Commercial Media Delivery Systems, Vol. SPIE 2615, October 1995, pp. 2–16.

    Google Scholar 

  5. U. Gargi, R. Kasturi, and S.H. Strayer. “Performance Characterization of Video-Shot-Change Detection Methods,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 10, No. 3, February 2000.

    Google Scholar 

  6. S.S. Intille, J.W. Davis, and A.F. Bobick, “Real Time Closed World Tracking,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1997, pp. 697–703.

    Google Scholar 

  7. A. Elgammal, D. Harwood, and L.S. Davis, “Non Parametric Model for Background Subtraction,” Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece, September 1999.

    Google Scholar 

  8. S. Pfeiffer, S. Fischer, and W. Effelsberg, “Automatic Audio Content Analysis,” Proceedings of the ACM Multimedia 96, 1996, pp. 21–30.

    Google Scholar 

  9. C.G.M. Snoek and M. Worring. “Multimodal Video Indexing: A Review of the State-of-the-Art,” Multimedia Tools and Applications, Vol. 25, No. 1, January 2005, pp. 5–35.

    Google Scholar 

  10. P. Viola and M. Jones, “Rapid Object Detection Using a Boosted Cascade of Simple Features,” Proceedings of the Computer Vision and Pattern Recognition (CVPR'01), 2001.

    Google Scholar 

  11. M.H. Yang, D.J. Kriegman, and N. Ahuja, “Detecting Faces in Images: A Survey,” IEEE Transactions on Pattern Analysis and Machine, Vol. 24, No. 1, January 2002, pp. 34–58.

    Google Scholar 

  12. W. Zhao, R. Chellappa, P.J. Phillips, and A. Rosenfeld, “Face Recognition: A Literature Survey,” ACM Computing Surveys, Vol. 35, No. 4, December 2003, pp. 309–459.

    Google Scholar 

  13. T. Sato, T. Kanade, E. Hughes, and M. Smith. “Video OCR for Digital News Archives,” Proceedings of the IEEE Workshop on Content-Based Access of Image and Video Databases (CAIVD'98), Bombay, India, January 1998.

    Google Scholar 

  14. R. Lienhart, “Video OCR: A Survey and Practitioner's Guide,” in A. Rosenfeld, D. Doermann, and D. DeMenthon (Eds.), “Video Mining,” Kluwer Academic, 2003, pp. 155–183.

    Google Scholar 

  15. L. Agnihotri, K.V. Devara, T. McGee, and N. Dimitrova, “Summarization of Video Programs Based on Closed Captions,” Proceedings of the SPIE, Vol. 4315, Storage and Retrieval for Media Databases, 2001, pp. 599–607.

    Google Scholar 

  16. S. Eickeler and S. Muller, “Content-based Video Indexing of TV Broadcast News Using Hidden Markov Models,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP' 99, Vol. 6, March 1999, pp. 2997–3000.

    Google Scholar 

  17. A. Hauptmann, D. Ng, R. Baron, M-Y Chen, M. Christel, S. Duygulu, C. Huang, W-H. Lin, H. Wactlar, N. Moraveji, N. Papernick, C.G.M. Snoek, G. Tzanetakis, J. Yang, R. Yan, and R. Jin, “Informedia at TRECVID 2003: Analyzing and Searching Broadcast News Video,” Proceedings of TREC 2003, Gaithersburg, MD, November 2003.

    Google Scholar 

  18. J. Assfalg, M. Bertini, C. Colombo, and A. Del Bimbo, “Semantic Annotation of Sports Videos,” IEEE Multimedia, Vol. 9 No. 2, April/June 2002, pp. 52–60.

    Google Scholar 

  19. J. Assfalg, M. Bertini, C. Colombo, A. Del Bimbo, and W. Nunziati, “Semantic Annotation of Soccer Videos: Automatic Highlights Identification,” Computer Vision and Image Understanding, Vol. 92, No. 2–3, November/December 2003, pp. 285–305.

    Google Scholar 

  20. A. Ekin, A.M. Tekalp, and R. Mehrotra, “Automatic Soccer Video Analysis and Summarization,” IEEE Transactions on Image Processing, Vol. 12, No. 7, July 2003, pp. 796–807.

    Google Scholar 

  21. Z. Rasheed, Y. Sheikh, and M. Shah, “On the Use of Computable Features for Film Classification,” IEEE Transactions on Circuits and Systems for Video, Vol. 15, No. 1, January 2005, pp. 52–64.

    Google Scholar 

  22. B. Lehane, N. O'Connor, and N. Murphy, “Action Sequence Detection in Motion Pictures,” Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technology, London, UK, November 2004.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag

About this entry

Cite this entry

Bimbo, A.D., Bertini, M. (2008). Video Automatic Annotation. In: Furht, B. (eds) Encyclopedia of Multimedia. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78414-4_238

Download citation

Publish with us

Policies and ethics