Skip to main content

Automatic Interactive Video Authoring Method via Object Recognition

  • Conference paper
  • First Online:
Intelligent Information and Database Systems (ACIIDS 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10191))

Included in the following conference series:

Abstract

Interactive video is a type of video which provides interactions for obtaining video related information or participating in video content. However, authors of interactive video need to spend much time to create the interactive video content. Many researchers have presented methods and features to solve the time-consuming problem. However, the methods are still too complicated to use and need to be automated. In this paper, we suggest an automatic interactive video authoring method via object recognition. Our proposed method uses deep learning based object recognition and an NLP-based keyword extraction method to annotate objects. To evaluate the method, we manually annotated the objects in the selected video clips, and we compared proposed method and manual method. The method achieved an accuracy rate of 43.16% for the whole process. This method allows authors to create interactive videos easily.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Oh, K.J., Hong, M.D., Yoon, U.N., Jo, G.S.: Automatic generation of interactive cooking video with semantic annotation. J. Univ. Comput. Sci. 22(6), 742–759 (2016)

    Google Scholar 

  2. WIREWAX. http://www.wirewax.com/

  3. Park, T.J., Kim, J.K., Choy, Y.C.: Creating a clickable TV program by sketching and tracking freeform triggers. Multimedia Tools Appl. 59(3), 833–850 (2012)

    Article  Google Scholar 

  4. Yoon, U.N., Ga, M.H., Jo, G.S.: Annotation method based on face area for efficient interactive video authoring. J. Intell. Inf. Syst. 21(1), 83–98 (2015)

    Google Scholar 

  5. Zentrick. https://www.zentrick.com/

  6. Yoon, U.N., Ko, S.H., Oh, K.J., Jo, G.S.: Thumbnail-based interaction method for interactive video in multi-screen environment. In: IEEE International Conference on Consumer Electronics, pp. 3–4 (2016)

    Google Scholar 

  7. Zhang, H.J., Wu, J., Zhong, D., Smollar, S.W.: An integrated system for content-based video retrieval and browsing. Pattern Recogn. 30(3), 643–658 (1997)

    Article  Google Scholar 

  8. Bianco, S., Ciocca, G., Napoletano, P., Schettini, R.: An interactive tool for manual, semi-automatic and automatic video annotation. Comput. Vis. Image Underst. 131, 88–99 (2015)

    Article  Google Scholar 

  9. Sun, S.W., Wang, Y.C.F., Hung, Y.L., Chang, C.L., Chen, K.C., Cheng, K.C., Cheng, S.S., Wang, H.M., Liao, H.Y.M.: Automatic annotation of web videos. In: Proceedings of IEEE International Conference on Multimedia and Expo, Barcelona (2011)

    Google Scholar 

  10. Ren, S., He, K., Girshick, R., Sun, J.: Faster R-CNN: towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (2015)

    Google Scholar 

  11. Goldman, D.B., Curless, B., Salesin, D., Seitz, S.M.: Interactive video object annotation. In: ACM Computing Surveys (2007)

    Google Scholar 

  12. Chasanis, V.T., Likas, A.C., Galatsanos, N.P.: Scene detection in videos using shot clustering and sequence alignment. Trans. Multimedia 11(1), 89–100 (2009)

    Article  Google Scholar 

  13. Apostolidis, E., Mezaris, V.: Fast shot segmentation combining global and local visual descriptors. In: Conference of Acoustics, Speech and Signal Processing, Italy (2014)

    Google Scholar 

  14. Google Image Search engine. http://images.google.com/

  15. DBPedia. http://wiki.dbpedia.org/

  16. Wang, X.J., Zhang, L., Li, X., Ma, W.Y.: Annotating images by mining image search results. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1919–2008 (2008)

    Article  Google Scholar 

  17. Jing, Y., Rowley, H., Wang, J., Tsai, D., Rosenberg, C., Covell, M.: Google image swirl a large-scale content-based image visualization system. In: Proceedings of the International Conference on World Wide Web, France, pp. 539–540 (2012)

    Google Scholar 

  18. Stanford Named Entity Recognizer. http://nlp.stanford.edu/software/CRF-NER.shtml

  19. Stanford Log-linear Part-Of-Speech Tagger. http://nlp.stanford.edu/software/tagger.shtml

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Geun-Sik Jo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Yoon, UN., Hong, MD., Jo, GS. (2017). Automatic Interactive Video Authoring Method via Object Recognition. In: Nguyen, N., Tojo, S., Nguyen, L., Trawiński, B. (eds) Intelligent Information and Database Systems. ACIIDS 2017. Lecture Notes in Computer Science(), vol 10191. Springer, Cham. https://doi.org/10.1007/978-3-319-54472-4_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54472-4_55

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54471-7

  • Online ISBN: 978-3-319-54472-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics