An Alternative Approach to Exploring a Video

Salim, Fahim A.; Haider, Fasih; Conlan, Owen; Luz, Saturnino

doi:10.1007/978-3-319-66429-3_10

Fahim A. Salim¹⁶,
Fasih Haider¹⁶,
Owen Conlan¹⁶ &
…
Saturnino Luz¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10458))

Included in the following conference series:

International Conference on Speech and Computer

2270 Accesses
2 Citations

Abstract

Exploring the content of a video is typically inefficient due to the linear streamed nature of its media. Video may be seen as a combination of a set of features, the visual track, the audio track and transcription of the spoken words, etc. These features may be viewed as a set of temporally bounded parallel modalities. It is our contention that together these modalities and derived features have the potential to be presented individually or in discrete combination, to allow deeper and more effective content exploration within different parts of a video. This paper presents a novel system for videos’ exploration and reports a recent user study conducted to learn usage patterns by offering video content as an alternative representation. The learned usage patterns may be utilized to build a template driven representation engine that uses the features to offer a multimodal synopsis of video that may lead to more efficient exploration of video content.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

autosummarizer.com (2016). http://autosummarizer.com/
Belo, L., Caetano, C., do Patrocínio, Z., Guimarães, S.J.: Summarizing video sequence using a graph-based hierarchical approach. Neurocomputing 173, 1001–1016 (2016)
Article Google Scholar
Bouamrane, M.M., King, D., Luz, S., Masoodian, M.: A framework for collaborative writing with recording and post-meeting retrieval capabilities. IEEE Distrib. Syst. Online, 1–6 (2004)
Google Scholar
Bouamrane, M.M., Luz, S.: An analytical evaluation of search by content and interaction patterns on multimodal meeting records. Multimedia Syst. 13(2), 89–103 (2007)
Article Google Scholar
Chen, F., De Vleeschouwer, C., Cavallaro, A.: Resource allocation for personalized video summarization. IEEE Trans. Multimedia 16(2), 455–469 (2014)
Article Google Scholar
Choi, F.Y.Y.: Advances in domain independent linear text segmentation. In: Proceedings of NAACL 2000, Stroudsburg, PA, USA, pp. 26–33 (2000)
Google Scholar
Dong, A., Li, H.: Ontology-driven annotation and access of presentation video data. Estudios de Economía Aplicada (2008)
Google Scholar
Evangelopoulos, G., Zlatintsi, A., Potamianos, A., Maragos, P., Rapantzikos, K., Skoumas, G., Avrithis, Y.: Multimodal saliency and fusion for movie summarization based on aural, visual, and textual attention. IEEE Trans. Multimedia 15(7), 1553–1568 (2013)
Article Google Scholar
Gravier, G., Ragot, M., Amsaleg, L., Bois, R., Jadi, G., Jamet, É., Monceaux, L., Sébillot, P.: Shaping-up multimedia analytics: needs and expectations of media professionals. In: Tian, Q., Sebe, N., Qi, G.-J., Huet, B., Hong, R., Liu, X. (eds.) MMM 2016, Part II. LNCS, vol. 9517, pp. 303–314. Springer, Cham (2016). doi:10.1007/978-3-319-27674-8_27. https://hal.inria.fr/hal-01214829
Chapter Google Scholar
Haesen, M., Meskens, J., Luyten, K., Coninx, K., Becker, J., Tuytelaars, T., Poulisse, G., Pham, T., Moens, M.: Finding a needle in a haystack: an interactive video archive explorer for professional video searchers. Multimedia Tools Appl. 63(2), 331–356 (2011)
Article Google Scholar
Halvey, M., Vallet, D., Hannah, D., Jose, J.M.: Supporting exploratory video retrieval tasks with grouping and recommendation. Inf. Process. Manag. 50(6), 876–898 (2014)
Article Google Scholar
Hosseini, M.S., Eftekhari-Moghadam, A.M.: Fuzzy rule-based reasoning approach for event detection and annotation of broadcast soccer video. Appl. Soft Comput. 13(2), 846–866 (2013)
Article Google Scholar
Lei, P., Sun, C., Lin, S., Huang, T.: Effect of metacognitive strategies and verbal-imagery cognitive style on biology-based video search and learning performance. Comput. Educ. 87, 326–339 (2015)
Article Google Scholar
Lienhart, R., Kuranov, A., Pisarevsky, V.: Empirical analysis of detection cascades of boosted classifiers for rapid object detection. In: Michaelis, B., Krell, G. (eds.) DAGM 2003. LNCS, vol. 2781, pp. 297–304. Springer, Heidelberg (2003). doi:10.1007/978-3-540-45243-0_39
Chapter Google Scholar
Manning, C., Surdeanu, M., Bauer, J., Finkel, J., Bethard, S., McClosky, D.: The stanford CoreNLP natural language processing toolkit. In: ACL System Demos, pp. 55–60 (2014)
Google Scholar
Marchionini, G.: Exploratory search: from finding to understanding. Commun. ACM 49(4), 41–46 (2006)
Article Google Scholar
Matejka, J., Grossman, T., Fitzmaurice, G.: Video lens: rapid playback and exploration of large video collections and associated metadata. In: Proceedings of UIST 2014, pp. 541–550 (2014)
Google Scholar
Mujacic, S., Debevc, M., Kosec, P., Bloice, M., Holzinger, A.: Modeling, design, development and evaluation of a hypervideo presentation for digital systems teaching and learning. Multimedia Tools Appl. 58(2), 435–452 (2012)
Article Google Scholar
Nautiyal, A., Kenny, E., Dawson-Howe, K.: Video adaptation for the creation of advanced intelligent content for conferences. In: Irish Machine Vision and Image Processing Conference, pp. 122–127 (2014)
Google Scholar
Pavel, A., Reed, C., orn Hartmann, B., Agrawala, M.: Video digests: a browsable, skimmable format for informational lecture videos. In: Symposium on User Interface Software and Technology, USA, pp. 573–582 (2014)
Google Scholar
Piketty, T.: New thoughts on capital in the twenty-first century (2014)
Google Scholar
Rafailidis, D., Manolopoulou, S., Daras, P.: A unified framework for multimodal retrieval. Pattern Recogn. 46(12), 3358–3370 (2013)
Article Google Scholar
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: Proceedings of the CoNLL 2009, pp. 147–155. ACL, Stroudsburg (2009)
Google Scholar
Rogers, Y.: HCI Theory: Classical, Modern, and Contemporary, vol. 5. Morgan & Claypool Publishers, San Francisco (2012)
Google Scholar
Schoeffmann, K., Taschwer, M., Boeszoermenyi, L.: The video explorer a tool for navigation and searching within a single video based on fast content analysis. In: Proceedings of the ACM Conference on Multimedia Systems, pp. 247–258 (2010)
Google Scholar
Steinbock, D.: (2016). http://tagcrowd.com/
Tan, S., Bu, J., Qin, X., Chen, C., Cai, D.: Cross domain recommendation based on multi-type media fusion. Neurocomputing 127, 124–134 (2014)
Article Google Scholar
Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools Appl. 59(2), 645–672 (2012)
Article Google Scholar
Zhang, H., Liu, Y., Ma, Z.: Fusing inherent and external knowledge with nonlinear learning for cross-media retrieval. Neurocomputing 119, 10–16 (2013)
Article Google Scholar

Download references

Acknowledgement

This research is supported by Science Foundation Ireland through the CNGL Programme (Grant 12/CE/I2267) in the ADAPT Centre at School of Computer Science and Statistics, Trinity College Dublin, Ireland.

Author information

Authors and Affiliations

ADAPT Centre, Trinity College Dublin, Dublin, Ireland
Fahim A. Salim, Fasih Haider & Owen Conlan
IPHSI, University of Edinburgh, Edinburgh, UK
Saturnino Luz

Authors

Fahim A. Salim
View author publications
You can also search for this author in PubMed Google Scholar
Fasih Haider
View author publications
You can also search for this author in PubMed Google Scholar
Owen Conlan
View author publications
You can also search for this author in PubMed Google Scholar
Saturnino Luz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fahim A. Salim .

Editor information

Editors and Affiliations

SPIIRAS, Saint Petersburg, Russia
Alexey Karpov
Moscow State Linguistic University, Moscow, Russia
Rodmonga Potapova
University of Hertfordshire, Hatfield, United Kingdom
Iosif Mporas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Salim, F.A., Haider, F., Conlan, O., Luz, S. (2017). An Alternative Approach to Exploring a Video. In: Karpov, A., Potapova, R., Mporas, I. (eds) Speech and Computer. SPECOM 2017. Lecture Notes in Computer Science(), vol 10458. Springer, Cham. https://doi.org/10.1007/978-3-319-66429-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-66429-3_10
Published: 13 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66428-6
Online ISBN: 978-3-319-66429-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics