Using Structure for Video Object Retrieval

Hohl, Lukas; Souvannavong, Fabrice; Merialdo, Bernard; Huet, Benoit

doi:10.1007/978-3-540-27814-6_66

Lukas Hohl²⁰,
Fabrice Souvannavong²⁰,
Bernard Merialdo²⁰ &
…
Benoit Huet²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

International Conference on Image and Video Retrieval

Abstract

The work presented in this paper aims at reducing the semantic gap between low level video features and semantic video objects. The proposed method for finding associations between segmented frame region characteristics relies on the strength of Latent Semantic Analysis (LSA). Our previous experiments [1], using color histograms and Gabor features, have rapidly shown the potential of this approach but also uncovered some of its limitation. The use of structural information is necessary, yet rarely employed for such a task. In this paper we address two important issues. The first is to verify that using structural information does indeed improve performance, while the second concerns the manner in which this additional information is integrated within the framework. Here, we propose two methods using the structural information. The first adds structural constraints indirectly to the LSA during the preprocessing of the video, while the other includes the structure directly within the LSA. Moreover, we will demonstrate that when the structure is added directly to the LSA the performance gain of combining visual (low level) and structural information is convincing.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Souvannavong, F., Merialdo, B., Huet, B.: Video content modeling with latent semantic analysis. In: Third InternationalWorkshop on Content-Based Multimedia Indexing (2003)
Google Scholar
TREC Video Retrieval Workshop (TRECVID), http://www-nlpir.nist.gov/projects/trecvid/
Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. American Soc. of Information Science Journal 41, 391–407 (1990)
Article Google Scholar
Zhao, R., Grosky, W.I.: Video Shot Detection Using Color Anglogram and Latent Semantic Indexing: From Contents to Semantics. CRC Press, Boca Raton (2003)
Google Scholar
Monay, F., Gatica-Perez, D.: On image auto-annotation with latent space models. In: ACM Int. Conf. on Multimedia (2003)
Google Scholar
Wiemer-Hastings, P.: Adding syntactic information to lsa. In: Proceedings of the Twentysecond Annual Conference of the Cognitive Science Society, pp. 989–993 (2000)
Google Scholar
Landauer, T., Laham, D., Rehder, B., Schreiner, M.: How well can passage meaning be derived without using word order. Cognitive Science Society, 412–417 (1997)
Google Scholar
Swain, M., Ballard, D.: Indexing via colour histograms. In: ICCV, pp. 390–393 (1990)
Google Scholar
Flickner, M., Sawhney, H., et al.: Query by image and video content: the qbic system. IEEE Computer 28, 23–32 (1995)
Google Scholar
Pentland, A., Picard, R., Sclaroff, S.: Photobook: Content-based manipulation of image databases. International Journal of Computer Vision 18, 233–254 (1996)
Article Google Scholar
Gimelfarb, G., Jain, A.: On retrieving textured images from an image database. Pattern Recognition 29, 1461–1483 (1996)
Article Google Scholar
Shearer, K., Venkatesh, S., Bunke, H.: An efficient least common subgraph algorithm for video indexing. In: International Conference on Pattern Recognition, vol. 2, pp. 1241–1243 (1998)
Google Scholar
Huet, B., Hancock, E.: Line pattern retrieval using relational histograms. IEEE Transactions on Pattern Analysis and Machine Intelligence 21, 1363–1370 (1999)
Article Google Scholar
Sengupta, K., Boyer, K.: Organizing large structural modelbases. IEEE Transactions on Pattern Analysis and Machine Intelligence (1995)
Google Scholar
Messmer, B., Bunke, H.: A new algorithm for error-tolerant subgraph isomorphism detection. IEEE Transactions on Pattern Analysis and Machine Intelligence (1998)
Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Efficiently computing a good segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 98–104 (1998)
Google Scholar
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice Hall, Englewood Cliffs (1988)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Multimedia Department, Institute Eurecom, 2229 routes des Cretes, 06904, Sophia-Antipolis, France
Lukas Hohl, Fabrice Souvannavong, Bernard Merialdo & Benoit Huet

Authors

Lukas Hohl
View author publications
You can also search for this author in PubMed Google Scholar
Fabrice Souvannavong
View author publications
You can also search for this author in PubMed Google Scholar
Bernard Merialdo
View author publications
You can also search for this author in PubMed Google Scholar
Benoit Huet
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, Mathematical and Information Sciences, University of Brighton, UK
Peter Enser
Informatics and Telematics Institute, Centre for Research and Technology-Hellas, 57001, Thessaloniki, Greece
Yiannis Kompatsiaris
Centre for Digital Video Processing, Adaptive Information Cluster, Dublin City University, Ireland
Noel E. O’Connor
Dublin City University, Dublin, Ireland
Alan F. Smeaton
ISLA lab, Informatics Institute, University of Amsterdam, The Netherlands
Arnold W. M. Smeulders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hohl, L., Souvannavong, F., Merialdo, B., Huet, B. (2004). Using Structure for Video Object Retrieval. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_66

Download citation

DOI: https://doi.org/10.1007/978-3-540-27814-6_66
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics