Skip to main content

Combining Visual and Textual Modalities for Multimedia Ontology Matching

  • Conference paper
Semantic Multimedia (SAMT 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6725))

Included in the following conference series:

Abstract

Multimedia search and retrieval are considerably improved by providing explicit meaning to visual content by the help of ontologies. Several multimedia ontologies have been proposed recently as suitable knowledge models to narrow the well known semantic gap and to enable the semantic interpretation of images. Since these ontologies have been created in different application contexts, establishing links between them, a task known as ontology matching, promises to fully unlock their potential in support of multimedia search and retrieval. This paper proposes and compares empirically two extensional ontology matching techniques applied to an important semantic image retrieval issue: automatically associating common-sense knowledge to multimedia concepts. First, we extend a previously introduced matching approach to use both textual and visual knowledge. In addition, a novel matching technique based on a multimodal graph is proposed. We argue that the textual and visual modalities have to be seen as complementary rather than as exclusive means to improve the efficiency of the application of an ontology matching procedure in the multimedia domain. An experimental evaluation is included.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Athanasiadis, T., Tzouvaras, V., Petridis, K., Precioso, F., Avrithis, Y., Kompatsiaris, Y.: Using a multimedia ontology infrastructure for semantic annotation of multimedia content. In: SemAnnot 2005 (2005)

    Google Scholar 

  2. Dasiopoulou, S., Kompatsiaris, I., Strintzis, M.: Using fuzzy dls to enhance semantic image analysis. In: Semantic Multimedia, pp. 31–46. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  3. Dasiopoulou, S., Tzouvaras, V., Kompatsiaris, I., Strintzis, M.: Enquiring MPEG-7 based multimedia ontologies. In: MM Tools and Appls., pp. 1–40 (2010)

    Google Scholar 

  4. Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: a large-scale hierarchical image database. In: CVPR, pp. 710–719 (2009)

    Google Scholar 

  5. Euzenat, J., Shvaiko, P.: Ontology Matching, 1st edn. Springer, Heidelberg (2007)

    MATH  Google Scholar 

  6. Fan, J., Luo, H., Shen, Y., Yang, C.: Integrating visual and semantic contexts for topic network generation and word sense disambiguation. In: ACM CIVR 2009, pp. 1–8 (2009)

    Google Scholar 

  7. Guyon, I., Elisseeff, A.: An introduction to variable and feature selection. JMLR 3(1), 1157–1182 (2003)

    MATH  Google Scholar 

  8. Haveliwala, T.: Topic-sensitive pagerank: A context-sensitive ranking algorithm for web search. IEEE Transactions on Knowledge and Data Engineering, 784–796 (2003)

    Google Scholar 

  9. Hudelot, C., Atif, J., Bloch, I.: Fuzzy Spatial Relation Ontology for Image Interpretation. Fuzzy Sets and Systems 159, 1929–1951 (2008)

    Article  Google Scholar 

  10. Hudelot, C., Maillot, N., Thonnat, M.: Symbol grounding for semantic image interpretation: from image data to semantics. In: SKCV-Workshop, ICCV (2005)

    Google Scholar 

  11. Inoue, M.: On the need for annotation-based image retrieval. In: Proceedings of the Workshop on Information Retrieval in Context (IRiX), Sheffield, UK, pp. 44–46 (2004)

    Google Scholar 

  12. James, N., Todorov, K., Hudelot, C.: Ontology matching for the semantic annotation of images. In: FUZZ-IEEE. IEEE Computer Society Press, Los Alamitos (2010)

    Google Scholar 

  13. Koskela, M., Smeaton, A.: An empirical study of inter-concept similarities in multimedia ontologies. In: CIVR 2007, pp. 464–471. ACM, New York (2007)

    Google Scholar 

  14. Mihalcea, R., Tarau, P., Figa, E.: Pagerank on semantic networks, with application to word sense disambiguation. In: ICCL, p. 1126. Association for Computational Linguistics (2004)

    Google Scholar 

  15. Miller, G.: WordNet: a lexical database for English. Communications of the ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  16. Pan, J., Yang, H., Faloutsos, C., Duygulu, P.: Automatic multimedia cross-modal correlation discovery. In: ACM SIGKDD, p. 658. ACM, New York (2004)

    Google Scholar 

  17. Peraldi, I.S.E., Kaya, A., Möller, R.: Formalizing multimedia interpretation based on abduction over description logic aboxes. In: Description Logics (2009)

    Google Scholar 

  18. Russell, B., Torralba, A., Murphy, K., Freeman, W.: LabelMe: a database and web-based tool for image annotation. IJCV 77(1), 157–173 (2008)

    Article  Google Scholar 

  19. Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Patt. An. Mach. Intell., 1349–1380 (2000)

    Google Scholar 

  20. Smith, J., Chang, S.: Large-scale concept ontology for multimedia. IEEE Multimedia 13(3), 86–91 (2006)

    Article  Google Scholar 

  21. Snoek, C., Huurnink, B., Hollink, L., De Rijke, M., Schreiber, G., Worring, M.: Adding semantics to detectors for video retrieval. IEEE Trans. on Mult. 9(5), 975–986 (2007)

    Article  Google Scholar 

  22. Tansley, R.: The multimedia thesaurus: An aid for multimedia information retrieval and navigation. Master’s thesis (1998)

    Google Scholar 

  23. Todorov, K., Geibel, P., Kühnberger, K.-U.: Extensional ontology matching with variable selection for support vector machines. In: CISIS, pp. 962–968. IEEE Computer Society Press, Los Alamitos (2010)

    Google Scholar 

  24. Tong, H., Faloutsos, C., Pan, J.-Y.: Fast random walk with restart and its applications. In: ICDM 2006, pp. 613–622. IEEE Computer Society, Washington, DC (2006)

    Google Scholar 

  25. Wang, C., Jing, F., Zhang, L., Zhang, H.: Image annotation refinement using random walk with restarts. In: ACM MM, p. 650 (2006)

    Google Scholar 

  26. Wu, L., Hua, X.-S., Yu, N., Ma, W.-Y., Li, S.: Flickr distance. In: MM 2008, pp. 31–40. ACM, New York (2008)

    Google Scholar 

  27. Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Fourteenth ICML, pp. 412–420. Morgan Kaufmann Publishers, San Francisco (1997)

    Google Scholar 

  28. Yao, B., Yang, X., Lin, L., Lee, M., Zhu, S.: I2t: Image parsing to text description. IEEE Proc. Special Issue on Internet Vision (to appear)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

James, N., Todorov, K., Hudelot, C. (2011). Combining Visual and Textual Modalities for Multimedia Ontology Matching. In: Declerck, T., Granitzer, M., Grzegorzek, M., Romanelli, M., Rüger, S., Sintek, M. (eds) Semantic Multimedia. SAMT 2010. Lecture Notes in Computer Science, vol 6725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23017-2_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-23017-2_7

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-23016-5

  • Online ISBN: 978-3-642-23017-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics