Image Annotations Based on Semi-supervised Clustering with Semantic Soft Constraints

Xiaoguang, Rui; Pingbo, Yuan; Nenghai, Yu

doi:10.1007/11922162_72

Rui Xiaoguang²⁰,
Yuan Pingbo²⁰ &
Yu Nenghai²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4261))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

719 Accesses
1 Citations

Abstract

An efficient image annotation and retrieval system is highly desired for the increase of amounts of image information. Clustering algorithms make it possible to represent images with finite symbols. Based on this, many statistical models, which analyze correspondence between visual features and words, have been published for image annotation. But most of these models cluster only using visual features, ignoring semantics of images. In this paper, we propose a novel model based on semi-supervised clustering with semantic soft constraints which can utilize both visual features and semantic meanings. Our method first measures the semantic distance with generic knowledge (e.g. WordNet) between regions of the training images with manual annotations. Then a semi-supervised clustering algorithm with semantic soft constraints is proposed to cluster regions with semantic soft constraints which are formed by semantic distance. The experiment results show that our model improves performance of image annotation and retrieval system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: First International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)
Google Scholar
Duygulu, P., Barnard, K., De, F.N., et al.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models, pp. 119–126. ACM Press, Toronto, Canada (2003)
Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., et al.: Constrained k-means clustering with background knowledge. In: Proceedings of the Eighteenth International Conference on Machine Learning, pp. 577–584 (2001)
Google Scholar
Wagstaff, K., Cardie, C.: Clustering with instance-level constraints. In: Proceedings of the Seventeenth International Conference on Machine Learning, pp. 1103–1110 (2000)
Google Scholar
Wagstaff, K., et al.: Intelligent Clustering with Instance-Level Constraints. In: Proceedings of the Seventeenth International Conference on Machine Learning, pp. 1103–1110 (2000)
Google Scholar
Basu, S., Bilenko, M., Mooney, R.J.: A probabilistic framework for semi-supervised clustering. In: Proceedings of the 2004 ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 59–68 (2004)
Google Scholar
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of International Conference on Research in Computational Linguistics, pp. 19–33 (1997)
Google Scholar
Lin, D.: Using syntactic dependency as local context to resolve word sense ambiguity. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, pp. 64–71 (1997)
Google Scholar
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, vol. 1, pp. 448–453 (1995)
Google Scholar
Leacock, C., Chodorow, M.: Combining local context and WordNet similarity for word sense identification. WordNet: An Electronic Lexical Database, pp. 265–283 (1998)
Google Scholar
Shi, R., Wanjun, J., Tat-seng, C.: A Novel Approach to Auto Image Annotation Based on Pairwise Constrained Clustering and Semi-Naive Bayesian Model, pp. 322–327 (2005)
Google Scholar
Besag, J.: On the statistical analysis of dirty pictures (with discussion). Journal of the Royal Statistical Society, Series B 48(3), 259–302 (1986)
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

MOE-Microsoft Key Laboratory of Multimedia Computing and Communication, University of Science and Technology of China, Hefei, Anhui, China
Rui Xiaoguang, Yuan Pingbo & Yu Nenghai

Authors

Rui Xiaoguang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Pingbo
View author publications
You can also search for this author in PubMed Google Scholar
Yu Nenghai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Computer Science, Zhejiang University, China
Yueting Zhuang
Department of Computer Science and Technology, Tsinghua University, P.R. China
Shi-Qiang Yang
Microsoft Corporation, Microsoft China R&D Group, 49 Zhichun Road, 100080, Beijing, China
Yong Rui
College of Computer Science and Technology, Zhejiang University, 310027, Hangzhou, Zhejiang Province, China
Qinming He

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xiaoguang, R., Pingbo, Y., Nenghai, Y. (2006). Image Annotations Based on Semi-supervised Clustering with Semantic Soft Constraints. In: Zhuang, Y., Yang, SQ., Rui, Y., He, Q. (eds) Advances in Multimedia Information Processing - PCM 2006. PCM 2006. Lecture Notes in Computer Science, vol 4261. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11922162_72

Download citation

DOI: https://doi.org/10.1007/11922162_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48766-1
Online ISBN: 978-3-540-48769-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics