High-Performance Image Annotation and Retrieval for Weakly Labeled Images Using Latent Space Learning

Nakayama, Hideki; Harada, Tatsuya; Kuniyoshi, Yasuo; Otsu, Nobuyuki

doi:10.1007/978-3-540-89796-5_62

Hideki Nakayama⁸,
Tatsuya Harada⁸,
Yasuo Kuniyoshi⁸ &
…
Nobuyuki Otsu⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5353))

Included in the following conference series:

Pacific-Rim Conference on Multimedia

1424 Accesses
5 Citations

Abstract

Image annotation and retrieval are among the most promising new internet search technologies and have widespread applications. However, the task is very difficult because of the generic nature of the target images. In this paper, we propose a high speed and high accuracy image annotation and retrieval method for miscellaneous objects and scenes. This method combines the higher-order local auto-correlation (HLAC) features with the probabilistic canonical correlation analysis framework. A distance between images can be defined in the intrinsic feature space for annotation using latent space learning between images and labels. The HLAC features have additive and position invariance properties, which makes them well-suited for images in which the positions and number of objects are arbitrary. The proposed method is shown to be faster and more accurate than previously published methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Duygulu, P., Barnard, K., Freitas, D.F.N.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Proc. European Conf. Computer Vision, pp. 349–354 (2002)
Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using crossmedia relevance models. In: Proc. ACM SIGIR Conf., pp. 119–126 (2003)
Google Scholar
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Advances in Neural Information Processing Systems (2003)
Google Scholar
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: MISRM 1999 First International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)
Google Scholar
Feng, S., Manmatha, R., Lavrenko, V.: Multiple bernoulli relevance models for image and video annotation. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 1002–1009 (2004)
Google Scholar
Carneiro, G., Chan, A.B., Moreno, P.J., Vasconcelos, N.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. Pattern Analysis and Machine Intelligence 29(3), 394–410 (2007)
Article Google Scholar
Carneiro, G., Vasconcelos, N.: Formulating semantic image annotation as a supervised learning problem. In: Proc. IEEE Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 163–168 (2005)
Google Scholar
Bach, F.R., Jordan, M.I.: A probabilistic interpretation of canonical correlation analysis. Technical Report 688, Department of Statistics, University of California, Berkeley (2005)
Google Scholar
Kato, T., Kurita, T., Otsu, N., Hirata, K.: A sketch retrieval method for full color image database –query by visual example–. In: Proc. of 11th International Conference on Pattern Recognition, vol. 2, pp. 213–216 (1992)
Google Scholar
Otsu, N., Kurita, T.: A new scheme for practical, flexible and intelligent vision systems. In: Proc. IAPR Workshop on Computer Vision (1988)
Google Scholar

Download references

Author information

Authors and Affiliations

Grad. School of Information Science and Technology, Dept. of Mechano-Informatics, The University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Hideki Nakayama, Tatsuya Harada & Yasuo Kuniyoshi
National Institute of Advanced Industrial Science and Technology, 1-1-1, Umezono, Tsukuba-shi, Ibaraki, 305-8568, Japan
Nobuyuki Otsu

Authors

Hideki Nakayama
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuya Harada
View author publications
You can also search for this author in PubMed Google Scholar
Yasuo Kuniyoshi
View author publications
You can also search for this author in PubMed Google Scholar
Nobuyuki Otsu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Engineering Science, National Cheng Kung University, No.1, University Road, 701, Tainan City, Taiwan
Yueh-Min Ray Huang
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, 95, Zhongguancun East Road, 100190, Beijing, China
Changsheng Xu
Institute of Biomedical Engineering, National Cheng Kung University, No. 1, University Road, 701, Tainan City, Taiwan
Kuo-Sheng Cheng
Department of Electrical Engineering, National Cheng Kung University, No. 1, University Road, 701, Tainan City, Taiwan
Jar-Ferr Kevin Yang
Department of Electrical and Computer Engineering, Concordia University, S-EV005.139, 1515 St. Catherine West, Montreal, H4G 2W1, Quebec, Canada
M. N. S. Swamy
Microsoft Research Asia, 5/F, Beijing Sigma Center, No. 49, Zhichun Road, Hai Dian District, 100080, Beijing, China
Shipeng Li
Department of Information Management, National Kaohsiung University of Applied Sciences, No. 415, Jiangong Road, Sanmin District, 80778, Kaohsiung, Taiwan
Jen-Wen Ding

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nakayama, H., Harada, T., Kuniyoshi, Y., Otsu, N. (2008). High-Performance Image Annotation and Retrieval for Weakly Labeled Images Using Latent Space Learning. In: Huang, YM.R., et al. Advances in Multimedia Information Processing - PCM 2008. PCM 2008. Lecture Notes in Computer Science, vol 5353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89796-5_62

Download citation

DOI: https://doi.org/10.1007/978-3-540-89796-5_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89795-8
Online ISBN: 978-3-540-89796-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics