Nearest Neighbor with Multi-feature Metric for Image Annotation

Wu, Wei; Gao, Guanglai

doi:10.1007/978-3-319-26561-2_57

Wei Wu¹⁷ &
Guanglai Gao¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))

Included in the following conference series:

International Conference on Neural Information Processing

2278 Accesses
2 Citations

Abstract

Most of the Nearest Neighbor (NN) based image annotation (or classification) methods cannot achieve satisfactory performance. In this paper, we propose a novel Nearest Neighbor method based on a multi-feature distance metric, which takes full advantage of different and complementary features. We first establish a metric for each feature and assign a weight for every metric, and then linearly combine all of them together to form one distance metric, namely the multi-feature metric. After that, we construct an NN model based on “image-to-cluster” distances, which equals to the distances between an image and the clusters within an image category using our multi-feature based metric, and which is different from calculating Euclidean distances between two images. By introducing this multi-feature based distance metric, our NN based model can mitigate the semantic issues due to intra-class variations and inter-class similarities, and improve the image annotation performance. Experiments confirm the superiority of our model in comparison with both the traditional classifiers and the state of the art learning-based models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Thomee, B., Popescu, A.: Overview of the ImageCLEF2012 flickr photo annotation and retrieval task. In: CLEF 2012 working notes, Rome, Italy (2012)
Google Scholar
Yang, J., Yu, K., Gong, Y.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR, pp. 1794–1801. IEEE, Anchorage (2009)
Google Scholar
Li, L.J., Su, H., Xing, E.P., et al.: Object bank: a high-level image representation for scene classification and semantic feature sparsification. Int. J. Comput. Vis. 107(1), 20–39 (2014)
Article Google Scholar
Wang, X., Du, J., Wu, S., et al.: High-level semantic image annotation based on hot Internet topics. Multimedia Tools Appl. 74(6), 2055–2084 (2015)
Article Google Scholar
Moran, S, Lavrenko, V.: Sparse kernel learning for image annotation. In: Proceedings of International Conference on Multimedia Retrieval. ACM, Glasgow (2014)
Google Scholar
Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proceedings of CVPR, pp. 1–8. IEEE, Anchorage (2008)
Google Scholar
Wang, S., Jiang, S., Huang, Q., Tian, Q.: Multi-feature metric learning with knowledge transfer among semantics and social tagging. In: Proceedings of CVPR, pp. 2240–2247. IEEE, Rhodes Island (2012)
Google Scholar
Verma, Y., Jawahar, C.V.: Image annotation using metric learning in semantic neighbourhoods. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 836–849. Springer, Heidelberg (2012)
Chapter Google Scholar
Makadia, A., Pavlovic, V., Kumar, S.: A new baseline for image annotation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 316–329. Springer, Heidelberg (2008)
Chapter Google Scholar
Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proceedings of CVPR, pp. 221–228. IEEE, Anchorage (2009)
Google Scholar
Jia, Y, Huang, C., Darrell, T.: Beyond spatial pyramids: receptive field learning for pooled image features. In: Proceedings of CVPR, pp. 3370–3377. IEEE, Rhodes Island (2012)
Google Scholar
Zhang, L., Zhou, W.D.: Sparse ensembles using weighted combination methods based on linear programming. Pattern Recogn. 44(1), 97–106 (2011)
Article MATH Google Scholar
Wu, J., Rehg, J.M.: Beyond the euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: Proceedings of ICCV, pp. 630–637. IEEE, Kyoto (2009)
Google Scholar
Zeng, Z., et al.: A survey of affect recognition methods: audio, visual and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 39–58 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Inner Mongolia University, Hohhot, China
Wei Wu & Guanglai Gao

Authors

Wei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Guanglai Gao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wei Wu .

Editor information

Editors and Affiliations

University of Istanbul, Istanbul, Turkey
Sabri Arik
University at Qatar, Doha, Qatar
Tingwen Huang
Tunku Abdul Rahman University College, Kuala Lumpur, Malaysia
Weng Kin Lai
University of Science Technology, Wuhan, China
Qingshan Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, W., Gao, G. (2015). Nearest Neighbor with Multi-feature Metric for Image Annotation. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_57

Download citation

DOI: https://doi.org/10.1007/978-3-319-26561-2_57
Published: 18 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26560-5
Online ISBN: 978-3-319-26561-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics