Abstract
Most of the Nearest Neighbor (NN) based image annotation (or classification) methods cannot achieve satisfactory performance. In this paper, we propose a novel Nearest Neighbor method based on a multi-feature distance metric, which takes full advantage of different and complementary features. We first establish a metric for each feature and assign a weight for every metric, and then linearly combine all of them together to form one distance metric, namely the multi-feature metric. After that, we construct an NN model based on “image-to-cluster” distances, which equals to the distances between an image and the clusters within an image category using our multi-feature based metric, and which is different from calculating Euclidean distances between two images. By introducing this multi-feature based distance metric, our NN based model can mitigate the semantic issues due to intra-class variations and inter-class similarities, and improve the image annotation performance. Experiments confirm the superiority of our model in comparison with both the traditional classifiers and the state of the art learning-based models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Thomee, B., Popescu, A.: Overview of the ImageCLEF2012 flickr photo annotation and retrieval task. In: CLEF 2012 working notes, Rome, Italy (2012)
Yang, J., Yu, K., Gong, Y.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR, pp. 1794–1801. IEEE, Anchorage (2009)
Li, L.J., Su, H., Xing, E.P., et al.: Object bank: a high-level image representation for scene classification and semantic feature sparsification. Int. J. Comput. Vis. 107(1), 20–39 (2014)
Wang, X., Du, J., Wu, S., et al.: High-level semantic image annotation based on hot Internet topics. Multimedia Tools Appl. 74(6), 2055–2084 (2015)
Moran, S, Lavrenko, V.: Sparse kernel learning for image annotation. In: Proceedings of International Conference on Multimedia Retrieval. ACM, Glasgow (2014)
Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proceedings of CVPR, pp. 1–8. IEEE, Anchorage (2008)
Wang, S., Jiang, S., Huang, Q., Tian, Q.: Multi-feature metric learning with knowledge transfer among semantics and social tagging. In: Proceedings of CVPR, pp. 2240–2247. IEEE, Rhodes Island (2012)
Verma, Y., Jawahar, C.V.: Image annotation using metric learning in semantic neighbourhoods. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 836–849. Springer, Heidelberg (2012)
Makadia, A., Pavlovic, V., Kumar, S.: A new baseline for image annotation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 316–329. Springer, Heidelberg (2008)
Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proceedings of CVPR, pp. 221–228. IEEE, Anchorage (2009)
Jia, Y, Huang, C., Darrell, T.: Beyond spatial pyramids: receptive field learning for pooled image features. In: Proceedings of CVPR, pp. 3370–3377. IEEE, Rhodes Island (2012)
Zhang, L., Zhou, W.D.: Sparse ensembles using weighted combination methods based on linear programming. Pattern Recogn. 44(1), 97–106 (2011)
Wu, J., Rehg, J.M.: Beyond the euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: Proceedings of ICCV, pp. 630–637. IEEE, Kyoto (2009)
Zeng, Z., et al.: A survey of affect recognition methods: audio, visual and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 39–58 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Wu, W., Gao, G. (2015). Nearest Neighbor with Multi-feature Metric for Image Annotation. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_57
Download citation
DOI: https://doi.org/10.1007/978-3-319-26561-2_57
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26560-5
Online ISBN: 978-3-319-26561-2
eBook Packages: Computer ScienceComputer Science (R0)