Skip to main content

Nearest Neighbor with Multi-feature Metric for Image Annotation

  • Conference paper
  • First Online:
Neural Information Processing (ICONIP 2015)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9492))

Included in the following conference series:

Abstract

Most of the Nearest Neighbor (NN) based image annotation (or classification) methods cannot achieve satisfactory performance. In this paper, we propose a novel Nearest Neighbor method based on a multi-feature distance metric, which takes full advantage of different and complementary features. We first establish a metric for each feature and assign a weight for every metric, and then linearly combine all of them together to form one distance metric, namely the multi-feature metric. After that, we construct an NN model based on “image-to-cluster” distances, which equals to the distances between an image and the clusters within an image category using our multi-feature based metric, and which is different from calculating Euclidean distances between two images. By introducing this multi-feature based distance metric, our NN based model can mitigate the semantic issues due to intra-class variations and inter-class similarities, and improve the image annotation performance. Experiments confirm the superiority of our model in comparison with both the traditional classifiers and the state of the art learning-based models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Thomee, B., Popescu, A.: Overview of the ImageCLEF2012 flickr photo annotation and retrieval task. In: CLEF 2012 working notes, Rome, Italy (2012)

    Google Scholar 

  2. Yang, J., Yu, K., Gong, Y.: Linear spatial pyramid matching using sparse coding for image classification. In: Proceedings of CVPR, pp. 1794–1801. IEEE, Anchorage (2009)

    Google Scholar 

  3. Li, L.J., Su, H., Xing, E.P., et al.: Object bank: a high-level image representation for scene classification and semantic feature sparsification. Int. J. Comput. Vis. 107(1), 20–39 (2014)

    Article  Google Scholar 

  4. Wang, X., Du, J., Wu, S., et al.: High-level semantic image annotation based on hot Internet topics. Multimedia Tools Appl. 74(6), 2055–2084 (2015)

    Article  Google Scholar 

  5. Moran, S, Lavrenko, V.: Sparse kernel learning for image annotation. In: Proceedings of International Conference on Multimedia Retrieval. ACM, Glasgow (2014)

    Google Scholar 

  6. Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: Proceedings of CVPR, pp. 1–8. IEEE, Anchorage (2008)

    Google Scholar 

  7. Wang, S., Jiang, S., Huang, Q., Tian, Q.: Multi-feature metric learning with knowledge transfer among semantics and social tagging. In: Proceedings of CVPR, pp. 2240–2247. IEEE, Rhodes Island (2012)

    Google Scholar 

  8. Verma, Y., Jawahar, C.V.: Image annotation using metric learning in semantic neighbourhoods. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 836–849. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  9. Makadia, A., Pavlovic, V., Kumar, S.: A new baseline for image annotation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 316–329. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  10. Gehler, P., Nowozin, S.: On feature combination for multiclass object classification. In: Proceedings of CVPR, pp. 221–228. IEEE, Anchorage (2009)

    Google Scholar 

  11. Jia, Y, Huang, C., Darrell, T.: Beyond spatial pyramids: receptive field learning for pooled image features. In: Proceedings of CVPR, pp. 3370–3377. IEEE, Rhodes Island (2012)

    Google Scholar 

  12. Zhang, L., Zhou, W.D.: Sparse ensembles using weighted combination methods based on linear programming. Pattern Recogn. 44(1), 97–106 (2011)

    Article  MATH  Google Scholar 

  13. Wu, J., Rehg, J.M.: Beyond the euclidean distance: creating effective visual codebooks using the histogram intersection kernel. In: Proceedings of ICCV, pp. 630–637. IEEE, Kyoto (2009)

    Google Scholar 

  14. Zeng, Z., et al.: A survey of affect recognition methods: audio, visual and spontaneous expressions. IEEE Trans. Pattern Anal. Mach. Intell. 31(1), 39–58 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Wu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Wu, W., Gao, G. (2015). Nearest Neighbor with Multi-feature Metric for Image Annotation. In: Arik, S., Huang, T., Lai, W., Liu, Q. (eds) Neural Information Processing. ICONIP 2015. Lecture Notes in Computer Science(), vol 9492. Springer, Cham. https://doi.org/10.1007/978-3-319-26561-2_57

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-26561-2_57

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-26560-5

  • Online ISBN: 978-3-319-26561-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics