Bionic Vision Descriptor for Image Retrieval

Li, Guangzhe; Liu, Shenglan; Wang, Feilong; Feng, Lin

doi:10.1007/978-3-030-63830-6_14

Guangzhe Li¹⁴,
Shenglan Liu¹⁴,
Feilong Wang¹⁴ &
…
Lin Feng¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12532))

Included in the following conference series:

International Conference on Neural Information Processing

2254 Accesses

Abstract

Human visual system gets remarkable performance by processing low-level features. In the last decade, many descriptors have been proposed for feature extraction. However, fewer of them get satisfying performance with low-level features. Compared to high-level ones, low-level features make use of natural underlying elements like texture and they are extracted directly, which makes low-level features more efficient in image retrieval domains. In this paper, a new descriptor named Bionic Vision Descriptor (BVD), which is based on the principle of human visual system, is proposed. The descriptor fuses uniform low-level features extracted from color, texture and gradient elements. Moreover, matrix calculation and feature selection are utilized to accelerate the calculation of BVD. Experimental results show that our method outperforms other state-of-the-art traditional descriptors with less runtime and fewer initial dimensions on benchmark datasets.

This study was funded by National Natural Science Foundation of Peoples Republic of China(61672130, 61972064), The Fundamental Research Funds for the Central Universities(DUT19RC(3)012, DUT20RC(5)010) and LiaoNing Revitalization Talents Program(XLYC1806006).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Babenko, A., Slesarev, A., Chigorin, A., Lempitsky, V.: Neural codes for image retrieval. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8689, pp. 584–599. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10590-1_38
Chapter Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006). https://doi.org/10.1007/11744023_32
Chapter Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 886–893. IEEE (2005)
Google Scholar
Di Zenzo, S.: A note on the gradient of a multi-image. Comput. Vis. Graph. Image Process. 33(1), 116–125 (1986)
Article Google Scholar
Ferman, A.M., Tekalp, A.M., Mehrotra, R.: Robust color histogram descriptors for video segment retrieval and identification. IEEE Trans. Image Process. 11(5), 497–508 (2002)
Article Google Scholar
Ge, T., Ke, Q., Sun, J.: Sparse-coded features for image retrieval. In: BMVC, pp. 132.1–132.11 (2013)
Google Scholar
Gordoa, A., Rodríguez-Serrano, J.A., Perronnin, F., Valveny, E.: Leveraging category-level labels for instance-level image retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3045–3052. IEEE (2012)
Google Scholar
He, X., Cai, D., Niyogi, P.: Laplacian score for feature selection. In: Advances in Neural Information Processing Systems, pp. 507–514 (2006)
Google Scholar
Jégou, H., Douze, M., Schmid, C.: Improving bag-of-features for large scale image search. Int. J. Comput. Vis. 87(3), 316–336 (2010)
Article Google Scholar
Jégou, H., Douze, M., Schmid, C., Pérez, P.: Aggregating local descriptors into a compact image representation. In: CVPR 2010–23rd IEEE Conference on Computer Vision and Pattern Recognition, pp. 3304–3311. IEEE Computer Society (2010)
Google Scholar
Jolliffe, I.: Principal component analysis. Springer (2011). https://doi.org/10.1007/b98835
Koffka, K.: Principles of Gestalt Psychology. Routledge, London (2013)
Book Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Google Scholar
Lewis, D.E., Pearson, J., Khuu, S.K.: The color “fruit”: object memories defined by color. PloS One 8(5), e64960 (2013)
Article Google Scholar
Liu, C., Yuen, J., Torralba, A.: Sift flow: Dense correspondence across scenes and its applications. IEEE Trans. Pattern Anal. Mach. Intell. 33(5), 978–994 (2010)
Article Google Scholar
Liu, G.H., Li, Z.Y., Zhang, L., Xu, Y.: Image retrieval based on micro-structure descriptor. Pattern Recogn. 44(9), 2123–2133 (2011)
Article Google Scholar
Liu, G.H., Yang, J.Y.: Content-based image retrieval using color difference histogram. Pattern Recogn. 46(1), 188–198 (2013)
Article Google Scholar
Liu, H., Zhao, Q., Zhang, C., Mbelwa, J.T., Tang, S., Zhang, J.: Boosting vlad with weighted fusion of local descriptors for image retrieval. Multimedia Tools Appl. 78(9), 11835–11855 (2019)
Article Google Scholar
Liu, S., et al.: Color recognition for rubik’s cube robot. arXiv preprint arXiv:1901.03470 (2019)
Liu, S., et al.: Perceptual uniform descriptor and ranking on manifold for image retrieval. Inf. Sci. 424, 235–249 (2018)
Article MathSciNet Google Scholar
Liu, Y., Zhang, D., Lu, G., Ma, W.Y.: A survey of content-based image retrieval with high-level semantics. Pattern Recogn. 40(1), 262–282 (2007)
Article Google Scholar
Liu, Z., Li, H., Zhou, W., Rui, T., Tian, Q.: Making residual vector distribution uniform for distinctive image representation. IEEE Trans. Circuits Syst. Video Technol. 26(2), 375–384 (2015)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Maaten, L.V.D., Hinton, G.: Visualizing data using t-sne. J. Mach. Learn. Res. 9, 2579–2605 (2008)
MATH Google Scholar
Ojala, T., Pietikäinen, M., Mäenpää, T.: Gray scale and rotation invariant texture classification with local binary patterns. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1842, pp. 404–420. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-45054-8_27
Chapter Google Scholar
Rocha, A., Goldenstein, S.K.: Multiclass from binary: expanding one-versus-all, one-versus-one and ecoc-based approaches. IEEE Trans. Neural Networks Learn. Syst. 25(2), 289–302 (2013)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: IEEE International Conference on Computer Vision, pp. 1470–1477. IEEE (2003)
Google Scholar
Ullman, S., Assif, L., Fetaya, E., Harari, D.: Atoms of recognition in human and computer vision. Proc. Nat. Acad. Sci. 113(10), 2744–2749 (2016)
Article Google Scholar
Wengert, C., Douze, M., Jégou, H.: Bag-of-colors for improved image search. In: Proceedings of the 19th ACM international conference on Multimedia, pp. 1437–1440. ACM (2011)
Google Scholar
Zheng, L., Wang, S., Liu, Z., Tian, Q.: Packing and padding: Coupled multi-index for accurate image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1939–1946 (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian, Liaoning, China
Guangzhe Li, Shenglan Liu, Feilong Wang & Lin Feng

Authors

Guangzhe Li
View author publications
You can also search for this author in PubMed Google Scholar
Shenglan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Feilong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lin Feng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shenglan Liu .

Editor information

Editors and Affiliations

Department of AI, Ping An Life, Shenzhen, China
Haiqin Yang
Faculty of Information Technology, King Mongkut's Institute of Technology Ladkrabang, Bangkok, Thailand
Kitsuchart Pasupa
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi-Sing Leung
Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, Hong Kong
James T. Kwok
School of Information Technology, King Mongkut’s University of Technology Thonburi, Bangkok, Thailand
Jonathan H. Chan
The Chinese University of Hong Kong, New Territories, Hong Kong
Irwin King

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, G., Liu, S., Wang, F., Feng, L. (2020). Bionic Vision Descriptor for Image Retrieval. In: Yang, H., Pasupa, K., Leung, A.CS., Kwok, J.T., Chan, J.H., King, I. (eds) Neural Information Processing. ICONIP 2020. Lecture Notes in Computer Science(), vol 12532. Springer, Cham. https://doi.org/10.1007/978-3-030-63830-6_14

Download citation

DOI: https://doi.org/10.1007/978-3-030-63830-6_14
Published: 19 November 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-63829-0
Online ISBN: 978-3-030-63830-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics