Skip to main content
Log in

Non-full multi-layer feature representations for person re-identification

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Person re-identification(Re-ID) has attracted increasing attention in the field of computer vision due to its great significance for the potential real-world applications. Profited from the success of convolutional neural networks(CNNs), existing multi-layer approaches leverage different scales of convolutional layers to learn more discriminative features, improving the Re-ID performance to some extent. However, these methods do not further explore whether all the scales of convolutional layers are positive for person re-identification. In this work, we propose a novel non-full multi-layer(NFML) network, which can jointly learn discriminative feature embeddings from positive multiple layers with the manner of combining global and local cues. Moreover, considering few works focus on how to effectively handle the feature maps, a simple yet effective feature progressing module named Pooling Batch Normalization(PBN), consisting of pooling, reduction and batch normalization operations, is introduced to optimize the model structure and further improve the Re-ID performance. Results on three mainstream benchmark datasets Market-1501, DukeMTMC-reID and CUHK03 demonstrate that our method can significantly boost the performances, outperforming the state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1994) Signature verification using a siamese time delay neural network. In: Advances in neural information processing systems, pp 737–744

  2. Cai H, Wang Z, Cheng J (2019) Multi-scale body-part mask guided attention for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops

  3. Chang X, Hospedales TM, Xiang T (2018) Multi-level factorisation net for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2109–2118

  4. Chen D, Xu D, Li H, Sebe N, Wang X (2018) Group consistent similarity learning via deep crf for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8649–8658

  5. Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 403–412

  6. Chen X, Fang H, Lin TY, Vedantam R, Gupta S, Dollár P, Zitnick CL (2015) Microsoft coco captions:, Data collection and evaluation server. arXiv:1504.00325

  7. Chen Y, Zhu X, Gong S (2017) Person re-identification by deep learning multi-scale representations. In: Proceedings of the IEEE international conference on computer vision, pp 2590–2600

  8. Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1335–1344

  9. Dai Z, Chen M, Zhu S, Tan P (2019) Batch dropblock network for person re-identification and beyond. In: Proceedings of the IEEE international conference on computer vision

  10. Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: 2009 IEEE Conference on computer vision and pattern recognition, pp 248–255

  11. Fan X, Luo H, Zhang X, He L, Zhang C, Jiang W (2018) Scpnet: Spatial-channel parallelism network for joint holistic and partial person re-identification. In: Asian conference on computer vision, pp 19–34

  12. Fu Y, Wei Y, Zhou Y, Shi H, Huang G, Wang X, Yao Z, Huang T (2019) Horizontal pyramid matching for person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, vol 33, pp 8295–8302

  13. Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer society conference on computer vision and pattern recognition, vol 2, pp 1735–1742

  14. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  15. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv:1703.07737

  16. Hoffer E, Ailon N (2015) Deep metric learning using triplet network. In: International workshop on similarity-based pattern recognition, pp 84–92

  17. Huang H, Yang W, Chen X, Zhao X, Huang K, Lin J, Huang G, Du D (2018) Eanet:, Enhancing alignment for cross-domain person re-identification. arXiv:1812.11369

  18. Kalayeh MM, Basaran E, Gökmen M, Kamasak ME, Shah M (2018) Human semantic parsing for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1062–1071

  19. Li W, Zhao R, Xiao T, Wang X (2014) Deepreid: Deep filter pairing neural network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 152–159

  20. Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2285–2294

  21. Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2017) Improving person re-identification by attribute and identity learning. arXiv:1703.7220

  22. Liu J, Zha ZJ, Tian Q, Liu D, Yao T, Ling Q, Mei T (2016) Multi-scale triplet cnn for person re-identification. In: Proceedings of the 24th ACM international conference on multimedia, pp 192–196

  23. Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of tricks and a strong baseline for deep person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops

  24. Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision, pp 17–35

  25. Saquib Sarfraz M, Schumann A, Eberle A, Stiefelhagen R (2018) A pose-sensitive embedding for person re-identification with expanded cross neighborhood re-ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 420–429

  26. Schumann A, Stiefelhagen R (2017) Person re-identification by deep learning attribute-complementary information. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 20–28

  27. Selvaraju RR, Das A, Vedantam R, Cogswell M, Parikh D, Batra D (2017) Grad-cam: Visual explanations from deep networks via gradient-based localization. In: Proceedings of the IEEE international conference on computer vision, pp 618–626

  28. Shen Y, Xiao T, Li H, Yi S, Wang X (2018) End-to-end deep kronecker-product matching for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6886–6895

  29. Suh Y, Wang J, Tang S, Mei T, Mu Lee K (2018) Part-aligned bilinear representations for person re-identification. In: Proceedings of the european conference on computer vision, pp 402–419

  30. Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 393–402

  31. Sun Y, Zheng L, Deng W, Wang S (2017) Svdnet for pedestrian retrieval. In: Proceedings of the IEEE international conference on computer vision, pp 3800–3808

  32. Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the european conference on computer vision, pp 480–496

  33. Tay CP, Roy S, Yap KH (2019) Aanet: Attribute attention network for person re-identifications. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7134–7143

  34. Wang C, Zhang Q, Huang C, Liu W, Wang X (2018) Mancs: a multi-task attentional network with curriculum sampling for person re-identification. In: Proceedings of the european conference on computer vision, pp 365–381

  35. Wang G, Lai J, Huang P, Xie X (2019) Spatial-temporal person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, pp 8933–8940

  36. Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In: 2018 ACM multimedia conference on multimedia conference, pp 274–282

  37. Wang Y, Wang L, You Y, Zou X, Chen V, Li S, Huang G, Hariharan B, Weinberger KQ (2018) Resource aware person re-identification across multiple resolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8042–8051

  38. Wei L, Zhang S, Yao H, Gao W, Tian Q (2017) Glad: Global-local-alignment descriptor for pedestrian retrieval. In: Proceedings of the 25th ACM international conference on multimedia, pp 420–428

  39. Xiao T, Li H, Ouyang W, Wang X (2016) Learning deep feature representations with domain guided dropout for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1249–1258

  40. Xiong F, Xiao Y, Cao Z, Gong K, Fang Z, Zhou JT (2018) Towards good practices on building effective cnn baseline model for person re-identification. arXiv:1807.11042

  41. Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2119–2128

  42. Zhang J, Jiang F (2019) Multi-level supervised network for person re-identification. In: ICASSP 2019-2019 IEEE international conference on acoustics, speech and signal processing, pp 2072–2076

  43. Zhao L, Li X, Zhuang Y, Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 3219–3228

  44. Zheng F, Deng C, Sun X, Jiang X, Guo X, Yu Z, Huang F, Ji R (2019) Pyramidal person re-identification via multi-loss dynamic training. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 8514–8522

  45. Zheng L, Huang Y, Lu H, Yang Y (2019) Pose invariant embedding for deep person re-identification IEEE Transactions on Image Processing

  46. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124

  47. Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification:, Past, present and future. arXiv:1610.02984

  48. Zheng Z, Zheng L, Yang Y (2017) Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In: Proceedings of the IEEE international conference on computer vision, pp 3754–3762

  49. Zheng Z, Zheng L, Yang Y (2018) A discriminatively learned cnn embedding for person re-identification. ACM Trans Multimed Comput Commun Appl 14(1):13

    Article  MathSciNet  Google Scholar 

  50. Zheng Z, Zheng L, Yang Y (2018) Pedestrian alignment network for large-scale person re-identification IEEE Transactions on Circuits and Systems for Video Technology

  51. Zhong Z, Zheng L, Cao D, Li S (2017) Re-ranking person re-identification with k-reciprocal encoding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp 1318–1327

  52. Zhong Z, Zheng L, Zheng Z, Li S, Yang Y (2018) Camera style adaptation for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5157–5166

  53. Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-scale feature learning for person re-identification. In: Proceedings of the IEEE international conference on computer vision

Download references

Funding

This work was supported in part by the Chinese Natural Science Foundation (CNSF) (under Grant 61472278, Grant 61702165). This work was supported in part by the Major Project of Tianjin (under Grant 18ZXZNGX00150). This work was supported in part by the Hebei Provincial Natural Science Foundation, China (under Grant No. F2020111001). This work was supported in part by the Foundation for Talents Program Fostering of Hebei Province (No.A201803025).

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Jianguang Zhang or Xianbin Wen.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, J., Zhang, J. & Wen, X. Non-full multi-layer feature representations for person re-identification. Multimed Tools Appl 80, 17205–17221 (2021). https://doi.org/10.1007/s11042-020-09410-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-020-09410-7

Keywords

Navigation