Movatterモバイル変換


[0]ホーム

URL:


Skip to main content

Advertisement

Springer Nature Link
Log in

Short range correlation transformer for occluded person re-identification

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Occluded person re-identification is one of the challenging areas of computer vision, which faces problems such as inefficient feature representation and low recognition accuracy. Recently, vision transformer is introduced into the field of re-identification and achieved state-of-the-art results by constructing global feature relationships between patch sequences. However, vision transformer is not good at capturing short-range correlations of patch sequence and exploiting spatial correlation in patch sequence, which leads to a decrease in the accuracy and robustness of the network in the face of occluded person re-identification. Therefore, to address the above problems, we design a partial feature transformer-based occluded person re-identification framework named PFT. The proposed PFT utilizes three modules to enhance the efficiency of vision transformer. (1) Patch full dimension enhancement module. We design a learnable tensor with the same size as patch sequences, which is full-dimensional and deeply embedded in patch sequences to enrich the diversity of training samples. (2) Fusion and reconstruction module. We extract the less important part of obtained patch sequences, and fuse them with original patch sequence to reconstruct the original patch sequences. (3) Spatial Slicing Module. We slice and group patch sequences from spatial direction, which can effectively improve the short-range correlation of patch sequences. Experimental results over occluded and holistic re-identification datasets demonstrate that the proposed PFT network achieves superior performance consistently and outperforms the state-of-the-art methods.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Lakhan A, Mohammed MA, Kadry S, Abdulkareem KH, Al-Dhief FT, Hsu C-H (2021) Federated learning enables intelligent reflecting surface in fog-cloud enabled cellular network. PeerJ Comput Sci 7:e758

    Article  Google Scholar 

  2. Awan MJ, Masood OA, Mohammed MA, Yasin A, Zain AM, Damaševičius R, Abdulkareem KH (2021) Image-based malware classification using vgg19 network and spatial convolutional attention. Electronics 10(19):2444

    Article  Google Scholar 

  3. Poongodi M, Malviya M, Hamdi M, Vijayakumar V, Mohammed MA, Rauf HT, Al-Dhlan KA (2022) 5G based Blockchain network for authentic and ethical keyword search engine. IET Commun 16(5):442–448

    Article  Google Scholar 

  4. Mohammed MA, Ibrahim DA, Salman AO (2021) Adaptive intelligent learning approach based on visual anti-spam email model for multi-natural language. J Intell Syst 30(1):774–792

    Article  Google Scholar 

  5. Mujahid A, Awan MJ, Yasin A, Mohammed MA, Damaševičius R, Maskeliūnas R, Abdulkareem KH (2021) Real-time hand gesture recognition based on deep learning yolov3 model. Appl Sci 11(9):4164

    Article  Google Scholar 

  6. Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: past, present and future. arXiv preprintarXiv:1610.02984

  7. Wang Z, Jiang J, Wu Y, Ye M, Bai X, Satoh S (2020) Learning sparse and identity-preserved hidden attributes for person re-identification. IEEE Trans Image Process 29:2013–2025

    Article  Google Scholar 

  8. Liao S, Yang H, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR)

  9. Liao S, Li SZ (2015) Efficient PSD constrained asymmetric metric learning for person re-identification. In: 2015 IEEE international conference on computer vision (ICCV)

  10. Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprintarXiv:1703.07737

  11. Wang G, Lai JH, Liang W, Wang G (2020) Smoothing adversarial domain attack and p-memory reconsolidation for cross-domain person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  12. Luo W, Li Y, Urtasun R, Zemel R (2016) Understanding the effective receptive field in deep convolutional neural networks. In: Advances in neural information processing systems, vol 29

  13. Zheng WS, Li X, Xiang T, Liao S, Lai J, Gong S (2015) Partial person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 4678–4686

  14. Zhuo J, Chen Z, Lai J, Wang G (2018) Occluded person reidentification. In: 2018 IEEE International Conference on Multimedia and Expo, ICME 2018, San Diego, CA, USA, July 23-27, IEEE Computer Society, pp 1–6

  15. Zhang Z, Lan C, Zeng W, Jin X, Chen Z (2020) Relation-aware global attention for person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  16. Chen X, Fu C, Zhao Y, Zheng F, Yang Y (2020) Salience-guided cascaded suppression network for person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  17. Gong BXY, Zhang Y, Poellabauer C (2019) Second-order non-local attention networks for person re-identification. In: 2019 IEEE/CVF international conference on computer vision (ICCV)

  18. Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition

  19. Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: 2019 IEEE/CVF international conference on computer vision (ICCV)

  20. Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G Gelly S et al (2020) An image is worth\(16\times 16\) words: transformers for image recognition at scale. arXiv preprintarXiv:2010.11929

  21. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, December 4-9, 2017, Long Beach, CA, USA, pp 5998–6008

  22. Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2017) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). Springer, Cham

    Google Scholar 

  23. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2020) Grad-cam: visual explanations from deep networks via gradient-based localization. Int J Comput Vis 128(2):336–359

    Article  Google Scholar 

  24. Wang G, Yang S, Liu H, Wang Z, Yang Y, Wang S, Yu G, Zhou E, Sun J (2020) High-order information matters: learning relation and topology for occluded person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6449–6458

  25. Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person reid. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  26. Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprintarXiv:1810.04805

  27. Oord A, Li Y, Babuschkin I, Simonyan K, Vinyals O, Kavukcuoglu K, Driessche G, Lockhart E, Cobo L Stimberg F et al (2018) Parallel wavenet: fast high-fidelity speech synthesis. In: International conference on machine learning. PMLR, pp 3918–3926

  28. Gu J, Bradbury J, Xiong C, Li VO, Socher R (2017) Non-autoregressive neural machine translation. arXiv preprintarXiv:1711.02281

  29. Ghazvininejad M, Levy O, Liu Y, Zettlemoyer L (2019) Mask-predict: parallel decoding of conditional masked language models. arXiv preprintarXiv:1904.09324

  30. Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International conference on machine learning. PMLR, pp 10347–10357

  31. He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 15013–15022

  32. Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7-13, 2015, IEEE Computer Society, pp 1116–1124

  33. Ristani E, Solera F, Zou RS, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision

  34. Zheng WS, Xiang L, Tao X, Liao S, Lai J, Gong S (2016) Partial person re-identification. In: IEEE international conference on computer vision

  35. He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7073–7082

  36. Zhong Z, Zheng L, Kang G, Li S, Yang Y (2017) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, no 7

  37. Zhao L, Xi L, Zhuang Y,Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: 2017 IEEE international conference on computer vision (ICCV)

  38. Huang H, Li D, Zhang Z, Chen X, Huang K (2018) Adversarially occluded samples for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  39. Suh Y, Wang J, Tang S, Mei T, Lee KM (2018) Part-aligned bilinear representations for person re-identification . In: European conference on computer vision

  40. Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X et al (2018) Fd-gan: Pose-guided feature distilling gan for robust person re-identification. In: Advances in neural information processing systems, vol 31

  41. He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: 2018 IEEE/CVF conference on computer vision and pattern recognition

  42. He L, Sun Z, Zhu Y, Wang Y (2018) Recognizing partial biometric patterns.https://doi.org/10.48550/arXiv.1810.07399

  43. Jia M, Cheng X, Zhai Y, Lu S, Ma S, Tian Y, Zhang J (2021) Matching on sets: conquer occluded person re-identification without alignment. Proc AAAI Conf Artif Intell 35:1673–1681

    Google Scholar 

  44. Jia M, Cheng X, Lu S, Zhang J (2021) Learning disentangled representation implicitly via transformer for occluded person re-identification. arXiv preprintarXiv:2107.02380

  45. Tan H, Liu X, Tian S, Yin B, Li X (2020) Mhsa-net: multi-head self-attention network for occluded person re-identification. IEEE Trans Neural Netw Learn Syst 1-15.https://doi.org/10.1109/TNNLS.2022.3144163

  46. Wang P, Ding C, Shao Z, Hong Z, Zhang S, Tao D (2022) Quality-aware part models for occluded person re-identification. arXiv preprintarXiv:2201.00107

  47. He L, Wang Y, Liu W, Zhao H, Sun Z, Feng J (2019) oreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In: 2019 IEEE/CVF international conference on computer vision, ICCV 2019, Seoul, Korea (South), October 27-November 2, 2019, IEEE, pp 8449–8458

  48. Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification. In: IEEE conference on computer vision and pattern recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019, Computer Vision Foundation/IEEE, pp 393–402

  49. Song C, Yan H, Ouyang W, Liang W (2018) Mask-guided contrastive attention model for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR)

  50. Kalayeh MM, Basaran E, Gokmen E, Kamasak ME, Shah M (2018) Human semantic parsing for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition

  51. Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-scale feature learning for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3702–3712

  52. Zhu K, Guo H, Liu Z, Tang M, Wang J (2020) Identity-guided human semantic parsing for person re-identification. In: Computer vision—ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, part III 16. Springer, pp 346–363

  53. Liao S, Jain AK, Li SZ (2012) Partial face recognition: alignment-free approach. IEEE Trans Pattern Anal Mach Intell 35(5):1193–1205

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by Natural Science Foundation of Nanjing University of Posts and Telecommunications under No. NY221077, and National Natural Science Foundation of China under No. 52170001.

Author information

Authors and Affiliations

  1. College of Automation and Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing, China

    Yunbin Zhao, Songhao Zhu, Dongsheng Wang & Zhiwei Liang

Authors
  1. Yunbin Zhao

    You can also search for this author inPubMed Google Scholar

  2. Songhao Zhu

    You can also search for this author inPubMed Google Scholar

  3. Dongsheng Wang

    You can also search for this author inPubMed Google Scholar

  4. Zhiwei Liang

    You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toSonghao Zhu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, Y., Zhu, S., Wang, D.et al. Short range correlation transformer for occluded person re-identification.Neural Comput & Applic34, 17633–17645 (2022). https://doi.org/10.1007/s00521-022-07400-4

Download citation

Access this article

Subscribe and save

Springer+ Basic
¥17,985 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Advertisement


[8]ページ先頭

©2009-2025 Movatter.jp