Movatterモバイル変換

Yunbin Zhao¹,
Songhao Zhu ORCID:orcid.org/0000-0002-9891-5692¹,
Dongsheng Wang¹ &
…
Zhiwei Liang¹

631Accesses
4Altmetric
Explore all metrics

Abstract

Occluded person re-identification is one of the challenging areas of computer vision, which faces problems such as inefficient feature representation and low recognition accuracy. Recently, vision transformer is introduced into the field of re-identification and achieved state-of-the-art results by constructing global feature relationships between patch sequences. However, vision transformer is not good at capturing short-range correlations of patch sequence and exploiting spatial correlation in patch sequence, which leads to a decrease in the accuracy and robustness of the network in the face of occluded person re-identification. Therefore, to address the above problems, we design a partial feature transformer-based occluded person re-identification framework named PFT. The proposed PFT utilizes three modules to enhance the efficiency of vision transformer. (1) Patch full dimension enhancement module. We design a learnable tensor with the same size as patch sequences, which is full-dimensional and deeply embedded in patch sequences to enrich the diversity of training samples. (2) Fusion and reconstruction module. We extract the less important part of obtained patch sequences, and fuse them with original patch sequence to reconstruct the original patch sequences. (3) Spatial Slicing Module. We slice and group patch sequences from spatial direction, which can effectively improve the short-range correlation of patch sequences. Experimental results over occluded and holistic re-identification datasets demonstrate that the proposed PFT network achieves superior performance consistently and outperforms the state-of-the-art methods.

This is a preview of subscription content,log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Subscribe now

Buy Now

Price includes VAT (Japan)

Instant access to the full article PDF.

Institutional subscriptions

Parallel Dense Vision Transformer and Augmentation Network for Occluded Person Re-identification

Single-scale robust feature representation for occluded person re-identification

Article14 August 2023

Swin transformer with part-level tokenization for occluded person re-identification

Article30 November 2024

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

References

Lakhan A, Mohammed MA, Kadry S, Abdulkareem KH, Al-Dhief FT, Hsu C-H (2021) Federated learning enables intelligent reflecting surface in fog-cloud enabled cellular network. PeerJ Comput Sci 7:e758
Article Google Scholar
Awan MJ, Masood OA, Mohammed MA, Yasin A, Zain AM, Damaševičius R, Abdulkareem KH (2021) Image-based malware classification using vgg19 network and spatial convolutional attention. Electronics 10(19):2444
Article Google Scholar
Poongodi M, Malviya M, Hamdi M, Vijayakumar V, Mohammed MA, Rauf HT, Al-Dhlan KA (2022) 5G based Blockchain network for authentic and ethical keyword search engine. IET Commun 16(5):442–448
Article Google Scholar
Mohammed MA, Ibrahim DA, Salman AO (2021) Adaptive intelligent learning approach based on visual anti-spam email model for multi-natural language. J Intell Syst 30(1):774–792
Article Google Scholar
Mujahid A, Awan MJ, Yasin A, Mohammed MA, Damaševičius R, Maskeliūnas R, Abdulkareem KH (2021) Real-time hand gesture recognition based on deep learning yolov3 model. Appl Sci 11(9):4164
Article Google Scholar
Zheng L, Yang Y, Hauptmann AG (2016) Person re-identification: past, present and future. arXiv preprintarXiv:1610.02984
Wang Z, Jiang J, Wu Y, Ye M, Bai X, Satoh S (2020) Learning sparse and identity-preserved hidden attributes for person re-identification. IEEE Trans Image Process 29:2013–2025
Article Google Scholar
Liao S, Yang H, Zhu X, Li SZ (2015) Person re-identification by local maximal occurrence representation and metric learning. In: 2015 IEEE conference on computer vision and pattern recognition (CVPR)
Liao S, Li SZ (2015) Efficient PSD constrained asymmetric metric learning for person re-identification. In: 2015 IEEE international conference on computer vision (ICCV)
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprintarXiv:1703.07737
Wang G, Lai JH, Liang W, Wang G (2020) Smoothing adversarial domain attack and p-memory reconsolidation for cross-domain person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Luo W, Li Y, Urtasun R, Zemel R (2016) Understanding the effective receptive field in deep convolutional neural networks. In: Advances in neural information processing systems, vol 29
Zheng WS, Li X, Xiang T, Liao S, Lai J, Gong S (2015) Partial person re-identification. In: Proceedings of the IEEE international conference on computer vision, pp 4678–4686
Zhuo J, Chen Z, Lai J, Wang G (2018) Occluded person reidentification. In: 2018 IEEE International Conference on Multimedia and Expo, ICME 2018, San Diego, CA, USA, July 23-27, IEEE Computer Society, pp 1–6
Zhang Z, Lan C, Zeng W, Jin X, Chen Z (2020) Relation-aware global attention for person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Chen X, Fu C, Zhao Y, Zheng F, Yang Y (2020) Salience-guided cascaded suppression network for person re-identification. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Gong BXY, Zhang Y, Poellabauer C (2019) Second-order non-local attention networks for person re-identification. In: 2019 IEEE/CVF international conference on computer vision (ICCV)
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition
Miao J, Wu Y, Liu P, Ding Y, Yang Y (2019) Pose-guided feature alignment for occluded person re-identification. In: 2019 IEEE/CVF international conference on computer vision (ICCV)
Dosovitskiy A, Beyer L, Kolesnikov A, Weissenborn D, Zhai X, Unterthiner T, Dehghani M, Minderer M, Heigold G Gelly S et al (2020) An image is worth\(16\times 16\) words: transformers for image recognition at scale. arXiv preprintarXiv:2010.11929
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Advances in neural information processing systems 30: annual conference on neural information processing systems 2017, December 4-9, 2017, Long Beach, CA, USA, pp 5998–6008
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2017) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). Springer, Cham
Google Scholar
Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D (2020) Grad-cam: visual explanations from deep networks via gradient-based localization. Int J Comput Vis 128(2):336–359
Article Google Scholar
Wang G, Yang S, Liu H, Wang Z, Yang Y, Wang S, Yu G, Zhou E, Sun J (2020) High-order information matters: learning relation and topology for occluded person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6449–6458
Gao S, Wang J, Lu H, Liu Z (2020) Pose-guided visible part matching for occluded person reid. In: 2020 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Devlin J, Chang MW, Lee K, Toutanova K (2018) Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprintarXiv:1810.04805
Oord A, Li Y, Babuschkin I, Simonyan K, Vinyals O, Kavukcuoglu K, Driessche G, Lockhart E, Cobo L Stimberg F et al (2018) Parallel wavenet: fast high-fidelity speech synthesis. In: International conference on machine learning. PMLR, pp 3918–3926
Gu J, Bradbury J, Xiong C, Li VO, Socher R (2017) Non-autoregressive neural machine translation. arXiv preprintarXiv:1711.02281
Ghazvininejad M, Levy O, Liu Y, Zettlemoyer L (2019) Mask-predict: parallel decoding of conditional masked language models. arXiv preprintarXiv:1904.09324
Touvron H, Cord M, Douze M, Massa F, Sablayrolles A, Jégou H (2021) Training data-efficient image transformers & distillation through attention. In: International conference on machine learning. PMLR, pp 10347–10357
He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) Transreid: transformer-based object re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 15013–15022
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: 2015 IEEE international conference on computer vision, ICCV 2015, Santiago, Chile, December 7-13, 2015, IEEE Computer Society, pp 1116–1124
Ristani E, Solera F, Zou RS, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision
Zheng WS, Xiang L, Tao X, Liao S, Lai J, Gong S (2016) Partial person re-identification. In: IEEE international conference on computer vision
He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7073–7082
Zhong Z, Zheng L, Kang G, Li S, Yang Y (2017) Random erasing data augmentation. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, no 7
Zhao L, Xi L, Zhuang Y,Wang J (2017) Deeply-learned part-aligned representations for person re-identification. In: 2017 IEEE international conference on computer vision (ICCV)
Huang H, Li D, Zhang Z, Chen X, Huang K (2018) Adversarially occluded samples for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Suh Y, Wang J, Tang S, Mei T, Lee KM (2018) Part-aligned bilinear representations for person re-identification . In: European conference on computer vision
Ge Y, Li Z, Zhao H, Yin G, Yi S, Wang X et al (2018) Fd-gan: Pose-guided feature distilling gan for robust person re-identification. In: Advances in neural information processing systems, vol 31
He L, Liang J, Li H, Sun Z (2018) Deep spatial feature reconstruction for partial person re-identification: alignment-free approach. In: 2018 IEEE/CVF conference on computer vision and pattern recognition
He L, Sun Z, Zhu Y, Wang Y (2018) Recognizing partial biometric patterns.https://doi.org/10.48550/arXiv.1810.07399
Jia M, Cheng X, Zhai Y, Lu S, Ma S, Tian Y, Zhang J (2021) Matching on sets: conquer occluded person re-identification without alignment. Proc AAAI Conf Artif Intell 35:1673–1681
Google Scholar
Jia M, Cheng X, Lu S, Zhang J (2021) Learning disentangled representation implicitly via transformer for occluded person re-identification. arXiv preprintarXiv:2107.02380
Tan H, Liu X, Tian S, Yin B, Li X (2020) Mhsa-net: multi-head self-attention network for occluded person re-identification. IEEE Trans Neural Netw Learn Syst 1-15.https://doi.org/10.1109/TNNLS.2022.3144163
Wang P, Ding C, Shao Z, Hong Z, Zhang S, Tao D (2022) Quality-aware part models for occluded person re-identification. arXiv preprintarXiv:2201.00107
He L, Wang Y, Liu W, Zhao H, Sun Z, Feng J (2019) oreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In: 2019 IEEE/CVF international conference on computer vision, ICCV 2019, Seoul, Korea (South), October 27-November 2, 2019, IEEE, pp 8449–8458
Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: Learning visibility-aware part-level features for partial person re-identification. In: IEEE conference on computer vision and pattern recognition, CVPR 2019, Long Beach, CA, USA, June 16-20, 2019, Computer Vision Foundation/IEEE, pp 393–402
Song C, Yan H, Ouyang W, Liang W (2018) Mask-guided contrastive attention model for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition (CVPR)
Kalayeh MM, Basaran E, Gokmen E, Kamasak ME, Shah M (2018) Human semantic parsing for person re-identification. In: 2018 IEEE/CVF conference on computer vision and pattern recognition
Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-scale feature learning for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3702–3712
Zhu K, Guo H, Liu Z, Tang M, Wang J (2020) Identity-guided human semantic parsing for person re-identification. In: Computer vision—ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, Proceedings, part III 16. Springer, pp 346–363
Liao S, Jain AK, Li SZ (2012) Partial face recognition: alignment-free approach. IEEE Trans Pattern Anal Mach Intell 35(5):1193–1205
Article Google Scholar

Download references

Acknowledgements

This work is supported by Natural Science Foundation of Nanjing University of Posts and Telecommunications under No. NY221077, and National Natural Science Foundation of China under No. 52170001.

Author information

Authors and Affiliations

College of Automation and Artificial Intelligence, Nanjing University of Posts and Telecommunications, Nanjing, China
Yunbin Zhao, Songhao Zhu, Dongsheng Wang & Zhiwei Liang

Authors

Yunbin Zhao
View author publications
You can also search for this author inPubMed Google Scholar
Songhao Zhu
View author publications
You can also search for this author inPubMed Google Scholar
Dongsheng Wang
View author publications
You can also search for this author inPubMed Google Scholar
Zhiwei Liang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence toSonghao Zhu.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhao, Y., Zhu, S., Wang, D.et al. Short range correlation transformer for occluded person re-identification.Neural Comput & Applic34, 17633–17645 (2022). https://doi.org/10.1007/s00521-022-07400-4

Download citation

Received:28 October 2021
Accepted:04 May 2022
Published:01 June 2022
Issue Date:October 2022
DOI:https://doi.org/10.1007/s00521-022-07400-4

Movatterモバイル変換

Short range correlation transformer for occluded person re-identification

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Parallel Dense Vision Transformer and Augmentation Network for Occluded Person Re-identification

Single-scale robust feature representation for occluded person re-identification

Swin transformer with part-level tokenization for occluded person re-identification

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Access this article

Subscribe and save

Buy Now