1Henan Univ. (China)
ARTICLE - 1 Introduction
- 2 Related Works
- 2.1 Local-Based Person Re-Identification
- 2.2 Attention-Based Person Re-Identification
- 3 Local Information Interaction Enhancement Network
- 3.1 Overall Framework
- 3.2 Dual Attention Module
- 3.2.1 Channel attention module
- 3.2.2 Spatial attention module
- 3.3 Local Information Interaction Module
- 3.4 Loss Function
- 4 Experiment
- 4.1 Experimental Details
- 4.2 Datasets
- 4.3 Comparison with State-of-the-Art Methods
- 4.4 Ablation Study
- 4.4.1 Dual attention module
- 4.4.2 Local information interaction module
- 4.5 Hyper-Parameter Analysis
- 4.6 Visualization of Experimental Results
- 4.6.1 Visualization of retrieval results
- 4.6.2 Visualization of feature maps
- 5 Conclusions
FIGURES & TABLES REFERENCES CITED BY
Most of existing local-based person re-identification (re-ID) methods extract powerful feature representations from multiple body regions of a pedestrian. However, local-based person re-ID methods employ horizontal or grid slicing to divide pedestrian images, which can easily cause network redundancy and misalignment of pedestrian body parts. Attention mechanisms have been recently introduced to computer vision tasks to strengthen local salient features and suppress irrelevant features simultaneously. However, attention mechanisms overemphasize local salient features, which may lead to potential significant information being missed. To address the above issues, we proposed a local information interaction enhancement network (LIEN) for person re-ID. The network integrates attention mechanism with local-based strategy to achieve mutually facilitating and complementary effects. Specifically, LIEN was composed of three parts: a backbone network, a dual attention module (DAM), and a local information interaction module (LIM). DAM was used to guide the backbone network to extract the local salient information, and LIM was used to mine the potential significant information that DAM ignored by exploring the correlations between different channel-level local features. With the combination of DAM and LIM, LIEN mined the salient features while obtaining more potential effective features. In particular, the way that it extracts local features, LIEN effectively avoided pedestrian misalignment caused by uniformly dividing pedestrian images. We evaluated the performance of LIEN on public person datasets, and experimental results indicated that LIEN achieved a more advanced performance. |

Proceedings of SPIE (October 12 2022)
Proceedings of SPIE (September 07 2022)
Proceedings of SPIE (October 06 1997)