PPBI: Pose-Guided Partial-Attention Network with Batch Information for Occluded Person Re-Identification
Abstract
:1. Introduction
2. Related Work
2.1. Feature Representation Learning
2.2. Pose-Guided Feature Alignment Methods
2.3. Mutual Information
3. Method
3.1. Motivation
3.2. Pose-Guided Partial-Attention Network with Batch Information
3.2.1. Node Optimization Network
3.2.2. Learning Key-Point Batch Attention
3.2.3. Learning BE
3.3. Training Loss
Learning Joint Hard Cases and Topological Information
4. Experiment
4.1. Dataset and Experimental Setting
4.2. Implementation
4.3. Model Analysis
4.3.1. Comparison of Different Modules
4.3.2. Parameter Analysis
4.4. Experimental Results
4.4.1. Results on Occluded Datasets
4.4.2. Results on Holistic Datasets
4.4.3. Visual Comparison Experiment
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zheng, L.; Yang, Y.; Hauptmann, A.G. Person re-identification: Past, present and future. arXiv 2016, arXiv:1610.02984. [Google Scholar]
- Suh, Y.; Wang, J.; Tang, S.; Mei, T.; Lee, K.M. Part-aligned bilinear representations for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 402–419. [Google Scholar]
- Li, Y.; He, J.; Zhang, T.; Liu, X.; Zhang, Y.; Wu, F. Diverse part discovery: Occluded person re-identification with part-aware transformer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 10–25 June 2021; pp. 2898–2907. [Google Scholar]
- Wang, C.; Zhang, Q.; Huang, C.; Liu, W.; Wang, X. Mancs: A multi-task attentional network with curriculum sampling for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 365–381. [Google Scholar]
- Huang, H.; Li, D.; Zhang, Z.; Chen, X.; Huang, K. Adversarially occluded samples for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 5098–5107. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. In Advances in Neural Information Processing Systems; The MIT Press: Cambridge, MA, USA, 2017; Volume 30. [Google Scholar]
- Hou, R.; Ma, B.; Chang, H.; Gu, X.; Shan, S.; Chen, X. Interaction-and-aggregation network for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 9317–9326. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16 × 16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Ristani, E.; Solera, F.; Zou, R.; Cucchiara, R.; Tomasi, C. Performance measures and a data set for multi-target, multi-camera tracking. In Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands, 11–14 October 2016; pp. 17–35. [Google Scholar]
- Li, W.; Zhu, X.; Gong, S. Harmonious attention network for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 2285–2294. [Google Scholar]
- Wang, Y.; Chen, Z.; Wu, F.; Wang, G. Person re-identification with cascaded pairwise convolutions. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 1470–1478. [Google Scholar]
- Cheng, D.; Gong, Y.; Zhou, S.; Wang, J.; Zheng, N. Person re-identification by multi-channel parts-based cnn with improved triplet loss function. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1335–1344. [Google Scholar]
- Li, D.; Chen, X.; Zhang, Z.; Huang, K. Learning deep context-aware features over body and latent parts for person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 384–393. [Google Scholar]
- Zhao, L.; Li, X.; Zhuang, Y.; Wang, J. Deeply-learned part-aligned representations for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3219–3228. [Google Scholar]
- Liu, J.; Ni, B.; Yan, Y.; Zhou, P.; Cheng, S.; Hu, J. Pose transferrable person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 4099–4108. [Google Scholar]
- Qian, X.; Fu, Y.; Xiang, T.; Wang, W.; Qiu, J.; Wu, Y.; Jiang, Y.G.; Xue, X. Pose-normalized image generation for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 650–667. [Google Scholar]
- Zheng, Z.; Zheng, L.; Yang, Y. Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3754–3762. [Google Scholar]
- Song, C.; Huang, Y.; Ouyang, W.; Wang, L. Mask-guided contrastive attention model for person re-identification. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 1179–1188. [Google Scholar]
- Zhong, Z.; Zheng, L.; Kang, G.; Li, S.; Yang, Y. Random erasing data augmentation. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 13001–13008. [Google Scholar]
- Sun, Y.; Zheng, L.; Yang, Y.; Tian, Q.; Wang, S. Beyond part models: Person retrieval with refined part pooling (and a strong convolutional baseline). In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 480–496. [Google Scholar]
- Zheng, W.S.; Li, X.; Xiang, T.; Liao, S.; Lai, J.; Gong, S. Partial person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 4678–4686. [Google Scholar]
- He, S.; Luo, H.; Wang, P.; Wang, F.; Li, H.; Jiang, W. TransReID: Transformer-Based Object Re-Identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Montreal, Canada, 10–17 October 2021; pp. 15013–15022. [Google Scholar]
- Jin, X.; Lan, C.; Zeng, W.; Wei, G.; Chen, Z. Semantics-Aligned Representation Learning for Person Re-identification. arXiv 2019, arXiv:1905.13143. [Google Scholar] [CrossRef]
- Sun, K.; Xiao, B.; Liu, D.; Wang, J. Deep high-resolution representation learning for human pose estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 5693–5703. [Google Scholar]
- He, L.; Wang, Y.; Liu, W.; Zhao, H.; Sun, Z.; Feng, J. Foreground-aware pyramid reconstruction for alignment-free occluded person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 8450–8459. [Google Scholar]
- Gao, S.; Wang, J.; Lu, H.; Liu, Z. Pose-guided visible part matching for occluded person reid. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 11744–11752. [Google Scholar]
- Wang, G.; Yang, S.; Liu, H.; Wang, Z.; Yang, Y.; Wang, S.; Yu, G.; Zhou, E.; Sun, J. High-order information matters: Learning relation and topology for occluded person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 6449–6458. [Google Scholar]
- Tang, S.; Chen, C.; Xie, Q.; Chen, M.; Wang, Y.; Ci, Y.; Bai, L.; Zhu, F.; Yang, H.; Yi, L.; et al. HumanBench: Towards General Human-Centric Perception With Projector Assisted Pretraining. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 18–22 June 2023; pp. 21970–21982. [Google Scholar]
- Huang, W.; Jia, X.; Zhong, X.; Wang, X.; Jiang, K.; Wang, Z. Beyond the Parts: Learning Coarse-to-Fine Adaptive Alignment Representation for Person Search. ACM Trans. Multimed. Comput. Commun. Appl. 2022, 19, 1–19. [Google Scholar] [CrossRef]
- Ge, Y.; Li, Z.; Zhao, H.; Yin, G.; Yi, S.; Wang, X. Fd-gan: Pose-guided feature distilling gan for robust person re-identification. In Proceedings of the Advances in Neural Information Processing Systems (NIPS), Montreal, QC, Canada, 3–8 December 2018; Volume 31. [Google Scholar]
- Su, C.; Li, J.; Zhang, S.; Xing, J.; Gao, W.; Tian, Q. Pose-Driven Deep Convolutional Model for Person Re-identification. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 3980–3989. [Google Scholar]
- Sarfraz, M.S.; Schumann, A.; Eberle, A.; Stiefelhagen, R. A Pose-Sensitive Embedding for Person Re-identification with Expanded Cross Neighborhood Re-ranking. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 420–429. [Google Scholar]
- Chen, D.; Xu, D.; Li, H.; Sebe, N.; Wang, X. Group consistent similarity learning via deep crf for person re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 8649–8658. [Google Scholar]
- Luo, C.; Chen, Y.; Wang, N.; Zhang, Z. Spectral feature transformation for person re-identification. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 4976–4985. [Google Scholar]
- He, L.; Sun, Z.; Zhu, Y.; Wang, Y. Recognizing partial biometric patterns. arXiv 2018, arXiv:1810.07399. [Google Scholar]
- He, L.; Liang, J.; Li, H.; Sun, Z. Deep spatial feature reconstruction for partial person re-identification: Alignment-free approach. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 7073–7082. [Google Scholar]
- Luo, H.; Gu, Y.; Liao, X.; Lai, S.; Jiang, W. Bag of Tricks and a Strong Baseline for Deep Person Re-Identification. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, 16–20 June 2019; pp. 1487–1495. [Google Scholar]
- Zhuo, J.; Lai, J.; Chen, P. A novel teacher-student learning framework for occluded person re-identification. arXiv 2019, arXiv:1907.03253. [Google Scholar]
- Qi, L.; Huo, J.; Wang, L.; Shi, Y.; Gao, Y. MaskReID: A Mask Based Deep Ranking Neural Network for Person Re-identification. arXiv 2018, arXiv:1804.03864. [Google Scholar]
- Zhu, K.; Guo, H.; Yan, T.; Zhu, Y.; Wang, J.; Tang, M. PASS: Part-Aware Self-Supervised Pre-Training for Person Re-Identification. In Proceedings of the European Conference on Computer Vision (ECCV), Tel Aviv, Israel, 23–27 October 2022; pp. 198–214. [Google Scholar]
- He, W.; Tang, S.; Deng, Y.; Chen, Q.; Xie, Q.; Wang, Y.; Bai, L.; Zhu, F.; Zhao, R.; Ouyang, W.; et al. Instruct-ReID: A Multi-Purpose Person Re-Identification Task with Instructions. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 18–22 June 2023; pp. 17521–17531. [Google Scholar]
- Datta, A.; Swamidass, S.J. Fair-Net: A Network Architecture For Reducing Performance Disparity Between Identifiable Sub-Populations. arXiv 2021, arXiv:2106.00720. [Google Scholar]
- Buolamwini, J.; Gebru, T. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification. In Proceedings of the Machine Learning Research (PMLR), Stockholm, Sweden, 10–15 July 2018; pp. 77–91. [Google Scholar]
- Ni, H.; Li, Y.; Shen, H.; Song, J. Part-Aware Transformer for Generalizable Person Re-identification. In Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2–6 October 2023; pp. 11246–11255. [Google Scholar]
- Miao, J.; Wu, Y.; Liu, P.; Ding, Y.; Yang, Y. Pose-guided feature alignment for occluded person re-identification. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 542–551. [Google Scholar]
- Zhuo, J.; Chen, Z.; Lai, J.; Wang, G. Occluded person re-identification. In Proceedings of the 2018 IEEE International Conference on Multimedia and Expo (ICME), San Diego, CA, USA, 23–27 July 2018; pp. 1–6. [Google Scholar]
- Kalayeh, M.M.; Basaran, E.; Gökmen, M.; Kamasak, M.E.; Shah, M. Human semantic parsing for person re-identification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 1062–1071. [Google Scholar]
Dataset | Train | Gallery | Query |
---|---|---|---|
Market-1501 | 751/12,936 | 750/19,732 | 750/3368 |
DukeMTMC | 702/16,522 | 1110/17,661 | 702/2668 |
Occluded-Duke | 702/15,618 | 1110/17,661 | 519/2210 |
Occluded-ReID | - | 200/1000 | 200/1000 |
Index | Stage 1 | Stage 2 | Stage 3 | Stage 4 | Rank-1 | mAP |
---|---|---|---|---|---|---|
1 | ✓ | ✕ | ✕ | ✕ | 48.5 | 41.0 |
2 | ✓ | ✓ | ✕ | ✕ | 49.2 | 42.3 |
3 | ✓ | ✕ | ✓ | ✕ | 51.8 | 43.6 |
4 | ✓ | ✕ | ✕ | ✓ | 49.8 | 41.5 |
5 | ✓ | ✓ | ✓ | ✕ | 52.7 | 44.4 |
6 | ✓ | ✓ | ✕ | ✓ | 49.9 | 42.7 |
7 | ✓ | ✕ | ✓ | ✓ | 51.4 | 43.2 |
8 | ✓ | ✓ | ✓ | ✓ | 52.8 | 44.9 |
Methods | Occ-Duke | Occ-ReID | ||
---|---|---|---|---|
Rank-1 (%) | mAP (%) | Rank-1 (%) | mAP (%) | |
Part-Aligne [15] | 28.8 | 20.2 | - | - |
PCB [21] | 42.6 | 33.7 | 41.3 | 38.9 |
Part Bilinear [2] | 36.9 | - | - | - |
FD-GAN [31] | 40.8 | - | - | - |
AMC + SWM [22] | - | - | 31.2 | 27.3 |
DSR [37] | 40.8 | 30.4 | 72.8 | 62.8 |
SFR [36] | 42.3 | 32 | - | - |
Ad-Occluded [5] | 44.5 | 32.2 | - | - |
TCSDO [39] | - | - | 73.7 | 77.9 |
FPR [26] | - | - | 78.3 | 68.0 |
PGFA [46] | 51.4 | 37.3 | - | - |
HOReID [25] | 55.1 | 43.8 | 80.3 | 70.2 |
PPBI (Ours) | 52.8 | 44.9 | 92 | 76.8 |
PPBI + BE | 54.9 | 46.5 | - | - |
Methods | Market-1501 | DukeMTMC | ||
---|---|---|---|---|
Rank-1 (%) | mAP (%) | Rank-1 (%) | mAP (%) | |
PCB [21] | 92.3 | 77.4 | 81.8 | 66.1 |
VPM [21] | 93.0 | 80.8 | 83.6 | 72.6 |
BOT [38] | 94.1 | 85.7 | 86.4 | 76.4 |
SPReID [48] | 92.5 | 81.3 | - | - |
MGCAM [19] | 83.8 | 74.3 | 46.7 | 46.0 |
MaskReID [40] | 90.0 | 75.3 | - | - |
FPR [26] | 95.4 | 86.6 | 88.6 | 78.4 |
PDC [32] | 84.2 | 63.4 | - | - |
Pose-transfer [16] | 87.7 | 68.9 | 30.1 | 28.2 |
PSE [33] | 87.7 | 68.9 | 30.1 | 28.2 |
PGFA [46] | 91.2 | 76.8 | 82.6 | 65.5 |
HOReId [25] | 94.2 | 84.9 | 86.9 | 75.6 |
PPBI (Ours) | 93.0 | 83.1 | 85.8 | 73.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cui, J.; Chen, Y.; Deng, B.; Liu, G.; Wang, Z.; Li, Y. PPBI: Pose-Guided Partial-Attention Network with Batch Information for Occluded Person Re-Identification. Sensors 2025, 25, 757. https://doi.org/10.3390/s25030757
Cui J, Chen Y, Deng B, Liu G, Wang Z, Li Y. PPBI: Pose-Guided Partial-Attention Network with Batch Information for Occluded Person Re-Identification. Sensors. 2025; 25(3):757. https://doi.org/10.3390/s25030757
Chicago/Turabian StyleCui, Jianhai, Yiping Chen, Binbin Deng, Guisong Liu, Zhiguo Wang, and Ye Li. 2025. "PPBI: Pose-Guided Partial-Attention Network with Batch Information for Occluded Person Re-Identification" Sensors 25, no. 3: 757. https://doi.org/10.3390/s25030757
APA StyleCui, J., Chen, Y., Deng, B., Liu, G., Wang, Z., & Li, Y. (2025). PPBI: Pose-Guided Partial-Attention Network with Batch Information for Occluded Person Re-Identification. Sensors, 25(3), 757. https://doi.org/10.3390/s25030757