Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains
Abstract
:1. Introduction
- •
- •
- Intermediate domains were utilized as a bridge to narrow the gap between source and target domains. We first adopted the model pre-trained on the source domain to unlabeled intermediate domains and then fine-tuned it on the unlabeled target domain on the basis of the UDA strong baseline in [6].
- •
- We conducted extensive experiments on Market1501, DukeMTMC-reID and MSMT17 datasets. The results demonstrate the effectiveness of our framework. Our scheme achieves a state-of-the-art performance on all of the benchmark datasets.
2. Related Works
2.1. Unsupervised Domain Adaptive ReID
2.1.1. Feature Alignment
2.1.2. Image Translation
2.1.3. Semi-Supervised Learning
2.2. Domain Adaptation and Generalization
3. Proposed Method
3.1. Intermediate Domains
3.1.1. SPGAN Model
3.1.2. Generating Intermediate Domains
3.2. UDA Baseline via Intermediate Domains
3.2.1. Fine-Tuning with Source Data
3.2.2. Mean Teacher Method
3.2.3. Contrastive Loss
3.2.4. Triplet Loss
4. Experimental Results and Discussion
4.1. Datasets
4.2. Implementation Details
4.2.1. Training of the GAN Model
4.2.2. Pre-Training on Source Domain
4.2.3. Fine-Tuning on Intermediate and Target Domains
4.3. Analysis of Intermediate Domains
4.3.1. Qualitative Evaluation
4.3.2. Quantitative Evaluation
4.4. Comparison with Other Methods
4.5. Further Analysis
4.5.1. Differences between GAN-Based Approaches and our Proposed Method
4.5.2. Number of Intermediate Domains
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
ReID | Person Re-identification |
UDA | Unsupervised Domain Adaptive |
GAN | Generative Adversarial Network |
SPGAN | Similarity Preserving Generative Adversarial Network |
CNN | Convolutional Neural Network |
DLOW | Domain Flow for Adaptation and Generalization |
SOTA | State-Of-The-Art |
MMD | Maximum Mean Discrepancy |
PTGAN | Person Transfer Generative Adversarial Network |
PDA-Net | Pose Disentanglement and Adaptation Network |
HHL | Hetero and Homogeneously Learning |
UNRN | Uncertainty-Guided Noise Resilient Network |
CycleGAN | Cycle-Consistent Adversarial Networks |
MMT | Mutual Mean-Teaching |
DBSCAN | Density-Based Spatial Clustering of Applications with Noise |
mAP | mean Average Precision |
MEB-Net | Multiple Expert Brainstorming Network |
References
- Wang, G.; Yang, S.; Liu, H.; Wang, Z.; Yang, Y.; Wang, S.; Yu, G.; Zhou, E.; Sun, J. High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 6449–6458. [Google Scholar]
- Luo, H.; Jiang, W.; Gu, Y.; Liu, F.; Liao, X.; Lai, S.; Gu, J. A Strong Baseline and Batch Normalization Neck for Deep Person Re-Identification. IEEE Trans. Multimed. 2020, 22, 2597–2609. [Google Scholar] [CrossRef] [Green Version]
- Zheng, L.; Shen, L.; Tian, L.; Wang, S.; Wang, J.; Tian, Q. Scalable Person Re-identification: A Benchmark. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1116–1124. [Google Scholar]
- Ristani, E.; Solera, F.; Zou, R.S.; Cucchiara, R.; Tomasi, C. Performance Measures and a Data Set for Multi-target, Multi-camera Tracking. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2016; pp. 17–35. [Google Scholar]
- Ge, Y.; Chen, D.; Li, H. Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification. In Proceedings of the ICLR 2020: The Eighth International Conference on Learning Representations, Addis Ababa, Ethiopia, 26–30 April 2020. [Google Scholar]
- Zheng, K.; Lan, C.; Zeng, W.; Zhang, Z.; Zha, Z.J. Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification. Proc. AAAI Conf. Artif. Intell. 2021, 35, 3538–3546. [Google Scholar]
- Cui, Z.; Li, W.; Xu, D.; Shan, S.; Chen, X.; Li, X. Flowing on Riemannian manifold: Domain adaptation by shifting covariance. IEEE Trans. Syst. Man Cybern. 2014, 44, 2264–2273. [Google Scholar]
- Gong, R.; Li, W.; Chen, Y.; Gool, L.V. DLOW: Domain Flow for Adaptation and Generalization. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 2477–2486. [Google Scholar]
- Deng, W.; Zheng, L.; Ye, Q.; Kang, G.; Yang, Y.; Jiao, J. Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 994–1003. [Google Scholar]
- Lin, S.; Li, H.; Li, C.T.; Kot, A.C. Multi-task mid-level feature alignment network for unsupervised cross-dataset person re-identification. In Proceedings of the BMVC 2018: Proceedings of the 29th British Machine Vision Conference, Newcastle, UK, 3–6 September 2018; p. 9. [Google Scholar]
- Wang, J.; Zhu, X.; Gong, S.; Li, W. Transferable Joint Attribute-Identity Deep Learning for Unsupervised Person Re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 2275–2284. [Google Scholar]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
- Wei, L.; Zhang, S.; Gao, W.; Tian, Q. Person Transfer GAN to Bridge Domain Gap for Person Re-identification. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 79–88. [Google Scholar]
- Lv, J.; Wang, X. Cross-Dataset Person Re-identification Using Similarity Preserved Generative Adversarial Networks. In International Conference on Knowledge Science, Engineering and Management; Springer: Cham, Switzerland, 2018; pp. 171–183. [Google Scholar]
- Zhong, Z.; Zheng, L.; Li, S.; Yang, Y. Generalizing A Person Retrieval Model Hetero- and Homogeneously. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2018; pp. 172–188. [Google Scholar]
- Ge, Y.; Zhu, F.; Chen, D.; Zhao, R.; Li, H. Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID. Adv. Neural Inf. Process. Syst. 2020, 33, 11309–11321. [Google Scholar]
- Zhao, F.; Liao, S.; Xie, G.S.; Zhao, J.; Zhang, K.; Shao, L. Unsupervised Domain Adaptation with Noise Resistible Mutual-Training for Person Re-identification. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 526–544. [Google Scholar]
- Zhai, Y.; Ye, Q.; Lu, S.; Jia, M.; Ji, R.; Tian, Y. Multiple Expert Brainstorming for Domain Adaptive Person Re-Identification. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 594–611. [Google Scholar]
- Li, R.; Jiao, Q.; Cao, W.; Wong, H.S.; Wu, S. Model Adaptation: Unsupervised Domain Adaptation Without Source Data. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 9641–9650. [Google Scholar]
- Zhou, B.; Kalra, N.; Krahenbuhl, P. Domain Adaptation Through Task Distillation. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 664–680. [Google Scholar]
- Huang, F.; Zhang, L.; Yang, Y.; Zhou, X. Probability Weighted Compact Feature for Domain Adaptive Retrieval. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 9582–9591. [Google Scholar]
- Zhou, K.; Yang, Y.; Hospedales, T.M.; Xiang, T. Learning to Generate Novel Domains for Domain Generalization. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 561–578. [Google Scholar]
- Zhou, K.; Yang, Y.; Qiao, Y.; Xiang, T. Domain Generalization with MixStyle. In Proceedings of the ICLR 2021: The Ninth International Conference on Learning Representations, Vienna, Austria, 3–7 May 2021. [Google Scholar]
- Wang, H.; He, H.; Katabi, D. Continuously Indexed Domain Adaptation. In Proceedings of the International Conference on Machine Learning, Virtual, 13–18 July 2020; Volume 1, pp. 9898–9907. [Google Scholar]
- Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2242–2251. [Google Scholar]
- Taigman, Y.; Polyak, A.; Wolf, L. Unsupervised Cross-Domain Image Generation. In Proceedings of the ICLR 2016: The Fourth International Conference on Learning Representations, San Juan, Puerto Rico, 2–4 May 2016. [Google Scholar]
- Hadsell, R.; Chopra, S.; LeCun, Y. Dimensionality Reduction by Learning an Invariant Mapping. In Proceedings of the 2006 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New York, NY, USA, 17–22 June 2006; Volume 2, pp. 1735–1742. [Google Scholar]
- Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the Inception Architecture for Computer Vision. In Proceedings of the 2016 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 2818–2826. [Google Scholar]
- Kingma, D.P.; Ba, J.L. Adam: A Method for Stochastic Optimization. In Proceedings of the ICLR 2015: The Third International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial Databases with Noise. In Proceedings of the 2nd International Conference on Knowledg Discovery and Data Mining (KDD-96), Portland, OR, USA, 2–4 August 1996; pp. 226–231. [Google Scholar]
- Zhong, Z.; Zheng, L.; Luo, Z.; Li, S.; Yang, Y. Invariance Matters: Exemplar Memory for Domain Adaptive Person Re-Identification. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019; pp. 598–607. [Google Scholar]
- Chen, Y.; Zhu, X.; Gong, S. Instance-Guided Context Rendering for Cross-Domain Person Re-Identification. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October–2 November 2019; pp. 232–242. [Google Scholar]
- Fu, Y.; Wei, Y.; Wang, G.; Zhou, Y.; Shi, H.; Uiuc, U.; Huang, T. Self-Similarity Grouping: A Simple Unsupervised Cross Domain Adaptation Approach for Person Re-Identification. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October–2 November 2019; pp. 6111–6120. [Google Scholar]
- Wang, D.; Zhang, S. Unsupervised Person Re-Identification via Multi-Label Classification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 10981–10990. [Google Scholar]
- Yang, F.; Li, K.; Zhong, Z.; Luo, Z.; Sun, X.; Cheng, H.; Guo, X.; Huang, F.; Ji, R.; Li, S. Asymmetric Co-Teaching for Unsupervised Cross-Domain Person Re-Identification. Proc. AAAI Conf. Artif. Intell. 2020, 34, 12597–12604. [Google Scholar] [CrossRef]
- Jin, X.; Lan, C.; Zeng, W.; Chen, Z.; Zhang, L. Style Normalization and Restitution for Generalizable Person Re-Identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 3143–3152. [Google Scholar]
- Zhong, Z.; Zheng, L.; Luo, Z.; Li, S.; Yang, Y. Learning to Adapt Invariance in Memory for Person Re-identification. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 2723–2738. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zhai, Y.; Lu, S.; Ye, Q.; Shan, X.; Chen, J.; Ji, R.; Tian, Y. AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 9021–9030. [Google Scholar]
- Huang, Y.; Peng, P.; Jin, Y.; Xing, J.; Lang, C.; Feng, S. Domain Adaptive Attention Model for Unsupervised Cross-Domain Person Re-Identification. arXiv 2019, arXiv:1905.10529. [Google Scholar]
- Jin, X.; Lan, C.; Zeng, W.; Chen, Z. Global Distance-Distributions Separation for Unsupervised Person Re-identification. In European Conference on Computer Vision (ECCV); Springer: Cham, Switzerland, 2020; pp. 735–751. [Google Scholar]
- Liu, C.T.; Lee, M.Y.; Chen, T.S.; Chien, S.Y. Hard samples rectification for unsupervised cross-domain person re-identification. In Proceedings of the 2021 IEEE International Conference on Image Processing (ICIP), Anchorage, AK, USA, 19–22 September 2021; pp. 429–433. [Google Scholar]
Dataset | Rank-1 | mAP |
---|---|---|
Market1501 | 94.5 | 91.0 |
M→D | 47.5 | 37.6 |
DukeMTMC-reID | 86.7 | 84.1 |
D→M | 61.3 | 43.5 |
Dataset | mAP | Rank-1 | Rank-5 | Rank-10 |
---|---|---|---|---|
market(pre-train) | 83.2 | 93.9 | 97.7 | 98.5 |
directly transfer | 6.9 | 19.4 | 28.7 | 33.9 |
msmt2m_0.2 | 12.3 | 30.1 | 40.7 | 46.1 |
msmt2m_0.5 | 19.0 | 42.6 | 53.5 | 59.0 |
msmt2m_0.8 | 22.8 | 48.3 | 59.8 | 65.0 |
msmt17 | 27.6 | 55.5 | 66.8 | 71.7 |
Dataset | mAP | Rank-1 | Rank-5 | Rank-10 |
---|---|---|---|---|
duke(pre-train) | 73.7 | 87.4 | 93.4 | 94.9 |
directly transfer | 9.4 | 27.4 | 38.4 | 43.5 |
msmt2d_0.2 | 19.0 | 41.9 | 53.3 | 58.7 |
msmt2d_0.5 | 23.6 | 49.1 | 61.6 | 66.8 |
msmt2d_0.8 | 25.3 | 52.7 | 65.1 | 69.9 |
msmt17 | 29.5 | 58.3 | 69.7 | 74.3 |
Metdods | DukeMTMC-reID→Market1501 | Market1501→DukeMTMC-reID | |||||||
mAP | Rank-1 | Rank-5 | Rank-10 | mAP | Rank-1 | Rank-5 | Rank-10 | ||
ECN [32] | CVPR’19 | 43.0 | 75.1 | 87.6 | 91.6 | 40.4 | 63.3 | 75.8 | 80.4 |
CR-GAN [33] | ICCV’19 | 54.0 | 77.7 | 89.7 | 92.7 | 48.6 | 68.9 | 80.2 | 84.7 |
SSG [34] | ICCV’19 | 58.3 | 80.0 | 90.0 | 92.4 | 53.4 | 73.0 | 80.6 | 83.2 |
MMCL [35] | CVPR’20 | 60.4 | 84.4 | 92.8 | 95.0 | 51.4 | 72.4 | 82.9 | 85.0 |
ACT [36] | AAAI’20 | 60.6 | 80.5 | - | - | 54.5 | 72.4 | - | - |
SNR [37] | CVPR’20 | 61.7 | 82.8 | - | - | 58.1 | 76.3 | - | - |
ECN++ [38] | TPAMI’20 | 63.8 | 84.1 | 92.8 | 95.4 | 54.4 | 74.0 | 83.7 | 87.4 |
AD-Cluster [39] | CVPR’20 | 68.3 | 86.7 | 94.4 | 96.5 | 54.1 | 72.6 | 82.5 | 85.5 |
DAAM [40] | AAAI’20 | 67.8 | 86.4 | - | - | 63.9 | 77.6 | - | - |
NRMT [17] | ECCV’20 | 71.7 | 87.8 | 94.6 | 96.5 | 62.2 | 77.8 | 86.9 | 98.5 |
MMT [5] | ICLR’20 | 71.2 | 87.7 | 94.9 | 96.9 | 65.1 | 78.0 | 88.8 | 92.5 |
B-SNR+GDS-H [41] | ECCV’20 | 72.5 | 89.3 | - | - | 59.7 | 76.7 | - | - |
MEB-Net [18] | ECCV’20 | 76.0 | 89.9 | 96.0 | 97.5 | 66.1 | 79.6 | 88.3 | 92.2 |
SpCL [16] | NeurIPS’20 | 76.7 | 90.3 | 96.2 | 97.7 | 68.8 | 82.9 | 90.1 | 92.5 |
HSR [42] | ICIP’21 | 65.2 | 85.3 | - | - | 58.1 | 76.1 | - | - |
UNRN [6] | AAAI’21 | 78.1 | 91.9 | 96.1 | 97.8 | 69.1 | 82.0 | 90.7 | 93.5 |
Baseline | 78.7 | 92.0 | 96.5 | 97.9 | 68.5 | 82.1 | 90.1 | 92.3 | |
Ours | 80.2 | 92.4 | 97.1 | 98.1 | 69.1 | 82.3 | 90.8 | 92.8 | |
Metdods | Market1501→MSMT17 | DukeMTMC-reID→MSMT17 | |||||||
mAP | Rank-1 | Rank-5 | Rank-10 | mAP | Rank-1 | Rank-5 | Rank-10 | ||
ECN [32] | CVPR’19 | 8.5 | 25.3 | 36.3 | 42.1 | 10.2 | 30.2 | 41.5 | 46.8 |
SSG [34] | ICCV’19 | 13.2 | 31.6 | - | 49.6 | 13.3 | 32.3 | - | 51.2 |
MMCL [35] | CVPR’20 | 15.1 | 40.8 | 51.8 | 56.7 | 16.2 | 43.6 | 54.3 | 58.9 |
ECN++ [38] | TPAMI’20 | 15.2 | 40.4 | 53.1 | 58.7 | 16.0 | 42.5 | 55.9 | 61.5 |
NRMT [17] | ECCV’20 | 19.8 | 43.7 | 56.5 | 62.2 | 20.6 | 45.2 | 57.8 | 63.3 |
MMT [5] | ICLR’20 | 22.9 | 49.2 | 63.1 | 68.8 | 23.3 | 50.1 | 63.9 | 68.8 |
SpCL [16] | NeurIPS’20 | 25.4 | 51.6 | 64.3 | 69.7 | 26.5 | 53.1 | 65.8 | 70.5 |
UNRN [6] | AAAI’21 | 25.3 | 52.4 | 64.7 | 69.7 | 26.2 | 54.9 | 67.3 | 70.6 |
Baseline | 25.1 | 51.1 | 62.4 | 67.6 | 27.6 | 55.4 | 68.3 | 72.7 | |
Ours | 27.6 | 55.5 | 66.8 | 71.7 | 29.5 | 58.3 | 69.7 | 74.3 |
Numbers of Intermediate Domains | Duke→Market | Market→Duke | ||
---|---|---|---|---|
mAP | Rank-1 | mAP | Rank-1 | |
2 | 78.8 | 92.1 | 67.6 | 81.1 |
3 | 80.2 | 92.4 | 69.1 | 82.3 |
4 | 79.7 | 92.0 | 67.8 | 81.4 |
Methods | Duke→Market | Market→Duke | ||
---|---|---|---|---|
mAP | Rank-1 | mAP | Rank-1 | |
cycleGAN | 77.8 | 90.4 | 68.2 | 81.5 |
SPGAN | 80.2 | 92.4 | 69.1 | 82.3 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Xie, H.; Luo, H.; Gu, J.; Jiang, W. Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains. Appl. Sci. 2022, 12, 6990. https://doi.org/10.3390/app12146990
Xie H, Luo H, Gu J, Jiang W. Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains. Applied Sciences. 2022; 12(14):6990. https://doi.org/10.3390/app12146990
Chicago/Turabian StyleXie, Haonan, Hao Luo, Jianyang Gu, and Wei Jiang. 2022. "Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains" Applied Sciences 12, no. 14: 6990. https://doi.org/10.3390/app12146990
APA StyleXie, H., Luo, H., Gu, J., & Jiang, W. (2022). Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains. Applied Sciences, 12(14), 6990. https://doi.org/10.3390/app12146990