Age-Invariant Face Retrieval Based on Hybrid Metric Learning Framework (HMLF)
Abstract
1. Introduction
- We propose a Hybrid Metric Learning Framework (HMLF) that mingles ArcFace with supervised contrastive learning for face retrieval under large intra-class appearance variations. Without explicitly using age annotations, the proposed loss effectively enhances identity discriminability under age progression, pose, and illumination changes.
- Unlike most existing cross-age studies that focus on verification (1:1), we treat the problem as large-scale cross-age face retrieval (1:N) and utilize a unified evaluation protocol with gallery/query splits, mAP and Rank-k metrics, which better reflects practical search scenarios and provides reproducible benchmarks for future research.
- Extensive experiments are conducted on five public cross-age datasets (CACD [28], MORPH Album 2 [29], FG-NET [30], AgeDB [31], IMDB-clean [32]) with several representative backbones under the HMLF. Our method demonstrates consistent improvements across multiple datasets and provides insights into how different backbone–loss combinations affect cross-age retrieval precision.
2. Related Work
2.1. The Evolution of Backbone Architectures for Age-Invariant Face Recognition (AIFR)
2.2. Contrastive Learning
3. Approach
3.1. Overview Framework
3.2. TAL
3.2.1. ArcFace Loss
3.2.2. Triplet Loss
3.2.3. Mixed Online Sampling Strategy Centering Semi-Hard and Hard Samples
| Algorithm 1: Adaptive Triplet–ArcFace hybrid loss for face recognition |
| Input: Mini-batch with samples, network , margins , fixed weight , learnable weight Output: Total loss Extract and normalize embeddings: ; Compute cosine distances: Initialize , ; ![]() Compute ArcFace Loss with angular margin : Update via gradient descent on return |
3.3. IAL
3.3.1. InfoNCE Loss
3.3.2. Memory-Augmented InfoNCE Loss
3.3.3. Adaptive Parameters in TAL and IAL
4. Experiments
4.1. Implementation Details
4.1.1. Network Architecture
4.1.2. Data Preprocessing
4.1.3. Training Details
4.1.4. Evaluation Protocol
4.2. Experiments on AIFR Datasets
4.2.1. Result on CACD
4.2.2. Result on MORPH Album 2
4.2.3. Result on FG-NET Dataset
4.2.4. Result on AgeDB
4.2.5. Result on IMDB-Clean
4.2.6. Comparison of Different Backbones
4.3. Ablation Study
4.4. Visualization Analysis
4.4.1. t-SNE Feature Distribution Visualization Analysis
4.4.2. Retrieval Visualization
4.4.3. Visualization of Intra-Class and Inter-Class Distance Distributions
4.4.4. Attention Map Comparison
4.5. Hyper-Parameter Analysis
4.6. Model Comparison and Analysis
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Jalal, A.S.; Sharma, D.K.; Sikander, B. Suspect face retrieval system using multicriteria decision process and deep learning. Multimed. Tools Appl. 2023, 82, 38189–38216. [Google Scholar] [CrossRef]
- Zhang, Z.; Yin, S.; Cao, L. Age-invariant face recognition based on identity-age shared features. Vis. Comput. 2024, 40, 5465–5474. [Google Scholar] [CrossRef]
- Wang, M.; Deng, W. Deep face recognition: A survey. Neurocomputing 2021, 429, 215–244. [Google Scholar] [CrossRef]
- Ouyang, Y.; Shao, Y.; Shi, B. Research on Open Set Recognition Method for Aerial Infrared Targets. In Proceedings of the 2024 2nd International Conference on Artificial Intelligence and Automation Control (AIAC); IEEE: New York, NY, USA, 2024; pp. 517–521. [Google Scholar] [CrossRef]
- Chandaliya, P.K.; Nain, N. ChildGAN: Face aging and rejuvenation to find missing children. Pattern Recognit. 2022, 129, 108761. [Google Scholar] [CrossRef]
- Shen, Y.; Yang, C.; Tang, X.; Zhou, B. InterFaceGAN: Interpreting the Disentangled Face Representation Learned by GANs. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 2004–2018. [Google Scholar] [CrossRef] [PubMed]
- Ritharson P, I.; Vidhya, K.; G, M.; D, B.; Sathish Kumar, K. GAN-Based Facial Feature Reconstruction for Improved Masked Face Recognition During Covid. In Proceedings of the 2023 International Conference on Circuit Power and Computing Technologies (ICCPCT); IEEE: New York, NY, USA, 2023; pp. 1–5. [Google Scholar] [CrossRef]
- Park, U.; Tong, Y.; Jain, A.K. Age-Invariant Face Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2010, 32, 947–954. [Google Scholar] [CrossRef]
- Liao, J.; Sanchez, V.; Guha, T. Self-Supervised Frontalization and Rotation Gan with Random Swap for Pose-Invariant Face Recognition. In Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP); IEEE: New York, NY, USA, 2022; pp. 911–915. [Google Scholar] [CrossRef]
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar] [CrossRef]
- Wang, H.; Gong, D.; Li, Z.; Liu, W. Decorrelated Adversarial Learning for Age-Invariant Face Recognition. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2019; pp. 3522–3531. [Google Scholar] [CrossRef]
- Wang, Y.; Gong, D.; Zhou, Z.; Ji, X.; Wang, H.; Li, Z.; Liu, W.; Zhang, T. Orthogonal Deep Features Decomposition for Age-Invariant Face Recognition. In Proceedings of the Computer Vision–ECCV 2018; Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y., Eds.; Springer: Cham, Switzerland, 2018; pp. 764–779. [Google Scholar] [CrossRef]
- Han, Y.; Zhou, Y.; Li, M.; Zhang, X. Cross-Age Face Recognition Based on Feature Disentanglement and Age Regression. In Proceedings of the 2025 7th International Academic Exchange Conference on Science and Technology Innovation (IAECST); IEEE: New York, NY, USA, 2025; pp. 229–232. [Google Scholar] [CrossRef]
- Dong, Z.; Zhu, H.; Li, Y. An Innovative Approach to Cross-Age Face Recognition: Combining Deformable Convolutional Networks with VGG-16 Network. In Proceedings of the 2024 5th International Conference on Computers and Artificial Intelligence Technology (CAIT); IEEE: New York, NY, USA, 2024; pp. 100–104. [Google Scholar] [CrossRef]
- Ermao, L.; Min, Z. Review of Cross-Age Face Recognition in Discriminative Models. In Proceedings of the 2023 8th International Conference on Image, Vision and Computing (ICIVC); IEEE: New York, NY, USA, 2023; pp. 124–130. [Google Scholar] [CrossRef]
- Huang, Z.; Zhang, J.; Shan, H. When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework and a New Benchmark. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 7917–7932. [Google Scholar] [CrossRef]
- Yan, C.; Meng, L.; Li, L.; Zhang, J.; Zhan, W.; Yin, J.; Zhang, J.; Sun, Y.; Zheng, B. Age-Invariant Face Recognition by Multi-Feature Fusion and Decomposition with Self-attention. ACM Trans. Multimed. Comput. Commun. Appl. 2022, 18, 1–18. [Google Scholar] [CrossRef]
- Elboushaki, A.; Hannane, R.; Afdel, K. Similarity-based face image retrieval using sparsely embedded deep features and binary code learning. Int. J. Multimed. Inf. Retr. 2024, 13, 28. [Google Scholar] [CrossRef]
- Jang, Y.K.; Cho, N.I. Similarity Guided Deep Face Image Retrieval. arXiv 2021, arXiv:2107.05025. [Google Scholar]
- Yang, Y.; Tian, X.; Ng, W.W.Y.; Wang, R.; Gao, Y.; Kwong, S. Generative face inpainting hashing for occluded face retrieval. Int. J. Mach. Learn. Cybern. 2023, 14, 1725–1738. [Google Scholar] [CrossRef]
- Prathiba, T.; Selva Kumari, R.; Murugesan, C.S. ALMEGA-VIR: Face video retrieval system. Imaging Sci. J. 2023, 72, 766–776. [Google Scholar] [CrossRef]
- Deng, J.; Guo, J.; Xue, N.; Zafeiriou, S. ArcFace: Additive Angular Margin Loss for Deep Face Recognition. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2019; pp. 4685–4694. [Google Scholar] [CrossRef]
- Hermans, A.; Beyer, L.; Leibe, B. In Defense of the Triplet Loss for Person Re-Identification. arXiv 2017, arXiv:1703.07737. [Google Scholar] [CrossRef]
- Fang, L. Lightweight face recognition neural network based on MobileNetV4. In Proceedings of the Sixth International Conference on Signal Processing and Computer Science (SPCS 2025); Mathiopoulos, P.T., Feng, Y., Eds.; International Society for Optics and Photonics, SPIE: Bellingham, WA, USA, 2025; Volume 13978, p. 1397817. [Google Scholar] [CrossRef]
- Schroff, F.; Kalenichenko, D.; Philbin, J. FaceNet: A unified embedding for face recognition and clustering. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2015; pp. 815–823. [Google Scholar] [CrossRef]
- Chen, S.; Liu, Y.; Gao, X.; Han, Z. MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices. In Biometric Recognition; Zhou, J., Wang, Y., Sun, Z., Jia, Z., Feng, J., Shan, S., Ubul, K., Guo, Z., Eds.; Springer: Cham, Switzerland, 2018; pp. 428–438. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 9992–10002. [Google Scholar] [CrossRef]
- Chen, B.C.; Chen, C.S.; Hsu, W.H. Face Recognition and Retrieval Using Cross-Age Reference Coding With Cross-Age Celebrity Dataset. IEEE Trans. Multimed. 2015, 17, 804–815. [Google Scholar] [CrossRef]
- Ricanek, K.; Tesafaye, T. MORPH: A longitudinal image database of normal adult age-progression. In Proceedings of the 7th International Conference on Automatic Face and Gesture Recognition (FGR06); IEEE: New York, NY, USA, 2006; pp. 341–345. [Google Scholar] [CrossRef]
- Lanitis, A.; Taylor, C.; Cootes, T. Toward automatic simulation of aging effects on face images. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 442–455. [Google Scholar] [CrossRef]
- Moschoglou, S.; Papaioannou, A.; Sagonas, C.; Deng, J.; Kotsia, I.; Zafeiriou, S. AgeDB: The First Manually Collected, In-the-Wild Age Database. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); IEEE: New York, NY, USA, 2017; pp. 1997–2005. [Google Scholar] [CrossRef]
- Lin, Y.; Shen, J.; Wang, Y.; Pantic, M. FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild. IEEE Trans. Image Process. 2025, 34, 4767–4777. [Google Scholar] [CrossRef]
- Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
- Wen, Y.; Li, Z.; Qiao, Y. Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2016; pp. 4893–4901. [Google Scholar] [CrossRef]
- Agarwal, A.; Susan, S. Attention-augmented squeeze-and-excitation enhanced mobile network for occluded facial expression recognition in resource-constrained environments. Signal Image Video Process. 2025, 19, 687. [Google Scholar] [CrossRef]
- Chandhana, G.; Hemalatha, P.; Reddy, P.; Sesadri, U.; Madhavi, K. CNN-Powered Facial Analysis for Gender, Age and Facial shape Classification. In Proceedings of the 2025 2nd Asia Pacific Conference on Innovation in Technology (APCIT); IEEE: New York, NY, USA, 2025; pp. 1–6. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. In Proceedings of the NIPS’17, Long Beach, CA, USA, 4–9 December 2017; pp. 6000–6010. [Google Scholar] [CrossRef]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv 2021, arXiv:2010.11929. [Google Scholar] [CrossRef]
- Touvron, H.; Cord, M.; Douze, M.; Massa, F.; Sablayrolles, A.; J’egou, H. Training data-efficient image transformers & distillation through attention. arXiv 2020, arXiv:2012.12877. [Google Scholar] [CrossRef]
- Yuan, L.; Chen, Y.; Wang, T.; Yu, W.; Shi, Y.; Tay, F.E.H.; Feng, J.; Yan, S. Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 538–547. [Google Scholar] [CrossRef]
- Wu, H.; Xiao, B.; Codella, N.; Liu, M.; Dai, X.; Yuan, L.; Zhang, L. CvT: Introducing Convolutions to Vision Transformers. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 22–31. [Google Scholar] [CrossRef]
- Darcet, T.; Oquab, M.; Mairal, J.; Bojanowski, P. Vision Transformers Need Registers. arXiv 2024, arXiv:2309.16588. [Google Scholar] [CrossRef]
- He, K.; Fan, H.; Wu, Y.; Xie, S.; Girshick, R. Momentum Contrast for Unsupervised Visual Representation Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2020. [Google Scholar]
- Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A simple framework for contrastive learning of visual representations. arXiv 2020, arXiv:2106.09215v5. [Google Scholar] [CrossRef]
- Khosla, P.; Teterwak, P.; Wang, C.; Sarna, A.; Tian, Y.; Isola, P.; Maschinot, A.; Liu, C.; Krishnan, D. Supervised Contrastive Learning. arXiv 2021, arXiv.2004.11362. [Google Scholar] [CrossRef]
- Li, H.; Zhu, J.; Wen, G.; Zhong, H. Structural self-contrast learning based on adaptive weighted negative samples for facial expression recognition. Vis. Comput. 2025, 41, 579–590. [Google Scholar] [CrossRef]
- Liu, B.; Wang, B.; Li, T. Bayesian Self-Supervised Contrastive Learning. arXiv 2024, arXiv:2301.11673. [Google Scholar] [CrossRef]
- Dong, H.; Long, X.; Li, Y. Synthetic Hard Negative Samples for Contrastive Learning. Neural Process. Lett. 2024, 56, 33. [Google Scholar] [CrossRef]
- Guo, Y.; Zhang, L.; Hu, Y.; He, X.; Gao, J. MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition. In Proceedings of the Computer Vision–ECCV 2016; Leibe, B., Matas, J., Sebe, N., Welling, M., Eds.; Springer: Cham, Switzerland, 2016; pp. 87–102. [Google Scholar] [CrossRef]
- Kemelmacher-Shlizerman, I.; Seitz, S.M.; Miller, D.; Brossard, E. The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2016; pp. 4873–4882. [Google Scholar] [CrossRef]
- Cao, Q.; Shen, L.; Xie, W.; Parkhi, O.; Zisserman, A. VGGFace2: A Dataset for Recognising Faces across Pose and Age. In Proceedings of the 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi’an, China, 15–19 May 2018; pp. 67–74. [Google Scholar] [CrossRef]
- Li, J.; Zhou, L.; Chen, J. Age-invariant face network (AFN): A discriminative model towards age-invariant face recognition. Neural Comput. Appl. 2024, 36, 13689–13702. [Google Scholar] [CrossRef]
- Zhu, J.; Cai, Z.; Lin, S. Multi-scale Feature Fusion and Multi-task Learning for Cross-age Face Recognition. In Proceedings of the 2024 5th International Conference on Computers and Artificial Intelligence Technology (CAIT); IEEE: New York, NY, USA, 2024; pp. 76–81. [Google Scholar] [CrossRef]
- Yuan, H.; He, Y.; Du, P.; Song, L. Multi-Task Learning Using Uncertainty to Weigh Losses for Heterogeneous Face Attribute Estimation. IEEE Trans. Affect. Comput. 2024, 14, 2033–2047. [Google Scholar] [CrossRef]
- Ali, H.; Ijaz, A. Machine and Deep Learning Based CCTV Surveillance Using FaceNet, MTCNN, and Haar Cascade for Enhanced Security. In Proceedings of the 2025 6th International Conference on Computer Vision, Image and Deep Learning (CVIDL); IEEE: New York, NY, USA, 2025; pp. 672–677. [Google Scholar] [CrossRef]
- Yi, D.; Lei, Z.; Liao, S.; Li, S.Z. Learning Face Representation from Scratch. arXiv 2014, arXiv.1411.7923. [Google Scholar] [CrossRef]
- Hou, X.; Li, Y.; Wang, S. Disentangled Representation for Age-Invariant Face Recognition: A Mutual Information Minimization Perspective. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV); IEEE: New York, NY, USA, 2021; pp. 3672–3681. [Google Scholar] [CrossRef]
- Gong, D.; Li, Z.; Lin, D.; Liu, J.; Tang, X. Hidden Factor Analysis for Age Invariant Face Recognition. In Proceedings of the 2013 IEEE International Conference on Computer Vision; IEEE: New York, NY, USA, 2013; pp. 2872–2879. [Google Scholar] [CrossRef]
- Lin, L.; Wang, G.; Zuo, W.; Feng, X.; Zhang, L. Cross-Domain Visual Matching via Generalized Similarity Measure and Feature Learning. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1089–1102. [Google Scholar] [CrossRef]
- Xu, C.; Liu, Q.; Ye, M. Age invariant face recognition and retrieval by coupled auto-encoder networks. Neurocomputing 2017, 222, 62–71. [Google Scholar] [CrossRef]
- Zheng, T.; Deng, W.; Hu, J. Age Estimation Guided Convolutional Neural Network for Age-Invariant Face Recognition. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); IEEE: New York, NY, USA, 2017; pp. 503–511. [Google Scholar] [CrossRef]
- Yu, J.; Jing, L. A Joint Multi-Task CNN for Cross-Age Face Recognition. In Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP); IEEE: New York, NY, USA, 2018; pp. 2411–2415. [Google Scholar] [CrossRef]
- Gong, D.; Li, Z.; Tao, D.; Liu, J.; Li, X. A maximum entropy feature descriptor for age invariant face recognition. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); IEEE: New York, NY, USA, 2015; pp. 5289–5297. [Google Scholar] [CrossRef]
- Zhao, J.; Cheng, Y.; Cheng, Y.; Yang, Y.; Zhao, F.; Li, J.; Liu, H.; Yan, S.; Feng, J. Look across elapse: Disentangled representation learning and photorealistic cross-age face synthesis for age-invariant face recognition. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence and Thirty-First Innovative Applications of Artificial Intelligence Conference and Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, AAAI’19/IAAI’19/EAAI’19, 2019; AAAI Press: Washington, DC, USA, 2019. [Google Scholar] [CrossRef]
- Wang, H.; Sanchez, V.; Li, C.T. Cross-Age Contrastive Learning for Age-Invariant Face Recognition. In Proceedings of the ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); IEEE: New York, NY, USA, 2023; pp. 4600–4604. [Google Scholar] [CrossRef]
- Li, Z.; Park, U.; Jain, A.K. A Discriminative Model for Age Invariant Face Recognition. IEEE Trans. Inf. Forensics Secur. 2011, 6, 1028–1037. [Google Scholar] [CrossRef]
- Truong, T.D.; Duong, C.N.; Quach, K.G.; Le, N.; Bui, T.D.; Luu, K. LIAAD: Lightweight attentive angular distillation for large-scale age-invariant face recognition. Neurocomputing 2023, 543, 126198. [Google Scholar] [CrossRef]
- Kuprashevich, M.; Tolstykh, I. MiVOLO: Multi-input Transformer for Age and Gender Estimation. In Proceedings of the Analysis of Images, Social Networks and Texts: 11th International Conference, AIST 2023, Yerevan, Armenia, 28–30 September 2023; Revised Selected Papers; Springer: Berlin/Heidelberg, Germany, 2023; pp. 212–226. [Google Scholar] [CrossRef]
- Balamurugan, R.; Sekar, K.T.A.; Nandalal, V.; Wu, C.P. Face-Based Kinship Verification Using Deep Embeddings for Low-Cost Health Record Linkage. In Proceedings of the 2025 IEEE First International Conference on Innovations in Engineering and Next-Generation Technologies for Sustainability (ICINVENTS); IEEE: New York, NY, USA, 2025; Volume 1, pp. 1–6. [Google Scholar] [CrossRef]











| Method | 2004–2006 | 2007–2009 | 2010–2012 |
|---|---|---|---|
| HFA [58] | 50.58 | 53.01 | 56.12 |
| CARC [28] | 52.72 | 55.48 | 61.38 |
| GSM-1 [59] | 53.79 | 57.83 | 63.92 |
| GSM-2 [59] | 55.45 | 58.74 | 64.58 |
| CAN [60] | 62.33 | 67.69 | 73.24 |
| AE-CNN [61] | 70.01 | 72.87 | 78.25 |
| JM-CNN [62] | 82.53 | 85.26 | 88.28 |
| MT-MIM [57] | 92.63 | 93.95 | 96.09 |
| MFD [17] | 92.41 | 95.44 | 97.51 |
| IResnet + TAL | 96.43 | 97.38 | 98.23 |
| IResnet + IAL | 96.23 | 97.44 | 98.34 |
| Method | Setting-1/Setting-2 |
|---|---|
| HFA [58] | 91.14/- |
| CARC [28] | 92.80/- |
| MEFA [63] | 93.80/- |
| MEFA + SIFT + MLBP [63] | 94.59/- |
| LF-CNN [34] | 97.51/- |
| GSM [59] | -/94.40 |
| AE-CNN [61] | -/98.13 |
| OE-CNN [12] | 98.55/98.67 |
| DAL [11] | 98.93/98.97 |
| AIM [64] | 99.13/98.81 |
| AIM + CAFR [64] | 99.65/99.26 |
| MT-MIM [57] | -/99.43 |
| CACon [65] | 99.57/99.52 |
| IResnet + TAL | 99.73/99.78 |
| IResnet + IAL | 99.83/99.80 |
| Method | Rank-1 (%) |
|---|---|
| Park et al. [8] | 37.40 |
| Li et al. [66] | 47.50 |
| HFA [58] | 69.00 |
| MEFA [63] | 76.20 |
| LF-CNN [34] | 88.10 |
| CAN [60] | 86.50 |
| DAL [11] | 94.50 |
| AIM [64] | 93.20 |
| MT-MIM [57] | 94.21 |
| ISF [2] | 94.67 |
| VGG16-DCN [14] | 80.50 |
| CACon [65] | 94.61 |
| MFNR-LIAAD [67] | 95.11 |
| IResnet + TAL | 95.43 |
| IResnet + IAL | 95.30 |
| Method | mAP (%) | Rank-1 (%) |
|---|---|---|
| FaceNet | 56.12 | 72.30 |
| Swin-T | 86.60 | 92.62 |
| MobileFaceNet | 71.19 | 86.62 |
| IResnet + TAL | 89.43 | 93.21 |
| IResnet + IAL | 88.78 | 93.43 |
| Method | mAP (%) | Rank-1 (%) |
|---|---|---|
| FaceNet | 16.15 | 23.53 |
| Swin-T | 61.87 | 64.7 |
| MobileFaceNet | 21.25 | 29.41 |
| IResnet + TAL | 91.23 | 96.01 |
| IResnet + IAL | 91.33 | 95.87 |
| Method | CACD (2011) | CACD (2008) | CACD (2005) | MORPH | FG-NET | AgeDB | IMDB-Clean |
|---|---|---|---|---|---|---|---|
| IResnet + ArcFace | 99.88 | 99.56 | 99.54 | 99.75 | 93.44 | 91.40 | 94.12 |
| IResnet + TAL | 99.87 | 99.54 | 99.43 | 99.78 | 95.43 | 93.21 | 96.01 |
| IResnet + IAL | 99.77 | 99.56 | 99.34 | 99.80 | 95.30 | 93.43 | 95.87 |
| FaceNet + ArcFace | 99.23 | 99.05 | 99.01 | 87.23 | 79.57 | 72.90 | 23.53 |
| FaceNet + TAL | 99.30 | 99.12 | 99.23 | 88.43 | 84.23 | 80.34 | 27.37 |
| FaceNet + IAL | 99.43 | 99.22 | 99.03 | 89.23 | 85.01 | 82.23 | 27.89 |
| Swin-T + ArcFace | 99.08 | 99.08 | 99.15 | 99.86 | 93.81 | 92.62 | 64.70 |
| Swin-T + TAL | 99.06 | 99.06 | 99.14 | 99.76 | 95.45 | 92.78 | 67.19 |
| Swin-T + IAL | 99.00 | 99.08 | 99.15 | 99.87 | 96.01 | 92.97 | 66.56 |
| MobileFaceNet + ArcFace | 99.15 | 99.0 | 98.92 | 98.93 | 91.69 | 86.62 | 29.41 |
| MobileFaceNet + TAL | 99.03 | 98.88 | 98.78 | 98.89 | 93.23 | 89.14 | 35.43 |
| MobileFaceNet + IAL | 99.04 | 98.78 | 98.77 | 98.78 | 94.21 | 88.68 | 36.78 |
| Method | CACD (2005) | CACD (2008) | CACD (2011) | MORPH | FG-NET | AgeDB | IMDB-Clean |
|---|---|---|---|---|---|---|---|
| IResnet + ArcFace | 98.33 | 98.01 | 96.54 | 99.7 | 81.28 | 88.01 | 88.87 |
| IResnet + TAL | 98.23 | 97.38 | 96.43 | 99.75 | 87.23 | 89.43 | 91.23 |
| IResnet + IAL | 98.34 | 97.44 | 96.23 | 99.77 | 85.34 | 88.78 | 91.33 |
| FaceNet + ArcFace | 96.44 | 93.12 | 92.32 | 88.92 | 44.00 | 56.23 | 16.15 |
| FaceNet + TAL | 96.33 | 93.11 | 92.12 | 89.55 | 50.12 | 66.78 | 21.23 |
| FaceNet + IAL | 95.99 | 93.01 | 92.33 | 90.12 | 52.23 | 68.34 | 20.89 |
| Swin-T + ArcFace | 98.62 | 97.60 | 97.24 | 99.88 | 76.42 | 86.60 | 61.87 |
| Swin-T + TAL | 98.44 | 97.54 | 97.15 | 99.78 | 79.32 | 86.88 | 65.32 |
| Swin-T + IAL | 98.58 | 97.5 | 97.17 | 99.67 | 80.12 | 86.89 | 66.09 |
| MobileFaceNet + ArcFace | 97.87 | 96.26 | 95.66 | 99.07 | 65.62 | 71.19 | 21.25 |
| MobileFaceNet + TAL | 97.45 | 96.02 | 95.44 | 98.90 | 72.23 | 77.01 | 33.87 |
| MobileFaceNet + IAL | 97.54 | 96.11 | 95.43 | 99.05 | 74.10 | 76.19 | 34.09 |
| # | Settings | AgeDB | ||||
|---|---|---|---|---|---|---|
| Baseline | Mix Mining | Memory Bank | Adaptive Weight | mAP | Rank-1 | |
| 1 | ✓ | - | - | - | 56.23 | 72.90 |
| 2 | ✓ | ✓ | - | - | 66.72 | 80.27 |
| 3 | ✓ | ✓ | - | ✓ | 66.78 | 80.34 |
| 4 | ✓ | - | ✓ | - | 68.21 | 81.98 |
| 5 | ✓ | - | ✓ | ✓ | 68.34 | 82.23 |
| # | Settings | AgeDB | ||||
|---|---|---|---|---|---|---|
| Baseline | Mix Mining | Memory Bank | Adaptive Weight | mAP | Rank-1 | |
| 1 | ✓ | - | - | - | 71.19 | 86.62 |
| 2 | ✓ | ✓ | - | - | 75.14 | 88.27 |
| 3 | ✓ | ✓ | - | ✓ | 77.01 | 89.14 |
| 4 | ✓ | - | ✓ | - | 74.58 | 87.98 |
| 5 | ✓ | - | ✓ | ✓ | 76.19 | 88.68 |
| # | Settings | AgeDB | ||||
|---|---|---|---|---|---|---|
| Baseline | Mix Mining | Memory Bank | Adaptive Weight | mAP | Rank-1 | |
| 1 | ✓ | - | - | - | 88.01 | 91.40 |
| 2 | ✓ | ✓ | - | - | 89.01 | 92.88 |
| 3 | ✓ | ✓ | - | ✓ | 89.43 | 93.21 |
| 4 | ✓ | - | ✓ | - | 88.60 | 92.78 |
| 5 | ✓ | - | ✓ | ✓ | 88.78 | 93.43 |
| Backbone | Type | Params (M) | FLOPs (G) | Latency (ms) | Peak GPU Mem (MB) |
|---|---|---|---|---|---|
| IResnet-50 | CNN | 43.6 | 6.36 | 8.5 | 1250.3 |
| Swin-T | Transformer | 28.3 | 4.51 | 12.3 | 1580.7 |
| FaceNet | CNN | 23.0 | 3.78 | 7.2 | 980.2 |
| MobileFaceNet | Efficient CNN | 0.99 | 0.45 | 3.8 | 420.5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cao, J.; Zhang, T.; Wang, Z.; Lian, B. Age-Invariant Face Retrieval Based on Hybrid Metric Learning Framework (HMLF). Electronics 2026, 15, 1851. https://doi.org/10.3390/electronics15091851
Cao J, Zhang T, Wang Z, Lian B. Age-Invariant Face Retrieval Based on Hybrid Metric Learning Framework (HMLF). Electronics. 2026; 15(9):1851. https://doi.org/10.3390/electronics15091851
Chicago/Turabian StyleCao, Jingtian, Tingshuo Zhang, Ziyi Wang, and Bobo Lian. 2026. "Age-Invariant Face Retrieval Based on Hybrid Metric Learning Framework (HMLF)" Electronics 15, no. 9: 1851. https://doi.org/10.3390/electronics15091851
APA StyleCao, J., Zhang, T., Wang, Z., & Lian, B. (2026). Age-Invariant Face Retrieval Based on Hybrid Metric Learning Framework (HMLF). Electronics, 15(9), 1851. https://doi.org/10.3390/electronics15091851


