Facial Beauty Prediction Using a Generative Adversarial Network for Dataset Augmentation
Abstract
1. Introduction
- Through investigation of the class distribution imbalance issue in FBP datasets, a new data category distribution that is suitable for FBP is proposed.
- Based on the feature space of the SCUT-FBP5500 dataset, images in LSAFBD are reconstructed to alleviate the class imbalance issue in SCUT-FBP5500, thereby enhancing the classification accuracy of the MobileViT and ResNeXt networks on this dataset.
- An effective solution is provided for FBP in scenarios with scarce data, offering new insights regarding the application of generative adversarial networks (GANs) in computer vision tasks.
2. Related Works
2.1. Generative Adversarial Networks
2.2. Lightweight General Vision Transformer
2.3. Improved Residual Network
3. Methods
3.1. Overall Framework
3.2. Style Information Transfer
| Algorithm 1. Face Generation in Latent Space |
| Input: Input vector , generator , feature extraction network , real face image Output: Synthetic face image 1: while do 2: ▷ Generate synthetic face 3: ▷ Embedding of synthetic face in latent space 4: ▷ Embedding of real face in latent space 5: ▷ Difference between synthetic and real face 6: end while |
3.3. FBP-GAN Method
3.4. Ethical Considerations
4. Experiments and Analysis
4.1. Experimental Subjects
4.2. Experimental Environment
4.3. Evaluation Metrics and Augmentation Parameters
4.4. Training and Evaluation Based on the Transformer Model
4.5. Training and Evaluation Based on CNN Models
4.6. Migrate to Other Datasets
4.7. Statistical Significance Test
4.8. Comparison with Other Methods
4.9. Methodological Limitations
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Saeed, J.; Abdulazeez, A.M. Facial beauty prediction and analysis based on deep convolutional neural network: A review. J. Soft Comput. Data Min. 2021, 2, 1–12. [Google Scholar] [CrossRef]
- Chen, C.L.P.; Liu, Z.; Feng, S. Universal approximation capability of broad learning system and its structural variations. IEEE Trans. Neural Netw. Learn. Syst. 2018, 30, 1191–1204. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
- Lin, L.; Liang, L.; Jin, L. Regression Guided by Relative Ranking Using Convolutional Neural Network (R3 CNN) for Facial Beauty Prediction. IEEE Trans. Affect. Comput. 2022, 13, 122–134. [Google Scholar] [CrossRef]
- Lin, L.; Liang, L.; Jin, L.; Chen, W. Attribute-Aware Convolutional Neural Networks for Facial Beauty Prediction. In Proceedings of the International Joint Conference on Artificial Intelligence, Macao, China, 10–16 August 2019; pp. 847–853. [Google Scholar]
- Xu, L.; Xiang, J.; Yuan, X. Transferring rich deep features for facial beauty prediction. arXiv 2018, arXiv:1803.07253. [Google Scholar] [CrossRef]
- Zhai, Y.; Cao, H.; Deng, W.; Gan, J.; Piuri, V.; Zeng, J. BeautyNet: Joint multiscale CNN and transfer learning method for unconstrained facial beauty prediction. Comput. Intell. Neurosci. 2019, 2019, 1910624. [Google Scholar] [CrossRef] [PubMed]
- Huang, G.B.; Mattar, M.; Berg, T.; Learned-Miller, E. Labeled faces in the wild: A database for studying face recognition in unconstrained environments. In Proceedings of the Workshop on Faces in ‘Real-Life’ Images: Detection, Alignment, and Recognition, Marseille, France, 17 October 2008. [Google Scholar]
- Liu, Z.; Luo, P.; Wang, X.; Tang, X. Deep learning face attributes in the wild. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 13–16 December 2015; pp. 3730–3738. [Google Scholar]
- Guo, Y.; Zhang, L.; Hu, Y.; He, X.; Gao, J. Ms-celeb-1m: A dataset and benchmark for large-scale face recognition. In Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands, 11–14 October 2016; Springer: Berlin/Heidelberg, Germany, 2016; pp. 87–102. [Google Scholar]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; Volume 27. [Google Scholar]
- Mehta, S.; Rastegari, M. MobileViT: Light-weight, general-purpose, and mobile-friendly vision transformer. arXiv 2021, arXiv:2110.02178. [Google Scholar]
- Xie, S.; Girshick, R.; Dollár, P.; Tu, Z.; He, K. Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1492–1500. [Google Scholar]
- Liang, L.; Lin, L.; Jin, L.; Xie, D.; Li, M. SCUT-FBP5500: A diverse benchmark dataset for multi-paradigm facial beauty prediction. In Proceedings of the 24th International Conference on Pattern Recognition (ICPR), Beijing, China, 20–24 August 2018; IEEE: New York, NY, USA, 2018; pp. 1598–1603. [Google Scholar]
- Zhai, Y.; Huang, Y.; Xu, Y.; Gan, J.; Cao, H.; Deng, W.; Labati, R.D.; Piuri, V.; Scotti, F. Asian female facial beauty prediction using deep neural networks via transfer learning and multi-channel feature fusion. IEEE Access 2020, 8, 56892–56907. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998–6008. [Google Scholar]
- Radford, A.; Metz, L.; Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv 2015, arXiv:1511.06434. [Google Scholar]
- Arjovsky, M.; Chintala, S.; Bottou, L. Wasserstein generative adversarial networks. In Proceedings of the International Conference on Machine Learning, PMLR, Sydney, Australia, 6–11 August 2017; pp. 214–223. [Google Scholar]
- Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved training of wasserstein GANs. Adv. Neural Inf. Process. Syst. 2017, 30, 5767–5777. [Google Scholar]
- Karras, T.; Aila, T.; Laine, S.; Lehtinen, J. Progressive growing of GANs for improved quality, stability, and variation. arXiv 2017, arXiv:1710.10196. [Google Scholar]
- Karras, T.; Laine, S.; Aila, T. A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–20 June 2019; pp. 4401–4410. [Google Scholar]
- Karras, T.; Aittala, M.; Hellsten, J.; Laine, S.; Lehtinen, J.; Aila, T. Training generative adversarial networks with limited data. Adv. Neural Inf. Process. Syst. 2020, 33, 12104–12122. [Google Scholar]
- Karras, T.; Laine, S.; Aittala, M.; Hellsten, J.; Lehtinen, J.; Aila, T. Analyzing and improving the image quality of StyleGAN. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020; pp. 8110–8119. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Dong, C.; Loy, C.C.; He, K.; Tang, X. Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 38, 295–307. [Google Scholar] [CrossRef]
- Mirza, M.; Osindero, S. Conditional generative adversarial nets. arXiv 2014, arXiv:1411.1784. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
- Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2223–2232. [Google Scholar]
- Wen, F.F.; Zuo, B. The influence of masculinity and femininity on face preference: Evidence from image processing technology and eye movement. Acta Psychol. Sin. 2012, 44, 14–29. [Google Scholar] [CrossRef]
- Dai, L.Q.; Jin, Z.; Sun, M.M. Facial geometric beauty score based on semi-supervised regression learning. Comput. Appl. Softw. 2015, 32, 209–211. [Google Scholar]
- Lebedeva, I.; Guo, Y.; Ying, F. MEBeauty: A multi-ethnic facial beauty dataset in-the-wild. Neural Comput. Appl. 2022, 34, 14169–14183. [Google Scholar] [CrossRef]
- Sumsion, A.; Torrie, S.; Lee, D.J.; Sun, Z. Surveying racial bias in facial recognition: Balancing datasets and algorithmic enhancements. Electronics 2024, 13, 2317. [Google Scholar] [CrossRef]
- Deng, J.; Dong, W.; Socher, R.; Li, L.; Li, K.; Li, F. ImageNet: A large-scale hierarchical image database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; IEEE: New York, NY, USA, 2009; pp. 248–255. [Google Scholar]
- Buda, M.; Maki, A.; Mazurowski, M.A. A systematic study of the class imbalance problem in convolutional neural networks. Neural Netw. 2018, 106, 249–259. [Google Scholar] [CrossRef] [PubMed]
- Pearson, J. The human imagination: The cognitive neuroscience of visual mental imagery. Nat. Rev. Neurosci. 2019, 20, 624–634. [Google Scholar] [CrossRef] [PubMed]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale. In Proceedings of the 9th International Conference on Learning Representations (ICLR 2021), Virtual Event, 3–7 May 2021. [Google Scholar]
- Lee, P.; Li, J.; Rafiee, Y.; Jones, B.C.; Shiramizu, V.K.M. Further evidence that averageness and femininity, rather than symmetry and masculinity, predict facial attractiveness judgments. Sci. Rep. 2025, 15, 5498. [Google Scholar] [CrossRef] [PubMed]
- Gan, J.; Wu, B.; Zhai, Y.; He, G.; Mai, C.; Bai, Z. Self-correcting noise labels for facial beauty prediction. Chin. J. Image Graph. 2022, 27, 2487–2495. [Google Scholar]
- Gan, J.; Xie, X.; Zhai, Y.; He, G.; Mai, C.; Luo, H. Facial beauty prediction fusing transfer learning and broad learning system. Soft Comput. 2023, 27, 13391–13404. [Google Scholar] [CrossRef]
- Gan, J.; Xie, X.; He, G.; Luo, H. TransBLS: Transformer combined with broad learning system for facial beauty prediction. Appl. Intell. 2023, 53, 26110–26125. [Google Scholar] [CrossRef]
- Gan, J.; Luo, H.; Xiong, J.; Xie, X.; Li, H.; Liu, J. Facial beauty prediction combined with multi-task learning of adaptive sharing policy and attentional feature fusion. Electronics 2023, 13, 179. [Google Scholar] [CrossRef]
- Yeung, M.; Teramoto, T.; Wu, S.; Fujiwara, T.; Suzuki, K.; Kojima, T. VariFace: Fair and Diverse Synthetic Dataset Generation for Face Recognition. arXiv 2024, arXiv:2412.06235. [Google Scholar] [CrossRef]
- Xie, E.; Chen, J.; Chen, J.; Cai, H.; Tang, H.; Lin, Y.; Zhang, Z.; Li, M.; Zhu, L.; Lu, Y.; et al. SANA: Efficient high-resolution text-to-image synthesis with linear diffusion transformers. In Proceedings of the Thirteenth International Conference on Learning Representations, Singapore, 24–28 April 2025. [Google Scholar]













| Parameters | Recommended Values |
|---|---|
| mapping depth | 2 |
| lr | 0.0025 |
| batch size | 16 |
| kimg | 3500 |
| snapshot | 50 |
| r1 | 0.8192 |
| optimizer | Adam |
| Seed count | 1 |
| Method | ER | EP | Macro-F1 | ACC | Evaluate ACC |
|---|---|---|---|---|---|
| Baseline | 0 | 100% | 45.10 | 75.83 | 74.59 |
| FBP-GAN-MA | 10% | 118% | 45.15 | 75.92 | 74.86 |
| FBP-GAN-MA | 20% | 136% | 50.78 | 76.75 | 76.96 |
| Method | ER | EP | Macro-F1 | ACC | Evaluate ACC |
|---|---|---|---|---|---|
| Baseline | 0 | 100% | 45.10 | 75.83 | 74.59 |
| FBP-GAN-MB | 25% | 125% | 46.38 | 76.01 | 75.96 |
| FBP-GAN-MB | 50% | 150% | 50.42 | 76.84 | 77.32 |
| Method | ER | EP | Macro-F1 | ACC | Evaluate ACC |
|---|---|---|---|---|---|
| Baseline | 0 | 100% | 45.10 | 75.83 | 74.59 |
| FBP-GAN-MO | 3% | 103% | 49.07 | 76.38 | 77.69 |
| Method | ER | EP | Macro-F1 | ACC | Evaluate ACC |
|---|---|---|---|---|---|
| Baseline | 0 | 100% | 51.20 | 76.29 | 75.14 |
| FBP-GAN-RO | 3% | 103% | 51.56 | 77.94 | 77.05 |
| Method | ER | EP | Macro-F1 | ACC | Evaluate ACC |
|---|---|---|---|---|---|
| Baseline | 0 | 100% | 43.18 | 71.43 | 70.88 |
| FBP-GAN-RO | 1% | 101% | 57.34 | 72.99 | 72.38 |
| Method | Baseline | SCUT-FBP5500 (p-Value) |
|---|---|---|
| FBP-GAN-MO | MobileViT-S | p = 0.0025 |
| FBP-GAN-RO | ResNeXt-50 | p < 0.001 |
| Method | SCUT-FBP5500 | MEBeauty |
|---|---|---|
| ResNeXt-50 [13] | 76.29 | 71.43 |
| Noise Labels [38] | 75.30 | — |
| E-BLS [39] | 73.13 | — |
| ER-BLS [39] | 74.69 | — |
| TransBLS-T [40] | 75.23 | — |
| Adaptive multi-task method [41] | 75.41 | — |
| FBP-GAN-MO (ours) | 76.38 | — |
| FBP-GAN-RO (ours) | 77.94 | 72.99 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Gan, J.; Chen, Z.; Chen, H.; Xu, W.; Zhuang, Z.; Xiong, J. Facial Beauty Prediction Using a Generative Adversarial Network for Dataset Augmentation. Electronics 2026, 15, 615. https://doi.org/10.3390/electronics15030615
Gan J, Chen Z, Chen H, Xu W, Zhuang Z, Xiong J. Facial Beauty Prediction Using a Generative Adversarial Network for Dataset Augmentation. Electronics. 2026; 15(3):615. https://doi.org/10.3390/electronics15030615
Chicago/Turabian StyleGan, Junying, Zhen Chen, Hantian Chen, Wenchao Xu, Zhenxin Zhuang, and Junling Xiong. 2026. "Facial Beauty Prediction Using a Generative Adversarial Network for Dataset Augmentation" Electronics 15, no. 3: 615. https://doi.org/10.3390/electronics15030615
APA StyleGan, J., Chen, Z., Chen, H., Xu, W., Zhuang, Z., & Xiong, J. (2026). Facial Beauty Prediction Using a Generative Adversarial Network for Dataset Augmentation. Electronics, 15(3), 615. https://doi.org/10.3390/electronics15030615

