Evaluation of the Performance of an Artificial Intelligence-Based Classification Model for Pediatric Maxillofacial Morphology †
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Collection
2.2. Training Data Preparation
2.3. AI Model Development
- (1)
- Random forest classifier (RF);
- (2)
- Logistic regression (LR);
- (3)
- Support vector classification (SVC).
2.4. Analysis of Input Features Importance
3. Results
3.1. Orthodontist Classification Results
3.2. Comparison with ML Model Classification Performance
3.3. Classification Performance for Horizontal Classification Alone
3.4. Classification Performance for Vertical Classification Alone
3.5. Classification Performance for Sassouni Classifications
3.6. Feature Importance
4. Discussion
4.1. AI Model Machine Learning Algorithm
4.2. RF Model Horizontal and Vertical Classification
4.3. Sassouni Classification
4.4. Input Feature Importance
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| RF | Random Forest |
| LR | Logistic Regression |
| SVC | Support Vector Classification |
References
- Davis, S.; Zhang., J.; Lee, I.; Rezaei, M.; Greiner, R.; McAlister, F.A.; Padwal, R. Effective hospital readmission prediction models using machine-learned features. BMC Health Serv. Res. 2022, 22, 1415. [Google Scholar] [CrossRef]
- Miotto, R.; Li, L.; Kidd, B.A.; Dudley, J.T. Deep patient: An unsupervised representation to predict the future of patients from the electronic health records. Sci. Rep. 2016, 6, 26094. [Google Scholar] [CrossRef]
- Choi, H.-I.; Jung, S.-K.; Baek, S.-H.; Lim, W.H.; Ahn, S.-J.; Yang, I.-H.; Kim, T.-W. Artificial intelligent model with neural network machine learning for the diagnosis of orthognathic surgery. J. Craniofac. Surg. 2019, 30, 1986–1989. [Google Scholar] [CrossRef] [PubMed]
- Kunz, F.; Stellzig-Eisenhauer, A.; Zeman, F.; Boldt, J. Artificial intelligence in orthodontics: Evaluation of a fully automated cephalometric analysis using a customized convolutional neural network. J. Orofac. Orthop. 2020, 81, 52–68. [Google Scholar] [CrossRef] [PubMed]
- Tanikawa, C.; Yamashiro, T. Development of novel artificial intelligence systems to predict facial morphology after orthognathic surgery and orthodontic treatment in Japanese patients. Sci. Rep. 2021, 11, 15853. [Google Scholar] [CrossRef] [PubMed]
- Li, Z.; Wang, L. Multi-task reinforcement learning and explainable AI-Driven platform for personalized planning and clinical decision support in orthodontic-orthognathic treatment. Sci. Rep. 2025, 15, 24502. [Google Scholar] [CrossRef]
- Olawade, D.B.; Leena, N.; Egbon, E.; Rai, J.; Mohammed, A.P.E.K.; Oladapo, B.I.; Boussios, S. AI-Driven Advancements in Orthodontics for Precision and Patient Outcomes. Dent. J. 2025, 13, 198. [Google Scholar] [CrossRef]
- Sassouni, V. A classification of skeletal facial types. Am. J. Orthod. 1969, 55, 109–123. [Google Scholar] [CrossRef]
- Hiroki, S.; Akane, U.; Tussie, C.; Kim, S.; Yukinori, K.; Emiko, K.; Shigemi, N.; Kazuro, S. Classification of Maxillofacial Morphology in Japanese Children Using Artificial Intelligence and Verification of Accuracy. In Proceedings of the 84th Annual Meeting of the Japanese Orthodontic Society, Sapporo, Japan, 29 September–1 October 2025. [Google Scholar]
- Downs, W.B. Variations in facial relationships: Their significance in treatment and prognosis. Am. J. Orthod. 1948, 34, 812–840. [Google Scholar] [CrossRef]
- Steiner, C.C. Cephalometrics in clinical practice. Angle. Orthod. 1959, 29, 8–29. [Google Scholar] [CrossRef]
- Montella, A.; Aria, M.; D’aMbrosio, A.; Mauriello, F. Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery. Accid. Anal. Prev. 2012, 49, 58–72. [Google Scholar] [CrossRef]
- Morgan, E.W.; Perdew, G.H.; Patterson, A.D. Multi-Omics Strategies for Investigating the Microbiome in Toxicology Research. Toxicol. Sci. 2022, 187, 189–213. [Google Scholar] [CrossRef] [PubMed]
- Svetnik, V.; Liaw, A.; Tong, C.; Culberson, J.C.; Sheridan, R.P.; Feuston, B.P. Random forest: A classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 2003, 43, 1947–1958. [Google Scholar] [CrossRef] [PubMed]
- Schonlau, M.; Zou, R.Y. The random forest algorithm for statistical learning. Stata. J. 2020, 20, 3–29. [Google Scholar] [CrossRef]
- Tong, T.; Li, Z. Predicting learning achievement using ensemble learning with result explanation. PLoS ONE 2025, 20, e0312124. [Google Scholar] [CrossRef]
- Gnardellis, C.; Notara, V.; Papadakaki, M.; Gialamas, V.; Chliaoutakis, J. Overestimation of relative risk and prevalence ratio: Misuse of logistic modeling. Diagnostics. 2022, 12, 2851. [Google Scholar] [CrossRef]
- Levy, J.J.; O’Malley, A.J. Don’t dismiss logistic regression: The case for sensible extraction of interactions in the era of machine learning. BMC Med. Res. Methodol. 2020, 20, 171. [Google Scholar] [CrossRef]
- Boateng, E.; Abaye, D. A Review of the Logistic Regression Model with Emphasis on Medical Research. J. Data Anal. Inform. Process. 2019, 7, 190–207. [Google Scholar] [CrossRef]
- Sperandei, S. Understanding logistic regression analysis. Biochem. Med. 2014, 24, 12–18. [Google Scholar] [CrossRef] [PubMed]
- Moral, R.D.A.; Díaz-Orueta, U.; Oltra-Cucarella, J. Logistic versus linear regression-based reliable change index: A simulation study with implications for clinical studies with different sample sizes. Psychol. Assess. 2022, 34, 731–741. [Google Scholar] [CrossRef]
- Haubold., A.; Peterson., B.S.; Bansal., R. Progress in using brain morphometry as a clinical tool for diagnosing psychiatric disorders. J. Child Psychol. Psychiatry 2012, 53, 519–535. [Google Scholar] [CrossRef] [PubMed]
- Amari, S.; Wu, S. Improving support vector machine classifiers by modifying kernel functions. Neural Netw. 1999, 12, 783–789. [Google Scholar] [CrossRef] [PubMed]
- Abukhait, J.; Mansour, A.M.; Obeidat, M. Classification based on Gaussian-kernel Support Vector Machine with Adaptive Fuzzy Inference System. Przegl. Elektrotech. 2018, 1, 16–24. [Google Scholar] [CrossRef]
- Eraydin, F.; Cakan, D.G.; Tozlu, M.; Ozdemir, F. Evaluation of buccolingual molar inclinations among different vertical facial types. Korean J. Orthod. 2018, 48, 333–338. [Google Scholar] [CrossRef]
- Fontenele, R.C.; Gomes, A.F.; Moreira, N.R.; Costa, E.D.; Oliveira, M.L.; Freitas, D.Q. Do the location and dimensions of the mental foramen differ among individuals of different facial types and skeletal classes? A CBCT study. J. Prosthet Dent. 2023, 129, 741–747. [Google Scholar] [CrossRef]
- Miralles, R.; Hevia, R.; Contreras, L.; Carvajal, R.; Bull, R.; Manns, A. Patterns of electromyographic activity in subjects with different skeletal facial types. Angle. Orthod. 1991, 61, 277–284. [Google Scholar] [CrossRef]
- Ueda, A.; Tussie, C.; Kim, S.; Kuwajima, Y.; Matsumoto, S.; Kim, G.; Satoh, K.; Nagai, S. Classification of maxillofacial morphology by artificial intelligence using cephalometric analysis measurements. Diagnostics 2023, 13, 2134. [Google Scholar] [CrossRef]










| RF | LR | SVC | |
|---|---|---|---|
| Accuracy | 0.907 ± 0.051 | 0.837 ± 0.057 | 0.770 ± 0.055 |
| F1 score | 0.740 ± 0.160 | 0.623 ± 0.132 | 0.507 ± 0.102 |
| Sensitivity | 0.750 ± 0.147 | 0.631 ± 0.116 | 0.534 ± 0.102 |
| Positive predictive value (PPV) | 0.741 ± 0.170 | 0.641 ± 0.155 | 0.499 ± 0.113 |
| Horizontal Classification | Vertical Classification | |
|---|---|---|
| Accuracy | 0.963 ± 0.031 | 0.973 ± 0.025 |
| F1 score | 0.948 ± 0.044 | 0.937 ± 0.055 |
| Sensitivity | 0.937 ± 0.053 | 0.915 ± 0.075 |
| Positive predictive value (PPV) | 0.971 ± 0.030 | 0.983 ± 0.028 |
| Classification | Precision | Recall | F1 Score | Support |
|---|---|---|---|---|
| Class I | 0.95 | 0.98 | 0.97 | 158 |
| Class II | 0.98 | 0.98 | 0.98 | 102 |
| Class III | 0.97 | 0.85 | 0.91 | 40 |
| Accuracy | 0.96 | 300 | ||
| Macro avg. | 0.97 | 0.94 | 0.95 | 300 |
| Weighted avg. | 0.96 | 0.96 | 0.96 | 300 |
| Classification | Precision | Recall | F1 Score | Support |
|---|---|---|---|---|
| Short | 1.00 | 0.82 | 0.90 | 17 |
| Medium | 0.97 | 1.00 | 0.98 | 247 |
| Long | 0.97 | 0.89 | 0.93 | 36 |
| Accuracy | 0.97 | 300 | ||
| Macro avg. | 0.98 | 0.90 | 0.94 | 300 |
| Weighted avg. | 0.97 | 0.97 | 0.97 | 300 |
| Classification | Precision | Recall | F1 Score | Support |
|---|---|---|---|---|
| Class II Short | 1.00 | 0.75 | 0.86 | 4 |
| Class I Short | 1.00 | 0.56 | 0.71 | 9 |
| Class III Short | 0.00 | 0.00 | 0.00 | 4 |
| Class II Medium | 0.97 | 0.98 | 0.97 | 86 |
| Class I Medium | 0.88 | 0.98 | 0.93 | 130 |
| Class III Medium | 0.84 | 0.84 | 0.84 | 31 |
| Class II Long | 1.00 | 0.92 | 0.96 | 12 |
| Class I Long | 0.83 | 0.79 | 0.81 | 19 |
| Class III Long | 1.00 | 0.20 | 0.33 | 5 |
| Accuracy | 0.91 | 300 | ||
| Macro avg. | 0.84 | 0.67 | 0.71 | 300 |
| Weighted avg. | 0.90 | 0.91 | 0.89 | 300 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sato, H.; Ueda, A.; Tussie, C.; Kim, S.; Kuwajima, Y.; Kikuchi, E.; Nagai, S.; Satoh, K. Evaluation of the Performance of an Artificial Intelligence-Based Classification Model for Pediatric Maxillofacial Morphology. Diagnostics 2025, 15, 2958. https://doi.org/10.3390/diagnostics15232958
Sato H, Ueda A, Tussie C, Kim S, Kuwajima Y, Kikuchi E, Nagai S, Satoh K. Evaluation of the Performance of an Artificial Intelligence-Based Classification Model for Pediatric Maxillofacial Morphology. Diagnostics. 2025; 15(23):2958. https://doi.org/10.3390/diagnostics15232958
Chicago/Turabian StyleSato, Hiroki, Akane Ueda, Camila Tussie, Sophie Kim, Yukinori Kuwajima, Emiko Kikuchi, Shigemi Nagai, and Kazuro Satoh. 2025. "Evaluation of the Performance of an Artificial Intelligence-Based Classification Model for Pediatric Maxillofacial Morphology" Diagnostics 15, no. 23: 2958. https://doi.org/10.3390/diagnostics15232958
APA StyleSato, H., Ueda, A., Tussie, C., Kim, S., Kuwajima, Y., Kikuchi, E., Nagai, S., & Satoh, K. (2025). Evaluation of the Performance of an Artificial Intelligence-Based Classification Model for Pediatric Maxillofacial Morphology. Diagnostics, 15(23), 2958. https://doi.org/10.3390/diagnostics15232958

