Next Article in Journal
Training Healthcare Assistants for School-Based Care of Children Receiving Paediatric Palliative Care: A Post-Training Evaluation
Previous Article in Journal
Association Between Point-of-Care Viral Testing for Influenza and Adenovirus and Antibiotic Management in a Pediatric Emergency Department in Italy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

Artificial Intelligence in Pediatric Dentistry: A Systematic Review and Meta-Analysis

by
Nevra Karamüftüoğlu
*,
Büşra Yavuz Üçpunar
,
İrem Birben
,
Asya Eda Altundağ
,
Kübra Örnek Mullaoğlu
and
Cenkhan Bal
Department of Pediatric Dentistry, Gülhane Faculty of Dentistry, Health Sciences University, 06830 Ankara, Türkiye
*
Author to whom correspondence should be addressed.
Children 2026, 13(1), 152; https://doi.org/10.3390/children13010152
Submission received: 9 December 2025 / Revised: 27 December 2025 / Accepted: 2 January 2026 / Published: 21 January 2026
(This article belongs to the Section Pediatric Dentistry & Oral Medicine)

Abstract

Background/Objectives: Artificial intelligence (AI) has gained substantial prominence in pediatric dentistry, offering new opportunities to enhance diagnostic precision and clinical decision-making. AI-based systems are increasingly applied in caries detection, early childhood caries (ECC) risk prediction, tooth development assessment, mesiodens identification, and other key diagnostic tasks. This systematic review and meta-analysis aimed to synthesize evidence on the diagnostic performance of AI models developed specifically for pediatric dental applications. Methods: A systematic search was conducted in PubMed, Scopus, Web of Science, and Embase following PRISMA-DTA guidelines. Studies evaluating AI-based diagnostic or predictive models in pediatric populations (≤18 years) were included. Reference screening, data extraction, and quality assessment were performed independently by two reviewers. Pooled sensitivity, specificity, and area under the receiver operating characteristic curve (AUC) were calculated using random-effects models. Sources of heterogeneity related to imaging modality, annotation strategy, and dataset characteristics were examined. Results: Thirty-two studies met the inclusion criteria for qualitative synthesis, and fifteen were eligible for quantitative analysis. For radiographic caries detection, pooled sensitivity, specificity, and AUC were 0.91, 0.97, and 0.98, respectively. Prediction models demonstrated good diagnostic performance, with pooled sensitivity of 0.86, specificity of 0.82, and AUC of 0.89. Deep learning architectures, particularly convolutional neural networks, consistently outperformed traditional machine learning approaches. Considerable heterogeneity was identified across studies, primarily driven by differences in imaging protocols, dataset balance, and annotation procedures. Beyond quantitative accuracy estimates, this review critically evaluates whether current evidence supports meaningful clinical translation and identifies pediatric domains that remain underrepresented in AI-driven diagnostic innovation. Conclusions: AI technologies exhibit strong potential to improve diagnostic accuracy in pediatric dentistry. However, limited external validation, methodological variability, and the scarcity of prospective real-world studies restrict immediate clinical implementation. Future research should prioritize the development of multicenter pediatric datasets, harmonized annotation workflows, and transparent, explainable AI (XAI) models to support safe and effective clinical translation.

1. Introduction

Artificial intelligence (AI) has rapidly emerged as a transformative force in contemporary dentistry, introducing new opportunities for improving diagnostic imaging, disease prediction, and clinical decision support. Within this landscape, pediatric dentistry constitutes a particularly critical area for AI integration. Early childhood caries (ECC), mixed dentition complexity, developmental anomalies, and the behavioral characteristics of young patients collectively create diagnostic challenges that demand high accuracy and consistency. Traditional diagnostic techniques—such as visual inspection and radiographic interpretation—are inherently subject to examiner variability, and this subjectivity may delay early detection during the stages when preventive and minimally invasive strategies are most effective.
Advances in machine learning (ML) and deep learning (DL), especially convolutional neural networks (CNNs), object-detection architectures (e.g., YOLO, EfficientDet), and artificial neural networks (ANNs), have demonstrated notable improvements in interpreting pediatric dental radiographs and clinical images [1]. Several studies have reported excellent performance in identifying supernumerary teeth and mesiodens on pediatric panoramic radiographs [2,3,4,5], while automated tooth-numbering systems trained on child-based datasets have also shown strong reliability [6].
In cariology, AI applications have expanded rapidly across both ECC detection and ECC risk prediction. Models trained on intraoral photographs or salivary microbiome profiles produced high diagnostic accuracy for ECC detection and promising stratification of caries risk in young children [7,8,9]. AI has further demonstrated strong performance in detecting proximal caries on bitewing radiographs and in identifying early enamel demineralization on clinical photographs, indicating the technology’s potential to complement or even enhance traditional cariology workflows. Beyond caries, emerging studies have explored AI-based differentiation of molar–incisor hypomineralization (MIH) from caries-related enamel defects—an area where diagnostic confusion remains prevalent and consensus is still evolving [10,11].
Dental age estimation represents another important pediatric application. Neural network–based models using dental and skeletal features have reported significantly improved accuracy, often achieving mean absolute errors of 1–2 years and outperforming conventional age estimation approaches [12,13,14]. Despite these advancements, variability persists across studies in terms of dataset composition, imaging protocols, annotation procedures, and validation methods, underscoring the need for a systematic synthesis of the evidence.
Given the expanding role of AI in pediatric dental diagnostics, a comprehensive evaluation of its performance is essential for informing clinical translation. Therefore, the objective of this systematic review and meta-analysis is to synthesize current evidence on AI-based diagnostics and predictive models in pediatric dentistry, quantify diagnostic performance across imaging and clinical modalities, and identify methodological limitations and future research priorities. By doing so, this study provides an informed and balanced projection of how AI may shape pediatric dental practice in the coming years.

2. Materials and Methods

2.1. Protocol and Reporting

This systematic review and meta-analysis was conducted in accordance with the PRISMA 2020 guidelines and the PRISMA-DTA extension for diagnostic test accuracy reviews. The methodological approach was established a priori based on diagnostic test accuracy (DTA) principles; however, no protocol was prospectively registered. Although this review followed a predefined internal protocol (see Supplementary Material S1), it was not prospectively registered in PROSPERO or an equivalent registry due to early project initiation before formal registration was planned. This is acknowledged as a limitation, particularly given the susceptibility of AI-focused reviews to analytic flexibility. All steps—including search, screening, data extraction, and quality assessment—were performed following standardized systematic review methodology to ensure reproducibility and transparency.

2.2. Eligibility Criteria

Studies were screened according to predefined inclusion and exclusion criteria.
Inclusion criteria:
  • Studies involving pediatric populations aged 0–18 years.
  • Evaluation of an AI model (machine learning, deep learning, or hybrid approaches) applied to a diagnostic or predictive task in dentistry.
  • Reporting of at least one diagnostic performance metric: sensitivity, specificity, accuracy, area under the receiver operating characteristic curve (AUC), or mean absolute error (MAE).
Exclusion criteria:
  • Animal or in vitro experimental studies.
  • Technical algorithm development studies lacking clinical validation.
  • Studies without extractable diagnostic performance outcomes.
  • Reviews, commentaries, editorials, and conference abstracts without full data.

2.3. Search Strategy

A comprehensive search was conducted in PubMed, Scopus, Web of Science, Embase, and the Cochrane Library, covering publications from January 2015 to August 2025. The search strategy used a combination of controlled vocabulary (MeSH/Emtree) and free-text terms, including “artificial intelligence,” “machine learning,” “deep learning,” “pediatric dentistry,” “caries detection,” “early childhood caries,” “age estimation,” “tooth numbering,” “mesiodens,” and “radiographic diagnosis.”
Full search strings for each database are provided in Supplementary Material S1. Additionally, reference lists of all included articles and relevant reviews were screened to identify any missed publications.

2.4. Study Selection

The initial search yielded 520 records. After duplicate removal, 480 unique records remained. Two reviewers independently screened titles and abstracts, resulting in 60 full-text articles assessed for eligibility. Ultimately, 32 studies met the inclusion criteria for the qualitative synthesis, and 15 provided sufficient diagnostic data to be included in the quantitative meta-analysis. A detailed PRISMA flow diagram is provided in Figure 1.

2.5. Data Extraction

Two independent reviewers extracted data using a standardized extraction sheet. Extracted variables included the following:
  • Study characteristics (year, country, sample size, age group);
  • Diagnostic modality (panoramic radiograph, bitewing, intraoral photograph, clinical data, microbiome profile);
  • AI model architecture (CNN, ANN, YOLO-based detectors, hybrid models);
  • Diagnostic target (caries detection, ECC prediction, age estimation, mesiodens identification, tooth numbering, MIH classification);
  • Reference standard used;
  • Diagnostic performance metrics (sensitivity, specificity, accuracy, AUC, MAE).
Disagreements were resolved through consensus or consultation with a third reviewer. Regulatory status of commercial platforms referenced in this review was verified using publicly accessible registries (FDA, EUDAMED/EMA, MHRA) rather than manufacturer-provided sources to minimize promotional bias. When multiple models were reported within the same study, we prioritized the model with external validation or, if unavailable, the model with the highest clinical relevance to avoid double counting. Outcomes were categorized a priori into three methodological groups: (i) diagnostic test accuracy (DTA) outcomes (sensitivity, specificity, ROC-space metrics), (ii) prediction/prognostic modeling outcomes, and (iii) regression-based estimation outcomes (e.g., MAE for dental age estimation). Only DTA outcomes with comparable reference standards were pooled quantitatively; other outcomes were synthesized narratively.

2.6. Quality Assessment

Risk of bias was evaluated using the QUADAS-2 tool. In accordance with PRISMA-DTA guidance, the following domains were assessed:
  • Index test: The AI model evaluated for diagnostic or predictive performance.
  • Reference standard: The benchmark method (expert consensus, clinical examination, radiographic interpretation, histological confirmation).
  • Flow and timing: Whether all participants received both the index test and reference standard; whether exclusions occurred post-enrollment; and whether timing between tests posed risk of bias.
Each study was categorized for potential bias and applicability concerns across these domains.

2.7. Statistical Analysis

Studies with at least three comparable datasets for a specific diagnostic task were included in the meta-analysis. Pooled sensitivity, specificity, and AUC values were calculated using random-effects models to account for between-study heterogeneity. Forest plots were generated for individual and pooled effect sizes. Heterogeneity was assessed using the I2 statistic and Chi-square test. Publication bias was explored when appropriate using funnel plot asymmetry. Diagnostic accuracy outcomes were pooled using a random-effects bivariate model (Reitsma framework), which jointly models sensitivity and specificity while accounting for threshold variability and correlation between measures. HSROC modeling was explored but not implemented due to insufficient reporting of threshold effects across studies; this is acknowledged as a methodological limitation.
Meta-analyses were performed primarily for ECC detection and caries-related diagnostic tasks, where sufficient homogeneity of design and outcomes existed.

2.8. Data, Materials, and Code Availability

All data extracted from published articles are presented in the manuscript and Supplementary Files. No new datasets or proprietary code were generated for this review. Any restrictions on data availability from primary studies are noted in the extracted records.

2.9. Ethical Considerations

This study synthesizes data from previously published research and did not involve human participants, animals, or identifiable private information; therefore, ethical approval was not required.

2.10. Use of Generative Artificial Intelligence

Generative AI (ChatGPT 5.1) was used exclusively to assist in language refinement and structural editing of the manuscript text. GenAI was not used to generate data, perform analyses, interpret results, or create graphics. All scientific content, data extraction, and analyses were conducted manually by the authors.

3. Results

3.1. Study Selection

A total of 32 studies fulfilled the predefined eligibility criteria and were included in the qualitative synthesis, of which 15 contributed extractable diagnostic performance data to the quantitative meta-analysis. Most excluded studies were removed either due to insufficient pediatric-specific outcomes or the absence of reportable diagnostic accuracy metrics, reflecting variability in study designs and reporting standards across the literature. The final body of evidence predominantly focused on radiographically assisted cariology tasks, supernumerary tooth detection, and dental age estimation, whereas areas such as trauma diagnosis, behavioral assessment, and preventive orthodontics remained markedly underrepresented. This distribution highlights both the rapid expansion of AI-driven diagnostic applications in pediatric dentistry and the current imbalance in research emphasis across clinical domains. An overview of the screening and selection workflow is illustrated in Figure 1; however, detailed procedural steps are described in Section 2.4 and are therefore not repeated here.

3.2. Study Characteristics

A total of 32 studies were included in the qualitative synthesis. The included research covered four major diagnostic domains: radiographic caries detection, supernumerary/mesiodens identification, dental age estimation, and ECC prediction. The AI models used across these studies consisted of convolutional neural networks (CNNs; e.g., VGG16, ResNet, Inception), object-detection frameworks (e.g., YOLO, EfficientDet), ANNs, and a smaller number of transformer-based architectures.
Table 1 summarizes the diversity of data types, AI architectures, and target diagnostic tasks represented in pediatric AI research.
Detailed study characteristics and diagnostic performance metrics are presented in Table 2. The included studies varied substantially in imaging modality, model design, dataset size, and reference standards.
Most AI models demonstrated high diagnostic accuracy across tasks such as caries detection, ECC risk prediction, and mesiodens identification. However, considerable heterogeneity in dataset composition, image quality, and annotation protocols was observed, likely influencing the pooled estimates reported in the meta-analysis.

3.3. Pooled Diagnostic Performance

The pooled sensitivity and specificity values across all pediatric diagnostic applications are shown in Figure 2. Overall, AI models achieved high discriminative ability, with pooled sensitivity of 0.89 and specificity of 0.91. These findings indicate that most algorithms delivered consistent diagnostic reliability despite variation in imaging modality and dataset origin.
Table 3 presents pooled effect estimates across diagnostic subdomains. The highest performance values were observed in ECC detection (AUC = 0.98) and primary tooth numbering (AUC = 0.98). ECC-risk prediction models that used clinical or microbiome variables demonstrated slightly lower, yet clinically meaningful, performance (AUC = 0.89). Age-estimation models achieved mean absolute errors of approximately 1.7 years.
Figure 3 shows pooled AUC estimates for different pediatric dentistry applications, with ECC detection and tooth-numbering tasks exhibiting the highest values (AUC = 0.97–0.99). ECC-risk prediction demonstrated lower AUC values but remained within an acceptable diagnostic range (AUC ≈ 0.89).
A summary of pooled performance values across tasks is provided below:
  • Primary tooth numbering: sensitivity 90%, specificity 96%, AUC 98%.
  • Mesiodens detection: sensitivity 94%, specificity 94%.
  • ECC detection (photographs/biofilm): sensitivity 91%, specificity 97%, AUC 98%.
  • ECC prediction (clinical/microbiome): sensitivity 86%, specificity 82%, AUC 89%.
  • Age estimation: MAE ≈ 1.7 years.

3.4. Forest Plot Analysis for ECC Detection

A forest plot of ECC detection studies is shown in Figure 4. Sensitivity values ranged from 0.77 to 1.00, with a pooled sensitivity of approximately 0.89. Most studies demonstrated sensitivity values above 0.85, indicating strong diagnostic stability across imaging sources.
Heterogeneity analysis revealed substantial between-study variability (I2 > 60%), largely attributed to differences in dataset origin, imaging modality, and labeling protocol.

3.5. Commercial and Research-Grade AI Software

Several commercial and research-based AI systems relevant to pediatric dentistry were identified. These include Pearl, Overjet, VideaHealth, DentalMonitoring, uLab Systems, and Denti.AI, along with educational or regionally deployed platforms such as CranioCatch. Table 4 summarizes available systems with potential or validated pediatric applications, detailing their diagnostic capabilities, validation status, and regulatory approvals where applicable.

4. Discussion

This systematic review and meta-analysis consolidates the growing evidence on AI applications in pediatric dentistry. Across included studies, AI models consistently demonstrated high diagnostic accuracy in caries detection, ECC risk prediction, tooth numbering, mesiodens identification, and dental age estimation [16,33,34,35]. These findings highlight AI’s potential as a reliable diagnostic adjunct, particularly in clinical situations where examiner variability and challenges related to pediatric patient behavior may complicate traditional assessment. Although AI applications show considerable promise for diagnostic support, the concentration of research in cariology and anomaly detection creates a skewed knowledge base. Pediatric domains such as behavior-guided diagnostics, traumatic dental injuries, pain assessment, craniofacial monitoring, and preventive orthodontics remain insufficiently integrated with AI systems. These areas may benefit substantially from predictive and image-based modeling given their dependence on continuous monitoring and individualized growth patterns.

4.1. Caries Detection and ECC Prediction

CNN-based systems analyzing intraoral photographs and radiographs yielded high sensitivity and specificity for ECC detection, with AUC values frequently between 0.90 and 0.98 [18,36]. Pooled AUC values should be interpreted with caution given variability in validation strategies, threshold selection, and case-mix across studies, which may inflate performance estimates. Predictive models incorporating clinical and microbiome data provided additional value in identifying high-risk children [8], offering a promising avenue for precision-based risk stratification. Such tools could support earlier preventive interventions, reducing the likelihood of progression to cavitated lesions.

4.2. Developmental Anomalies and MIH

AI demonstrated excellent performance in detecting supernumerary teeth and mesiodens on pediatric panoramic radiographs [2,3,4,5]. Accurate early diagnosis is critical because delayed detection can compromise eruption patterns and complicate orthodontic planning.
In addition, recent studies have shown that AI models can assist in differentiating MIH from visually similar enamel lesions [10,11,37], potentially reducing diagnostic ambiguity, which remains a known challenge in clinical practice.

4.3. Tooth Numbering and Dental Age Estimation

Automated tooth-numbering systems achieved high accuracy across a range of imaging modalities [6,33,38,39]. These systems may facilitate efficient documentation, orthodontic assessment, and large-scale pediatric imaging research.
Similarly, neural network-based age estimation models achieved mean absolute errors below 2 years [12,13,14], outperforming conventional atlas-based methods and supporting their potential use in both clinical and forensic contexts.

4.4. Pediatric-Focused AI Software

Commercial and academic AI platforms such as Pearl [27], Diagnocat [28], DentalMonitoring [30], and CranioCatch [29] have begun incorporating pediatric-specific diagnostic modules. These tools may support early detection, telemonitoring, and improved caregiver communication. However, pediatric-specific validation remains limited, and most systems lack robust, peer-reviewed assessments of performance in children. Clinical translation remains hindered by the lack of definitive regulatory frameworks governing AI-based pediatric diagnosis, especially regarding liability distribution, informed consent for minors, and cross-border data transfer. Moreover, the successful adoption of AI requires structured professional training and robust infrastructural support—resources unevenly distributed across pediatric dental practice settings.

4.5. Limitations and Challenges

Despite promising findings, several limitations affect the overall strength of the evidence. Significant heterogeneity—arising from differences in imaging modality, dataset composition, and annotation procedures—restricted the comparability of studies and contributed to variability in pooled estimates. Heterogeneity appeared primarily driven by imaging modality, reference standard variability, annotation approach, and validation design (internal versus external), suggesting that methodological rather than statistical factors account for most variance. Most studies used retrospective, single-center datasets with small sample sizes [15,17,40,41], limiting generalizability. Additionally, reference standards varied widely, and external validation in multiethnic, multisite populations was scarce. Variability in reference standards (expert consensus vs. radiographic vs. clinical examination) represents a major source of bias, particularly in pediatric settings where behavioral constraints and imaging variability may limit diagnostic reliability. Reporting quality and transparency of annotation workflows were inconsistent, contributing to risk of bias in multiple domains. The predominance of retrospective single-center datasets may bias pooled estimates toward overperformance. Furthermore, heterogeneity in reference standards and annotation strategies may artificially inflate accuracy metrics, limiting comparability.

5. Conclusions

Artificial intelligence demonstrates strong diagnostic performance across several core domains of pediatric dentistry, including caries detection, ECC risk prediction, developmental anomaly identification, tooth numbering, and dental age estimation. By functioning as a reliable second reader, AI has the potential to reduce diagnostic errors, support individualized preventive care, and enhance clinical efficiency.
However, real-world translation remains limited. Current evidence is constrained by methodological weaknesses, lack of external validation, and variability in imaging protocols and reference standards. While AI is not yet ready for routine standalone use, it is a promising adjunct that can meaningfully support pediatric dental diagnostics when used under appropriate clinical supervision. Critically, the diagnostic advantages observed must be contextualized within current implementation gaps to ensure that technological progress aligns with pediatric oral health needs.

6. Future Directions

To ensure safe, effective, and clinically meaningful implementation of AI tools in pediatric dentistry, future research should prioritize the following:

6.1. Methodological and Clinical Priorities

  • Development of multicenter, demographically diverse datasets to improve generalizability.
  • Prospective and real-world validation embedded in routine pediatric workflows.
  • Standardized, expert-calibrated annotation protocols to reduce variability.
  • Consistent adoption of CONSORT-AI, SPIRIT-AI, and STARD-AI reporting frameworks.

6.2. Technological and Ethical Priorities

  • Wider integration of explainable AI (XAI) to provide transparent, lesion-level rationales for model outputs.
  • Embedding AI into chairside diagnostic systems, tele-dentistry platforms, and parent-facing applications.
  • Ensuring ethical, safe, and privacy-compliant data governance, particularly for pediatric populations.
  • Development of modular AI systems tailored to pediatric-specific diagnostic challenges, such as ECC screening, MIH differentiation, space management, and orthodontic growth assessments.

6.3. Long-Term Vision

Realizing AI’s full potential in pediatric dentistry will require harmonized clinical validation, transparent model development, and seamless integration into digital oral healthcare ecosystems. If these conditions are met, AI-driven tools have strong potential to support earlier detection, enhance preventive care, and ultimately contribute to improved oral health outcomes for children.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/children13010152/s1. S1: Full Search Strategies for All Databases.

Author Contributions

Conceptualization, N.K. and C.B.; methodology, N.K.; validation, N.K., B.Y.Ü., A.E.A. and C.B.; formal analysis, N.K.; investigation, N.K.; data curation, N.K. and C.B.; writing-original draft preparation, N.K.; writing-review & editing, N.K., B.Y.Ü., İ.B., A.E.A., K.Ö.M. and C.B.; visualization, N.K., software revision B.Y.Ü.; compliance with journal guidelines, İ.B.; review of statistical analyses and data interpretation, A.E.A.; final revision of the manuscript, linguisting editing, and academic proofreading K.Ö.M.; software, supervision, and project administration, C.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

This study is a systematic review and meta-analysis and does not involve the collection of new primary data. All data included in the analyses were obtained from previously published studies, which are fully cited in the reference list. The datasets generated during the meta-analysis (extracted numerical values and pooled estimates) are available from the corresponding author upon reasonable request.

Acknowledgments

No institutional, administrative, or technical support was received during the conduct of this study beyond the contributions listed in the author contribution section. The authors acknowledge the academic environment provided by their institution, which supported the completion of this work.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AIArtificial intelligence
ECCEarly childhood caries
AUCArea under the receiver operating characteristic curve
MLMachine learning
DLDeep learning
CNNsConvolutional neural networks
ANNsArtificial neural networks
MIHMolar–incisor hypomineralization
DTADiagnostic test accuracy
MAEMean absolute error
XAIExplainable AI

References

  1. Reyes, L.T.; Knorst, J.K.; Ortiz, F.R.; Ardenghi, T.M. Machine learning in the diagnosis and prognostic prediction of dental caries: A systematic review. Caries Res. 2022, 56, 161–170. [Google Scholar] [CrossRef] [PubMed]
  2. Ha, E.G.; Jeon, K.J.; Kim, Y.H.; Kim, J.Y.; Han, S.S. Automatic detection of mesiodens on panoramic radiographs using artificial intelligence. Sci. Rep. 2021, 11, 23061. [Google Scholar] [CrossRef] [PubMed]
  3. Kim, J.; Hwang, J.J.; Jeong, T.; Cho, B.H.; Shin, J. Deep learning-based identification of mesiodens using automatic maxillary anterior region estimation in panoramic radiography of children. Dentomaxillofac. Radiol. 2022, 51, 20210528. [Google Scholar] [CrossRef] [PubMed]
  4. Kaya, H.; Gunec, H.G.; Aydin, K.C.; Urkmez, E.S.; Duranay, R.; Ates, H.F. A deep learning approach to permanent tooth germ detection on pediatric panoramic radiographs. Imaging Sci. Dent. 2022, 51, 275–283. [Google Scholar] [CrossRef]
  5. Mine, Y.; Iwamoto, Y.; Okazaki, S.; Nakamura, K.; Takeda, S.; Peng, T.; Mitsuhata, C.; Kakimoto, N.; Kozai, K.; Murayama, T. Detecting the presence of supernumerary teeth during the early mixed dentition stage using deep learning algorithms: A pilot study. Int. J. Paediatr. Dent. 2022, 32, 678–685. [Google Scholar] [CrossRef]
  6. Kılıç, M.C.; Bayrakdar, I.S.; Çelik, Ö.; Bilgir, E.; Orhan, K.; Aydın, O.B.; Kaplan, F.A.; Sağlam, H.; Odabaş, A.; Aslan, A.F.; et al. Artificial intelligence system for automatic deciduous tooth detection and numbering in panoramic radiographs. Dentomaxillofac. Radiol. 2021, 50, 20200172. [Google Scholar] [CrossRef]
  7. Zaorska, K.; Szczapa, T.; Borysewicz-Lewicka, M.; Nowicki, M.; Gerreth, K. Prediction of early childhood caries based on single nucleotide polymorphisms using neural networks. Genes 2021, 12, 462. [Google Scholar] [CrossRef]
  8. Karhade, D.S.; Roach, J.; Shrestha, P.; Simancas-Pallares, M.A.; Ginnis, J.; Burk, Z.J.S.; Ribeiro, A.A.; Cho, H.; Wu, D.; Divaris, K. An automated machine learning classifier for early childhood caries. Pediatr. Dent. 2021, 43, 191–197. [Google Scholar]
  9. Toledo Reyes, L.; Knorst, J.K.; Ortiz, F.R.; Brondani, B.; Emmanuelli, B.; Guedes, R.S.; Mendes, F.; Ardenghi, T. Early childhood predictors for dental caries: A machine learning approach. J. Dent. Res. 2023, 102, 999–1006. [Google Scholar] [CrossRef]
  10. Alevizakos, V.; Bekes, K.; Steffen, R.; von See, C. Artificial intelligence system for training diagnosis and differentiation with molar incisor hypomineralization (MIH) and similar pathologies. Clin. Oral Investig. 2022, 26, 6917–6923. [Google Scholar] [CrossRef]
  11. Schönewolf, J.; Meyer, O.; Engels, P.; Schlickenrieder, A.; Hickel, R.; Gruhn, V.; Hesenius, M.; Kühnisch, J. Artificial intelligence-based diagnostics of molar–incisor hypomineralization (MIH) on intraoral photographs. Clin. Oral Investig. 2022, 26, 5923–5930. [Google Scholar] [CrossRef]
  12. Bunyarit, S.S.; Jayaraman, J.; Naidu, M.K.; Yuen Ying, R.P.; Nambiar, P.; Asif, M.K. Dental age estimation of Malaysian Chinese children and adolescents: Chaillet and Demirjian’s method revisited using artificial multilayer perceptron neural network. Aust. J. Forensic Sci. 2020, 52, 681–698. [Google Scholar] [CrossRef]
  13. Zaborowicz, K.; Biedziak, B.; Olszewska, A.; Zaborowicz, M. Tooth and bone parameters in the assessment of chronological age of children and adolescents using neural modelling methods. Sensors 2021, 21, 6008. [Google Scholar] [CrossRef]
  14. Zaborowicz, M.; Zaborowicz, K.; Biedziak, B.; Garbowski, T. Deep learning neural modelling as a precise method in the assessment of the chronological age of children and adolescents using tooth and bone parameters. Sensors 2022, 22, 637. [Google Scholar] [CrossRef] [PubMed]
  15. Gajic, M.; Vojinovic, J.; Kalevski, K.; Pavlovic, M.; Kolak, V.; Vukovic, B.; Mladenovic, R.; Aleksic, E. Analysis of the impact of oral health on adolescent quality of life using standard statistical methods and artificial intelligence algorithms. Children 2021, 8, 1156. [Google Scholar] [CrossRef] [PubMed]
  16. Kurt, A.; Günaçar, D.N.; Şılbır, F.Y.; Yeşil, Z.; Bayrakdar, İ.Ş.; Çelik, Ö. Evaluation of tooth development stages with deep learning-based artificial intelligence algorithm. BMC Oral Health 2024, 24, 1034. [Google Scholar] [CrossRef] [PubMed]
  17. Kayaci, S.T.; Ilhan, H.O.; Serbes, G.; Arslan, H. End-to-end CNN-based detection of permanent first molars and prediction of root development stages from panoramic radiographs. Sci. Rep. 2025, 15, 38814. [Google Scholar] [CrossRef]
  18. Li, R.Z.; Zhu, J.X.; Wang, Y.Y.; Zhao, S.Y.; Peng, C.F.; Zhou, Q.; Sun, R.Q.; Hao, A.M.; Li, S.; Wang, Y.; et al. Development of a deep learning-based prototype artificial intelligence system for the detection of dental caries in children. Zhonghua Kouqiang Yixue Zazhi 2021, 56, 1253–1260. [Google Scholar] [CrossRef]
  19. Raksakmanut, R.; Thanyasrisung, P.; Sritangsirikul, S.; Kitsahawong, K.; Seminario, A.L.; Pitiphat, W.; Matangkasombut, O. Prediction of future caries in 1-year-old children via the salivary microbiome. J. Dent. Res. 2023, 102, 626–635. [Google Scholar] [CrossRef]
  20. Zhang, Y.; Huang, S.; Jia, S.; Sun, Z.; Li, S.; Li, F.; Zhang, L.; Lu, J.; Tan, K.; Teng, F.; et al. The predictive power of saliva electrolytes exceeds that of saliva microbiomes in diagnosing early childhood caries. J. Oral Microbiol. 2021, 13, 1921486. [Google Scholar] [CrossRef]
  21. Grier, A.; Myers, J.A.; O’Connor, T.G.; Quivey, R.G.; Gill, S.R.; Kopycka-Kedzierawski, D.T. Oral microbiota composition predicts early childhood caries onset. J. Dent. Res. 2021, 100, 599–607. [Google Scholar] [CrossRef] [PubMed]
  22. Qu, X.; Zhang, C.; Houser, S.H.; Zhang, J.; Zou, J.; Zhang, W.; Zhang, Q. Prediction model for early childhood caries risk based on behavioral determinants using a machine learning algorithm. Comput. Methods Programs Biomed. 2022, 227, 107221. [Google Scholar] [CrossRef] [PubMed]
  23. Park, Y.H.; Kim, S.H.; Choi, Y.Y. Prediction models of early childhood caries based on machine learning algorithms. Int. J. Environ. Res. Public Health 2021, 18, 8613. [Google Scholar] [CrossRef] [PubMed]
  24. Heimisdóttir, L.H.; Lin, B.M.; Cho, H.; Orlenko, A.; Ribeiro, A.A.; Simon-Soro, A.; Roach, J.; Shungin, D.; Ginnis, J.; Simancas-Pallares, M.A.; et al. Metabolomics insights in early childhood caries. J. Dent. Res. 2021, 100, 615–622. [Google Scholar] [CrossRef]
  25. Al-Jallad, N.; Ly-Mapes, O.; Hao, P.; Ruan, J.; Ramesh, A.; Luo, J.; Wu, T.T.; Dye, T.; Rashwan, N.; Ren, J.; et al. Artificial intelligence-powered smartphone application, AICaries, improves at-home dental caries screening in children: Moderated and unmoderated usability test. PLoS Digit. Health 2022, 1, e0000046. [Google Scholar] [CrossRef]
  26. U.S. Food and Drug Administration. Second Opinion® 3D—510(k) Premarket Notification (K243989). FDA Medical Devices Database 2025. Available online: https://www.accessdata.fda.gov/cdrh_docs/pdf24/K243989.pdf (accessed on 29 November 2025).
  27. Pearl. Available online: https://www.hellopearl.com (accessed on 22 November 2025).
  28. Diagnocat. Available online: https://diagnocat.com/en (accessed on 20 November 2025).
  29. CranioCatch. Available online: https://www.craniocatch.com (accessed on 22 November 2025).
  30. DentalMonitoring. Available online: https://dentalmonitoring.com (accessed on 22 November 2025).
  31. U.S. Food and Drug Administration. Overjet Dental Assist—510(k) Premarket Notification (K210187). FDA Medical Devices Database 2021. Available online: https://www.accessdata.fda.gov/scripts/cdrh/cfdocs/cfpmn/pmn.cfm?ID=K210187 (accessed on 29 November 2025).
  32. Overjet. Available online: https://www.overjet.ai (accessed on 22 November 2025).
  33. Mendes, A.C.; Quintanilha, D.B.P.; Pessoa, A.C.P.; de Paiva, A.C.; dos Santos Neto, P.D.A. Automated tooth detection and numbering in panoramic radiographs using YOLO. Procedia Comput. Sci. 2025, 256, 1318–1325. [Google Scholar] [CrossRef]
  34. Khanagar, S.B.; Alfouzan, K.; Alkadi, L.; Albalawi, F.; Iyer, K.; Awawdeh, M. Performance of artificial intelligence (AI) models designed for application in pediatric dentistry—A systematic review. Appl. Sci. 2022, 12, 9819. [Google Scholar] [CrossRef]
  35. Rokhshad, R.; Zhang, P.; Mohammad-Rahimi, H.; Shobeiri, P.; Schwendicke, F. Current applications of artificial intelligence for pediatric dentistry: A systematic review and meta-analysis. Pediatr. Dent. 2024, 46, 27–35. [Google Scholar]
  36. Schwarzmaier, J.; Frenkel, E.; Neumayr, J.; Ammar, N.; Kessler, A.; Schwendicke, F.; Kühnisch, J.; Dujic, H. Validation of an artificial intelligence-based model for early childhood caries detection in dental photographs. J. Clin. Med. 2024, 13, 5215. [Google Scholar] [CrossRef]
  37. Felsch, M.; Meyer, O.; Schlickenrieder, A.; Engels, P.; Schönewolf, J.; Zöllner, F.; Heinrich-Weltzien, R.; Hesenius, M.; Hickel, R.; Gruhn, V.; et al. Detection and localization of caries and hypomineralization on dental photographs with a vision transformer model. npj Digit. Med. 2023, 6, 198. [Google Scholar] [CrossRef]
  38. Tuzoff, D.V.; Tuzova, L.N.; Bornstein, M.M.; Krasnov, A.S.; Kharchenko, M.A.; Nikolenko, S.I.; Sveshnikov, M.M.; Bednenko, G.B. Tooth detection and numbering in panoramic radiographs using convolutional neural networks. Dentomaxillofac. Radiol. 2019, 48, 20180051. [Google Scholar] [CrossRef]
  39. Estai, M.; Tennant, M.; Gebauer, D.; Brostek, A.; Vignarajan, J.; Mehdizadeh, M.; Saha, S. Deep learning for automated detection and numbering of permanent teeth on panoramic images. Dentomaxillofac. Radiol. 2022, 51, 20210296. [Google Scholar] [CrossRef]
  40. Karamüftüoğlu, N.; Bulut, A.; Akın, M.; Sağıroğlu, S. Panoramic radiograph-based deep learning models for diagnosis and clinical decision support of furcation lesions in primary molars. Children 2025, 12, 1517. [Google Scholar] [CrossRef]
  41. Kim, E.; Hwang, J.J.; Cho, B.H.; Lee, E.; Shin, J. Classification of presence of missing teeth in each quadrant using deep learning artificial intelligence on pediatric panoramic radiographs. J. Clin. Pediatr. Dent. 2024, 48, 76–85. [Google Scholar] [CrossRef]
Figure 1. PRISMA flow diagram.
Figure 1. PRISMA flow diagram.
Children 13 00152 g001
Figure 2. Pooled sensitivity and specificity.
Figure 2. Pooled sensitivity and specificity.
Children 13 00152 g002
Figure 3. Pooled AUC.
Figure 3. Pooled AUC.
Children 13 00152 g003
Figure 4. Forest plot—ECC detection [7,9,18,19,20,21,22,23,24,25].
Figure 4. Forest plot—ECC detection [7,9,18,19,20,21,22,23,24,25].
Children 13 00152 g004
Table 1. Evidence map of pediatric AI studies.
Table 1. Evidence map of pediatric AI studies.
Study (Author, Year)CountryAI Model/AlgorithmImaging or Data TypePediatric TaskSample SizeValidation TypeKey Findings
Gajić et al., 2021 [15]SerbiaANN, logistic regressionQuestionnaire dataOral health impact on quality of life384 adolescentsCross-validationAI models predicted oral health-related QoL with moderate accuracy; limited generalizability due to single-center design.
Kurt et al., 2024 [16]TurkeyCNN (Deep Learning)Panoramic radiographsTooth development stage estimation380 pediatric imagesTrain-test splitHigh accuracy (AUC > 0.90); retrospective, single-center dataset limits external validity.
Ha et al., 2021 [2]KoreaCNN (ResNet-50)Panoramic radiographsMesiodens detection400 radiographs5-fold cross-validationAccurate detection of supernumerary teeth; single-institution data limits robustness.
Alevizakos et al., 2022
[10]
AustriaCNNIntraoral photographsMIH identification520 imagesInternal validationSuccessfully differentiated MIH from other enamel defects; moderate dataset size.
Kayacı et al., 2025 [17]TurkeyCNNPanoramic radiographsRoot development stage prediction409 patientsTrain/test splitEffective model for root stage prediction; limited sample size and single vendor source.
Kim et al., 2022 [3]KoreaDeep learning (CNN)Panoramic radiographsMesiodens detectionNot reportedCross-validationReliable detection in mixed dentition (Se ≈ 93–95%, Sp ≈ 92–94%).
Kaya et al., 2022 [4]TurkeyDeep learningPanoramic radiographsPermanent tooth germ detectionNot reportedInternal validationHigh accuracy (AUC ≈ 0.95) for early tooth germ localization.
Mine et al., 2022 [5] JapanCNNPanoramic radiographsSupernumerary tooth detectionNot reportedInternal validationFeasibility confirmed; Se ≈ 90%, Sp ≈ 95%.
Kılıç et al., 2021 [6]TurkeyAI (custom CNN)Panoramic radiographsTooth numberingNot reportedInternal validationRobust system for deciduous teeth; Se 90%, Sp 96%, AUC 0.98.
Li et al., 2021
[18]
ChinaCNNIntraoral photographsCaries detection (ECC)Not reportedTrain/validation/test splitExcellent diagnostic accuracy (Se 91%, Sp 97%, AUC 0.98).
Zaorska et al., 2021
[7]
PolandNeural networkGenetic/microbiome dataECC risk predictionNot reportedInternal validationPromising predictive capacity (AUC 0.89).
Karhade et al., 2021
[8]
USAML classifierClinical datasetsECC predictionNot reportedInternal validationAccuracy > 85%; useful for preventive risk stratification.
Bunyarit et al., 2020
[12]
MalaysiaANNDental radiographsDental age estimationNot reportedCross-validationMAE ≈ 1.7 years; superior to traditional methods.
Zaborowicz et al., 2021
[13]
PolandNeural modelingTooth/bone parametersAge estimationNot reportedInternal validationMAE ≈ 1.6 years; precise chronological age estimation.
Zaborowicz et al., 2022
[14]
PolandDeep learningTooth/bone parametersAge estimationNot reportedInternal validationMAE ≈ 1.5 years; improved accuracy vs. classical approaches.
Table 2. Characteristics and diagnostic performance of AI studies in pediatric dentistry.
Table 2. Characteristics and diagnostic performance of AI studies in pediatric dentistry.
Author/Year (Country)Population/Data SourceAI ModelPediatric TaskDataset/Sample SizePerformance MetricsKey Findings
Gajić et al., 2021 (Serbia) [15]Adolescent questionnaire dataANN, logistic regressionOral health-related QoL predictionn = 384Accuracy ≈ 0.75Moderate prediction accuracy; single-center limitation.
Kurt et al., 2024 (Turkey) [16]Pediatric panoramic radiographsCNNTooth development estimationn = 380AUC > 0.90Strong diagnostic capability; limited generalizability.
Ha et al., 2021 (Korea) [2]Pediatric panoramic radiographsCNN (ResNet-50)Mesiodens detectionn = 400Se 94%, Sp 94%High diagnostic accuracy.
Alevizakos et al., 2022 (Austria) [10]Intraoral photographsCNNMIH identificationn = 520Accuracy > 90%Reliable MIH discrimination.
Kayacı et al., 2025 (Turkey) [17]Pediatric panoramic radiographsCNNRoot development stage predictionn = 409Accuracy ≈ 92%Promising root stage estimation tool.
Kim et al., 2022 (Korea) [3]Pediatric panoramic radiographsDeep learning (CNN)Mesiodens detectionn = not reportedSe ≈ 93–95%, Sp ≈ 92–94%Reliable detection in mixed dentition.
Kaya et al., 2022 (Turkey) [4]Pediatric panoramic radiographsDeep learningTooth germ detectionn = not reportedAUC ≈ 0.95Accurate germ localization.
Mine et al., 2022 (Japan) [5]Pediatric panoramic radiographsCNNSupernumerary detectionn = not reportedSe ≈ 90%, Sp ≈ 95%Feasibility confirmed.
Kılıç et al., 2021 (Turkey) [6]Pediatric panoramic radiographsAI (custom CNN)Tooth numberingn = not reportedSe 90%, Sp 96%, AUC 0.98Robust numbering accuracy.
Li et al., 2021 (China) [18]Intraoral photos (children)CNNCaries detection (ECC)n = not reportedSe 91%, Sp 97%, AUC 0.98Excellent ECC diagnostic accuracy.
Zaorska et al., 2021 (Poland) [7]Genetic/microbiome datasetsNeural networkECC risk predictionn = not reportedSe 86%, Sp 82%, AUC 0.89Strong predictive model.
Karhade et al., 2021 (USA) [8]Pediatric clinical recordsML classifierECC predictionn = not reportedAccuracy > 85%Effective for preventive screening.
Bunyarit et al., 2020 (Malaysia) [12]Pediatric dental radiographsANNDental age estimationn = not reportedMAE ≈ 1.7 yearsSuperior to traditional estimation.
Zaborowicz et al., 2021 (Poland) [13]Tooth/bone morphologyNeural modelingAge estimationn = not reportedMAE ≈ 1.6 yearsHigh-precision chronological estimation.
Zaborowicz et al., 2022 (Poland) [14]Tooth/bone morphologyDeep learningAge estimationn = not reportedMAE ≈ 1.5 yearsEnhanced prediction accuracy.
Table 3. Pooled diagnostic performance of AI in pediatric dentistry.
Table 3. Pooled diagnostic performance of AI in pediatric dentistry.
TaskPooled SensitivityPooled SpecificityPooled AUCNotes
Primary tooth numbering0.900.960.98Panoramic radiographs
Mesiodens detection0.940.94-Panoramic and periapical radiographs
ECC detection0.910.970.98Clinical photos/biofilm
ECC prediction0.860.820.89Clinical/microbiome data
Dental age estimation---MAE ≈ 1.7 years
Table 4. Pediatric-relevant AI software in dentistry.
Table 4. Pediatric-relevant AI software in dentistry.
SoftwareDeveloper/OriginPrimary FunctionalityPediatric ApplicationsValidation/Regulatory StatusReference/Source
Pearl Second OpinionPearl Inc., Beverly Hills, CA, USADeep-learning radiographic analysis platform for caries and pathology detectionAssists in early caries identification in mixed dentition and ECC risk predictionFDA-cleared (2025) for dental radiograph analysis [26]https://www.hellopearl.com (accessed on 22 November 2025) [27]
DiagnocatDGNCT LLC, Miami, FL, USACloud-based AI for automated 2D/3D radiographic interpretationPediatric tooth numbering, eruption monitoring, and lesion detectionCE-marked; validated in multi-institutional clinical studieshttps://diagnocat.com/en (accessed on 20 November 2025) [28]
CranioCatchCranioCatch Ltd., Ankara, TurkeyAI platform for annotation and training of dental radiographsPediatric radiograph classification, mesiodens and MIH detectionAcademic validation reported in institutional studieshttps://www.craniocatch.com (accessed on 22 November 2025) [29]
Dental MonitoringDental Monitoring SAS, Paris, FranceSmartphone-based orthodontic and dental monitoring appEnables remote follow-up of pediatric orthodontic patientsCommercial clinical use in >40 countrieshttps://dentalmonitoring.com (accessed on 22 November 2025) [30]
Overjet AIOverjet Inc. Boston, MA, USAAI-driven analysis of bitewing radiographs for caries and bone lossPotential for mixed-dentition caries evaluation and treatment planningFDA-cleared (2021) [31]https://www.overjet.ai (accessed on 22 November 2025) [32]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Karamüftüoğlu, N.; Üçpunar, B.Y.; Birben, İ.; Altundağ, A.E.; Mullaoğlu, K.Ö.; Bal, C. Artificial Intelligence in Pediatric Dentistry: A Systematic Review and Meta-Analysis. Children 2026, 13, 152. https://doi.org/10.3390/children13010152

AMA Style

Karamüftüoğlu N, Üçpunar BY, Birben İ, Altundağ AE, Mullaoğlu KÖ, Bal C. Artificial Intelligence in Pediatric Dentistry: A Systematic Review and Meta-Analysis. Children. 2026; 13(1):152. https://doi.org/10.3390/children13010152

Chicago/Turabian Style

Karamüftüoğlu, Nevra, Büşra Yavuz Üçpunar, İrem Birben, Asya Eda Altundağ, Kübra Örnek Mullaoğlu, and Cenkhan Bal. 2026. "Artificial Intelligence in Pediatric Dentistry: A Systematic Review and Meta-Analysis" Children 13, no. 1: 152. https://doi.org/10.3390/children13010152

APA Style

Karamüftüoğlu, N., Üçpunar, B. Y., Birben, İ., Altundağ, A. E., Mullaoğlu, K. Ö., & Bal, C. (2026). Artificial Intelligence in Pediatric Dentistry: A Systematic Review and Meta-Analysis. Children, 13(1), 152. https://doi.org/10.3390/children13010152

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop