Artificial Intelligence for Color Prediction and Esthetic Design in CAD/CAM Ceramic Restorations: A Systematic Review and Meta-Analyses
Abstract
1. Introduction
2. Materials and Methods
2.1. Eligibility Criteria
- Participants (P): In vitro, in silico with experimental validation, and clinical studies involving CAD/CAM ceramic restorations.
- Intervention (I): Application of artificial intelligence (AI) or machine learning (ML) for color prediction of CAD/CAM ceramic restorations or automated esthetic crown design (morphology, occlusion, proximal contacts, finish-line detection, or internal fit).
- Comparisons (C): Conventional or reference methods, such as spectrophotometry, spectroradiometry, colorimetry, technician-designed CAD/CAM crowns, or established commercial CAD software.
- Outcomes (O): For color prediction: L*, a*, and b* coordinates, ΔE/ΔE00 values, RMSE, mean absolute error (MAE), R2 values, and thresholds of perceptibility/acceptability. For crown design: morphological accuracy (3D similarity, RMS deviation, Hausdorff distance, or Intersection over Union), occlusal and proximal contact accuracy, finish-line detection error, internal fit, and design time.
- Study design (S): Original experimental or observational studies with quantitative results.
2.2. Exclusion Criteria
2.3. Information Sources and Search Strategy
2.4. Selection Process
2.5. Data Collection Process
2.6. Data Items
2.7. Outcome Definitions
- Design time: The total time (in seconds or minutes) required for the software or operator to generate a complete crown design, including automated steps and operator-adjusted phases when applicable.
- Internal fit: The discrepancy (µm) between the intaglio surface of the designed crown and the prepared abutment surface, typically quantified using root-mean-square (RMS) error or equivalent gap-measurement techniques.
- Finish-line accuracy: The deviation (mm) between the AI-detected margin line and the reference margin determined by experienced technicians or validated gold standards, most commonly expressed as mean Hausdorff distance.
- Color-prediction acceptability: The proportion of AI-generated color predictions falling within each study’s prespecified CIEDE2000 acceptability threshold (AT00), reflecting clinically acceptable shade matching.
- Morphology deviation: The three-dimensional geometric discrepancy between the AI-generated crown and the reference model, measured using RMS deviation, Chamfer distance, or other 3D surface comparison metrics.
- Functional contacts (occlusal and proximal): The presence, number, position, or intensity of simulated occlusal and proximal contacts, typically assessed through intermesh distance thresholds, contact-point loss metrics, or virtual articulating papers.
2.8. Study Risk of Bias Assessment
2.9. Certainty of Evidence (GRADE Assessment)
2.10. Data Synthesis and Statistical Analysis
3. Results
3.1. Characteristics of Included Studies
3.2. Design Time for Crown Design
3.3. Internal Fit
3.4. Finish-Line Accuracy (Hausdorff Distance, mm)
3.5. Color-Prediction Acceptability
3.6. Morphology Deviation
3.7. Occlusal and Proximal Contacts
3.8. Study Risk of Bias Assessment
3.9. Certainty of Evidence (GRADE)
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Miyazaki, T.; Hotta, Y.; Kunii, J.; Kuriyama, S.; Tamaki, Y. A review of dental CAD/CAM: Current status and future perspectives from 20 years of experience. Dent. Mater. J. 2009, 28, 44–56. [Google Scholar] [CrossRef]
- Miyazaki, T.; Nakamura, T.; Matsumura, H.; Ban, S.; Kobayashi, T. Current status of zirconia restoration. J. Prosthodont. Res. 2013, 57, 236–261. [Google Scholar] [CrossRef]
- Rexhepi, I.; Santilli, M.; D’Addazio, G.; Tafuri, G.; Manciocchi, E.; Caputi, S.; Sinjari, B. Clinical Applications and Mechanical Properties of CAD-CAM Materials in Restorative and Prosthetic Dentistry: A Systematic Review. J. Funct. Biomater. 2023, 14, 431. [Google Scholar] [CrossRef]
- Saravi, B.; Vollmer, A.; Hartmann, M.; Lang, G.; Kohal, R.J.; Boeker, M.; Patzelt, S.B.M. Clinical Performance of CAD/CAM All-Ceramic Tooth-Supported Fixed Dental Prostheses: A Systematic Review and Meta-Analysis. Materials 2021, 14, 2672. [Google Scholar] [CrossRef] [PubMed]
- Morsy, N.; Holiel, A.A. Color difference for shade determination with visual and instrumental methods: A systematic review and meta-analysis. Syst. Rev. 2023, 12, 95. [Google Scholar] [CrossRef] [PubMed]
- Cho, J.H.; Çakmak, G.; Yi, Y.; Yoon, H.I.; Yilmaz, B.; Schimmel, M. Tooth morphology, internal fit, occlusion and proximal contacts of dental crowns designed by deep learning-based dental software: A comparative study. J. Dent. 2024, 141, 104830. [Google Scholar] [CrossRef] [PubMed]
- Kelly, J.R.; Benetti, P. Ceramic materials in dentistry: Historical evolution and current practice. Aust. Dent. J. 2011, 56 (Suppl. S1), 84–96. [Google Scholar] [CrossRef] [PubMed]
- Heffernan, M.J.; Aquilino, S.A.; Diaz-Arnold, A.M.; Haselton, D.R.; Stanford, C.M.; Vargas, M.A. Relative translucency of six all-ceramic systems. Part II: Core and veneer materials. J. Prosthet. Dent. 2002, 88, 10–15. [Google Scholar] [CrossRef]
- Kose, C., Jr.; Oliveira, D.; Pereira, P.N.R.; Rocha, M.G. Using artificial intelligence to predict the final color of leucite-reinforced ceramic restorations. J. Esthet. Restor. Dent. 2023, 35, 105–115. [Google Scholar] [CrossRef]
- Mascaro, B.A.; Conejo, R.V.; Tejada-Casado, M.; Fonseca, R.G.; Reis, J.M.D.S.N.; Pérez, M.M. Machine learning regression models for color prediction of CAD-CAM materials against different tooth-colored backgrounds. J. Dent. 2025, 161, 105994. [Google Scholar] [CrossRef]
- Yang, J.; Hao, Z.; Xu, J.; Wang, J.; Jiang, X. Fusion machine learning model predicts CAD-CAM ceramic colors and the corresponding minimal thicknesses over various clinical backgrounds. Dent. Mater. 2024, 40, 285–296. [Google Scholar] [CrossRef]
- Ding, H.; Cui, Z.; Maghami, E.; Chen, Y.; Matinlinna, J.P.; Pow, E.H.N.; Fok, A.S.L.; Burrow, M.F.; Wang, W.; Tsoi, J.K.H. Morphology and mechanical performance of dental crown designed by 3D-DCGAN. Dent. Mater. 2023, 39, 320–332. [Google Scholar] [CrossRef]
- Wu, Z.; Zhang, C.; Ye, X.; Dai, Y.; Zhao, J.; Zhao, W.; Zheng, Y. Comparison of the Efficacy of Artificial Intelligence-Powered Software in Crown Design: An In Vitro Study. Int. Dent. J. 2025, 75, 127–134. [Google Scholar] [CrossRef]
- Choi, J.; Ahn, J.; Park, J.M. Deep learning-based automated detection of the dental crown finish line: An accuracy study. J. Prosthet. Dent. 2024, 132, 1286.e1–1286.e9. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Barker, T.H.; Stone, J.C.; Sears, K.; Klugar, M.; Leonardi-Bee, J.; Tufanaru, C.; Aromataris, E.; Munn, Z. Revising the JBI quantitative critical appraisal tools to improve their applicability: An overview of methods and the development process. JBI Evid. Synth. 2023, 21, 478–493. [Google Scholar] [CrossRef] [PubMed]
- Sterne, J.A.; Hernán, M.A.; Reeves, B.C.; Savović, J.; Berkman, N.D.; Viswanathan, M.; Henry, D.; Altman, D.G.; Ansari, M.T.; Boutron, I.; et al. ROBINS-I: A tool for assessing risk of bias in non-randomised studies of interventions. BMJ 2016, 355, i4919. [Google Scholar] [CrossRef]
- Guyatt, G.H.; Oxman, A.D.; Vist, G.E.; Kunz, R.; Falck-Ytter, Y.; Alonso-Coello, P.; Schünemann, H.J.; GRADE Working Group. GRADE: An emerging consensus on rating quality of evidence and strength of recommendations. BMJ 2008, 336, 924–926. [Google Scholar] [CrossRef] [PubMed]
- Sawangsri, K.; Bekkali, M.; Lutz, N.; Alrashed, S.; Hsieh, Y.L.; Lai, Y.C.; Arreaza, C.; Nassani, L.M.; Hammoudeh, H.S. Acceptability and deviation of finish line detection and restoration contour design in single-unit crown: Comparative evaluation between 2 AI-based CAD software programs and dental laboratory technicians. J. Prosthet. Dent. 2025, 134, 409–417. [Google Scholar] [CrossRef] [PubMed]
- Nagata, K.; Inoue, E.; Nakashizu, T.; Seimiya, K.; Atsumi, M.; Kimoto, K.; Kuroda, S.; Hoshi, N. Verification of the accuracy and design time of crowns designed with artificial intelligence. J. Adv. Prosthodont. 2025, 17, 1–10. [Google Scholar] [CrossRef]
- Broll, A.; Goldhacker, M.; Hahnel, S.; Rosentritt, M. Morphological effects of input data quantity in AI-powered dental crown design. J. Dent. 2025, 159, 105767. [Google Scholar] [CrossRef]
- Çakmak, G.; Cho, J.H.; Choi, J.; Yoon, H.I.; Yilmaz, B.; Schimmel, M. Can deep learning-designed anterior tooth-borne crown fulfill morphologic, aesthetic, and functional criteria in clinical practice? J. Dent. 2024, 150, 105368. [Google Scholar] [CrossRef]
- Chen, Y.; Lee, J.K.Y.; Kwong, G.; Pow, E.H.N.; Tsoi, J.K.H. Morphology and fracture behavior of lithium disilicate dental crowns designed by human and knowledge-based AI. J. Mech. Behav. Biomed. Mater. 2022, 131, 105256. [Google Scholar] [CrossRef]
- Chau, R.C.W.; Hsung, R.T.; McGrath, C.; Pow, E.H.N.; Lam, W.Y.H. Accuracy of artificial intelligence-designed single-molar dental prostheses: A feasibility study. J. Prosthet. Dent. 2024, 131, 1111–1117. [Google Scholar] [CrossRef]
- Cho, J.H.; Yi, Y.; Choi, J.; Ahn, J.; Yoon, H.I.; Yilmaz, B. Time efficiency, occlusal morphology, and internal fit of anatomic contour crowns designed by dental software powered by generative adversarial network: A comparative study. J. Dent. 2023, 138, 104739. [Google Scholar] [CrossRef] [PubMed]
- Tian, S.; Wang, M.; Dai, N.; Ma, H.; Li, L.; Fiorenza, L.; Sun, Y.; Li, Y. DCPR-GAN: Dental Crown Prosthesis Restoration Using Two-Stage Generative Adversarial Networks. IEEE J. Biomed. Health Inform. 2022, 26, 151–160. [Google Scholar] [CrossRef]
- Shim, J.S.; Lee, J.S.; Lee, J.Y.; Choi, Y.J.; Shin, S.W.; Ryu, J.J. Effect of software version and parameter settings on the marginal and internal adaptation of crowns fabricated with the CAD/CAM system. J. Appl. Oral Sci. 2015, 23, 515–522. [Google Scholar] [CrossRef] [PubMed]
- Tribst, J.P.M.; Hosseini, F.; Pilecco, R.O.; Serrano, C.M.; Kleverlaan, C.J.; Dal Piva, A.M.O. The Influence of Extra-Fine Milling Protocol on the Internal Fit of CAD/CAM Composite and Ceramic Crowns. Materials 2024, 17, 5601. [Google Scholar] [CrossRef]
- Farag, E.A.A.; Rizk, A.; Ashraf, R.; Emad Eldin, F. Effect of the scanner type on the marginal gap and internal fit of two monolithic CAD/CAM esthetic crown materials: An in vitro study. Dent. Med. Probl. 2024. ahead of print. [Google Scholar] [CrossRef]
- Litzenburger, A.P.; Hickel, R.; Richter, M.J.; Mehl, A.C.; Probst, F.A. Fully automatic CAD design of the occlusal morphology of partial crowns compared to dental technicians’ design. Clin. Oral Investig. 2013, 17, 491–496. [Google Scholar] [CrossRef] [PubMed]
- Uribe-Hernández, L.; Latorre-Correa, F.; Perea-Lowery, L.; Ardila, C.M. Effect of preheating and curing lamp distance on the degree of conversion of four nanohybrid resins: An in vitro study. J. Clin. Exp. Dent. 2024, 16, e975–e983. [Google Scholar] [CrossRef] [PubMed]










| Author | Domain | Setting | Sample Size | AI Method | Comparator | Outcomes | Thresholds |
|---|---|---|---|---|---|---|---|
| Cho et al. (2024) [6] | Design | In vivo digital datasets; 30 partial-arch scans of prepared posterior teeth (16 maxillary, 14 mandibular) from a single dental laboratory; STL files; IRB-exempt. | 30, 16, 14 | Two deep-learning–based commercial CAD tools: Automate (3Shape; AA) and Dentbird Crown (Imagoworks; AD); AD employs GAN/CNN per prior work; designs auto-generated without technician intervention. | Technician-designed CAD comparator (Dental System, 3Shape; NC) by an experienced technician. | 3D morphology deviation (RMS, positive/negative averages; % within ±50 µm), internal fit RMS (µm), margin-line location RMS (µm), cusp angle (°), occlusal contact count and intensity, proximal contact intensity. | Applied criteria/tolerances: ±50 µm for morphology; ±40 µm for internal fit (cement space); occlusal contact defined as intermesh distance < 20 µm; proximal contact preset −20 µm with intensity classes. |
| Kose et al. (2023) [9] | Color | In vitro (CAD/CAM leucite-reinforced ceramics; IPS Empress CAD HT/LT A1/A3; thicknesses: 0.3–1.2 mm; substrates: black, A1, A3, white) | 1, 3, 0, 3, 1, 2, 1, 3 | Multiple machine-learning regressors to predict L*, a*, b*, and ΔE00. | Spectrophotometric reference. | ΔE00 prediction error (and ΔL/Δa/Δb), MAE, R2 | ΔE00 acceptability threshold = 1.77. |
| Mascaro et al. (2025) [10] | Color | In vitro (four CAD/CAM materials—Lava Ultimate, Grandio Blocs, VITA Enamic, VITA Mark II; thicknesses 0.5–1.5 mm with n = 3 per material/thickness; measured against white, black, and nine tooth-colored ND1–ND9 backgrounds using a spectroradiometer). | 0, 5, 1, 5, 3, 1, 9 | Partial Least Squares (PLS) regression to predict L*, a*, and b* from material, thickness, and background. | Spectroradiometric ground truth (predicted vs. measured CIE L*a*b*). | ΔE00 between predicted and measured; RMSE for L*, a*, b* (e.g., overall mean ΔE00 = 1.04 in approach 1; by-material mean ≈ 0.97: LU 1.01, GB 1.05, VE 1.10, VM 0.71; best RMSE for a* ≈ 0.14 in approach 3). | ΔE00 thresholds applied: PT00 = 0.80; AT00 = 1.81. |
| Yang et al. (2024) [11] | Color | In vitro (4 CAD/CAM materials: IPS e.max CAD, IPS e.max ZirCAD, Upcera Li CAD, Upcera TT CAD; thicknesses: 0.5/1.0/2.0 mm; n = 10 per thickness → 120 specimens; 7 backgrounds: A1, A2, A3.5, ND2, ND7, cobalt–chromium [CC], medium-precious alloy [MPA]). | 4, 0, 5, 1, 0, 2, 0, 10, 120, 7, 1, 2, 3, 5, 2, 7 | Fusion/stacking ML model combining ExtraTreesRegressor and XGBRegressor; SHAP used for feature importance/explainability. | Spectrophotometric ground truth (CIELab measured with a digital spectrophotometer; predictions benchmarked to measured values). | ΔE (CIE76) and ΔL prediction performance (R2 and RMSE); external-test performance of fusion model for ΔE: R2 ≈ 0.906, RMSE ≈ 0.348; minimal thickness estimates per background to meet targets. | ΔE perceptibility = 2.6, acceptability = 5.5; ΔL threshold = 0.5 (used to derive minimal-thickness recommendations. |
| Ding et al. (2023) [12] | Design | In silico + experimental: training on 600 full-arch digital casts from healthy subjects (intraoral scans; teeth 44–46); independent test set of 12 mandibular second-premolar cases. | 600, 44, 46, 12 | 3D-DCGAN (true 3D deep-convolutional GAN) implemented in PyTorch for automated crown design. | Natural tooth (NT), CEREC Biogeneric individual design (BI), and technician CAD design (TD). | 3D morphology discrepancy (RMS, mean positive/negative deviation)—e.g., NT vs. AI RMS = 0.3611 (0.1160); cusp angle (means: NT 54.05°, AI 49.43°, BI 67.11°, TD 63.34°); occlusal contact number/area using virtual articulating papers (100 µm, 200 µm); dynamic FEA (max principal and shear stresses; fatigue lifetime under 100–400 N). | None explicitly; operational parameters included virtual articulating paper thicknesses (100/200 µm) and FEA loading range of 100–400 N. |
| Wu et al. (2025) [13] | Design | In vitro (33 intraoral scan datasets of posterior crowns; reference built from clinically adapted crowns; TRIOS 3 scans; STL files). | 33, 3 | Two AI-powered design programs: Automate (3Shape; AA) and Dentbird Crown (Imagoworks; AD—reports using GAN/CNN); fully automated crown generation. | Exocad DentalCAD by experienced technician (CE) and novice technician (CN); Std crown set as reference. | Design time (seconds, median [IQR]); morphological accuracy as RMS deviation (mm) for occlusal, mesial, distal surfaces and margin lines. | Applied criteria/tolerances: deviation map range ±500 µm with “green” tolerance ±50 µm; occlusal contact defined as intermesh distance < 20 µm; Std marginal fit noted as <120 µm. |
| Choi et al. (2024) [14] | Design (finish-line) | Retrospective dataset of 182 jaw scans; evaluation on two sets—desktop-scanner trimmed casts (DS: 58 anterior, 83 posterior) and intraoral scans (IS: 35 anterior, 58 posterior). | 182, 58, 83, 35, 58 | Hybrid deep learning + CAD (Dentbird): CNN preparation-tooth detector (CenterNet with Stacked Hourglass), UneXt encoder–decoder finish-line segmentor, spline curve-on-mesh. | 3Shape Dental System 2021-1, exocad DentalCAD 3.1, MEDIT Margin Lines 1.0. | Hausdorff distance (mm) and Chamfer distance (a.u.) with mean ± SD by group (e.g., IS posterior HD means—Dentbird 0.543, Dental System 0.712, DentalCAD 0.635, Medit 0.694); counts of clinically acceptable preparations. | Clinical thresholds applied: DS—HD ≤ 0.366 mm, CD ≤ 0.026; IS—HD ≤ 0.566 mm, CD ≤ 0.100. |
| Sawangsri et al. (2025) [19] | Design (finish-line detection and restoration contour) | In vitro/retrospective convenience sample; 100 single-crown abutment scans with antagonists (TRIOS 4) replicated 3× and assigned to dental technicians (DT), Dentbird (DB), and Automate (AM) | 100, 4, 3 | Two fully automated AI-based CAD programs: Dentbird Crown 4.1.1 (hybrid deep learning + CAD) and Automate 1.02 (proprietary AI CAD) for autonomous finish-line detection and crown design. | Four Certified Dental Technicians using 3Shape Dental Manager (manual margin registration; library of choice) as the reference designs. | Acceptability scores (finish- line; 8-criterion restoration design), Hausdorff distance (mm) for finish-line deviation, and RMS error (mm) for restoration design; key results—AM lower deviation than DB (overall HD 0.132 ± 0.057 vs. 0.380 ± 0.431 mm; overall RMS 0.195 ± 0.059 vs. 0.253 ± 0.060 mm); DT highest acceptability. | Clinical thresholds/criteria: authors propose an acceptability threshold of ~130 µm for AI-generated finish-line deviation based on study findings; design parameters included occlusal clearance 0.00 mm, interproximal distance −0.03 mm, cement space 0.03 mm. |
| Nagata et al. (2025) [20] | Design | In vitro model study: master models for #15 (maxillary 2nd premolar) and #26 (maxillary 1st molar); TRIOS 3 intraoral scanning to STL; five dental technicians each designed one crown per tooth with both systems (total 20 crowns); milled from CAD-CAM blocks | 15, 2, 26, 1, 3, 20 | AI-equipped CAD (Dentbird; commercial deep-learning-assisted platform with automatic abutment/margin detection and auto crown design); conventional CAD: 3Shape Dental System. | Conventional CAD (3Shape Dental System) vs. AI-equipped CAD (Dentbird), both used by the same five technicians. | Occlusal-surface “accuracy” (RMS-type misfit across points a–f)—means (±SD) overall: conventional 275.5 ± 116.8 µm vs. AI 25.7 ± 13.0 µm; design time (min): #15—3Shape 397.2 ± 80.4 vs. Dentbird 99.4 ± 17.1; #26—3Shape 516.4 ± 61.3 vs. Dentbird 97.6 ± 11.1; proximal contact intensity distribution; marginal fit by micro-CT (e.g., #15 sites 52–72 µm; #26 sites 60–76 µm; no significant differences between systems). | Clinical thresholds/criteria: desirable marginal fit ≤ 120 µm cited from the prior literature; results for both systems remained <120 µm. |
| Broll et al. (2025) [21] | Design | Retrospective in vitro; n = 30 patients (11M/19F, age 22–31); CEREC PrimeScan intraoral scans; target lower first molar (#36/46); input data groups: full jaw (full), quadrant (quad), adjacent teeth (adj); antagonists included. | 30, 11, 19, 22, 31, 36, 46 | Commercial AI-based CAD: Dentbird Crown (Imagoworks Inc.); authors note a 2D deep-learning approach underlying the software. | Original tooth (ground truth) as reference; comparisons between full vs. quad vs. adj data quantities using the same software. | Morphology—Chamfer Distance (L2), complemented IoU; occlusion-specific—penetration loss, contact-point distance loss, contact-point position loss (Chamfer on CPs), contact-point number loss; failure rates for reconstruction/occlusion establishment reported. | None explicit; computational thresholds specified (e.g., occlusion threshold d_thresh = 0 mm; cluster convergence ε_conv = 0.45 mm; GT extraction ε = 1 × 10−6 mm); lower values indicate better performance for normalized metrics. |
| Çakmak et al. (2024) [22] | Design (clinical; anterior crown) | In vivo datasets: 25 complete/nearly complete arch scans with a prepared maxillary central incisor abutment; STL files; retrospective lab collection with recorded maxillomandibular relations. | 25 | Deep learning–based CAD (Dentbird Crown; GAN + CNN): as-generated output (DB) and technician-modified output (DM) versus conventional CAD without DL (NC). | DL as-generated (DB), DL technician-modified (DM), and technician-designed conventional CAD (NC). | Morphology—TDV (mm3), TDV ratio (%), linear deviation (RMS, PA, NA) by regions; function—incisal path deviation (RMS µm), length, mean inclination; aesthetics—width, height, width/height ratio, mesioincisal angle radius, proximal contact length, tooth axis. | Applied criteria/tolerances: operational design presets as above; no explicit external clinical acceptability thresholds reported. |
| Chen et al. (2022) [23] | Crown design (occlusal morphology and fracture behavior) | In vivo-derived datasets; 12 participants with mandibular premolar #45; lithium–disilicate crowns fabricated on 3D-printed casts | 12, 45, 3 | Knowledge-based AI (CEREC “biogeneric individual”, BI) vs. human CAD designs (experienced technician, TD; trained dental students, AD). | Human-designed CAD (TD, AD); original tooth used as morphology reference. | Occlusal profile discrepancy (avg ± SD), RMS_estimate, z-difference, volume/area discrepancy, cusp angle; load-to-fracture (N) and failure modes. | Clinical thresholds: fracture loads judged “clinically acceptable”; no explicit numerical fit tolerance reported. |
| Chau et al. (2024) [24] | Design | In vivo-derived casts and digital workflow; 169 participants recruited; 159 training casts and 10 validation pairs (maxillary right first molar removed/retained); evaluation on 10 × 10 comparisons. | 169, 159, 10, 10, 10 | 3D generative adversarial network (GAN) system to generate a biomimetic single-molar prosthesis from the remaining dentition. | The subject’s original natural tooth (ground truth) for morphology matching. | Mean Hausdorff distance (mm) and Intersection over Union (IoU) for “true reconstruction”; validation results—mean HD across matched pairs = 0.633 ± 0.961 mm; IoU = 0.600 (6/10 true reconstructions). | Clinical thresholds: none explicitly stated; authors note 0.500 (50%) as a commonly accepted IoU threshold in the segmentation literature; morphological error interpreted via HD (lower is better). |
| Cho et al. (2023) [25] | Design | In vivo-derived digital datasets; 30 partial-arch scans of prepared posterior abutments (12 intraoral, 18 cast) with jaw-relation records; retrospective single-lab STL collection; IRB-exempt. | 30, 12, 18 | GAN-based deep-learning dental design software (Dentbird Crown): CNN modules for prepared-tooth detection and margin-line segmentation; StyleGAN-based occlusal-surface generator; inner-surface generator for intaglio. | Conventional CAD (3Shape Dental System, non-AI) designed by an experienced technician (use of Auto Crown/Auto Placement per manufacturer). | Design time (T1–T6); 3D occlusal morphology deviation (RMS, µm) between initial and final CAD; internal fit RMS (µm) to abutment; finish-line deviation (µm). | Applied criteria/tolerances: deviation maps with nominal ±50 µm for morphology and ±40 µm for internal fit (reflecting 40-µm cement space); no explicit external clinical acceptability threshold. |
| Tian et al. (2022) [26] | Design | In silico (large 3D dental crown database of mandibular first molars #36/#46; depth-map representation; 780 samples total; 700 for training, 80 for testing). | 3, 36, 46, 780, 700, 80 | Two-stage GAN (DCPR-GAN): Stage-I conditional GAN for global morphology; Stage-II adds occlusal fingerprint constraint and GroNet groove-parsing loss; TensorFlow implementation. | Ground-truth target crowns and DL baselines. | PSNR, RMSE, SSIM, FSIM; deviation (RMS, SD) between generated occlusal surface and target crown < 0.161 mm; deviation boxplots and runtime (~11–15 s). | None explicit (computational similarity metrics; functional target enforced via occlusal fingerprint constraint). |
| Study | Setting and Study Design | Tool | Overall RoB | Key Concerns |
|---|---|---|---|---|
| Mascaro et al. [10] | In vitro color-prediction vs. spectrophotometry; AT00 = 1.81. | JBI | Some concerns | Small n per condition; blinding of measurements unclear; internal cross-validation. |
| Kose et al. [9] | In vitro ML regression for leucite-reinforced ceramics; spectrophotometer reference. | JBI | Some concerns | Specimen preparation/replicate independence unclear; no preregistration. |
| Yang et al. [11] | In vitro fusion ML for color/thickness across backgrounds; external test set. | JBI | Some concerns | Indirectness to clinical performance; assessor blinding not explicit. |
| Broll et al. [21] | In vitro retrospective dataset assessing morphology/occlusion from AI-CAD with varying input. | JBI | Some concerns | Convenience sample; custom metrics; analytical flexibility. |
| Sawangsri et al. [19] | In vitro comparison of two AI-CAD vs. technicians for finish-line detection and restoration design. | JBI | Some concerns | Potential conflicts of interest; qualitative and quantitative mix; assessor blinding unclear. |
| Nagata et al. [20] | In vitro master models; AI-equipped vs. conventional CAD; accuracy and design time. | JBI | Low–some concerns | Standardized protocol; generalizability limited; learning-curve effects. |
| Cho et al. [6] | In vitro partial-arch scans; DL-CAD vs. technician; morphology, internal fit, occlusion, proximal contacts. | JBI | Some concerns | Multiple outcomes; ROI/threshold choices; blinding not explicit. |
| Çakmak et al. [22] | In vivo nonrandomized comparison; morphology and functional outcomes. | ROBINS-I | Moderate | Confounding and selection; outcome assessment without blinding; few missing data. |
| Chen et al. [23] | In vivo-derived designs (12 participants) compared across knowledge-based AI vs. human CAD; ex vivo tests. | ROBINS-I | Moderate | Nonrandomized; blinding unclear; small sample. |
| Chau et al. [24] | In vitro; 3D-GAN evaluation with Hausdorff/IoU. | JBI | Some concerns | Validation n ≈ 10; best-fit alignment; non-clinical thresholds. |
| Choi et al. [14] | In silico/retrospective; DL for finish-line detection with proposed HD/Chamfer thresholds. | JBI | Some concerns | Convenience sampling; outlier removal rules; generalizability. |
| Tian et al. [26] | In silico two-stage DCPR-GAN; large bench dataset. | JBI | Some concerns | Validation on bench data; indirect clinical transferability. |
| Ding et al. [12] | In vitro/in silico 3D-DCGAN with mechanical/FE analysis. | JBI | Some concerns | Small test set; multiple endpoints; blinding not reported. |
| Wu et al. [13] | In vitro; two AI-CAD vs. expert/novice CAD; surface-wise accuracy. | JBI | Some concerns | Nonrandomized; per-surface metrics; mixed unit reporting. |
| Cho et al. [25] | In vivo clinical datasets; AI-CAD vs. conventional CAD. | ROBINS-I | Moderate | Nonrandomized design; possible operator bias; small samples. |
| Outcome | Evidence Base (k, Setting) | Summary of Findings | Main Limitations (GRADE Domains) | Certainty |
|---|---|---|---|---|
| Color-prediction acceptability (ΔE00 within AT00) | k = 2 in vitro ML (Kose et al. AT00 = 1.77 [9]; Mascaro et al. AT00 = 1.81 [10]) | High proportion of predictions within each study’s prespecified acceptability threshold. | Indirectness (bench → clinical), imprecision (low k/n), threshold heterogeneity (1.77 vs. 1.81). | Low |
| Internal fit | k = 2 (Cho et al. 2023 [25]; Nagata et al. 2025 [20]) | AI-assisted CAD showed better internal fit than conventional CAD in both studies. | Imprecision (small samples), indirectness (lab/retrospective), possible inconsistency across systems. | Very low–Low |
| Morphology deviation | k ≥ 7 (Chau et al. 2024 [24]; Tian et al. 2022 [26]; Ding et al. 2023 [12]; Cho et al. 2024 [6]; Nagata et al. 2025 [20]; Broll et al. 2025 [21]; Wu et al. 2025 [13]) | AI often matches or exceeds comparators on RMS/HD, but effects depend on metric/ROI/software. | Serious inconsistency (RMS µm vs. HD/Chamfer mm; custom indices), indirectness, imprecision. | Very low |
| Finish-line detection/restoration design | k = 2 (Choi et al. [14] 2024; Sawangsri et al. 2025 [19]) | Hybrid DL and CAD comparable or better; software-specific differences; thresholds proposed for HD and Chamfer. | Single-study per subtasks, potential COI, indirectness. | Very low–Low |
| Design time | k = 2 (Cho et al. 2023 [25]; Nagata et al. 2025 [20]) | AI-equipped CAD reduced design time compared with conventional CAD. | Imprecision (low k), indirectness. | Low |
| Functional contacts (occlusal and proximal) | k = 3 (Ding et al. 2023 [12]; Broll et al. 2025 [21]; Cho et al. 2024 [6]) | Contact counts/intensity varied; AI generally comparable, not consistently superior. | Heterogeneous definitions/units, low k, indirectness. | Very low |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ardila, C.M.; Pulgarín-Medina, D.M.; Pineda-Vélez, E.; Vivares-Builes, A.M. Artificial Intelligence for Color Prediction and Esthetic Design in CAD/CAM Ceramic Restorations: A Systematic Review and Meta-Analyses. Prosthesis 2025, 7, 160. https://doi.org/10.3390/prosthesis7060160
Ardila CM, Pulgarín-Medina DM, Pineda-Vélez E, Vivares-Builes AM. Artificial Intelligence for Color Prediction and Esthetic Design in CAD/CAM Ceramic Restorations: A Systematic Review and Meta-Analyses. Prosthesis. 2025; 7(6):160. https://doi.org/10.3390/prosthesis7060160
Chicago/Turabian StyleArdila, Carlos M., Diana María Pulgarín-Medina, Eliana Pineda-Vélez, and Anny M. Vivares-Builes. 2025. "Artificial Intelligence for Color Prediction and Esthetic Design in CAD/CAM Ceramic Restorations: A Systematic Review and Meta-Analyses" Prosthesis 7, no. 6: 160. https://doi.org/10.3390/prosthesis7060160
APA StyleArdila, C. M., Pulgarín-Medina, D. M., Pineda-Vélez, E., & Vivares-Builes, A. M. (2025). Artificial Intelligence for Color Prediction and Esthetic Design in CAD/CAM Ceramic Restorations: A Systematic Review and Meta-Analyses. Prosthesis, 7(6), 160. https://doi.org/10.3390/prosthesis7060160

