Author Contributions
Conceptualization, K.T., K.K., B.A., S.T. and P.P.; methodology, K.T., R.W., K.K. and P.P.; software, K.T. and P.P.; validation, K.T., R.W., S.T. and P.P.; formal analysis, K.T., R.W. and P.P.; investigation, K.T., R.W. and P.P.; resources, K.T., R.W., K.K. and B.A.; data curation, K.T., R.W. and P.P.; writing—original draft preparation, K.T., R.W. and H.S.; writing—review and editing, K.T., R.W., H.S., S.T. and P.P.; visualization, K.T., R.W. and S.T.; supervision, K.T. and P.P.; project administration, K.T.; funding acquisition, K.T. All authors have read and agreed to the published version of the manuscript.
Figure 1.
AE signal preprocessing: (a) Raw AE signal before denoising. (b) AE signal after Kalman filtering.
Figure 1.
AE signal preprocessing: (a) Raw AE signal before denoising. (b) AE signal after Kalman filtering.
Figure 2.
CWT scalograms of AE signals from different sample components and crown regions: (a) root component; (b) incisal; (c) labial; (d) left; (e) palatal; (f) right.
Figure 2.
CWT scalograms of AE signals from different sample components and crown regions: (a) root component; (b) incisal; (c) labial; (d) left; (e) palatal; (f) right.
Figure 3.
Proposed hierarchical deep learning framework for AE-based crack diagnosis.
Figure 3.
Proposed hierarchical deep learning framework for AE-based crack diagnosis.
Figure 4.
Experimental setup of zirconia sample: (a) implant-like cylindrical fixture with top-mounted AE sensor; (b) zirconia crown with labial-mounted AE sensor and data acquisition system.
Figure 4.
Experimental setup of zirconia sample: (a) implant-like cylindrical fixture with top-mounted AE sensor; (b) zirconia crown with labial-mounted AE sensor and data acquisition system.
Figure 5.
PLB source used for AE signal generation in accordance with ASTM E976.
Figure 5.
PLB source used for AE signal generation in accordance with ASTM E976.
Figure 6.
Mean ± SD of peak AE frequency (kHz) and peak amplitude (dB) per PLB excitation position for (a) Crown sample and (b) Root sample.
Figure 6.
Mean ± SD of peak AE frequency (kHz) and peak amplitude (dB) per PLB excitation position for (a) Crown sample and (b) Root sample.
Figure 7.
Individual PLB measurements of peak AE frequency (kHz) and peak amplitude (dB): (a) Crown and (b) Root. Stars denote the overall centroid of each dataset.
Figure 7.
Individual PLB measurements of peak AE frequency (kHz) and peak amplitude (dB): (a) Crown and (b) Root. Stars denote the overall centroid of each dataset.
Figure 8.
CNN–BiGRU architecture for Stage 1 component classification.
Figure 8.
CNN–BiGRU architecture for Stage 1 component classification.
Figure 9.
Architecture of the ResNet-18–BiLSTM model with multi-head attention for crown position classification.
Figure 9.
Architecture of the ResNet-18–BiLSTM model with multi-head attention for crown position classification.
Figure 10.
Mechanical loading setup for crown crack localization: (a) UTM; (b) close-up of the loading tip applied to the zirconia crown.
Figure 10.
Mechanical loading setup for crown crack localization: (a) UTM; (b) close-up of the loading tip applied to the zirconia crown.
Figure 11.
Zirconia crown sample mounted at a 135° inclination in a customized fixture.
Figure 11.
Zirconia crown sample mounted at a 135° inclination in a customized fixture.
Figure 12.
Training and validation loss curves of the CNN2D–BiGRU model.
Figure 12.
Training and validation loss curves of the CNN2D–BiGRU model.
Figure 13.
Confusion Matrix of the CNN2D–BiGRU of the CNN2D–BiGRU Model.
Figure 13.
Confusion Matrix of the CNN2D–BiGRU of the CNN2D–BiGRU Model.
Figure 14.
ROC Curve of the CNN2D–BiGRU Model.
Figure 14.
ROC Curve of the CNN2D–BiGRU Model.
Figure 15.
Training and Validation Loss Curves of ResNet-18 with BiLSTM–Attention model for Crown Position Classification.
Figure 15.
Training and Validation Loss Curves of ResNet-18 with BiLSTM–Attention model for Crown Position Classification.
Figure 16.
Confusion Matrix of the ResNet-18 with BiLSTM–Attention model for Multi-Class Crown Position Classification.
Figure 16.
Confusion Matrix of the ResNet-18 with BiLSTM–Attention model for Multi-Class Crown Position Classification.
Figure 17.
ROC Curves of the ResNet-18 with BiLSTM–Attention for Multi-Class Crown Position Classification.
Figure 17.
ROC Curves of the ResNet-18 with BiLSTM–Attention for Multi-Class Crown Position Classification.
Figure 18.
Example scalograms of AE signals acquired during onsite mechanical loading experiments in panels (a,b).
Figure 18.
Example scalograms of AE signals acquired during onsite mechanical loading experiments in panels (a,b).
Figure 19.
Confusion Matrix of CNN2D–BiGRU Component Classification under Onsite Mechanical Loading Conditions: (a) n = 30 and (b) n = 60.
Figure 19.
Confusion Matrix of CNN2D–BiGRU Component Classification under Onsite Mechanical Loading Conditions: (a) n = 30 and (b) n = 60.
Figure 20.
Confusion Matrix of ResNet-18 with BiLSTM–Attention for Crown Position Classification under Onsite Mechanical Loading Conditions: (a) n = 30 and (b) n = 60.
Figure 20.
Confusion Matrix of ResNet-18 with BiLSTM–Attention for Crown Position Classification under Onsite Mechanical Loading Conditions: (a) n = 30 and (b) n = 60.
Figure 21.
SEM micrographs of the zirconia crown surface after indentation testing from the 60 crown sample subjected to onsite mechanical loading: (a) overall surface morphology; (b) low-magnification image (100×) showing global deformation features; (c) high-magnification image (300×) highlighting crack propagation and chipping around the indentation site.
Figure 21.
SEM micrographs of the zirconia crown surface after indentation testing from the 60 crown sample subjected to onsite mechanical loading: (a) overall surface morphology; (b) low-magnification image (100×) showing global deformation features; (c) high-magnification image (300×) highlighting crack propagation and chipping around the indentation site.
Figure 22.
Comparative Force–Time and AE Response during Mechanical Loading, Highlighting Early AE-Based Detection and Late UTM Force-Drop Failure Identification.
Figure 22.
Comparative Force–Time and AE Response during Mechanical Loading, Highlighting Early AE-Based Detection and Late UTM Force-Drop Failure Identification.
Table 1.
Material specifications of SHOFU Disc ZR Lucent.
Table 1.
Material specifications of SHOFU Disc ZR Lucent.
| Parameter | Specification |
|---|
| Zirconia material | SHOFU Disc Lucent |
| Material type/classification | High-translucent pre-sintered zirconia (5Y-PSZ) |
| Flexural strength | >1000 MPa |
| Translucency | High |
| Disk diameter | ~98.5 mm |
| Shade/Layers | Monolayer or multilayer with gradual shade gradient (incisal ~30% to dentin/cervical ~35%) |
| Sintering temperature | ~1450 °C |
| Thermal expansion coefficient | ~10.2 × 10−6 K−1 (25–500 °C) |
| Dimensional verification precision | Digital caliper ± 0.01 mm |
| Electrical requirements | 100–240 V, 50/60 Hz |
Table 2.
Technical specifications of the WM500/1 AE sensor.
Table 2.
Technical specifications of the WM500/1 AE sensor.
| Parameter | Specification |
|---|
| Manufacturer | QingCheng AE Institute Co., Ltd., Guangzhou, China |
| Model | WM500/1 |
| Device type | Passive piezoelectric wideband AE sensor |
| Operating Frequency Range | 0.1–1 MHz |
| Resonant frequency | 500 kHz |
| Sensitivity peak | >65 dB (ref. V/(m/s)) |
| Operating temperature | −20 to 50 °C |
| Dimensions (D × H) | 4.7 × 4.0 mm |
| Weight | 13 g |
| Case material | SUS-304 stainless steel |
| Face material | SUS-304 stainless steel |
| Protection grade | IP68 |
| Connector type | BNC (integral cable) |
| Calibration standard | ISO 12714; Metallic Materials—Compression Testing at Room Temperature. International Organization for Standardization (ISO): Geneva, Switzerland, 2015 GB/T 19801; Metallic Materials—Compression Test Method at Room Temperature. Standardization Administration of China: Beijing, China, 2005 |
Table 3.
Technical specifications of the Picoscope 4262 data acquisition system.
Table 3.
Technical specifications of the Picoscope 4262 data acquisition system.
| Parameter | Specification |
|---|
| Manufacturer | Pico Technology, St Neots, UK |
| Model | Picoscope 4262 |
| Device Type | Digital Oscilloscope (Data Acquisition System) |
| Number of channels | 2 Channels |
| Resolution | 16-bit |
| Sampling Rate (Maximum) | 10 MS/s |
| Analog bandwidth | 5 MHz |
| Buffer memory | 16 MS |
| Input coupling | AC/DC |
| Input impedance | 1 MΩ‖15 pF |
| Operating Temperature | 5 to 40 °C |
Table 4.
Technical specifications of the Shimadzu AG-100kNX2 universal testing machine.
Table 4.
Technical specifications of the Shimadzu AG-100kNX2 universal testing machine.
| Parameter | Specification |
|---|
| Model | AG-100kNX2 (AUTOGRAPH AGS-X2 Series) |
| Testing type | Universal testing machine (tension/compression) |
| Maximum load capacity | 100 kN |
| Load measurement accuracy | ±0.5% of indicated value (ISO 7500-1 Class 0.5) |
| Load cell precision range | 1/500 to 1/1 of rated capacity |
| Crosshead speed range | 0.0001–500 mm/min (stepless) |
| Crosshead speed accuracy | ±0.1% |
| Maximum return speed | 550 mm/min |
| Software | TRAPEZIUM X-V materials testing software, Shimadzu Corporation: Kyoto, Japan, 2020. |
| Operating temperature | 5–40 °C |
| Applicable standards | ISO 7500-1; Metallic Materials—Calibration and Verification of Static Uniaxial Testing Machines—Part 1: Tension/Compression Testing Machines—Calibration and Verification of the Force-Measuring System. International Organization for Standardization (ISO): Geneva, Switzerland, 2018 ASTM E4; Standard Practices for Force Verification of Testing Machines. ASTM International: West Conshohocken, PA, USA, 2020 JIS B7721; Testing Machines for Tensile, Compressive and Bending Tests—Verification and Calibration of the Force-Measuring System. Japanese Standards Association (JSA): Tokyo, Japan, 2018 |
Table 5.
Hyperparameter Configuration of Hierarchical Models.
Table 5.
Hyperparameter Configuration of Hierarchical Models.
| Parameter | Stage 1 | Stage 2 |
|---|
| Task | Classification between Crown and Root | 5-Class Crown Position |
| Training Samples | 3000 | 1500 |
| Train/Test Split | 80%/20% | 80%/20% |
| Loss Function | Cross-Entropy | Cross-Entropy |
| Optimizer | AdamW | AdamW |
| Learning Rate | 1 × | 5 × |
| Weight Decay | 1 × | 1 × |
| Batch Size | 64 | 64 |
| Max Epochs | 500 | 500 |
| Early Stopping | 20 epochs | 20 epochs |
Table 6.
Architecture configuration of CNN2D–BiGRU. (Stage 1).
Table 6.
Architecture configuration of CNN2D–BiGRU. (Stage 1).
| Module | Configuration | Output Size |
|---|
| Input | CWT scalogram | Output Size |
| Conv Block 1 | Conv (3 → 32), 3 × 3, BN + ReLU × 2, MaxPool (2 × 2), Dropout (0.2) | 32 × 112 × 112 |
| Conv Block 2 | Conv (32 → 64), 3 × 3, BN + ReLU × 2, MaxPool (2 × 2), Dropout (0.2) | 64 × 56 × 56 |
| Conv Block 3 | Conv (64 → 128), 3 × 3, BN + ReLU × 2, MaxPool (2 × 2), Dropout (0.2) | 128 × 28 × 28 |
| Conv Block 4 | Conv (128 → 256), 3 × 3, BN + ReLU, MaxPool (2 × 2), Dropout (0.2) | 256 × 14 × 14 |
| CNN Output | Feature Map | 256 × 14 × 14 |
| Reshape | Spatial-to-Sequence (14 × 14 → 196 steps) | 196 × 256 |
| BiGRU | Hidden size = 128, 1 layer, Bidirectional | 196 × 256 |
| Attention | Linear (256 → 128), Softmax over 196 steps | 256 |
| Classifier | Fully Connected + Softmax (2 classes) | 2 |
| Output | Root/Crown | 2 |
Table 7.
Architecture configuration of ResNet-18 with BiLSTM–Attention. (Stage 2).
Table 7.
Architecture configuration of ResNet-18 with BiLSTM–Attention. (Stage 2).
| Module/Block | Configuration | Output Size |
|---|
| Input | RGB CWT Image | 3 × 224 × 224 |
| Preprocessing | Resize + Normalize | 3 × 224 × 224 |
| CNN Stem | Conv 7 × 7, s = 2, 64 ch + BN + ReLU + MaxPool 3 × 3, s = 2 | 64 × 56 × 56 |
| Residual Stage 1 | 2 × Residual Blocks, 64 ch, s = 1 | 64 × 56 × 56 |
| Residual Stage 2 | 2 × Residual Blocks, 128 ch, s = 2 + projection shortcut | 128 × 28 × 28 |
| Residual Stage 3 | 2 × Residual Blocks, 256 ch, s = 2 + projection shortcut | 256 × 14 × 14 |
| Residual Stage 4 | 2 × Residual Blocks, 512 ch, s = 2 + projection shortcut | 512 × 7 × 7 |
| Dropout2D | p = 0.2 | 512 × 7 × 7 |
| Spatial-to-Sequence | Reshape 7 × 7 → 49 tokens | 49 × 512 |
| BiLSTM | 2 layers, bidirectional, hidden size = 512 | 49 × 1024 |
| Multi-Head Attention | 8 heads, hidden dim = 1024 | 49 × 1024 |
| LayerNorm + Dropout | LayerNorm, p = 0.2 | 49 × 1024 |
| Global Average Pooling | AdaptiveAvgPool1D | 1024 |
| Global Max Pooling | AdaptiveMaxPool1D | 1024 |
| Feature Fusion | Concatenate (Avg + Max) | 2048 |
| FC Layer 1 | Linear 2048 → 512 + BN + ReLU + Dropout | 512 |
| FC Layer 2 | Linear 512 → 256 + BN + ReLU + Dropout | 256 |
| FC Layer 3 | Linear 256 → 128 + BN + ReLU + Dropout | 128 |
| Output Layer | Linear 128 → 5 + Softmax | 5 |
Table 8.
Classification Performance Metrics of the CNN2D–BiGRU.
Table 8.
Classification Performance Metrics of the CNN2D–BiGRU.
| Class | Precision | Recall | F1-Score |
|---|
| Crown | 0.9934 | 1.000 | 0.9967 |
| Root | 1.000 | 0.9933 | 0.9966 |
| Overall (Macro F1) | - | - | 0.9967 |
Table 9.
Classification Performance Metrics of CNN2D–BiLSTM.
Table 9.
Classification Performance Metrics of CNN2D–BiLSTM.
| Class | Precision | Recall | F1-Score |
|---|
| Incisal | 0.9672 | 0.9833 | 0.9752 |
| Labial | 0.9836 | 1.0000 | 0.9917 |
| Left | 1.0000 | 1.0000 | 1.0000 |
| Palatal | 1.0000 | 1.0000 | 1.0000 |
| Right | 1.0000 | 0.9667 | 0.9831 |
Table 10.
Five-fold stratified cross-validation results—Stage 1: CNN2D–BiGRU component classification Root vs. Crown, n = 3000, balanced 1500/class).
Table 10.
Five-fold stratified cross-validation results—Stage 1: CNN2D–BiGRU component classification Root vs. Crown, n = 3000, balanced 1500/class).
| Metric | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Mean ± SD |
|---|
| Accuracy | 0.9917 | 0.9967 | 0.9933 | 0.9950 | 0.9900 | 0.9933 ± 0.0024 |
| Precision | 0.9917 | 0.9967 | 0.9934 | 0.9951 | 0.9901 | 0.9934 ± 0.0024 |
| Recall | 0.9917 | 0.9967 | 0.9934 | 0.9950 | 0.9900 | 0.9934 ± 0.0024 |
| F1-Score | 0.9917 | 0.9967 | 0.9934 | 0.995 | 0.9900 | 0.9934 ± 0.0024 |
Table 11.
Five-fold stratified cross-validation results—Stage 2: ResNet-18–BiLSTM–MHA crown position classification (5-class, n = 1500, balanced 300/class).
Table 11.
Five-fold stratified cross-validation results—Stage 2: ResNet-18–BiLSTM–MHA crown position classification (5-class, n = 1500, balanced 300/class).
| Metric | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Mean ± SD |
|---|
| Accuracy | 0.9867 | 0.9833 | 0.9900 | 0.9800 | 0.9733 | 0.9827 ± 0.0062 |
| Precision | 0.9871 | 0.9837 | 0.9902 | 0.9806 | 0.9740 | 0.9831 ± 0.0059 |
| Recall | 0.9867 | 0.9833 | 0.9900 | 0.9800 | 0.9733 | 0.9827 ± 0.0062 |
| F1-Score (Incisal) | 0.9672 | 0.9748 | 0.9752 | 0.9672 | 0.9500 | 0.9669 ± 0.0092 |
| F1-Score (Labial) | 0.9917 | 0.9836 | 0.9917 | 0.9917 | 0.9756 | 0.9869 ± 0.0067 |
| F1-Score (Left) | 1.0000 | 0.9836 | 1.0000 | 0.9836 | 0.9836 | 0.9902 ± 0.0083 |
| F1-Score (Palatal) | 0.9916 | 1.0000 | 1.0000 | 0.9916 | 1.0000 | 0.9966 ± 0.0042 |
| F1-Score (Right) | 0.9831 | 0.9744 | 0.9831 | 0.9655 | 0.9565 | 0.9725 ± 0.0108 |
Table 12.
Performance Comparison of Different Deep Learning Architectures for Crown Position Classification.
Table 12.
Performance Comparison of Different Deep Learning Architectures for Crown Position Classification.
| Model | Accuracy (%) | F1-Score | Parameters | Training Time per Epoch(s) | Improvement (%) |
|---|
| CNN2D + FC | 93.50 | 0.9340 | 5.2 M | 45 | −5.50% |
| CNN2D + BiLSTM | 96.50 | 0.9645 | 8.5 M | 68 | −2.50% |
| CNN + Transformer Encoder | 97.50 | 0.9747 | 18.3 M | 112 | −1.50% |
| ResNet-18 + BiLSTM + MHA (Proposed) | 99.00 | 0.9900 | 27.1 M | 95 | - |
Table 13.
Ablation Study—Stage 1: CNN2D–BiGRU component classification.
Table 13.
Ablation Study—Stage 1: CNN2D–BiGRU component classification.
| Model | Performance Metrics |
|---|
| Accuracy | Precision | Recall | F1-Score |
|---|
| CNN2D + FC | 0.9383 | 0.9386 | 0.9383 | 0.9383 |
CNN2D + GRU (unidirectional) | 0.9583 | 0.9585 | 0.9583 | 0.9583 |
CNN2D + BiGRU (without attention) | 0.9800 | 0.9801 | 0.9800 | 0.9800 |
CNN2D + BiGRU + Attention (proposed) | 0.9967 | 0.9967 | 0.9967 | 0.9967 |
| Difference (GRU inidirectional to BiGRU) | +0.0217 | +0.0216 | +0.0217 | +0.0217 |
| Difference (Attention gain, BiGRU to proposed) | +0.0167 | +0.0166 | +0.0167 | +0.0167 |
Table 14.
Ablation Study—Stage 2: ResNet-18 + BiLSTM + Multi-Head Attention crown position classification.
Table 14.
Ablation Study—Stage 2: ResNet-18 + BiLSTM + Multi-Head Attention crown position classification.
| Model | Performance Metrics |
|---|
| Accuracy | Precision | Recall | F1-Score |
|---|
| ResNet-18 + FC | 0.9717 | 0.9715 | 0.9717 | 0.9714 |
| ResNet-18 + BiLSTM (without attention) | 0.9800 | 0.9798 | 0.9800 | 0.9797 |
| ResNet-18 + LSTM + MHA (unidirectional) | 0.9850 | 0.9848 | 0.9850 | 0.9847 |
| ResNet-18 + BiLSTM + MHA (proposed) | 0.9900 | 0.9900 | 0.9900 | 0.9900 |
| Difference (BiLSTM to LSTM + MHA unidirectional) | +0.0050 | +0.0050 | +0.0050 | +0.0050 |
| Difference (BiLSTM gain, LSTM unidirectional to proposed) | +0.0050 | +0.0052 | +0.0050 | +0.0053 |
Table 15.
Inference Performance Metrics for Operational Deployment.
Table 15.
Inference Performance Metrics for Operational Deployment.
| Metric | Single Sample | Batch (100) |
|---|
| Latency | 12.3 ms | 847 ms (total for samples) |
| Throughput | 81.3 sample/s | 118 sample/s |