Self-Supervised Learning of Deep Embeddings for Classification and Identification of Dental Implants
Abstract
1. Introduction
- We replace MAE’s reconstruction of masked patches with the reconstruction of patch embeddings. Consequently, our loss is the simple distance between predicted and computed embeddings over masked patches.
- Our proposed method yields substantial performance improvement, surpassing all state-of-the-art methods in the dental implant detection task.
- The labeling of implant design extends the horizon of possible dental applications.
2. Dental Implant Design
3. Methods
3.1. Two-Stage Implant Detection Methodology
- Handling Missing Implant Parts by developing post-processing strategies to infer or estimate missing parts based on the detected components. We implement techniques such as predictive models [32], spline interpolation [33], and adaptive thresholds [34] to enhance robustness in the presence of incomplete information.
- Analyzing spatial relationships between detected parts to refine the assembly process and improve the accuracy of the final representation.
- Employing clustering algorithms, such as K-Means Clustering [35], to group related implant design parts, adapting to variations in implant geometry, and aiding in the identification of missing components.
- Implementing heuristics based on known implant geometries to guide the assembly process, especially when dealing with missing parts.
3.2. Self-Supervised Pre-Training with Masked Autoencoders
3.3. Architectures for Downstream Tasks
4. Experiments
4.1. Dataset
4.2. Evaluation Metric
4.3. Implementation Details
5. Results and Analysis
5.1. MDE Reconstruction
5.2. Dental Implant Classification and Identification
5.3. Qualitative Results
5.4. Parameter Setting
6. Conclusions
Limitations
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Shulman, L.; Driskell, T. Dental implants: A historical perspective. In Implants in Dentistry; WB Saunders: Philadelphia, PA, USA, 1997; p. 6. [Google Scholar]
- Boven, G.; Raghoebar, G.; Vissink, A.; Meijer, H. Improving masticatory performance, bite force, nutritional state and patient’s satisfaction with implant overdentures: A systematic review of the literature. J. Oral Rehabil. 2015, 42, 220–233. [Google Scholar] [CrossRef]
- Kanehira, Y.; Arai, K.; Kanehira, T.; Nagahisa, K.; Baba, S. Oral health-related quality of life in patients with implant treatment. J. Adv. Prosthodont. 2017, 9, 476–481. [Google Scholar] [CrossRef]
- De Kok, I.J.; Duqum, I.S.; Katz, L.H.; Cooper, L.F. Management of implant/prosthodontic complications. Dent. Clin. 2019, 63, 217–231. [Google Scholar] [CrossRef] [PubMed]
- Hashim, D.; Cionca, N.; Combescure, C.; Mombelli, A. The diagnosis of peri-implantitis: A systematic review on the predictive value of bleeding on probing. Clin. Oral Implant. Res. 2018, 29, 276–293. [Google Scholar] [CrossRef]
- Hamet, P.; Tremblay, J. Artificial intelligence in medicine. Metabolism 2017, 69, S36–S40. [Google Scholar] [CrossRef]
- Lee, J.H.; Kim, D.H.; Jeong, S.N.; Choi, S.H. Detection and diagnosis of dental caries using a deep learning-based convolutional neural network algorithm. J. Dent. 2018, 77, 106–111. [Google Scholar] [CrossRef]
- Yamaguchi, S.; Lee, C.; Karaer, O.; Ban, S.; Mine, A.; Imazato, S. Predicting the debonding of CAD/CAM composite resin crowns with AI. J. Dent. Res. 2019, 98, 1234–1238. [Google Scholar] [CrossRef] [PubMed]
- Takahashi, T.; Nozaki, K.; Gonda, T.; Ikebe, K. A system for designing removable partial dentures using artificial intelligence. Part 1. Classification of partially edentulous arches using a convolutional neural network. J. Prosthodont. Res. 2021, 65, 115–118. [Google Scholar] [CrossRef] [PubMed]
- Joda, T.; Waltimo, T.; Probst-Hensch, N.; Pauli-Magnus, C.; Zitzmann, N.U. Health data in dentistry: An attempt to master the digital challenge. Public Health Genom. 2019, 22, 1–7. [Google Scholar] [CrossRef]
- Hamamci, I.E.; Er, S.; Simsar, E.; Sekuboyina, A.; Gundogar, M.; Stadlinger, B.; Mehl, A.; Menze, B. Diffusion-Based Hierarchical Multi-label Object Detection to Analyze Panoramic Dental X-Rays. In Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2023, Vancouver, BC, Canada, 8–12 October 2023; Springer: Cham, Switzerland, 2023; pp. 389–399. [Google Scholar]
- Shi, J.; Sun, B.; Ye, X.; Wang, Z.; Luo, X.; Liu, J.; Gao, H.; Li, H. Semantic Decomposition Network With Contrastive and Structural Constraints for Dental Plaque Segmentation. IEEE Trans. Med. Imaging 2022, 42, 935–946. [Google Scholar] [CrossRef]
- Liu, Z.; He, X.; Wang, H.; Xiong, H.; Zhang, Y.; Wang, G.; Hao, J.; Feng, Y.; Zhu, F.; Hu, H. Hierarchical self-supervised learning for 3D tooth segmentation in intra-oral mesh scans. IEEE Trans. Med. Imaging 2022, 42, 467–480. [Google Scholar] [CrossRef]
- Chen, G.; Qin, J.; Amor, B.B.; Zhou, W.; Dai, H.; Zhou, T.; Huang, H.; Shao, L. Automatic detection of tooth-gingiva trim lines on Dental Surfaces. IEEE Trans. Med. Imaging 2023, 42, 3194–3204. [Google Scholar] [CrossRef]
- Hwang, J.J.; Jung, Y.H.; Cho, B.H.; Heo, M.S. An overview of deep learning in the field of dentistry. Imaging Sci. Dent. 2019, 49, 1–7. [Google Scholar] [CrossRef]
- Implants Dataset. 2023. Available online: https://universe.roboflow.com/al-xfpta/implants-meckw (accessed on 20 February 2024).
- Bao, H.; Dong, L.; Piao, S.; Wei, F. Beit: Bert pre-training of image transformers. arXiv 2021, arXiv:2106.08254. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Xie, Z.; Zhang, Z.; Cao, Y.; Lin, Y.; Bao, J.; Yao, Z.; Dai, Q.; Hu, H. Simmim: A simple framework for masked image modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 9653–9663. [Google Scholar]
- Bozorgtabar, B.; Mahapatra, D.; Thiran, J.P. AMAE: Adaptation of Pre-trained Masked Autoencoder for Dual-Distribution Anomaly Detection in Chest X-Rays. In Proceedings of the Medical Image Computing and Computer Assisted Intervention—MICCAI 2023, Vancouver, BC, Canada, 8–12 October 2023; Springer: Cham, Switzerland, 2023; pp. 195–205. [Google Scholar]
- Zhou, L.; Liu, H.; Bae, J.; He, J.; Samaras, D.; Prasanna, P. Self pre-training with masked autoencoders for medical image classification and segmentation. In Proceedings of the 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 18–21 April 2023; IEEE: New York, NY, USA, 2023; pp. 1–6. [Google Scholar]
- Almalki, A.; Latecki, L.J. Self-Supervised Learning With Masked Autoencoders for Teeth Segmentation From Intra-Oral 3D Scans. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA, 3–8 January 2024; pp. 7820–7830. [Google Scholar]
- Almalki, A.; Latecki, L.J. Self-supervised learning with masked image modeling for teeth numbering, detection of dental restorations, and instance segmentation in dental panoramic radiographs. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA, 2–7 January 2023; pp. 5594–5603. [Google Scholar]
- Almalki, A.; Latecki, L.J. Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs. In Proceedings of the 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 18–21 April 2023; IEEE: New York, NY, USA, 2023; pp. 1–5. [Google Scholar]
- He, K.; Chen, X.; Xie, S.; Li, Y.; Dollár, P.; Girshick, R. Masked autoencoders are scalable vision learners. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 16000–16009. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In Proceedings of the ICLR, Virtual, 3–7 May 2021. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; pp. 10012–10022. [Google Scholar]
- Almalki, A.; Almalki, A.; Latecki, L.J. Self-supervised masked deep embeddings for dental caries detection. IEEE Access 2025, 13, 161206–161216. [Google Scholar] [CrossRef]
- Almalki, A.; Latecki, L.J. Masked Deep Embeddings of Patches for Teeth Segmentation in Cone Beam CT Images. In Proceedings of the 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, 27–30 May 2024; IEEE: New York, NY, USA, 2024; pp. 1–5. [Google Scholar]
- Almalki, A.H. Advancements in Artificial Intelligence and Computer Vision for Dental Imaging Analysis: Self-Supervised Learning Innovations; Temple University: Philadelphia, PA, USA, 2024. [Google Scholar]
- Brooks, J. COCO Annotator. 2019. Available online: https://github.com/jsbroks/coco-annotator/ (accessed on 14 March 2022).
- Yao, W.; Li, L. A new regression model: Modal linear regression. Scand. J. Stat. 2014, 41, 656–671. [Google Scholar] [CrossRef]
- Habermann, C.; Kindermann, F. Multidimensional spline interpolation: Theory and applications. Comput. Econ. 2007, 30, 153–169. [Google Scholar] [CrossRef]
- Siqueira, B.R.; Ferrari, F.C.; Souza, K.E.; Camargo, V.V.; de Lemos, R. Testing of adaptive and context-aware systems: Approaches and challenges. Softw. Test. Verif. Reliab. 2021, 31, e1772. [Google Scholar] [CrossRef]
- Lloyd, S. Least squares quantization in PCM. IEEE Trans. Inf. Theory 1982, 28, 129–137. [Google Scholar] [CrossRef]
- Assran, M.; Duval, Q.; Misra, I.; Bojanowski, P.; Vincent, P.; Rabbat, M.; LeCun, Y.; Ballas, N. Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023. [Google Scholar]
- Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]
- Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.; Gimelshein, N.; Antiga, L.; et al. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 2019, 32, 8026–8037. [Google Scholar]
- Loshchilov, I.; Hutter, F. Decoupled weight decay regularization. arXiv 2017, arXiv:1711.05101. [Google Scholar]
- Ultralytics. YOLOv5: A State-of-the-Art Real-Time Object Detection System. 2021. Available online: https://docs.ultralytics.com (accessed on 20 February 2024).





| Coronal | Middle | Apical |
|---|---|---|
| Bone level | Parallel fin | Hole round |
| Tissue level | Tapered fin | Hole oblong |
| Microthread | Parallel square | Parallel groove |
| Fin | Tapered square | Tapered groove |
| Square | Parallel no threads | Parallel no groove |
| No threads | Tapered no threads | Tapered no groove |
| V-shaped | Parallel V-shaped | Apex shape flat |
| Rounded | Tapered V-shaped | Apex shape cone |
| Buttress | Parallel rounded | Apex shape dome |
| Reverse buttress | Tapered rounded | Apex shape semi-dome |
| Parallel buttress | ||
| Tapered buttress | ||
| Parallel reverse buttress | ||
| Tapered reverse buttress |
| Design Category | Class | Count |
|---|---|---|
| Coronal | Bone level | 1240 |
| Tissue level | 870 | |
| Microthread | 410 | |
| No threads | 208 | |
| Middle | Parallel body | 1935 |
| Tapered body | 621 | |
| V-shaped threads | 382 | |
| Apical | Hole round | 1710 |
| Hole oblong | 545 | |
| Apex cone | 364 | |
| Apex flat | 188 |
| Initialization | Backbone | Pre-Training Data | |
|---|---|---|---|
| YOLOv5 [40] | CSPDarknet53 | IN-1K w/Labels | 91.5 |
| Random | ViT-B | None | 91.9 |
| Supervised | ViT-B | IN-1K w/Labels | 92.6 |
| MAE | ViT-B | IN-1K | 93.2 |
| MDE (ours) | ViT-B | IN-1K | 94.9 |
| Initialization | Backbone | Pre-Training Data | |
|---|---|---|---|
| Random | ViT-B | None | 92.4 |
| Supervised | ViT-B | IN-1K w/Labels | 93.2 |
| MAE | ViT-B | IN-1K | 94.0 |
| MDE (ours) | ViT-B | IN-1K | 96.1 |
| Pipeline Stage | Description | AP Contribution () | FPS |
|---|---|---|---|
| Part Detection (Stage 1) | Detection of implant design parts | +2.1 | 32 |
| Bounding-Box Inference (Stage 2) | Assembly of parts into final implant box | +1.3 | 58 |
| Full Two-Stage Pipeline | End-to-end detection system | +3.4 (total) | 28 |
| Mask Ratio | Pre-Training Epochs | |
|---|---|---|
| 65% | 100 | 92.5 |
| 55% | 100 | 93.2 |
| 55% | 800 | 91.6 |
| 45% | 100 | 94.0 |
| 35% | 100 | 94.4 |
| 25% | 100 | 94.9 |
| 15% | 100 | 94.3 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Almalki, A.; Almalki, A.; Latecki, L.J. Self-Supervised Learning of Deep Embeddings for Classification and Identification of Dental Implants. J. Imaging 2026, 12, 39. https://doi.org/10.3390/jimaging12010039
Almalki A, Almalki A, Latecki LJ. Self-Supervised Learning of Deep Embeddings for Classification and Identification of Dental Implants. Journal of Imaging. 2026; 12(1):39. https://doi.org/10.3390/jimaging12010039
Chicago/Turabian StyleAlmalki, Amani, Abdulrahman Almalki, and Longin Jan Latecki. 2026. "Self-Supervised Learning of Deep Embeddings for Classification and Identification of Dental Implants" Journal of Imaging 12, no. 1: 39. https://doi.org/10.3390/jimaging12010039
APA StyleAlmalki, A., Almalki, A., & Latecki, L. J. (2026). Self-Supervised Learning of Deep Embeddings for Classification and Identification of Dental Implants. Journal of Imaging, 12(1), 39. https://doi.org/10.3390/jimaging12010039

