Next Article in Journal
“One and a Half Years of Things We Could Have Done”: Multi-Method Analysis of the Narratives of Adolescents with Type 1 Diabetes during the COVID-19 Pandemic
Previous Article in Journal
Public Perceptions on the Policy of Electronic Cigarettes as Medical Products on Twitter
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Dentist’s Ability to Detect Different Restorative Procedures on Periapical Radiographs—Results from a Reliability Study

Department of Conservative Dentistry and Periodontology, University Hospital, LMU, 80336 Munich, Germany
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2023, 20(3), 2619; https://doi.org/10.3390/ijerph20032619
Submission received: 12 December 2022 / Revised: 26 January 2023 / Accepted: 30 January 2023 / Published: 1 February 2023
(This article belongs to the Section Oral Health)

Abstract

:
(1) Background: This in vitro reliability study aimed to determine the inter- and intra-examiner reliability for the detection of direct fillings, indirect crown restorations, root canal fillings and implants on periapical radiographs. (2) Methods: Fourteen dentists (<2 years of clinical experience = 11; >2 years of clinical experience = 3) participated in this diagnostic reliability study in which included a theoretical and practical educational training prior to data collection. The image set of periapical radiographs (N = 150) was examined in two evaluation rounds by all the dentists. Cohen’s Kappa (CK) and a binary logistic regression model were computed. (3) Results: The inter- and intra-examiner reliability were found to be in almost perfect agreement: direct fillings (1st round 0.859/2nd round 0.844/intra 0.910), indirect crown restorations (0.932/0.926/0.955), root canal fillings (0.920/0.886/0.941) and dental implants (0.994/0.988/0.987). The binary logistic regression model revealed that the “evaluation round” and “dentist’s clinical experience” had no significant influence, but for the “diagnostic category”; small, but statistically significant differences were documented. (4) Conclusions: The reliability for detecting direct and indirect restorations, root canal fillings or implants on periapical radiographs was found to be high in the present reliability study on periapical radiographs.

1. Introduction

Periapical radiographs are frequently used to detect and evaluate different pathologies or restorative procedures in dentistry. In the case of dental restorations, direct fillings (Figure 1a), indirectly manufactured crowns (Figure 1b), root canal fillings (Figure 1c) or dental implants (Figure 1d) must be mainly considered and will appear on X-rays in variant shapes, configurations and different radiopacities according to their clinical specifications and material properties. When conducting diagnostic studies—also under the inclusion of dental restorations on dental radiographs—then, it is important to safeguard an excellent familiarity of all the examiners with the corresponding diagnostic categories. Here, reliability studies are the study type of choice to document the diagnostic outcome between examiners. This approach is well established for the detection of caries, and numerous reliability or reproducibility studies have been conducted in recent decades aimed at assessing mostly new diagnostic criteria and/or detection methods [1,2,3,4,5,6]. In contrast, little information is available about the dentist’s ability to reliably detect different restorative procedures on dental radiographs. Here, only a few reliability studies have been published thus far that have considered the assessment of direct restorations [7], the marginal adaptation of indirect restorations [8] or the presence and quality of endodontic treatments [9,10]. With respect to the limited amount of available information, therefore, this study aimed to determine the inter- and intra-examiner reliability for the detection of direct fillings, indirect crown restorations, root canal fillings and dental implants on periapical radiographs. The hypothesis of this study was that the dentist’s clinical experience and the evaluation round influence the detection of different restorative procedures.

2. Materials and Methods

This study on diagnostic reliability was part of the scientific project on „Detection and diagnostics of dental pathologies with artificial intelligence” at the Department of Conservative Dentistry and Periodontology of the Ludwig-Maximilians University (LMU) of Munich. This project was approved by the Ethics Committee of the Medical Faculty of the LMU (project number 20-0798). The reporting of this investigation followed the GRRAS guidelines for reporting reliability and agreement studies [11].

2.1. Study Participants

A group of 14 dentists participated in this reliability study. Eleven participants had less than two years of clinical experience; three dentists (all doctors of medicine in dentistry) had worked longer than two years at the university-based dental school. The whole study group took part in theoretical and practical educational training under supervision of the principal investigator (JK) prior to data collection. During the introduction, the study design and diagnostic categories were explained. Furthermore, numerous dental radiographs (N > 500) with frequent and rare pathological findings and restorative procedures were viewed, evaluated and discussed during the training workshop. Subsequently, the same set of periapical radiographs—which was not part of the training course—was analyzed in two evaluation rounds by all the dentists with a minimum interval of 4 weeks to ensure blinding between the diagnostic decisions. All participants were encouraged to perform all the evaluations independently from each other.

2.2. Set of Dental Radiographs

For this investigation, a few hundred fully anonymized periapical digital radiographs were primarily selected. All the images were taken at the Department of Conservative Dentistry and Periodontology of the LMU using dental X-ray machines with a 203 mm tube (Heliodent DS, Sirona, Bensheim, Germany), including an X-ray field limitation (30 × 40 mm) and a charge-coupled device (CCD) sensor (Intraoral II, sensor size 30.7 × 40.7 mm, Sirona, Bensheim, Germany). The indication-related exposure time was set to 0.06–0.08 s at a cathode voltage of 60 kV and an amperage of 7 mA. A sensor-holding device (XPP-DS digital sensor holders for Sirona, Dentsply Rinn, Elgin, IL, USA) was available and used if applicable.
The primary selected set of images was screened according to the following inclusion criteria: (1) Each X-ray should present at least one direct restoration, crown, root canal filling or dental implant. Here, no distinction between different types of restoration, root canal filling materials or implant fabricates was made. (2) X-rays from anterior/posterior and upper/lower teeth were selected. (3) The main teeth need to be fully imaged. (4) All the periapical radiographs had to be correctly exposed and should have good contrast and brightness. In the case of distortions, prominent superimposing effects or other qualitative deficiencies, the corresponding X-ray was excluded. Furthermore, any treatment-related radiographs or those to estimate the working length during endodontic therapy were excluded from this reliability study. Finally, 150 periapical radiographs were identified according to the inclusion criteria and a unique identification number was assigned to each image.

2.3. Diagnostic Standards

The diagnostic evaluation of all the periapical radiographs used the following standards. Direct restorations, such as composite, cement or amalgam restoration, were diagnosed independently from its location, shape and radiopacity in one category as illustrated in Figure 1a. Indirectly manufactured crowns—detectable as ceramic, porcelain fused to metal or metal crowns—were scored in an additional category (Figure 1b). Further distinctions between restorative materials or insufficiencies [12,13,14] were not made in this study. Root canal fillings were recorded independently from the filling material (Figure 1c). In addition, in this category, no differentiation between a sufficient (full-length, complete and homogenous definitive root canal filling) or an insufficient root canal filling (e.g., over- and underfilling, inconsistencies or voids) was made [15]. The image set further contained images with implants from several manufacturers (Figure 1d). Again, each of the abovementioned categories was chosen when at least one dental finding was detectable. This resulted in a dichotomous decision for each restorative procedure. Again, a distinction between different restorative or root canal filling materials, implant types and qualities was not performed.

2.4. Consensus Decision (Reference Standard)

After completion of the two evaluation rounds, the whole study group re-assessed all the X-rays together during a separate meeting to determine a consensus decision for each periapical radiograph. Here, a dichotomous decision (0 vs. 1) was made again for each of the abovementioned diagnostic categories. If an examiner expressed a divergent finding, the group re-assessed the corresponding X-ray and discussed their points until consensus was reached. This final evaluation served as a reference standard. In total, 101 direct restorations, 63 indirect restorations, 9 root canal fillings and 31 implants were identified.

2.5. Data Management and Statistical Analysis

All the data from all the participating dentists (N = 14), two evaluation rounds (N = 2) and the reference standard were entered into an Excel spreadsheet (Excel 2019, Microsoft, Redmond, WA, USA) and checked for plausibility before analysis. Descriptive and explorative data analysis was performed using Excel and SPSS (SPSS Statistics 27, 2020, IBM Corporation, Armonk, NY, USA). The statistical analysis included the calculation of the inter- and intra-examiner reliability among all the participants and the reference standard. For the explorative analysis, Cohen’s Kappa (CK) was computed. To provide an overall value, the arithmetic mean of these estimates was calculated. CK values within the below-mentioned ranges need to be interpreted as follows: 0.00 to 0.20—slight agreement, 0.21 to 0.40—fair agreement, 0.41 to 0.60—moderate agreement, 0.61 to 0.80—substantial agreement and 0.81 to 1.00—(almost) perfect agreement [16,17,18]. In addition, binary logistic regression analysis using a backward elimination model was performed for the outcome (correct/incorrect diagnostic decision in relation to the reference standard). The analysis was adjusted for the diagnostic category, evaluation round and dentist experience (less vs. more than two years of clinical experience).

3. Results

The inter- and intra-examiner reliability data in terms of CK can be taken from the corresponding Table for all the participating dentists in relation to the detection of direct fillings (Table 1), indirectly manufactured crowns (Table 2), root canal fillings (Table 3) and implants (Table 4). The mean CK values are summarized in Table 5. In detail, the inter-examiner reliability of the first evaluation round was in an almost perfect agreement range, with the highest CK for dental implants (0.994). The mean CK values for root canal fillings (0.920) and crown restorations (0.932) were nearly the same, followed by direct fillings (0.859). The mean inter-examiner CK values for the first and second evaluation round were found to be of the same order of magnitude (Table 5). The intra-examiner reliability was documented to be excellent throughout all the diagnostic categories (0.910 to 0.987; Table 5).
The level of agreement in comparison to the reference standard indicated only small differences in all four diagnostic categories between the two different evaluation rounds (Table 5). The mean CK values ranged from 0.854/0.837 (direct restorations) to 0.996/0.991 (implants) and were found to be in almost perfect agreement.
The reliability data were further explored by using a binary logistic regression model (Table 6). The results showed that the variables “evaluation round” and “dentist’s clinical experience” had no significant influence on reliability. In contrast, the detection of the included restorative procedure showed statistically significant odds ratios when using implants as a reference. Here, the detection of direct fillings, crown restorations and root canal fillings on periapical radiographs was found to be associated with less reliable diagnostic decisions.

4. Discussion

The aim of this diagnostic study was to determine the inter- and intra-examiner reliability for different restorative procedures—direct and indirect restorations, root canal fillings and implants—on periapical radiographs. In general, the overall documented CK values were in perfect agreement for each diagnostic category (Table 5). In contrast, the binary logistic regression model (Table 6) revealed significant differences. While the evaluation round and the dentist’s clinical experience had no statistical influence on reliability, direct fillings, crown restorations and root canal fillings were detected with a significantly lower reliability in comparison to implants. With respect to this significant difference, the above formulated hypothesis had to be rejected.
When discussing this finding, it should be noted that the documented CK values signal an almost perfect agreement and the recorded deviations between the diagnostic categories were found to be small. However, these differences were tested as significant and should not be overrated. Dental implants are a large-sized and radiopaque structure that can be easily detected on all types of X-rays. Miss-classification might predominantly be caused by documentation errors rather than detection errors. This also applies for metal crowns or amalgam fillings, which show a high radiopacity on X-rays [19]. In contrast, it might be more probable to miss small direct fillings made of composite or gracile temporary endodontic treatments, which are typically less radiopaque than other materials [20].
In daily dental practice, the evaluation of X-rays has to be considered routine; and many different variables have to be screened, detected and, if present, evaluated by the dentist. In contrast, there are surprisingly little scientific data available on how reliable professionals can identify dental restorations, root canal fillings or implants on X-rays. This primarily justified the present study in combination with the aim of determining the diagnostic competence of more or less experienced dentists. With respect to the spectrum of available restoration measures, we prioritized the selection of restoration measures (Figure 1) and further agreed on the dichotomous detection. However, this might be linked to a simpler decision making in comparison to the use of complex classification systems with multiple scores. When considering our results for direct restorations in detail, it must be noted that both the mean inter-examiner CK values amounted to 0.859/0.844 (between all the examiners) and 0.854/0.837 (in relation to the reference standard) and the intra-examiner CK value of 0.910 (Table 5) were found to be high. With respect to the lack of comparable studies, the discussion of the data is subject to restrictions. In a forensic dental study [21], dentists specialized in conservative dentistry reached a high CK value for restoration detection (0.83), followed by dentists specialized in oral surgery (0.64) and dental students in their final semester (0.57). The data support the assumption that there were significant differences between beginners and more experienced practitioners, which were not documented in our study (Table 1 and Table 5). This may show that intensive theoretical and practical educational training prior to data collection had an obviously positive impact on the outcome. This resulted in the fact that the level of knowledge of the young dentists was found to be similar to that of the experienced dentists (Table 6). For indirect restorations, high CK values for inter- and intra-examiner reliability (Table 2 and Table 5) were also observed in the present study. In this context, Mauad et al. [8] investigated the reliability of measuring crown adaptation and found good CK values for inter- (0.633–0.841) and intra-examiner reliability (0.721–0.982), which indicate substantial to perfect agreement between examiners. Unfortunately, we were unable to identify other reliability studies that investigated the dentist’s performance to identify different types of crown restorations.
When viewing, interpreting and discussing the data for the detection of endodontic treatment measures, the previously drawn statements need to be repeated. On the one hand, high CK values for inter- and intra-examiner reliability were registered (Table 3 and Table 5); on the other hand, further work groups documented similar [10,22] or moderate reliability data [9] for the detection of endodontic findings. The latter investigation [9] corroborates the assumption that the interpretation of endodontic information on radiographs may involve errors. The authors highlight the need to train examiners appropriately aiming at improving the data quality. As noted above, the CK values for implant detection (Table 4 and Table 5) achieved the highest agreement compared to all the other categories examined. Again, for this category, no equivalent studies were identified, which hinders comparisons.
Interestingly, no improvement between the first and second evaluation round was observed compared to other studies [22]. This could be addressed by the fact that all the examiners were extensively trained before participating in this study. Furthermore, it must be mentioned that none of the participants were fully inexperienced when agreeing to study participation.
This investigation also had strengths and limitations that need to be discussed. When considering the small number of reliability studies in this field of interest, the relevance of this study project is underlined. The involvement of four diagnostic categories, 14 examiners and 150 periapical radiographs are further strengths of this project. It is noteworthy that the image set represented a broad spectrum of radiological findings that had to be evaluated by participating dentists.
In addition to the mentioned strength, some factors may have had an impact on the examiner’s choice; thus, leading to higher reliability values. It needs to be stated that several inclusion and exclusion criteria were formulated that harmonized the used image set. Therefore, the set of periapical radiographs may not be representative of clinical practice, where often less-than-perfect images need to be assessed. Similarly, a dichotomous decision without performing a thorough classification was made, which may also contribute to the above average reliability. When viewing reliability data from caries categorization studies, it becomes obvious that the CK values are typically found in a range from moderate to excellent reproducibility [3,4]. Likewise, it must be emphasized that only periapical radiographs were used; and therefore, the findings are not fully generalizable for other X-ray projection techniques, e.g., panoramic or bitewing radiographs. Another weakness of the study might be the distribution of diagnostic categories among the set of periapical radiographs, which was not fully balanced over the four selected categories. Variations in the data values may also result from different viewing conditions (lighting, image size and masking of extraneous light). The use of nonstandard monitors for image viewing by the examiners may have affected the radiological diagnosis as well [23,24], but had obviously no influence in the present study, as this variable remained uncontrolled. This matches the typical circumstances in a dentist’s practice where different hardware is omnipresent.

5. Conclusions

In result of this diagnostic study, it can be concluded that the reliability for detecting direct and indirect restorations, root canal fillings or implants on periapical radiographs was excellent in terms of Cohen’s Kappa values. Significant differences among the diagnostic categories must be interpreted as small. The dentist’s clinical experience had no influence on the outcome in the present study.

Author Contributions

Conceptualization and project administration, J.K. and R.H.; study design, J.K.; investigations, A.W., T.M., H.D., A.K. and J.K.; data management and statistical analysis, K.H.; writing—original draft preparation, A.W., T.M., H.D. and J.K.; writing—review and editing: all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study project was approved by the local ethics committee at the medical faculty of the LMU (project no. 20-0798).

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Neuhaus, K.W.; Eggmann, F.; Kühnisch, J.; Kapor, S.; Janjic Rankovic, M.; Schüler, I.; Krause, F.; Lussi, A.; Michou, S.; Ekstrand, K.; et al. Standard Reporting of Caries Detection and Diagnostic Studies (STARCARDDS). Clin. Oral Investig. 2022, 26, 1947–1955. [Google Scholar] [CrossRef] [PubMed]
  2. Kühnisch, J.; Janjic Rankovic, M.; Kapor, S.; Schüler, I.; Krause, F.; Michou, S.; Ekstrand, K.; Eggmann, F.; Neuhaus, K.W.; Lussi, A.; et al. Identifying and Avoiding Risk of Bias in Caries Diagnostic Studies. J. Clin. Med. 2021, 10, 3223. [Google Scholar] [CrossRef] [PubMed]
  3. Janjic Rankovic, M.; Kapor, S.; Khazaei, Y.; Crispin, A.; Schüler, I.; Krause, F.; Ekstrand, K.; Michou, S.; Eggmann, F.; Lussi, A.; et al. Systematic review and meta-analysis of diagnostic studies of proximal surface caries. Clin. Oral Investig. 2021, 25, 6069–6079. [Google Scholar] [CrossRef] [PubMed]
  4. Kapor, S.; Rankovic, M.J.; Khazaei, Y.; Crispin, A.; Schüler, I.; Krause, F.; Lussi, A.; Neuhaus, K.; Eggmann, F.; Michou, S.; et al. Systematic review and meta-analysis of diagnostic methods for occlusal surface caries. Clin. Oral Investig. 2021, 25, 4801–4815. [Google Scholar] [CrossRef]
  5. Schwendicke, F.; Tzschoppe, M.; Paris, S. Radiographic caries detection: A systematic review and meta-analysis. J. Dent. 2015, 43, 924–933. [Google Scholar] [CrossRef]
  6. Twetman, S.; Axelsson, S.; Dahlén, G.; Espelid, I.; Mejàre, I.; Norlund, A.; Tranæus, S. Adjunct methods for caries detection: A systematic review of literature. Acta Odontol. Scand. 2013, 71, 388–397. [Google Scholar] [CrossRef]
  7. Mesinger, S.; Heck, K.; Crispin, A.; Frankenberger, R.; Cadenaro, M.; Burgess, J.; Peschke, A.; Heintze, S.D.; Loomans, B.; Opdam, N.; et al. Evaluation of direct restorations using the revised FDI criteria: Results from a reliability study. Clin. Oral Investig. 2022. Online ahead of print. [Google Scholar] [CrossRef]
  8. Mauad, L.; Doriguêtto, P.V.T.; Almeida, D.; Fardim, K.A.C.; Machado, A.H.; Devito, K.L. Quantitative assessment of artefacts and identification of gaps in prosthetic crowns: A comparative in vitro study between periapical radiography and CBCT images. Dentomaxillofac. Radiol. 2021, 50, 20200134. [Google Scholar] [CrossRef]
  9. Eckerbom, M.; Andersson, J.E.; Magnusson, T. Interobserver variation in radiographic examination of endodontic variables. Endod. Dent. Traumatol. 1986, 2, 243–246. [Google Scholar] [CrossRef]
  10. Saunders, M.B.; Gulabivala, K.; Holt, R.; Kahan, R.S. Reliability of radiographic observations recorded on a proforma measured using inter- and intra-observer variation: A preliminary study. Int. Endod. J. 2000, 33, 272–278. [Google Scholar] [CrossRef]
  11. Kottner, J.; Audige, L.; Brorson, S.; Donner, A.; Gajewski, B.J.; Hróbjartsson, A.; Roberts, C.; Shoukri, M.; Streiner, D.L. Guidelines for Reporting Reliability and Agreement Studies (GRRAS) were proposed. Int. J. Nurs. Stud. 2011, 48, 661–671. [Google Scholar] [CrossRef] [PubMed]
  12. Hickel, R.; Roulet, J.F.; Bayne, S.; Heintze, S.D.; Mjör, I.A.; Peters, M.; Rousson, V.; Randall, R.; Schmalz, G.; Tyas, M.; et al. Recommendations for conducting controlled clinical studies of dental restorative materials. Clin. Oral Investig. 2007, 11, 5–33. [Google Scholar] [CrossRef] [PubMed]
  13. Hickel, R.; Peschke, A.; Tyas, M.; Mjör, I.; Bayne, S.; Peters, M.; Hiller, K.A.; Randall, R.; Vanherle, G.; Heintze, S.D. FDI World Dental Federation: Clinical criteria for the evaluation of direct and indirect restorations-update and clinical examples. Clin. Oral Investig. 2010, 14, 349–366. [Google Scholar] [CrossRef] [PubMed]
  14. Hickel, R.; Mesinger, S.; Opdam, N.; Loomans, B.; Frankenberger, R.; Cadenaro, M.; Burgess, J.; Peschke, A.; Heintze, S.D.; Kühnisch, J. Revised FDI criteria for evaluating direct and indirect dental restorations-recommendations for its clinical use, interpretation, and reporting. Clin. Oral Investig. 2022. Online ahead of print. [Google Scholar] [CrossRef]
  15. Ng, Y.L.; Mann, V.; Rahbaran, S.; Lewsey, J.; Gulabivala, K. Outcome of primary root canal treatment: Systematic review of the literature—Part 2. Influence of clinical factors. Int. Endod. J. 2008, 41, 6–31. [Google Scholar] [CrossRef]
  16. Cohen, J.A. A Coefficient of Agreement for Nominal Scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
  17. Fleiss, J.L.; Cohen, J. The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability. Educ. Psychol. Meas. 1973, 33, 613–619. [Google Scholar] [CrossRef]
  18. Landis, J.R.; Koch, G.G. The measurement of observer agreement for categorical data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef]
  19. Okuda, Y.; Noda, M.; Kono, H.; Miyamoto, M.; Sato, H.; Ban, S. Radio-opacity of core materials for all-ceramic restorations. Dent. Mater. J. 2010, 29, 35–40. [Google Scholar] [CrossRef]
  20. Tagger, M.; Katz, A. A standard for radiopacity of root-end (retrograde) filling materials is urgently needed. Int. Endod. J. 2004, 37, 260–264. [Google Scholar] [CrossRef]
  21. Auderset, F.C.; Connert, T.; Meller, C.; Filippi, A.; Dagassan-Berndt, D.C. Evaluation of five methods to identify composite restorations in human teeth on a forensic purpose—An ex vivo comparative study. Int. J. Legal. Med. 2022. Online ahead of print. [Google Scholar] [CrossRef] [PubMed]
  22. Sebring, D.; Kvist, T.; Buhlin, K.; Jonasson, P.; EndoReCo; Lund, H.L. Calibration improves observer reliability in detecting periapical pathology on panoramic radiographs. Acta Odontol. Scand. 2021, 79, 554–561. [Google Scholar] [CrossRef] [PubMed]
  23. Patel, N.; Rushton, V.E.; Macfarlane, T.V.; Horner, K. The influence of viewing conditions on radiological diagnosis of periapical inflammation. Br. Dent. J. 2000, 189, 40–42. [Google Scholar] [CrossRef] [PubMed]
  24. Moshfeghi, M.; Shahbazian, M.; Sajadi, S.S.; Sajadi, S.; Ansari, H. Effects of Different Viewing Conditions on Radiographic Interpretation. J. Dent. 2015, 12, 853–858. [Google Scholar]
Figure 1. Representative examples of periapical radiographs for the evaluated diagnostic categories: (a) directly applied composite or amalgam restorations; (b) various types of indirectly manufactured ceramic, porcelain-fused to metal or metal crowns; (c) different types of temporary and definitive root canal fillings; (d) typical dental implants from divergent manufactures.
Figure 1. Representative examples of periapical radiographs for the evaluated diagnostic categories: (a) directly applied composite or amalgam restorations; (b) various types of indirectly manufactured ceramic, porcelain-fused to metal or metal crowns; (c) different types of temporary and definitive root canal fillings; (d) typical dental implants from divergent manufactures.
Ijerph 20 02619 g001
Table 1. Inter- and intra-examiner reliability values for the detection of direct restorations between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Table 1. Inter- and intra-examiner reliability values for the detection of direct restorations between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Cohen’s Kappa
Direct Restorations
1st Evaluation Round
Ref. Stand.1 *234567 *891011121314 *Mean
Ref. Stand.1.0000.6980.8800.8970.8500.8240.8960.9100.7760.8530.8800.8220.8530.8970.9250.854
2nd evaluation round1 *0.7230.9730.7260.7680.6990.7280.7400.7260.6860.7550.7260.6450.7550.7680.7120.726
20.8520.7800.9410.9560.8810.9130.9260.9700.8370.9131.0000.8820.9420.9270.8360.901
30.8660.7660.9560.9710.8680.8990.9120.9560.8250.8990.9560.8690.9280.9710.8530.897
40.7670.7540.8570.8420.8840.8250.8670.9110.7780.8250.8810.8230.8540.8390.8660.840
50.8100.7400.8700.8840.8150.9860.8690.9130.8110.9430.9130.8550.9710.8700.7800.868
60.8790.7510.8810.8960.8250.8680.9250.9560.7940.8980.9260.8390.8980.9120.8510.876
7 *0.9400.7520.8530.8370.7980.7820.8500.8810.8370.9130.9700.8820.9420.9270.8660.905
80.8520.7800.9120.9270.8280.8700.9110.8530.8680.8400.8370.8100.8400.7960.7920.806
90.8380.7660.8980.9120.8140.8560.8970.8100.8980.9280.9130.8840.9430.8990.8100.880
100.7960.7540.8850.8990.7740.8150.8550.7690.8570.8430.8840.8820.9420.9270.8360.901
110.8200.7240.8820.8970.7980.8400.8800.7920.8820.8390.8560.8820.8550.8400.7780.834
120.8960.7660.9560.9710.8700.9130.9250.8670.9270.9120.8990.8970.9560.8990.8100.891
130.7670.6790.7720.7860.7190.7620.7980.7390.7720.7890.7480.7700.8170.7890.8530.879
14 *0.9100.7520.9410.9560.8560.8980.9400.8810.9120.8970.8840.8810.9850.8000.8660.819
Mean0.8370.7510.8800.8870.8120.8390.8670.8140.8710.8560.8340.8410.9000.7650.8910.910
* Dentists with more than 2 years of clinical experience.
Table 2. Inter- and intra-examiner reliability values for the detection of crown restorations between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Table 2. Inter- and intra-examiner reliability values for the detection of crown restorations between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Cohen’s Kappa
Crown Restorations
1st Evaluation Round
Ref. Stand.1 *234567 *891011121314 *Mean
Ref. Stand.1.0000.8750.9730.9730.9590.9460.8910.9450.9170.9590.9730.9860.9450.9590.9860.949
2nd evaluation round1 *0.8890.9570.8750.8750.8340.8480.8490.8470.8450.8340.8750.8620.8470.8610.8610.855
20.9860.9020.9861.0000.9590.9730.9190.9730.9170.9591.0000.9590.9730.9860.9590.958
30.9590.8740.9730.9860.9590.9730.9190.9730.9170.9591.0000.9590.9730.9860.9590.958
40.9730.8610.9590.9860.9860.9320.8780.9590.9040.9730.9590.9450.9590.9450.9450.935
50.9320.8750.9450.9450.9320.9860.9460.9730.8900.9590.9730.9320.9730.9590.9320.943
60.9320.8480.9460.9460.9320.9460.9590.9190.8360.9050.9190.8780.9190.9050.8780.898
7 *0.9860.8740.9730.9450.9590.9180.9180.9310.9170.9590.9730.9321.0000.9590.9310.947
80.9590.8450.9450.9450.9590.8900.8910.9450.9030.9040.9170.9040.9170.9030.9030.898
90.9590.8470.9450.9730.9860.9180.9190.9450.9450.9730.9590.9450.9590.9450.9450.939
100.8910.8050.9040.9310.9180.8770.9320.8770.8760.9320.9180.9590.9730.9860.9590.958
110.9730.8890.9860.9860.9730.9590.9590.9590.9310.9590.9180.9590.9320.9450.9730.933
120.9730.8890.9860.9860.9730.9590.9590.9590.9310.9590.9181.0000.9730.9590.9310.947
130.8910.7790.8770.9040.9180.8780.9050.9040.8770.9320.9180.8910.8910.8870.9450.945
14 *0.9730.8610.9590.9590.9730.9040.9050.9590.9590.9590.8910.9450.9450.8910.9590.932
Mean0.9480.8580.9460.9500.9480.9190.9240.9330.9180.9400.9000.9500.9500.8900.9320.955
* Dentists with more than 2 years of clinical experience.
Table 3. Inter- and intra-examiner reliability values for the detection of root canal fillings between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Table 3. Inter- and intra-examiner reliability values for the detection of root canal fillings between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Cohen’s Kappa
Root Canal Filling
1st Evaluation Round
Ref. Stand.1 *234567 *891011121314 *Mean
Ref. Stand.1.0000.9380.9380.9380.8820.9380.9380.9380.8310.8820.9380.8470.9380.9380.9380.916
2nd evaluation round1 *0.9381.0001.0001.0000.9381.0000.8681.0000.8820.9381.0000.7861.0001.0001.0000.955
20.9381.0001.0001.0000.9381.0000.8681.0000.8820.9381.0000.7861.0001.0001.0000.955
31.0000.9380.9380.9380.9381.0000.8681.0000.8820.9381.0000.7861.0001.0001.0000.955
40.7900.8500.8500.7900.7900.9380.8130.9380.8310.8820.9380.7440.9380.9380.9380.901
50.9381.0001.0000.9380.8501.0000.8681.0000.8820.9381.0000.7861.0001.0001.0000.955
60.9440.8820.8820.9440.7370.8820.8820.8680.7640.8130.8680.7860.8680.8680.8680.845
7 *0.9381.0001.0000.9380.8501.0000.8821.0000.8820.9381.0000.7861.0001.0001.0000.955
80.8310.8820.8820.8310.7370.8820.7860.8820.8930.9440.8820.7060.8820.8820.8820.860
90.9381.0001.0000.9380.8501.0000.8821.0000.8820.9380.9380.7440.9380.9380.9380.910
100.9381.0001.0000.9380.8501.0000.8821.0000.8821.0001.0000.7861.0001.0001.0000.955
110.7300.6730.6730.7300.5450.6730.6960.6730.6960.6730.6730.7970.7860.7860.7860.773
120.9381.0001.0000.9380.8501.0000.8821.0000.8821.0001.0000.6731.0001.0001.0000.955
130.8680.9300.9300.8680.7590.9300.8130.9300.8130.9300.9300.6120.9300.9301.0000.955
14 *0.9381.0001.0000.9380.8501.0000.8821.0000.8821.0001.0000.6731.0000.9301.0000.955
Mean0.9050.9350.9350.8970.7980.9350.8490.9350.8400.9350.9350.6660.9350.8700.9350.941
* Dentists with more than 2 years of clinical experience.
Table 4. Inter- and intra-examiner reliability values for the detection of dental implants between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Table 4. Inter- and intra-examiner reliability values for the detection of dental implants between all examiners (N = 14) and in relation to the reference diagnosis (green background color). Gray background colors indicate inter-examiner CK values of the first evaluation round and blue background colors for the second evaluation round. Fields with a white background color represent intra-examiner CK values.
Cohen’s Kappa
Implants
1st Evaluation Round
Ref. Stand.1 *234567 *891011121314 *Mean
Ref. Stand.1.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.996
2nd evaluation round1 *1.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
21.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
31.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
40.9790.9790.9790.9790.9791.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
51.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
61.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
7 *1.0001.0001.0001.0001.0001.0001.0001.0000.9591.0001.0001.0001.0000.9801.0000.995
80.9580.9580.9580.9580.9580.9580.9580.9580.9171.0001.0001.0001.0000.9801.0000.976
91.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.9801.0000.998
100.9790.9790.9790.9790.9790.9790.9790.9790.9790.9790.9791.0001.0000.9801.0000.998
111.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.9801.0000.998
121.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0000.9801.0000.998
130.9580.9580.9580.9580.9580.9580.9580.9580.9580.9580.9580.9580.9580.9381.0000.982
14 *1.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.0001.000
Mean0.9910.9900.9900.9900.9870.9920.9920.9920.9730.9950.9820.9970.9970.9611.0000.987
* Dentists with more than 2 years of clinical experience.
Table 5. Mean CK values of the inter- and intra-examiner reliability for all evaluated categories.
Table 5. Mean CK values of the inter- and intra-examiner reliability for all evaluated categories.
VariableInter-
Examiner
Reliability
Inter-
Examiner
Reliability
Intra-
Examiner
Reliability
Inter-
Examiner
Reliability
Inter-
Examiner
Reliability
1st Round2nd Round-1st Round/Reference Stand.2nd Round/Reference Stand.
Direct restorations0.859
(0.726–0.905)
0.844
(0.751–0.891)
0.910
(0.789–0.986)
0.854
(0.698–0.925)
0.837
(0.723–0.940)
Indirect restorations0.932
(0.855–0.958)
0.926
(0.858–0.950)
0.955
(0.887–0.986)
0.949
(0.875–0.986)
0.948
(0.891–0.986)
Root canal filling0.920
(0.831–0.938)
0.886
(0.730–1.000)
0.941
(0.790–1.000)
0.916
(0.773–0.955)
0.905
(0.666–0.935)
Implant0.994
(0.959–1.000)
0.988
(0.958–1.000)
0.987
(0.917–1.000)
0.996
(0.976–1.000)
0.991
(0.961–1.000)
Table 6. Odds ratios with corresponding 95% confidence intervals (CI) and p-values were computed according to the logistic regression model. Bold numbers illustrate a significant influence.
Table 6. Odds ratios with corresponding 95% confidence intervals (CI) and p-values were computed according to the logistic regression model. Bold numbers illustrate a significant influence.
VariablesGroupsOdds Ratio95% CIp-Value
CategoriesImplant1--
Direct restorations0.030.02–0.06<0.001
Crown restorations0.080.04–0.17<0.001
Root canal filling0.210.10–0.43<0.001
Evaluation round1st1--
2nd0.900.74–1.090.266
Clinical experience≤2 years1--
>2 years1.040.82–1.310.755
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wülk, A.; Meusburger, T.; Dujic, H.; Hickel, R.; Kessler, A.; Heck, K.; Kühnisch, J. The Dentist’s Ability to Detect Different Restorative Procedures on Periapical Radiographs—Results from a Reliability Study. Int. J. Environ. Res. Public Health 2023, 20, 2619. https://doi.org/10.3390/ijerph20032619

AMA Style

Wülk A, Meusburger T, Dujic H, Hickel R, Kessler A, Heck K, Kühnisch J. The Dentist’s Ability to Detect Different Restorative Procedures on Periapical Radiographs—Results from a Reliability Study. International Journal of Environmental Research and Public Health. 2023; 20(3):2619. https://doi.org/10.3390/ijerph20032619

Chicago/Turabian Style

Wülk, Annika, Theresa Meusburger, Helena Dujic, Reinhard Hickel, Andreas Kessler, Katrin Heck, and Jan Kühnisch. 2023. "The Dentist’s Ability to Detect Different Restorative Procedures on Periapical Radiographs—Results from a Reliability Study" International Journal of Environmental Research and Public Health 20, no. 3: 2619. https://doi.org/10.3390/ijerph20032619

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop