1. Introduction
Oral health is an integral component of overall health and well-being. The perceived condition of one’s oral cavity significantly impacts daily functioning and is commonly assessed through the concept of oral-health-related quality of life (OHRQoL). This measures how oral diseases and conditions affect a person’s physical, emotional and social well-being [
1,
2]. Numerous studies have shown that OHRQoL can be substantially impaired by oral diseases and their consequences, such as pain, discomfort, esthetic concerns and functional limitations [
3].
Among the most prevalent oral diseases globally is dental caries, affecting individuals across all age groups [
4]. Periodontal diseases are also widespread and represent a major public health burden. While advanced oral conditions often lead individuals to seek professional care, many diseases frequently progress without noticeable symptoms [
5].
Gingivitis is the minor form of periodontal disease, which is one of the most common oral diseases and conditions in adults [
6]. It is also the most prevalent form of periodontal disease in children and adolescents [
7] due to poor oral hygiene and hormonal changes.
Periodontitis, the severe form of periodontal disease, is a non-communicable chronic inflammatory disease that affects all parts of the periodontium and causes largely irreversible damage. Advanced forms of periodontitis are characterized by significant destruction of the periodontal ligament and marginal alveolar bone, resulting in tooth mobility, tooth migration, gingival recession, and finally, tooth loss [
8]. Since the respective inflammation is primarily chronic and painless, periodontitis is often described as a “silent” disease [
9].
The inflammation is triggered by components from microbial plaque accumulating on the non-shedding tooth surfaces at or near the gingival sulcus [
10]. Inflammation is clinically manifested by the main features of redness, swelling and bleeding on periodontal probing. If untreated, enhanced pocket depths due to inflammatory destruction of the periodontal tissues are found, and the disease is then called periodontitis [
11]. In gingivitis, the redness of the gingival margin arises partly from the dilatation and enhanced perfusion of capillary blood vessels in the subepithelial connective tissue [
12].
Bleeding on brushing and bleeding on interdental cleaning have been suggested as identifying features for patients. Since self-diagnosis remains difficult, periodontal diseases are usually detected at a late stage, and proper localization and extent can only be diagnosed by trained dental professionals. For gingivitis, the clinical parameter of bleeding on gentle probing is the accepted gold standard [
11].
Regular dental appointments for early detection are therefore essential for prevention and (if needed) adequate treatment of this disease. Nevertheless, many individuals, especially younger adults, do not regularly seek dental care and tend to underestimate the early signs of gingival inflammation. The asymptomatic nature of mild gingivitis, combined with low awareness of its potential long-term consequences, often results in delayed diagnosis and treatment. Given the high prevalence and potential for progression into irreversible disease, the demand for easily accessible, low-barrier diagnostic support systems is growing.
For self-diagnosis of gingivitis that can easily be performed by the patient themselves, the first attempts were proposed in 1992 [
13]. Gingivitis was examined using photographs for its characteristic redness and swelling of the gums. Over the years, more studies based on clinical photos of the gingiva and additional compatible software like Serif photo plus-6 [
14], Image Pro Plus software (V4.0; Media Cybernetics, Silver Spring, MD, USA) [
15] had been developed. All respective studies are based on measuring redness and swelling before and after gingivitis treatment. The advancement in the development of digital technology also enabled the exploration of the tissues’ volumes via more refined imaging techniques, such as 3D scans (3Shape A/S, Copenhagen, Denmark) [
16].
To detect oral diseases at an early stage and to eventually treat them via specific home care treatment, recently developed computer applications may be of benefit. For example, apps to improve toothbrushing technique have been invented. Examples of these include the Oral-B app and the Philips Sonicare app, which are both available in the App Store and Google Play Store. Since 2021, another provider aimed for diagnosis and developed a filter application for an intraoral camera, which allows patients to perform a self-check on their respective oral hygiene and gingivitis status. The application claims to recognize discolorations, calculus and gingivitis [
17].
This intraoral camera-based filter application is based on a supervised machine-learning algorithm, which is based on the principles of artificial intelligence (AI) and machine learning. In contrast to the studies mentioned earlier that relied on static photographs and manual or semi-automated analysis of gingival inflammation, this AI-based system enables automated and scalable detection by identifying complex visual patterns associated with gingivitis. For the present instrument, this means that the algorithm is trained with hundreds of thousands of images. These had previously been assessed for gingivitis by experienced dentists, based on the main characteristics of red or swollen gums in the vicinity of calculus. The neural network classifies each pixel with a so-called segmentation algorithm [
18]. This type of gadget is primarily aimed at a target group of young adults with interest in both easy computerized applications and their oral hygiene.
Despite many studies involving photographic assessment of gingivitis [
13,
14,
15,
16], a proper validation of an app-based detection feature for redness and swelling is still missing. Therefore, the fact that only the surface but no deeper tissue layers can be assessed by an optical system is a relevant limitation in the detection of periodontal diseases so far. Accordingly, the proof of correlation with gingivitis diagnosis using merely optical means like photo assessment or filter-based application is still missing.
Therefore, the aim of the study was to determine whether an intraoral camera-based application can visually detect gingivitis. Accordingly, we hypothesized that gingivitis could be reliably detected either by an app using a filter application or by a dentist based solely on optical features.
3. Results
The present study was conducted as a single examination from May to the end of August 2022 at the Clinic for Conservative and Preventive Dentistry at the Center for Dental Medicine of University of Zurich. The study assessed gingivitis in 110 subjects.
Due to the one-time examination, there were no dropouts. Participants ranged in age from 18 to 78 years, with most subjects between 20 and 29 years old. Three designated test sites were analyzed per subject. Of the 330 initially targeted sites, 320 were ultimately included in the final evaluation. While all subjects remained in the study, 10 sites had to be excluded. Specifically, three sites were excluded due to ethnic pigmentation of the gingiva (two at tooth 16, one at tooth 41), two sites due to extensive amalgam restorations (both at tooth 36) and one site because of the presence of an orthodontic bracket (at tooth 16). In four cases, assessment was not possible, as the corresponding test teeth were missing (all four at tooth 36). Of the 320 test sites evaluated for bleeding on probing, 118 tested positive: 33 at tooth 16, 43 at tooth 36 and 42 at tooth 41. The filter-based assessment identified 125 sites as indicative of gingivitis: 33 at tooth 16, 32 at tooth 36 and 60 at tooth 41. In comparison, the clinical evaluation by the dentist classified a total of 89 sites as gingivitis: 34 at tooth 16, 19 at tooth 36 and 36 at tooth 41.
Analysis
The agreement between gingivitis diagnosis via filter application and the clinical findings of gingivitis (BOP) was expressed by a kappa value of 0.055 as statistically significant agreement. The corresponding diagnostic accuracy metrics were as follows: sensitivity 66.3%, specificity 48.3% and positive predictive value 68.7%. If the results were assessed separately, the kappa value was 0.036 at tooth 16, 0.114 at tooth 36 and 0.247 at tooth 41, respectively (see
Table 1).
The total kappa value for the inter-group correlation of photo evaluation by the dentist and the clinical finding of gingivitis was 0.087, with a significant agreement of
p < 0.001. To complement the kappa statistic, diagnostic accuracy metrics were calculated: the sensitivity was 79.7%, specificity 40.7% and positive predictive value 69.7%. The kappa values for individual teeth were 0.196 for tooth 16, 0.265 for tooth 36 and 0.204 for tooth 41. The gingiva was classified as healthy by the dentist in case of doubt, which led to an increased number of false negative results (see
Table 2).
To determine the extent to which the results of the filter-based application differ from the dentist’s visual assessment of the photographs,
Table 3 presents a comparative analysis. All data were evaluated based on whether gingivitis was visually detected by either the filter application or the dentist. The total kappa value was 0.280, indicating a statistically significant agreement (
p < 0.001), which is higher than the agreement between either the dentist and the clinical reference or the app and the clinical reference. When examining individual teeth, the kappa values were as follows: 0.153 for tooth 16, 0.262 for tooth 36 and 0.361 for tooth 41. The agreement for teeth 36 and 41 was statistically significant, whereas the value for tooth 16 did not reach statistical significance (
p = 0.114). In numerical terms, the assessments were concordant in 212 out of 320 test sites. At the time of data analysis, no comparison with the Bleeding on Probing index had been performed.
Finally, the kappa value for the correlation between calculus and BOP was evaluated. Since calculus was mainly located on test tooth 41, only the values of this tooth were assessed. No calculus was detected on test tooth 36 (lingual), and only two instances were found on test tooth 16 (vestibular). Due to the limited relevance of these findings, they were not included in further analysis. Among the 109 test teeth analyzed, calculus was detected on 61. The kappa value was 0.089 and did not show a significant association with 0.323.
4. Discussion
The aim of the study was to assess whether gingivitis may be diagnosed by a filter application and an intraoral camera operating solely on an optical base. Optical assessment of the respective photos by a dentist on the one hand and clinical assessment on the other hand served as a comparison and validation. Based on the data of the present assessment, it can be concluded that there is only limited agreement between the app and the clinical finding of gingivitis (see
Table 1). The diagnosis by the dentist for recognizing gingivitis based on the images, however, is low to a similar degree (see
Table 2). Therefore,
Table 1 and
Table 2 show kappa values that reflect no agreement beyond chance. This indicates that even trained dental professionals are unable to reliably detect gingivitis through purely optical evaluation of intraoral photographs. Therefore, our hypothesis that gingivitis can be detected by an app and its filter application or a dentist based on purely optical features must be rejected.
Many aspects make it difficult to assess whether gingivitis is present. During the investigation, testing for BOP was sometimes hampered in cases of abundant presence of calculus because access to the sulcus is sometimes limited. The analysis shows that BOP and calculus do not exhibit a significant association. It is difficult to determine whether this is due to limitations in the study, such as the challenge of accurate validation in the presence of calculus, or whether there truly is no association. This methodological limitation may have affected the validity of BOP as a diagnostic reference and potentially contributed to false negative results.
In the present study, the operator was well trained to handle the camera. Nevertheless, technical challenges were encountered. Achieving the correct distance from the tooth of interest was essential to obtain focused, diagnostically usable images. Particularly in the posterior molar area, this was sometimes difficult to achieve due to limitation by cheek or tongue. Saliva with air bubbles at the gingival margin posed a considerable challenge in acquiring diagnostically adequate images. Additionally, intermittent lens fogging may have further compromised the quality of the intraoral photographs.
Furthermore, there are also some inherent limitations of the application. To prevent a potential bias, amalgam tattoos, ethnic pigmentation of the gingiva and gingival hyperplasia had to be excluded from the study because the filter application had not been trained sufficiently to cope with these changes. Other problems were noted with the recognition of teeth with large metal restorations like amalgam fillings, gold crowns or when orthodontic brackets were attached on the teeth.
Clinically, gingivitis is diagnosed by the dentist based on a comprehensive set of symptoms. Optically, redness, the texture of the gingiva (stippling, swelling), spontaneous and provoked bleeding may be used. While the latter is a well-assessed and scientifically approved way to clinically assess gingivitis when correct pressure is used [
20], assessment of a purely optical analysis has rarely been performed. Furthermore, the diagnostic value of intraoral photographs is inherently limited by lighting conditions, such as shadows or overexposure, especially under non-standardized settings. Nevertheless, the agreement between the dentist and the filter-based application was higher than with the clinical gold standard (
Table 3), suggesting some consistency between both visual assessments. The kappa value demonstrated a fair agreement between the assessed methods.
Notably, the dentist tended to classify ambiguous cases as healthy, which likely increased specificity at the expense of sensitivity and contributed to false negative results. Therefore, the discrepancy between the results from the filter application and the dentist may have been distorted. A closer look at the raw data supports this assumption: among a total of 320 images, only 89 were diagnosed as showing signs of gingivitis by the dentist, while the filter application identified 125 cases. Optimizing both the camera system and the filter application may help reduce the respective bias and improve diagnostic performance to a certain extent. The filter, which is based on deep-learning algorithms, might also show some potential for improvements. Artificial intelligence is already being applied and actively investigated in several fields of dentistry. In endodontology, artificial intelligence is helping to study the anatomy of the root canal system, forecasting the viability of stem cells of the dental pulp, measuring working lengths, pinpointing root fractures and periapical lesions and forecasting the success of retreatment procedure [
21]. In cariology, artificial intelligence is successful in detecting caries lesions of varying radiological extent on bitewings [
22]. In the research of orthodontics, artificial intelligence has been applied in the analysis of clinical imagery, such as automated landmark detection in lateral cephalograms and evaluation of intraoral scans or photographic data. Furthermore, artificial intelligence supports orthodontists with treatment decisions, such as the need for orthognathic surgery or orthognathic tooth extractions [
23]. One of the most recent studies demonstrates that AI can also be effective in patient education. It compares the effectiveness of AI-generated educational materials with traditional methods in improving patient understanding and reducing anxiety prior to dental procedures [
24]. Another contemporary study supports dentists in treatment decision making by assisting in the detection of periapical radiolucencies on panoramic radiographs [
25].
Since the dentist’s image-based assessment did not show a high agreement with the clinical gold standard, visual diagnosis of gingivitis from photographs is likely to show fundamental limitations, regardless of human or artificial interpretation. This raises the question of whether improvements in the current filter algorithm alone are sufficient to substantially increase diagnostic reliability. Alternative strategies, such as the combination of visual, tactile and biochemical data, may be options to optimize both the device and the application.
Future developmental steps could involve the integration of additional diagnostic modalities. One promising approach is fluorescence-based detection of gingival biomarkers. A study by Rana et al. [
26], which followed a similar study design, used light with a wavelength ranging from 405 to 450 nm to stimulate fluorescence of porphyrin-producing bacteria associated with periodontal inflammation. Using a specialized oral imaging device (ACTEON, North America), the emitted fluorescence signals were detected, and healthy tissue could be disclosed by its characteristic coloration. The study demonstrated that the method could differentiate between inflamed and healthy gingiva with an accuracy of more than 50%. Other studies have explored visual gingivitis detection using standardized photographic comparisons before and after professional tooth cleaning [
14,
15]. However, these studies often involved patients with clearly visible plaque and calculus, which makes diagnosis easier and could lead to an overestimation of diagnostic performance. In contrast, the present study included subjects with mild and therefore less clinically evident gingival inflammation. Since the age of participants primarily ranged between 20 and 30 years and they exhibited above-average oral hygiene, often lacking visible signs, such as plaque or swelling, the presence of gingivitis was not obvious. Therefore, the identification of gingivitis represented a more challenging diagnostic task. The relatively young age of the study population was mainly due to the limited willingness of older individuals to participate. Then, older volunteers often presented with amalgam restorations, increased STI (>6) in the molar area or even missing teeth, which led to the exclusion of certain sites from evaluation. A more balanced age distribution among participants—and consequently, a broader range of gingivitis severity—might have enhanced the study’s generalizability. The generalizability was further limited by the exclusion of pigmentation, as this occurs predominantly in certain ethnicities due to their darker skin color.
It would also be of interest if, at the time of clinical photo acquisition and BOP assessment, two independent examiners performed the evaluation. This would allow for an assessment of inter-examiner variability and the extent to which diagnostic outcomes may differ between clinicians. Furthermore, conducting the study using an improved version of the filter application would be very valuable. At the same time, images of people in more age groups should be evaluated, with a clear diagnosis of gingivitis. This would likely improve the generalizability of the results and enable more robust, broadly applicable conclusions to be drawn. Additionally, further training and refinement of the AI model could reveal whether diagnostic accuracy and overall performance could be significantly enhanced, thereby increasing its clinical utility.
The integration of artificial intelligence into oral care is an expanding and increasingly accessible field. Numerous digital health technologies are being developed to enhance preventive care, improve patient adherence and support remote monitoring. For example, smart toothbrushing systems are now capable of analyzing brushing patterns, detecting missed areas and providing real-time feedback through app-based interfaces [
27]. These systems often use similar algorithmic principles to the gingivitis detection application described in this study.
In the future, the combination of different AI-supported tools—such as intraoral imaging, brushing analysis and symptom tracking—could provide users with a more comprehensive picture of their oral health and serve as a valuable adjunct in teledentistry and preventive care. However, the present results underline that current AI-based applications, particularly those relying solely on visual information, still face significant limitations in diagnostic accuracy and should be regarded (in future) as complementary tools rather than replacements for clinical examination.