Skip to Content
DiagnosticsDiagnostics
  • Systematic Review
  • Open Access

24 February 2026

Diagnostic Accuracy of Raman Spectroscopy for Oral Potentially Malignant Disorders: A Systematic Review and Meta-Analysis

,
,
,
,
and
1
Division of Maxillofacial Surgery, Department of Stomatology, Peking Union Medical College Hospital, Chinese Academy of Medical Science & Peking Union Medical College, Beijing 100730, China
2
Department of Plastic and Reconstructive Surgery, Peking Union Medical College Hospital, Chinese Academy of Medical Science & Peking Union Medical College, Beijing 100730, China
*
Author to whom correspondence should be addressed.

Abstract

Objective: The aim of this study was to summarize the accuracy and efficacy of Raman spectroscopy (RS) technology in identifying Oral Potentially Malignant Disorders (OPMDs). Methods: This systematic review was registered in PROSPERO (CRD420251124866). A literature search was conducted in databases including PubMed, Embase, and Web of Science from inception up to September 2025. The risk of bias of included studies was evaluated using the QUADAS-2 tool. Appropriate statistical techniques were applied to perform heterogeneity analysis, subgroup analyses, and calculation of pooled diagnostic efficacy indices. Results: A total of 12 studies involving 2810 samples were included. In the “OPMDs vs. Normal” group, RS achieved a pooled sensitivity of 0.84 (95% CI: 0.78–0.88), specificity of 0.90 (95% CI: 0.73–0.97), and AUC of 0.89 (95% CI: 0.86–0.91). In the “OPMDs vs. Oral squamous cell carcinoma (OSCC)” group, the pooled sensitivity was 0.89 (95% CI: 0.82–0.94), specificity was 0.91 (95% CI: 0.85–0.95), and AUC was 0.94 (95% CI: 0.91–0.96). Heterogeneity analysis revealed low heterogeneity in the “OPMDs vs. OSCC” group (I2 = 22.8%) and moderate heterogeneity in the “OPMDs vs. Normal oral mucosa” group (I2 = 58.9%). Subgroup analyses were performed: sample type exerted no significant impact on heterogeneity (p > 0.19), while Raman type (micro vs. fiber) showed potential to modulate diagnostic efficacy, micro-Raman had higher sensitivity (0.916 vs. 0.811), and fiber-Raman had better specificity (0.917 vs. 0.843). No significant publication bias was observed (Egger’s p > 0.3). Conclusions: Raman spectroscopy is an effective and reliable tool for screening and differentiating OPMDs.

1. Introduction

Oral squamous cell carcinoma (OSCC) accounts for approximately 90% of oral cancer, making it the thirteenth most common cancer globally [1]. Epidemiological data indicates that 389,846 new cases of oral and mouth cancer were reported in 2022 [2]. The irritation of the oral mucosa by substances such as alcohol, tobacco, betel nuts, HPV infection, and inadequate dental restorations results in an increased incidence of malignant transformation. Oral cancer usually progresses by a sequence of histological alterations, starting with normal mucosa and progressing to mild to severe atypical hyperplasia [3] and oral squamous cell carcinoma. In this carcinogenic process, Oral Potentially Malignant Disorders (OPMDs) act as the key intermediate link between normal oral mucosa and malignant tumors [4]; the most frequent types include oral leukoplakia (OLK), erythroplakia, proliferative verrucous leukoplakia (PVL), oral submucous fibrosis (OSMF), oral lichenoid diseases, graft-versus-host disease, and actinic cheilitis. These OPMDs exhibit heterogeneous malignant transformation rates, ranging from approximately 0.44% (e.g., oral lichen planus) to 43.87% (e.g., PVL) [5]. Given this variability, there is an immediate clinical necessity to explore and create innovative detection technologies that can enable early diagnosis and treatment of OPMDs, hence improving patient outcomes.
Surgical biopsy, in conjunction with histopathological diagnosis, is regarded as the gold standard [6] to evaluate suspicious oral lesions. Nevertheless, because of its invasive characteristics, this method is inappropriate for the early identification of OPMDs, especially when such lesions manifest in apparently “normal” or asymptomatic oral mucosa. There is a pressing necessity to investigate and advance novel detection technologies that enable early diagnosis and treatment of OPMDs, thus enhancing the prognosis for patients with oral cancer.
Raman spectroscopy (RS) is a non-elastic light scattering technique regarded as a potential diagnostic approach, capable of acquiring extensive molecular information from biological tissues, referred to as “molecular fingerprints.” In the carcinogenesis process within biological tissues, internal proteins, nucleic acids, lipids, carbohydrates, and other biomolecules experience alterations in conformation and amount, resulting in changes in Raman frequency shifts, peak intensities, and spectrum bands [7]. In recent years, numerous RS techniques have been extensively utilized for the differential detection of ex vivo oral cancer tissues and normal tissues [8,9,10,11]. Clinically, RS shows considerable promise in early risk stratification of suspicious oral lesions, streamlining clinical triage, and guiding precise biopsy targeting, serving as a potentially valuable adjunctive diagnostic tool to histopathological examination.
To date, a substantial body of literature has confirmed the diagnostic accuracy of Raman spectroscopy (RS) in head and neck carcinomas, with specific studies investigating its in vivo performance in oral cancer diagnosis [9,12,13,14]. For instance, Zhan (2020) [15] conducted a meta-analysis of 41 studies and reported a pooled sensitivity of 0.91 and specificity of 0.85 for in vivo oral cancer diagnosis using Raman spectroscopy (RS) [10]. Notably, this study also separately included 15 studies focusing on OPMDs, with subgroup analysis demonstrating excellent diagnostic performance of RS for OPMDs: the pooled diagnostic odds ratio (DOR) was as high as 839.25 (95% CI [274.50, 2565.92]), and the area under the summary receiver operating characteristic (SROC) curve (AUC) was 0.9932, confirming its robust potential for OPMD identification. Similarly, Shrivastava (2023) [16] performed a meta-analysis of 8 studies and reported that RS achieved a pooled sensitivity of 1.00 (95% CI: 0.72–1.00) and specificity of 0.93 (95% CI: 0.80–0.98) for detecting oral malignant and potentially malignant conditions.
However, despite these promising findings, existing meta-analyses on RS and OPMDs have not evaluated RS’s efficacy in distinguishing OPMDs from oral squamous cell carcinoma (OSCC), nor have they systematically assessed its overall diagnostic value in differentiating OPMDs, normal oral mucosa, and OSCC. Identifying the malignant transformation of OPMDs is clinically crucial for improving early cancer screening and reducing unnecessary invasive procedures, yet this remains an unmet clinical need inadequately addressed by prior research. Therefore, this meta-analysis aims to systematically evaluate the diagnostic accuracy of RS for the rapid identification of OPMDs and the differentiation of their malignant progression, clarify its efficacy in distinguishing OPMDs from normal oral mucosa and OSCC, fill the aforementioned research gap, and support the early detection of oral mucosal carcinogenesis.

2. Materials and Methods

This meta-analysis adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A completed PRISMA 2020 checklist is provided as Supplementary Materials. The study was registered under PROSPERO number: CRD420251124866.
This systematic review and meta-analysis was guided by a clearly defined PICO (Population, Index test, Comparator, Outcomes) framework to ensure transparency, reproducibility, and methodological rigor in diagnostic accuracy assessment.
Population (P): Patients with OPMDs, OSCC, or normal oral mucosa, regardless of age, gender, OPMD subtype, or study setting.
Index test (I): Raman spectroscopy (RS), including micro-Raman and fiber-Raman systems, used for the detection and differentiation of oral mucosal lesions.
Comparator (C): Histopathological diagnosis (gold standard) for OPMDs and OSCC; normal oral mucosa as a reference group; and direct comparison between OPMDs and OSCC.
Outcomes (O): Primary outcomes: Pooled diagnostic accuracy measures including sensitivity, specificity, positive likelihood ratio (PLR), negative likelihood ratio (NLR), and area under the summary receiver operating characteristic (SROC) curve (AUC).
Secondary outcomes: Subgroup performance by Raman system type, sample type, and sources of heterogeneity.

2.1. Search Strategy

A systematic search of four databases—PubMed, Embase, Web of Science, and the Cochrane Library—was conducted through September 2025 to identify studies assessing the accuracy of RS in diagnosing OPMDs. Taking the PubMed database as an example, the search expression was: ((“Precancerous Conditions”[MeSH Terms] AND “Oral”[Title/Abstract]) OR “Leukoplakia, Oral”[MeSH Terms] OR “Erythroplasia”[MeSH Terms] OR “Oral Submucous Fibrosis”[MeSH Terms] OR “Lichen Planus, Oral”[MeSH Terms] OR “Oral Premalignant Lesions”[Title/Abstract] OR “Oral Potentially Malignant Disorders”[Title/Abstract] OR “Oral Dysplasia”[Title/Abstract]) AND “Spectrum Analysis, Raman”[MeSH Terms]. In addition to electronic database searches, a manual search was conducted of reference lists from previously published systematic reviews and meta-analyses on Raman spectroscopy in oral oncology (identified via initial database searches) to identify potentially eligible studies missed by electronic searches.

2.2. Selection Criteria

Inclusion criteria: (1) original research studies; (2) studies focusing on specimens from patients with OPMDs; (3) studies employing histopathology as the diagnostic gold standard; (4) studies utilizing RS as a diagnostic tool for screening, assessing lesion risks, delineating surgical margins, and/or guiding subsequent actions; (5) studies conducting tests on specimens from human bodies, ex vivo human tissues, or human body fluids; (6) studies providing the number of samples in each test group along with true positive (TP), false positive (FP), true negative (TN), and false negative (FN) test results.
Exclusion criteria: (1) case reports, case series, review articles, editorials, letters, comments, conference proceedings, or non-original articles; (2) studies with low sample sizes (normal/benign < 5; OPMDs/OSCC < 5); (3) studies based on animal models and cell lines; (4) studies focusing on specimens outside the oral region; (5) studies without histopathological diagnosis for the collected samples.

2.3. Data Extraction

Employing a uniform data extraction template, two reviewers conducted independent data retrieval from each included article and appraised its quality, resolving any disagreements through consensus. The following data were extracted: author name; publication year; patient count; sample type; spectral details; diagnostic criteria; the RS methodology of the primary investigator; the numbers of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN); and various methodological and technical specifics. The quality of each study was assessed in accordance with the Quality Assessment of Diagnostic Accuracy Studies-2 (QUADAS-2) [17].

2.4. Statistical Analysis

Data analysis was conducted using Stata 18.0, Meta-Disc 2.0, and Revman 5.3 software. Bivariate random-effects models were consistently applied across all diagnostic meta-analyses to synthesize the diagnostic accuracy indices. Sensitivity, specificity, and their 95% confidence intervals (CIs), along with the positive likelihood ratio (PLR), negative likelihood ratio (NLR), and diagnostic odds ratio (DOR), were calculated, and the area under the curve (AUC) was derived by plotting the SROC curve. The significance level for the meta-analysis was set at α = 0.05. The I2 metric was utilized to assess heterogeneity. A fixed-effects model was employed for meta-analysis when studies showed no statistical heterogeneity; otherwise, a random-effects model was utilized after investigating and excluding clear clinical heterogeneity sources. Deek’s funnel plot assessed publication bias.

3. Results

3.1. Literature Selection

A total of 17 articles were obtained from the PubMed database, 32 from the Embase database, 17 from the Web of Science database, and 13 from the Cochrane Library, resulting in a cumulative total of 76 papers. Following the exclusion of 12 duplicate papers, 64 papers remained. Excluding papers that lacked full text, had missing data, did not meet established criteria, were repetitive, or exhibited significant design flaws resulted in the inclusion of 12 papers [18,19,20,21,22,23,24,25,26,27,28,29] for systematic review and meta-analysis (Figure 1). A manual search of reference lists from two prior systematic reviews [15,16] and meta-analyses was performed. Zhan (2020) [15] included 16 references and Shrivastava (2023) [16] included 8 references, all of which were duplicates of studies already identified in the initial screening. After excluding non-original studies, studies without a gold standard, and studies with duplicate data, no new eligible studies were included. The article selection and systematic review process adhered to the PRISMA guidelines [17].
Figure 1. PRISMA flow chart of literature selection process.
Table 1 summarizes the study’s characteristics of included articles, including author, publication year, sample type and corresponding sample sizes, RS parameters, and data processing methods. Among these studies, 11 provided diagnostic data for differentiating Oral Potentially Malignant Disorders (OPMDs) from oral squamous cell carcinoma (OSCC), while another 11 documented data for distinguishing OPMDs from normal samples.
Table 1. Study characteristics.

3.2. Risk and Publication Bias Assessment

The relevant data and information from the included studies were systematically organized and analyzed utilizing the QUADAS-2 tool. Figure 2 presents the evaluations of bias risk and applicability concerns across the included studies. The majority of included studies exhibited a low risk of bias across all QUADAS-2 domains, while a minority indicated a high risk of bias in certain domains.
Figure 2. Methodological quality summary (2010 Li [18], 2012 Singh [19], 2013 Krishna [20], 2015 Sahu [21], 2015 Xue [22], 2016 Brindha [25], 2016 Krishna [23], 2016 Rekha [26], 2016 Sahu [24], 2019 Ghosh [27], 2020 Behl [28], 2025 Kaushik [29]).
To identify potential publication bias in the 12 included articles, distinct funnel plots were created for two comparisons: OPMDs vs. Normal (Figure 3) and OPMDs vs. OSCC (Figure 4). In the comparison of OPMDs vs. OSCC, Deeks’ funnel plot asymmetry test produced a p-value of 0.73. For the comparison of OPMDs vs. Normal, the p-value was 0.19. Both results demonstrated the absence of significant publication bias in each comparison. Egger’s Test and analogous asymmetry tests demonstrate limited statistical power with a small number of included studies. The inclusion of merely 12 studies in this research may undermine the sensitivity of the test. The slight asymmetry observed in the funnel plot suggests that the potential for publication bias cannot be entirely dismissed. The lack of gray literature, including unpublished studies and conference abstracts, may influence the results.
Figure 3. Deeks’ funnel plot asymmetry test (OPMDs vs. Normal). The numbers 1–11 correspond to references [18,19,20,21,22,23,24,25,26,27,29], respectively.
Figure 4. Deeks’ funnel plot asymmetry test (OPMDs vs. OSCC). The numbers 1–11 correspond to references [18,19,20,21,22,23,24,25,26,27,28], respectively.
The findings of this meta-analysis indicate a low risk of publication bias, suggesting that the research conclusions are relatively reliable. Future research must incorporate additional high-quality studies for further verification.

3.3. Pooled Diagnostic Accuracy Results

3.3.1. OPMDs vs. Normal

A total of 11 out of 12 included studies were eligible for meta-analysis regarding the differentiation of OPMDs from normal oral mucosa. Complete data extraction was achieved for true positive (TP), false positive (FP), true negative (TN), and false negative (FN) across all eligible studies.
Individual studies exhibited marked variability in diagnostic performance, as visualized by two forest plots. For sensitivity (Figure 5), values ranged from 0.44 (Sahu 2016 [24]) to 0.93 (Rekha 2016 [26]). For specificity (Figure 6), the range was 0.64 (Li 2010 [18]) to 1.00 (Xue 2015 [22], Sahu 2016 [24]). The observed variability suggests potential heterogeneity, likely due to variations in study design, sample type, or RS parameters.
Figure 5. Sensitivity forest plot (OPMDs vs. Normal) (Li 2010 [18], Singh 2012 [19], Krishna 2013 [20], Sahu 2015 [21], Xue 2015 [22], Krishna 2016 [23], Sahu 2016 [24], Brindha 2016 [25], Rekha 2016 [26], Ghosh 2019 [27], Kaushik 2025 [29]).
Figure 6. Specificity forest plot (OPMDs vs. Normal) (Li 2010 [18], Singh 2012 [19], Krishna 2013 [20], Sahu 2015 [21], Xue 2015 [22], Krishna 2016 [23], Sahu 2016 [24], Brindha 2016 [25], Rekha 2016 [26], Ghosh 2019 [27], and Kaushik 2025 [29]).
Using a bivariate random-effects meta-analysis model, the pooled diagnostic efficacy indicators for OPMDs vs. Normal, including sensitivity, specificity, diagnostic odds ratio (DOR), positive likelihood ratio (PLR), and negative likelihood ratio (NLR), are summarized in Table 2. These indicators illustrate the strong discriminative performance of RS. The pooled sensitivity of 0.843 reduces missed screenings for OPMDs, confirming its effectiveness as an initial detection tool. Additionally, the pooled specificity of 0.900 (FPR = 0.100) decreases the likelihood of unnecessary invasive procedures for individuals with normal oral mucosa. Additionally, high pooled DOR (48.359) attests to strong overall diagnostic efficacy, while elevated pooled PLR (8.414) and low pooled NLR (0.174) strengthen confidence in positive result validity and negative-case exclusion reliability.
Table 2. Pooled diagnostic efficacy indicators (OPMDs vs. Normal).
An SROC curve was created to assess diagnostic abilities (Figure 7), resulting in an area under the curve of 0.91 (95% CI: 0.88–0.93). This is “excellent” (AUC > 0.90), validating RS’s ability to distinguish OPMDs from normal oral mucosa.
Figure 7. SROC curve plot (OPMDs vs. Normal).

3.3.2. OPMDs vs. OSCC

This group comprised 11 studies from a total of 12, with one study excluded. Data extraction adhered to the same methodology and encompassed similar diagnostic data as the previous group (Figure 8 and Figure 9). The individual studies demonstrated variability in diagnostic performance: sensitivity varied from 0.50 (Rekha 2016 [26]) to 0.95 (Li 2010 [18]), and specificity ranged from 0.44 (Sahu 2016 [24]) to 0.94 (Brindha 2016 [25]), as illustrated in the forest plots.
Figure 8. Sensitivity forest plot (OPMDs vs. OSCC) (Li 2010 [18], Singh 2012 [19], Krishna 2013 [20], Sahu 2015 [21], Xue 2015 [22], Krishna 2016 [23], Sahu 2016 [24], Brindha 2016 [25], Rekha 2016 [26], Ghosh 2019 [27], Behl 2020 [28]).
Figure 9. Specificity forest plot (OPMDs vs. OSCC) (Li 2010 [18], Singh 2012 [19], Krishna 2013 [20], Sahu 2015 [21], Xue 2015 [22], Krishna 2016 [23], Sahu 2016 [24], Brindha 2016 [25], Rekha 2016 [26], Ghosh 2019 [27], Behl 2020 [28]).
The combined diagnostic effectiveness demonstrates significant clinical value (Table 3). A pooled sensitivity of 0.838 facilitates the accurate identification of approximately 84% of cases, thereby minimizing the risk of missed diagnoses and endorsing its application in screening processes. The pooled specificity of 0.791 indicates that approximately 79% of non-cases are excluded. A DOR of 19.61, significantly greater than 1, indicates strong overall discriminatory ability. A PLR of 4.018 indicates that a positive result is approximately four times more likely in cases, thereby enhancing confidence in positive findings. An NLR of 0.205 suggests that a negative result is approximately five times more probable in non-cases, thereby rendering negative results highly dependable for the exclusion of disease. In summary, RS exhibits robust comprehensive performance, notably excelling in sensitivity and overall differentiation.
Table 3. Pooled diagnostic efficacy indicators (OPMDs vs. OSCC).
The SROC curve resides in the upper-left region (Figure 10), consistent with an AUC of 0.89 (95% CI: 0.86–0.91), a value that falls into the “good” category (typically defined as AUC 0.8–0.9) and confirms RS’s robust ability to differentiate OPMDs from OSCC.
Figure 10. SROC curve plot (OPMDs vs. OSCC).

3.3.3. Cross-Group Comparison

To clarify RS’s performance across scenarios, we compared pooled diagnostic efficacy between the “OPMDs vs. Normal” and “OPMDs vs. OSCC” groups. The “OPMDs vs. OSCC” group exhibited higher sensitivity (0.89 vs. 0.84), specificity (0.91 vs. 0.90), DOR (85.2 vs. 48.4), and AUC (0.94 vs. 0.89), indicating superior efficacy in differentiating OPMDs from OSCC than from normal mucosa.
A combined analysis of the SROC curve (Figure 11) and meta-regression results reveals important details: although the “OPMDs vs. Normal” curve (red) is positioned nearer to the upper-left region, meta-regression indicated no statistically significant differences in relative sensitivity (p = 0.918), relative specificity (p = 0.197), or global test comparison (p = 0.425) between the two groups. Despite this statistical parity, RS retains distinct value in the “screening to differentiation” workflow: it serves as a dependable preliminary screening tool for OPMDs compared to normal mucosa, utilizing high sensitivity (0.84) and AUC (0.89) to reduce missed diagnoses. In differentiating OPMDs from OSCC, RS also exhibits favorable specificity (0.91), DOR (85.2), and AUC (0.94), facilitating accurate treatment decisions (e.g., conservative management for OPMDs versus radical therapy for OSCC). RS facilitates thorough management of oral lesions, encompassing detection and accurate classification, and demonstrates consistent performance in various clinical contexts.
Figure 11. Combined analysis of the SROC curve.

3.3.4. Heterogeneity and Subgroups Analysis

The heterogeneity analysis (Table 4) revealed that the bivariate I2 statistic was 22.8% for the OPMDs vs. OSCC group, reflecting low heterogeneity, while it reached 58.9% for the OPMDs vs. Normal group, suggesting moderate heterogeneity. In terms of variance components, the variance of logit-transformed sensitivity (Var logit(sen) = 0.349) and specificity (Var logit(spe) = 0.686) in the OPMDs vs. OSCC group were both lower than those in the OPMDs vs. Normal group (0.494 and 3.529, respectively), indicating less between-study variation in the former. The multiplicative odds ratios (MORs) for sensitivity (1.757) and specificity (2.203) in the OPMDs vs. OSCC group were also significantly lower than those in the OPMDs vs. Normal group (1.955 and 6.001, respectively), further confirming weaker heterogeneity in the former. Additionally, the area of the 95% prediction ellipse was much smaller in the OPMDs vs. OSCC group (0.068) than in the OPMDs vs. Normal group (0.26), intuitively reflecting better consistency of results. In conclusion, RS exhibits lower between-study heterogeneity and stronger result stability in the “OPMDs vs. OSCC” scenario, while there is moderate heterogeneity in the “OPMDs vs. Normal” scenario, which requires further exploration of sources through subgroup analysis.
Table 4. Heterogeneity analysis.
To identify the potential drivers of moderate heterogeneity in the “OPMDs vs. Normal” group, we conducted subgroup analyses based on type of sample (Table 5) and type of RS (Table 6), respectively, with results as follows:
Table 5. Subgroup analysis by type of sample.
Table 6. Subgroup analysis by type of RS.
The tissue subgroup demonstrated superior performance in specificity and DOR. Subsequent meta-regression analysis indicated that the p-value for relative sensitivity level (tissue vs. other) was 0.194, that for relative specificity level was 0.197, and that for global test comparison was 0.349. All p-values were >0.05, indicating that “type of sample” had no statistically significant impact on the heterogeneity of this group.
The micro subgroup performed better in sensitivity, while the fiber subgroup was superior in specificity. Meta-regression analysis showed that the p-value for relative sensitivity level (micro vs. fiber) was 0.086 (approaching statistical significance), that for relative specificity level was 0.586, and that for global test comparison was 0.186. This suggests that the type of RS may potentially affect sensitivity.

4. Discussion

This study assessed the diagnostic efficacy of RS in differentiating OPMDs from normal oral mucosa and OSCC, emphasizing the resolution of clinical unmet needs and methodological deficiencies in previous research.
The diagnostic efficacy of RS aligns with the clinical workflow in oral oncology, and its potential clinical scenarios are clearly defined herein to enhance translational relevance. Specifically, RS is well-suited for three core clinical settings: First, population-based preliminary screening in primary care or community health centers, where its high sensitivity (0.84) can effectively identify suspicious lesions in asymptomatic individuals or those with a high risk factor, addressing the issue of overlooked early lesions when pathological biopsy is not readily accessible. Second, it serves as a pre-biopsy adjunctive tool during specialist assessment in stomatology departments, as its favorable specificity (0.90) helps triage suspicious lesions, minimizing unnecessary invasive biopsies and alleviating patient anxiety. Third, it holds promise for long-term follow-up of patients with OPMDs, enabling non-invasive dynamic monitoring of lesion progression and aiding in timely identification of malignant transformation. The enhanced efficacy for differentiating OPMDs from OSCC aids in addressing the significant challenges of over-treatment and under-treatment in contemporary clinical practice. The lack of statistically significant efficacy differences between the two scenarios (global p = 0.425) further underscores RS’s adaptability across these clinical stages.
Notably, our pooled diagnostic accuracy (sensitivity = 0.84, specificity = 0.90) for differentiating OPMDs from normal mucosa and OSCC is consistent with the findings of Zhan (2020) [15], who reported a pooled sensitivity of 0.96 and specificity of 0.90 for RS in diagnosing oral precancerous lesions [10]. This alignment validates RS’s reproducible diagnostic value in oral mucosal lesions, while our study extends their work by specifically clarifying RS’s efficacy in distinguishing OPMDs from OSCC. This critical aspect was not addressed in their meta-analysis and fills a key research gap in the field.
Heterogeneity analysis identifies key patterns: the low heterogeneity in differentiating OPMDs from OSCC (I2 = 22.8%) is due to OSCC’s unique pathology causing consistent spectral changes, while moderate heterogeneity in distinguishing OPMDs from normal tissue (I2 = 58.9%) is linked to normal mucosal variations. Subgroup analyses indicate that sample type does not affect heterogeneity (p > 0.19), supporting RS’s non-invasive use. Raman type is a modifiable factor; micro-Raman is more sensitive (0.916 vs. 0.811) for early lesions, and fiber-Raman is more specific (0.917 vs. 0.843) for bedside screening.
Important limitations of this investigation should be acknowledged. We fully recognize these constraints and their potential impact on the precision, robustness, and interpretability of the pooled findings. First, stratified analysis according to OPMD subtypes could not be performed, largely due to incomplete reporting of subtype classifications and the general absence of prospective, subtype-stratified research designs in the majority of included primary studies. Given the divergent malignant potential across distinct OPMDs entities, this may reduce the specificity and granularity of RS diagnostic efficacy estimates. Second, inconsistent and incomplete reporting of key sample and clinical characteristics—including histopathological differentiation of oral cancer, patient lifestyle habits, and detailed sample source information—limited further exploratory subgroup analyses and may affect the external generalizability of the synthesized results. Third, the micro-Raman subgroup comprised only three included studies, which reduces statistical precision and restricts the stability and generalizability of corresponding subgroup estimates.
Despite a comprehensive search and screening strategy implemented across multiple databases and relevant prior reviews, the overall number of high-quality diagnostic accuracy studies eligible for inclusion remains relatively modest. This largely reflects the current breadth and maturity of peer-reviewed evidence focused on RS for OPMDs diagnosis. To strengthen future evidence in this field, subsequent research is encouraged to implement standardized OPMD subtype categorization, adopt prospectively planned subtype-stratified diagnostic evaluations, improve uniformity in reporting sample and patient-level characteristics, expand sample sizes in underrepresented technical subgroups, and perform head-to-head comparative studies between RS and established non-invasive diagnostic modalities. These steps may better define the clinical performance and implementation potential of RS for OPMD detection and risk stratification.

5. Conclusions

This study validates that RS technology shows promising diagnostic performance for OPMDs, offering non-invasive, real-time, and rapid identification capabilities that support a range of adjunctive clinical applications. Nonetheless, important limitations persist that require further refinement. In conclusion, RS holds potential as a complementary screening, triage, or biopsy-guidance tool for OPMDs, but should not be regarded as a replacement for histopathological diagnosis (the current gold standard). Ongoing research and development are needed to advance its clinical translation and standardization.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics16050648/s1, File S1: PRISMA 2020 Main Checklist.

Author Contributions

T.Z. played a role in design of the study; X.X. wrote the main manuscript text; X.L. extracted data, Z.Z., Q.S. and N.H. reviewed the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by CAMS Innovation Fund for Medical Science (CIFMS) (2024-I2M-C&T-B-027).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

This systematic review and meta-analysis is based on secondary analysis of publicly available data from previously published studies. All included primary studies are listed in the reference section, and their original data can be obtained via the journal websites or by requesting from the corresponding authors of those studies. The literature search strategy (including databases, keywords, and retrieval timeframes) and PRISMA flow diagram are presented in the main text. Additionally, the aggregated data used for meta-analysis (e.g., sensitivity, specificity, pooled effect sizes) are available from the corresponding author upon reasonable request for academic purposes.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
RSRaman Spectroscopy
OPMDsOral Potentially Malignant Disorders
OSCCOral Squamous Cell Carcinoma
PVLProliferative Verrucous Leukoplakia
OSMFOral Submucous Fibrosis
PRISMAPreferred Reporting Items for Systematic Reviews and Meta-Analyses
TPTrue Positive
FPFalse Positive
TNTrue Negative
FNFalse Negative
CIConfidence Interval
PLRPositive Likelihood Ratio
NLRNegative likelihood Ratio
DORDiagnostic Odds Ratio
AUCArea Under the Curve
MORMultiplicative Odds Ratio
Var logit(sen)Variance of logit-transformed sensitivity
Var logit(spe)Variance of logit-transformed specificity

References

  1. World Health Organization. Oral Health. 2023. Available online: https://www.who.int/zh/news-room/fact-sheets/detail/oral-health (accessed on 1 December 2024).
  2. World Cancer Research Fund. Mouth and Oral Cancer Statistics. 2023. Available online: https://www.wcrf.org/preventing-cancer/cancer-statistics/mouth-and-oral-cancer-statistics/#latest-mouth-and-oral-cancer-data (accessed on 1 December 2024).
  3. Pritzker, K.P.; Darling, M.R.; Hwang, J.T.; Mock, D. Oral Potentially Malignant Disorders (OPMDs): What is the clinical utility of dysplasia grade? Expert Rev. Mol. Diagn. 2021, 21, 289–298. [Google Scholar] [CrossRef]
  4. Warnakulasuriya, S.; Kujan, O.; Aguirre-Urizar, J.M.; Bagan, J.V.; González-Moles, M.Á.; Kerr, A.R.; Lodi, G.; Mello, F.W.; Monteiro, L.; Ogden, G.R.; et al. Oral potentially malignant disorders: A consensus report from an international seminar on nomenclature and classification, convened by the WHO Collaborating Centre for Oral Cancer. Oral Dis. 2021, 27, 1862–1880. [Google Scholar] [CrossRef] [PubMed]
  5. Iocca, O.; Sollecito, T.P.; Alawi, F.; Weinstein, G.S.; Newman, J.G.; De Virgilio, A.; Di Maio, P.; Spriano, G.; Pardiñas López, S.; Shanti, R.M. Potentially malignant disorders of the oral cavity and oral dysplasia: A systematic review and meta-analysis of malignant transformation rate by subtype. Head Neck 2020, 42, 539–555. [Google Scholar] [CrossRef]
  6. Sankaranarayanan, R.; Ramadas, K.; Amarasinghe, H.; Subramanian, S.; Johnson, N. Oral Cancer: Prevention, Early Detection, and Treatment. In Cancer: Disease Control Priorities, 3rd ed.; The International Bank for Reconstruction and Development/The World Bank: Washington, DC, USA, 2015; Volume 3. [Google Scholar] [CrossRef]
  7. Li, Y.; Su, S.; Zhang, Y.; Liu, S.; Jin, H.; Zeng, Q.; Cheng, L. Accuracy of Raman spectroscopy in discrimination of nasopharyngeal carcinoma from normal samples: A systematic review and meta-analysis. J. Cancer Res. Clin. Oncol. 2019, 145, 1811–1821. [Google Scholar] [CrossRef]
  8. Johnson, D.E.; Burtness, B.; Leemans, C.R.; Lui, V.W.Y.; Bauman, J.E.; Grandis, J.R. Head and neck squamous cell carcinoma. Nat. Rev. Dis. Primers 2020, 6, 92. [Google Scholar]
  9. Matthies, L.; Gebrekidan, M.T.; Tegtmeyer, J.F.; Oetter, N.; Rohde, M.; Vollkommer, T.; Smeets, R.; Wilczak, W.; Stelzle, F.; Gosau, M.; et al. Optical diagnosis of oral cavity lesions by label-free Raman spectroscopy. Biomed. Opt. Express 2021, 12, 836–851. [Google Scholar] [CrossRef] [PubMed]
  10. Barroso, E.M.; Smits, R.W.H.; Bakker Schut, T.C.; Ten Hove, I.; Hardillo, J.A.; Wolvius, E.B.; Baatenburg de Jong, R.J.; Koljenović, S.; Puppels, G.J. Discrimination between oral cancer and healthy tissue based on water content determined by Raman spectroscopy. Anal. Chem. 2015, 87, 2419–2426. [Google Scholar] [CrossRef] [PubMed]
  11. Li, B.; Gu, Z.Y.; Yan, K.X.; Wen, Z.N.; Zhao, Z.H.; Li, L.J.; Li, Y. Evaluating oral epithelial dysplasia classification system by near-infrared Raman spectroscopy. Oncotarget 2017, 8, 76257–76265. [Google Scholar] [CrossRef][Green Version]
  12. Han, R.; Lin, N.; Huang, J.; Ma, X. Diagnostic accuracy of Raman spectroscopy in oral squamous cell carcinoma. Front. Oncol. 2022, 12, 925032. [Google Scholar] [CrossRef]
  13. Sharma, M.; Li, Y.C.; Manjunatha, S.N.; Tsai, C.L.; Lin, R.M.; Huang, S.F.; Chang, L.B. Identification of Healthy Tissue from Malignant Tissue in Surgical Margin Using Raman Spectroscopy in Oral Cancer Surgeries. Biomedicines 2023, 11, 1984. [Google Scholar] [CrossRef]
  14. Cals, F.L.; Koljenović, S.; Hardillo, J.A.; de Jong, R.J.B.; Schut, T.C.B.; Puppels, G.J. Development and validation of Raman spectroscopic classification models to discriminate tongue squamous cell carcinoma from non-tumorous tissue. Oral Oncol. 2016, 60, 41–47. [Google Scholar] [CrossRef]
  15. Zhan, Q.; Li, Y.; Yuan, Y.; Liu, J.; Li, Y. The accuracy of Raman spectroscopy in the detection and diagnosis of oral cancer: A systematic review and meta-analysis. J. Raman Spectrosc. 2020, 51, 2377–2397. [Google Scholar] [CrossRef]
  16. Shrivastava, P.K.; Kumar, A.; Aggarwal, Y.; Kumar, A.; Agrawal, A.; Rai, A. Diagnostic accuracy of Vibrational spectroscopy in the diagnosis of oral potentially malignant and malignant disorders: A systematic review and meta-analysis. J. Cancer Res. Ther. 2023, 19, 151–158. [Google Scholar] [CrossRef]
  17. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
  18. Li, Y.; Wen, Z.-N.; Li, L.-J.; Li, M.-L.; Gao, N.; Guo, Y.-Z. Research on the Raman spectral character and diagnostic value of squamous cell carcinoma of oral mucosa. J. Raman Spectrosc. 2010, 41, 142–147. [Google Scholar] [CrossRef]
  19. Singh, S.P.; Deshmukh, A.; Chaturvedi, P.; Murali Krishna, C. In vivo Raman spectroscopic identification of premalignant lesions in oral buccal mucosa. J. Biomed. Opt. 2012, 17, 105002. [Google Scholar] [CrossRef]
  20. Krishna, H.; Majumder, S.K.; Chaturvedi, P.; Gupta, P.K. Anatomical variability of in vivo Raman spectra of normal oral cavity and its effect on oral tissue classification. Biomed. Spectrosc. Imaging 2013, 2, 199–217. [Google Scholar] [CrossRef]
  21. Sahu, A.K.; Dhoot, S.; Singh, A.; Sawant, S.S.; Nandakumar, N.; Talathi-Desai, S.; Garud, M.; Pagare, S.; Srivastava, S.; Nair, S.; et al. Oral cancer screening: Serum Raman spectroscopic approach. J. Biomed. Opt. 2015, 20, 115006. [Google Scholar] [CrossRef]
  22. Xue, L.; Li, Y.; Cai, Q.; Sun, P.; Luo, X.; Yan, B. Raman spectral characteristics of oral squamous cell carcinoma, epithelial dysplasia and normal mucosa. Chin. J. Stomatol. 2015, 50, 18–22. [Google Scholar]
  23. Krishna, H.; Muttagi, S.; Ingole, P.; Chaturvedi, P.; Majumder, S.K. Tobacco Consumption Induced Changes in the Healthy Oral Mucosa and its Effect on Differential Diagnosis of Oral Lesions-A Clinical In Vivo Raman Spectroscopic Study. J. Anal. Oncol. 2016, 5, 110–123. [Google Scholar] [CrossRef]
  24. Sahu, A.; Deshmukh, A.; Hole, A.R.; Chaturvedi, P.; Krishna, C.M. In vivo subsite classification and diagnosis of oral cancers using Raman spectroscopy. J. Innov. Opt. Health Sci. 2016, 9, 1650017. [Google Scholar] [CrossRef]
  25. Brindha, E.; Rajasekaran, R.; Aruna, P.; Koteeswaran, D.; Ganesan, S. High wavenumber Raman spectroscopy in the characterization of urinary metabolites of normal subjects, oral premalignant and malignant patients. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2017, 171, 52–59. [Google Scholar] [CrossRef] [PubMed]
  26. Rekha, P.; Aruna, P.; Brindha, E.; Koteeswaran, D.; Baludavid, M.; Ganesan, S. Near-infrared Raman spectroscopic characterization of salivary metabolites in the discrimination of normal from oral premalignant and malignant conditions. J. Raman Spectrosc. 2016, 47, 763–772. [Google Scholar] [CrossRef]
  27. Ghosh, A.; Raha, S.; Dey, S.; Chatterjee, K.; Chowdhury, A.R.; Barui, A. Chemometric analysis of integrated FTIR and Raman spectra obtained by non-invasive exfoliative cytology for the screening of oral cancer. Analyst 2019, 144, 1309–1325. [Google Scholar] [CrossRef]
  28. Behl, I.; Calado, G.; Malkin, A.; Flint, S.; Galvin, S.; Healy, C.M.; Pimentel, M.L.; Byrne, H.J.; Lyng, F.M. A pilot study for early detection of oral premalignant diseases using oral cytology and Raman micro-spectroscopy: Assessment of confounding factors. J. Biophotonics 2020, 13, e202000079. [Google Scholar] [CrossRef]
  29. Senapati, S.; Kaushik, A.; Rajan; Singh, A.; Gupta, I.; Virkar, R.; Kulkarni, S.S.; Arankalle, V.; Singh, J.P. Rapid, On-Site SARS-CoV-2 Variant Detection and Differentiation Using GLAD-Pristine Silver Nanorod Arrays and Machine Learning-Enhanced SERS. ACS Omega 2025, 10, 44978–44988. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.