Simple Summary
Cervical lymph node status is critical when planning surgery for papillary thyroid carcinoma, yet most tools treat the BRAF V600E mutation as a simple yes/no finding. We asked whether the amount of BRAF (reported as a percentage in fine-needle aspiration samples) provides more useful, patient-specific information. In a single-center cohort, we measured BRAF abundance preoperatively and linked it with surgical pathology, using flexible models that allow for a relationship where risk increases as abundance rises rather than a fixed cut-off. We found a steep rise in nodal risk at around 20.7% BRAF abundance, followed by a near-plateau, and built an easy-to-understand risk tool that combines BRAF with routine ultrasound features and demographics. This approach may help clinicians discuss options before surgery, focus attention on higher-risk patients, and avoid unnecessary procedures in lower-risk cases. Prospective, multi-center validation is still needed.
Abstract
Objectives: Papillary thyroid carcinoma (PTC) frequently presents with cervical lymph node metastasis (CLNM), yet preoperative tools often encode BRAF V600E as a binary variable, potentially overlooking information contained in mutation abundance. We sought to quantify the dose–response relationship between BRAF V600E abundance and CLNM and to develop an interpretable model for preoperative risk stratification. Methods: We performed a single-center retrospective study of consecutive PTC patients who underwent preoperative BRAF V600E testing and surgery from 2019 to 2023. Patients were randomly split 70/30 into training and test sets. Candidate predictors included clinical and ultrasound features and BRAF V600E abundance. We used multivariable logistic regression and restricted cubic splines (RCS) to assess nonlinearity and compared six machine-learning algorithms (LR, KNN, SVM, XGB, LightGBM, and NN). Model performance was evaluated by F1, AUC, calibration, and decision-curve analyses; SHAP aided interpretation. Ethics approval: SYSKY-2024-169-01. Results: The cohort included 667 patients; CLNM occurred in 391 (58.6%). CLNM cases had higher BRAF abundance (median 23% vs. 17%) and characteristic clinical/sonographic differences. RCS revealed a nonlinear association between abundance and CLNM, with a steep risk rise of up to ~20.7% followed by a plateau. Among six algorithms, XGBoost showed the best validation performance (AUC 0.752; F1 0.73). SHAP indicated that maximum tumor diameter, BRAF abundance, age, and microcalcifications contributed most to predictions. Conclusions: Modeling BRAF V600E as a quantitative abundance—rather than a binary status—improves preoperative CLNM risk assessment in PTC. An interpretable XGBoost model integrating abundance with routine features demonstrates acceptable discrimination and potential clinical utility for individualized surgical planning and counseling.
1. Introduction
Thyroid cancer is the most common malignant tumor of the endocrine system, among which papillary thyroid carcinoma (PTC) accounts for more than 80% [,]. Although PTC generally has a favorable prognosis (approximately 90% 10-year overall survival), occult cervical lymph node metastasis (CLNM) or distant metastasis can still lead to adverse outcomes [,]. As a key prognostic factor, CLNM significantly increases the risk of recurrence and reduces treatment efficacy [,,,], and its occurrence is closely associated with patient prognosis []. Accordingly, timely and accurate prediction of CLNM risk is crucial for optimizing individualized treatment and improving outcomes [,]. Against this background, developing an effective and readily interpretable early-identification model to target populations at high risk for CLNM has become one of the core objectives for improving long-term efficacy.
In recent years, progress has been made in predicting CLNM risk in PTC; however, models based on ultrasonography and conventional clinical parameters remain limited: they often rely on single predictors, offer insufficient discriminative accuracy and clinical decision support, and fail to meet the demands of precise preoperative evaluation [,,]. Machine learning (ML), capable of integrating multidimensional data, has shown value in disease prediction—particularly in risk assessment for PTC [,,]—and multiple models have been constructed based on clinical and imaging features [,]. Nevertheless, many studies have not effectively incorporated genomic data, or have encoded gene mutations simply as binary variables (present/absent), overlooking the potential information contained in mutation abundance, which has resulted in suboptimal performance and inconsistent findings [,,].
Among these genetic alterations, BRAF V600E is one of the most common mutations in PTC, carried by approximately 45–60% of patients and closely associated with tumor initiation, progression, and adverse prognosis [,,]. At the same time, some studies have not identified a significant association with prognosis; such work often remains at the level of mutation presence/absence without further examination of mutation abundance, limiting understanding of tumor heterogeneity and mutational burden [,,]. As an important indicator for quantifying heterogeneity and mutational burden, mutation abundance is expected to provide a more refined delineation of risk; however, existing explorations are often constrained by small sample sizes, limited evaluative dimensions, and a lack of interpretable predictive models [,,,].
This study aims to integrate BRAF V600E mutation abundance with clinical characteristics to evaluate its clinical value in PTC: (i) to elucidate the dose–response relationship between BRAF V600E mutation abundance and CLNM and to propose a clinically actionable risk threshold; and (ii) to develop and validate an interpretable ML predictive tool that combines BRAF mutation abundance with clinical variables, thereby improving the accuracy and operability of preoperative CLNM assessment in PTC and providing theoretical rationale and practical guidance for individualized treatment planning.
2. Materials and Methods
2.1. Data Sources
This study was approved by the Ethics Committee of Sun Yat-sen Memorial Hospital and conducted in accordance with the Declaration of Helsinki. Consecutive patients hospitalized in the Department of Thyroid Surgery between 1 September 2019 and 1 September 2023 who underwent preoperative fine-needle aspiration (FNA) with BRAF testing and were postoperatively confirmed as papillary thyroid carcinoma (PTC) were screened.
The primary outcome was any cervical lymph node metastasis (CLNM), encompassing the central (level VI) and/or lateral compartments (levels II–V). Pathology served as the reference standard for compartments that underwent dissection (routine CND; selective LND when indicated).
Inclusion criteria: (i) preoperative BRAF V600E testing with available mutation abundance; (ii) postoperative paraffin pathology confirming PTC; (iii) preoperative ultrasound (US) evaluation of tumor characteristics; (iv) definitive surgery with complete postoperative pathology. Exclusion criteria: (i) concomitant malignancies; (ii) incomplete preoperative imaging/genetic data or incomplete pathology; (iii) prior thyroid surgery. All data were de-identified; the requirement for informed consent was waived due to the retrospective use of anonymized data. To ensure data completeness, cases with incomplete records were excluded at pre-enrollment screening; a variable-by-variable audit of the final analytic cohort confirmed 0% missingness for the outcome and all candidate predictors. Accordingly, no imputation or complete-case/sensitivity analyses were performed, and all analyses were conducted on the complete dataset. The overall study workflow is shown in Figure 1.
Figure 1.
Study design flow chart.
2.2. Clinical Characteristics and Data Processing
Preoperative predictors and outcome. Preoperative variables included demographics; US features (margins, hypoechogenicity, microcalcifications, aspect ratio, CDFI grade, US-suspected cervical lymph node enlargement, US-defined multifocality, US-suspected capsular invasion/ETE, TI-RADS); maximum tumor diameter (cm); and BRAF V600E mutation abundance (%) from FNA. CLNM was defined as metastasis in the central (VI) and/or lateral (II–V) compartments, with postoperative pathology as the reference standard for compartments dissected (CND for all; LND selectively).
2.3. NGS
Ultrasound-guided fine-needle aspiration (FNA) was performed on the index (most suspicious) nodule, with two passes per nodule, deliberately sampling distinct regions (quadrants when feasible) to mitigate intratumoral heterogeneity. In multifocal (including bilateral) disease, one of the most suspicious nodules per side was sampled as the index lesion, and other foci were not routinely aspirated. Part of the aspirate was submitted for routine cytopathology and the remainder was immediately transferred to a DNA stabilization tube. Genomic DNA was extracted and quantified on a Qubit fluorometer. Library preparation (fragmentation, end-repair, and adapter ligation) was carried out according to standard procedures, and sequencing was performed on an Illumina MiniSeq platform. Targeted next-generation sequencing was restricted to the BRAF locus (V600E hotspot), using a single-gene capture design optimized for cytology specimens. A minimum on-target depth of ≥500× was required to ensure analytical sensitivity for variant detection. Bioinformatic processing included quality control, alignment to the hg19 reference genome, and variant calling with a GATK-based workflow, followed by functional annotation using ANNOVAR (version:2024Jun02). Variant interpretation was curated against thyroid cancer–related resources. All testing was conducted in the hospital’s molecular diagnostics laboratory under established quality assurance procedures
2.4. Surgical Procedures
Thyroid surgery was performed by experienced thyroid surgeons using either an open or an endoscopic approach, according to surgeon expertise and patient preference. The extent of surgery followed American Thyroid Association recommendations and our institutional protocol. In this cohort, the typical procedure consisted of unilateral lobectomy with isthmusectomy combined with routine central compartment lymph node dissection (CND, level VI). Lateral neck dissection (LND) was not performed routinely and was undertaken selectively when preoperative ultrasound suggested lateral nodal involvement and/or when intraoperative assessment indicated lateral disease.
2.5. Model Development and Validation
To predict the risk of cervical lymph node metastasis (CLNM) in PTC patients, we compared six representative algorithms spanning complementary families suitable for tabular clinical data—linear (logistic regression), distance-based (k-nearest neighbors), margin-based non-linear (RBF-SVM), tree-based ensembles (XGBoost, LightGBM), and a shallow neural network—to ensure diversity of inductive biases and capture both linear and non-linear effects. To optimize the predictive models, based on 10 rounds of 10-fold cross-validation and the default hyperparameter grid of the “caret” package, the final hyperparameters for each model were obtained on the optimal feature subset. Finally, the models were refit on the training set with the optimal feature subset and the final hyperparameters (based on 10 rounds of 10-fold internal cross-validation).
2.6. Statistical Analysis
Patients were randomly allocated 7:3 to training and validation sets; between-set differences were assessed using χ2 or Fisher’s exact tests for categorical variables, and non-normal data were summarized as median (IQR), while the t-test or the rank-sum test was used for continuous variables. An independent regression method was used to screen independent risk factors for cervical lymph node metastasis. Model performance was evaluated using receiver operating characteristic (ROC) curves and calibration curves, and decision curve analysis (DCA) was subsequently performed to determine the threshold of net benefit, including the area under the receiver operating characteristic curve (AUC), accuracy, precision, sensitivity, specificity, and F1 score. Final model selection was determined a priori by test-set performance to minimize overfitting, prioritizing F1 and then AUC. The model with the highest F1 score was considered the best-performing model; if F1 scores were identical, AUC values were compared. Calibration curves and DCA curves visualized the net benefit of the models under different thresholds. SHapley additive explanations (SHAP) values were used to determine the contribution of each feature to the final model. All analyses were performed in R (version 4.3.2; R Foundation for Statistical Computing). Reproducibility was ensured by setting a fixed random seed (1234) prior to data partitioning and model training.
3. Results
3.1. Baseline Clinical Information
Among 667 patients, females accounted for 75.7% and CLNM occurred in 391 (58.6%). The mean maximum tumor diameter was 1.08 cm (SD 0.73), and the mean BRAF V600E abundance was 21% (SD 12). Compared with the non-CLNM group, the CLNM group showed larger tumors (1.25 vs. 0.85 cm) and higher abundance (23% vs. 18%; both p < 0.001), and younger age (39 vs. 42 years, p < 0.001), and patients with microcalcifications, multifocality, and lymph node enlargement were more common in the CLNM group (all p < 0.05); meanwhile, the incidence of capsular invasion was also higher (p = 0.009). Details are shown in Table 1.
Table 1.
Baseline characteristics of 667 patients with papillary thyroid carcinoma.
3.2. Independent Risk Factors
Using stepwise regression, we ultimately identified seven factors independently associated with CLNM: irregular margins, microcalcifications, lymph node enlargement, lesion group, maximum tumor diameter, BRAF V600E mutation abundance, and age. The specific odds ratios (ORs) are shown in Figure 2 (Table S1).
Figure 2.
Forest plot of multivariable logistic regression analysis.
3.3. Dose–Response Relationship
Based on the above stepwise regression results, we further used restricted cubic splines (RCS) to examine the dose–response relationships between the above continuous variables and CLNM risk, including BRAF V600E mutation abundance, maximum tumor diameter, and age. We used RCS for nonlinear testing to analyze the quantitative response relationships (Figure 3). The results showed a significant nonlinear relationship between BRAF V600E mutation abundance and cervical lymph node metastasis (overall p < 0.001, nonlinearity p = 0.027). Specifically, when mutation abundance increased from a low level to a moderate level (20.7%), the risk of cervical lymph node metastasis rose rapidly; when mutation abundance increased further, the risk began to flatten or decline. Maximum lesion diameter and age in PTC patients showed linear relationships with cervical lymph node metastasis (overall p < 0.05, nonlinearity p > 0.05), with risk thresholds of 0.8 (cm) and 40 (years), respectively.
Figure 3.
Restricted cubic spline (RCS) depiction of the association between (A) BRAF V600E mutation abundance, (B) maximum tumor diameter, and (C) age and the odds of cervical lymph node metastasis (CLNM). The solid line shows the estimated odds ratio; the shaded band denotes the 95% confidence interval. Light-gray histograms indicate the distribution of each predictor in the study cohort.
3.4. Model Development and Performance Comparison
After determining the key variables, we constructed six machine learning (ML) models for performance comparison, namely LR, KNN, SVM, XGB, LightGBM, and NN (Figure 4). ROC curves and AUCs were computed for the training and validation sets (Figure 4A,B); F1 scores (the harmonic mean of sensitivity and precision) were also calculated, and F1 scores were prioritized when comparing the discriminative performance of the six algorithms, followed by AUC values. In the training set, LightGBM performed best (AUC = 0.894, 95% CI 0.866–0.922), with sensitivity and specificity of 0.89 and 0.66, respectively; it was followed by XGBoost (AUC = 0.848, 95% CI 0.814–0.883), with an F1 score of 0.84. In the test set, XGBoost performed best (AUC = 0.752, 95% CI 0.686–0.818), with sensitivity and specificity of 0.78 and 0.49, respectively, and an F1 score of 0.73 (Table S2). Therefore, XGBoost (version 1.7.11.1), which performed best in the test set, was ultimately chosen as the optimal predictive model for subsequent clinical risk stratification and decision support. In addition, we used calibration curves (Figure 4C) to evaluate the accuracy of the predicted probabilities, and decision curve analysis (Figure 4D) showed that XGBoost had higher net benefit across clinically relevant thresholds.
Figure 4.
Performance of ML models for predicting CLNM in PTC. (A) ROC—training set; (B) ROC—test set; (C) calibration curves on the test set; the dashed 45° line indicates perfect calibration; (D) decision-curve analysis (DCA) across threshold probabilities 0.00–0.80, showing standardized net benefit.
3.5. Model Interpretation
We used SHAP to interpret the output of the final model by calculating the contribution of each variable to the prediction. The SHAP summary dot plot (Figure 5A) shows that larger tumor diameter, higher mutation abundance, younger age, and the presence of microcalcifications significantly increased the risk of CLNM in PTC. The SHAP summary bar chart (Figure 5B), which uses mean SHAP values to evaluate feature contributions and displays them in descending order, indicates that maximum lesion diameter, BRAF V600E mutation abundance, age, and microcalcifications are the four most important features in the predictive model. Irregular margins showed an independent association in the multivariable model but contributed less to the SHAP ranking (Figure 5B), likely due to partial collinearity with other sonographic primitives. SHAP dependence plots illustrate how individual features affect the model’s output. In Figure 5D, the actual values of these seven features are compared with the SHAP values; higher SHAP values indicate higher predicted risk of lymph node metastasis. For example, larger tumor diameter and younger age push the prediction toward cervical lymph node metastasis; the presence of microcalcifications, multifocal lesions, and lymph node enlargement also promotes a positive prediction for CLNM. For BRAF V600E mutation abundance, when the abundance is <10%, most SHAP values are below zero, indicating lower risk; when the abundance is 10–20%, SHAP values increase rapidly; subsequently, as abundance continues to increase, SHAP values remain greater than zero, indicating a sustained and significant positive effect of high abundance on the risk of CLNM, with a certain plateau phase.

Figure 5.
SHAP-based interpretation of the final model at the global and local levels: (A) summary dot plot; (B) summary bar plot; (C) waterfall plot; (D) dependence plots.
In addition, the SHAP waterfall plot (Figure 5C) shows the contribution of each feature to the model’s prediction of CLNM for a randomly selected PTC patient (Case No. 52). The specific values of each feature and their corresponding SHAP values indicate the positive and negative effects of that feature on the prediction. By accumulating SHAP values, the waterfall plot intuitively presents the formation process of the prediction for a specific patient, helping us to gain deeper insight into the model’s decision-making mechanism.
4. Discussion
This study, based on 667 patients with papillary thyroid carcinoma (PTC), integrated BRAF V600E mutation abundance with clinical data and developed multiple machine learning models to predict the risk of cervical lymph node metastasis (CLNM). The results showed that BRAF V600E mutation abundance was significantly higher in the CLNM group than in the non-metastatic group (median 23% vs. 17%, p < 0.001), suggesting a close association between higher mutation abundance and lymph node metastasis. This finding indicates that, beyond the binary “presence/absence” of BRAF V600E, its abundance—as a continuous variable—more effectively reflects tumor biology. Among the six models compared, XGBoost performed best, with an AUC of 0.848 in the training set and the highest F1 score of 0.73 and AUC of 0.75 in the test set, outperforming traditional logistic regression and other algorithms. In view of the combined performance on F1 and AUC, XGBoost was ultimately selected to build the predictive model. Notably, RCS analysis revealed a nonlinear dose–response relationship between BRAF V600E mutation abundance and CLNM risk: as abundance increased from a low level to approximately 20.7%, the risk of lymph node metastasis rose rapidly, whereas beyond this threshold the risk tended to plateau or even slightly decline. This “threshold effect” indicates that the driving effect of BRAF mutation on metastasis risk is most pronounced at intermediate abundance, whereas very high abundance may reflect different biological behavior. Mechanistically, the plateau at very high BRAF V600E abundance may reflect a ceiling (saturation) of MAPK/ERK pathway output with adaptive negative feedback once a dominant mutant clone is established, such that further increases in mutant-allele burden add little incremental invasiveness []. An alternative, not mutually exclusive interpretation is clonal sweep with reduced subclonal diversity; despite sustained MAPK activation, the loss of competing subclones may not further elevate the propensity for nodal dissemination []. Together, these hypotheses can be tested by multi-omic profiling (e.g., co-mutations such as TERT, copy-number burden, and immune contexture) and by external validation in prospective cohorts. By incorporating BRAF V600E mutation abundance as a continuous variable into the model and quantifying its nonlinear effect, this study achieved superior performance and provided new biological insights for risk assessment of CLNM in PTC compared with conventional approaches.
Prior studies on the association between BRAF V600E and PTC aggressiveness have yielded inconsistent conclusions: most reports link BRAF positivity to adverse outcomes such as lymph node metastasis, whereas some studies found no significant association, leaving the conclusions controversial [,,,,]. This controversy may stem from viewing BRAF mutation solely as a binary variable (present or absent), without considering differences in intratumoral mutant clonal burden. The innovation of this study lies in introducing the concept of mutation abundance, using a continuous variable to reflect tumor heterogeneity and mutational burden, thereby enabling a more refined evaluation of the impact of BRAF mutation on metastasis []. Our results confirm that high-abundance BRAF V600E significantly increases the risk of CLNM and establishes an approximate risk threshold of 20.7%, indicating that BRAF V600E mutation abundance may need to accumulate to a certain level before disease occurrence is promoted [,]. This echoes prior evidence indicating that higher BRAF V600E mutation abundance is linked to more aggressive behavior []. However, Wang et al. considered the risk to increase linearly with abundance, without an obvious inflection point; in contrast, our RCS analysis revealed a significant nonlinear relationship, suggesting a sharp risk surge at intermediate abundance and a plateau at high abundance. In addition, Abdulhaleem et al. found in a study of PTC that tumors with aggressive pathological features had a mean BRAF V600E abundance as high as 25.8%, significantly higher than 10.3% in non-aggressive tumors, and that patients with lymph node metastasis had markedly higher mutation abundance (approximately 29% vs. 17%) []. Although they did not delineate the dose–response relationship of BRAF V600E mutation abundance, they proposed that BRAF mutation abundance could serve as an indicator for preoperative assessment of tumor aggressiveness and determination of surgical extent, which is consistent with our findings. Prior paired-sample studies report high concordance of BRAF V600E between preoperative FNA and matched surgical specimens (~90–98%), supporting the reliability of FNA-based molecular testing for our abundance-based risk modeling [,]. Collectively, these results indicate that the presence of BRAF V600E alone does not necessarily lead to disease, and that the degree of mutation must also be considered. Differences in dose–response patterns across studies may result from variations in sample size and analytical methods; with validation in larger samples, the “threshold effect” of BRAF mutation abundance is expected to be further confirmed. In addition, by incorporating BRAF abundance into a multivariable machine learning model, this study markedly improved predictive performance, providing a useful complement to previous models that relied solely on clinical imaging features [,,,,,,,].
Beyond BRAF mutation abundance, the other key predictors identified by the model are consistent with PTC biology. By definition, BRAF V600E abundance represents the proportion of mutant alleles among the total measured alleles, comprehensively reflecting tumor cell clonal architecture and tumor purity; an increase indicates dominance of the mutant clone, with more cells persistently in an activated MAPK/ERK pathway state, thereby elevating overall invasive potential. Multiple studies support this mechanism: Huang et al. found in PTC patients with intermediate-to-high recurrence risk that for each 1% increase in BRAF V600E mutation abundance, the risk of recurrence increased by approximately 8%, and proposed a threshold of 28.2% for recurrence risk stratification; Schumm et al., in JCEM, reported a significant association between high mutation abundance and adverse pathological outcomes and emphasized its clinical reportability; Tatar et al. also showed that tumors with high mutation abundance are closely associated with more aggressive behavior and higher risk of lymph node metastasis [,,]. Consistent with these findings, we identified an abundance threshold of approximately 20.7% corresponding to a significant increase in CLNM risk, which constitutes a callback to the RCS conclusion in the first paragraph. In terms of interpretability, SHAP (SHapley Additive exPlanations, SHAP) bar plots showed that maximum tumor diameter, microcalcifications, multifocality, lymph node enlargement, and younger age were the most important contributors; their directions are concordant with findings from Liu et al., who reported in a multicenter prospective study that maximum diameter > 10 mm was a predictor of CLNM with OR = 1.81 (p < 0.001), and from Han et al., who reported that microcalcifications, age < 45 years, and multifocality increase the risk of lymphatic metastasis [,]. These points outline the significance of our study and its relation to prior work. Thus, the two tracks of performance and biology converge here, indicating that this continuous measure has practical stratification value.
Nevertheless, we acknowledge several limitations and uncertainties. First, this was a single-center retrospective study without an external validation set; its generalizability needs to be examined in prospective, multicenter cohorts. Because quantitative BRAF V600E mutation–abundance testing is embedded in our preoperative pathway, entry-into-testing selection bias and unmeasured confounding cannot be fully excluded. We plan cell-based BRAF V600E mutation abundance assays and a multicenter registry to validate/recalibrate the model and reassess the ~20% threshold/plateau. Second, the current model considered only the BRAF V600E mutation and did not integrate other potentially important molecular features, such as TERT mutations, RAS mutations, RET/PTC fusions, and immune markers; future work should expand toward multi-gene and multimodal model construction. Third, long-term follow-up has not yet been completed, and the model’s value in predicting postoperative recurrence, enabling dynamic monitoring, and assessing treatment response remains to be verified. Finally, although SHAP provides strong model interpretability, the causal mechanisms for some features require further mechanistic studies. The apparent plateau beyond ~20.7% is hypothesis-generating—possibly reflecting pathway saturation or immune surveillance—but requires external validation and mechanistic confirmation. We will investigate this in subsequent basic experiments, and jointly incorporate mutation abundance, radiomics, and immune transcriptomic features to build a multidimensional predictive framework integrating genomic data, artificial intelligence, and radiomics, with a prospective design to evaluate its potential for clinical decision support; this also echoes the earlier note that the “threshold effect” requires external validation.
This interpretable CLNM risk prediction model shows promise for clinical translation. Although the RCS curve delineates clinically intuitive risk bands for BRAF V600E mutation abundance—for example, <10% (low), 10–20.7% (intermediate), and ≥20.7% (high)—we regard these bands as aids for communication and stratification rather than prescriptive cut points. In practice, a low predicted risk (<10%) generally supports conservative cervical nodal management with routine surveillance; a high predicted risk (≥20.7%) would prompt consideration of formal preoperative cervical nodal mapping and multidisciplinary team (MDT) discussion, more often favoring operative management; and an intermediate estimate (10–20.7%) warrants individualized appraisal that integrates tumor size, ultrasound features (e.g., microcalcifications, suspicious nodes), cytology/Bethesda category, comorbidities, and patient preferences. These bands should be locally recalibrated using decision-curve analysis alongside local service constraints. Importantly, the model is an aid—not a directive—and the proposed bands remain exploratory pending external validation and mechanistic studies. In preoperative decision-making, the model helps identify high-risk individuals and thereby assists in formulating individualized surgical plans; given that the incidence of CLNM in PTC can be as high as 30–80%, and that preoperative imaging has limited sensitivity for detecting micrometastases [], performing a second operation after postoperative detection increases patient burden and the risk of complications [], making earlier risk stratification of practical significance. From the perspective of precision medicine, quantitative BRAF V600E abundance provides an individual with a molecular risk indicator, aligning with the concept of stratification by molecular features; moreover, the interpretability of the model (including SHAP plots) facilitates physician–patient communication by visually presenting the main factors driving individual risk, which helps achieve an actionable management plan. Overall, we prefer to regard this as an actionable starting point to be integrated in parallel with clinical experience and resource constraints, rather than as a predetermined single endpoint.
5. Conclusions
In summary, our study confirms a nonlinear dose–response relationship between BRAF V600E mutation abundance and the risk of lymph node metastasis in PTC, with an abundance threshold of approximately 20.7% being associated with a markedly increased metastasis risk; a machine learning model that combines mutation abundance with clinical features can effectively predict lymph node metastasis risk, with good interpretability and potential clinical applicability.
Supplementary Materials
The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/cancers17213562/s1, Supplementary Table S1 (DOCX): Independent risk factors for CLNM in PTC identified by multivariable logistic regression (odds ratio [OR] with 95% confidence intervals); Supplementary Table S2 (DOCX): Performance of six machine-learning models on the training and test sets (AUC, F1 score, accuracy, sensitivity, specificity, PPV, NPV).
Author Contributions
Conceptualization, M.L. and Y.Z.; methodology, Y.Y. and Y.S.; software, Y.Y.; validation, Y.Y. and Y.S.; formal analysis, Y.Y. and Y.S.; investigation, Y.S., A.M., Q.Z. and J.W.; data curation, Y.S., A.M., Q.Z. and J.W.; writing—original draft preparation, M.L.; writing—review and editing, M.L., Y.Y., Y.S., A.M., Q.Z., J.W. and Y.Z.; visualization, Y.Y.; supervision, Y.Z. and M.L.; funding acquisition, Y.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the National Natural Science Foundation of China (NSFC, 82203329) and the Guangzhou Basic and Applied Basic Research Foundation (2024A04J10015). The APC was funded by institutional funds of the Department of Thyroid Surgery, The Sixth Affiliated Hospital of Sun Yat-sen University. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
Institutional Review Board Statement
The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Sun Yat-sen Memorial Hospital of Sun Yat-sen University (protocol code SYSKY-2024-169-01; date of approval: 5 March 2024).
Informed Consent Statement
Patient consent was waived by the Ethics Committee because all data were anonymized and retrospectively collected.
Data Availability Statement
The analysis code and a synthetic demonstration dataset are openly available in the Open Science Framework (OSF) repository at: https://osf.io/n37ch/?view_only=32723588ee2f4c0aa7b5a5854003b4b3 (accessed on 17 September 2025). The demo dataset reproduces the analysis workflow and expected file structure; its outputs verify functionality and do not match the manuscript’s numerical results. The full de-identified patient-level dataset underlying the findings cannot be made publicly available due to institutional ethics and privacy restrictions. Access for editors and peer reviewers can be provided under controlled conditions in accordance with the approved protocol (IRB: SYSKY-2024-169-01). Subject to ethics approval and a data use agreement, qualified investigators may obtain the dataset upon reasonable request to the corresponding author.
Acknowledgments
Supported by National Key Clinical Discipline. We thank Gulilalai Abuduwaili for assistance with data curation and for encouragement during this work.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| AUC | Area under the receiver operating characteristic curve |
| CDFI | Color Doppler flow imaging |
| CI | Confidence interval |
| CLNM | Cervical lymph node metastasis |
| DCA | Decision curve analysis |
| F1 | F1 score |
| FNA | Fine-needle aspiration |
| KNN | k-nearest neighbors |
| LightGBM | Light Gradient Boosting Machine |
| LR | Logistic regression |
| MAPK | Mitogen-activated protein kinase |
| ML | Machine learning |
| NN | Neural network |
| OR | Odds ratio |
| PTC | Papillary thyroid carcinoma |
| RAS | RAS proto-oncogene family |
| RET/PTC | RET proto-oncogene/Papillary Thyroid Carcinoma fusion |
| RCS | Restricted cubic splines |
| ROC | Receiver operating characteristic |
| SHAP | SHapley Additive exPlanations |
| SVM | Support vector machine |
| TI-RADS | Thyroid Imaging Reporting and Data System |
| XGB | Extreme Gradient Boosting (XGBoost) |
References
- Xu, B.; Ghossein, R. Genomic Landscape of Poorly Differentiated and Anaplastic Thyroid Carcinoma. Endocr. Pathol. 2016, 27, 205–212. [Google Scholar] [CrossRef]
- Boucai, L.; Zafereo, M.; Cabanillas, M.E. Thyroid Cancer: A Review. JAMA 2024, 331, 425–435. [Google Scholar] [CrossRef]
- Pizzato, M.; Li, M.; Vignat, J.; Laversanne, M.; Singh, D.; La Vecchia, C.; Vaccarella, S. The Epidemiological Landscape of Thyroid Cancer Worldwide: GLOBOCAN Estimates for Incidence and Mortality Rates in 2020. Lancet Diabetes Endocrinol. 2022, 10, 264–272. [Google Scholar] [CrossRef]
- Lundgren, C.I.; Hall, P.; Dickman, P.W.; Zedenius, J. Clinically Significant Prognostic Factors for Differentiated Thyroid Carcinoma: A Population-Based, Nested Case—Control Study. Cancer 2006, 106, 524–531. [Google Scholar] [CrossRef]
- Pastorello, R.; Valerio, E.; Lobo, A.; Maia, A.; Saieg, M. Do Thyroid Nodules That Arise in the Isthmus Have a Higher Risk of Malignancy? Cancer Cytopathol. 2020, 128, 520–522. [Google Scholar] [CrossRef]
- Haugen, B.R.; Alexander, E.K.; Bible, K.C.; Doherty, G.M.; Mandel, S.J.; Nikiforov, Y.E.; Pacini, F.; Randolph, G.W.; Sawka, A.M.; Schlumberger, M.; et al. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid 2016, 26, 1–133. [Google Scholar] [CrossRef]
- Ruan, J.; Chen, Z.; Chen, S.; Zhang, J.; Wang, S.; Qi, M.; Fu, Y.; Zhou, J.; Zhang, Y.; Pang, M.; et al. Lateral Lymph Node Metastasis in Papillary Thyroid Microcarcinoma: A Study of 5241 Follow-Up Patients. Endocrine 2024, 83, 414–421. [Google Scholar] [CrossRef]
- Ito, Y.; Tomoda, C.; Uruno, T.; Takamura, Y.; Miya, A.; Kobayashi, K.; Matsuzuka, F.; Kuma, K.; Miyauchi, A. Preoperative Ultrasonographic Examination for Lymph Node Metastasis: Usefulness When Designing Lymph Node Dissection for Papillary Microcarcinoma of the Thyroid. World J. Surg. 2004, 28, 498–501. [Google Scholar] [CrossRef]
- Adam, M.A.; Pura, J.; Goffredo, P.; Dinan, M.A.; Reed, S.D.; Scheri, R.P.; Hyslop, T.; Roman, S.A.; Sosa, J.A. Presence and Number of Lymph Node Metastases Are Associated with Compromised Survival for Patients Younger Than Age 45 Years with Papillary Thyroid Cancer. J. Clin. Oncol. 2015, 33, 2370–2375. [Google Scholar] [CrossRef]
- Tian, H.; Yu, Z.; Dong, T.; Xie, Q.; Mu, Y.; Liao, W.; Ma, N. Risk Factors of Cervical Central Lymph Node Metastasis in Stage T1a Unifocal Papillary Thyroid Carcinoma. Sci. Rep. 2024, 14, 25577. [Google Scholar] [CrossRef]
- Lan, X.; Zhang, S.; Zhou, K. Lobectomy vs. Total Thyroidectomy for Papillary Thyroid Carcinoma with Lymph Node Metastasis. Radiother. Oncol. 2024, 192 (Suppl. S1), 46. [Google Scholar] [CrossRef]
- Li, M.; Liu, L.; Feng, L.; Li, Z.; Xu, Q.; Zhang, Y.; Zhang, F.; Feng, L.N. Prediction of Cervical Lymph Node Metastasis in Solitary Papillary Thyroid Carcinoma Based on Ultrasound Radiomics Analysis. Front. Oncol. 2024, 14, 1291767. [Google Scholar] [CrossRef]
- Yao, F.; Yang, Z.; Li, Y.; Chen, W.; Wu, T.; Peng, J.; Jiao, Z.; Yang, A. Real-World Evidence on the Sensitivity of Preoperative Ultrasound in Evaluating Central Lymph Node Metastasis of Papillary Thyroid Carcinoma. Front. Endocrinol. 2022, 13, 865911. [Google Scholar] [CrossRef]
- He, L.; Chen, X.; Hu, J.; Meng, Y.; Zhang, Y.; Chen, W.; Fan, Y.; Li, T.; Fang, J. Score Based on Contrast-Enhanced Ultrasound Predicts Central Lymph Node Metastasis in Papillary Thyroid Cancer. Front. Endocrinol. 2024, 15, 1336787. [Google Scholar] [CrossRef]
- Yu, J.; Deng, Y.; Liu, T.; Zhou, J.; Jia, X.; Xiao, T.; Zhou, S.; Li, J.; Guo, Y.; Wang, Y.; et al. Lymph Node Metastasis Prediction of Papillary Thyroid Carcinoma Based on Transfer Learning Radiomics. Nat. Commun. 2020, 11, 4807. [Google Scholar] [CrossRef]
- Foersch, S.; Glasner, C.; Woerl, A.-C.; Eckstein, M.; Wagner, D.-C.; Schulz, S.; Kellers, F.; Fernandez, A.; Tserea, K.; Kloth, M.; et al. Multistain Deep Learning for Prediction of Prognosis and Therapy Response in Colorectal Cancer. Nat. Med. 2023, 29, 430–439. [Google Scholar] [CrossRef]
- Moon, I.; LoPiccolo, J.; Baca, S.C.; Sholl, L.M.; Kehl, K.L.; Hassett, M.J.; Liu, D.; Schrag, D.; Gusev, A. Machine Learning for Genetics-Based Classification and Treatment Response Prediction in Cancer of Unknown Primary. Nat. Med. 2023, 29, 2057–2067. [Google Scholar] [CrossRef]
- Shen, P.; Yang, Z.; Sun, J.; Wang, Y.; Qiu, C.; Wang, Y.; Ren, Y.; Liu, S.; Cai, W.; Lu, H.; et al. Explainable Multimodal Deep Learning for Predicting Thyroid Cancer Lateral Lymph Node Metastasis Using Ultrasound Imaging. Nat. Commun. 2025, 16, 7052. [Google Scholar] [CrossRef]
- Saygili, E.S.; Elhassan, Y.S.; Prete, A.; Lippert, J.; Altieri, B.; Ronchi, C.L. Machine Learning–Based Survival Prediction Tool for Adrenocortical Carcinoma. J. Clin. Endocrinol. Metab. 2025, 110, e3185–e3192. [Google Scholar] [CrossRef]
- Yao, S.; Shen, P.; Dai, F.; Deng, L.; Qiu, X.; Zhao, Y.; Gao, M.; Zhang, H.; Zheng, X.; Yu, X.; et al. Thyroid Cancer Central Lymph Node Metastasis Risk Stratification Based on Homogeneous Positioning Deep Learning. Research 2024, 7, 0432. [Google Scholar] [CrossRef]
- Chen, B.D.; Zhang, Z.; Wang, K.K.; Shang, M.Y.; Zhao, S.S.; Ding, W.B.; Du, R.; Yu, Z.; Xu, X.M. A Multivariable Model of BRAF V600E and Ultrasonographic Features for Predicting the Risk of Central Lymph Node Metastasis in cN0 Papillary Thyroid Microcarcinoma. Cancer Manag. Res. 2019, 11, 7211–7217. [Google Scholar] [CrossRef]
- Shi, C.; Cao, J.; Shi, T.; Liang, M.; Ding, C.; Lv, Y.; Zhang, W.; Li, C.; Gao, W.; Wu, G.; et al. BRAF V600E Mutation, BRAF-Activated Long Non-Coding RNA and miR-9 Expression in Papillary Thyroid Carcinoma, and Their Association with Clinicopathological Features. World J. Surg. Oncol. 2020, 18, 194. [Google Scholar] [CrossRef]
- Xing, M. Genetic-Guided Risk Assessment and Management of Thyroid Cancer. Endocrinol. Metab. Clin. N. Am. 2019, 48, 109–124. [Google Scholar] [CrossRef]
- Kim, T.H.; Park, Y.J.; Lim, J.A.; Ahn, H.Y.; Lee, E.K.; Lee, Y.J.; Kim, K.W.; Hahn, S.K.; Youn, Y.K.; Kim, K.H.; et al. The Association of the BRAF(V600E) Mutation with Prognostic Factors and Poor Clinical Outcome in Papillary Thyroid Cancer: A Meta-Analysis. Cancer 2012, 118, 1764–1773. [Google Scholar] [CrossRef]
- Xing, M. BRAF Mutation in Papillary Thyroid Cancer: Pathogenic Role, Molecular Bases, and Clinical Implications. Endocr. Rev. 2007, 28, 742–762. [Google Scholar] [CrossRef]
- Zhang, J.; Xu, S. High Aggressiveness of Papillary Thyroid Cancer: From Clinical Evidence to Regulatory Cellular Networks. Cell Death Discov. 2024, 10, 378. [Google Scholar] [CrossRef]
- Ciarrocchi, A.; Cavuto, S.; Piana, S. BRAF V600E Mutation and Papillary Thyroid Cancer. JAMA 2013, 310, 534. [Google Scholar] [CrossRef]
- Gouveia, C.; Can, N.T.; Bostrom, A.; Grenert, J.P.; van Zante, A.; Orloff, L.A. Lack of Association of BRAF Mutation with Negative Prognostic Indicators in Papillary Thyroid Carcinoma: The University of California, San Francisco, Experience. JAMA Otolaryngol. Head Neck Surg. 2013, 139, 1164–1170. [Google Scholar] [CrossRef]
- Guerra, A.; Fugazzola, L.; Marotta, V.; Cirillo, M.; Rossi, S.; Cirello, V.; Forno, I.; Moccia, T.; Budillon, A.; Vitale, M. A High Percentage of BRAFV600E Alleles in Papillary Thyroid Carcinoma Predicts a Poorer Outcome. J. Clin. Endocrinol. Metab. 2012, 97, 2333–2340. [Google Scholar] [CrossRef]
- Vijg, J.; Dong, X. Pathogenic Mechanisms of Somatic Mutation and Genome Mosaicism in Aging. Cell 2020, 182, 12–23. [Google Scholar] [CrossRef]
- Liu, L.; Chang, J.W.; Jung, S.-N.; Park, H.S.; Oh, T.; Lim, Y.-C. Clinical Implications of the Extent of BRAFV600E Alleles in Patients with Papillary Thyroid Carcinoma. Oral Oncol. 2016, 62, 72–77. [Google Scholar] [CrossRef]
- Schumm, M.A.; Nikiforov, Y.E.; Nikiforova, M.N.; Wald, A.I.; Tseng, C.-H.; Smooke-Praw, S.; Wu, J.X.; Yeh, M.W.; Livhits, M.J. Association of BRAF V600E Allele Frequency with Clinicopathologic Outcomes in Papillary Thyroid Cancer. J. Clin. Endocrinol. Metab. 2025, 110, 2164–2171. [Google Scholar] [CrossRef]
- Lake, D.; Corrêa, S.A.L.; Müller, J. Negative feedback regulation of the ERK1/2 MAPK pathway. Cell. Mol. Life Sci. 2016, 73, 4397–4413. [Google Scholar] [CrossRef]
- McGranahan, N.; Swanton, C. Biological and Therapeutic Impact of Intratumor Heterogeneity in Cancer Evolution. Cancer Cell 2015, 27, 15–26. [Google Scholar] [CrossRef]
- Ying, H.; Ling, H.; Zhang, B.; Li, G.; Li, S. Risk Factors for Neck Nodal Metastasis in Papillary Thyroid Cancer with BRAF V600E Mutation. Front. Endocrinol. 2022, 13, 884428. [Google Scholar] [CrossRef]
- Henke, L.E.; Pfeifer, J.D.; Ma, C.; Perkins, S.M.; DeWees, T.; El-Mofty, S.; Moley, J.F.; Nussenbaum, B.; Haughey, B.H.; Baranski, T.J.; et al. BRAF Mutation Is Not Predictive of Long-Term Outcome in Papillary Thyroid Carcinoma. Cancer Med. 2015, 4, 791–799. [Google Scholar] [CrossRef]
- Chakraborty, A.; Narkar, A.A.; Mukhopadhyaya, R.; Kane, S.; D’Cruz, A.K.; Rajan, M.G.R. BRAF V600E Mutation in Papillary Thyroid Carcinoma: Significant Association with Node Metastases and Extrathyroidal Invasion. Endocr. Pathol. 2012, 23, 83–93. [Google Scholar] [CrossRef]
- Chan, T.A.; Yarchoan, M.; Jaffee, E.; Swanton, C.; Quezada, S.A.; Stenzinger, A.; Peters, S. Development of Tumor Mutation Burden as an Immunotherapy Biomarker: Utility for the Oncology Clinic. Ann. Oncol. 2019, 30, 44–56. [Google Scholar] [CrossRef]
- Itan, Y.; Shang, L.; Boisson, B.; Ciancanelli, M.J.; Markle, J.G.; Martinez-Barricarte, R.; Scott, E.; Shah, I.; Stenson, P.D.; Gleeson, J.; et al. The Mutation Significance Cutoff: Gene-Level Thresholds for Variant Predictions. Nat. Methods 2016, 13, 109–110. [Google Scholar] [CrossRef]
- Tatar, L.; Bandargal, S.; Pusztaszeri, M.P.; Forest, V.-I.; Hier, M.P.; Kouz, J.; Chowdhury, R.; Payne, R.J. Association Between BRAF V600E Allele Frequency and Aggressive Behavior in Papillary Thyroid Microcarcinoma. Cancers 2025, 17, 2553. [Google Scholar] [CrossRef]
- Abdulhaleem, M.; Bandargal, S.; Pusztaszeri, M.P.; Rajab, M.; Greenspoon, H.; Krasner, J.R.; Da Silva, S.D.; Forest, V.-I.; Payne, R.J. The Impact of BRAF V600E Mutation Allele Frequency on the Histopathological Characteristics of Thyroid Cancer. Cancers 2023, 16, 113. [Google Scholar] [CrossRef]
- Xing, M.; Clark, D.; Guan, H.; Ji, M.; Dackiw, A.; Carson, K.A.; Kim, M.; Tufaro, A.; Ladenson, P.; Zeiger, M.; et al. BRAF Mutation Testing of Thyroid Fine-Needle Aspiration Biopsy Specimens for Preoperative Risk Stratification in Papillary Thyroid Cancer. J. Clin. Oncol. 2009, 27, 2977–2982. [Google Scholar] [CrossRef]
- Fu, G.; Chazen, R.S.; MacMillan, C.; Witterick, I.J. Development of a Molecular Assay for Detection and Quantification of the BRAF Variation in Residual Tissue from Thyroid Nodule FNA Biopsy Specimens. JAMA Netw. Open 2021, 4, e2127243. [Google Scholar] [CrossRef]
- Yoon, J.; Lee, E.; Koo, J.S.; Yoon, J.-H.; Nam, K.-H.; Lee, J.; Jo, Y.S.; Moon, H.J.; Park, V.Y.; Kwak, J.Y. Artificial Intelligence to Predict the BRAFV600E Mutation in Patients with Thyroid Cancer. PLoS ONE 2020, 15, e0242806. [Google Scholar] [CrossRef]
- Lin, C.; Wang, D.; He, L.; Li, D.; Fu, Z.; Xue, S.; Su, X.; Zhou, J. Explainable Machine Learning Model for Predicting Paratracheal Lymph Node Metastasis in cN0 Papillary Thyroid Cancer. Sci. Rep. 2024, 14, 22361. [Google Scholar] [CrossRef]
- Huang, J.; Wang, J.; Xv, J.; Wang, J.; Wang, G.; Zhao, Y. Genetic Alterations and Allele Frequency of BRAF V600E and TERT Mutation in Papillary Thyroid Carcinoma with Intermediate-to-High Recurrence Risk: A Retrospective Study. Clin. Exp. Med. 2024, 24, 76. [Google Scholar] [CrossRef]
- Liu, J.; Jia, X.; Gu, Y.; Zhang, Y.; Huang, P.; Liu, X.; Wang, Z.; Liu, P.; Yang, W.; Peng, X.; et al. Thyroid Parenchyma Microcalcifications on Ultrasound for Predicting Lymph Node Metastasis in Papillary Thyroid Carcinoma: A Prospective Multicenter Study in China. Front. Oncol. 2021, 11, 609075. [Google Scholar] [CrossRef]
- Huang, J.; Li, Z.; Zhong, Q.; Fang, J.; Chen, X.; Zhang, Y.; Huang, Z. Developing and Validating a Multivariable Machine Learning Model for the Preoperative Prediction of Lateral Lymph Node Metastasis of Papillary Thyroid Cancer. Gland Surg. 2023, 12, 101–109. [Google Scholar] [CrossRef]
- Chen, L.; Zhu, Y.; Zheng, K.; Zhang, H.; Guo, H.; Zhang, L.; Wu, K.; Kong, L.; Ruan, W.; Hu, J.; et al. The Presence of Cancerous Nodules in Lymph Nodes Is a Novel Indicator of Distant Metastasis and Poor Survival in Patients with Papillary Thyroid Carcinoma. J. Cancer Res. Clin. Oncol. 2017, 143, 1035–1042. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).