Next Article in Journal
First Glance at Myeloid Leukaemia Factor 2 in Cardiomyocytes
Previous Article in Journal
Vericiguat Therapy Is Associated with Reverse Myocardial Remodeling in Chronic Heart Failure with Reduced Ejection Fraction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Subclinical Atrial Fibrillation Prediction in Patients with CIED by a Novel Deep Learning Framework

1
Department of Cardiovascular Diseases, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
2
Shanghai Artificial Intelligence Laboratory, Shanghai 200233, China
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
J. Cardiovasc. Dev. Dis. 2026, 13(1), 18; https://doi.org/10.3390/jcdd13010018 (registering DOI)
Submission received: 12 November 2025 / Revised: 15 December 2025 / Accepted: 23 December 2025 / Published: 30 December 2025
(This article belongs to the Section Cardiovascular Clinical Research)

Abstract

Background: Subclinical atrial fibrillation (SCAF), a key risk factor for cryptogenic stroke, is difficult to predict with current tools. This study aimed to develop a novel deep learning framework, ResKAN-Attention, using only routine clinical data to predict SCAF in patients with cardiac implantable electronic device (CIED). Methods: In this retrospective study, the ResKAN-Attention model was developed using 27 routine parameters from 124 CIED patients without prior AF. This framework features a dual-path architecture combining a Kolmogorov–Arnold Network (KAN) with a traditional multilayer perceptron, fused via a cross-attention mechanism. The model’s performance was evaluated against common baselines using five-fold cross-validation, while its decision-making process was assessed through interpretability analysis. A clinically applicable risk scoring system was subsequently derived via knowledge distillation. Results: Over a 12-month follow-up period, SCAF occurred in 31.5% of patients (39/124). The ResKAN-Attention model significantly outperformed all baseline models, achieving a mean AUC of 0.837 in cross-validation and 0.788 in external validation. Interpretability analysis identified left atrial diameter (LAD), gender, lactate dehydrogenase, BMI, and hypertension as top predictors. The simplified risk score exhibited excellent predictive power (AUC 0.882), retaining 99.1% of the complex model’s performance on the fifth fold validation set. Conclusions: The ResKAN-Attention model demonstrated promising preliminary results for SCAF prediction with enhanced interpretability. The distilled risk score provided a potential method for early risk stratification in clinical settings, demonstrating that advanced artificial intelligence (AI) can effectively predict complex cardiovascular events using readily available data.

Graphical Abstract

1. Introduction

SCAF, characterized by asymptomatic rapid atrial arrhythmias, is increasingly recognized as a significant cause of cryptogenic stroke [1,2]. Previous studies have revealed a high prevalence of atrial high-rate episodes (AHREs) in elderly patients recorded by CIED [3]. Most AHREs are caused by atrial fibrillation, while device-detected SCAF has been proven to be a predictor of future clinical AF [4]. However, the lack of reliable clinical tools to predict SCAF occurrence, together with the impracticality of long-term wearable monitoring in asymptomatic populations, hinders timely intervention and outcome improvement. Consequently, developing a simple and effective clinical prediction model for early identification of SCAF is critical.
The emergence of AI technologies offers new possibilities for significantly expanding the diagnostic and prognostic potential of clinical tools [5]. In recent years, the application of deep learning in medicine has advanced significantly, evolving from multilayer perceptron (MLP) to residual networks (ResNet), Transformers, and the recently introduced KAN. As a novel architecture, the KAN places learnable activation functions on network edges, replacing fixed functions, and abandoning linear weight matrices in favor of parameterized univariate spline functions. This enhances nonlinear modeling capabilities, enabling the precise capture of complex data patterns [6].
In this study, we developed a new deep learning framework, termed ResKAN-Attention, which integrates demographic, echocardiographic, and biochemical parameters to predict the risk of SCAF. We further distilled the model into a simplified score via knowledge distillation and externally validated it in an independent cohort, demonstrating its strong potential for practical clinical application.

2. Methods

2.1. Patient Enrollment and Clinical Data Collection

This study retrospectively enrolled consecutive patients who underwent permanent dual-chamber pacemaker implantation at our institution from August 2023 to February 2024 and completed a routine 12-month follow-up in the CIED clinic. A total of 237 patients were initially enrolled in this study. Follow-up CIED clinic visits were scheduled at 1, 3, 6, and 12 months post-implantation. All arrhythmia data recorded by CIED were collected during visits. Any patient with a CIED-detected AHRE with irregular A-A intervals lasting >30 s was considered to have SCAF. AF-related symptoms were also collected. Ninety-one patients who were previously diagnosed with paroxysmal AF or developed symptoms related to AHRE after CIED implantation were excluded from the study. Twenty-two patients with significant cardiomyopathy who received cardiac resynchronization therapy (CRT) or implantable cardioverter-defibrillator (ICD) implantation were also excluded from the final analysis (Figure 1). Clinical features before CIED implantation of the remaining 124 patients were then investigated. A total of 27 parameters were collected, including both demographic and clinical characteristics. Demographic characteristics included gender, age, and body mass index (BMI). Clinical characteristics encompassed medical history (hypertension, diabetes, coronary heart disease, valvular heart disease), biochemical markers (NT-proBNP, hsTnI, CRP, Myoglobin, CK-MB, lactate dehydrogenase (LDH), Lp(a), LDL-C, serum creatinine, eGFR calculated by simplified MDRD formula), standard transthoracic echocardiographic parameters (left atrial diameter [LAD], left ventricular ejection fraction [LVEF], left ventricular end-diastolic diameter [LVEDD], left ventricular end-diastolic volume [LVEDV], interventricular septal [IVS] thickness, left ventricular posterior wall [LVPW] thickness), as well as tissue Doppler imaging parameters (septal E/e′, lateral E/e′).

2.2. Data Preprocessing

To address missing data and class imbalance, we employed IterativeImputer for missing value imputation [7], followed by log1p transformation for skewed features (NT-proBNP, CRP) and StandardScaler for normalization [8]. To prevent data leakage, all preprocessing steps were fitted exclusively on training data within each cross-validation fold. Synthetic minority over-sampling technique (SMOTE) was applied only to training sets to balance the SCAF-positive and SCAF-negative classes [9]. Detailed preprocessing parameters are listed in Table S1.

2.3. Model Development

The ResKAN-Attention model employs a dual-pathway architecture comprising a ResKAN pathway utilizing Kolmogorov–Arnold Network layers for flexible nonlinear modeling and an MLP pathway for conventional feature extraction. Outputs from both pathways are integrated through a cross-attention fusion module to generate the final SCAF risk prediction [10]. The complete architecture is illustrated in Figure 2.

2.4. Baseline Models

To comprehensively evaluate the performance of the proposed ResKAN-Attention model, we constructed a series of classic models as baselines. These include traditional machine learning models: Logistic Regression, XGBoost, Random Forest, CatBoost, LightGBM, K-Nearest Neighbors (KNN), Gaussian NB and Support Vector Machine (SVM). Standard deep learning models: MLP, ResNet, Transformer-based FT-Transformer model [11], and KAN. The hyperparameter search spaces for all baseline models are detailed in Table S2.

2.5. Model Training and Evaluation

All deep learning models were implemented in PyTorch and optimized using AdamW [12]. Early stopping was applied to prevent overfitting. Five-fold cross-validation was performed, with AUC as the primary metric and accuracy, precision, recall, and F1 score as secondary metrics. Hyperparameters were optimized via grid search. Statistical significance between models was assessed using paired-samples t-tests and Cohen’s d effect sizes [13]. All experiments were conducted in Python 3.9; the ResKAN-Attention model hyperparameters are presented in Table S3.

2.6. Model Interpretability Analysis

SHAP (SHapley Additive exPlanations) was applied to identify global feature importance and LIME (Local Interpretable Model-agnostic Explanations) for patient-specific interpretations [14,15]. Additionally, activation patterns in the KAN layers were visualized to reveal the nonlinear relationships learned by the model, providing further insight into key clinical predictors of SCAF.

2.7. Knowledge Distillation for Clinical Risk Scoring

To enhance clinical applicability, knowledge distillation was employed to transfer knowledge from the ResKAN-Attention teacher model to a logistic regression student model [16]. The distillation process focused on the top 10 features identified by SHAP analysis. Specific distillation parameters and loss functions are provided in Table S4. The resulting student model converts the linear risk score into a SCAF probability, with a threshold of 0.5 for high-risk classification.

2.8. External Validation

We prospectively enrolled 35 patients who underwent permanent dual-chamber pacemaker implantation at our institution between December 2024 and March 2025, with a SCAF prevalence of 28.6% (10/35 patients). All patients routinely completed a 6-month follow-up visit in the CIED clinic, ensuring comprehensive post-implantation monitoring. Clinical data were collected in alignment with the primary cohort, encompassing the same 27 parameters, and preprocessed consistently with the training dataset to maintain comparability.

3. Results

3.1. Patient Characteristics

According to the pacemaker recordings, SCAF was detected in 39 out of 124 patients (31.5%), while the remaining 85 patients (68.5%) comprised the non-SCAF group. The mean age was 72.9 ± 7.8 years in the SCAF group and 71.5 ± 12.0 years in the non-SCAF group, with male proportions of 35.9% and 63.5%, respectively. The proportion of patients with a history of severe mitral valve disease was significantly higher in the SCAF group than in the non-SCAF group (12.8% vs. 1.2%), while no significant difference was found in all the other comorbidities between the two groups. For biochemical markers, NT-proBNP level was significantly higher in the SCAF group (926.2 ± 1271.3 pg/mL) compared to the non-SCAF group (393.8 ± 620.0 pg/mL). Left atrial diameter (LAD) measured through transthoracic echocardiography was 43.9 ± 5.0 mm in the SCAF group and 40.3 ± 4.1 mm in the non-SCAF group, which was statistically significant. The detailed data and comparison of clinical characteristics between the two groups are shown in Table 1. During the 12-month follow-up, no stroke or systemic embolic events were documented in either group.

3.2. Model Performance

To explore the best model for the prediction of SCAF, both common machine learning models and deep learning models were attempted. Among traditional machine learning models, the SVM model exhibited the best overall performance, with an AUC of 0.753 ± 0.134, accuracy (ACC) of 0.742 ± 0.071, and F1 score of 0.624 ± 0.093. In the deep learning baseline models, KAN model performed best, achieving an AUC of 0.825 ± 0.037, ACC of 0.735 ± 0.058, and F1 score of 0.589 ± 0.096 (Table 2).
The performance of the ResKAN-Attention model was then analyzed. In the five-fold cross-validation, the model achieved the highest average AUC of 0.837 ± 0.057 (Table 2, Figure 3A,B) with low standard deviation, suggesting its accuracy and stability (Figure 3C). We then analyzed the effect size of the performance difference using Cohen’s d. A large effect size was revealed compared to most baseline models, further indicating the superiority of the model (Figure 3D).
The peak performance of the ResKAN-Attention model was observed in the fifth cross-validation fold. In this specific fold, it achieved a validation set AUC of 0.891, demonstrating excellent discriminative ability (Figure 4A). The corresponding confusion matrix detailed the classification results, correctly identifying 4 true positives and 15 true negatives (Figure 4B). The histogram of predicted probabilities for this fold further illustrates the model’s capacity to effectively separate SCAF-positive from SCAF-negative cases (Figure 4C).
In the external validation cohort, the ResKAN-Attention model demonstrated robust generalization, achieving an AUC of 0.788 (Supplementary Figure S3). Additional performance metrics included accuracy of 0.800, precision of 0.667, recall of 0.600, and F1-score of 0.632.

3.3. Model Interpretability Analysis

To elucidate the decision-making process and further explore the potential risk factors of SCAF, interpretability analysis was conducted at three distinct levels.
Global feature importance analysis using SHAP was initially applied to identify the key predictors of SCAF (Figure 5). Among the 27 features, LAD exhibited the strongest predictive contribution, with its SHAP value significantly surpassing that of other features. Following LAD, the most influential features were gender, LDH, BMI, and hypertension.
We applied the LIME method to generate local attribution explanations in several representative patients, elucidating the model’s case-specific prediction logic (Supplementary Figure S1). Elevated LAD was shown to be the dominant factor in most predictions, while the contribution of other parameters seemed to vary in different cases. Active neurons were also extracted from the KAN input layer to visualize their responses in comparison with the top eight most important features identified by SHAP analysis (Supplementary Figure S2). Each single input parameter elicited diverse responses in different neurons, as no specific tendency suitable for all the clinical features could be identified. All these analyses suggested that SCAF prediction was complex and multifactorial, demonstrating the necessity of applying AI-assisted models.

3.4. Development and Validation of Clinical SCAF Risk Scoring

In an attempt to achieve a simplified, clinically applicable predictive tool for SCAF, we transferred the predictive capability of the complex ResKAN-Attention model to a simplified logistic regression model based on the top 10 key clinical features selected according to their SHAP contribution scores, with the assistance of knowledge distillation. The linear SCAF Risk Score is calculated as follows:
S C A F   R i s k   S c o r e = 0.407314 × L A D ( m m ) 1.810896 × M a l e   g e n d e r   +   2.026237 × l n ( L D H ( I U / L ) + 1 ) 0.128719 × B M I ( k g / m 2 )     0.571839 × H y p e r t e n s i o n 0.439865 × L V P W   t h i c k n e s s ( m m )     0.000892 × e G F R ( m L / m i n / 1.73   m 2 ) + 0.005358 × S e p t a l E / e     0.023019 × L V E D V ( m L ) 1.335999 × C o r o n a r y   h e a r t   d i s e a s e     17.815156
The probability of SCAF was subsequently calculated. On the fifth-fold validation set, this formula achieved an AUC of 0.882, closely aligning with the teacher model’s AUC of 0.891, with an AUC retention rate of 99.1%. The correlation coefficient between the predicted probabilities of the two models was 0.9091, with a mean absolute error of only 0.1164, demonstrating the effectiveness of the distillation process. The formula achieved an AUC of 0.788 on the external validation set (Supplementary Figure S3).
These results indicate that the simplified clinical risk score maintains high predictive performance while significantly enhancing its applicability in clinical settings.

4. Discussion

In this study, we introduced the ResKAN-Attention model, a novel deep learning framework designed to predict the occurrence of SCAF within one year using routinely collected clinical parameters. By leveraging a dual-path architecture that integrates the KAN with MLP and a cross-attention fusion mechanism, our model demonstrated promising predictive performance compared to existing models in this pilot cohort. The incorporation of SHAP and LIME techniques enhances interpretability, providing both global and individual-level explanations of predictions. Furthermore, through knowledge distillation, we derived a simplified, clinically applicable formula that maintains predictive accuracy while being practical for resource-constrained settings. These findings suggest that this approach may help address gaps in SCAF prediction.
Electrocardiogram (ECG) data have been widely applied for the prediction of AF. Although typical ECG-based models utilizing standard 12-lead ECG or Holter monitoring as data input have shown high accuracy [5,17,18], they face limitations in monitoring continuity and are especially impractical for the detection of SCAF. Wearable devices employing photoplethysmography (PPG) have emerged as an alternative, yet their signal instability and impracticality for population-wide monitoring pose significant challenges [19,20]. In contrast, the ResKAN-Attention model utilizes widely available clinical parameters, such as echocardiographic measurements and basic biochemical markers. As all of the parameters could be routinely collected in elderly patients, our approach attempted to offer broader clinical applicability and reduce dependence on specialized monitoring devices. However, risk stratification may vary over time, which requires frequent adjustment to ensure the credibility of the model [21]. Whether our model could maintain its prediction capability after 12 months and a better time interval for the reassessment of all the parameters remains to be clarified in future studies.
The interpretability analysis not only elucidated the feature importance underlying AI algorithm predictions but also provided clinically relevant insights into SCAF. Firstly, several key findings were consistent with established clinical knowledge, which reinforced the biological plausibility of our model. LAD emerged as the most critical predictor of SCAF across all analyses, aligning with the clinical understanding of atrial pressure and remodeling as key drivers of AF pathogenesis [22]. LDH, another important predictor in our model, has also been reported to be positively related to AF occurrence in several studies [23]. Conversely, some results challenge conventional clinical assumptions. For instance, while epidemiological data suggest a higher prevalence of symptomatic AF in males [24], our model identified higher SCAF risk in females. This discrepancy may reflect differences in symptom tolerance, as females potentially under-report AF-related symptoms, leading to a higher proportion of undetected SCAF. Similar unexpected findings were observed for BMI and hypertension [25], suggesting that single-factor predictions may be insufficient for SCAF. These observations underscore the necessity of multifactorial, data-driven models like ResKAN-Attention to capture complex interactions among clinical variables. NT-proBNP, despite being significantly elevated in the SCAF group, was not among the final top 10 predictors. This suggests that NT-proBNP may primarily reflect the atrial pressure state at the time of measurement, with limited predictive efficacy for long-term arrhythmic events. All these findings emphasize the value of comprehensive feature integration in this model. Further studies are needed to validate these hypotheses and elucidate their underlying mechanisms.
The ResKAN-Attention model was built on recent advancements in AI, particularly the KAN architecture, which offers distinct advantages over traditional neural networks. By placing learnable activation functions on network edges and replacing linear weight matrices with parameterized univariate spline functions, KAN enhances nonlinear modeling capabilities [6]. This enables the model to capture intricate patterns in complex, feature-diverse datasets, such as those encountered in cardiology. The dual-path architecture, combining KAN with MLP and a cross-attention mechanism, effectively integrates linear and nonlinear data relationships, resulting in a more robust and accurate predictive framework compared to single-model architectures like ResNet or Transformer models.
Despite these promising results, our study has several limitations. As a single-center study, the model was developed and validated using data from one institution, which may limit its generalizability. The study population was restricted to patients with CIED, who were at higher risk of AHRE. The model’s performance in other patient groups remains to be validated. Future multicenter studies in diverse populations and with prospective validation are essential to confirm the model’s broader applicability and robustness.

5. Conclusions

In conclusion, the ResKAN-Attention model exhibited promising preliminary results for SCAF prediction with enhanced interpretability and potential clinical applicability. By utilizing routine clinical parameters and a novel AI architecture, our model provided a framework for SCAF risk stratification. This framework represents a potential tool that warrants further investigation for improving the early detection and management of SCAF in clinical practice.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jcdd13010018/s1, Table S1: Data Preprocessing Parameters; Table S2: Baseline Models Hyperparameter Search Space; Table S3: ResKAN-Attention Training Hyperparameters; Table S4: Knowledge Distillation Parameters; Figure S1: LIME local interpretability analysis; Figure S2: Visualization of learned activation functions within the KAN model; Figure S3: External Validation ROC Curve: ResKAN-Attention Model and Distilled Clinical Formula; TRIPOD Checklist: Prediction Model Development.

Author Contributions

Conceptualization, Y.L., C.L. (Chengze Lin) and Y.X.; methodology, Y.L., C.L. (Chengze Lin), N.Z. and Y.X.; model, Y.L. and C.L. (Chengze Lin); validation, Q.J., Q.L., Y.W., Y.B., C.L. (Changjian Lin) and W.P.; formal analysis, Y.L. and C.L. (Chengze Lin); investigation, N.Z. and Q.C.; resources, L.W. and K.C.; data curation, Y.L., C.L. (Chengze Lin), N.Z., Y.X. and Q.C.; writing—original draft preparation, Y.L.; writing—review and editing, Y.L. and C.L. (Chengze Lin); visualization, C.L. (Chengze Lin); supervision, L.W., K.C. and Y.X.; project administration, L.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by National Key R&D Program of China (Grant No. 2022ZD0160704), National Natural Science Foundation of China (Grant Number: 82100329, 8227020136) and Medical-Engineering Interdisciplinary Research Fund of Shanghai Jiao Tong University (Grant No. YG2022ZD003).

Institutional Review Board Statement

This study was approved by the Ethics Committee of Ruijin Hospital affiliated with Shanghai Jiao Tong University School of Medicine (2024 LinLunShen No. 220) on 3 December 2023. Data were anonymized prior to analysis.

Informed Consent Statement

Patient consent was waived because this study was a retrospective investigation involving only the collection and analysis of anonymized clinical data from patients, presenting no more than minimal risk to the subjects. In accordance with the regulations of our institution’s Ethics Committee, approval for the waiver of patient informed consent was obtained from the Institutional Review Board/Ethics Committee. (Ethics Approval No: 2024 LinLunShen No. 220).

Data Availability Statement

The code is available at https://github.com/LCZ0824/ResKan-Attention (accessed on 4 August 2025) to protect patient privacy, the dataset for this study is not publicly available but can be obtained from the corresponding author upon reasonable request (xy11992@rjh.com.cn).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
SCAFSubclinical atrial fibrillation
CIEDcardiac implantable electronic device
KANKolmogorov–Arnold Network
LADleft atrial diameter
AHREatrial high-rate episode
MLPmultilayer perceptrons
ResNetresidual networks
LDHlactate dehydrogenase
SHAPShapley Additive exPlanations
LIMELocal Interpretable Model-Agnostic Explanations

References

  1. Healey, J.S.; Connolly, S.J.; Gold, M.R.; Israel, C.W.; Gelder, I.C.V.; Capucci, A.; Lau, C.P.; Fain, E.; Yang, S.; Bailleul, C.; et al. Subclinical Atrial Fibrillation and the Risk of Stroke. N. Engl. J. Med. 2012, 366, 120–129. [Google Scholar] [CrossRef] [PubMed]
  2. Patel, S.M.; Ruff, C.T. Subclinical atrial fibrillation and anticoagulation: Weighing the absolute risks and benefits. Circulation 2024, 149, 989–992. [Google Scholar] [CrossRef]
  3. Miyazawa, K.; Pastori, D.; Li, Y.-G.; Székely, O.; Shahid, F.; Boriani, G.; Lip, G.Y.H. Atrial high rate episodes in patients with cardiac implantable electronic devices: Implications for clinical outcomes. Clin. Res. Cardiol. 2019, 108, 1034–1041. [Google Scholar] [CrossRef]
  4. Van Gelder, I.C.; Rienstra, M.; Bunting, K.V.; Casado-Arroyo, R.; Caso, V.; Crijns, H.J.; De Potter, T.J.; Dwight, J.; Guasti, L.; Hanke, T. 2024 ESC Guidelines for the management of atrial fibrillation developed in collaboration with the European Association for Cardio-Thoracic Surgery (EACTS) Developed by the task force for the management of atrial fibrillation of the European Society of Cardiology (ESC), with the special contribution of the European Heart Rhythm Association (EHRA) of the ESC. Endorsed by the European Stroke Organisation (ESO). Eur. Heart J. 2024, 45, 3314–3414. [Google Scholar] [PubMed]
  5. Attia, Z.I.; Noseworthy, P.A.; Lopez-Jimenez, F.; Asirvatham, S.J.; Deshmukh, A.J.; Gersh, B.J.; Carter, R.E.; Yao, X.; Rabinstein, A.A.; Erickson, B.J.; et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: A retrospective analysis of outcome prediction. Lancet 2019, 394, 861–867. [Google Scholar] [CrossRef] [PubMed]
  6. Liu, Z.; Wang, Y.; Vaidya, S.; Ruehle, F.; Halverson, J.; Soljačić, M.; Hou, T.; Tegmark, M. KAN: Kolmogorov-arnold networks. arXiv 2024, arXiv:240419756. [Google Scholar]
  7. Ming, C.; Lee, G.J.; Teo, Y.H.; Teo, Y.N.; Toh, E.M.; Li, T.Y.; Guo, C.Y.; Ding, J.; Zhou, X.; Teoh, H.L. Machine Learning Modeling to Predict Atrial Fibrillation Detection in Embolic Stroke of Undetermined Source Patients. J. Pers. Med. 2024, 14, 534. [Google Scholar] [CrossRef]
  8. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  9. Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic minority over-sampling technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
  10. Gheini, M.; Ren, X.; May, J. Cross-attention is all you need: Adapting pretrained transformers for machine translation. arXiv 2021, arXiv:2104.08771. [Google Scholar] [CrossRef]
  11. Gorishniy, Y.; Rubachev, I.; Khrulkov, V.; Babenko, A. Revisiting deep learning models for tabular data. Adv. Neural Inf. Process. Syst. 2021, 34, 18932–18943. [Google Scholar]
  12. Loshchilov, I.; Hutter, F. Decoupled weight decay regularization. arXiv 2017, arXiv:1711.05101. [Google Scholar]
  13. Cohen, J. Statistical Power Analysis for the Behavioral Sciences; Routledge: Oxfordshire, UK, 2013. [Google Scholar]
  14. Lundberg, S.M.; Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems 30, Proceedings of 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; Guyon, I., von Luxburg, U., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017. [Google Scholar]
  15. Garreau, D.; Luxburg, U. Explaining the explainer: A first theoretical analysis of LIME. In Proceedings of the International Conference on Artificial Intelligence and Statistics, Sicily, Italy, 26–28 August 2020; pp. 1287–1296. [Google Scholar]
  16. Hinton, G.; Vinyals, O.; Dean, J. Distilling the knowledge in a neural network. arXiv 2015, arXiv:1503.02531. [Google Scholar] [CrossRef]
  17. Raghunath, S.; Pfeifer, J.M.; Ulloa-Cerna, A.E.; Nemani, A.; Carbonati, T.; Jing, L.; vanMaanen, D.P.; Hartzel, D.N.; Ruhl, J.A.; Lagerman, B.F.; et al. Deep Neural Networks Can Predict New-Onset Atrial Fibrillation From the 12-Lead ECG and Help Identify Those at Risk of Atrial Fibrillation–Related Stroke. Circulation 2021, 143, 1287–1298. [Google Scholar] [CrossRef] [PubMed]
  18. Choi, J.; Kim, J.Y.; Cho, M.S.; Kim, M.; Kim, J.; Oh, I.-Y.; Cho, Y.; Lee, J.H. Artificial intelligence predicts undiagnosed atrial fibrillation in patients with embolic stroke of undetermined source using sinus rhythm electrocardiograms. Heart Rhythm 2024, 21, 1647–1655. [Google Scholar] [CrossRef] [PubMed]
  19. Guo, Y.; Wang, H.; Zhang, H.; Liu, T.; Liang, Z.; Xia, Y.; Yan, L.; Xing, Y.; Shi, H.; Li, S. Mobile photoplethysmographic technology to detect atrial fibrillation. J. Am. Coll. Cardiol. 2019, 74, 2365–2375. [Google Scholar] [CrossRef]
  20. Steinhubl, S.R.; Waalen, J.; Edwards, A.M.; Ariniello, L.M.; Mehta, R.R.; Ebner, G.S.; Carter, C.; Baca-Motes, K.; Felicione, E.; Sarich, T.; et al. Effect of a Home-Based Wearable Continuous ECG Monitoring Patch on Detection of Undiagnosed Atrial Fibrillation: The mSToPS Randomized Clinical Trial. JAMA 2018, 320, 146–155. [Google Scholar] [CrossRef]
  21. Compagnucci, P.; D’Onofrio, A.; Vitulano, G.; Calo’, L.; Bertini, M.; Santini, L.; Savarese, G.; Santobuono, V.E.; Lavalle, C.; Viscusi, M. Clinical value of multiple sensors contributing to a composite heart failure ICD monitoring index. Eur. Heart J. 2024, 45, ehae666.3540. [Google Scholar] [CrossRef]
  22. Henry, W.L.; Morganroth, J.; Pearlman, A.S.; Clark, C.E.; Redwood, D.R.; Itscoitz, S.B.; Epstein, S.E. Relation between echocardiographically determined left atrial size and atrial fibrillation. Circulation 1976, 53, 273–279. [Google Scholar] [CrossRef]
  23. Hua, C.; Liu, P.; Zhang, M.; Niu, J.; Wang, L.; Zhong, G. Low-normal free triiodothyronine as a predictor of post-operative atrial fibrillation after surgical coronary revascularization. J. Thorac. Dis. 2024, 16, 1191. [Google Scholar] [CrossRef]
  24. Westerman, S.; Wenger, N. Gender differences in atrial fibrillation: A review of epidemiology, management, and outcomes. Curr. Cardiol. Rev. 2019, 15, 136–144. [Google Scholar] [CrossRef] [PubMed]
  25. Joglar, J.A.; Chung, M.K.; Armbruster, A.L.; Benjamin, E.J.; Chyou, J.Y.; Cronin, E.M.; Deswal, A.; Eckhardt, L.L.; Goldberger, Z.D.; Gopinathannair, R. 2023 ACC/AHA/ACCP/HRS guideline for the diagnosis and management of atrial fibrillation: A report of the American College of Cardiology/American Heart Association Joint Committee on Clinical Practice Guidelines. J. Am. Coll. Cardiol. 2024, 83, 109–279. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flowchart illustrating the patient selection process for this study.
Figure 1. Flowchart illustrating the patient selection process for this study.
Jcdd 13 00018 g001
Figure 2. Schematic diagram of the ResKAN-Attention model architecture.
Figure 2. Schematic diagram of the ResKAN-Attention model architecture.
Jcdd 13 00018 g002
Figure 3. Performance comparison of the ResKAN-Attention model with common models. (A) Boxplot comparing the distribution of Area Under the Curve (AUC) scores. The central line, box, and whiskers represent the median, interquartile range, and minimum/maximum values, respectively. (B) Heatmap of the fold-specific AUC score for each model. (C) Scatterplot of mean AUC versus standard deviation. (D) Bar chart of Cohen’s d. Asterisks denote statistical significance.
Figure 3. Performance comparison of the ResKAN-Attention model with common models. (A) Boxplot comparing the distribution of Area Under the Curve (AUC) scores. The central line, box, and whiskers represent the median, interquartile range, and minimum/maximum values, respectively. (B) Heatmap of the fold-specific AUC score for each model. (C) Scatterplot of mean AUC versus standard deviation. (D) Bar chart of Cohen’s d. Asterisks denote statistical significance.
Jcdd 13 00018 g003
Figure 4. Performance of the ResKAN-Attention model in the fifth cross-validation fold. (A) ROC curves from five-fold cross-validation of the ResKAN-Attention model. (B) Confusion matrix for the fifth cross-validation fold, detailing true and false classifications. (C) Histogram of predicted probabilities.
Figure 4. Performance of the ResKAN-Attention model in the fifth cross-validation fold. (A) ROC curves from five-fold cross-validation of the ResKAN-Attention model. (B) Confusion matrix for the fifth cross-validation fold, detailing true and false classifications. (C) Histogram of predicted probabilities.
Jcdd 13 00018 g004
Figure 5. Global feature importance derived from SHapley Additive exPlanations analysis of the ResKAN-Attention model.
Figure 5. Global feature importance derived from SHapley Additive exPlanations analysis of the ResKAN-Attention model.
Jcdd 13 00018 g005
Table 1. Baseline characteristics of study population.
Table 1. Baseline characteristics of study population.
CharacteristicsTotalSCAF GroupNon-SCAF Groupp-Value
(N = 124)(N = 39, 31.5%)(N = 85, 68.5%)
Demographics
  Age (years)72.0 ± 10.872.9 ± 7.871.5 ± 12.00.43
  Male gender 68 (54.8)14 (35.9)54 (63.5)0.006
  BMI (kg/m2)24.0 ± 3.723.6 ± 4.224.1 ± 3.40.47
Comorbidities
  Hypertension84 (67.7)24 (61.5)60 (70.6)0.41
  Diabetes mellitus38 (30.6)10 (25.6)28 (32.9)0.53
  Coronary heart disease 33 (26.6)6 (15.4)27 (31.8)0.08
  Aortic valve disease6 (4.8)2 (5.1)4 (4.7)1
  Mitral valve disease6 (4.8)5 (12.8)1 (1.2)0.01
  Tricuspid valve disease 9 (7.3)5 (12.8)4 (4.7)0.14
Biochemical Parameters
  NT-proBNP (pg/mL) 564.0 ± 911.1926.2 ± 1271.3393.8 ± 620.00.02
  hsTnI (pg/mL)135.7 ± 1224.125.9 ± 56.3183.3 ± 1465.30.33
  CRP (mg/L)10.2 ± 23.24.0 ± 6.213.1 ± 27.30.03
  Myoglobin (ng/mL)31.8 ± 21.129.3 ± 14.632.9 ± 23.30.3
  CK-MB (ng/mL)1.8 ± 1.31.8 ± 1.51.8 ± 1.30.98
  LDH (IU/L)218.3 ± 73.0224.1 ± 50.5215.6 ± 81.30.5
  Lp(a) (g/L)0.2 ± 0.20.3 ± 0.20.2 ± 0.20.33
  LDL-C (mmol/L)2.3 ± 0.92.4 ± 0.92.2 ± 0.90.29
  Serum creatinine (µmol/L)83.1 ± 25.683.5 ± 29.882.9 ± 23.60.92
  eGFR (mL/min/1.73 m2)74.9 ± 17.871.7 ± 19.176.4 ± 17.10.2
Echocardiographic Parameters
  Left atrial diameter (mm)41.5 ± 4.743.9 ± 5.040.3 ± 4.1<0.001
  LVEF (%)65.2 ± 7.065.2 ± 5.065.3 ± 7.80.99
  LVEDD (mm)49.5 ± 6.048.2 ± 5.250.1 ± 6.20.08
  LVEDV (mL)118.2 ± 32.1112.4 ± 28.6120.7 ± 33.40.18
  IVS thickness (mm) 10.1 ± 1.710.0 ± 1.910.1 ± 1.70.89
  LVPW thickness (mm)9.1 ± 1.18.9 ± 1.39.2 ± 1.10.34
  Septal E/e′ 12.4 ± 6.513.8 ± 6.511.8 ± 6.50.14
  Lateral E/e′9.0 ± 3.69.9 ± 4.48.5 ± 3.10.09
Data are presented as mean ± SD for continuous variables and as number (percentage) for categorical variables. Statistical significance was tested using Student’s t-test for continuous variables and Fisher’s exact test for categorical variables. p < 0.05 was considered statistically significant.
Table 2. Performance comparison of machine learning models.
Table 2. Performance comparison of machine learning models.
ModelAUCACCPrecisionRecallF1
Traditional Machine Learning
  Logistic Regression0.7395 ± 0.12960.7110 ± 0.09760.5406 ± 0.10560.6750 ± 0.20310.5897 ± 0.1344
  XGBoost0.6668 ± 0.10610.6943 ± 0.05700.5107 ± 0.08960.4179 ± 0.15590.4529 ± 0.1185
  Random Forest0.7084 ± 0.10800.7017 ± 0.04010.5589 ± 0.10790.4107 ± 0.04920.4644 ± 0.0228
  CatBoost0.7782 ± 0.08900.6860 ± 0.04360.5000 ± 0.07770.3393 ± 0.14850.3881 ± 0.1273
  LightGBM 0.6893 ± 0.08920.6943 ± 0.05700.5107 ± 0.08960.4179 ± 0.15590.4529 ± 0.1185
  KNN0.7124 ± 0.05690.7013 ± 0.10690.5466 ± 0.13360.6679 ± 0.09560.5920 ± 0.1041
  Gaussian NB0.6191 ± 0.11880.5393 ± 0.09210.3411 ± 0.05120.5214 ± 0.30630.3752 ± 0.1388
  SVM0.7529 ± 0.13420.7417 ± 0.07070.6052 ± 0.12790.7000 ± 0.20310.6239 ± 0.0931
Deep Learning
  MLP0.7311 ± 0.06350.7183 ± 0.04800.5750 ± 0.10000.4429 ± 0.15710.4857 ± 0.1092
  ResNet0.8004 ± 0.04830.7823 ± 0.05950.6819 ± 0.10090.5643 ± 0.12510.5643 ± 0.1251
  FT-Transformer0.7987 ± 0.06090.7190 ± 0.09300.5483 ± 0.10090.7000 ± 0.21790.6027 ± 0.1351
  KAN0.8250 ± 0.03670.7347 ± 0.05760.5829 ± 0.07950.6179 ± 0.15300.5885 ± 0.0959
Advanced Deep Learning Model
  ResKAN-Attention0.8370 ± 0.05720.7503 ± 0.11640.6603 ± 0.21220.6393 ± 0.05910.6333 ± 0.1124
Data are presented as mean ± standard deviation.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lan, Y.; Lin, C.; Zhang, N.; Cao, Q.; Jin, Q.; Luo, Q.; Wei, Y.; Bao, Y.; Lin, C.; Pan, W.; et al. Subclinical Atrial Fibrillation Prediction in Patients with CIED by a Novel Deep Learning Framework. J. Cardiovasc. Dev. Dis. 2026, 13, 18. https://doi.org/10.3390/jcdd13010018

AMA Style

Lan Y, Lin C, Zhang N, Cao Q, Jin Q, Luo Q, Wei Y, Bao Y, Lin C, Pan W, et al. Subclinical Atrial Fibrillation Prediction in Patients with CIED by a Novel Deep Learning Framework. Journal of Cardiovascular Development and Disease. 2026; 13(1):18. https://doi.org/10.3390/jcdd13010018

Chicago/Turabian Style

Lan, Yongying, Chengze Lin, Ning Zhang, Qing Cao, Qi Jin, Qingzhi Luo, Yue Wei, Yangyang Bao, Changjian Lin, Wenqi Pan, and et al. 2026. "Subclinical Atrial Fibrillation Prediction in Patients with CIED by a Novel Deep Learning Framework" Journal of Cardiovascular Development and Disease 13, no. 1: 18. https://doi.org/10.3390/jcdd13010018

APA Style

Lan, Y., Lin, C., Zhang, N., Cao, Q., Jin, Q., Luo, Q., Wei, Y., Bao, Y., Lin, C., Pan, W., Chen, K., Wu, L., & Xie, Y. (2026). Subclinical Atrial Fibrillation Prediction in Patients with CIED by a Novel Deep Learning Framework. Journal of Cardiovascular Development and Disease, 13(1), 18. https://doi.org/10.3390/jcdd13010018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop