You are currently on the new version of our website. Access the old version .
DiagnosticsDiagnostics
  • Article
  • Open Access

3 May 2025

Leveraging Subjective Parameters and Biomarkers in Machine Learning Models: The Feasibility of lnc-IL7R for Managing Emphysema Progression

,
,
,
,
,
,
,
,
1
Division of Pulmonary Medicine, Department of Internal Medicine, Shuang Ho Hospital, Taipei Medical University, New Taipei City 23561, Taiwan
2
Division of Pulmonary Medicine, Department of Internal Medicine, School of Medicine, College of Medicine, Taipei Medical University, Taipei 11031, Taiwan
3
TMU Research Center for Thoracic Medicine, Taipei Medical University, Taipei 11031, Taiwan
4
Graduate Institute of Clinical Medicine, College of Medicine, Taipei Medical University, Taipei 11031, Taiwan
This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics

Abstract

Background/Objectives: Chronic obstructive pulmonary disease (COPD) remains a leading cause of death worldwide, with emphysema progression providing valuable insights into disease development. Clinical assessment approaches, including pulmonary function tests and high-resolution computed tomography, are limited by accessibility constraints and radiation exposure. This study, therefore, proposed an alternative approach by integrating the novel biomarker long non-coding interleukin-7 receptor α-subunit gene (lnc-Il7R), along with other easily accessible clinical and biochemical metrics, into machine learning (ML) models. Methods: This cohort study collected baseline characteristics, COPD Assessment Test (CAT) scores, and biochemical details from the enrolled participants. Associations with emphysema severity, defined by a low attenuation area percentage (LAA%) threshold of 15%, were evaluated using simple and multivariate-adjusted models. The dataset was then split into training and validation (80%) and test (20%) subsets. Five ML models were employed, with the best-performing model being further analyzed for feature importance. Results: The majority of participants were elderly males. Compared to the LAA% <15% group, the LAA% ≥15% group demonstrated a significantly higher body mass index (BMI), poor pulmonary function, and lower expression levels of lnc-Il7R (all p < 0.01). Fold changes in lnc-IL7R were strongly and negatively associated with LAA% (p < 0.01). The random forest (RF) model achieved the highest accuracy and area under the receiver operating characteristic curve (AUROC) across datasets. A feature importance analysis identified lnc-IL7R fold changes as the strongest predictor for emphysema classification (LAA% ≥15%), followed by CAT scores and BMI. Conclusions: Machine learning models incorporated accessible clinical and biochemical markers, particularly the novel biomarker lnc-IL7R, achieving classification accuracy and AUROC exceeding 75% in emphysema assessments. These findings offer promising opportunities for improving emphysema classification and COPD management.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.