Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (896)

Search Parameters:
Keywords = breast cancer classification

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 2614 KB  
Article
Exploring the Use of Functional Data for Binary Classifications: The Case of Tissue Doppler Imaging in Cardiotoxicity Related-Therapy Cardiac Dysfunction Detection
by Pablo Martínez-Camblor and Susana Díaz-Coto
Axioms 2026, 15(2), 120; https://doi.org/10.3390/axioms15020120 - 6 Feb 2026
Abstract
Functional data are nowadays routinely collected and stored in a wide variety of fields. Their adequate use and analysis are a challenge for the scientific community. Mathematically, each function can be understood as a sequence of infinite related numbers. Therefore, for statisticians, functional [...] Read more.
Functional data are nowadays routinely collected and stored in a wide variety of fields. Their adequate use and analysis are a challenge for the scientific community. Mathematically, each function can be understood as a sequence of infinite related numbers. Therefore, for statisticians, functional data can be read as a collection of a strongly correlated infinite-dimensional variable. Most existing statistical procedures have been adapted to functional data scenarios. In this manuscript, we are interested in understanding the use of functions for constructing adequate ROC curves and, therefore, for carrying out binary classifications. In particular, we consider the problem of studying the real capacity of functions derived from tissue doppler imaging (TDI) for identifying cardiac dysfunction related to cardiotoxicity therapy (CRTCD) in breast cancer women with high levels of the protein human epidermal growth factor receptor 2 (HER2). With this goal, we use public and freely available data that has been already used for illustrating the use of functional data in the binary classification problem with very different take-home messages. This variability in the conclusions made us question the reproducibility of the results. Here, we explore five different functional approaches, and we think about the clinical use of the provided solutions and their potential overfitting. The main aim of this manuscript is identifying whether published results are excessively optimistic or if they adequately capture the actual capacity of TDI for accurately diagnostic CRTCD. Full article
(This article belongs to the Special Issue Functional Data Analysis and Its Application)
Show Figures

Figure 1

26 pages, 5240 KB  
Article
Enhanced Assumption-Aware Linear Discriminant Analysis for the Wisconsin Breast Cancer Dataset: A Guide to Dimensionality Reduction and Prediction with Performance Comparable to Machine Learning Methods
by Vasiliki Pantoula, Vasileios Mandikas and Tryfon Daras
AppliedMath 2026, 6(2), 20; https://doi.org/10.3390/appliedmath6020020 - 3 Feb 2026
Viewed by 72
Abstract
The analysis of multivariate data is a central issue in biomedical research, where the accurate classification of patients and the extraction of reliable conclusions are of critical importance. Linear Discriminant Analysis (LDA) remains one of the most established methods for both dimensionality reduction [...] Read more.
The analysis of multivariate data is a central issue in biomedical research, where the accurate classification of patients and the extraction of reliable conclusions are of critical importance. Linear Discriminant Analysis (LDA) remains one of the most established methods for both dimensionality reduction and classification of data. In this paper, we examine in detail the theoretical foundations, assumptions, and statistical properties of LDA, and apply the method step by step to real data from the Breast Cancer Wisconsin (Diagnostic) database, which includes cellular features from breast biopsy samples with the aim of distinguishing benign from malignant tumors. Emphasis is placed on the importance of the method’s assumptions, such as multivariate normality, equality of covariance matrices, and absence of multicollinearity, demonstrating that their fulfillment leads to significant improvements in model performance. Specifically, careful preprocessing and strict adherence to these assumptions increase classification accuracy from 95.6% (94.7% cross-validated) to 97.8% (97.4% cross-validated). To our knowledge, this study is the first to demonstrate the dual use of LDA as both a dimensionality-reduction tool and a predictive classification model for this medical database within the same biomedical analysis framework. Moreover, we provide, for the first time, a systematic comparison between our assumption-aware LDA model and related studies employing the most accurate machine-learning classifiers reported in the literature for this dataset, showing that classical LDA achieves accuracy comparable to these more complex methods. The resulting discriminant model, which uses 13 variables out of the original 30, can be applied easily by clinical researchers to classify new cases as benign or malignant, while simultaneously providing interpretable coefficients that reveal the underlying relationships among variables. The implementation is carried out in the SPSS environment, following the theoretical steps described in the paper, thus offering a user-friendly and reproducible framework for reliable application. In addition, the study establishes a structured and transparent workflow for the proper application of LDA in biomedical research by explicitly linking assumption verification, preprocessing, dimensionality reduction, and classification. Full article
Show Figures

Figure 1

26 pages, 4105 KB  
Article
Robust Dual-Stream Diagnosis Network for Ultrasound Breast Tumor Classification with Cross-Domain Segmentation Priors
by Xiaokai Jiang, Xuewen Ding, Jinying Ma, Chunyu Liu and Xinyi Li
Sensors 2026, 26(3), 974; https://doi.org/10.3390/s26030974 - 2 Feb 2026
Viewed by 154
Abstract
Ultrasound imaging is widely used for early breast cancer screening to enhance patient survival. However, interpreting these images is inherently challenging due to speckle noise, low lesion-to-tissue contrast, and highly variable tumor morphology within complex anatomical structures. Additionally, variations in image characteristics across [...] Read more.
Ultrasound imaging is widely used for early breast cancer screening to enhance patient survival. However, interpreting these images is inherently challenging due to speckle noise, low lesion-to-tissue contrast, and highly variable tumor morphology within complex anatomical structures. Additionally, variations in image characteristics across institutions and devices further impede the development of robust and generalizable computer-aided diagnostic systems. To alleviate these issues, this paper presents a cross-domain segmentation prior guided classification strategy for robust breast tumor diagnosis in ultrasound imaging, implemented through a novel Dual-Stream Diagnosis Network (DSDNet). DSDNet adopts a decoupled dual-stream architecture, where a frozen segmentation branch supplies spatial priors to guide the classification backbone. This design enables stable and accurate performance across diverse imaging conditions and clinical settings. To realize the proposed DSDNet framework, three novel modules are created. The Dual-Stream Mask Attention (DSMA) module enhances lesion priors by jointly modeling foreground and background cues. The Segmentation Prior Guidance Fusion (SPGF) module integrates multi-scale priors into the classification backbone using cross-domain spatial cues, improving tumor morphology representation. The Mamba-Inspired Linear Transformer (MILT) block, built upon the Mamba-Inspired Linear Attention (MILA) mechanism, serves as an efficient attention-based feature extractor. On the BUSI, BUS, and GDPH_SYSUCC datasets, DSDNet achieves ACC values of 0.878, 0.836, and 0.882, and Recall scores of 0.866, 0.789, and 0.878, respectively. These results highlight the effectiveness and strong classification performance of our method in ultrasound breast cancer diagnosis. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

43 pages, 2173 KB  
Review
The Complex Path from Mammary Ductal Hyperplasia to Breast Cancer: Elevated Malignancy Risk in Atypical Forms
by Bogdan-Alexandru Gheban, Lavinia Patricia Mocan, Adina Bianca Boșca, Rada Teodora Suflețel, Eleonora Dronca, Mihaela Elena Jianu, Carmen Crivii, Tudor Cristian Pașcalău, Mădălin Mihai Onofrei, Andreea Moise-Crintea and Alina Simona Șovrea
Biomedicines 2026, 14(2), 349; https://doi.org/10.3390/biomedicines14020349 - 2 Feb 2026
Viewed by 141
Abstract
Background: Mammary ductal hyperplasia represents a spectrum of benign proliferative breast lesions, some of which pose elevated risks for malignant transformation into ductal carcinoma in situ and invasive breast cancer. This narrative review explores why only specific types, particularly those with atypia, exhibit [...] Read more.
Background: Mammary ductal hyperplasia represents a spectrum of benign proliferative breast lesions, some of which pose elevated risks for malignant transformation into ductal carcinoma in situ and invasive breast cancer. This narrative review explores why only specific types, particularly those with atypia, exhibit higher progression potential, synthesizing epidemiologic, histopathologic, molecular, and environmental insights. Methods: We reviewed key literature from databases, including PubMed, focusing on classification, risk stratification, genetic/epigenetic mechanisms, tumor microenvironment dynamics, and modifiable factors influencing progression. Results: Benign breast lesions are categorized into non-proliferative, proliferative without atypia, and proliferative with atypia, such as atypical ductal hyperplasia and atypical lobular hyperplasia. Atypia represents a morphologic continuum toward low-grade ductal carcinoma in situ, driven by genetic alterations, epigenetic reprogramming, and changes in the tumor microenvironment, including stromal remodeling, immune infiltration, hypoxia-induced angiogenesis, and extracellular matrix degradation. Dietary factors, such as high-fat intake and obesity, exacerbate progression through inflammation, insulin resistance, and adipokine imbalance, while environmental toxins, including endocrine disruptors, pesticides, and ionizing radiation, amplify genomic instability. Conclusions: Understanding differential risks and mechanisms underscores the need for stratified surveillance, biomarker-driven interventions, and lifestyle modifications to mitigate progression. Future research should prioritize molecular profiling for personalized prevention in high-risk hyperplasia. Full article
(This article belongs to the Special Issue Advanced Research in Breast Diseases and Histopathology)
23 pages, 3441 KB  
Article
Integrating Large Language Models with Deep Learning for Breast Cancer Treatment Decision Support
by Heeseung Park, Serin Ok, Taewoo Kang and Meeyoung Park
Diagnostics 2026, 16(3), 394; https://doi.org/10.3390/diagnostics16030394 - 26 Jan 2026
Viewed by 289
Abstract
Background/Objectives: Breast cancer is one of the most common malignancies, but its heterogeneous molecular subtypes make treatment decision-making complex and patient-specific. Both the pathology reports and the electronic medical record (EMR) play a critical role for an appropriate treatment decision. This study [...] Read more.
Background/Objectives: Breast cancer is one of the most common malignancies, but its heterogeneous molecular subtypes make treatment decision-making complex and patient-specific. Both the pathology reports and the electronic medical record (EMR) play a critical role for an appropriate treatment decision. This study aimed to develop an integrated clinical decision support system (CDSS) that combines a large language model (LLM)-based pathology analysis with deep learning-based treatment prediction to support standardized and reliable decision-making. Methods: Real-world data (RWD) obtained from a cohort of 5015 patients diagnosed with breast cancer were analyzed. Meta-Llama-3-8B-Instruct automatically extracted the TNM stage and tumor size from the pathology reports, which were then integrated with EMR variables. A multi-label classification of 16 treatment combinations was performed using six models, including Decision Tree, Random Forest, GBM, XGBoost, DNN, and Transformer. Performance was evaluated using accuracy, macro/micro-averaged precision, recall, F1 score, and AUC. Results: Using combined LLM-extracted pathology and EMR features, GBM and XGBoost achieved the highest and most stable predictive performance across all feature subset configurations (macro-F1 ≈ 0.88–0.89; AUC = 0.867–0.868). Both models demonstrated strong discrimination ability and consistent recall and precision, highlighting their robustness for multi-label classification in real-world settings. Decision Tree and Random Forest showed moderate but reliable performance (macro-F1 = 0.84–0.86; AUC = 0.849–0.821), indicating their applicability despite lower predictive capability. By contrast, the DNN and Transformer models produced comparatively lower scores (macro-F1 = 0.74–0.82; AUC = 0.780–0.757), especially when using the full feature set, suggesting limited suitability for structured clinical data without strong contextual dependencies. These findings indicate that gradient-boosting ensemble approaches are better optimized for tabular medical data and generate more clinically reliable treatment recommendations. Conclusions: The proposed artificial intelligence-based CDSS improves accuracy and consistency in breast cancer treatment decision support by integrating automated pathology interpretation with deep learning, demonstrating its potential utility in real-world cancer care. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

27 pages, 1579 KB  
Article
Quadra Sense: A Fusion of Deep Learning Classifiers for Mitosis Detection in Breast Cancer Histopathology
by Afnan M. Alhassan and Nouf I. Altmami
Diagnostics 2026, 16(3), 393; https://doi.org/10.3390/diagnostics16030393 - 26 Jan 2026
Viewed by 238
Abstract
Background/Objectives: The difficulties caused by breast cancer have been addressed in a number of ways. Since it is said to be the second most common cause of death from cancer among women, early intervention is crucial. Early detection is difficult because of [...] Read more.
Background/Objectives: The difficulties caused by breast cancer have been addressed in a number of ways. Since it is said to be the second most common cause of death from cancer among women, early intervention is crucial. Early detection is difficult because of the existing detection tools’ shortcomings in objectivity and accuracy. Quadra Sense, a fusion of deep learning (DL) classifiers for mitosis detection in breast cancer histopathology, is proposed to address the shortcomings of current approaches. It demonstrates a greater capacity to produce more accurate results. Methods: Initially, the raw dataset is preprocessed by using a normalization by means of color channel normalization (zero-mean normalization) and stain normalization (Macenko Stain Normalization), and the artifact can be removed via median filtering and contrast enhancement using histogram equalization; ROI identification is performed using modified Fully Convolutional Networks (FCNs) followed by the feature extraction (FE) with Modified InceptionV4 (M-IV4), by which the deep features are retrieved and the feature are selected by means of a Self-Improved Seagull Optimization Algorithm (SA-SOA), and finally, classification is performed using Mito-Quartet. Results: Ultimately, using a performance evaluation, the suggested approach achieved a higher accuracy of 99.2% in comparison with the current methods. Conclusions: From the outcomes, the recommended technique performs well. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

23 pages, 2542 KB  
Article
Class-Balanced Convolutional Neural Networks for Digital Mammography Image Classification in Breast Cancer Diagnosis
by Evangelos Mavropoulos, Paraskevi Zacharia, Nikolaos Laskaris and Evangelos Pallis
Electronics 2026, 15(2), 486; https://doi.org/10.3390/electronics15020486 - 22 Jan 2026
Viewed by 110
Abstract
This study introduces a class-balanced Convolutional Neural Network (CNN) framework specifically designed for the binary classification of breast tumors in digital mammography. The proposed method systematically addresses the pervasive issue of class imbalance in medical imaging datasets by implementing advanced dataset balancing strategies, [...] Read more.
This study introduces a class-balanced Convolutional Neural Network (CNN) framework specifically designed for the binary classification of breast tumors in digital mammography. The proposed method systematically addresses the pervasive issue of class imbalance in medical imaging datasets by implementing advanced dataset balancing strategies, which resulted in a significant reduction in false negatives that is critical in early breast cancer detection. The proposed architecture is designed for high-resolution mammograms and employs regularization techniques, such as dropout and L2 weight decay, which are intended to enhance generalization and reduce the risk of overfitting. Comprehensive data augmentation and normalization further enhance the model’s robustness and adaptability to real-world clinical variability. Evaluated on the MIAS dataset, our balanced CNN achieved an accuracy of 98.84%, exhibiting both sensitivity and overall reliability. This work demonstrates that a class-balanced CNN can deliver both high diagnostic accuracy and computational efficiency, indicating potential for future use in clinical screening workflows. The system’s ability to minimize diagnostic errors and support radiologists with reliable, data-driven predictions represents an exploratory step toward improving automated breast cancer detection. Full article
Show Figures

Figure 1

31 pages, 1485 KB  
Article
Explainable Multi-Modal Medical Image Analysis Through Dual-Stream Multi-Feature Fusion and Class-Specific Selection
by Naeem Ullah, Ivanoe De Falco and Giovanna Sannino
AI 2026, 7(1), 30; https://doi.org/10.3390/ai7010030 - 16 Jan 2026
Viewed by 439
Abstract
Effective and transparent medical diagnosis relies on accurate and interpretable classification of medical images across multiple modalities. This paper introduces an explainable multi-modal image analysis framework based on a dual-stream architecture that fuses handcrafted descriptors with deep features extracted from a custom MobileNet. [...] Read more.
Effective and transparent medical diagnosis relies on accurate and interpretable classification of medical images across multiple modalities. This paper introduces an explainable multi-modal image analysis framework based on a dual-stream architecture that fuses handcrafted descriptors with deep features extracted from a custom MobileNet. Handcrafted descriptors include frequency-domain and texture features, while deep features are summarized using 26 statistical metrics to enhance interpretability. In the fusion stage, complementary features are combined at both the feature and decision levels. Decision-level integration combines calibrated soft voting, weighted voting, and stacking ensembles with optimized classifiers, including decision trees, random forests, gradient boosting, and logistic regression. To further refine performance, a hybrid class-specific feature selection strategy is proposed, combining mutual information, recursive elimination, and random forest importance to select the most discriminative features for each class. This hybrid selection approach eliminates redundancy, improves computational efficiency, and ensures robust classification. Explainability is provided through Local Interpretable Model-Agnostic Explanations, which offer transparent details about the ensemble model’s predictions and link influential handcrafted features to clinically meaningful image characteristics. The framework is validated on three benchmark datasets, i.e., BTTypes (brain MRI), Ultrasound Breast Images, and ACRIMA Retinal Fundus Images, demonstrating generalizability across modalities (MRI, ultrasound, retinal fundus) and disease categories (brain tumor, breast cancer, glaucoma). Full article
(This article belongs to the Special Issue Digital Health: AI-Driven Personalized Healthcare and Applications)
Show Figures

Figure 1

23 pages, 5052 KB  
Article
Exploratory Study on Hybrid Systems Performance: A First Approach to Hybrid ML Models in Breast Cancer Classification
by Francisco J. Rojas-Pérez, José R. Conde-Sánchez, Alejandra Morlett-Paredes, Fernando Moreno-Barbosa, Julio C. Ramos-Fernández, José Luna-Muñoz, Genaro Vargas-Hernández, Blanca E. Jaramillo-Loranca, Juan M. Xicotencatl-Pérez and Eucario G. Pérez-Pérez
AI 2026, 7(1), 29; https://doi.org/10.3390/ai7010029 - 15 Jan 2026
Viewed by 310
Abstract
The classification of breast cancer using machine learning techniques has become a critical tool in modern medical diagnostics. This study analyzes the performance of hybrid models that combine traditional machine learning algorithms (TMLAs) with a convolutional neural network (CNN)-based VGG16 model for feature [...] Read more.
The classification of breast cancer using machine learning techniques has become a critical tool in modern medical diagnostics. This study analyzes the performance of hybrid models that combine traditional machine learning algorithms (TMLAs) with a convolutional neural network (CNN)-based VGG16 model for feature extraction to improve accuracy for classifying eight breast cancer subtypes (BCS). The methodology consists of three steps. First, image preprocessing is performed on the BreakHis dataset at 400× magnification, which contains 1820 histopathological images classified into eight BCS. Second, the CNN VGG16 is modified to function as a feature extractor that converts images into representative vectors. These vectors constitute the training set for TMLAs, such as Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naive Bayes (NB), leveraging VGG16’s ability to capture relevant features. Third, k-fold cross-validation is applied to evaluate the model’s performance by averaging the metrics obtained across all folds. The results reveal that hybrid models leveraging a CNN-based VGG16 model for feature extraction, followed by TMLAs, achieve accuracy outstanding experimental accuracy. The KNN-based hybrid model stood out with a precision of 0.97, accuracy of 0.96, sensitivity of 0.96, specificity of 0.99, F1-score of 0.96, and ROC-AUC of 0.97. These findings suggest that, with an appropriate methodology, hybrid models based on TMLA have strong potential in classification tasks, offering a balance between performance and predictive capability. Full article
Show Figures

Figure 1

21 pages, 830 KB  
Article
Predicting Breast Cancer Mortality Using SEER Data: A Comparative Analysis of L1-Logistic Regression and Neural Networks
by Mayra Cruz-Fernandez, Francisco Antonio Castillo-Velásquez, Carlos Fuentes-Silva, Omar Rodríguez-Abreo, Rafael Rojas-Galván, Marcos Avilés and Juvenal Rodríguez-Reséndiz
Technologies 2026, 14(1), 66; https://doi.org/10.3390/technologies14010066 - 15 Jan 2026
Viewed by 283
Abstract
Breast cancer remains a leading cause of mortality among women worldwide, motivating the development of transparent and reproducible risk models for clinical decision making. Using the open-access SEER Breast Cancer dataset (November 2017 release), we analyzed 4005 women diagnosed between 2006 and 2010 [...] Read more.
Breast cancer remains a leading cause of mortality among women worldwide, motivating the development of transparent and reproducible risk models for clinical decision making. Using the open-access SEER Breast Cancer dataset (November 2017 release), we analyzed 4005 women diagnosed between 2006 and 2010 with infiltrating duct and lobular carcinoma (ICD-O-3 8522/3). Thirty-one clinical and demographic variables were preprocessed with one-hot encoding and z-score standardization, and the lymph node ratio was derived to characterize metastatic burden. Two supervised models, L1-regularized logistic regression and a feedforward artificial neural network, were compared under identical preprocessing, fixed 60/20/20 data splits, and stratified five-fold cross-validation. To define clinically meaningful endpoints and handle censoring, we reformulated mortality prediction as fixed-horizon classification at 3 and 5 years, and evaluated discrimination, calibration, and operating thresholds. Logistic regression demonstrated consistently strong performance, achieving test ROC-AUC values of 0.78 at 3 years and 0.75 at 5 years, with substantially superior calibration (Brier score less than or equal to 0.12, ECE less than or equal to 0.03). A structured hyperparameter search with repeated-seed evaluation identified optimal neural network architectures for each horizon, yielding test ROC-AUC values of 0.74 at 3 years and 0.73 at 5 years, but with markedly poorer calibration (ECE 0.19 to 0.23). Bootstrap analysis showed no significant AUC difference between models at 3 years, but logistic regression exhibited greater stability across folds and lower sensitivity to feature pruning. Overall, L1-regularized logistic regression provides competitive discrimination (ROC-AUC 0.75 to 0.78), markedly superior probability calibration (ECE below 0.03 versus 0.19 to 0.23 for the neural network), and approximately 40% lower cross-validation variance, supporting its use for scalable screening, risk stratification, and triage workflows on structured registry data. Full article
(This article belongs to the Section Assistive Technologies)
Show Figures

Figure 1

22 pages, 3134 KB  
Article
Experimental Mis-Splicing Assessment and ACMG/AMP-Guided Classification of 47 ATM Splice-Site Variants
by Inés Llinares-Burguet, Lara Sanoguera-Miralles, Elena Bueno-Martínez, Ada Esteban-Sanchez, Daniel Romano-Medina, Lobna Ramadane-Morchadi, Alicia García-Álvarez, Pedro Pérez-Segura, Doug F. Easton, Peter Devilee, Maaike P. G. Vreeswijk, Miguel de la Hoya and Eladio A. Velasco-Sampedro
Int. J. Mol. Sci. 2026, 27(2), 765; https://doi.org/10.3390/ijms27020765 - 12 Jan 2026
Viewed by 377
Abstract
Pathogenic germline variants in the ATM gene are associated with a 20–30% lifetime risk of breast cancer. Crucially, a relevant fraction of loss-of-function variants in breast cancer susceptibility genes disrupts pre-mRNA splicing. We aimed to perform splicing analysis of ATM splice-site variants identified [...] Read more.
Pathogenic germline variants in the ATM gene are associated with a 20–30% lifetime risk of breast cancer. Crucially, a relevant fraction of loss-of-function variants in breast cancer susceptibility genes disrupts pre-mRNA splicing. We aimed to perform splicing analysis of ATM splice-site variants identified in the large-scale sequencing project BRIDGES (Breast Cancer After Diagnostic Gene Sequencing). To this end, we bioinformatically selected 47 splice-site variants across 17 exons that were genetically engineered into three minigenes and assayed in MCF-7 cells. Aberrant splicing was observed in 38 variants. Of these, 30 variants, including 7 missense, yielded no or negligible expression of the minigene full-length (mgFL) transcript. A total of 69 different transcripts were characterized, 48 of which harboured a premature termination codon. Some variants, such as c.2922-1G>A, generated complex patterns with up to 10 different transcripts. Alternative 3′ or 5′ splice-site usage was the predominant event. Integration of ATM minigene read-outs into the ACMG/AMP (American College of Medical Genetics and Genomics/Association for Molecular Pathology)-based specifications for the ATM gene enabled the classification of 30 ATM variants as pathogenic or likely pathogenic and 9 as likely benign. Overall, splicing assays provide key information for variant interpretation and the clinical management of patients. Full article
Show Figures

Figure 1

15 pages, 2695 KB  
Article
Opportunistic Osteoporosis Screening in Breast Cancer Using AI-Derived Vertebral BMD from Routine CT: Validation Against QCT and Multivariable Diagnostic Modeling
by Jiayi Pu, Wenqin Zhou, Miao Wei, Wen Li, Yan Xiao, Jia Xie and Fajin Lv
J. Clin. Med. 2026, 15(2), 512; https://doi.org/10.3390/jcm15020512 - 8 Jan 2026
Viewed by 288
Abstract
Background/Objectives: Breast cancer survivors face elevated risk of treatment-related bone loss, yet routine bone health assessment remains underutilized. Opportunistic bone density extraction from routine CT may address this gap. This study validated AI-derived vertebral bone mineral density (AI-vBMD) from non-contrast thoracoabdominal CT [...] Read more.
Background/Objectives: Breast cancer survivors face elevated risk of treatment-related bone loss, yet routine bone health assessment remains underutilized. Opportunistic bone density extraction from routine CT may address this gap. This study validated AI-derived vertebral bone mineral density (AI-vBMD) from non-contrast thoracoabdominal CT for osteoporosis screening and assessed its diagnostic value beyond clinical variables. Methods: This retrospective study included 332 breast cancer patients; AI-vBMD was successfully extracted in 325 (98%). Quantitative CT (QCT) served as reference standard. Agreement between AI-vBMD and QCT-vBMD was assessed using Pearson correlation, Bland–Altman analysis, and weighted kappa for QCT-defined osteoporosis (<80 mg/cm3). Nested logistic regression models compared a clinical model with and without AI-vBMD. Discrimination [area under the curve (AUC)], calibration, and clinical utility [decision-curve analysis (DCA)] were evaluated. Results: AI-vBMD showed strong correlation with QCT-vBMD (r = 0.98, p < 0.001), minimal bias (mean difference +1.82 mg/cm3), and excellent agreement for osteoporosis classification (weighted κ = 0.90). AI-vBMD alone achieved excellent discrimination for osteoporosis (AUC = 0.986). Integrating AI-vBMD into the clinical model yielded significantly higher diagnostic performance (AUC 0.988 vs. 0.879; p < 0.001) and demonstrated superior net benefit across relevant decision thresholds. Conclusions: AI-derived vertebral BMD from routine CT serves as a reliable QCT-aligned imaging biomarker for opportunistic osteoporosis assessment in breast cancer patients and adds significant incremental diagnostic value beyond clinical information alone. Full article
Show Figures

Figure 1

29 pages, 17558 KB  
Article
Subtype-Specific m6A circRNA Methylation Patterns Identify Epigenetic Biomarker Candidates of Potential Diagnostic and Prognostic Significance in Breast Cancer
by Amal Qattan, Wafa Alkhayal, Kausar Suleman, Taher Al-Tweigeri and Asma Tulbah
Int. J. Mol. Sci. 2026, 27(1), 529; https://doi.org/10.3390/ijms27010529 - 4 Jan 2026
Viewed by 678
Abstract
Breast cancer subtypes are known to have important pathobiological and clinical features. For example, triple-negative breast cancer (TNBC) remains one of the most aggressive and treatment-resistant breast cancer subtypes, lacking hormone and HER2 targets. Increasing evidence suggests that circular RNAs (circRNAs) and their [...] Read more.
Breast cancer subtypes are known to have important pathobiological and clinical features. For example, triple-negative breast cancer (TNBC) remains one of the most aggressive and treatment-resistant breast cancer subtypes, lacking hormone and HER2 targets. Increasing evidence suggests that circular RNAs (circRNAs) and their N6-methyladenosine (m6A) modifications play critical roles in cancer biology through the regulation of gene expression, stability, and signaling networks. This study aimed to identify m6A methylation patterns in circRNAs among breast cancer subtypes, explore their potential biological functions, and assess their diagnostic and prognostic relevance compared with luminal breast cancer subtypes. Genome-wide profiling of m6A-modified circRNAs was conducted in TNBC and luminal breast tumor samples using methylated RNA immunoprecipitation followed by microarray analysis. Differential methylation and expression analyses were integrated with pathway enrichment, survival correlation, and receiver operating characteristic (ROC) curve assessments to identify subtype-specific and clinically relevant circRNA candidates. Distinct m6A circRNA methylation signatures were identified across breast cancer subtypes, with TNBC showing enrichment in pathways related to Wnt/β-catenin, CDC42 GTPase signaling, and cytoskeletal remodeling. Several circRNAs, including those derived from ZBTB16, DOCK1, METTL8, and VAV3, exhibited significant hypermethylation and high diagnostic accuracy (AUC > 0.80). Survival analyses revealed associations between circRNAs from key host genes and overall or relapse-free survival, suggesting prognostic potential. These findings uncover subtype-specific m6A circRNA methylation landscapes that may contribute to tumor aggressiveness and heterogeneity. Identified circRNAs represent candidates for investigation as biomarkers for subtype classification and prognosis and may inform future research into epigenetic and post-transcriptional therapeutic targets in breast cancer. Full article
(This article belongs to the Special Issue The Role of RNAs in Cancers: Recent Advances)
Show Figures

Figure 1

22 pages, 1277 KB  
Article
Clinically Aware Learning: Ordinal Loss Improves Medical Image Classifiers
by Arsenii Litvinov, Egor Ushakov, Sofia Senotrusova, Kirill Lukianov, Yury Markin, Liudmila Mikhailova and Evgeny Karpulevich
J. Clin. Med. 2026, 15(1), 365; https://doi.org/10.3390/jcm15010365 - 3 Jan 2026
Viewed by 491
Abstract
Background: BI-RADS (Breast Imaging Reporting and Data System) mammogram classification is central to early breast cancer detection. Despite being an ordinal scale that reflects increasing levels of malignancy suspicion, most models treat BI-RADS as a nominal task using cross-entropy loss, thereby disregarding the [...] Read more.
Background: BI-RADS (Breast Imaging Reporting and Data System) mammogram classification is central to early breast cancer detection. Despite being an ordinal scale that reflects increasing levels of malignancy suspicion, most models treat BI-RADS as a nominal task using cross-entropy loss, thereby disregarding the inherent class order. This mismatch between the clinical severity of misclassification and the model’s optimization objective remains underexplored. Methods: We systematically evaluate whether incorporating ordinal-aware loss functions improves BI-RADS classification performance under controlled, architecture-fixed conditions and dataset imbalance. Using a unified training pipeline across multiple datasets, we compare ordinal losses to standard cross-entropy, analyzing the effect of dataset- and label-level balancing. Area under the receiver operating characteristic curve (AUROC) and macro-F1 scores are reported as averages over five seeds. Results: Balanced sampling across datasets during training led to statistically significant improvements. Ordinal loss functions, such as Earth Mover Distance (EMD), consistently achieved higher performance across multiple metrics compared to conventional cross-entropy approaches commonly reported in the literature. Improvements were particularly evident in reducing severe misclassifications, demonstrating that aligning the learning objective with the ordinal structure of BI-RADS enhances robustness and clinical relevance. Conclusions: Aligning the learning objective with the ordinal BI-RADS structure substantially improves classification accuracy without changing the underlying architecture. These findings emphasize the importance of loss design, regularization, and data-balancing strategies in medical AI, supporting more reliable breast cancer screening. Full article
Show Figures

Figure 1

18 pages, 1077 KB  
Article
Machine Learning Modeling of Hospital Length of Stay After Breast Cancer Surgery: Comparison of Random Forest and Linear Regression Approaches
by Iulian Slavu, Raluca Tulin, Alexandru Dogaru, Ileana Dima, Cristina Orlov Slavu, Daniela-Elena Gheoca Mutu and Adrian Tulin
Medicina 2026, 62(1), 88; https://doi.org/10.3390/medicina62010088 - 31 Dec 2025
Viewed by 396
Abstract
Background and Objectives: Hospital length of stay (LOS) after breast cancer surgery is a key indicator of postoperative recovery, healthcare quality, and hospital resource utilization. Traditional statistical approaches have identified general correlates of LOS but remain limited in predictive accuracy, particularly in [...] Read more.
Background and Objectives: Hospital length of stay (LOS) after breast cancer surgery is a key indicator of postoperative recovery, healthcare quality, and hospital resource utilization. Traditional statistical approaches have identified general correlates of LOS but remain limited in predictive accuracy, particularly in heterogeneous real-world surgical populations. Machine learning (ML) models may offer improved performance by capturing nonlinear interactions among clinical, pathological, and operative factors. This study aimed to evaluate ML algorithms for LOS prediction and to identify determinants of prolonged hospitalization in a contemporary breast cancer cohort. Materials and Methods: We conducted a retrospective cross-sectional study of 198 consecutive breast cancer patients who underwent surgery between January 2022 and December 2023 at a single tertiary care center. Clinical, pathological, and surgical data were extracted from electronic medical records. Three regression models—multiple linear regression, Random Forest, and Gradient Boosting—were trained to predict continuous LOS, and three classification models were applied to prolonged LOS (≥10 days). Model performance was assessed using mean absolute error (MAE), root mean square error (RMSE), coefficient of determination (R2), and area under the curve (AUC). Feature importance was analyzed for the best-performing model. Results: The median LOS was 7 days (IQR 5–10), ranging from 1 to 26 days. Breast-conserving surgery showed the shortest LOS (median 3 days), while mastectomy with immediate reconstruction resulted in the longest stays (median 8 days). Random Forest regression achieved the lowest prediction error (MAE 2.31 days; RMSE 2.82; R2 = 0.37), outperforming Gradient Boosting and substantially surpassing linear regression (MAE 8.63 days; R2 = –8.17). Key predictors included age, surgical complexity, reconstruction modality, BMI, implant capacity, and tumor burden. Classification models yielded modest AUCs (0.545–0.589) with low sensitivity, indicating limited discriminative performance for dichotomized LOS outcomes. Conclusions: Machine-learning models, particularly Random Forest, substantially improve LOS prediction compared with classical regression and provide clinically meaningful insights into the drivers of hospitalization after breast cancer surgery. Continuous LOS modeling is more informative than binary thresholds. These findings support integrating ML-based tools into perioperative planning, resource allocation, and patient counseling in breast surgical care. Full article
Show Figures

Figure 1

Back to TopTop