AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis

Mota, Ana M.

doi:10.3390/cancers17203387

Open AccessReview

AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis

by

Ana M. Mota

Instituto de Biofísica e Engenharia Biomédica, Faculdade de Ciências, Universidade de Lisboa, 1749-016 Lisboa, Portugal

Cancers 2025, 17(20), 3387; https://doi.org/10.3390/cancers17203387

Submission received: 14 September 2025 / Revised: 11 October 2025 / Accepted: 19 October 2025 / Published: 21 October 2025

(This article belongs to the Special Issue Advances in Breast Cancer Imaging: From Detection to Personalized Diagnosis)

Download

Browse Figures

Versions Notes

Simple Summary

Breast cancer is the most common cancer in women. Today, treatment decisions still rely on biopsy, an invasive procedure that samples only a small part of the tumor and may not fully reflect its complexity. In recent years, artificial intelligence (AI) has shown promise in extracting biological and clinical information directly from standard breast images such as mammography and tomosynthesis. This review summarizes the latest research on how AI models based on radiomics and deep learning can predict tumor subtype, stage, and likely outcomes, without the need for tissue sampling. These tools could one day help personalize care, reduce invasive procedures, and make advanced diagnostics more accessible.

Abstract

Background: Biopsy remains the gold standard for characterizing breast cancer, but it is invasive, costly, and may not fully capture tumor heterogeneity. Advances in artificial intelligence (AI) now allow for the extraction of biological and clinical information from medical images, raising the possibility of using imaging as a non-invasive alternative. Methods: A semi-systematic review was conducted to identify AI-based approaches applied to mammography (MM) and breast tomosynthesis (BT) for tumor subtyping, staging, and prognosis. A PubMed search retrieved 1091 articles, of which 81 studies met inclusion criteria (63 MM, 18 BT). Studies were analyzed by clinical target, modality, AI pipeline, number of cases, dataset type, and performance metrics (AUC, accuracy, or C-index). Results: Most studies focused on tumor subtyping, particularly receptor status and molecular classification. Contrast-enhanced spectral mammography (CESM) was frequently used in radiomics pipelines, while end-to-end deep learning (DL) approaches were increasingly applied to MM. Deep models achieved strong performance for ER/PR and HER2 status prediction, especially in large datasets. Fewer studies addressed staging or prognosis, but promising results were obtained for axillary lymph node (ALN) metastasis and pathological complete response (pCR). Multimodal and longitudinal approaches—especially those combining MM or BT with MRI or ultrasound—show improved accuracy but remain rare. Public datasets were used in only a minority of studies, limiting reproducibility. Conclusions: AI models can predict key tumor characteristics directly from MM and BT, showing promise as non-invasive tools to complement or even replace biopsy. However, challenges remain in terms of generalizability, external validation, and clinical integration. Future work should prioritize standardized annotations, larger multicentric datasets, and integration of histological or transcriptomic validation to ensure robustness and real-world applicability.

Keywords:

breast cancer; artificial intelligence; mammography; breast tomosynthesis; radiomics; deep learning; tumor subtyping; tumor staging; prognosis

1. Introduction

Breast cancer remains the most commonly diagnosed cancer among women worldwide and is the leading cause of cancer-related mortality in this population [1]. The importance of early detection is well established, with multiple studies showing that cancers diagnosed at earlier stages are associated with significantly better outcomes [2].

When a suspicious lesion is detected on an imaging exam, typically mammography (MM), the patient is referred for additional tests, including further imaging and, imperatively, a biopsy to characterize the lesion at the tissue level. Core needle biopsy remains the gold standard for confirming malignancy, as it enables the extraction and histopathological analysis of a tissue sample from the lesion, often under image guidance.

While biopsy provides highly reliable diagnostic results, it presents several limitations. First, biopsies may provide an incomplete representation of the lesion, as the sampled tissue might not reflect the full heterogeneity of the tumor. Breast tumors often consist of multiple subclones and diverse cell populations, each potentially exhibiting different molecular or phenotypic profiles. Since the biopsy typically samples only a very small fraction of the lesion (sometimes as little as 0.0005% of the total tumor volume [3]), more aggressive or clinically relevant regions may be missed [4]. Second, biopsies are inherently invasive and frequently painful, often resulting in local trauma. Although these effects are typically temporary, some patients may experience longer-term complications or residual discomfort. Moreover, initial biopsy samples may produce inconclusive results, requiring additional procedures to ensure accurate pathological assessment. However, acquiring additional tissue may be challenging due to biopsy-induced alterations at the original site, potentially compromising subsequent histopathological evaluation [5]. Third, biopsies do not capture the surrounding tumor microenvironment, which plays a critical role in cancer progression and therapeutic response. This environment includes hypoxic regions, infiltrating immune cells and dynamic stromal interactions that may influence tumor aggressiveness, likelihood of invasion and resistance to treatment [6,7]. Lastly, biopsies are resource-intensive procedures as they require trained radiologists, specialized pathologists and advanced laboratory infrastructure. This makes them expensive and potentially less accessible in low-resource settings, where a higher threshold for biopsy referral may lead to delayed diagnosis and poorer clinical outcomes [8].

Medical imaging is the foundation of initial breast cancer detection. When considering breast imaging, there is a well-established set of modalities used in clinical practice: mammography (MM), breast tomosynthesis (BT), breast ultrasound (US) and magnetic resonance imaging (MRI). Among these, MM plays the central role, as it is the primary modality used in population-based screening programs. BT is increasingly used, particularly for women with dense breast tissue or for those at higher risk, and is progressively replacing MM in several screening settings [9,10]. In standard diagnostic workflows outside of organized screening, MM or BT is always the first imaging step, usually combined with US. MRI is generally used as a second-line imaging modality, depending on clinical findings, particularly in complex cases, for high-risk patients, or for pre-surgical planning. Therefore, imaging data is routinely available at the time of diagnosis.

In recent years, increasing attention has been directed toward utilizing these images not only for visual interpretation or basic quantitative assessment, but also as a rich source of complementary information about the lesion. This perspective has led to the emergence of advanced image analysis techniques, aiming to extract features that go beyond what is perceptible to the human eye. Among the most promising ideas is the hypothesis that certain genomic and proteomic patterns may be reflected in macroscopic imaging features, raising the possibility that imaging could serve as a non-invasive alternative or complement to tissue biopsy in the near future [11,12].

With this in mind, and driven by recent advances in computational power and the growing availability of data, interest in artificial intelligence (AI)-based methods has grown over the past few years. These approaches are based on the assumption that quantitative features extracted from imaging data may correlate with the molecular phenotype or genotype of a tumor. As a result, there has been a growing body of research exploring the potential of AI models to predict key aspects of tumor biology—such as tumor subtyping, tumor staging, and treatment response and prognosis—directly from imaging, potentially bypassing the need for invasive procedures.

Tumor subtyping and staging are currently determined through histopathological analysis of biopsy samples. In contrast, prediction of treatment response and prognosis is guided by predefined clinical protocols. Treatment response is typically assessed through longitudinal imaging studies, while prognosis is estimated based on a combination of clinical factors, including tumor subtype, size and stage. The prospect of inferring these critical parameters non-invasively from routine imaging data using AI-based approaches has opened new avenues for more accessible, patient-friendly workflows and earlier and more informative decision-making in breast cancer care.

Several existing reviews have explored the application of AI in breast cancer imaging, primarily focusing on lesion detection or segmentation and binary classification between benign and malignant findings [13,14]. However, despite the increasing volume of research in the last five years, reviews that specifically address the use of AI for biological and clinical characterization of breast tumors remain scarce, particularly those centered on MM and BT, which are the most widely used frontline imaging modalities in breast cancer detection.

The most relevant effort in this direction is the review by Elahi and Nazari [15], which provides a valuable overview of breast cancer characterization studies using various imaging modalities; however, it does not focus specifically on MM or BT. In contrast, Hussain et al. [16] present a comprehensive review on deep learning, radiomics and radiogenomics in breast image analysis, with a specific focus on BT and MM. Still, it does not include a dedicated analysis of studies addressing tumor biology prediction tasks.

The current review aims to fill that gap by providing a comprehensive overview of recent studies that employ AI to extract biological and prognostic information directly from MM and BT imaging data. It analyzes the clinical targets addressed, the datasets used, and the AI-based pipelines adopted (ranging from traditional radiomics to deep radiomics and end-to-end deep learning approaches). It includes a methodology section (Section 2) and an overview of AI-based pipelines for breast cancer characterization (Section 3). It then focuses on three core topics aligned with the clinical characterization pathway: tumor subtyping (Section 4.1), tumor staging (Section 4.2), and treatment response and prognosis (Section 4.3), followed by conclusions. Unless otherwise specified, the studies discussed refer to invasive breast carcinoma, which represents the most common histological subtype of breast cancer.

By focusing exclusively on biological and prognostic characterization tasks derived from standard mammography and tomosynthesis—the most widely available breast imaging modalities—this review offers a focused synthesis that bridges radiomics and deep learning approaches, providing a practical framework for clinical translation.

2. Methodology

This review followed a structured, semi-systematic literature search strategy to identify relevant studies addressing the use of AI for the biological and clinical characterization of breast tumors using MM or BT. A PubMed search was conducted using two separate queries: (“mammography” AND (“radiomics” OR “deep learning”)) and (“breast tomosynthesis” AND (“radiomics” OR “deep learning”)). Articles were screened based on title and abstract, and selection was conducted independently for each modality, as illustrated in Figure 1 and Figure 2.

The initial search yielded 942 articles for MM and 149 for BT. The same inclusion and exclusion criteria were applied to both sets. In the first stage, articles were excluded if they were not in English, if they were reviews, editorials, incomplete studies, or abstracts only. Studies that were not AI-based, not focused on MM or BT, or addressed detection or segmentation tasks rather than classification were also discarded. In the second stage, studies limited to binary classification tasks not directly related to the scope of this review (such as benign versus malignant lesion classification, presence or absence of lesions, breast density assessment, or general risk prediction) were excluded. Additionally, papers were excluded when their primary aim, despite involving classification, was the evaluation of other aspects such as radiomic stability, image quality, methodological validation, simulations, report generation, clinical performance assessment, or image reconstruction (in BT).

Following this process, 63 studies were retained for MM (30 addressing tumor subtyping, 23 staging, and 10 prognosis) and 18 studies for BT (9 addressing subtyping, 7 staging, and 2 prognosis). The MM group includes conventional mammography (MM), and contrast-enhanced spectral mammography (CESM).

A full list of included studies is presented in the following sections. They were systematically grouped into tables according to the following descriptors: clinical target, modality, AI approach, number of cases, type of dataset (public or private), and performance metric. Performance metrics considered were the area under the receiver operating characteristic curve (AUC, with 95% confidence intervals whenever available), accuracy, and in one case where both were absent, the concordance index (C-index). Several studies address more than one clinical target simultaneously.

3. AI-Based Pipelines Used in Breast Cancer Characterization

As previously mentioned, this review focuses on studies that applied AI methods to extract biological characteristics of breast tumors from imaging data. These AI-based approaches can be broadly categorized into three groups: traditional radiomics, deep radiomics and end-to-end deep learning (DL), as illustrated in Figure 3.

All three approaches begin with image acquisition, an evident prerequisite for any image-based AI workflow. However, the subsequent steps differ substantially.

Traditional radiomics is based on the concept introduced in 2012 by Lambin et al. [11], in which a set of quantitative imaging features is extracted from medical images and analyzed to predict clinical targets. The key initial step is the precise segmentation of a region of interest (ROI), usually performed manually by radiologists, although semi-automatic and AI-assisted methods are increasingly available. This step is critical, as all subsequent analysis depends on the accuracy of the segmented area. Once the ROI is defined (typically as a binary mask), a large number of handcrafted features (e.g., intensity, texture, shape) are extracted. These are then passed through a feature selection step, which aims to retain only the most relevant and non-redundant features. Finally, a machine learning classifier is trained on the selected features to predict the target of interest (e.g., molecular subtype, stage, or prognosis).

Deep radiomics reduces some of the constraints of traditional radiomics, particularly regarding the segmentation step and the selection of features. Instead of a precise pixel-level mask, the information needed may be obtained through a broader region such as a bounding box surrounding the lesion. This region (either a precise mask or bounding box) is used as input to a convolutional neural network (CNN) to automatically extract deep features from specific intermediate layers of the network. These features are not predefined but rather learned directly from data. Because CNNs inherently learn hierarchical representations, explicit feature selection is often not necessary. These deep features are then passed to traditional machine learning classifier algorithms for prediction, similar to the traditional radiomics pipeline.

The end-to-end DL approach represents the most integrated but computationally intensive strategy. It can take as input the entire image, a bounding box, or a segmentation mask, thus offering greater flexibility and minimizing the need for precise lesion delineation. It uses CNNs which are trained to directly map input data to the desired prediction, learning both feature representations and classification boundaries simultaneously. Unlike deep radiomics, where features are extracted from intermediate layers, the end-to-end pipeline outputs the final classification or prediction directly, hence the term “end-to-end”. While powerful, this approach requires large, well-annotated datasets to avoid overfitting and achieve robust performance, a requirement that has only recently become more feasible with increased data availability and computational resources.

4. AI-Based Characterization of Breast Cancer from Mammography and Tomosynthesis

Following the description of the main AI-based pipelines, this section explores their application to clinically meaningful prediction tasks, grouped into three main categories: tumor subtyping, tumor staging and treatment response and prognosis.

Figure 4 presents the distribution of studies according to the specific clinical targets addressed and imaging modalities (MM and BT). The majority of works are focused on tumor subtyping, particularly the prediction of molecular subtypes and receptor status, followed by ki-67expression and tumor grade. Staging-related tasks were also frequently reported, with emphasis on axillary lymph node (ALN) metastasis, followed by the distinction between invasive and in situ cancers and the identification of lymphovascular invasion. In contrast, fewer studies have addressed treatment response and prognosis, including the prediction of pathological complete response (pCR) to neoadjuvant chemotherapy and the estimation of recurrence risk. Only one study focused on the assessment of tumor-infiltrating lymphocytes.

In the following subsections, the studies are reviewed within three main categories—tumor subtyping, tumor staging, and treatment response and prognosis—reflecting key steps in clinical decision-making. Each category is accompanied by summary tables and a narrative discussion of methodological approaches, performance and clinical relevance.

4.1. Tumor Subtyping

Tumor subtyping refers to the stratification of breast cancer based on key biological markers that define the tumor’s molecular identity and clinical behavior. This includes the assessment of hormone receptor status (estrogen receptor (ER) and progesterone receptor (PR)), human epidermal growth factor receptor 2 (HER2) expression, ki-67proliferation index and tumor grade. With the exception of tumor grade, all other parameters determine the classification of tumors into molecular subtypes such as Luminal A (ER+, PR+, ki-67low, HER2−), Luminal B1 (ER+, PR− or ki-67high, HER2−), Luminal B2 (ER+, PR any, ki-67any, HER2+), HER2-enriched (ER−, PR−, ki-67any, HER2+) and Triple-Negative (ER−, PR−, ki-67any, HER2−), which are central to therapeutic decision-making and prognosis.

Subtyping is currently performed through histopathological analysis of biopsy samples, primarily using immunohistochemistry (IHC). However, recent studies suggest that imaging-based AI models may offer a non-invasive alternative. This section reviews studies that explore the use of MM and BT data to predict tumor subtype-related biomarkers, highlighting advances in receptor status classification, molecular subtype prediction, proliferation index estimation and tumor grading. Table 1 summarizes studies using MM- and BT-based imaging alone, while Table 2 lists those using multimodal approaches for breast cancer subtyping.

Among the reviewed works, radiomics-based pipelines remain dominant, particularly in CESM studies. La Forgia et al. (2020) [17] proposed a radiomics-based approach using CESM to predict multiple biomarkers, including ER, PR, HER2, Ki-67, tumor grade, and triple-negative (TN) status. Despite a small dataset (n = 52), the model yielded competitive AUCs of 0.8348 for HER2, 0.8213 for PR, and 0.9089 for TN. The pipeline employed manual lesion segmentation, 14 handcrafted features, and a leave one out cross-validation scheme. Although the risk of overfitting remains considerable given the sample size, the study illustrates the feasibility of multi-biomarker prediction from CESM.

DL-based approaches have also demonstrated strong performance, particularly for receptor status. Zhang et al. (2023) [18] developed a weakly supervised DL framework (BSNet) using a large MM dataset (n = 2321), achieving an AUC of 0.921 and accuracy of 0.802 for ER/PR classification. By training directly on full multi-view mammograms without manual ROIs and avoiding pixel-level annotations, BSNet supports scalability. It also provides visual feature maps that highlight relevant image regions, supporting interpretability. Duan et al. (2024) [19], using a smaller cohort (n = 358), implemented a ResNet-based architecture with attention mechanisms, achieving comparable performance. Similarly, it also generated attention maps highlighting suspicious tumor regions associated with ER status. These studies underscore that end-to-end DL pipelines can achieve robust results even without explicit lesion annotation, provided sufficient data is available.

For HER2 status prediction, Wang et al. (2025) [20] proposed a dual-modality framework combining MM and MRI data, combining handcrafted and deep features to differentiate HER2-zero, HER2-low, and HER2-positive tumor, aligning with the growing clinical importance of accurately identifying HER2-low patients for emerging targeted therapies. Their model achieved 0.8815 accuracy for HER2-positive detection and was validated across private and public datasets, including CMMD and Duke. The authors combined manual 3D tumor segmentation on MRI with automatic ResNet-50-based feature extraction on mammograms, followed by ComBat harmonization and synthetic minority oversampling technique (SMOTE) rebalancing. Though the exclusion of functional MRI sequences and semantic features is a limitation, the framework’s multimodal architecture and rigorous validation make it a benchmark in imaging-based HER2 classification.

In terms of molecular subtypes, Zhang et al. (2021) [21] used CESM-based radiomics to classify TN breast cancer. Following a univariate analysis and LASSO for feature selection, prediction models achieved a strong performance with an AUC of 0.90. On the other hand, Zhu et al. (2023) [22] explored a binary classification strategy to distinguish HER2-enriched vs. non-HER2-enriched, Luminal vs. non-Luminal and TN vs. non-TN subtypes using CESM. After feature selection with intraclass correlation coefficient (ICC) analysis, minimum Redundancy Maximum Relevance (mRMR) algorithm, and recursive feature elimination, support vector machine classifiers were trained for each task. The model performance was robust for HER2-enriched and Luminal tumors, but weaker for TN, reflecting a common challenge in capturing the heterogeneity of TN lesions. Importantly, their model incorporated clinical variables alongside radiomic features in a combined nomogram, providing a more clinically actionable framework. While their lower performance for TN may reflect class imbalance and subtle textural signs, the binary classification framework provides a pragmatic modelling strategy that mirrors how clinical decisions are often dichotomized, though its real-world applicability will require further validation in larger, balanced cohorts.

Other studies have approached the spatial heterogeneity of tumor signatures by analyzing not only the intratumoral region but also the surrounding peritumoral tissue for molecular subtype characterization. Niu et al. (2022) [23] analyzed intra- and peritumoral regions across MM, BT, and multiparametric MRI (T2WI, DCE, DWI). Despite a moderate sample (n = 241), the multimodal and multi-regional radiomic analysis led to strong predictive performance: AUCs of 0.938 for TN (DW + DCE) and 0.877 for HER2-enriched subtypes (DW + DCE). By constructing radiomics signatures separately for intra-, peri-, and combined regions in each modality, and comparing aggregated performances of DM + DBT versus DW + DCE, the study proposed a robust yet complex framework. While the methodology highlights the relevance of tumor microenvironment characterization, its reliance on advanced MRI sequences may limit immediate clinical translation, underscoring the need for simplified and effective pipelines.

Regarding Ki-67, Oba et al. (2024) [24] used DL on BT slices and achieved an AUC of 0.883 and accuracy of 0.912. Using Xception CNN as backbone, performance varied by lesion type, with better results in masses than in focal asymmetries or calcifications. The study also provided a nuanced analysis of preprocessing limitations, particularly how downsampling might obscure calcification detail. Jiang et al. (2022) [25] took a radiomics-based approach using MM, BT and MRI and analyzing intra- and peritumoral regions. The combined intra- and peritumoral radiomics signatures achieved a predictive performance with an AUC of 0.87, similar to Oba et al. Although their pipeline did not outperform the DL model from Oba et al., both studies suggest that ki-67 prediction is feasible using BT or multimodal imaging.

Tumor grade prediction has also been addressed. Mao et al. (2021) [26] used CESM-based radiomics to distinguish low- from high-grade tumors in a single-center study (n = 205), achieving an AUC of 0.80 and accuracy of 0.85. A two-step logistic regression model combined features from both low-energy and recombined CESM images, with the latter proving the most discriminative power, highlighting the role of contrast enhancement in grading. Petrillo et al. (2024) [27] extended the evidence provided by Mao et al.’s single-center CESM radiomics study to a multicenter setting, including four institutions. In this broader context, they reported an accuracy of 0.8205 for tumor grading using a decision tree classifier, thereby enhancing the generalizability. Marino et al. (2020) [28] compared CESM and MRI-based radiomics in 48 patients and showed that both modalities could effectively predict histological grade (accuracy > 0.77). While CESM outperformed MRI for hormone receptor status classification (95.6% vs. 82.6%), their performance in grading was comparable. This cross-modality comparison suggests that CESM could offer a cost-effective alternative to MRI for tumor grading, while also highlighting the value of combining imaging biomarkers from different modalities in future research.

In summary, tumor subtyping prediction has evolved from handcrafted radiomics applied to MM images toward increasingly sophisticated DL and multimodal pipelines. While radiomics remains dominant in MM and BT applications due to its interpretability and lower data demands, DL models, particularly weakly supervised or attention-based, have begun to demonstrate competitive performance with less reliance on manual annotation. Moreover, spatial and multi-region analyses are emerging as key strategies to capture tumor heterogeneity, especially for complex targets such as TN and Ki-67.

4.2. Tumor Staging

Tumor staging in breast cancer follows the TNM classification system, which integrates tumor size and extent of local invasion (T), presence of regional lymph node metastasis (N), and evidence of distant metastasis (M). Beyond size, a key histopathological concern is whether malignant cells have breached the basement membrane, invaded lymphatic or vascular structures, or disseminated to lymph nodes—factors that significantly impact prognosis and treatment decisions. Traditionally, these parameters are assessed through surgical excision, histological grading and sentinel lymph node biopsy. However, there is increasing interest in developing non-invasive alternatives based on imaging and artificial intelligence to infer these staging features preoperatively. Table 3 and Table 4 summarize the studies using MM- and BT-based AI models for tumor staging.

Table 1. Studies using mammography-based and tomosynthesis-based imaging for breast cancer subtyping.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Zhou et al., 2019 [29]	Receptor status (HER2)	MM	TradRad	306 patients	private	AUC = 0.787 (95% CI, 0.673–0.885), Acc = 0.770
Ma et al., 2019 [30]	Molecular subtypes	MM	TradRad	182 patients	private	AUC = 0.865 (Acc = 0.796) TN vs. non-TN AUC = 0.784 (Acc = 0.748) HER2-enriched vs. non-HER2-enriched AUC = 0.752 (Acc = 0.788) luminal vs. non-luminal
Tagliafico et al., 2019 [31]	ki-67	BT (2D synthetic)	TradRad	70 patients	private	AUC = 0.698
La Forgia et al., 2020 [17]	Receptor status, tumor grade and ki-67	CESM	TradRad	52 patients	private	AUC = 0.8276 ER-positive vs. ER-negative AUC = 0.8213 PR-positive vs. PR-negative AUC = 0.8348 HER2-positive vs. HER2-negative AUC = 0.7985 ki-67+ vs. ki-67- AUC = 0.7680 High-grade vs. Low-grade AUC = 0.9089 TN vs. non-TN
Wang et al., 2020 [32]	Molecular subtype (TN vs. non-TN)	MM	TradRad	112 patients	private	AUC = 0.84 (95% CI, 0.73–0.96), Acc = 0.783
Marino et al., 2020 [33]	Receptor status and tumor grade	CESM	TradRad	100 patients	private	Acc = 0.784 HR-positive vs. HR-negative Acc = 0.972 HR-positive/HER2-negative vs. HR-negative/HER2-positive Acc = 1.00 HR-positive/HER2-positive (triple positive) vs. HR-negative/HER2-negative (TN) Acc = 0.821 TN vs. HR-positive Acc = 0.900 G1 vs. G2 + G3 for invasive cancers Acc = 1.00 G1 vs. G2 + G3 for non-invasive cancers
Son et al., 2020 [34]	Molecular subtypes	BT (2D synthetic)	TradRad	221 patients	private	AUC = 0.677 (95% CI, 0.552–0.802) Luminal vs. non-luminal AUC = 0.582 (95% CI, 0.361–0.804) HER2 vs. non-HER2 AUC = 0.868 (95% CI, 0.730–1.000) TN vs. non-TN
Kanbayti et al., 2021 [35]	Receptor status (PR)	MM	TradRad	184 patients	private	AUC = 0.652 (95% CI, 0.560–0.744) PR status
Ueda et al., 2021 [36]	Receptor status	MM	End-to-end DL	1373 images	private	AUC = 0.67 (95% CI, 0.58–0.76) ER-positive vs. ER-negative AUC = 0.61 (95% CI, 0.53–0.68) PR-positive vs. PR-negative AUC = 0.75 (95% CI, 0.68–0.82) HER2-positive vs. HER2-negative
Zhang et al., 2021 [21]	Molecular subtype (TN vs. non-TN)	CESM	TradRad	367 patients	private	AUC = 0.90 (95% CI, 0.85–0.96)
Mao et al., 2021 [26]	Tumor grade	CESM	TradRad	205 patients	private	AUC = 0.80 (95%CI, 0.67–0.92), Acc = 0.85
Ge et al., 2022 [37]	Molecular subtype (TN vs. non-TN)	MM	TradRad	319 patients	private	AUC = 0.809 (95% CI, 0.711–0.907), Acc = 0.806
Petrillo et al., 2022 [38]	Receptor status and tumor grade	CESM	TradRad	182 patients	private	AUC = 0.8237, Acc = 0.9167 G1 vs. G2 + G3 AUC = 0.7081, Acc = 0.8929 HER2-positive vs. HER2-negative AUC = 0.7500, Acc = 0.8500 HR-positive vs. HR-negative
Dominique et al., 2022 [39]	Tumor grade, ki-67, receptor status	CESM	End-to-end DL	389 patients	private	AUC = 0.621, Acc = 0.6028 G1 + G2 vs. G3 AUC = 0.858, Acc = 0.8125 ER-positive vs. ER-negative AUC = 0.630, Acc = 0.5958 PR-positive vs. PR-negative AUC = 0.644, Acc = 0.5544 HER2-positive vs. HER2-negative AUC = 0.610, Acc = 0.5645 Ki-67 (high or not) AUC = 0.908, Acc = 0.8468 TN
Jiang et al., 2022 [40]	ki-67	BT (slice-based)	TradRad and DeepRad	266 patients	private	AUC = 0.792 (95% CI, 0.647–0.897), Acc = 0.652 (95% CI, 0.404–0.913)
Zhang et al., 2023 [18]	Receptor status (ER and PR)	MM	End-to-end DL	2321 patients	private	AUC = 0.921, Acc = 0.802
Zhu et al., 2023 [22]	Molecular subtypes	CESM	TradRad	356 patients	private	AUC = 0.82 (95% CI, 0.69–0.92), Acc = 0.72 Luminal vs. Non-Luminal AUC = 0.83 (95% CI, 0.70–0.94), Acc = 0.83 HER2-enriched vs. Non-HER2-enriched AUC = 0.68 (95% CI, 0.47–0.86), Acc = 0.75 TN vs. Non-TN
Rong et al., 2023 [41]	Tumor grade	MM	TradRad	534 patients	private	AUC = 0.75 (95% CI, 0.66–0.83), Acc = 0.68 G3 vs. G1 + G2
Deng et al., 2024 [42]	Receptor status (HER2)	MM	TradRad	1512 patients	private	AUC = 0.724 (95% CI, 0.637–0.811), Acc = 0.722
Duan et al., 2024 [19]	Receptor status (ER)	MM	End-to-end DL	358 patients	private	AUC = 0.886 (95% CI, 0.809–0.934), Acc = 0.831
Petrillo et al., 2024 [27]	Receptor status, molecular subtype (luminal vs. non-luminal) and tumor grade	CESM	TradRad	169 patients	private	Acc = 0.6071 HER2-positive Acc = 0.6786 HR-positive Acc = 0.8205 grade Acc = 0.9375 luminal vs. non-luminal
Mota et al., 2024 [43]	Molecular subtypes	MM	End-to-end DL	660 patients	semiPublic—OPTIMAM	AUC = 0.6599, Acc = 0.7372 Luminal A vs. non-Luminal A AUC = 0.6545, Acc = 0.7491 Luminal B1 vs. non-Luminal B1 AUC = 0.6530, Acc = 0.6284 HER2 vs. non-HER2, AUC = 0.6445, Acc = 0.7706 TN vs. non-TN Multiclassification: AUC = 0.5992 Luminal A AUC = 0.5981 Luminal B1 AUC = 0.5920 Luminal B2 AUC = 0.6440 HER2 AUC = 0.6089 TN
Bakker et al., 2024 [44]	Molecular subtypes	MM	TradRad	186 patients	semipublic-OPTIMAM	AUC = 0.855 (95% CI, 0.779–0.930), Acc = 0.815 luminal A AUC = 0.812 (95% CI, 0.736–0.889), Acc = 0.734 luminal B AUC = 0.755 (95% CI, 0.644–0.867), Acc = 0.637 HER2 AUC = 0.789 (95% CI, 0.701–0.878), Acc = 0.718 TN
Nissar et al., 2024 [45]	Molecular subtypes	MM	End-to-end DL	749 patients	public—CMMD	Subtype classification (4-class: Luminal A, Luminal B, HER2-enriched, TN): Acc = 0.90 (mass-based) Acc = 0.90 (calcification-based) Acc = 0.92 (combined mass + calcification)
Oba et al., 2024 [24]	ki-67	BT (slice-based)	End-to-end DL	126 patients	private	AUC = 0.883, Acc = 0.912 AUC = 0.890 mass AUC = 0.750 calcification AUC = 0.870 distortion AUC = 0.660 focal asymmetric density
Zeng et al., 2025 [46]	Receptor status	MM	End-to-end DL	352 patients	private	AUC = 0.708 (95% CI, 0.609–0.808), Acc = 0.651 HER2 status AUC = 0.785 (95% CI, 0.673–0.897), Acc = 0.845 ER status AUC = 0.706 (95% CI, 0.603–0.809), Acc = 0.678 PR status
Rabah et al., 2025 [47]	Molecular subtypes	MM	End-to-end DL	1775 patients (public)	public—CMMD	AUC = 0.8887, Acc = 0.6379 (multiclass classification) AUC = 0.67 Luminal A vs. Others AUC = 0.74 Luminal B vs. Others AUC = 0.78 HER2 vs. Others AUC = 0.78 TN vs. Others
Hu et al., 2025 [48]	ki-67	BT (slice-based)	TradRad	289 patients	private	AUC = 0.755 (95% CI, 0.629–0.880), Acc = 0.782
Mota et al., 2025 [49]	Molecular subtypes	BT (slice-based)	End-to-end DL	453 images	semiPublic—OPTIMAM	AUC = 0.5928 Luminal B2 vs. non-Luminal B2 AUC = 0.7317 HER2+ vs. non-HER2+ AUC = 0.6522 TN vs. non-TN

Table 2. Studies using multimodal imaging approaches (including mammography and tomosynthesis) for breast cancer subtyping.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Marino et al., 2020 [28]	Receptor status and tumor grade	CESM and MRI	TradRad	48 patients	private	CESM: Acc = 0.956 HR positive vs. HR negative Acc = 0.778 G1 + G2 vs. G3 invasive cancers MRI: Acc = 0.826 HR positive vs. HR negative Acc = 0.778 G1 + G2 vs. G3 cancers invasive cancers
Jiang et al., 2022 [25]	ki-67	MM and MRI and BT (slice-based)	TradRad	209 patients	private	AUC = 0.866
Niu et al., 2022 [23]	Molecular subtypes	MM and MRI and BT (slice-based)	TradRad	241 patients	private	AUC = 0.773 (95% CI, 0.650–0.895) Luminal A (MM + BT) AUC = 0.747 (0.578–0.917) Luminal A (DW + DCE) AUC = 0.807 (0.706–0.908) Luminal B (MM + BT) AUC = 0.784 (0.675–0.894) Luminal B (DW + DCE) AUC = 0.802 (0.658–0.946) HER2 (MM + BT) AUC = 0.877 (0.795–0.959) HER2 (DW + DCE) AUC = 0.874 (0.759–0.990) TN (MM + BT) AUC = 0.938 (0.883–0.992) TN (DW + DCE)
Liu et al., 2023 [50]	Molecular subtype (Luminal A vs. non-Luminal A)	MM and MRI	End-to-end DL	158 patients	private	AUC = 0.802 (95% CI, 0.657–0.906), Acc = 0.711 (MM + MRI) AUC = 0.593 (95%CI, 0.436–0.737), Acc = 0.533 (only MM)
Zhang et al., 2023 [51]	Molecular subtypes	MM and US	End-to-end DL	4162 images	private	AUC = 0.929 (95% CI, 0.903, 0.951), Acc = 88.5 (86.0–90.9) Luminal (Luminal A and Luminal B) vs. Non-Luminal (HER2-enriched and TN)
Liu et al., 2024 [52]	ki-67	BT (slice-based) and US	TradRad	149 patients	private	AUC = 0.818 (95% CI, 0.685–0.950)
Wang et al., 2025 [20]	Receptor status (HER2)	MM and MRI	TradRad and DeepRad	550 patients (private)	private and public (4validation)—Duke, CMMD	AUC = 0.824 (95% CI, 0.749–0.884), Acc = 0.8074, HER2-Positive vs. HER2 Zero/Low AUC = 0.811 (95% CI, 0.735–0.874), Acc = 0. 7926, HER2-Zero vs. HER2-Low/Positive Acc = 0.8444 (HER2-zero) Acc = 0.8000 (HER2-low) Acc = 0.8815 (HER2-positive)
Yang et al., 2025 [53]	Molecular subtypes	MM and MRI	TradRad	243 patients	private	AUC = 0.648, Acc = 0.627 Luminal A vs. Luminal B AUC = 0.819, Acc = 0.793 luminal A vs. HER2-enriched AUC = 0.725, Acc = 0.696 luminal A vs. TN AUC = 0.644, Acc = 0.560 luminal B vs. HER2-enriched AUC = 0.625, Acc = 0.636 luminal B vs. TN AUC = 0.598, Acc = 0.500 TN vs. HER2-enriched

Ductal carcinoma in situ (DCIS) progression to invasive carcinoma has been a major focus. Alaeikhanehshir et al. (2024) [54] investigated whether DL applied to MM could distinguish low- from high-risk DCIS, aiming to guide eligibility for active surveillance. Using a fully convolutional network based on U-Net trained on mammograms from 464 patients with preoperative DCIS, the study achieved an AUC of 0.72 for distinguishing low- from high-grade DCIS, which increased to 0.76 when cases upstaged to invasive cancer were also considered high-risk. Although the performance was modest, the study addresses a clinically pressing challenge—avoiding overtreatment of indolent DCIS—and reports a high negative predictive value (0.91), supporting the model’s utility in safely identifying low-risk cases. As one of the few AI studies focused on the transition from in situ to invasive disease, it contributes meaningfully to efforts toward personalized, less invasive management of DCIS, even if further validation is needed before clinical translation.

Wu et al. (2024) [55] extended the investigation of DCIS staging using a larger cohort (n = 733) and a multimodal input strategy combining US with MM, clinical information, and core needle biopsy pathology. Their end-to-end DL framework achieved strong performance across multiple classification tasks, with AUCs ranging from 0.859 to 0.907 for the three-class grading task (low-, intermediate/high-grade, upstaged DCIS), from 0.829 to 0.861 for DCIS versus upstaged DCIS, and up to 0.987 for distinguishing low-grade from upstaged low-grade DCIS, with accuracies reaching 0.939 Notably, the model was evaluated across different nuclear grades and histological presentations, including the challenging task of distinguishing low-grade DCIS from upstaged low-grade DCIS. This stratified analysis enhances clinical relevance by addressing uncertainty in DCIS progression risk. The model’s high performance may be partly attributable to the added diagnostic value of US, which improves visualization of soft tissue. The absence of external validation and cohort-specific performance analysis, however, leaves questions about generalizability.

In a related study, Wu et al. (2025) [56] focused on the preoperative prediction of low nuclear grade DCIS using traditional radiomics in a smaller cohort (n = 237). Their ensemble model—built using Elastic Net, Generalized Linear Models with Boosting, and Ranger—achieved an AUC of 0.92 and accuracy of 0.88 in the validation set, outperforming a baseline model based solely on clinical and conventional imaging variables. Importantly, the authors applied Shapley Additive Explanations (SHAP) analysis to interpret the contribution of individual radiomic features, emphasizing that transparency and interpretability can be preserved while maintaining high performance. Although limited by its modest sample size and lack of external validation, the integration of radiomic features with clinical information supports the role of interpretable radiomics in preoperative decision-making for DCIS.

Invasiveness assessment has also been addressed using CESM and MRI. Marino et al. (2020) [28] evaluated the ability of radiomics applied to both CESM and MRI to differentiate invasive from non-invasive breast cancers in a small cohort (n = 48), reporting an accuracy of 0.92 for CESM and 0.90 for MRI. In an earlier study with 100 patients [33], the same group had already demonstrated solid performance using CESM alone (accuracy = 0.874). The consistency across studies supports the hypothesis that vascular enhancement patterns captured by CESM are highly informative for assessing invasive potential within preoperative DCIS cases. These findings reinforce the role of CESM not only in lesion detection but also as a functional imaging tool for early biological characterization.

Axillary lymph node (ALN) metastasis has emerged as the most studied clinical target in recent AI-based breast cancer staging research, reflecting its central role in treatment decisions such as axillary dissection or systemic therapy. Several studies have explored this using both DL and traditional radiomics. Abel et al. (2022) [57] developed an end-to-end DL model based on MM alone and achieved a remarkable accuracy of 0.9596 for suspicious lymph nodes in a small sample size (n = 74). This work stands out by showing that even conventional MM alone can carry predictive signals for ALN status. However, the absence of external validation and small, homogeneous cohorts limit clinical applicability. Xu et al. (2023) [58] adopted a slice-based radiomics strategy on BT and achieved an AUC of 0.92 and accuracy of 0.833 in 120 patients. While radiomics may offer greater interpretability and model transparency, its sensitivity to segmentation protocols and image preprocessing requires caution when comparing across datasets.

Larger studies by Wang et al. (2024) [59] and Hua et al. (2024) [60] combined MM and MRI for ALN prediction, achieving consistent results (AUC ~0.90, accuracy ~0.82–0.85) in cohorts of nearly 500 patients. These multimodal pipelines emphasize the value of integrating structural and functional data when targeting complex endpoints such as lymphatic spread. However, both studies lack stratification by tumor subtype or size, which may influence nodal involvement patterns. They also underscore a practical limitation of MRI availability, which may restrict model implementation in standard clinical settings.

As for lymphovascular invasion, Wang et al. (2022) [61] and Xu et al. (2024) [62] proposed two different BT-based radiomics strategies. Wang et al. used a volumetric approach with 3D features extracted from the entire tumor volume, achieving an AUC of 0.835 in a cohort of 135 patients. Xu et al., using slice-based 2D feature extraction from intra- and peritumoral regions, reported superior performance (AUC = 0.905, Acc = 0.815) in a slightly larger cohort (n = 178). The higher performance of the latter may stem from differences in ROI selection, feature robustness, or even cohort composition. Both studies underscore the value of addressing lymphovascular invasion non-invasively, a target rarely explored in AI-based breast cancer imaging research, despite its clinical relevance. Moreover, they demonstrate how BT can provide meaningful 3D tissue characterization, particularly in detecting subtle signs of microinvasion.

Overall, AI has shown to be able of assisting in complex staging tasks, from detecting occult invasion to predicting nodal or vascular involvement. While CESM shows potential for evaluating invasiveness, and BT demonstrates value for lymphovascular invasion detection, the most consistent gains are observed when multimodal data or sophisticated DL architectures are used.

Table 3. Studies using mammography-based and tomosynthesis-based imaging for breast cancer staging.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Shi et al., 2018 [63]	Pure DCIS vs. DCIS with invasion	MM	TradRad and DeepRad	99 patients	private	TradRad: AUC = 0.68 (95% CI, 0.66–0.71) DeepRad: AUC = 0.70 (95% CI, 0.68–0.73)
Yang et al., 2019 [64]	ALN metastasis	MM	TradRad	147 patients	private	AUC = 0.875 (95% CI, 0.698–0.891), Acc = 0.800 (95% CI, 66.4–83.2%)
Li et al., 2019 [65]	Pure DCIS vs. DCIS with invasion	MM	TradRad	362 patients	private	AUC = 0.72 (95% CI, 0.63–0.81)
Tan et al., 2020 [66]	ALN metastasis	MM	TradRad	216 patients	private	AUC = 0.863 (95% CI, 0.821–0.897), Acc = 0.7917
Mao et al., 2020 [67]	ALN metastasis	CESM	TradRad	394 patients	private	AUC = 0.767 (95% CI, 0.583–0.857) internal validation AUC = 0.790 (95% CI, 0.63–0.94) external validation
Marino et al., 2020 [33]	Pure DCIS vs. DCIS with invasion	CESM	TradRad	100 patients	private	Acc = 0.874 invasive vs. noninvasive cancer
Kanbayti et al., 2021 [35]	ALN metastasis	MM	TradRad	184 patients	private	AUC = 0.681 (95% CI, 0.559–0.804)
Wu et al., 2022 [68]	ALN metastasis	CESM	DeepRad	182 patients	private	for non-sentinel lymph node metastasis status: AUC = 0.85 (95% CI, 0.71–0.99), Acc = 0.81 (95% CI, 0.63–0.93) testing AUC = 0.82 (95% CI, 0.67–0.97), Acc = 0.74 (95% CI, 0.55–0.88) temporal validation axillary tumor burden: AUC = 0.82 (95% CI, 0.66–0.97), Acc = 0.75 (95% CI, 0.55–0.88) testing AUC = 0.77 (95% CI, 0.62–0.93), Acc = 0.74 (95% CI, 0.55–0.88) temporal validation
Abel et al., 2022 [57]	ALN metastasis	MM	End-to-end DL	74 patients	private	Acc = 0.9596
Hou et al., 2022 [69]	Pure DCIS vs. DCIS with invasion	MM	TradRad	700 patients	private	AUC = 0.71 (95% CI, 0.62–0.79)
Wang et al., 2022 [61]	Lymphovascular Invasion	BT (volume-based)	TradRad	135 patients	private	AUC = 0.835 (95% CI, 0.712–0.958)
Lin et al., 2023 [70]	ALN metastasis	CESM	TradRad	365 patients	private	AUC = 0.7567 (95% CI, 0.6717–0.8678), Acc = 0.7551 (0.6113–0.8666)
Wang et al., 2023 [71]	ALN metastasis	CESM	TradRad	809 patients	private	AUC = 0.732, Acc = 0.681
Xu et al., 2023 [58]	ALN metastasis	BT (slice-based)	TradRad	120 patients	private	AUC = 0.920 (95% CI, 0.806–1.000), Acc = 0.833
Shimokawa et al., 2023 [72]	Stromal invasion/pure DCIS vs. DCIS with invasion	BT (slice-based)	End-to-end DL	140 patients	private	AUC = 0.75 (95% CI, 0.69–0.81)
Tsai et al., 2024 [73]	Pure DCIS vs. DCIS with invasion	MM	End-to-end DL	1436 patients	private	AUC = 0.747 (95% CI, 0.677–0.813)
Alaeikhanehshir et al., 2024 [54]	Pure DCIS vs. DCIS with invasion	MM	End-to-end DL	464 patients	private	AUC = 0.72 low-risk DCIS vs. high risk AUC = 0.76 low-risk DCIS vs. high-risk and/or upstaged DCIS (occult IBC)
Xu et al., 2024 [62]	Lymphovascular Invasion	BT (slice-based)	TradRad	178 patients	private	AUC = 0.905 (95% CI, 0.823–0.986), Acc = 0.815
He et al., 2025 [74]	ALN metastasis	BT (volume-based)	TradRad	536 patients	private	AUC = 0.792, Acc = 0.726

Table 4. Studies using multimodal imaging approaches (including mammography and tomosynthesis) for breast cancer staging.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Marino et al., 2020 [28]	Pure DCIS vs. DCIS with invasion	CESM + MRI	TradRad	48 patients	private	CESM Acc = 0.92 MRI Acc = 0.90
Cheng et al., 2022 [75]	ALN metastasis	MM + MRI + BT (slice-based)	TradRad	208 patients	private	AUC = 0.786, Acc = 0.771 (MM + BT) AUC = 0.826, Acc = 0.829 (DCE-MRI + DWI)
Haraguchi et al., 2023 [76]	ALN metastasis	MM + BT (2D synthetic)	TradRad	77 patients	private	AUC = 0.738 (95% CI, 0.608–0.867), MM AUC = 0.742 (95% CI, 0.613–0.871), BT (2D synthetic)
Wang et al., 2024 [59]	ALN metastasis	MM + MRI	TradRad	485 patients	private	AUC = 0.892 (95% CI, 0.826–0.939), Acc = 0.8182
Hua et al., 2024 [60]	ALN metastasis	MM + MRI	TradRad	492 patients	private	AUC = 0.902 (95% CI, 0.833–0.972), Acc = 0.847
Guo et al., 2024 [77]	ALN metastasis	MM + MRI	TradRad and DeepRad	270 patients	private	AUC = 0.846, Acc = 0.765
Wu et al., 2024 [55]	Pure DCIS vs. DCIS with invasion	MM + US	End-to-end DL	733 patients	private	AUC = 0.859–0.907, Acc = 0.752–0.766 in low-grade DCIS vs. intermediate-to-high-grade DCIS vs. upstaged DCIS AUC = 0.829–0.861, Acc = 0.751–0.780 in DCIS vs. upstaged DCIS AUC = 0.769–0.987, Acc = 0.818–0.939 in low-grade DCIS vs. upstaged low-grade DCIS
Cheng et al., 2025 [78]	ALN metastasis	MM + MRI	TradRad and End-to-end DL	326 patients	private	AUC = 0.877, Acc = 0.727
Wu et al., 2025 [56]	Pure DCIS vs. DCIS with invasion	MM + US	TradRad	237 patients	private	AUC = 0.92 (95% CI, 0.84–0.96), Acc = 0.88 low nuclear grade DCIS vs. Intermediate to high nuclear grade

4.3. Treatment Response and Prognosis

Breast cancer management is increasingly shaped by the ability to anticipate treatment efficacy and long-term outcomes. In this context, AI offers a non-invasive path to predict clinically meaningful endpoints such as pathological complete response (pCR) to neoadjuvant chemotherapy, recurrence risk, and immunological markers like tumor-infiltrating lymphocytes. Predicting pCR preoperatively can help tailor therapy intensity and avoid overtreatment, while recurrence risk models may guide follow-up strategies and adjuvant therapy decisions. The studies summarized in Table 5 and Table 6 reflect a growing interest in integrating AI into therapeutic planning and outcome prediction, though their clinical translation remains limited by methodological challenges and dataset constraints.

Mao et al. (2022) [79] employed CESM-based radiomics to predict pCR in 118 breast cancer patients receiving neoadjuvant chemotherapy. Their pipeline integrated intratumoral and peritumoral features, achieving an AUC of 0.85. This inclusion of peritumoral regions reflects an emerging recognition that tumor-stroma interactions influence therapeutic outcomes. The model, based on multivariate logistic regression, offers interpretability while maintaining competitive performance. A notable strength lies in the detailed radiomic feature selection and the clarity of the biological rationale, particularly the correlation of peritumoral enhancement with response. However, the study would benefit from a broader validation across imaging centers to assess robustness.

Xing et al. (2024) [80] adopted a different approach to predict pCR in ER+/HER2− patients undergoing neoadjuvant chemotherapy, integrating CESM with an end-to-end DL model. In a cohort of 265 patients, the model achieved excellent results (AUC = 0.95; Acc = 0.94), by integrating deep features from low-energy and recombined CESM images with selected clinical and pathological variables. The study included a nomogram incorporating these fused inputs, offering a more interpretable and potentially deployable tool for clinical use. The inclusion of an independent test set lends credibility to the performance metrics, which are often overestimated in small internal validations. However, the restriction to a single molecular subtype (ER+/HER2−) raises questions about generalizability, especially since neoadjuvant chemotherapy response is known to vary substantially across subtypes. While the study presents a strong case for CESM-based DL in the pCR setting, broader validation across subtypes and imaging centers remains a necessary next step.

Förnvik et al. (2024) [81] explored the potential of temporal modeling using volume-based BT images to predict pCR in a prospective NAC setting. The study stands out methodologically by integrating volumetric changes across different timepoints of chemotherapy, simulating a longitudinal assessment of treatment response. A 3D ResNet with an attention module was trained on volumetric inputs from three timepoints of chemotherapy, simulating a longitudinal assessment of treatment response, rather than relying on static pre-treatment imaging. Despite a relatively modest sample size (n = 149), the model achieved an AUC of 0.83, highlighting the added value of temporal information. GradCAM analyses showed that the network consistently focused on similar tumor regions across timepoints, suggesting that it captured relevant treatment dynamics. However, BT’s limited soft tissue contrast compared to CESM or MRI may reduce its ability to detect more subtle changes in tumor microenvironment, possibly constraining its utility in borderline or heterogeneous responses.

Beyond response prediction, several studies addressed recurrence and long-term prognosis. Jiang et al. (2020) [82] developed a MM-based radiomics model to construct a nomogram for predicting invasive disease-free survival in patients with triple-negative breast cancer. From 136 extracted features, a 14-feature Rad-score was derived using LASSO and incorporated with pathological nodal stage into a multivariate regression nomogram. This combined model achieved a C-index of 0.944 in the validation cohort, indicating strong prognostic discrimination and outperforming a clinical model based solely on nodal stage. While promising, the limited number of events and absence of external validation raise concerns about potential overfitting and generalizability. Mao et al. (2021) [83] conducted a larger, multicenter retrospective study involving 304 patients with ER-positive, LN-negative invasive breast cancer to predict recurrence risk based on Oncotype DX (the 21- gene recurrence risk score test). Their radiomics model, based on preoperative MM, achieved AUCs of 0.88 in the internal cohort and 0.84 in an external validation set. While lower than Jiang et al.’s results, the inclusion of an independent external test set makes these findings very significant. Additionally, the study incorporated multiple feature selection and classifier strategies, improving methodological transparency. Nonetheless, the retrospective design, and limited discussion on feature stability across scanners and acquisition protocols suggest that further standardization is needed before real-world translation.

Ma et al. (2024) [84] explored prognosis prediction in a rare setting—phyllodes tumors, which account for less than 1% of breast tumors—using a multimodal radiomics model combining MM and MRI. Their pipeline achieved an AUC of 0.95 and accuracy of 0.923 for disease-free survival prediction after surgery. While phyllodes tumors are typically understudied in AI literature due to their low prevalence, this work demonstrates the feasibility of predictive modeling even in rare breast cancer subtypes. Despite the encouraging results, the model’s generalizability remains uncertain due to the small cohort (n = 131) and the uniqueness of the tumor type, which may limit direct extrapolation to more common breast cancers. Nonetheless, the study marks an important step toward expanding AI applicability beyond the dominant focus on invasive ductal carcinoma.

Yu et al. (2021) [85] investigated the potential of MM-based radiomics to predict tumor-infiltrating lymphocyte levels, achieving moderate performance (AUC = 0.79; accuracy = 0.639). Despite that, the study addresses an innovative and clinically relevant target, as tumor-infiltrating lymphocytes reflect tumor immunogenicity and are increasingly recognized as predictive biomarkers for immunotherapy response. By identifying imaging correlates of immune infiltration, the authors open avenues for integrating radiomics into immuno-oncology workflows. However, the relatively low accuracy highlights the complexity of capturing microenvironmental features from standard mammograms, which lack functional or contrast information. Future work could benefit from multimodal data fusion and inclusion of histopathological or transcriptomic validation to improve robustness and clinical value.

In sum, CESM dominates the treatment response subfield, likely due to its functional imaging capabilities. Meanwhile, multimodal and longitudinal approaches are emerging as promising directions, but reproducibility, interpretability, and standardization remain key obstacles before AI-based prognostic tools can become routine in clinical oncology.

Table 5. Studies using mammography-based and tomosynthesis-based imaging for treatment response assessment and prognosis prediction in breast cancer.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Jiang et al., 2020 [82]	Prognostic/recurrence prediction	MM	TradRad	200 patients	private	C-index = 0.944 (95% CI, 0.883–1.004) for predicting iDFS (invasive disease-free survival)
Wang et al., 2021 [86]	pCR to neoadjuvant chemotherapy	CESM	TradRad	117 patients	private	AUC = 0.81 (95% CI, 0.575–0.948), Acc = 0.80
Mao et al., 2021 [83]	Prognostic/recurrence prediction	MM	TradRad	304 patients	private	AUC = 0.88 (95% CI, 0.75–1.00) internal test set AUC = 0.84 (95% CI, 0.69–0.99) external test set
Yu et al., 2021 [85]	Tumor-infiltrating lymphocyte	MM	TradRad	121 patients	private	AUC = 0.79 (95% CI, 0.615–0.964), Acc = 0.639
Skarping et al., 2022 [87]	pCR to neoadjuvant chemotherapy	MM	End-to-end DL	453 patients	private	AUC = 0.71 (95% CI, 0.53–0.90)
Mao et al., 2022 [79]	pCR to neoadjuvant chemotherapy	CESM	TradRad	118 patients	private	AUC = 0.85 (95% CI, 0.72–0.98)
Zhang et al., 2023 [88]	pCR to neoadjuvant chemotherapy	CESM	TradRad	118 patients	private	AUC = 0.790 (95% CI, 0.554–0.952), Acc = 0.861
Xing et al., 2024 [80]	pCR to neoadjuvant chemotherapy	CESM	End-to-end DL	265 patients	private	AUC = 0.95 (95% CI, 0.847–1.0), Acc = 0.94
Förnvik et al., 2024 [81]	pCR to neoadjuvant chemotherapy	BT (volume-based)	End-to-end DL	149 patients	private	AUC = 0.83 (95% CI, 0.63–1.00)

Table 6. Studies using multimodal imaging approaches (including mammography and tomosynthesis) for treatment response assessment and prognosis prediction in breast cancer.

Study	Clinical Target	Modality	AI Approach	No. of Cases	Dataset	Performance Metric
Han et al., 2024 [89]	Prognostic/recurrence prediction	MM + US	End-to-end DL	1242 patients	private	AUC = 0.739 (95% CI, 0.608–0.871), Acc = 0.798 (95% CI, 0.797–0.800) disease-free survival
Ma et al., 2024 [84]	Prognostic/recurrence prediction	MM + MRI	TradRad	131 patients	private	AUC = 0.95 (95% CI, 0.92–0.98), Acc = 0.923 disease-free survival
Cai et al., 2024 [90]	pCR to neoadjuvant chemotherapy	BT (slice-based) + US	TradRad	720 patients	private	AUC = 0.81 (95% CI, 0.75–0.87), Acc =0. 764

5. Conclusions

This review highlights how AI, through radiomics and DL, is reshaping breast cancer characterization using MM and BT. From hormone receptor status to recurrence prediction, AI models have demonstrated the ability to non-invasively extract biological and clinical information directly from standard imaging. Radiomics remains widely used, particularly in contrast-enhanced spectral mammography (CESM), due to its interpretability and lower data requirements. DL strategies, especially end-to-end CNNs, are gaining momentum in MM, driven by larger datasets and reduced reliance on manual annotations.

A clear trend emerges: multimodal and weakly supervised approaches, integration of peritumoral regions, and temporal modeling are showing promise across multiple clinical targets. However, challenges persist. Most studies are retrospective, monocentric, and based on private data, which hampers reproducibility and comparison. Of the 81 studies included, only 6 used public or semi-public datasets, limiting transparency and external benchmarking. Furthermore, several studies address multiple clinical targets simultaneously, reflecting the feasibility of comprehensive, multiparametric models.

Beyond the limited availability of multicentric and public datasets, additional methodological challenges continue to hinder clinical translation. The variability in biomarker assessment—such as differences in immunohistochemical thresholds for ER, PR, HER2, and Ki-67—introduces inconsistencies in the ground truth used to train AI models. Many studies also face class imbalance, with underrepresented subtypes such as HER2-positive and triple-negative cancers reducing model generalizability. Moreover, segmentation strategies, image preprocessing and feature extraction methods remain heterogeneous across studies, complicating reproducibility and cross-cohort comparison. Addressing these limitations will require stronger harmonization of imaging and annotation protocols, transparent sharing of data and code, and larger, prospectively collected cohorts to support the development of reliable and clinically applicable models.

No filter was applied to the publication year during article selection, but all relevant studies were published between 2018 and 2025, underscoring the recent and fast-evolving nature of this field. Despite the growing number of publications, robust clinical integration of AI tools remains limited by the lack of external validation, absence of standardized image acquisition and annotation protocols, and variable outcome definitions.

Funding

Research supported by Fundação para a Ciência e Tecnologia–Portugal (FCT IBEB Strategic Project UID/645/2025).

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MM	mammography
BT	breast tomosynthesis
US	breast ultrasound
MRI	magnetic resonance imaging
AI	artificial intelligence
CESM	contrast-enhanced spectral mammography
DL	deep learning
CNN	convolutional neural networks
ROI	region of interest
ALN	axillary lymph node
pCR	pathological complete response
ER	estrogen receptor
PR	progesterone receptor
HER2	human epidermal growth factor receptor 2
IHC	immunohistochemistry
TN	triple-negative
DCIS	ductal carcinoma in situ

References

Wilkinson, L.; Gathani, T. Understanding breast cancer as a global health concern. Br. J. Radiol. 2021, 95, 20211033. [Google Scholar] [CrossRef]
Tabár, L.; Dean, P.B.; Chen, T.H.-H.; Yen, A.M.-F.; Chen, S.L.-S.; Fann, J.C.-Y.; Chiu, S.Y.-H.; Ku, M.M.-S.; Wu, W.Y.-Y.; Hsu, C.-Y.; et al. The incidence of fatal breast cancer measures the increased effectiveness of therapy in women participating in mammography screening. Cancer 2019, 125, 515–523. [Google Scholar] [CrossRef] [PubMed]
Litchfield, K.; Stanislaw, S.; Spain, L.; Gallegos, L.L.; Rowan, A.; Schnidrig, D.; Rosenbaum, H.; Harle, A.; Au, L.; Hill, S.M.; et al. Representative Sequencing: Unbiased Sampling of Solid Tumor Tissue. Cell Rep. 2020, 31, 107550. [Google Scholar] [CrossRef] [PubMed]
McGranahan, N.; Swanton, C. Clonal Heterogeneity and Tumor Evolution: Past, Present, and the Future. Cell 2017, 168, 613–628. [Google Scholar] [CrossRef]
Chen, Y.; Liu, H.; Sun, Y. Effect of acute inflammatory reaction induced by biopsy on tumor microenvironment. J. Cancer Res. Clin. Oncol. 2024, 150, 177. [Google Scholar] [CrossRef]
Crouigneau, R.; Li, Y.-F.; Auxillos, J.; Goncalves-Alves, E.; Marie, R.; Sandelin, A.; Pedersen, S.F. Mimicking and analyzing the tumor microenvironment. Cell Rep. Methods 2024, 4, 100866. [Google Scholar] [CrossRef]
Bareche, Y.; Buisseret, L.; Gruosso, T.; Girard, E.; Venet, D.; Dupont, F.; Desmedt, C.; Larsimont, D.; Park, M.; Rothé, F.; et al. Unraveling Triple-Negative Breast Cancer Tumor Microenvironment Heterogeneity: Towards an Optimized Treatment Approach. J. Natl. Cancer Inst. 2020, 112, 708–719. [Google Scholar] [CrossRef]
Kumar Yadav, S.; Sharma, D.; Bala Sharma, D.; Kintu-Luwaga, R.; Jha, C.K.; Shekhar, S. Barriers and challenges in providing standard breast cancer care in low resource settings. Trop. Dr. 2022, 52, 532–537. [Google Scholar] [CrossRef]
Zackrisson, S.; Lång, K.; Rosso, A.; Johnson, K.; Dustler, M.; Förnvik, D.; Förnvik, H.; Sartor, H.; Timberg, P.; Tingberg, A.; et al. One-view breast tomosynthesis versus two-view mammography in the Malmö Breast Tomosynthesis Screening Trial (MBTST): A prospective, population-based, diagnostic accuracy study. Lancet Oncol. 2018, 19, 1493–1503. [Google Scholar] [CrossRef] [PubMed]
Hofvind, S.; Hovda, T.; Holen, Å.S.; Lee, C.I.; Albertsen, J.; Bjørndal, H.; Brandal, S.H.B.; Gullien, R.; Lømo, J.; Park, D.; et al. Digital Breast Tomosynthesis and Synthetic 2D Mammography versus Digital Mammography: Evaluation in a Population-based Screening Program. Radiology 2018, 287, 787–794. [Google Scholar] [CrossRef]
Lambin, P.; Rios-Velazquez, E.; Leijenaar, R.; Carvalho, S.; van Stiphout, R.G.P.M.; Granton, P.; Zegers, C.M.L.; Gillies, R.; Boellard, R.; Dekker, A.; et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur. J. Cancer 2012, 48, 441–446. [Google Scholar] [CrossRef]
Gillies, R.J.; Kinahan, P.E.; Hricak, H. Radiomics: Images Are More than Pictures, They Are Data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef]
Al-Karawi, D.; Al-Zaidi, S.; Helael, K.A.; Obeidat, N.; Mouhsen, A.M.; Ajam, T.; Alshalabi, B.A.; Salman, M.; Ahmed, M.H. A Review of Artificial Intelligence in Breast Imaging. Tomography 2024, 10, 705–726. [Google Scholar] [CrossRef] [PubMed]
Lei, Y.M.; Yin, M.; Yu, M.H.; Yu, J.; Zeng, S.E.; Lv, W.Z.; Li, J.; Ye, H.R.; Cui, X.W.; Dietrich, C.F. Artificial Intelligence in Medical Imaging of the Breast. Front. Oncol. 2021, 11, 600557. [Google Scholar] [CrossRef] [PubMed]
Elahi, R.; Nazari, M. An updated overview of radiomics-based artificial intelligence (AI) methods in breast cancer screening and diagnosis. Radiol. Phys. Technol. 2024, 17, 795–818. [Google Scholar] [CrossRef]
Hussain, S.; Lafarga-Osuna, Y.; Ali, M.; Naseem, U.; Ahmed, M.; Tamez-Peña, J.G. Deep learning, radiomics and radiogenomics applications in the digital breast tomosynthesis: A systematic review. BMC Bioinform. 2023, 24, 401. [Google Scholar] [CrossRef]
La Forgia, D.; Fanizzi, A.; Campobasso, F.; Bellotti, R.; Didonna, V.; Lorusso, V.; Moschetta, M.; Massafra, R.; Tamborra, P.; Tangaro, S.; et al. Radiomic Analysis in Contrast-Enhanced Spectral Mammography for Predicting Breast Cancer Histological Outcome. Diagnostics 2020, 10, 708. [Google Scholar] [CrossRef] [PubMed]
Zhang, M.; Wang, C.; Cai, L.; Zhao, J.; Xu, Y.; Xing, J.; Sun, J.; Zhang, Y. Developing a weakly supervised deep learning framework for breast cancer diagnosis with HR status based on mammography images. Comput. Struct. Biotechnol. J. 2023, 22, 17–26. [Google Scholar] [CrossRef]
Duan, W.; Wu, Z.; Zhu, H.; Zhu, Z.; Liu, X.; Shu, Y.; Zhu, X.; Wu, J.; Peng, D. Deep learning modeling using mammography images for predicting estrogen receptor status in breast cancer. Am. J. Transl. Res. 2024, 16, 2411–2422. [Google Scholar] [CrossRef]
Wang, Q.; Zhang, Z.Q.; Huang, C.C.; Xue, H.W.; Zhang, H.; Bo, F.; Guan, W.T.; Zhou, W.; Bai, G.J. Dual-Modality Virtual Biopsy System Integrating MRI and MG for Noninvasive Predicting HER2 Status in Breast Cancer. Acad Radiol. 2025, 32, 3858–3869. [Google Scholar] [CrossRef]
Zhang, Y.; Liu, F.; Zhang, H.; Ma, H.; Sun, J.; Zhang, R.; Song, L.; Shi, H. Diagnostic Value of Radiomics Analysis in Contrast-Enhanced Spectral Mammography for Identifying Triple-Negative Breast Cancer. Front. Oncol. 2021, 11, 773196. [Google Scholar] [CrossRef]
Zhu, S.; Wang, S.; Guo, S.; Wu, R.; Zhang, J.; Kong, M.; Pan, L.; Gu, Y.; Yu, S. Contrast-Enhanced Mammography Radiomics Analysis for Preoperative Prediction of Breast Cancer Molecular Subtypes. Acad. Radiol. 2023, 31, 2228–2238. [Google Scholar] [CrossRef]
Niu, S.; Jiang, W.; Zhao, N.; Jiang, T.; Dong, Y.; Luo, Y.; Yu, T.; Jiang, X. Intra- and peritumoral radiomics on assessment of breast cancer molecular subtypes based on mammography and MRI. J. Cancer Res. Clin. Oncol. 2022, 148, 97–106. [Google Scholar] [CrossRef]
Oba, K.; Adachi, M.; Kobayashi, T.; Takaya, E.; Shimokawa, D.; Fukuda, T.; Takahashi, K.; Yagishita, K.; Ueda, T.; Tsunoda, H. Deep learning model to predict Ki-67 expression of breast cancer using digital breast tomosynthesis. Breast Cancer 2024. [Google Scholar] [CrossRef]
Jiang, T.; Song, J.; Wang, X.; Niu, S.; Zhao, N.; Dong, Y.; Wang, X.; Luo, Y.; Jiang, X. Intratumoral and Peritumoral Analysis of Mammography, Tomosynthesis, and Multiparametric MRI for Predicting Ki-67 Level in Breast Cancer: A Radiomics-Based Study. Mol. Imaging Biol. 2022, 24, 550–559. [Google Scholar] [CrossRef]
Mao, N.; Jiao, Z.; Duan, S.; Xu, C.; Xie, H. Preoperative prediction of histologic grade in invasive breast cancer by using contrast-enhanced spectral mammography-based radiomics. J. X-Ray Sci. Technol. 2021, 29, 763–772. [Google Scholar] [CrossRef]
Petrillo, A.; Fusco, R.; Petrosino, T.; Vallone, P.; Granata, V.; Rubulotta, M.R.; Pariante, P.; Raiano, N.; Scognamiglio, G.; Fanizzi, A.; et al. A multicentric study of radiomics and artificial intelligence analysis on contrast-enhanced mammography to identify different histotypes of breast cancer. La Radiol. Medica 2024, 129, 864–878. [Google Scholar] [CrossRef] [PubMed]
Marino, M.A.; Leithner, D.; Sung, J.; Avendano, D.; Morris, E.A.; Pinker, K.; Jochelson, M.S. Radiomics for Tumor Characterization in Breast Cancer Patients: A Feasibility Study Comparing Contrast-Enhanced Mammography and Magnetic Resonance Imaging. Diagnostics 2020, 10, 492. [Google Scholar] [CrossRef] [PubMed]
Zhou, J.; Tan, H.; Bai, Y.; Li, J.; Lu, Q.; Chen, R.; Zhang, M.; Feng, Q.; Wang, M. Evaluating the HER-2 status of breast cancer using mammography radiomics features. Eur. J. Radiol. 2019, 121, 108718. [Google Scholar] [CrossRef] [PubMed]
Ma, W.; Zhao, Y.; Ji, Y.; Guo, X.; Jian, X.; Liu, P.; Wu, S. Breast Cancer Molecular Subtype Prediction by Mammographic Radiomic Features. Acad Radiol. 2019, 26, 196–201. [Google Scholar] [CrossRef]
Tagliafico, A.S.; Bignotti, B.; Rossi, F.; Matos, J.; Calabrese, M.; Valdora, F.; Houssami, N. Breast cancer Ki-67 expression prediction by digital breast tomosynthesis radiomics features. Eur. Radiol. Exp. 2019, 3, 36. [Google Scholar] [CrossRef] [PubMed]
Wang, L.; Yang, W.; Xie, X.; Liu, W.; Wang, H.; Shen, J.; Ding, Y.; Zhang, B.; Song, B. Application of digital mammography-based radiomics in the differentiation of benign and malignant round-like breast tumors and the prediction of molecular subtypes. Gland Surg. 2020, 9, 2005–2016. [Google Scholar] [CrossRef]
Marino, M.A.; Pinker, K.; Leithner, D.; Sung, J.; Avendano, D.; Morris, E.A.; Jochelson, M. Contrast-Enhanced Mammography and Radiomics Analysis for Noninvasive Breast Cancer Characterization: Initial Results. Mol. Imaging Biol. 2020, 22, 780–787. [Google Scholar] [CrossRef]
Son, J.; Lee, S.E.; Kim, E.-K.; Kim, S. Prediction of breast cancer molecular subtypes using radiomics signatures of synthetic mammography from digital breast tomosynthesis. Sci. Rep. 2020, 10, 21566. [Google Scholar] [CrossRef]
Kanbayti, I.H.; Rae, W.I.D.; McEntee, M.F.; Gandomkar, Z.; Ekpo, E.U. Clinicopathologic breast cancer characteristics: Predictions using global textural features of the ipsilateral breast mammogram. Radiol. Phys. Technol. 2021, 14, 248–261. [Google Scholar] [CrossRef]
Ueda, D.; Yamamoto, A.; Takashima, T.; Onoda, N.; Noda, S.; Kashiwagi, S.; Morisaki, T.; Honjo, T.; Shimazaki, A.; Miki, Y. Training, Validation, and Test of Deep Learning Models for Classification of Receptor Expressions in Breast Cancers From Mammograms. JCO Precis. Oncol. 2021, 5, 543–551. [Google Scholar] [CrossRef]
Ge, S.; Yixing, Y.; Jia, D.; Ling, Y. Application of mammography-based radiomics signature for preoperative prediction of triple-negative breast cancer. BMC Med. Imaging 2022, 22, 166. [Google Scholar] [CrossRef]
Petrillo, A.; Fusco, R.; Di Bernardo, E.; Petrosino, T.; Barretta, M.L.; Porto, A.; Granata, V.; Di Bonito, M.; Fanizzi, A.; Massafra, R.; et al. Prediction of Breast Cancer Histological Outcome by Radiomics and Artificial Intelligence Analysis in Contrast-Enhanced Mammography. Cancers 2022, 14, 2132. [Google Scholar] [CrossRef]
Dominique, C.; Callonnec, F.; Berghian, A.; Defta, D.; Vera, P.; Modzelewski, R.; Decazes, P. Deep learning analysis of contrast-enhanced spectral mammography to determine histoprognostic factors of malignant breast tumours. Eur. Radiol. 2022, 32, 4834–4844. [Google Scholar] [CrossRef] [PubMed]
Jiang, T.; Jiang, W.; Chang, S.; Wang, H.; Niu, S.; Yue, Z.; Yang, H.; Wang, X.; Zhao, N.; Fang, S.; et al. Intratumoral analysis of digital breast tomosynthesis for predicting the Ki-67 level in breast cancer: A multi-center radiomics study. Med. Phys. 2022, 49, 219–230. [Google Scholar] [CrossRef] [PubMed]
Rong, X.-C.; Kang, Y.-H.; Shi, G.-F.; Ren, J.-L.; Liu, Y.-H.; Li, Z.-G.; Yang, G. The use of mammography-based radiomics nomograms for the preoperative prediction of the histological grade of invasive ductal carcinoma. J. Cancer Res. Clin. Oncol. 2023, 149, 11635–11645. [Google Scholar] [CrossRef]
Deng, Y.; Lu, Y.; Li, X.; Zhu, Y.; Zhao, Y.; Ruan, Z.; Mei, N.; Yin, B.; Liu, L. Prediction of human epidermal growth factor receptor 2 (HER2) status in breast cancer by mammographic radiomics features and clinical characteristics: A multicenter study. Eur. Radiol. 2024, 34, 5464–5476. [Google Scholar] [CrossRef]
Mota, A.M.; Mendes, J.; Matela, N. Breast Cancer Molecular Subtype Prediction: A Mammography-Based AI Approach. Biomedicines 2024, 12, 1371. [Google Scholar] [CrossRef] [PubMed]
Bakker, M.A.G.; Ovalho, M.d.L.; Matela, N.; Mota, A.M. Decoding Breast Cancer: Using Radiomics to Non-Invasively Unveil Molecular Subtypes Directly from Mammographic Images. J. Imaging 2024, 10, 218. [Google Scholar] [CrossRef] [PubMed]
Nissar, I.; Alam, S.; Masood, S.; Kashif, M. MOB-CBAM: A dual-channel attention-based deep learning generalizable model for breast cancer molecular subtypes prediction using mammograms. Comput. Methods Programs Biomed. 2024, 248, 108121. [Google Scholar] [CrossRef] [PubMed]
Zeng, S.; Chen, H.; Jing, R.; Yang, W.; He, L.; Zou, T.; Liu, P.; Liang, B.; Shi, D.; Wu, W.; et al. An assessment of breast cancer HER2, ER, and PR expressions based on mammography using deep learning with convolutional neural networks. Sci. Rep. 2025, 15, 4826. [Google Scholar] [CrossRef]
Ben Rabah, C.; Sattar, A.; Ibrahim, A.; Serag, A. A Multimodal Deep Learning Model for the Classification of Breast Cancer Subtypes. Diagnostics 2025, 15, 995. [Google Scholar] [CrossRef]
Hu, Z.; Xu, M.; Yang, H.; Hao, H.; Zhao, P.; Yang, Y.; Liu, G. Development of an Intra- and Peritumoral Radiomics Nomogram Using Digital Breast Tomosynthesis for Preoperative Assessment of Ki-67 Expression in Invasive Breast Cancer. Acad. Radiol. 2025, 32, 2465–2476. [Google Scholar] [CrossRef]
Mota, A.M. AI and Tomosynthesis for Breast Cancer Molecular Subtyping: A Step Toward Precision Medicine. In Proceedings of the 47th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC 2025), Copenhagen, Denmark, 14–17 July 2025. [Google Scholar]
Liu, M.; Zhang, S.; Du, Y.; Zhang, X.; Wang, D.; Ren, W.; Sun, J.; Yang, S.; Zhang, G. Identification of Luminal A breast cancer by using deep learning analysis based on multi-modal images. Front. Oncol. 2023, 13, 1243126. [Google Scholar] [CrossRef]
Zhang, T.; Tan, T.; Han, L.; Appelman, L.; Veltman, J.; Wessels, R.; Duvivier, K.M.; Loo, C.; Gao, Y.; Wang, X.; et al. Predicting breast cancer types on and beyond molecular level in a multi-modal fashion. npj Breast Cancer 2023, 9, 16. [Google Scholar] [CrossRef]
Liu, J.; Yan, C.; Liu, C.; Wang, Y.; Chen, Q.; Chen, Y.; Guo, J.; Chen, S. Predicting Ki-67 expression levels in breast cancer using radiomics-based approaches on digital breast tomosynthesis and ultrasound. Front. Oncol. 2024, 14, 1403522. [Google Scholar] [CrossRef] [PubMed]
Yang, X.; Li, J.; Sun, H.; Chen, J.; Xie, J.; Peng, Y.; Shang, T.; Pan, T. Radiomics Integration of Mammography and DCE-MRI for Predicting Molecular Subtypes in Breast Cancer Patients. Breast Cancer (Dove Med. Press) 2025, 17, 187–200. [Google Scholar] [CrossRef] [PubMed]
Alaeikhanehshir, S.; Voets, M.M.; van Duijnhoven, F.H.; Lips, E.H.; Groen, E.J.; van Oirsouw, M.C.J.; Hwang, S.E.; Lo, J.Y.; Wesseling, J.; Mann, R.M.; et al. Application of deep learning on mammographies to discriminate between low and high-risk DCIS for patient participation in active surveillance trials. Cancer Imaging 2024, 24, 48. [Google Scholar] [CrossRef] [PubMed]
Wu, H.; Jiang, Y.; Tian, H.; Ye, X.; Cui, C.; Shi, S.; Chen, M.; Ding, Z.; Li, S.; Huang, Z.; et al. Sonography-based multimodal information platform for identifying the surgical pathology of ductal carcinoma in situ. Comput. Methods Programs Biomed. 2024, 245, 108039. [Google Scholar] [CrossRef]
Wu, Y.; Xu, D.; Zha, Z.; Gu, L.; Chen, J.; Fang, J.; Dou, Z.; Zhang, P.; Zhang, C.; Wang, J. Integrating radiomics into predictive models for low nuclear grade DCIS using machine learning. Sci. Rep. 2025, 15, 7505. [Google Scholar] [CrossRef]
Abel, F.; Landsmann, A.; Hejduk, P.; Ruppert, C.; Borkowski, K.; Ciritsis, A.; Rossi, C.; Boss, A. Detecting Abnormal Axillary Lymph Nodes on Mammograms Using a Deep Convolutional Neural Network. Diagnostics 2022, 12, 1347. [Google Scholar] [CrossRef]
Xu, M.; Yang, H.; Yang, Q.; Teng, P.; Hao, H.; Liu, C.; Yu, S.; Liu, G. Radiomics nomogram based on digital breast tomosynthesis: Preoperative evaluation of axillary lymph node metastasis in breast carcinoma. J. Cancer Res. Clin. Oncol. 2023, 149, 9317–9328. [Google Scholar] [CrossRef]
Wang, Q.; Lin, Y.; Ding, C.; Guan, W.; Zhang, X.; Jia, J.; Zhou, W.; Liu, Z.; Bai, G. Multi-modality radiomics model predicts axillary lymph node metastasis of breast cancer using MRI and mammography. Eur. Radiol. 2024, 34, 6121–6131. [Google Scholar] [CrossRef]
Hua, Y.; Peng, Q.; Han, J.; Fei, J.; Sun, A. A two-center study of a combined nomogram based on mammography and MRI to predict ALN metastasis in breast cancer. Magn. Reson. Imaging 2024, 110, 128–137. [Google Scholar] [CrossRef]
Wang, D.; Liu, M.; Zhuang, Z.; Wu, S.; Zhou, P.; Chen, X.; Zhu, H.; Liu, H.; Zhang, L. Radiomics Analysis on Digital Breast Tomosynthesis: Preoperative Evaluation of Lymphovascular Invasion Status in Invasive Breast Cancer. Acad. Radiol. 2022, 29, 1773–1782. [Google Scholar] [CrossRef]
Xu, M.; Yang, H.; Sun, J.; Hao, H.; Li, X.; Liu, G. Development of an Intratumoral and Peritumoral Radiomics Nomogram Using Digital Breast Tomosynthesis for Preoperative Assessment of Lymphovascular Invasion in Invasive Breast Cancer. Acad. Radiol. 2024, 31, 1748–1761. [Google Scholar] [CrossRef]
Shi, B.; Grimm, L.J.; Mazurowski, M.A.; Baker, J.A.; Marks, J.R.; King, L.M.; Maley, C.C.; Hwang, E.S.; Lo, J.Y. Prediction of Occult Invasive Disease in Ductal Carcinoma in Situ Using Deep Learning Features. J. Am. Coll. Radiol. 2018, 15, 527–534. [Google Scholar] [CrossRef]
Yang, J.; Wang, T.; Yang, L.; Wang, Y.; Li, H.; Zhou, X.; Zhao, W.; Ren, J.; Li, X.; Tian, J.; et al. Preoperative Prediction of Axillary Lymph Node Metastasis in Breast Cancer Using Mammography-Based Radiomics Method. Sci. Rep. 2019, 9, 4429. [Google Scholar] [CrossRef]
Li, J.; Song, Y.; Xu, S.; Wang, J.; Huang, H.; Ma, W.; Jiang, X.; Wu, Y.; Cai, H.; Li, L. Predicting underestimation of ductal carcinoma in situ: A comparison between radiomics and conventional approaches. Int. J. Comput. Assist. Radiol. Surg. 2019, 14, 709–721. [Google Scholar] [CrossRef]
Tan, H.; Wu, Y.; Bao, F.; Zhou, J.; Wan, J.; Tian, J.; Lin, Y.; Wang, M. Mammography-based radiomics nomogram: A potential biomarker to predict axillary lymph node metastasis in breast cancer. Br. J. Radiol. 2020, 93, 20191019. [Google Scholar] [CrossRef]
Mao, N.; Yin, P.; Li, Q.; Wang, Q.; Liu, M.; Ma, H.; Dong, J.; Che, K.; Wang, Z.; Duan, S.; et al. Radiomics nomogram of contrast-enhanced spectral mammography for prediction of axillary lymph node metastasis in breast cancer: A multicenter study. Eur. Radiol. 2020, 30, 6732–6739. [Google Scholar] [CrossRef]
Wu, X.; Guo, Y.; Sa, Y.; Song, Y.; Li, X.; Lv, Y.; Xing, D.; Sun, Y.; Cong, Y.; Yu, H.; et al. Contrast-Enhanced Spectral Mammography-Based Prediction of Non-Sentinel Lymph Node Metastasis and Axillary Tumor Burden in Patients With Breast Cancer. Front. Oncol. 2022, 12, 823897. [Google Scholar] [CrossRef] [PubMed]
Hou, R.; Grimm, L.J.; Mazurowski, M.A.; Marks, J.R.; King, L.M.; Maley, C.C.; Lynch, T.; van Oirsouw, M.; Rogers, K.; Stone, N.; et al. Prediction of Upstaging in Ductal Carcinoma in Situ Based on Mammographic Radiomic Features. Radiology 2022, 303, 54–62. [Google Scholar] [CrossRef]
Lin, F.; Li, Q.; Wang, Z.; Shi, Y.; Ma, H.; Zhang, H.; Zhang, K.; Yang, P.; Zhang, R.; Duan, S.; et al. Intratumoral and peritumoral radiomics for preoperatively predicting the axillary non-sentinel lymph node metastasis in breast cancer on the basis of contrast-enhanced mammography: A multicenter study. Br. J. Radiol. 2023, 96, 20220068. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.; Zhang, H.; Lin, F.; Zhang, R.; Ma, H.; Shi, Y.; Yang, P.; Zhang, K.; Zhao, F.; Mao, N.; et al. Intra- and Peritumoral Radiomics of Contrast-Enhanced Mammography Predicts Axillary Lymph Node Metastasis in Patients With Breast Cancer: A Multicenter Study. Acad. Radiol. 2023, 30, S133–S142. [Google Scholar] [CrossRef] [PubMed]
Shimokawa, D.; Takahashi, K.; Oba, K.; Takaya, E.; Usuzaki, T.; Kadowaki, M.; Kawaguchi, K.; Adachi, M.; Kaneno, T.; Fukuda, T.; et al. Deep learning model for predicting the presence of stromal invasion of breast cancer on digital breast tomosynthesis. Radiol. Phys. Technol. 2023, 16, 406–413. [Google Scholar] [CrossRef]
Tsai, H.Y.; Kao, Y.W.; Wang, J.C.; Tsai, T.Y.; Chung, W.S.; Hsu, J.S.; Hou, M.F.; Weng, S.F. Multitask deep learning on mammography to predict extensive intraductal component in invasive breast cancer. Eur. Radiol. 2024, 34, 2593–2604. [Google Scholar] [CrossRef]
He, S.; Deng, B.; Chen, J.; Li, J.; Wang, X.; Li, G.; Long, S.; Wan, J.; Zhang, Y. Preoperative DBT-based radiomics for predicting axillary lymph node metastasis in breast cancer: A multi-center study. BMC Med. Imaging 2025, 25, 169. [Google Scholar] [CrossRef] [PubMed]
Cheng, Y.; Xu, S.; Wang, H.; Wang, X.; Niu, S.; Luo, Y.; Zhao, N. Intra- and peri-tumoral radiomics for predicting the sentinel lymph node metastasis in breast cancer based on preoperative mammography and MRI. Front. Oncol. 2022, 12, 1047572. [Google Scholar] [CrossRef]
Haraguchi, T.; Goto, Y.; Furuya, Y.; Nagai, M.T.; Kanemaki, Y.; Tsugawa, K.; Kobayashi, Y. Use of machine learning with two-dimensional synthetic mammography for axillary lymph node metastasis prediction in breast cancer: A preliminary study. Transl. Cancer Res. 2023, 12, 1232–1240. [Google Scholar] [CrossRef] [PubMed]
Guo, F.; Sun, S.; Deng, X.; Wang, Y.; Yao, W.; Yue, P.; Wu, S.; Yan, J.; Zhang, X.; Zhang, Y. Predicting axillary lymph node metastasis in breast cancer using a multimodal radiomics and deep learning model. Front. Immunol. 2024, 15, 1482020. [Google Scholar] [CrossRef]
Cheng, M.Y.; Wu, C.G.; Lin, Y.Y.; Zou, J.C.; Wang, D.Q.; Haffty, B.G.; Wang, K. Development and validation of a multivariable risk model based on clinicopathological characteristics, mammography, and MRI imaging features for predicting axillary lymph node metastasis in patients with upgraded ductal carcinoma in situ. Gland Surg. 2025, 14, 738–753. [Google Scholar] [CrossRef] [PubMed]
Mao, N.; Shi, Y.; Lian, C.; Wang, Z.; Zhang, K.; Xie, H.; Zhang, H.; Chen, Q.; Cheng, G.; Xu, C.; et al. Intratumoral and peritumoral radiomics for preoperative prediction of neoadjuvant chemotherapy effect in breast cancer based on contrast-enhanced spectral mammography. Eur. Radiol. 2022, 32, 3207–3219. [Google Scholar] [CrossRef]
Xing, D.; Lv, Y.; Sun, B.; Chu, T.; Bao, Q.; Zhang, H. Develop and Validate a Nomogram Combining Contrast-Enhanced Spectral Mammography Deep Learning with Clinical-Pathological Features to Predict Neoadjuvant Chemotherapy Response in Patients with ER-Positive/HER2-Negative Breast Cancer. Acad. Radiol. 2024, 31, 3524–3534. [Google Scholar] [CrossRef]
Förnvik, D.; Borgquist, S.; Larsson, M.; Zackrisson, S.; Skarping, I. Deep learning analysis of serial digital breast tomosynthesis images in a prospective cohort of breast cancer patients who received neoadjuvant chemotherapy. Eur. J. Radiol. 2024, 178, 111624. [Google Scholar] [CrossRef]
Jiang, X.; Zou, X.; Sun, J.; Zheng, A.; Su, C. A Nomogram Based on Radiomics with Mammography Texture Analysis for the Prognostic Prediction in Patients with Triple-Negative Breast Cancer. Contrast Media Mol. Imaging 2020, 2020, 5418364. [Google Scholar] [CrossRef]
Mao, N.; Yin, P.; Zhang, H.; Zhang, K.; Song, X.; Xing, D.; Chu, T. Mammography-based radiomics for predicting the risk of breast cancer recurrence: A multicenter study. Br. J. Radiol. 2021, 94, 20210348. [Google Scholar] [CrossRef] [PubMed]
Ma, X.; Zhang, L.; Xiao, Q.; Huang, Y.; Lin, L.; Peng, W.; Gong, J.; Gu, Y. Predicting Prognosis of Phyllodes Tumors Using a Mammography- and Magnetic Resonance Imaging-Based Radiomics Model: A Preliminary Study. Clin. Breast Cancer 2024, 24, e571–e582.e571. [Google Scholar] [CrossRef] [PubMed]
Yu, H.; Meng, X.; Chen, H.; Liu, J.; Gao, W.; Du, L.; Chen, Y.; Wang, Y.; Liu, X.; Liu, B.; et al. Predicting the Level of Tumor-Infiltrating Lymphocytes in Patients With Breast Cancer: Usefulness of Mammographic Radiomics Features. Front. Oncol. 2021, 11, 628577. [Google Scholar] [CrossRef]
Wang, Z.; Lin, F.; Ma, H.; Shi, Y.; Dong, J.; Yang, P.; Zhang, K.; Guo, N.; Zhang, R.; Cui, J.; et al. Contrast-Enhanced Spectral Mammography-Based Radiomics Nomogram for the Prediction of Neoadjuvant Chemotherapy-Insensitive Breast Cancers. Front. Oncol. 2021, 11, 605230. [Google Scholar] [CrossRef]
Skarping, I.; Larsson, M.; Förnvik, D. Analysis of mammograms using artificial intelligence to predict response to neoadjuvant chemotherapy in breast cancer patients: Proof of concept. Eur. Radiol. 2022, 32, 3131–3141. [Google Scholar] [CrossRef] [PubMed]
Zhang, K.; Lin, J.; Lin, F.; Wang, Z.; Zhang, H.; Zhang, S.; Mao, N.; Qiao, G. Radiomics of contrast-enhanced spectral mammography for prediction of pathological complete response to neoadjuvant chemotherapy in breast cancer. J. X-Ray Sci. Technol. 2023, 31, 669–683. [Google Scholar] [CrossRef]
Han, J.; Hua, H.; Fei, J.; Liu, J.; Guo, Y.; Ma, W.; Chen, J. Prediction of Disease-Free Survival in Breast Cancer using Deep Learning with Ultrasound and Mammography: A Multicenter Study. Clin. Breast Cancer 2024, 24, 215–226. [Google Scholar] [CrossRef]
Cai, L.; Sidey-Gibbons, C.; Nees, J.; Riedel, F.; Schäfgen, B.; Togawa, R.; Killinger, K.; Heil, J.; Pfob, A.; Golatta, M. Can multi-modal radiomics using pretreatment ultrasound and tomosynthesis predict response to neoadjuvant systemic treatment in breast cancer? Eur. Radiol. 2024, 34, 2560–2573. [Google Scholar] [CrossRef]

Figure 1. Flow diagram showing inclusion and exclusion criteria for MM-based studies.

Figure 2. Flow diagram showing inclusion and exclusion criteria for BT-based studies.

Figure 3. Overview of AI-based pipelines for breast cancer characterization. Traditional radiomics relies on handcrafted features from segmented ROIs. Deep radiomics used masks or bounding boxes and extracts deep features from Convolutional Neural Networks (CNNs). End-to-end DL predicts outcomes directly from the input region without explicit feature extraction.

Figure 4. Distribution of studies by imaging modality (MM or BT) and clinical target. Studies were grouped into three main categories: tumor subtyping, tumor staging and treatment response/prognosis.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mota, A.M. AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis. Cancers 2025, 17, 3387. https://doi.org/10.3390/cancers17203387

AMA Style

Mota AM. AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis. Cancers. 2025; 17(20):3387. https://doi.org/10.3390/cancers17203387

Chicago/Turabian Style

Mota, Ana M. 2025. "AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis" Cancers 17, no. 20: 3387. https://doi.org/10.3390/cancers17203387

APA Style

Mota, A. M. (2025). AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis. Cancers, 17(20), 3387. https://doi.org/10.3390/cancers17203387

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

AI-Based Characterization of Breast Cancer in Mammography and Tomosynthesis: A Review of Radiomics and Deep Learning for Subtyping, Staging, and Prognosis

Simple Summary

Abstract

1. Introduction

2. Methodology

3. AI-Based Pipelines Used in Breast Cancer Characterization

4. AI-Based Characterization of Breast Cancer from Mammography and Tomosynthesis

4.1. Tumor Subtyping

4.2. Tumor Staging

4.3. Treatment Response and Prognosis

5. Conclusions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI