Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer

Xing, Kaiyuan; Li, Liangshuang; Ma, Yingnan; Zhu, Jiang

doi:10.3390/cimb47080652

Open AccessArticle

Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer

School of Medical Informatics, Daqing Campus, Harbin Medical University, Daqing 163319, China

^*

Author to whom correspondence should be addressed.

Curr. Issues Mol. Biol. 2025, 47(8), 652; https://doi.org/10.3390/cimb47080652

Submission received: 13 June 2025 / Revised: 3 August 2025 / Accepted: 8 August 2025 / Published: 13 August 2025

(This article belongs to the Section Bioinformatics and Systems Biology)

Download

Browse Figures

Versions Notes

Abstract

Colorectal cancer (CRC) is a highly aggressive cancer, with its treatment and prognosis particularly challenging due to metastasis. The immune response is involved in the whole process of CRC development, and immunotherapy has increasingly become a part of CRC patients’ treatment. However, comprehensive research on the immune microenvironment driving CRC metastasis remains limited. Given this limitation, we proposed a bioinformatics method to construct a metastasis–based immune prognostic model (MIPM) by integrating CRC single–cell RNA sequencing (scRNA–seq) and bulk data. Our study identified several MIPM genes significantly associated with CRC metastasis and progression. MIPM reliably predicted overall survival (OS) and tumor recurrence in CRC across eleven bulk validation datasets. Notably, MIPM could independently predict outcomes beyond traditional clinical factors such as age, sex, and stage. It showed high predictive accuracy in CRC patients treated with chemotherapy. Drug sensitivity and multifaceted immune analyses further underscored the importance of MIPM in therapeutic and immunotherapy response modulation. In conclusion, our findings have profound implications for the illustration of MIPM, which could serve as a new plausible prognostic marker for CRC patients and provide new insights for treatment strategies. The further evaluation and investigation of MIPM will enhance the prognosis and precision therapy of CRC patients.

Keywords:

colorectal cancer; scRNA–seq; metastasis; immune; prognosis; biomarkers

1. Introduction

Colorectal cancer (CRC), the third most prevalent malignancy globally, significantly impacts global health [1,2]. Metastasis is the primary cause of CRC–related mortality, with the liver (~50% of cases) being the predominant site, followed by lymph nodes and lungs [3,4,5]. Although treatment methods like surgery and chemotherapy have improved the survival outcome of CRC patients, metastatic CRC still carries a dismal prognosis, with a 5-year overall survival (OS) rate of just 14% [6,7]. This highlights the critical need for research focused on identifying metastasis-associated genes to enhance patient prognosis and facilitate the development of personalized treatment strategies.

The tumor microenvironment plays a crucial role in tumor biology [8,9]. Immunotherapy and related combination therapies have offered hope for patients with malignant tumors, especially for CRC [10,11,12]. Therefore, many studies have recently focused on developing immune–related prognostic biomarkers for CRC patients. For example, Li et al. constructed a CRC–specific immune–related gene prognostic index (IRGPI). They demonstrated the ability of IRGPI to predict patient outcomes in the different risk groups [13]. Based on the study of the expression of immune–related and inflammatory markers in CRC, An et al. found that APRIL/TNFSF13, BAFF, and MMP-3 were highly expressed in CRC, which may serve as diagnostic or prognostic markers for CRC [14]. An immune risk model was also developed by integrating immune cells and immune–related genes, which could effectively predict the prognosis of CRC patients undergoing chemotherapy [15]. These analyses have significantly advanced the field of immune–related prognostic studies.

Nevertheless, despite the promise of immunotherapy in CRC, its effectiveness remains limited by the heterogeneity of the tumor immune microenvironment (TIME), often resulting in drug resistance, recurrence, and poor prognosis. Meanwhile, while numerous factors influence the effectiveness of immunotherapy, there is lack of reliable markers to predict patient prognosis. Advancements in single–cell RNA sequencing (scRNA–seq) have greatly facilitated the identification of immune–related biomarkers across various cancers by integrating both scRNA–seq and bulk data. For instance, Zhang et al. developed a prognostic risk score based on the combination of scRNA–seq and bulk data from CRC patients. They found its strong correlation with immunotherapy sensitivity, which was conducive to developing more accurate treatment strategies for CRC patients [16]. Similarly, Liu et al. developed an immune prediction model that has significant prognostic value by integrating bulk and scRNA–seq data and they observed that it could precisely discriminate the immune status of patients [17]. Wang et al. integrated the scRNA–seq data with bulk data to construct a prognostic model, whose prognostic efficacy was validated in five independent cohorts [18]. However, the absence of robust immune and metastasis prognostic markers remains a major challenge in oncology. Constructing prognostic signatures related to immunity and metastasis is of paramount importance in the fight against cancer and other metastatic diseases. These signatures enable clinicians to predict disease progression more accurately and develop new immunotherapeutic strategies, ultimately leading to improved survival rates and better outcomes for patients.

In the study, we utilized CRC bulk and scRNA–seq data, including primary and matched metastases samples to develop a metastasis–based immune prognostic model (MIPM) that effectively predicts CRC prognosis. This MIPM’s robust predictive capabilities for OS, tumor recurrence, and chemotherapy benefits in CRC patients were validated in eleven datasets. Additionally, the MIPM’s associations with clinical characteristics and drug sensitivities were explored, and we also evaluated the immune checkpoint inhibitors (ICIs) therapy response in MIPM subgroups, which together established its potential as a significant prognostic biomarker. Our findings highlight MIPM as a crucial tool for advancing CRC metastasis and immune research, potentially guiding more precise treatment approaches for CRC patients.

2. Materials and Methods

2.1. CRC scRNA–Seq and Bulk Data Acquisition and Pre–Procession

Colorectal cancer (CRC) single–cell RNA sequencing (scRNA–seq) data from GSE178318, encompassing 113,331 cells, were downloaded from the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/geo/ (accessed on 1 June 2022)), which contains primary and matched liver metastasis samples derived from six CRC patients (Table 1) [19,20]. Three of the six patients (specifically those identified as COL15, COL17, and COL18), received preoperative chemotherapy, while the others were treatment–naïve. To capture a broad range of cellular responses and gene expressions relevant to both treatment–naïve and treated CRC conditions, we aggregated and analyzed the scRNA–seq data from all six patients. The raw expression data with unique molecular identifier (UMI) counts were normalized and scaled using the R package “Seurat” (version 5.0.1), while low–quality cells and genes were filtered out as follows: (1) the exclusion of cells with <300 expressed genes or >15% mitochondrial gene content; (2) filtered out mitochondrial genes and ribosomal genes, and genes expressed in <3 cells; (3) retention of protein–coding genes. We stratified cell clusters and annotated them based on the known marker genes from Che et al. (Supplementary Table S1). According to the annotation results, the expression data of cancer cells were extracted for downstream processing.

The clinical and expression data of 1836 patients of 11 CRC bulk datasets were obtained from the GEO database, including GSE39582 [21], GSE159216 [22], GSE87211 [23], GSE29621 [24], GSE72968 [25], GSE72970 [25,26], GSE12945 [27], GSE17536 [28], GSE17537 [28], GSE17538 [28], GSE37892 [29] (Table 1). All selected bulk expression datasets were log2-transformed.

2.2. Identification of Gene Expression Signature from Primary and Metastatic Cells

To further distinguish primary and metastatic cancer cells, the signal–to–noise statistics for identifying feature genes that could be used to classify these two group cells were calculated. The signal–to–noise statistic for each gene was calculated by

S_{i} = (μ_{p} - μ_{m} / σ_{p} + σ_{m})

(1)

where

S_{i}

represents the signal–to–noise statistic for gene

i

,

μ_{p}

and

μ_{m}

separately represent the mean expression values of gene

i

in primary and metastatic cells, and

σ_{p}

and

σ_{m}

represent the standard deviation of gene

i

in primary and metastatic cells.

Genes were ranked by the signal–to–noise statistic, followed by weighted–voting classification and ‘leave–one–out’ cross–validation algorithm to determine the optimal gene set distinguishing primary and metastatic cells [30,31]. Specifically, given a dataset with N cells, each time leaving out one cell as the test set and using the remaining N−1 cells for training. First, calculate the

S_{i}

of each gene in the signature in the training set. Next, the weighted–voting classification algorithm found the decision boundaries between the primary and metastatic means for each gene:

b_{i} = (μ_{p} - μ_{m}) / 2

(2)

For a test sample

x

in cross–validation analysis, each gene

i

in the feature gene set casts a vote:

v_{i} = S_{i} (g_{i}^{x} - b_{i})

(3)

where

g_{i}^{x}

represents the expression value of gene

i

in sample

x

. And,

\sum v_{i}

is used for the final vote for primary or metastases.

Finally, the feature gene set with the highest cross–validation accuracy was selected as the most discriminating gene expression signature.

2.3. Screening High–Contribution Immune–Related Genes

A total of 2013 immune–related genes were downloaded from the ImmPort (https://www.immport.org/resources (accessed on 8 May 2024)) database [32]. CRC patients in GSE39582 were divided into high–immunity and low–immunity groups based on the median immune score which was calculated by a single sample gene set enrichment analysis (ssGSEA) method. Then, we used the random forest method to calculate the contribution of 2013 immune genes, and extracted genes with mean decrease Gini > 1 (refer to the analysis of Shen et al.) as high–contribution immune–related genes (HIRGs) [33].

2.4. Construction of Metastasis–Based Immune Prognostic Model (MIPM)

We intersected the most discriminating gene expression signature and HIRGs. Subsequently, in the GSE39582 dataset, we used univariate Cox regression analysis to identify prognosis–related genes (p < 0.05) significantly correlated with the survival among the intersected genes. These identified prognosis–related genes were used for the subsequent construction of prognostic model. Lasso regression is a type of penalized regression technique commonly used for variable selection and dimensionality reduction. It is based on the minimum mean cross–validated error criterion and determines the optimal value of the tuning parameter (

λ

) through ten–fold cross–validation. Then, based on the expression profiles of each sample, Lasso selected the prognosis–related genes that met the criteria and assigned a corresponding coefficient to each, which is then used to establish MIPM:

M I P M r i s k s c o r e = \sum_{i = 1}^{n} r_{i} E x p (i)

(4)

where

r_{i}

is the regression coefficient of gene

i

,

n

is the number of MIPM genes, and

E x p (i)

is the expression value of gene

i

in sample.

2.5. Functional Enrichment Analysis

Based on the canonical pathway gene set collections of the MSigDB database (https://www.gsea-msigdb.org/gsea/msigdb/ (accessed on 14 May 2024)), we conducted the enrichment analysis for intersection genes and MIPM genes utilizing the “clusterProfiler” package (version 4.10.0) [34].

2.6. Evaluation of MIPM in Validation Cohorts

The prognostic power of MIPM was evaluated in 11 validation cohorts. The cutoff points were defined by the “survminer” package (version 0.4.9) and the differences in overall survival (OS) for patients were compared using Kaplan–Meier (K–M) curves with the log–rank test [35]. Subsequently, the decision curve analysis (DCA), calibration curves, and receiver operating characteristic (ROC) curves were used to evaluate the predictive ability and application value of MIPM. Univariate and multivariate Cox regression analyses were used to assess independent prognostic factors. The comprehensive nomograms were built to predict the survival of CRC patients, and the calibration curves and ROC curves to validate the accuracy of the nomograms for predicting the OS of CRC patients at 3– and 5–year follow–up [36,37]. Additionally, we evaluated the ability of MIPM to predict chemotherapy benefits and tumor recurrence outcomes (disease–free survival, DFS) in validation datasets by prognostic analysis. We further compared the prognostic performance of MIPM with several previously published immune–related prognostic models (including IRGPI and models proposed by Xiao et al., Zheng et al., and Li et al.) [13,38,39,40]. In all 11 validation datasets, we calculated a risk score for each sample based on the respective model (with detailed formulas provided in Supplementary Table S2), and then compared the OS differences between risk groups and AUC values of the predictive efficacy.

2.7. Analysis of Clinical Features

In the GSE39582 dataset, we portrayed the differences in MIPM genes expression and clinical features (including age, sex, stage, chemotherapy, survival time, and survival event) between high– and low–risk groups. The ROC curves were applied to compare the accuracy of MIPM to other clinical variables (age, sex, stage) in predicting patients’ survival. Additionally, we also assessed differences in MIPM risk scores between different clinical variables and consensus molecular subtypes (CMSs), and the prognostic efficacy of MIPM in CMS subtypes.

2.8. Drug Sensitivity Analysis

Expression and drug sensitivity data (the half maximal inhibitory concentration (IC50)) for 44 CRC cell lines were downloaded from the Genomics of Drug Sensitivity in Cancer (GDSC) (https://www.cancerrxgene.org/ (accessed on 1 June 2024)) database [41]. We used the oncoPredict method to predict the sensitivities of patients in the high– and low–risk groups in the GSE39582 dataset to drugs [42]. We linked the predicted drug sensitivities to enriched pathways to support the prediction conclusion. Spearman correlation analysis was applied to further examine the associations between MIPM risk scores and drug sensitivity to 286 compounds.

2.9. Comprehensive Immune-Related Analysis

Using CIBERSORT, we quantified 22 immune cell signatures in GSE39582 and compared their infiltration levels between high– and low–risk groups [43,44]. We utilized the tumor immune dysfunction and exclusion (TIDE) method to assess the immunotherapy response, and described the differences in TIDE scores among different MIPM subgroups [45]. We further assessed the correlations between risk scores and immune–related characteristics. Finally, we employed the immunophenoscore (IPS) to quantify patient response to immune checkpoint therapy (PD1 and PD–L1), comparing the differences of IPS between risk groups [46,47].

2.10. Statistical Analysis

All the statistical analyses were performed in R software (version 4.3.1). The Wilcoxon test was used to compare the differences in values between the test and control groups. A difference of p < 0.05 was regarded as statistically significant.

3. Results

3.1. Single–Cell RNA Sequencing (scRNA–Seq) Data Acquisition and Processing

The largest scRNA–seq dataset of GSE178318 was downloaded for the construction of the metastasis–based immune prognostic model (MIPM) for the criterion containing both primary and matched liver metastasis samples. The flowchart of this study is shown in Figure 1. After quality control (QC), as described in the Methods section, a total of 92,675 single cells (48,865 cells in the liver metastases and 43,810 cells in the primary tumor) from different patients with 16,499 protein-coding genes were obtained, which were shown in Figure 2A. T–distributed stochastic neighbor embedding (t–SNE) was used for nonlinear dimension reduction, as shown in Figure 2B. Then, these single cells were stratified into 21 clusters, and we used marker genes to annotate the main cell populations, including T cells, epithelial cells, plasma cells, myeloid cells, CAFs, mast cells, pDCs, endothelial cells, B cells, and NK cells (Figure 2C–E). Next, the inferCNV method was utilized to identify 7587 cancer cells from all epithelial cells, and the expression profiles of these cancer cells were extracted for further analysis (Figure 2F).

3.2. Establishment of Metastasis–Based Immune Prognostic Model (MIPM)

By measuring the classified ability of feature gene sets in the scRNA–seq dataset, we found a gene expression signature containing 4000 genes that had the maximum prediction accuracy (84.41%) for distinguishing primary cells from metastatic ones, and the area under the ROC curve (AUC) for evaluating the predictive accuracy of the 4000 genes reached the maximum value of 0.9388, outperforming other gene sets (Figure 3A). To further validate the robustness of the signature’s classification performance, we applied the signature to an independent scRNA–seq dataset (GSE221575). The signature consistently discriminated between primary and metastatic cells (AUC = 0.8611, accuracy = 0.7924), highlighting its stability and cross-dataset applicability (Figure 3B). In addition, we also performed cross–validation at the patient level to avoid patient–specific bias. Specifically, we used the cells from one patient as the test set and the cells from the remaining patients as the training set to predict the types of cells from that held–out patient. The results consistently demonstrated the strong classification power of the signature. For example, when using patient COL12/COL15 as the test set, it achieved an accuracy of 0.8182/0.7929 and an AUC value of 0.9227/0.9051 (Figure 3C). Given the significant relationship between the immune landscape of cancer and the prognosis of the patient, we employed enrichment scores derived from 2013 immune–related genes to stratify CRC patients into high–immunity and low–immunity groups, and selected 55 high–contribution immune–related genes (HIRGs) with a mean decrease in Gini > 1 by the random forest method (Figure 3D). Subsequently, 22 intersection genes between metastases–associated gene signature and HIRGs were obtained (Figure 3E). Functional enrichment analysis revealed that these intersection genes were mainly enriched in canonical pathways such as “signaling by interleukins”, “cytokine–cytokine receptor interaction”, “chemokine receptors bind chemokines”, “cell interactions of the pancreatic cancer microenvironment”, and “selective expression of chemokine receptors during T cell polarization” (Figure 3F).

Subsequently, we used univariate the Cox regression analysis method to identify six genes that were significantly correlated with CRC patients’ overall survival (OS) from intersection genes (Figure 3G,H), and by Lasso regression analysis, six genes were selected to establish MIPM using the expression of genes weighted by the Lasso regression coefficient as follows: MIPM risk score = (0.077 × exp(C5AR1)) + (−0.141 × exp(CCR7)) + (−0.359 × exp(ICOS)) + (−0.191 × exp(IL2RB)) + (0.340 × exp(NRP1)) + (0.191 × exp(VIM)).

The results of the functional enrichment analysis of MIPM genes indicated that MIPM was mainly involved in the pathways “cytokine–cytokine receptor interaction”, “GPCR ligand binding”, “signaling by Interleukins”, “caspase–mediated cleavage of cytoskeletal proteins”, etc. (Figure 3I).

Notably, we found that six MIPM genes were potentially involved in immune regulation, metastasis, and the progression of CRC. Among them, C5AR1 was found to be able to predict poor survival in CRC patients, and it played a prominent role in tumorigenesis and the development of CRC by modulating the immune response [48]. A previous study using an animal model of CRC demonstrated that CCR7 expression was associated with lymph node metastasis [49]. Zhang et al. found a significant negative correlation between ICOS expression and enhanced survival of CRC patients, especially in the case of tumor metastasis, suggesting that ICOS may be a useful predictor of progression in CRC patients [50]. Additionally, IL2RB was discovered as the most common gene associated with immune checkpoint genes in CRC, and the potential predictive value of IL2RB for immune checkpoint therapy response was investigated [51]. A much higher expression level of NRP1 was discovered in metastatic CRC tumors than in primary, and the knockdown of NRP1 also has strong inhibitory effects on the metastasis of CRC cells [52,53]. VIM was known as a potential cancer therapeutic target, and reports found that VIM could promote the invasion and metastasis of CRC cells by binding to overexpressed FSTL1 [54]. These findings further demonstrated that the MIPM was closely related to CRC.

3.3. Associations of MIPM and Clinical Characteristics in Internal Validation Datasets

When the MIPM was applied to the internal validation dataset (GSE39582) to assess the prognostic value of MIPM in terms of OS, 556 patients were divided into high–risk and low–risk groups using the cutoff value (−0.17), and patients in the high–risk group had significantly shorter OS time than those in the low–risk group (p < 0.0001) (Figure 4A–C). The results of the receiver operating characteristic (ROC) curves showed that MIPM had the highest accuracy in predicting patients’ survival compared to other clinical variables (age, sex, stage) (Figure 4D). There were significant differences in MIPM genes expression and various clinical features between high– and low–risk groups (p < 0.05) (Figure 4E). Different MIPM risk score distributions were found in stage and sex subgroups, but the differences were not significant in age and chemotherapy subgroups (Figure 4F). Previous studies have indicated that, among four consensus molecular subtypes (CMS,) for CRC, the CMS4 subtype has the worst OS [55]. So, we evaluated the MIPM risk scores in the context of CMS status, we found that high–risk patients accounted for the highest proportion in the CMS4 subtype and the MIPM risk scores were significantly higher in the CMS4 subtype than in other subtypes (p < 2 × 10⁻¹⁶) (Figure 4G). To evaluate whether MIPM provided additional prognostic value beyond known CMS subtypes, we performed CMS–stratified survival analyses. The MIPM exhibited significant prognostic capacity in most subtypes, indicating its robustness independent of CMS classification (Figure 4H). These results illustrated that MIPM might play an essential role in the prognostic prediction of CRC patients and there were strong relationships between it and clinical characteristics.

3.4. MIPM’s Prognostic Power Across Multiple External Validation Datasets

To evaluate the robustness of MIPM, we tested its prognostic power using ten external validation datasets, including GSE159216, GSE87211, GSE29621, GSE72968, GSE72970, GSE12945, GSE17536, GSE17537, GSE17538, and GSE37892. We found that MIPM could effectively categorize patients into high− and low−risk groups with significantly different OS in all datasets, and patients with the high−risk group had worse OS outcomes (Figure 5A and Table 2). The MIPM demonstrated good and consistent prognostic performance across the validation cohorts, as indicated by AUC values in ROC analysis, great concordance between the predicted and observed outcomes in calibration curves, and substantial net clinical benefit in the decision curve analysis (DCA), highlighting its potential clinical applicability in prognostic risk stratification (Figure 5B,C). Subsequently, we also confirmed the effectiveness of the MIPM for recurrence risk prediction in seven validation datasets containing information on tumor recurrence outcomes (DFS) (Figure 5D). Similarly, MIPM maintained consistent predictive accuracy to DFS and a practical clinical benefit across validation datasets, supported by AUC, calibration, and DCA analyses (Figure 5E,F). Taken together, these findings suggested that MIPM had important clinical implications and might serve as an important biomarker for predicting survival in CRC patients.

3.5. MIPM Predicts Chemotherapy Benefits

Chemotherapy significantly prolonged the OS time in CRC patients and remains a primary therapeutic strategy [56]. Therefore, we evaluated the predictive power of MIPM for CRC patients with chemotherapy in six validation datasets (Table 1). The results showed that MIPM successfully classified CRC patients receiving chemotherapy into high– and low–risk groups with markedly different OS, demonstrating that MIPM had good ability to predict OS in CRC patients who received chemotherapy (Figure 6A). In addition, ROC analysis, calibration curves, and DCA provided evidence supporting the predictive accuracy of MIPM for chemotherapy benefit and its potential value in clinical practice (Figure 6B,C). We further confirmed this finding through response stratification analyses. Chemotherapy significantly improved patients’ survival, and those who received chemotherapy exhibited a higher survival rate, indicating a substantial clinical benefit from chemotherapy (Figure 6D). Furthermore, the ROC curves revealed that MIPM’s prognostic performance remains stable regardless of chemotherapy, suggesting that MIPM can reliably predict patient risk inde-pendent of treatment status Figure 6E).

3.6. Independent Prognostic Factor Evaluation and Nomogram Construction

To investigate the independence of MIPM from other clinical factors, such as age, sex, and stage, univariate and multivariate Cox regression analyses were performed in three validation cohorts (GSE39582, GSE17536, and GSE17538) that included the aforementioned clinical variables (Table 3). The results from the GSE39582 cohort showed that MIPM, age, and stage were independent prognostic factors for CRC patients (p < 0.001). In GSE17536, MIPM and stage were associated with the OS of CRC patients (p < 0.001). Likewise, MIPM and stage were two independent prognostic factors for CRC patients in GSE17538 (p < 0.001). These results confirmed a strong association between MIPM and OS in CRC patients.

Then, we designed a comprehensive nomogram based on the above variables to predict the 3– and 5–year OS of CRC patients (Figure 7A). The calibration and ROC curves were used to evaluate the predictive accuracy of the nomogram. The calibration curves displayed good agreement between predicted and actual OS, and in GSE39582, GSE17538, and GSE17536 datasets, the AUC values of 3– and 5–year were 0.662 and 0.686, 0.737 and 0.748, and 0.757 and 0.768, respectively (Figure 7B,C), which indicated that nomogram models had a good performance for OS prediction.

3.7. MIPM’s Influence on Drug Sensitivity and Resistance in CRC

To enhance the clinical utility of MIPM, we performed drug sensitivity prediction and correlation analysis. The results of drug sensitivity analysis predicted by the oncoPredict method in the GSE39582 datasets showed that significantly higher drug sensitivity in high–risk patients to several drugs, including Bortezomib, Dabrafenib, Talazoparib, and Teniposide (Figure 8A). To explore the mechanisms underlying increased drug sensitivity in the high–risk group, we identified pathways specifically enriched in this subgroup by gene set variation analysis (GSVA). A total of fourty–one KEGG pathways were significantly enriched in high–risk patients (FDR < 0.05), including MAPK, TGF–

β

, and mTOR signaling, ECM–receptor interaction, focal adhesion, regulation of autophagy, and multiple cancer–related pathways (Supplementary Table S3). Notably, several pathways were closely linked to the predicted sensitivities to Bortezomib, Dabrafenib, Talazoparib, and Teniposide. Specifically, since Bortezomib modulates NF–

κ

B and induces ER stress and apoptosis, there is extensive interactive regulation between NF–κB and TGF–β signaling pathway [57]. Dabrafenib targets BRAF, a key component of the MAPK signaling pathway, which was markedly activated in the high–risk group [58,59]. Talazoparib, a poly (ADP–ribose) polymerase (PARP) inhibitor, exploits synthetic lethality in tumors with impaired DNA repair [60]. The enrichment of autophagy regulation and cell–cell junction pathways (e.g., tight junction, adherens junction) suggested possible epithelial stress and genomic instability, which may sensitize cells to PARP inhibition. Teniposide, a topoisomerase II inhibitor, induces DNA double–strand breaks [61]. Its efficacy may be enhanced in tumors with active MAPK, mTOR, or ECM–related signaling, all enriched in the high–risk group. The correlation analysis results indicated that higher MIPM risk scores could increase the sensitivity of CRC cell lines to Dactolisib, Trametinib, Ulixertinib, and so on, but led to increased resistance to several compounds, such as Veliparib, Motesanib, and Erlotinib (p < 0.05), which may provide some assistance in the treatment of CRC patients (Figure 8B). In summary, this part might identify promising therapeutic candidates for CRC treatment.

3.8. Immune Characteristics Between the High– and Low–Risk Groups

The tumor microenvironment (TME) plays a fundamental role in tumor progression and metastasis [62,63]. First, we found that high–risk patients showed higher immunocyte infiltration degrees in M0 macrophages, M2 macrophages, activated mast cells, and neutrophils; in comparison, the low–risk group showed higher infiltration degrees in activated memory CD4 T cells, regulatory T cells, CD8 T cells, and resting NK cells, etc., illustrating that MIPM might partly reflect TME status (Figure 8C,D).

Subsequently, we evaluated the immune checkpoint inhibitors (ICIs) therapy response of different MIPM subgroups using the tumor immune dysfunction and exclusion (TIDE) approach. An elevated TIDE score represented an increased likelihood of immune escape and was associated with poorer immunotherapy efficacy [45]. Compared with the low–risk group, TIDE scores and T cell exclusion scores in the high–risk group were significantly increased, while T cell dysfunction scores and microsatellite instability (MSI) scores were decreased, indicating greater immune escape potential of high–risk patients, and the fact that they were less likely to benefit from ICI therapy (Figure 8E). We also found that patients with lower TIDE scores consistently had a worse prognosis among four patient groups stratified by MIPM risk scores and TIDE scores, patients with high–risk and low–TIDE really tended to have the highest risk for death (Figure 8F,G).

The expression of immune checkpoints has been recognized as a biomarker for the selection of CRC patients for immunotherapy [64,65]. So, we evaluated the relationships between MIPM risk scores and immune–related features and the expression of key immune checkpoints in sequence (Figure 8H). Based on the results of ESTIMATE method, we found that MIPM risk scores were correlated with stromal scores, immune scores, estimate scores, and tumor purity. Meanwhile, we identified high associations between MIPM risk scores and the expression of three important immune checkpoints (PD–1, PD–L1, and CTLA–4). We also investigated the differences in the expression of immune checkpoints between the high– and low–risk groups, and the results showed that the expression of the three immune checkpoints in the high–risk group was significantly higher than that in the low–risk group, suggesting poor prognosis in the high–risk group due in part to the immunosuppressive environment (Figure 8I).

The individual PD1 and PD–L1 responses could be quantified using the immunophenoscore (IPS), which consists of four different scores (molecules (MHCs), effector cells (ECs), suppressor cells (SCs), and immune checkpoints (CPs)) [47,66]. The results illustrated that there were significant differences in all four scores between MIPM subgroups (Figure 8J). Among them, in the high–risk group, MHC IPS, EC IPS, and SC IPS were significantly lower, but CP IPS was significantly higher.

Consistent with our results, numerous previous studies have also confirmed the involvement of MIPM genes in regulating the tumor immune microenvironment or therapy resistance. C5AR1, a pleiotropic regulator, promoted CRC initiation by fostering a tumor-supportive immune response and might serve as a potential preventive target [48]. CCR7 was linked to immune dysregulation and tumorigenesis, with studies highlighting its key role in coordinating immune cell trafficking [67,68]. ICOS exerted dual roles in tumor immunity by enhancing cytotoxic T cell activity while also promoting Treg–mediated immunosuppression, thus contributing to both anti– and pro–tumor effects [69,70]. IL2RB, essential for T cell proliferation and survival, was closely associated with immune checkpoint signaling in CRC. IL2RB–positive immune cells were often linked to immune suppression and T cell exhaustion [51,71]. NRP1, highly expressed on Tregs, supported their suppressive function and contributed to immune evasion and therapy resistance by sustaining immunosuppressive signaling and dampening effective T cell responses [72,73]. These results indirectly suggested that MIPM might play a crucial role in predicting immunotherapy outcomes.

4. Discussion

Colorectal cancer (CRC) exhibits high heterogeneity, leading to substantial variability in patient survival outcomes [74,75]. A major challenge in treating CRC is metastasis, particularly to the liver, which affects approximately 35–55% of CRC patients and complicates effective treatment strategies [76]. Concurrently, the intricate interplay between the immune system and cancer metastasis adds further complexity to CRC progression. Numerous genetic markers, including mutations in KRAS, BRAF, and TP53, have been identified as prognostic indicators. For instance, studies by Lievre A. et al. and Lee et al. had highlighted the prognostic value of these genetic alterations in CRC [77,78]. However, these markers fail to capture the full complexity of CRC, as they overlook the immune landscape and metastatic potential. This limitation underscores the urgent need to integrate prognostic signatures that include both immunological and metastatic factors. To address this limitation, we employed a bioinformatics methodology to construct the metastasis-based immune prognostic model (MIPM) by integrating CRC single–cell RNA sequencing (scRNA–seq) and bulk data. By leveraging both bulk sequencing for extensive phenotypic and survival data, and scRNA–seq to address cellular heterogeneity, our analysis synergized the strengths of these techniques [79,80].

To develop MIPM, we employed various computational approaches in scRNA–seq data, including signal–to–noise statistics for gene expression comparison and supervised learning to distinguish between primary and metastatic cells. This approach led to the identification of a gene expression signature comprising 4000 genes, which could well distinguish primary and metastatic cells. Further refinement, focusing on high–contribution immune–related genes selected by random forest algorithm, combined with the Cox and Lasso regression analysis to screen prognostic genes, resulted in a concise MIPM with six genes. Our validation confirmed the associations between MIPM and various clinical characteristics, revealing significant differences in MIPM risk scores across subgroups with different clinical features. And, to test the robustness of MIPM, our verification across multiple bulk datasets, we found the effectiveness of MIPM in predicting overall survival (OS) and its significant implications for disease–free survival (DFS) outcomes of patients. Notably, compared to other existing immune–related models, MIPM demonstrated a more robust and superior predictive performance in eleven independent validation datasets (Figure 8K). The efficacy of MIPM extended to predicting chemotherapy benefits, as evidenced in validation datasets. Among these, we highlighted findings not previously discussed, and MIPM demonstrated consistent predictive power in the chemotherapy-naïve subgroup, supporting its robustness independent of treatment status (Figure 8L). It showed a strong correlation with drug sensitivity, and some potential candidate drugs for CRC have also been identified through analysis. Multiple immunoassays revealed the powerful ability of MIPM to predict immunotherapy response. This evidence positioned MIPM as a promising prognostic marker, potentially guiding future CRC metastasis, treatment, and prognosis research.

However, our study still has several limitations. First, the key prognostic features identified in MIPM require further experimental validation to clarify their underlying biological mechanisms, including comprehensive functional characterization through both in vitro and in vivo studies. Second, although OncoPredict enables transcriptome–based drug sensitivity prediction, its extrapolation from GDSC cell lines to clinical CRC samples is constrained by biological differences, including the lack of tumor microenvironment and immune context. These limitations may affect the generalizability of our results and highlight the need for in vivo or patient–derived model validation. Furthermore, CIBERSORT and TIDE analyses provide valuable insights into the immune landscape, but they have limitations in CRC. CRC exhibits pronounced inter– and intra–tumoral heterogeneity. CIBERSORT depends on the predefined signatures of twenty–two immune cell types, which may inadequately capture the complex composition or plasticity of the immune microenvironment in CRC, especially within the microsatellite instability–high (MSI–H) or consensus molecular subtype subgroups. Similarly, TIDE simplifies immune evasion into a few gene markers (e.g., PD–1, CTLA–4), and its model is mainly trained on melanoma and non-small-cell lung cancer (NSCLC) data, potentially limiting its applicability to CRC due to distinct tumor biology and immune contexts. Currently, publicly available CRC cohorts with both transcriptomic data and clinical response to ICIs are very limited. Most ICI–treated cohorts focus on melanoma, lung, or bladder cancer. Only a few small CRC datasets exist, often with small sample sizes or lacking raw expression data. We acknowledge this as a limitation of our study. To address this, we plan to validate our findings using prospective CRC–ICI cohorts or multi-omics integration in the future.

5. Conclusions

In summary, through our bioinformatics approach, integrating single–cell RNA sequencing (scRNA–seq) and bulk data, we constructed metastasis–based immune prognostic model (MIPM). Our comprehensive evaluation highlighted MIPM’s association with metastasis and its predictive accuracy for CRC patient survival. Its integration with clinical variables improved risk prediction accuracy, laying a foundation for further investigation toward clinical relevance. In addition, MIPM integrated genes involved in both immune regulation and metastasis, providing new insights into the interplay between immune microenvironment and tumor dissemination in CRC. This could contribute to a deeper understanding of the molecular mechanisms underlying disease progression and therapeutic resistance. MIPM emerges as a promising prognostic tool, offering new avenues for understanding CRC metastasis and enhancing clinical research.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/cimb47080652/s1.

Author Contributions

Formal analysis, K.X.; Writing–original draft, K.X.; Funding acquisition, J.Z.; Project administration, J.Z.; Visualization, Y.M.; Conceptualization, L.L.; Data curation, L.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China [62301194]; University Nursing Program for Young Scholars with Creative Talents in Heilongjiang Province [UNPYSCT–2020164]; Fundamental Research Funds for the Provincial Universities of Harbin Medical University-Daqing [JFQN202303]; Key Discipline Construction Project of Harbin Medical University-Daqing [HD–ZDXK–202004]; Construction Project of Scientific Research and Innovation Team of Harbin Medical University-Daqing [HD–CXTD–202002].

Data Availability Statement

The data used in this study are openly available in GEO (https://www.ncbi.nlm.nih.gov/geo/browse/?view=series, accessed on 13 June 2025), ImmPort (https://www.immport.org/resources, accessed on 13 June 2025), and GDSC (https://www.cancerrxgene.org/, accessed on 13 June 2025) databases.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CRC	Colorectal cancer
MIPM	Metastasis–based immune prognostic model
scRNA–seq	Single–cell RNA sequencing
OS	Overall survival
IRGPI	Immune–related gene prognostic index
TIME	Tumor immune microenvironment
ICIs	Immune checkpoint inhibitors
GEO	Gene Expression Omnibus
UMI	Unique molecular identifier
ssGSEA	Single-sample gene set enrichment analysis
HIRG	High–contribution immune-related genes
K-M	Kaplan–Meier
DCA	Decision curve analysis
ROC	Receiver operating characteristic
DFS	Disease–free survival
IC50	Half maximal inhibitory concentration
GDSC	Genomics of Drug Sensitivity in Cancer
TIDE	Tumor immune dysfunction and exclusion
IPS	Immunophenoscore
QC	Quality control
t-SNE	T–distributed stochastic neighbor embedding
AUC	Area under the ROC
CMS	Consensus molecular subtypes
GSVA	Gene set variation analysis
PDOX	Patient-derived organoid–based xenograft
PARP	Poly (ADP–ribose) polymerase
TME	Tumor microenvironment
MSI	Microsatellite instability
MHC	MHC molecules
EC	Effector cells
SC	Suppressor cells
CP	Immune checkpoints
NSCLC	Non-small-cell lung cancer

References

Siegel, R.L.; Wagle, N.S.; Cercek, A.; Smith, R.A.; Jemal, A. Colorectal cancer statistics, 2023. CA Cancer J. Clin. 2023, 73, 233–254. [Google Scholar] [CrossRef] [PubMed]
Siegel, R.L.; Miller, K.D.; Wagle, N.S.; Jemal, A. Cancer statistics, 2023. CA Cancer J. Clin. 2023, 73, 17–48. [Google Scholar] [CrossRef] [PubMed]
Bhullar, D.S.; Barriuso, J.; Mullamitha, S.; Saunders, M.P.; O’Dwyer, S.T.; Aziz, O. Biomarker concordance between primary colorectal cancer and its metastases. EBioMedicine 2019, 40, 363–374. [Google Scholar] [CrossRef] [PubMed]
Xia, J.; Ma, N.; Shi, Q.; Liu, Q.C.; Zhang, W.; Cao, H.J.; Wang, Y.K.; Zheng, Q.W.; Ni, Q.Z.; Xu, S.; et al. XAF1 promotes colorectal cancer metastasis via VCP-RNF114-JUP axis. J. Cell Biol. 2024, 223, e202303015. [Google Scholar] [CrossRef]
Cañellas-Socias, A.; Cortina, C.; Hernando-Momblona, X.; Palomo-Ponce, S.; Mulholland, E.J.; Turon, G.; Mateo, L.; Conti, S.; Roman, O.; Sevillano, M.; et al. Metastatic recurrence in colorectal cancer arises from residual EMP1(+) cells. Nature 2022, 611, 603–613. [Google Scholar] [CrossRef]
Huang, X.; Zhu, X.; Yu, Y.; Zhu, W.; Jin, L.; Zhang, X.; Li, S.; Zou, P.; Xie, C.; Cui, R. Dissecting miRNA signature in colorectal cancer progression and metastasis. Cancer Lett. 2021, 501, 66–82. [Google Scholar] [CrossRef]
Shin, A.E.; Giancotti, F.G.; Rustgi, A.K. Metastatic colorectal cancer: Mechanisms and emerging therapeutics. Trends Pharmacol. Sci. 2023, 44, 222–236. [Google Scholar] [CrossRef]
Hanahan, D.; Weinberg, R.A. Hallmarks of cancer: The next generation. Cell 2011, 144, 646–674. [Google Scholar] [CrossRef]
Xue, R.; Zhang, Q.; Cao, Q.; Kong, R.; Xiang, X.; Liu, H.; Feng, M.; Wang, F.; Cheng, J.; Li, Z.; et al. Liver tumour immune microenvironment subtypes and neutrophil heterogeneity. Nature 2022, 612, 141–147. [Google Scholar] [CrossRef]
Li, J.; Wu, C.; Hu, H.; Qin, G.; Wu, X.; Bai, F.; Zhang, J.; Cai, Y.; Huang, Y.; Wang, C.; et al. Remodeling of the immune and stromal cell compartment by PD-1 blockade in mismatch repair-deficient colorectal cancer. Cancer Cell 2023, 41, 1152–1169.e7. [Google Scholar] [CrossRef]
Chalabi, M.; Fanchi, L.F.; Dijkstra, K.K.; Van den Berg, J.G.; Aalbers, A.G.; Sikorska, K.; Lopez-Yurda, M.; Grootscholten, C.; Beets, G.L.; Snaebjornsson, P.; et al. Neoadjuvant immunotherapy leads to pathological responses in MMR-proficient and MMR-deficient early-stage colon cancers. Nat. Med. 2020, 26, 566–576. [Google Scholar] [CrossRef]
Ganesh, K.; Stadler, Z.K.; Cercek, A.; Mendelsohn, R.B.; Shia, J.; Segal, N.H.; Diaz, L.A., Jr. Immunotherapy in colorectal cancer: Rationale, challenges and potential. Nat. Rev. Gastroenterol. Hepatol. 2019, 16, 361–375. [Google Scholar] [CrossRef]
Li, C.; Wirth, U.; Schardey, J.; Ehrlich-Treuenstätt, V.V.; Bazhin, A.V.; Werner, J.; Kühn, F. An immune-related gene prognostic index for predicting prognosis in patients with colorectal cancer. Front. Immunol. 2023, 14, 1156488. [Google Scholar] [CrossRef] [PubMed]
An, S.; Kim, S.K.; Kwon, H.Y.; Kim, C.S.; Bang, H.J.; Do, H.; Kim, B.; Kim, K.; Kim, Y. Expression of Immune-Related and Inflammatory Markers and Their Prognostic Impact in Colorectal Cancer Patients. Int. J. Mol. Sci. 2023, 24, 11579. [Google Scholar] [CrossRef] [PubMed]
Mo, X.; Huang, X.; Feng, Y.; Wei, C.; Liu, H.; Ru, H.; Qin, H.; Lai, H.; Wu, G.; Xie, W.; et al. Immune infiltration and immune gene signature predict the response to fluoropyrimidine-based chemotherapy in colorectal cancer patients. Oncoimmunology 2020, 9, 1832347. [Google Scholar] [CrossRef]
Zhang, X.; Yang, L.; Deng, Y.; Huang, Z.; Huang, H.; Wu, Y.; He, B.; Hu, F. Single-cell RNA-Seq and bulk RNA-Seq reveal reliable diagnostic and prognostic biomarkers for CRC. J. Cancer Res. Clin. Oncol. 2023, 149, 9805–9821. [Google Scholar] [CrossRef] [PubMed]
Liu, W.; Luo, X.; Zhang, Z.; Chen, Y.; Dai, Y.; Deng, J.; Yang, C.; Liu, H. Construction of an immune predictive model and identification of TRIP6 as a prognostic marker and therapeutic target of CRC by integration of single-cell and bulk RNA-seq data. Cancer Immunol. Immunother. 2024, 73, 69. [Google Scholar] [CrossRef]
Wang, Q.; Zhang, Y.F.; Li, C.L.; Wang, Y.; Wu, L.; Wang, X.R.; Huang, T.; Liu, G.L.; Chen, X.; Yu, Q.; et al. Integrating scRNA-seq and bulk RNA-seq to characterize infiltrating cells in the colorectal cancer tumor microenvironment and construct molecular risk models. Aging 2023, 15, 13799–13821. [Google Scholar] [CrossRef]
Che, L.H.; Liu, J.W.; Huo, J.P.; Luo, R.; Xu, R.M.; He, C.; Li, Y.Q.; Zhou, A.J.; Huang, P.; Chen, Y.Y.; et al. A single-cell atlas of liver metastases of colorectal cancer reveals reprogramming of the tumor microenvironment in response to preoperative chemotherapy. Cell Discov. 2021, 7, 80. [Google Scholar] [CrossRef]
Edgar, R.; Domrachev, M.; Lash, A.E. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30, 207–210. [Google Scholar] [CrossRef]
Marisa, L.; de Reyniès, A.; Duval, A.; Selves, J.; Gaub, M.P.; Vescovo, L.; Etienne-Grimaldi, M.C.; Schiappa, R.; Guenot, D.; Ayadi, M.; et al. Gene expression classification of colon cancer into molecular subtypes: Characterization, validation, and prognostic value. PLoS Med. 2013, 10, e1001453. [Google Scholar] [CrossRef]
Eide, P.W.; Moosavi, S.H.; Eilertsen, I.A.; Brunsell, T.H.; Langerud, J.; Berg, K.C.G.; Røsok, B.I.; Bjørnbeth, B.A.; Nesbakken, A.; Lothe, R.A.; et al. Metastatic heterogeneity of the consensus molecular subtypes of colorectal cancer. NPJ Genom. Med. 2021, 6, 59. [Google Scholar] [CrossRef]
Hu, Y.; Gaedcke, J.; Emons, G.; Beissbarth, T.; Grade, M.; Jo, P.; Yeager, M.; Chanock, S.J.; Wolff, H.; Camps, J.; et al. Colorectal cancer susceptibility loci as predictive markers of rectal cancer prognosis after surgery. Genes. Chromosomes Cancer 2018, 57, 140–149. [Google Scholar] [CrossRef]
Chen, D.T.; Hernandez, J.M.; Shibata, D.; McCarthy, S.M.; Humphries, L.A.; Clark, W.; Elahi, A.; Gruidl, M.; Coppola, D.; Yeatman, T. Complementary strand microRNAs mediate acquisition of metastatic potential in colonic adenocarcinoma. J. Gastrointest. Surg. 2012, 16, 905–912, discussion 912–903. [Google Scholar] [CrossRef] [PubMed]
Del Rio, M.; Mollevi, C.; Bibeau, F.; Vie, N.; Selves, J.; Emile, J.F.; Roger, P.; Gongora, C.; Robert, J.; Tubiana-Mathieu, N.; et al. Molecular subtypes of metastatic colorectal cancer are associated with patient response to irinotecan-based therapies. Eur. J. Cancer 2017, 76, 68–75. [Google Scholar] [CrossRef] [PubMed]
Cherradi, S.; Ayrolles-Torro, A.; Vezzo-Vi, N.; Gueguinou, N.; Denis, V.; Combes, E.; Boissière, F.; Busson, M.; Canterel-Thouennon, L.; Mollevi, C.; et al. Antibody targeting of claudin-1 as a potential colorectal cancer therapy. J. Exp. Clin. Cancer Res. 2017, 36, 89. [Google Scholar] [CrossRef]
Staub, E.; Groene, J.; Heinze, M.; Mennerich, D.; Roepcke, S.; Klaman, I.; Hinzmann, B.; Castanos-Velez, E.; Pilarsky, C.; Mann, B.; et al. An expression module of WIPF1-coexpressed genes identifies patients with favorable prognosis in three tumor types. J Mol Med. 2009, 87, 633–644. [Google Scholar] [CrossRef] [PubMed]
Smith, J.J.; Deane, N.G.; Wu, F.; Merchant, N.B.; Zhang, B.; Jiang, A.; Lu, P.; Johnson, J.C.; Schmidt, C.; Bailey, C.E.; et al. Experimentally derived metastasis gene expression profile predicts recurrence and death in patients with colon cancer. Gastroenterology 2010, 138, 958–968. [Google Scholar] [CrossRef]
Laibe, S.; Lagarde, A.; Ferrari, A.; Monges, G.; Birnbaum, D.; Olschwang, S. A seven-gene signature aggregates a subgroup of stage II colon cancers with stage III. Omics 2012, 16, 560–565. [Google Scholar] [CrossRef]
Ramaswamy, S.; Ross, K.N.; Lander, E.S.; Golub, T.R. A molecular signature of metastasis in primary solid tumors. Nat. Genet. 2003, 33, 49–54. [Google Scholar] [CrossRef]
Golub, T.R.; Slonim, D.K.; Tamayo, P.; Huard, C.; Gaasenbeek, M.; Mesirov, J.P.; Coller, H.; Loh, M.L.; Downing, J.R.; Caligiuri, M.A.; et al. Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science 1999, 286, 531–537. [Google Scholar] [CrossRef]
Bhattacharya, S.; Andorf, S.; Gomes, L.; Dunn, P.; Schaefer, H.; Pontius, J.; Berger, P.; Desborough, V.; Smith, T.; Campbell, J.; et al. ImmPort: Disseminating data to the public for the future of immunology. Immunol. Res. 2014, 58, 234–239. [Google Scholar] [CrossRef]
Shen, M.; Xie, Q.; Zhang, R.; Yu, C.; Xiao, P. Metabolite-assisted models improve risk prediction of coronary heart disease in patients with diabetes. Front. Pharmacol. 2023, 14, 1175021. [Google Scholar] [CrossRef]
Subramanian, A.; Tamayo, P.; Mootha, V.K.; Mukherjee, S.; Ebert, B.L.; Gillette, M.A.; Paulovich, A.; Pomeroy, S.L.; Golub, T.R.; Lander, E.S.; et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 2005, 102, 15545–15550. [Google Scholar] [CrossRef]
Sun, Z.; Dang, Q.; Liu, Z.; Shao, B.; Chen, C.; Guo, Y.; Chen, Z.; Zhou, Q.; Hu, S.; Liu, J.; et al. LINC01272/miR-876/ITGB2 axis facilitates the metastasis of colorectal cancer via epithelial-mesenchymal transition. J. Cancer 2021, 12, 3909–3919. [Google Scholar] [CrossRef]
Xie, X.; Liu, Q.; Zhou, X.; Liang, R.; Yang, S.; Xu, M.; Zhao, H.; Li, C.; Chen, Y.; Cai, X. Comprehensive analysis of cuproptosis-related genes in immune infiltration and prognosis in lung adenocarcinoma. Comput. Biol. Med. 2023, 158, 106831. [Google Scholar] [CrossRef] [PubMed]
Liu, L.; Xiao, Y.; Wei, D.; Wang, Q.; Zhang, J.K.; Yuan, L.; Bai, G.Q. Development and validation of a nomogram for predicting suicide risk and prognostic factors in bladder cancer patients following diagnosis: A population-based retrospective study. J. Affect. Disord. 2024, 347, 124–133. [Google Scholar] [CrossRef] [PubMed]
Xiao, Y.; Zhang, G.; Wang, L.; Liang, M. Exploration and validation of a combined immune and metabolism gene signature for prognosis prediction of colorectal cancer. Front. Endocrinol 2022, 13, 1069528. [Google Scholar] [CrossRef] [PubMed]
Zheng, L.; Xu, Z.; Zhang, W.; Lin, H.; Zhang, Y.; Zhou, S.; Liu, Z.; Gu, X. Identification and validation of a prognostic signature based on six immune-related genes for colorectal cancer. Discov. Oncol. 2024, 15, 192. [Google Scholar] [CrossRef] [PubMed]
Li, M.; Wang, H.; Li, W.; Peng, Y.; Xu, F.; Shang, J.; Dong, S.; Bu, L.; Wang, H.; Wei, W.; et al. Identification and validation of an immune prognostic signature in colorectal cancer. Int. Immunopharmacol. 2020, 88, 106868. [Google Scholar] [CrossRef]
Yang, W.; Soares, J.; Greninger, P.; Edelman, E.J.; Lightfoot, H.; Forbes, S.; Bindal, N.; Beare, D.; Smith, J.A.; Thompson, I.R.; et al. Genomics of Drug Sensitivity in Cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013, 41, D955–D961. [Google Scholar] [CrossRef]
Maeser, D.; Gruener, R.F.; Huang, R.S. oncoPredict: An R package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Brief. Bioinform. 2021, 22, bbab260. [Google Scholar] [CrossRef]
Newman, A.M.; Liu, C.L.; Green, M.R.; Gentles, A.J.; Feng, W.; Xu, Y.; Hoang, C.D.; Diehn, M.; Alizadeh, A.A. Robust enumeration of cell subsets from tissue expression profiles. Nat. Methods 2015, 12, 453–457. [Google Scholar] [CrossRef]
Gui, C.P.; Wei, J.H.; Chen, Y.H.; Fu, L.M.; Tang, Y.M.; Cao, J.Z.; Chen, W.; Luo, J.H. A new thinking: Extended application of genomic selection to screen multiomics data for development of novel hypoxia-immune biomarkers and target therapy of clear cell renal cell carcinoma. Brief. Bioinform. 2021, 22, bbab173. [Google Scholar] [CrossRef]
Jiang, P.; Gu, S.; Pan, D.; Fu, J.; Sahu, A.; Hu, X.; Li, Z.; Traugh, N.; Bu, X.; Li, B.; et al. Signatures of T cell dysfunction and exclusion predict cancer immunotherapy response. Nat. Med. 2018, 24, 1550–1558. [Google Scholar] [CrossRef] [PubMed]
Huang, X.; Liu, Y.; Qian, C.; Shen, Q.; Wu, M.; Zhu, B.; Feng, Y. CHSY3 promotes proliferation and migration in gastric cancer and is associated with immune infiltration. J. Transl. Med. 2023, 21, 474. [Google Scholar] [CrossRef] [PubMed]
Charoentong, P.; Finotello, F.; Angelova, M.; Mayer, C.; Efremova, M.; Rieder, D.; Hackl, H.; Trajanoski, Z. Pan-cancer Immunogenomic Analyses Reveal Genotype-Immunophenotype Relationships and Predictors of Response to Checkpoint Blockade. Cell Rep. 2017, 18, 248–262. [Google Scholar] [CrossRef] [PubMed]
Ding, P.; Li, L.; Li, L.; Lv, X.; Zhou, D.; Wang, Q.; Chen, J.; Yang, C.; Xu, E.; Dai, W.; et al. C5aR1 is a master regulator in Colorectal Tumorigenesis via Immune modulation. Theranostics 2020, 10, 8619–8632. [Google Scholar] [CrossRef]
Müller, A.; Homey, B.; Soto, H.; Ge, N.; Catron, D.; Buchanan, M.E.; McClanahan, T.; Murphy, E.; Yuan, W.; Wagner, S.N.; et al. Involvement of chemokine receptors in breast cancer metastasis. Nature 2001, 410, 50–56. [Google Scholar] [CrossRef]
Zhang, Y.; Luo, Y.; Qin, S.L.; Mu, Y.F.; Qi, Y.; Yu, M.H.; Zhong, M. The clinical impact of ICOS signal in colorectal cancer patients. Oncoimmunology 2016, 5, e1141857. [Google Scholar] [CrossRef]
Alderdice, M.; Craig, S.G.; Humphries, M.P.; Gilmore, A.; Johnston, N.; Bingham, V.; Coyle, V.; Senevirathne, S.; Longley, D.B.; Loughrey, M.B.; et al. Evolutionary genetic algorithm identifies IL2RB as a potential predictive biomarker for immune-checkpoint therapy in colorectal cancer. NAR Genom. Bioinform. 2021, 3, lqab016. [Google Scholar] [CrossRef]
Liu, X.; Meng, X.; Peng, X.; Yao, Q.; Zhu, F.; Ding, Z.; Sun, H.; Liu, X.; Li, D.; Lu, Y.; et al. Impaired AGO2/miR-185-3p/NRP1 axis promotes colorectal cancer metastasis. Cell Death Dis. 2021, 12, 390. [Google Scholar] [CrossRef] [PubMed]
Petit, J.Y.; Cavana, P.; Thoumire, S.; Guillot, J.; Perrot, S. Use of a modified hair strand test to assess the antifungal activity kinetics of dog hair after a 2% climbazole shampoo application. Vet. Dermatol. 2016, 27, 148-e38. [Google Scholar] [CrossRef] [PubMed]
Gu, C.; Wang, X.; Long, T.; Wang, X.; Zhong, Y.; Ma, Y.; Hu, Z.; Li, Z. FSTL1 interacts with VIM and promotes colorectal cancer metastasis via activating the focal adhesion signalling pathway. Cell Death Dis. 2018, 9, 654. [Google Scholar] [CrossRef] [PubMed]
Guinney, J.; Dienstmann, R.; Wang, X.; de Reyniès, A.; Schlicker, A.; Soneson, C.; Marisa, L.; Roepman, P.; Nyamundanda, G.; Angelino, P.; et al. The consensus molecular subtypes of colorectal cancer. Nat. Med. 2015, 21, 1350–1356. [Google Scholar] [CrossRef]
Ohishi, T.; Kaneko, M.K.; Yoshida, Y.; Takashima, A.; Kato, Y.; Kawada, M. Current Targeted Therapy for Metastatic Colorectal Cancer. Int. J. Mol. Sci. 2023, 24, 1702. [Google Scholar] [CrossRef]
Mutlu, G.M.; Budinger, G.R.; Wu, M.; Lam, A.P.; Zirk, A.; Rivera, S.; Urich, D.; Chiarella, S.E.; Go, L.H.; Ghosh, A.K.; et al. Proteasomal inhibition after injury prevents fibrosis by modulating TGF-β(1) signalling. Thorax 2012, 67, 139–146. [Google Scholar] [CrossRef]
Chapman, P.B.; Hauschild, A.; Robert, C.; Haanen, J.B.; Ascierto, P.; Larkin, J.; Dummer, R.; Garbe, C.; Testori, A.; Maio, M.; et al. Improved survival with vemurafenib in melanoma with BRAF V600E mutation. N. Engl. J. Med. 2011, 364, 2507–2516. [Google Scholar] [CrossRef]
King, A.J.; Arnone, M.R.; Bleam, M.R.; Moss, K.G.; Yang, J.; Fedorowicz, K.E.; Smitheman, K.N.; Erhardt, J.A.; Hughes-Earle, A.; Kane-Carson, L.S.; et al. Dabrafenib; preclinical characterization, increased efficacy when combined with trametinib, while BRAF/MEK tool combination reduced skin lesions. PLoS ONE 2013, 8, e67583. [Google Scholar] [CrossRef]
Lakomy, D.S.; Urbauer, D.L.; Westin, S.N.; Lin, L.L. Phase I study of the PARP inhibitor talazoparib with radiation therapy for locally recurrent gynecologic cancers. Clin. Transl. Radiat. Oncol. 2020, 21, 56–61. [Google Scholar] [CrossRef]
Clark, P.I.; Slevin, M.L. The clinical pharmacology of etoposide and teniposide. Clin. Pharmacokinet. 1987, 12, 223–252. [Google Scholar] [CrossRef]
de Visser, K.E.; Joyce, J.A. The evolving tumor microenvironment: From cancer initiation to metastatic outgrowth. Cancer Cell 2023, 41, 374–403. [Google Scholar] [CrossRef]
Hou, S.; Zhao, Y.; Chen, J.; Lin, Y.; Qi, X. Tumor-associated macrophages in colorectal cancer metastasis: Molecular insights and translational perspectives. J. Transl. Med. 2024, 22, 62. [Google Scholar] [CrossRef]
Zhang, L.; Yu, X.; Zheng, L.; Zhang, Y.; Li, Y.; Fang, Q.; Gao, R.; Kang, B.; Zhang, Q.; Huang, J.Y.; et al. Lineage tracking reveals dynamic relationships of T cells in colorectal cancer. Nature 2018, 564, 268–272. [Google Scholar] [CrossRef] [PubMed]
Llosa, N.J.; Cruise, M.; Tam, A.; Wicks, E.C.; Hechenbleikner, E.M.; Taube, J.M.; Blosser, R.L.; Fan, H.; Wang, H.; Luber, B.S.; et al. The vigorous immune microenvironment of microsatellite instable colon cancer is balanced by multiple counter-inhibitory checkpoints. Cancer Discov. 2015, 5, 43–51. [Google Scholar] [CrossRef] [PubMed]
Xu, Y.; Wang, Z.; Li, F. Survival prediction and response to immune checkpoint inhibitors: A prognostic immune signature for hepatocellular carcinoma. Transl. Oncol. 2021, 14, 100957. [Google Scholar] [CrossRef]
Shi, W.; Zou, R.; Yang, M.; Mai, L.; Ren, J.; Wen, J.; Liu, Z.; Lai, R. Analysis of Genes Involved in Ulcerative Colitis Activity and Tumorigenesis Through Systematic Mining of Gene Co-expression Networks. Front. Physiol. 2019, 10, 662. [Google Scholar] [CrossRef]
McNamee, E.N.; Masterson, J.C.; Veny, M.; Collins, C.B.; Jedlicka, P.; Byrne, F.R.; Ng, G.Y.; Rivera-Nieves, J. Chemokine receptor CCR7 regulates the intestinal TH1/TH17/Treg balance during Crohn’s-like murine ileitis. J. Leukoc. Biol. 2015, 97, 1011–1022. [Google Scholar] [CrossRef]
Wikenheiser, D.J.; Stumhofer, J.S. ICOS Co-Stimulation: Friend or Foe? Front. Immunol. 2016, 7, 304. [Google Scholar] [CrossRef]
Amatore, F.; Gorvel, L.; Olive, D. Role of Inducible Co-Stimulator (ICOS) in cancer immunotherapy. Expert. Opin. Biol. Ther. 2020, 20, 141–150. [Google Scholar] [CrossRef]
Li, S.; Xie, Q.; Zeng, Y.; Zou, C.; Liu, X.; Wu, S.; Deng, H.; Xu, Y.; Li, X.C.; Dai, Z. A naturally occurring CD8(+)CD122(+) T-cell subset as a memory-like Treg family. Cell Mol. Immunol. 2014, 11, 326–331. [Google Scholar] [CrossRef] [PubMed]
Chuckran, C.A.; Liu, C.; Bruno, T.C.; Workman, C.J.; Vignali, D.A. Neuropilin-1: A checkpoint target with unique implications for cancer immunology and immunotherapy. J. Immunother. Cancer 2020, 8, e000967. [Google Scholar] [CrossRef] [PubMed]
Liu, C.; Somasundaram, A.; Manne, S.; Gocher, A.M.; Szymczak-Workman, A.L.; Vignali, K.M.; Scott, E.N.; Normolle, D.P.; John Wherry, E.; Lipson, E.J.; et al. Neuropilin-1 is a T cell memory checkpoint limiting long-term antitumor immunity. Nat. Immunol. 2020, 21, 1010–1021. [Google Scholar] [CrossRef] [PubMed]
van de Wetering, M.; Francies, H.E.; Francis, J.M.; Bounova, G.; Iorio, F.; Pronk, A.; van Houdt, W.; van Gorp, J.; Taylor-Weiner, A.; Kester, L.; et al. Prospective derivation of a living organoid biobank of colorectal cancer patients. Cell 2015, 161, 933–945. [Google Scholar] [CrossRef]
Sveen, A.; Kopetz, S.; Lothe, R.A. Biomarker-guided therapy for colorectal cancer: Strength in complexity. Nat. Rev. Clin. Oncol. 2020, 17, 11–32. [Google Scholar] [CrossRef]
Biller, L.H.; Schrag, D. Diagnosis and Treatment of Metastatic Colorectal Cancer: A Review. Jama 2021, 325, 669–685. [Google Scholar] [CrossRef]
Lièvre, A.; Bachet, J.B.; Boige, V.; Cayre, A.; Le Corre, D.; Buc, E.; Ychou, M.; Bouché, O.; Landi, B.; Louvet, C.; et al. KRAS mutations as an independent prognostic factor in patients with advanced colorectal cancer treated with cetuximab. J. Clin. Oncol. 2008, 26, 374–379. [Google Scholar] [CrossRef]
Lee, D.W.; Kim, K.J.; Han, S.W.; Lee, H.J.; Rhee, Y.Y.; Bae, J.M.; Cho, N.Y.; Lee, K.H.; Kim, T.Y.; Oh, D.Y.; et al. KRAS mutation is associated with worse prognosis in stage III or high-risk stage II colon cancer patients treated with adjuvant FOLFOX. Ann. Surg. Oncol. 2015, 22, 187–194. [Google Scholar] [CrossRef]
Thind, A.S.; Monga, I.; Thakur, P.K.; Kumari, P.; Dindhoria, K.; Krzak, M.; Ranson, M.; Ashford, B. Demystifying emerging bulk RNA-Seq applications: The application and utility of bioinformatic methodology. Brief. Bioinform. 2021, 22, bbab259. [Google Scholar] [CrossRef]
Khan, S.U.; Huang, Y.; Ali, H.; Ali, I.; Ahmad, S.; Khan, S.U.; Hussain, T.; Ullah, M.; Lu, K. Single-cell RNA Sequencing (scRNA-seq): Advances and Challenges for Cardiovascular Diseases (CVDs). Curr. Probl. Cardiol. 2024, 49, 102202. [Google Scholar] [CrossRef]

Figure 1. The workflow of this study.

Figure 2. Single–cell atlas of CRC. (A) Patient origin of cells in primary (left) and liver metastases (right) tumors. (B) T–SNE plot for cells in distinct patients (left) or sites (right). (C) Clustering of single cells and label colors according to separate clusters. (D) The plots of typical markers of some main cell types (epithelial/plasma/T cells). (E) Identification of main cell types based on the expression of marker genes, label colors by cell types. (F) Heatmap of large–scale CNVs to distinguish cancer cells from non–cancer cells.

Figure 3. Development of metastasis–based immune prognostic model (MIPM). (A) Plot for the prediction accuracy of multiple gene signatures distinguishing primary and metastatic CRC cancer cells. (B) To verify the classification efficacy of the 4000–gene signature in an additional scRNA–seq dataset. (C) Cross–validation at the patient level was used to assess the performance of the 4000–gene signature. (D) Immune-related genes importance plot. (E) Twenty–two intersection genes between metastases–associated gene expression signature and high–contribution immune–related genes. (F) The enrichment analysis results of twenty–two intersection genes. (G) Forest plot of univariable Cox regression analysis with six prognosis–related genes. (H) Plots of selection of MIPM genes based on Lasso regression, each curve represents the coefficient variation trajectory of each MIPM gene. (I) The functional enrichment analysis results of MIPM genes.

Figure 4. Associations between MIPM and clinical characteristics. (A) Kaplan–Meier survival curves of OS between high–risk and low–risk patients in GSE39582 datasets were used to compare the survival differences between the two groups. (B) The optimal threshold for grouping determined by the “survminer” method is −0.17. (C) Risk curve and risk scatter plot of CRC patients demonstrated a clear separation between high–risk and low–risk patients. (D) ROC curves of MIPM risk scores and other clinical variables (age, sex, stage) for predicting OS. Among them, MIPM had a better predictive ability compared with other clinical variables. (E) The heatmap showed significant differences in MIPM gene expression and various clinical features (age, sex, stage, etc.) between high− and low−risk groups. (F) The MIPM risk scores exhibited significant differences among subgroups with different clinical features. (G) There were differences in the MIPM risk scores among CMS groups, and Sankey map further demonstrated the corresponding relationships between CMS groups and MIPM risk groups. (H) The prognostic stratification ability of MIPM for patients with CMS subtypes in GSE39582 datasets.

Figure 5. Evaluation of the prognostic efficacy of MIPM in multiple bulk validation datasets. (A) Kaplan–Meier survival curves were plotted in ten external validation datasets to compare the differences in OS between high–risk and low–risk patients. (B) ROC curves and calibration curves were used to evaluate the predictive ability (OS) of MIPM in the validation datasets. (C) DCA was employed to evaluate the clinical utility of the MIPM. (D) Assessment of the MIPM’s effectiveness in predicting recurrence risk across seven validation datasets with available tumor recurrence outcomes. (E) The discriminative capacity and predictive accuracy of the MIPM for DFS were validated using ROC and calibration curves, while (F) DCA was applied to determine its net clinical benefit.

Figure 6. (A) Evaluation of the predictive power of MIPM in CRC patients receiving chemotherapy in six validation datasets containing chemotherapy information. Evaluating the ability of MIPM to predict chemotherapy benefit and its clinical applicability by (B) ROC, calibration curves, and (C) DCA. (D) Patients were categorized into four subgroups according to MIPM (high–risk/low–risk) and chemotherapy status (yes/no), allowing the assessment of interaction effects. (E) ROC curves were conducted to compare the performance of MIPM in predicting OS between chemotherapy-treated and untreated patients.

Figure 7. Construction and validation of nomogram models. (A) MIPM combined clinical variables (age, sex, stage) to build nomogram models to predict the 3– and 5–year OS of CRC patients (‘***’, p < 0.001). Evaluation of the predictive powers of nomogram models by (B) calibration curves. (C) ROC curves were used to test the nomograms’ predictive performance, and the results demonstrated that they achieved relatively high predictive accuracy.

Figure 8. Analysis of drug sensitivity and immunity. (A) The predicted drug sensitivities (IC50 values) of the four drug components (Bortezomib, Dabrafenib, Talazoparib, and Teniposide) differed in CRC cell lines between high– and low–risk groups (p < 0.05). (B) The significant correlations (p < 0.05) between MIPM risk scores and drug sensitivities. Red (or purple) means that the cell lines with higher risk scores were resistant (or sensitive) to the drug, which suggested that these drugs might serve as potential candidates for the CRC treatment. (C) The abundance of signatures was calculated for twenty-two immune cell subpopulations for high– and low–risk patients. (D) Differences in the abundance of immune infiltration of twenty-two immune cells between high– and low–risk groups. (E) Comparison of the differences in TIDE, T cell exclusion, T cell dysfunction, and MSI scores in different MIPM subgroups. Differences of the OS among patient groups stratified by (F) TIDE scores and (G) MIPM risk scores and the combined TIDE scores. (H) The correlations between MIPM risk scores and immune–related features (stromal scores, immune scores, estimate scores, and tumor purity calculated by ESTIMATE) and the expression of key immune checkpoints (PD–1, PD–L1, and CTLA4), red represents positive correlation and blue represents negative correlation. (I) The expression differences of the immune checkpoint genes PD–L1, PD–1, and CTLA4 in different MIPM subgroups. (J) Comparison of MHC, EC, SC, and CP scores differences between different MIPM subgroups. (K) Comparison of the prognostic efficacy of MIPM with other immune–related prognostic models [13,38,39,40] (p < 0.05 is represented by ‘significant’, and the values in the figure represent the AUC values). (L) The predictive performance of MIPM in chemotherapy–naïve patients. Intergroup comparisons were performed using the Wilcoxon test, with the p–value threshold of 0.05 for statistical significance. Group allocation was explicitly annotated in the figure. ‘****’, p < 0.0001; ‘***’, p < 0.001; ‘**’, p < 0.01; ‘*’, p < 0.05; ‘ns’, 0.05 < p.

Table 1. Summary of datasets used in this study.

	Series	Platform	Cells/Patients	Samples Treated with Chemo
scRNA–seq	GSE178318	GPL24676	113,331/6	3
Bulk	GSE39582	GPL570	556	233
	GSE159216	GPL17586	171	156
	GSE87211	GPL13497	196	196
	GSE29621	GPL17586	65	38
	GSE72968	GPL570	68	68
	GSE72970	GPL570	124	124
	GSE12945	GPL96	62	–
	GSE17536	GPL570	177	–
	GSE17537	GPL570	55	–
	GSE17538	GPL570	232	–
	GSE37892	GPL570	130	–

Table 2. Summary of validation cohorts and MIPM performance.

Series	Platform	Sample Size	N (OS Event)	N (DFS Event)	p Value	HR	95% CI	AUC
GSE39582	GPL570	556	187	177	<0.001	2.153	1.603–2.891	0.655
GSE159216	GPL17586	171	108	-	<0.001	1.967	1.347–2.873	0.624
GSE87211	GPL13497	196	28	28	<0.001	3.578	1.460–8.770	0.653
GSE29621	GPL17586	65	25	9	<0.001	5.068	1.811–14.182	0.748
GSE72968	GPL570	68	49	-	0.019	1.925	1.032–3.590	0.559
GSE72970	GPL570	124	92	-	0.049	1.510	1.032–2.273	0.572
GSE12945	GPL96	62	12	4	0.002	5.247	0.705–39.062	0.622
GSE17536	GPL570	177	73	36	<0.001	2.774	1.693–4.543	0.684
GSE17537	GPL570	55	20	19	0.023	2.769	1.132–6.770	0.584
GSE17538	GPL570	232	93	55	<0.001	2.439	1.574–3.780	0.656
GSE37892	GPL570	130	37	-	<0.001	2.870	1.433–5.747	0.621

Table 3. Univariate and multivariate Cox regression analysis in validation datasets.

	Univariate Analysis			Multivariate Analysis
Variables	HR	95% CI	p Value	HR	95% CI	p Value
GSE39582
Risk (high vs. low)	2.160	1.615–2.888	<0.001 ***	1.873	1.393–2.518	<0.001 ***
Sex (male vs. female)	1.328	0.990–1.780	0.058 .	1.346	0.999–1.814	0.051 .
Age	1.023	1.011–1.036	<0.001 ***	1.023	1.011–1.036	<0.001 ***
Stage (III/IV vs. I/II)	1.818	1.360–2.431	<0.001 ***	1.815	1.353–2.433	<0.001 ***
GSE17538
Risk (high vs. low)	2.489	1.623–3.817	<0.001 ***	2.225	1.443–3.429	<0.001 ***
Sex (male vs. female)	1.006	0.659–1.535	0.979	1.019	0.658–1.578	0.934
Age	1.012	0.995–1.029	0.171	1.022	1.005–1.039	0.013 *
Stage (III/IV vs. I/II)	3.563	2.139–5.935	<0.001 ***	3.604	2.146–6.054	<0.001 ***
GSE17536
Risk (high vs. low)	2.790	1.755–4.436	<0.001 ***	2.598	1.629–4.144	<0.001 ***
Sex (male vs. female)	1.105	0.694–1.759	0.674	1.032	0.630–1.689	0.902
Age	1.006	0.988–1.025	0.492	1.016	0.997–1.034	0.096 .
Stage (III/IV vs. I/II)	4.220	2.387–7.459	<0.001 ***	4.217	2.369–7.506	<0.001 ***

‘***’, p < 0.001; ‘*’, p < 0.05; ‘.’, 0.05 < p < 0.1.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xing, K.; Li, L.; Ma, Y.; Zhu, J. Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer. Curr. Issues Mol. Biol. 2025, 47, 652. https://doi.org/10.3390/cimb47080652

AMA Style

Xing K, Li L, Ma Y, Zhu J. Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer. Current Issues in Molecular Biology. 2025; 47(8):652. https://doi.org/10.3390/cimb47080652

Chicago/Turabian Style

Xing, Kaiyuan, Liangshuang Li, Yingnan Ma, and Jiang Zhu. 2025. "Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer" Current Issues in Molecular Biology 47, no. 8: 652. https://doi.org/10.3390/cimb47080652

APA Style

Xing, K., Li, L., Ma, Y., & Zhu, J. (2025). Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer. Current Issues in Molecular Biology, 47(8), 652. https://doi.org/10.3390/cimb47080652

Article Menu

Integrating Primary and Metastatic scRNA–Seq and Bulk Data to Develop an Immune–Based Prognosis Signature for Colorectal Cancer

Abstract

1. Introduction

2. Materials and Methods

2.1. CRC scRNA–Seq and Bulk Data Acquisition and Pre–Procession

2.2. Identification of Gene Expression Signature from Primary and Metastatic Cells

2.3. Screening High–Contribution Immune–Related Genes

2.4. Construction of Metastasis–Based Immune Prognostic Model (MIPM)

2.5. Functional Enrichment Analysis

2.6. Evaluation of MIPM in Validation Cohorts

2.7. Analysis of Clinical Features

2.8. Drug Sensitivity Analysis

2.9. Comprehensive Immune-Related Analysis

2.10. Statistical Analysis

3. Results

3.1. Single–Cell RNA Sequencing (scRNA–Seq) Data Acquisition and Processing

3.2. Establishment of Metastasis–Based Immune Prognostic Model (MIPM)

3.3. Associations of MIPM and Clinical Characteristics in Internal Validation Datasets

3.4. MIPM’s Prognostic Power Across Multiple External Validation Datasets

3.5. MIPM Predicts Chemotherapy Benefits

3.6. Independent Prognostic Factor Evaluation and Nomogram Construction

3.7. MIPM’s Influence on Drug Sensitivity and Resistance in CRC

3.8. Immune Characteristics Between the High– and Low–Risk Groups

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI