Abstract
Triple-negative breast cancer (TNBC) is an aggressive subtype of breast cancer with a high tendency for developing a recurrent disease. Circulating micro-RNAs (cmiRNAs) obtained through liquid biopsy are potential prognostic biomarkers for the assessment of TNBC recurrence risk. In this study, we sequenced cmiRNAs from the serum samples of 14 recurrent and 19 non-recurrent TNBC cases and compared expression profiles in relation to recurrence status, survival data and miRNA expression in matched tumor samples. Differential expression analysis between recurrent and non-recurrent cases identified ten differentially expressed (DE) cmiRNAs, of which cmiRNAs miR-21-5p (p = 0.030, HR = 1.87, 95% CI 1.06–3.30), miR-16-5p (p = 0.032, HR = 0.53, 95% CI 0.30–0.95), and miR-26b-5p (p = 0.023, HR = 0.52, 95% CI 0.29–0.91) were associated with recurrence-free survival in multivariable survival analysis. Expression profiles of matched tumor and serum samples were shown to correlate with each other. DE cmiRNAs were associated with common cancer-related signaling pathways and improved the overall predictive performance of the logistic regression model assessing the recurrence risk. Our results indicate that recurrent and non-recurrent TNBC differ in their cmiRNA expression profiles, and that three specific cmiRNAs can be used to assess the risk of recurrence in TNBC.
1. Introduction
Triple-negative breast cancer (TNBC) is a heterogenous subtype of breast cancer (BC) that is characterized by the lack of estrogen receptor, progesterone receptor expression, along with normal expression levels of human epidermal growth factor receptor 2 (HER2). TNBC accounts for approximately 10–20% of all invasive BC cases [1,2,3] and has been associated with poor prognosis until the recent introduction of immunotherapies (ITs) in neoadjuvant and adjuvant setting [4]. However, not all patients benefit from ITs, and these treatments are also associated with the risk of developing irreversible endocrine disorders [5,6].
Although ITs are shown to decrease the recurrence rate of TNBC, the subtype is known for its high tendency to develop distant metastases [5,6,7]. Different prediction models have been introduced to estimate the risk of recurrent disease and distant metastases, but these models tend to have poor predictive performance for TNBC when compared to other BC subtypes [8,9,10]. The histopathological examination of tumor biopsies has remained the gold standard both for the diagnosis of TNBC and estimating its invasiveness. However, traditional tumor biopsies may not be representative of the entire tumor due to tumor heterogeneity, and procedures to collect biopsies are invasive and not always feasible in the clinical setting [11]. Moreover, there are no comprehensively validated biomarkers that would accurately predict the TNBC metastasis, and novel predictive biomarkers for metastatic TNBC are therefore needed to identify high-risk TNBC cases earlier.
Micro-RNAs (miRNAs) are noncoding RNAs that are involved in the post-transcriptional regulation of genes by inhibition of translation or mRNA cleavage [12]. The dysregulated expression of miRNAs and their direct impact on cancer-associated signaling pathways have been documented in a wide range of cancer diseases, and miRNAs are generally considered potential cancer biomarkers [13]. Recent studies have demonstrated that cancer cells release miRNAs to the extracellular environment through active secretion and cell death, thus suggesting the potential use of circulating miRNAs (cmiRNAs) as non-invasive biomarkers that could be used to predict the outcomes of BC [14,15,16,17].
Here, we isolated and sequenced circulating miRNAs from the serum samples of 33 TNBC cases to identify circulating miRNAs that are potentially associated with recurrent TNBC and could be potentially used as prognostic biomarkers to predict the clinical outcome of TNBC patients.
2. Materials and Methods
2.1. Patient Cohorts and Sample Material
This study included a cohort of 33 Eastern Finnish BC patients who had been diagnosed with a TNBC without locoregional or distant metastases at the time of diagnosis (Table 1). The cohort was collected to include both patients who remained recurrence-free (19 patients) and those who developed locoregional recurrence or distant metastases (14 patients) during the first five years of follow-up. Clinical data and sample materials were obtained from the Kuopio Breast Cancer Project (KBCP), which is a prospective population-based case-control study conducted in 1990–1995 in Eastern Finland [18,19,20]. To assess the impact of long-term freezing on cmiRNA expression levels, three serum samples from the more recent Itä-Länsi rintasyöpäprojekti (ILRS), a prospective population-based breast cancer study conducted in 2010 to 2014, were used as a control to investigate the effect of storage time.
Table 1.
Patient characteristics.
Analyzed serum samples were collected (1) at the time of diagnosis prior to the initiation of treatment, (2) after the end of curative treatment, and (3) at the latest follow-up examination prior to the diagnosis of recurrence or distant disease. Serum samples with similar sampling intervals were selected for cases who remained recurrence-free. All serum samples were stored in −70 °C. Matched fresh-frozen tumor samples from the primary tumors were available for 25 TNBC cases. All tumor samples were obtained during cancer surgery before the initiation of curative treatment. All tumor samples were immediately covered with optimum cutting temperature compound after resection, cooled in liquid isopentane and liquid nitrogen, and stored in −70 °C.
2.2. Isolation of cmiRNAs and Tumor RNA
Circulating miRNAs were isolated from patient serum samples with a miRNEasy Serum Plasma Kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. The total RNA from FF tumor samples was isolated with an Ambion mirVana miRNA Isolation Kit (Life Technologies, Carlsbad, CA, USA) according to the manufacturer’s protocol. The purity and concentration of isolated RNA samples were assessed with a NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA).
2.3. Library Preparation and Sequencing
All cmiRNA libraries were prepared with a QIAseq miRNA Library Kit (Qiagen) according to the manufacturer’s protocol. The concentration and size of the cmiRNA libraries was confirmed with the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) with the High-Sensitivity DNA Kit (Agilent Technologies) and Qubit 2.0 fluorometer (Thermo Fisher Scientific) with the DNA High-Sensitivity assay (Invitrogen, Walthman, MA, USA). Libraries were sequenced on the Illumina NextSeq platform (Illumina, San Diego, CA, USA) with 75 bp reads.
Tumor miRNA libraries were prepared with a TruSeq Small RNA Library Preparation Kit (Illumina) according to the manufacturer’s protocol as previously described [21]. The concentration, size and quality of the tumor miRNA libraries was confirmed, and the sequencing was performed as previously described [21]. Total RNA-seq libraries were constructed from 800 ng of total RNA. First, ribosomal RNA was removed using the NEBNext rRNA Depletion Kit (New England BioLabs, Ipswich, MA, USA). Then, libraries were prepared as instructed using the NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (New England BioLabs). The amount of library produced was measured with the Qubit 2.0 fluorometer using a DNA High-Sensitivity assay, and its quality was checked with the Agilent Bioanalyzer using the DNA 1000 kit (Agilent Technologies). After indexing, the libraries were combined and sequenced on the Illumina NextSeq 500 platform (Illumina) with 75 bp reads.
2.4. Bioinformatics
Sequencing data from cmiRNA-seq and miRNA-seq were processed with the in-house bioinformatics pipeline built with the Snakemake workflow management system (v.7.18.2) [22]. Unique molecular identifier (UMI) sequences were extracted and added to FASTQ header with UMI tools (v.1.1.4) [23] while discarding the 3′ adapter and primer sequences. Poor-quality reads and reads shorter than 16 bp were discarded with cutadapt (v.4.2) [23]. Trimmed reads were first aligned to human rRNA and tRNA sequences obtained from RFAM database (v.14) [24]. Unaligned reads were then aligned to human mature miRNA sequences obtained from the miRbase (v.22) [23] with Bowtie2 (v.2.5.1) [25]. Aligned reads were converted to BAM format with Samtools (v.1.16.1) [23] and deduplicated with UMI tools. Raw read counts were counted with the Samtools idxstats command. The bioinformatic analysis of serum and tumor samples slightly differed in terms of trimming and deduplication steps, as the sequenced tumor RNA libraries did not utilize UMI sequences.
The 75-nucleotide single-end RNA-seq reads were quality controlled using FastQC (v.0.11.7) [26]. Reads were then trimmed with Trimmomatic (v.0.39) [27] to remove Illumina sequencing adapters and poor-quality read ends, using the following as essential settings: ILLUMINACLIP:2:30:10, SLIDINGWINDOW:4:10, LEADING:3, TRAILING:3, MINLEN:50. Reads aligning to mtDNA or rRNA, the phiX174 genome, or composed of a single nucleotide were removed using STAR (v.2.7.9a) [28]. The remaining reads were aligned to the GENCODE human transcriptome version 38 (for genome version hg38) using STAR (v.2.7.9a) with essential non-default settings: --seedSearchStartLmax 12, --alignSJoverhangMin 15, --outFilterMultimapNmax 100, --outFilterMismatchNmax 33, --outFilterMatchNminOverLread 0, --outFilterScoreMinOverLread 0.3, and --outFilterType BySJout. The unstranded, uniquely mapping, gene-wise counts for primary alignments produced by STAR were collected in R (v.4.2.2) using Rsubread::featureCounts (v.2.12.3) [29].
Differential expression analyses were carried out using the DESeq2 R package (v.1.38.0) [30] with the Wald statistical test corrected for the multiple testing using the Benjamini–Hochberg correction. A false discovery rate (FDR) of ≤0.05 was considered as statistically significantly differentially expressed (DE). Variance stabilizing transformation (VST) and reads per million (RPM) transformations were used for statistical analyses and visualization, respectively.
2.5. Statistical Analyses
Survival data were analyzed with the univariable log–rank and Kaplan–Meier estimators followed by the multivariable Cox proportional hazard’s model. The used covariates in the multivariable survival analyses were age at the time of diagnosis, tumor grade, tumor stage, chemotherapy, and radiotherapy. Recurrence-free survival (RFS) was calculated as the time of diagnosis to the time of first local or distant metastasis or new BC. The overall survival (OS) and breast cancer-specific survival (BCSS) were calculated as the time from the date of diagnosis to the date of last follow-up or death where the cause of death was coded either caused by BC or not caused by BC. For all survival analyses, a p-value of ≤0.05 was considered as statistically significant. Survival analyses were conducted using IBM SPSS Statistics v.26 software (IBM).
To investigate the potential association between DE miRNAs and tumor features, all patients were divided into four groups based on the quartiles of miRNA expression. A Chi-squared test was used to test whether the distribution of categorical variables differs from the expected distribution. A p-value of ≤0.05 was considered as a statistically significant association. The prognostic value of DE cmiRNAs was further evaluated with univariable and multivariable logistic regression analyses. The variables’ age at the time of diagnosis, tumor grade and stage, and nodal status were used to build a reference multivariable regression model to predict the occurrence of recurrent disease. VST-normalized cmiRNA expressions were added to the reference model either as they were or as a composite variable obtained from the first principal component (PC1) of principal component analysis. The diagnostic performance of the models was estimated with a receiver operating characteristic (ROC) analysis and k-fold cross-validation. Bootstrapping with replacement was used to calculate the area under the curve (AUC) metric and its 95% confidence interval. A Chi-squared test and ROC analysis were conducted using the Scipy (v.1.11.0) [31] and scikit-learn (v.1.4.2) [32] Python packages.
2.6. Pathway Enrichment Analysis
The DIANA-mirPath (v.4) online tool with the Tarbase (v.8) database of experimentally validated miRNA–target gene interactions (MTIs) was used to identify known direct MTIs and affected biological pathways [33,34]. An FDR of ≤0.05 was considered as statistically significant pathway enrichment. The COSMIC Cancer Gene Census database (v.96) was used to identify MTIs that affect known oncogenes and tumor suppressors [35]. Identified MTIs were further analyzed and visualized with Cytoscape (v.3.10.1) software [36].
To validate the biological relevance of identified MTIs, we calculated Pearson correlation coefficient for the cmiRNA and tumor mRNA expression values of each MTI. A p-value of ≤0.10 was considered as statistically significant correlation. Hypergeometric gene set enrichment analysis with the hypeR R package (v.2.1.0) was used to estimate possible enrichment in biological pathways [37]. An FDR of ≤0.05 was considered as statistically significant pathway enrichment.
3. Results
3.1. Recurrent TNBC Is Characterized by Ten DE cmiRNAs
We detected 2656 unique miRNAs from sequenced serum and tumor samples, of which 549 miRNAs (20.7%) were expressed only in serum samples and 76 miRNAs only in tumor samples. A statistically significant correlation between tumor miRNA and cmiRNA expression levels was observed for all miRNAs with a median correlation coefficient of 0.633 (range 0.604–0.665) (Figure 1a).
Figure 1.
Differential expression analysis between tumor and serum samples of recurrent and non-recurrent TNBC cases. (a) Comparison between matched serum and tumor miRNA expression showed a moderate Pearson correlation in all cases, of which one is shown as an example. (b) Ten cmiRNAs were observed to be DE in recurrent TNBC at the time of diagnosis. (c) Two of these miRNAs were DE in matched tumor samples as well. The indicated cmiRNAs, let-7b-5p (d), let-7c-5p (e), and miR-21-5p (f), were DE prior to the detection of recurrent disease as well. In d–f, timepoints 1–3 refer to the time of diagnosis, follow-up sample, and the latest follow-up sample prior to the detection of recurrence, respectively. Asterisks in panels d-f indicate statistical significance levels, with * for p < 0.05, *** for p < 0.001, and NS for a non-significant result.
Differential expression analysis identified ten cmiRNAs that were DE between recurrent and non-recurrent TNBC cases at the time of diagnosis (Figure 1b–c, Table 2). Only two of these miRNAs, let-7c-5p and miR-128-3p, were DE in matched tumor samples as well. Differential expression analysis was repeated for each timepoint to investigate how the expression levels of DE cmiRNAs reflect the clinical status of patients. While no DE cmiRNAs were identified at the first follow-up serum sample, the second follow-up sample collected at the latest follow-up examination prior to the detection of recurrence showed a differential expression of cmiRNAs let-7b-5p, let-7c-5p, and miR-21-5p (Figure 1d–f).
Table 2.
Differentially expressed miRNAs in serum and tumor samples.
3.2. DE cmiRNAs Are Associated with RFS
Univariable survival analysis showed that the upregulated expression of cmiRNA miR-21-5p and the downregulated expression of miR-16-5p, miR-26b-5p, miR-146a-5p, and miR-199a-5p were associated with poor RFS when patients were divided into low and high-expression groups according to the median expression level (Table 3). Univariable survival analysis for BCSS and OS showed similar trends; the upregulated expression of miR-21-5p and the downregulated expression of miR-16-5p and miR-26b-5p were associated with poor BCSS and OS. In addition, the downregulated expression of miR-199a-5p and the upregulated expression of let-7b-5p were associated with poor BCSS.
Table 3.
Association between differentially expressed cmiRNA expression levels and RFS.
Multivariable survival analysis identified miR-16-5p, miR-21-5p, and miR-26b-5p as independent prognostic factors for RFS. The upregulated expression of miR-21-5p was associated with poor RFS (p = 0.040, HR = 1.87, 95% CI 1.06–3.30), while the upregulated expression of miR-16-5p (p = 0.032, HR = 0.53, 95% CI 0.30–0.95) and miR-26b-5p (p = 0.023, HR = 0.52, 95% CI 0.29–0.91) were both associated with improved RFS. Multivariable survival analysis did not observe any significant association with BCSS or OS, and the observed survival trends were mainly explained by the tumor features.
3.3. DE cmiRNAs Are Associated with Poor Tumor Characteristics
The downregulated expression of miR-128-3p and miR-199a-5p was associated with positive lymph node status (p = 0.048 and p = 0.002, respectively, Chi-squared test). The upregulated expression of miR-30e-5p was associated with higher tumor grade (p = 0.040, Chi-Squared test). No further associations were identified.
3.4. DE cmiRNAs Are Associated with Cancer-Associated Pathways
Our database search identified 2843 unique MTIs for DE cmiRNAs. The identified MTIs cover 1936 unique target genes, of which 89 were annotated as known tumor suppressors and 50 as known oncogenes. Correlation analysis identified 280 MTIs where the target gene expression and DE cmiRNA expression level showed significant correlation; 150 of these correlations were positive and 130 were negative (Figure 2). DE cmiRNAs miR-146a-5p, miR-199a-5p, and miR-3614-3p did not show significant correlation with their known target genes. None of these target genes were significantly DE in recurrent TNBC tumors, although some target genes showed notable fold changes.
Figure 2.
A network graph of DE cmiRNAs and their target genes. Size of the node corresponds to the number of cmiRNAs that have direct MTI with an indicated target gene. Color of the node represents fold change in differential gene expression analysis where the RNA-seq data of recurrent and non-recurrent TNBC tumor samples were compared. Only target genes that showed significant correlation with cmiRNA expression level are shown.
Pathway enrichment analysis with DIANA-miRPath identified 132 significantly enriched KEGG pathways when all DE cmiRNAs were used as an input. The identified signaling pathways include many cancer-associated pathways such as pathways regulating proteoglycans, cell cycles, and p53 signaling (Table 4). Hypergeometric gene set enrichment analysis showed that the correlating target genes were significantly enriched in the cell cycle regulating pathway (p = 0.001), while the rest of the observed enrichments were unreliable due to there being a high FDR. These enrichments included, for example, pathways in cancer (p = 0.240), the TGFβ signaling pathway (p = 0.260), and the mTOR signaling pathway (p = 0.260).
Table 4.
The most enriched signaling pathways in pathway enrichment analysis. Analysis was conducted with two alternative methods; DIANA-miRPath was run with all DE cmiRNA target genes that had an existing record in Tarbase, while hypeR was run with a list of target genes that showed significant correlation with matched tumor mRNA expression levels.
3.5. DE cmiRNAs Improve the Performance of Logistic Regression Models
Univariable logistic regression models predicted the presence of recurrent disease relatively well and showed moderate AUC values in ROC analysis (Figure 3a–c). None of the univariable models were able to outperform the reference model (AUC = 0.847, 95% CI 0.762–0.955). Including the cmiRNA expression level in the analysis was found to improve the diagnostic performance of the multivariable regression models (Figure 3d–f). Including PC1 in the multivariable regression model instead of single cmiRNA expression levels was found to provide slightly better overall performance (Figure 3g). All models showed similar behavior in cross-validation (Figure 3h).
Figure 3.
ROC analysis of logistic regression models that were trained to predict the occurrence of TNBC recurrence. Univariable regression models with DE cmiRNA expression level as the only explanatory variable (a–c) showed moderate AUC metrics. Including cmiRNA expression level into reference model improved the diagnostic performance (d–f). The best diagnostic performance was obtained when DE cmiRNA expression levels were included in the model as a component variable (g). All models were validated with k-fold cross-validation and showed relatively similar behavior in validation (h).
4. Discussion
TNBC is often considered the most aggressive subtype of BC, accounting for up to 30% of all BC deaths [38]. Although considerable achievements have been made in the molecular profiling of TNBC, only few of these prognostic and predictive biomarkers have notably improved the TNBC survival [39]. Blood-based biomarkers, such as cmiRNAs, might have the potential to identify BC patients at higher risk of recurrent disease and thus enable more proactive monitoring and treatment of these patients.
One of the key assumptions behind the concept of liquid biopsy is that tumor-specific biomarkers can be detected in body fluids. Elevated miRNA expression levels in both matched tumor and serum samples would suggest that the miRNA is tumor-specific and plays a role in tumorigenesis. Previous studies have reported significant differences and modest correlations between the expression levels of matched tumor and plasma samples [40,41,42]. Our results acknowledge similar challenges, as only a subset of detected cmiRNAs were expressed in the matched tumor samples. This suggests that the results of tumor sequencing are either affected by intra-tumoral heterogeneity and unrepresentative tumor samples or that the majority of detected cmiRNAs do not originate from the tumor tissue. This can be considered as a major challenge for the diagnostic use of cmiRNAs and emphasizes the need for careful evaluation of their biological sources. It is obvious that more extensive studies on the miRNA expression profiles of matched tumors and serum or plasma samples are required to identify factors that influence miRNA expression levels.
Our results indicate that three cmiRNAs—miR-16-5p, miR-21-5p, and miR-26b-5p, are independent prognostic factors for TNBC recurrence. Existing evidence regarding the association between these cmiRNAs and the risk of TNBC recurrence is conflicting. The most compelling evidence lies with miR-21-5p, whose overexpression is known to occur in wide range of solid tumors, including TNBC [43,44]. The upregulated expression of miR-21 has been connected to poor clinical outcome, and recent functional studies have demonstrated that miR-21 promotes tumor cell proliferation and invasion by targeting tumor suppressors PIK3R1 and PTEN [45,46,47,48]. Both tumor suppressors are negative regulators of the PI3K/Akt signaling pathway, whose activation has been shown to be associated with the aggressive disease profile of TNBC [49]. The upregulation of expression of miR-21-5p has been previously connected to recurrent BC as well [50].
Experimental evidence for the functional and prognostic role of miR-16-5p and miR-26b-5p is less comprehensive. Elevated expression levels of miR-16-5p have been observed in the plasma of BC patients and predict the response to immunotherapies [51,52]. Cell studies conducted with TNBC cell lines have linked the upregulated expression of miR-16-5p to favorable cellular behavior, such as suppressed proliferation and invasiveness [53,54], thus supporting the tumor-suppressive role of miR-16-5p suggested by our results. Likewise, functional studies have reported that miR-26b-5p inhibits cell proliferation and epithelial–mesenchymal transition (EMT) in TNBC cell lines MDA-MB-468 and BT-549, thus supporting the observed tumor-suppressive role [55]. To our knowledge, the association between the upregulated serum miR-16-5p and miR-26b-5p expression and improved RFS has not been previously reported in the context of TNBC.
Performed pathway enrichment analysis with predicted MTIs suggested significant enrichment with cancer-associated biological pathways that are known to be associated with TNBC recurrence [56,57]. Since the pathway enrichment analyses from the list of miRNAs may lead to inaccurate results, we carefully validated the results by comparing the list of identified target genes to total tumor RNA-seq data and by repeating the pathway enrichment analysis with another tool [58]. Although we were unable to demonstrate differential gene expressions of the target genes in the tumor, our results indicate that target genes involved in the cell cycle regulation are potential regulatory targets of DE cmiRNAs and might, to some extent, explain the association with TNBC recurrence risk and DE cmiRNA expression.
We performed logistic regression analysis to assess the diagnostic potential of cmiRNAs in the TNBC recurrence risk assessment. Although the used dataset is undeniably too small to reliably assess the diagnostic potential of the DE cmiRNAs, our results suggest that combining cmiRNA expression data with traditional clinicopathological tumor features might improve the performance of the TNBC recurrence risk prediction model and allow the early detection of TNBC patients who might benefit from proactive monitoring.
Our study has certain limitations that should be addressed. Firstly, it is important to note that the used sample material was collected in the 1990s and has undergone long-term freezing for approximately three decades. Although cmiRNAs in general seem to withstand long-term freezing relatively well, freezing may affect the observed cmiRNA expression levels and obtained results [59,60]. To address this challenge with our sample material, the effect of long-term freezing was assessed by comparing the sequencing data to more recent sample material that was collected in 2010–2014 (Table S1). However, the potential impact of long-term freezing on sequencing results cannot be entirely excluded despite our efforts. Another time-related challenge with the sample material is the recent advancement in TNBC treatments. At the time sample collection, ITs were not available, and TNBC was often treated with standard therapies due to limited understanding of its distinct biology. Therefore, our patient material does not reflect current TNBC treatment practices, and our findings cannot be directly generalized to patients treated according to modern standards. Moreover, our study focused on Finnish TNBC patients whose genetic background is unique due to Finland’s geographic location and isolated population history. Given the heterogeneous nature of TNBC, our results may not be directly generalizable to TNBC patients with different genetic backgrounds and environmental factors.
Secondly, the shown associations with biological signaling pathways are based on database records of experimentally validated direct MTIs that have been reported in breast cancer-related studies, and we have not provided any functional validation to support our findings. Functional in vivo and in vitro experiments are needed to validate our results and elucidate the cellular mechanism behind the observed association with TNBC recurrence. The objective of our study was to identify cmiRNAs associated with TNBC recurrence, and functional experiments are a natural continuation for this study in the future.
It is evident that our results require careful validation with a separate and larger patient cohort comprising a diverse group of TNBC patients treated according to modern treatment regimens. Although our results are promising and support the diagnostic potential of cmiRNAs in predicting TNBC recurrence, they should be considered preliminary until validated in other sample sets.
5. Conclusions
Circulating miRNAs are considered as potential biomarkers for TNBC recurrence. Our results show that cmiRNAs miR-16-5p, miR-21-5p, and miR-26b-5p are DE in recurrent TNBC, and their expression levels are associated both with the RFS and relevant biological pathways. Our results indicate that these cmiRNAs might possess predictive value and help to identify high-risk TNBC patients who have a higher risk for developing recurrent disease after curative treatment. Future studies to validate our results are well warranted.
Supplementary Materials
The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/cells13221884/s1, Table S1: Differentially expressed cmiRNAs in cmiRNA samples that were isolated from older KBCP and more recent ILRS serum samples to estimate the effect of long-term freezing on cmiRNA expression levels.
Author Contributions
Conceptualization, J.K., J.M.H., M.T. (Maria Tengström) and A.M.; methodology, J.K., J.M.H., S.H. and M.T. (Mari Taipale); software, J.K. and S.H.; validation, J.K.; formal analysis, J.K. and S.H.; investigation, J.K., J.M.H., S.H., M.T. (Mari Taipale), M.T. (Maria Tengström) and A.M.; resources, J.M.H., M.T. (Maria Tengström) and A.M.; writing—original draft, J.K., J.M.H. and A.M.; writing—review and editing, J.K., J.M.H., M.T. (Maria Tengström), S.H., M.T. (Mari Taipale), V.-M.K. and A.M.; visualization, J.K.; supervision, J.M.H., V.-M.K. and A.M.; project administration, J.M.H., V.-M.K. and A.M.; funding acquisition, J.K., J.M.H., V.-M.K. and A.M. All authors have read and agreed to the published version of the manuscript.
Funding
This study was funded by the research grants of Cancer Foundation Finland sr., Cancer Society of North Savo, Emil Aaltonen Foundation, Paavo Koistinen Foundation, Finnish Cultural Foundations’s Häme Regional Fund, The Finnish Medical Foundation, Orion Research Foundation, Governmental EVO, state research funding (VTR), Research Fund of Kuopio University Hospital, and the strategic funding of the University of Eastern Finland.
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Research Ethics Committee of the University of Eastern Finland and Kuopio University Hospital (decision 1040/2019, 27 August 2019).
Informed Consent Statement
Written informed consent has been obtained from the patients for participation in this study.
Data Availability Statement
The datasets used and analyzed during the study are available from the corresponding authors on reasonable request.
Acknowledgments
We want to thank Eija Myöhänen for her technical assistance. We would also like to thank the Genome Center of Eastern Finland, UEF, Kuopio, Finland and Biocenter Kuopio, UEF, Kuopio, Finland for providing the sequencing infrastructure as well as the UEF Bioinformatics Center, UEF, Kuopio, Finland, for providing the computational servers used in bioinformatics and statistical analysis.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Boyle, P. Triple-Negative Breast Cancer: Epidemiological Considerations and Recommendations. Ann. Oncol. 2012, 23 (Suppl. S6), vi7–vi12. [Google Scholar] [CrossRef] [PubMed]
- Dawood, S. Triple-Negative Breast Cancer: Epidemiology and Management Options. Drugs 2010, 70, 2247–2258. [Google Scholar] [CrossRef] [PubMed]
- Howard, F.M.; Olopade, O.I. Epidemiology of Triple-Negative Breast Cancer: A Review. Cancer J. 2021, 27, 8–16. [Google Scholar] [CrossRef] [PubMed]
- Yin, L.; Duan, J.-J.; Bian, X.-W.; Yu, S.-C. Triple-Negative Breast Cancer Molecular Subtyping and Treatment Progress. Breast Cancer Res. 2020, 22, 61. [Google Scholar] [CrossRef] [PubMed]
- Schmid, P.; Rugo, H.S.; Adams, S.; Schneeweiss, A.; Barrios, C.H.; Iwata, H.; Diéras, V.; Henschel, V.; Molinero, L.; Chui, S.Y.; et al. Atezolizumab plus Nab-Paclitaxel as First-Line Treatment for Unresectable, Locally Advanced or Metastatic Triple-Negative Breast Cancer (IMpassion130): Updated Efficacy Results from a Randomised, Double-Blind, Placebo-Controlled, Phase 3 Trial. Lancet Oncol. 2020, 21, 44–59. [Google Scholar] [CrossRef]
- Schmid, P.; Cortes, J.; Dent, R.; Pusztai, L.; McArthur, H.; Kümmel, S.; Bergh, J.; Denkert, C.; Park, Y.H.; Hui, R.; et al. Event-Free Survival with Pembrolizumab in Early Triple-Negative Breast Cancer. N. Engl. J. Med. 2022, 386, 556–567. [Google Scholar] [CrossRef]
- Steward, L.; Conant, L.; Gao, F.; Margenthaler, J.A. Predictive Factors and Patterns of Recurrence in Patients with Triple Negative Breast Cancer. Ann. Surg. Oncol. 2014, 21, 2165–2171. [Google Scholar] [CrossRef]
- Li, S.-Y.; Li, Y.-W.; Ma, D.; Shao, Z.-M. Prediction of Axillary Lymph Node Metastasis in Triple-Negative Breast Cancer by Multi-Omics Analysis and an Integrated Model. Ann. Transl. Med. 2022, 10, 623. [Google Scholar] [CrossRef]
- Devi, N.L.; Dhall, A.; Patiyal, S.; Raghava, G.P.S. Transcriptomics Based Prediction of Metastasis in TNBC Patients: Challenges in Cross-Platforms Validation. bioRxiv 2021. [Google Scholar] [CrossRef]
- Chen, X.; Thomas, K.; Folkert, M.R.; Kim, D.N.; Rahimi, A.S.; Zhou, Z.; Wang, J. Predicting Recurrence in Triple Negative Breast Cancer Patients from Clinical Parameters Using a Multi-Objective Classifier. Int. J. Radiat. Oncol. Biol. Phys. 2017, 99, E53. [Google Scholar] [CrossRef][Green Version]
- Gilson, P.; Merlin, J.-L.; Harlé, A. Deciphering Tumour Heterogeneity: From Tissue to Liquid Biopsy. Cancers 2022, 14, 1384. [Google Scholar] [CrossRef] [PubMed]
- O’Brien, J.; Hayder, H.; Zayed, Y.; Peng, C. Overview of MicroRNA Biogenesis, Mechanisms of Actions, and Circulation. Front. Endocrinol. 2018, 9, 402. [Google Scholar] [CrossRef] [PubMed]
- Peng, Y.; Croce, C.M. The Role of MicroRNAs in Human Cancer. Signal Transduct. Target. Ther. 2016, 1, 15004. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Peng, R.; Wang, J.; Qin, Z.; Xue, L. Circulating microRNAs as Potential Cancer Biomarkers: The Advantage and Disadvantage. Clin. Epigenetics 2018, 10, 59. [Google Scholar] [CrossRef]
- Davey, M.G.; McGuire, A.; Casey, M.C.; Waldron, R.M.; Paganga, M.; Holian, E.; Newell, J.; Heneghan, H.M.; McDermott, A.M.; Keane, M.M.; et al. Evaluating the Role of Circulating MicroRNAs in Predicting Long-Term Survival Outcomes in Breast Cancer: A Prospective, Multicenter Clinical Trial. J. Am. Coll. Surg. 2023, 236, 317–327. [Google Scholar] [CrossRef]
- Patellongi, I.; Amiruddin, A.; Massi, M.N.; Islam, A.A.; Pratama, M.Y.; Sutandyo, N.; Latar, N.H.M.; Faruk, M. Circulating miR-221/222 Expression as microRNA Biomarker Predicting Tamoxifen Treatment Outcome: A Case–Control Study. Ann. Med. Surg. 2023, 85, 3806. [Google Scholar] [CrossRef]
- Di Cosimo, S.; Ciniselli, C.M.; Pizzamiglio, S.; Cappelletti, V.; Silvestri, M.; El-Abed, S.; Izquierdo, M.; Bajji, M.; Nuciforo, P.; Huober, J.; et al. End-of-Neoadjuvant Treatment Circulating microRNAs and HER2-Positive Breast Cancer Patient Prognosis: An Exploratory Analysis from NeoALTTO. Front. Oncol. 2023, 12, 1028825. [Google Scholar] [CrossRef]
- Pellikainen, M.J.; Pekola, T.T.; Ropponen, K.M.; Kataja, V.V.; Kellokoski, J.K.; Eskelinen, M.J.; Kosma, V.-M. p21WAF1 Expression in Invasive Breast Cancer and Its Association with P53, AP-2, Cell Proliferation, and Prognosis. J. Clin. Pathol. 2003, 56, 214–220. [Google Scholar] [CrossRef]
- Hartikainen, J.M.; Tuhkanen, H.; Kataja, V.; Dunning, A.M.; Antoniou, A.; Smith, P.; Arffman, A.; Pirskanen, M.; Easton, D.F.; Eskelinen, M.; et al. An Autosome-Wide Scan for Linkage Disequilibrium–Based Association in Sporadic Breast Cancer Cases in Eastern Finland: Three Candidate Regions Found. Cancer Epidemiol. Biomark. Prev. 2005, 14, 75–80. [Google Scholar] [CrossRef]
- Kauppinen, J.M.; Kosma, V.-M.; Soini, Y.; Sironen, R.; Nissinen, M.; Nykopp, T.K.; Kärjä, V.; Eskelinen, M.; Kataja, V.; Mannermaa, A. ST14 Gene Variant and Decreased Matriptase Protein Expression Predict Poor Breast Cancer Survival. Cancer Epidemiol. Biomark. Prev. 2010, 19, 2133–2142. [Google Scholar] [CrossRef]
- Kärkkäinen, E.; Heikkinen, S.; Tengström, M.; Kosma, V.-M.; Mannermaa, A.; Hartikainen, J.M. The Debatable Presence of PIWI-Interacting RNAs in Invasive Breast Cancer. Cancer Med. 2021, 10, 3593–3603. [Google Scholar] [CrossRef] [PubMed]
- Köster, J.; Rahmann, S. Snakemake—A Scalable Bioinformatics Workflow Engine. Bioinformatics 2018, 34, 3600. [Google Scholar] [CrossRef] [PubMed]
- Smith, T.; Heger, A.; Sudbery, I. UMI-Tools: Modeling Sequencing Errors in Unique Molecular Identifiers to Improve Quantification Accuracy. Genome Res. 2017, 27, 491–499. [Google Scholar] [CrossRef] [PubMed]
- Kalvari, I.; Nawrocki, E.P.; Ontiveros-Palacios, N.; Argasinska, J.; Lamkiewicz, K.; Marz, M.; Griffiths-Jones, S.; Toffano-Nioche, C.; Gautheret, D.; Weinberg, Z.; et al. Rfam 14: Expanded Coverage of Metagenomic, Viral and microRNA Families. Nucleic Acids Res. 2021, 49, D192–D200. [Google Scholar] [CrossRef] [PubMed]
- Langmead, B.; Salzberg, S.L. Fast Gapped-Read Alignment with Bowtie 2. Nat. Methods 2012, 9, 357–359. [Google Scholar] [CrossRef]
- Andrews, S. FastQC: A Quality Control Tool for High Throughput Sequence Data. Available online: https://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 8 November 2020).
- Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A Flexible Trimmer for Illumina Sequence Data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef]
- Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast Universal RNA-Seq Aligner. Bioinforma. Oxf. Engl. 2013, 29, 15–21. [Google Scholar] [CrossRef]
- Liao, Y.; Smyth, G.K.; Shi, W. The R Package Rsubread Is Easier, Faster, Cheaper and Better for Alignment and Quantification of RNA Sequencing Reads. Nucleic Acids Res. 2019, 47, e47. [Google Scholar] [CrossRef]
- Love, M.I.; Huber, W.; Anders, S. Moderated Estimation of Fold Change and Dispersion for RNA-Seq Data with DESeq2. Genome Biol. 2014, 15, 550. [Google Scholar] [CrossRef]
- Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Müller, A.; Nothman, J.; Louppe, G.; et al. Scikit-Learn: Machine Learning in Python. arXiv 2018, arXiv:1201.0490v4. [Google Scholar]
- Tastsoglou, S.; Skoufos, G.; Miliotis, M.; Karagkouni, D.; Koutsoukos, I.; Karavangeli, A.; Kardaras, F.S.; Hatzigeorgiou, A.G. DIANA-miRPath v4.0: Expanding Target-Based miRNA Functional Analysis in Cell-Type and Tissue Contexts. Nucleic Acids Res. 2023, 51, W154–W159. [Google Scholar] [CrossRef] [PubMed]
- Karagkouni, D.; Paraskevopoulou, M.D.; Chatzopoulos, S.; Vlachos, I.S.; Tastsoglou, S.; Kanellos, I.; Papadimitriou, D.; Kavakiotis, I.; Maniou, S.; Skoufos, G.; et al. DIANA-TarBase v8: A Decade-Long Collection of Experimentally Supported miRNA–Gene Interactions. Nucleic Acids Res. 2018, 46, D239–D245. [Google Scholar] [CrossRef] [PubMed]
- Sondka, Z.; Bamford, S.; Cole, C.G.; Ward, S.A.; Dunham, I.; Forbes, S.A. The COSMIC Cancer Gene Census: Describing Genetic Dysfunction across All Human Cancers. Nat. Rev. Cancer 2018, 18, 696–705. [Google Scholar] [CrossRef]
- Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
- Federico, A.; Monti, S. hypeR: An R Package for Geneset Enrichment Workflows. Bioinformatics 2020, 36, 1307–1308. [Google Scholar] [CrossRef]
- Fallahpour, S.; Navaneelan, T.; De, P.; Borgo, A. Breast Cancer Survival by Molecular Subtype: A Population-Based Analysis of Cancer Registry Data. CMAJ Open 2017, 5, E734–E739. [Google Scholar] [CrossRef]
- Cocco, S.; Piezzo, M.; Calabrese, A.; Cianniello, D.; Caputo, R.; Di Lauro, V.; Fusco, G.; di Gioia, G.; Licenziato, M.; de Laurentiis, M. Biomarkers in Triple-Negative Breast Cancer: State-of-the-Art and Future Perspectives. Int. J. Mol. Sci. 2020, 21, 4579. [Google Scholar] [CrossRef]
- Armstrong, D.A.; Green, B.B.; Seigne, J.D.; Schned, A.R.; Marsit, C.J. MicroRNA Molecular Profiling from Matched Tumor and Bio-Fluids in Bladder Cancer. Mol. Cancer 2015, 14, 194. [Google Scholar] [CrossRef]
- Zedan, A.H.; Hansen, T.F.; Assenholt, J.; Pleckaitis, M.; Madsen, J.S.; Osther, P.J.S. microRNA Expression in Tumour Tissue and Plasma in Patients with Newly Diagnosed Metastatic Prostate Cancer. Tumor Biol. 2018, 40, 1010428318775864. [Google Scholar] [CrossRef]
- Cojocneanu, R.; Braicu, C.; Raduly, L.; Jurj, A.; Zanoaga, O.; Magdo, L.; Irimie, A.; Muresan, M.-S.; Ionescu, C.; Grigorescu, M.; et al. Plasma and Tissue Specific miRNA Expression Pattern and Functional Analysis Associated to Colorectal Cancer Patients. Cancers 2020, 12, 843. [Google Scholar] [CrossRef] [PubMed]
- Feng, Y.-H.; Tsao, C.-J. Emerging Role of microRNA-21 in Cancer (Review). Biomed. Rep. 2016, 5, 395–402. [Google Scholar] [CrossRef] [PubMed]
- Lü, L.; Mao, X.; Shi, P.; He, B.; Xu, K.; Zhang, S.; Wang, J. MicroRNAs in the Prognosis of Triple-Negative Breast Cancer. Medicine 2017, 96, e7085. [Google Scholar] [CrossRef]
- Yan, L.-X.; Huang, X.-F.; Shao, Q.; Huang, M.-Y.; Deng, L.; Wu, Q.-L.; Zeng, Y.-X.; Shao, J.-Y. MicroRNA miR-21 Overexpression in Human Breast Cancer Is Associated with Advanced Clinical Stage, Lymph Node Metastasis and Patient Poor Prognosis. RNA 2008, 14, 2348–2360. [Google Scholar] [CrossRef]
- Tang, Y.; Zhou, X.; Ji, J.; Chen, L.; Cao, J.; Luo, J.; Zhang, S. High Expression Levels of miR-21 and miR-210 Predict Unfavorable Survival in Breast Cancer: A Systemic Review and Meta-Analysis. Int. J. Biol. Markers 2015, 30, e347–e358. [Google Scholar] [CrossRef]
- Yan, L.-X.; Liu, Y.-H.; Xiang, J.-W.; Wu, Q.-N.; Xu, L.-B.; Luo, X.-L.; Zhu, X.-L.; Liu, C.; Xu, F.-P.; Luo, D.-L.; et al. PIK3R1 Targeting by miR-21 Suppresses Tumor Cell Migration and Invasion by Reducing PI3K/AKT Signaling and Reversing EMT, and Predicts Clinical Outcome of Breast Cancer. Int. J. Oncol. 2016, 48, 471–484. [Google Scholar] [CrossRef]
- Fang, H.; Xie, J.; Zhang, M.; Zhao, Z.; Wan, Y.; Yao, Y. miRNA-21 Promotes Proliferation and Invasion of Triple-Negative Breast Cancer Cells through Targeting PTEN. Am. J. Transl. Res. 2017, 9, 953–961. [Google Scholar]
- Prvanović, M.; Nedeljković, M.; Tanić, N.; Tomić, T.; Terzić, T.; Milovanović, Z.; Maksimović, Z.; Tanić, N. Role of PTEN, PI3K, and mTOR in Triple-Negative Breast Cancer. Life 2021, 11, 1247. [Google Scholar] [CrossRef]
- Huo, D.; Clayton, W.M.; Yoshimatsu, T.F.; Chen, J.; Olopade, O.I. Identification of a Circulating microRNA Signature to Distinguish Recurrence in Breast Cancer Patients. Oncotarget 2016, 7, 55231–55248. [Google Scholar] [CrossRef]
- Feliciano, A.; González, L.; Garcia-Mayea, Y.; Mir, C.; Artola, M.; Barragán, N.; Martín, R.; Altés, A.; Castellvi, J.; Benavente, S.; et al. Five microRNAs in Serum Are Able to Differentiate Breast Cancer Patients From Healthy Individuals. Front. Oncol. 2020, 10, 586268. [Google Scholar] [CrossRef]
- Li, H.; Liu, J.; Chen, J.; Wang, H.; Yang, L.; Chen, F.; Fan, S.; Wang, J.; Shao, B.; Yin, D.; et al. A Serum microRNA Signature Predicts Trastuzumab Benefit in HER2-Positive Metastatic Breast Cancer Patients. Nat. Commun. 2018, 9, 1614. [Google Scholar] [CrossRef] [PubMed]
- Qu, Y.; Liu, H.; Lv, X.; Liu, Y.; Wang, X.; Zhang, M.; Zhang, X.; Li, Y.; Lou, Q.; Li, S.; et al. MicroRNA-16-5p Overexpression Suppresses Proliferation and Invasion as Well as Triggers Apoptosis by Targeting VEGFA Expression in Breast Carcinoma. Oncotarget 2017, 8, 72400–72410. [Google Scholar] [CrossRef] [PubMed]
- Ghafouri-Fard, S.; Khoshbakht, T.; Hussen, B.M.; Abdullah, S.T.; Taheri, M.; Samadian, M. A Review on the Role of Mir-16-5p in the Carcinogenesis. Cancer Cell Int. 2022, 22, 342. [Google Scholar] [CrossRef] [PubMed]
- Ma, S.; Wei, H.; Wang, C.; Han, J.; Chen, X.; Li, Y. MiR-26b-5p Inhibits Cell Proliferation and EMT by Targeting MYCBP in Triple-Negative Breast Cancer. Cell. Mol. Biol. Lett. 2021, 26, 52. [Google Scholar] [CrossRef]
- Iozzo, R.V.; Sanderson, R.D. Proteoglycans in Cancer Biology, Tumour Microenvironment and Angiogenesis. J. Cell. Mol. Med. 2011, 15, 1013–1031. [Google Scholar] [CrossRef]
- Ahrens, T.D.; Bang-Christensen, S.R.; Jørgensen, A.M.; Løppke, C.; Spliid, C.B.; Sand, N.T.; Clausen, T.M.; Salanti, A.; Agerbæk, M.Ø. The Role of Proteoglycans in Cancer Metastasis and Circulating Tumor Cell Analysis. Front. Cell Dev. Biol. 2020, 8, 749. [Google Scholar] [CrossRef]
- Godard, P.; van Eyll, J. Pathway Analysis from Lists of microRNAs: Common Pitfalls and Alternative Strategy. Nucleic Acids Res. 2015, 43, 3490–3497. [Google Scholar] [CrossRef]
- Glinge, C.; Clauss, S.; Boddum, K.; Jabbari, R.; Jabbari, J.; Risgaard, B.; Tomsits, P.; Hildebrand, B.; Kääb, S.; Wakili, R.; et al. Stability of Circulating Blood-Based MicroRNAs—Pre-Analytic Methodological Considerations. PLoS ONE 2017, 12, e0167969. [Google Scholar] [CrossRef]
- Matias-Garcia, P.R.; Wilson, R.; Mussack, V.; Reischl, E.; Waldenberger, M.; Gieger, C.; Anton, G.; Peters, A.; Kuehn-Steven, A. Impact of long-term storage and freeze-thawing on eight circulating microRNAs in plasma samples. PLoS ONE 2020, 15, e0227648. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).