Next Article in Journal
Interpretation of Tumor Response Grade following Preoperative Therapy for Gastric Cancer: An Overview
Next Article in Special Issue
Management of Primary Uterine Cervix B-Cell Lymphoma Stage IE and Fertility Sparing Outcome: A Systematic Review of the Literature
Previous Article in Journal
Optimization of Radiolabeling of a [90Y]Y-Anti-CD66-Antibody for Radioimmunotherapy before Allogeneic Hematopoietic Cell Transplantation
Previous Article in Special Issue
Diagnostic Potential of Endometrial Cancer DNA from Pipelle, Pap-Brush, and Swab Sampling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Decreased Gene Expression of Antiangiogenic Factors in Endometrial Cancer: qPCR Analysis and Machine Learning Modelling

1
Department of Gynaecology and Obstetrics, Faculty of Medicine, University of Ljubljana, 1000 Ljubljana, Slovenia
2
Division of Gynaecology and Obstetrics, General Hospital Murska Sobota, 9000 Murska Sobota, Slovenia
3
Faculty of Electrical Engineering and Computer Science, University of Maribor, 2000 Maribor, Slovenia
4
Semantika Research, Semantika d.o.o., 2000 Maribor, Slovenia
5
Institute of Biochemistry and Molecular Genetics, Faculty of Medicine, University of Ljubljana, 1000 Ljubljana, Slovenia
6
Division of Gynaecology and Obstetrics, University Medical Centre, 1000 Ljubljana, Slovenia
*
Author to whom correspondence should be addressed.
Cancers 2023, 15(14), 3661; https://doi.org/10.3390/cancers15143661
Submission received: 15 June 2023 / Revised: 13 July 2023 / Accepted: 14 July 2023 / Published: 18 July 2023
(This article belongs to the Special Issue Gynecologic Cancer: From Diagnosis to Treatment)

Abstract

:

Simple Summary

Endometrial cancer (EC) is a prevalent gynaecological cancer, the growth and spread of which are facilitated by angiogenesis. Our study used publicly available datasets to compare the expression of angiogenesis-related genes and proteins in EC tissue and adjacent controls. We validated these findings in a cohort of 36 EC patients and built an EC-grade prediction model using machine learning. The results showed a significant up-regulation of IL8 and LEP and down-regulation of 11 other genes in EC tissue. These genes were differentially expressed in early-stage and lower-grade EC but not in more advanced forms or in patients with deep myometrial or lymphovascular invasion. Gene co-expressions were stronger in EC tissue, especially when the lymphovascular invasion was present. More extensive angiogenesis-related gene involvement was seen in postmenopausal women. Our findings suggest that angiogenesis in EC is primarily driven by reduced antiangiogenic factor expression, with altered regulation in the tumour-adjacent tissue of EC patients with less favourable prognoses.

Abstract

Endometrial cancer (EC) is an increasing health concern, with its growth driven by an angiogenic switch that occurs early in cancer development. Our study used publicly available datasets to examine the expression of angiogenesis-related genes and proteins in EC tissues, and compared them with adjacent control tissues. We identified nine genes with significant differential expression and selected six additional antiangiogenic genes from prior research for validation on EC tissue in a cohort of 36 EC patients. Using machine learning, we built a prognostic model for EC, combining our data with The Cancer Genome Atlas (TCGA). Our results revealed a significant up-regulation of IL8 and LEP and down-regulation of eleven other genes in EC tissues. These genes showed differential expression in the early stages and lower grades of EC, and in patients without deep myometrial or lymphovascular invasion. Gene co-expressions were stronger in EC tissues, particularly those with lymphovascular invasion. We also found more extensive angiogenesis-related gene involvement in postmenopausal women. In conclusion, our findings suggest that angiogenesis in EC is predominantly driven by decreased antiangiogenic factor expression, particularly in EC with less favourable prognostic features. Our machine learning model effectively stratified EC based on gene expression, distinguishing between low and high-grade cases.

1. Introduction

Endometrial cancer (EC) is the most frequent gynaecological cancer in developed countries. Its rates are increasing with the population ageing and with the epidemy of obesity, which is a known risk factor for EC, especially in the postmenopausal population, which presents with the most EC cases [1,2]. Nevertheless, 4% of EC patients are women aged 40 years or younger with a fertility sparing preference [3]. Despite similar early symptoms, i.e., abnormal uterine bleeding and discharge, EC does not present a homogenous malignancy. The pre-treatment diagnosis is set after the endometrial biopsy and histological verification.
Due to its histologic heterogeneity, classification into two types of EC was proposed in 1983: type I EC, which represents a more frequent, oestrogen dependent, and prognostically favourable form; and type II which includes diagnostically less favourable, oestrogen less dependent, and prognostically less favourable cases [4]. To further reduce the diagnostic and prognostic discrepancies, recent molecular classification was introduced, stratifying EC into four risk categories: POLE ultra-mutated, microsatellite instability hyper-mutated, copy-number low, and copy-number high [5,6]. This classification of endometrial cancer has been validated and incorporated in the ESMO/ESTRO risk stratification and is currently used in clinical practice to guide EC management decisions.
A simple hysterectomy with bilateral adnexectomy is a sufficient and final treatment in early-stage EC operative therapy. On the other hand, a suspicion of advanced disease or prognostically less favourable types of EC—with positive risk factors, such as the presence of lymphovascular invasion (LVI) or deep >50% myometrial invasion (DMI)—presents the need for the further advanced retroperitoneal procedure of additional pelvic and paraaortic lymphadenectomy, up to the level of the renal artery, and in many cases also adjuvant chemo- or radiotherapy [7]. Extended treatment is also associated with increased complications and longer recovery, which can vastly affect the patient’s quality of life. A particular EC category is represented by premenopausal EC patients wishing to retain fertility. In such cases, progestin-based therapy and the hysteroscopic resection of focal EC lesions are possible in the earliest stages of well-differentiated EC; however, hysterectomy is advised once the childbearing is completed [7,8].
However, besides fertility-sparing preference in younger patients and the burden of common comorbidities in older patients, the decision on the extent of treatment is based mainly on the histological findings acquired via endometrial sampling, which is only a modest predictor of surgical pathology features [9]. Additional EC stratification and an individually tailored treatment approach would reduce the possibility of both EC recurrence (due to under-treatment), and an increased rate of postoperative complications (due to over-treatment). Biomarkers may be pivotal in a more precise EC prognosis, in the clinician’s decision-making process, and in an individually tailored therapeutic approach.
Angiogenesis is one of the earliest processes promoted by cancer tissue, induced by the lack of oxygen and nutrition in a rapidly growing tumour mass through released pro-angiogenic molecules and suppressed antiangiogenic molecules, commonly named angiogenic factors (AFs). Cancer cells secrete AFs to the surrounding tissue, which provokes the growth of new vessels [10,11,12,13] and enables further cancer growth and metastasis. Our recent data and studies of other groups on preoperative plasma samples from EC patients revealed the potential of angiogenic factors (AFs) as biomarker candidates for the early diagnosis and risk stratification of EC [14,15,16].
Since AFs are controlled and produced directly in the cancerous tissue, confirming AFs’ expression from the tumour cell is needed. Thus, AFs gene expression levels in tumour tissue have the potential to become novel biomarkers as diagnostic and prognostic indicators of EC to guide therapies and promote an understanding of the carcinogenesis of EC.
In recent years, artificial intelligence (AI) and machine learning (ML) methods have been ubiquitously used in several fields, including in medical diagnosis and classification tasks, and relatively recent advances in the field have allowed for the use of advanced ML methods even on (very) small datasets [17].
The aims of this study were manifold and oriented towards evaluating the diagnostic and prognostic potential of AF-encoding gene expression. (i) We analysed publicly available datasets for the expression of angiogenesis-associated genes and proteins in EC tissues (T) compared to tumour-adjacent control tissue (TA). Genes and encoded proteins with the highest change in T versus TA expression were chosen for further analysis. (ii) An additional six genes (CSF3, IL8, LEP, NRP1, TEK, FST) were included in further research based on the results from our previous studies on biomarkers in the plasma samples of EC patients [14,18]. (iii) Ultimately, 15 genes were included in the validation study using the qPCR method on a cohort of 36 EC patients. (iv) By combining TCGA data and data from our study, we applied machine learning modelling to create an EC grade prediction model based on the T gene expressions in EC.

2. Materials and Methods

2.1. Analysis of Publicly Available Datasets

We analysed publicly available datasets for the expression of 91 angiogenesis-associated genes in EC tissues compared to adjacent control tissue, and for the presence of 64 angiogenesis-associated proteins in EC tissue compared to adjacent control tissue. Details about the datasets used in this study are listed in Table 1.
For further analysis and validation on EC tissue, genes/proteins of interest were selected when both criteria were fulfilled: more than 3-fold significant (adjusted p < 0.01) difference in gene expression between the tumour and normal adjacent tissue, and more than a 2-fold significant difference in protein levels between tumour and adjacent tissues. Additionally, six AFs from our previous studies [14,18]—leptin, IL-8, neuropilin-1, Tie-2, follistatin, and G-CSF—were included in the validation study in endometrial tissue. In plasma, leptin and IL-8 were significantly increased, whereas Tie-2 and G-CSF were significantly decreased in EC patients compared to control patients. Neuropilin-1 and follistatin were significantly increased in the plasma of higher-grade EC patients and LVI-positive patients, respectively.

2.2. Validation Study on Endometrial Cancer Tissue

2.2.1. Endometrial Tissue

The specimens of EC tissue and paired morphologically normal adjacent endometrial tissue were obtained from 36 patients undergoing hysterectomies for histologically proven EC (Table 2). All patients were treated at the Department of Gynaecology and Obstetrics at the University Medical Centre, Ljubljana, from 2003 to 2010. Samples were collected and processed according to the approved standard clinical operating procedures. All patients provided written informed consent, and the study was approved by the National Medical Ethics Committee of the Republic of Slovenia (ID 0120-429/2017/8, 5 November 2017).

2.2.2. RNA Isolation and Reverse Transcription

Immediately after surgery, the tissue samples were stored in RNAlater (Thermo Fisher Scientific, Waltham, MA, USA) at −20 °C to stabilize and protect cellular RNA. Tissues were then disrupted in the presence of liquid nitrogen using a mortar and pestle. The total RNA from tissue samples was isolated using Tri Reagent (Sigma-Aldrich, St. Louis, MO, USA), according to the manufacturer’s instructions. The RNA samples were additionally cleaned, and residual DNA was removed using RNeasy Mini kits and RNase-Free DNase sets (Qiagen, Düsseldorf, Germany), respectively. We then analysed the purity and quality of extracted RNA with the Agilent 2100 Bioanalyzer using the RNA 600 Nanokit (Agilent Technologies Inc., Santa Clara, CA, USA) and demonstrated that the RNA was of good quality (an average RIN 7.8 ± 0.80). Samples of the total RNA were reversely transcribed into cDNA using RT2 First Strand Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. The cDNA samples were stored at −20 °C.

2.2.3. Quantitative Real-Time PCR

The expression of 15 genes encoding AFs was examined in pairs of cancer and adjacent control endometrium using TaqMan Gene Expression Assays (Applied Biosystems; Foster City, CA, USA) listed in Table 3.
The quantification was accomplished using the Applied Biosystems ViiA 7 Real-Time PCR System (Thermo Fisher Scientific, Waltham, MA, USA), as described in our previous studies [23]. Shortly, each sample was run in triplicates (replicates of 0.25 μL cDNA in a total reaction volume of 5.0 μL) using the Applied Biosystems MicroAmp Optical 384-well plates (Thermo Fisher Scientific, Waltham, MA, USA). The amplification efficiency (E) was first calculated from the slope of the log-linear portion of the calibration curve for each gene and was accounted for in further calculations. Next, the normalization factor was calculated for each sample based on the geometric mean of the two most stably expressed reference genes (HPRT1 and POLR2A). Last, normalized RNA was calculated from the crossing-point value (Cq) as E−Cq, divided by the normalization factor. The Cq cut-off value was set to 36. We followed The Minimum Information for Publication of Quantitative Real-Time PCR Experiments guidelines in performing and interpreting qPCR reactions [24].

2.2.4. Statistics

In the first part of the study, protein and mRNA levels were evaluated in T and TA tissue in publicly available datasets in up to 24 paired samples (all samples where the data were available for both T and TA tissue) and analysed using Wilcoxon matched pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons; an adjusted p level < 0.01 was considered significant. Further on, in the clinical cohort, gene expression was evaluated in 36 paired samples, which were further stratified into two groups based on the clinical data (FIGO stage, menopausal status) and the histopathological data (tumour histological grade, depth of myometrial invasion, and presence of lymphovascular invasion). mRNA expressions in the tumour samples and the matched adjacent tissues were analysed using the Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. Unmatched data within tissue groups were analysed using the Mann–Whitney U test with Bonferroni–Šidák corrections for multiple comparisons. Unless noted otherwise, data are presented as mean ± sd and p level < 0.05 is considered significant.

2.3. Machine Learning Modelling

Since the study included a very small dataset of 36 patients, as described in Section 2.2.1, the data modelling was performed by combining the data from the TCGA Pan-Cancer study (TCGA data obtained with new generation sequencing technology) and the data from our study (study data). The modelling used all available TCGA data records that contained both the tumour tissue measurements and the tumour-adjacent tissue measurements, combined with a part of the study data for training and the remaining study data for testing. The combined training dataset, therefore, included 44 records, whereas the test dataset included 14 records, which can be sufficient if adequate ML approaches are used [17].
The general steps performed were the following:
  • TCGA and study data were normalised for merging;
  • TCGA data (22 samples) were merged with 22 stratified randomly selected samples of study data; the remaining 14 samples were assigned to the test dataset;
  • An automated machine learning (AutoML) approach was used to create the models on the training dataset;
  • The models were tested on the test dataset.

2.3.1. Merging and Normalisation

Combining a generally available dataset with a part of the target dataset to increase dataset size and reduce model overfitting has been described previously [25], however, with directly mergeable data. Since TCGA and study data were measured using a different approach, a normalisation process needed to be devised to allow the data to be merged. Several approaches have been studied previously, for example, combining microarray data with RNA-seq data [26] or normalising for other divergent factors in gene expression measurements [27].
However, since we wanted to preserve the information of the ratios based on the original distribution of the data (thus making, e.g., quantile normalisation less appropriate), and the data were not normally distributed (thus making e.g., Z scoring less appropriate), we devised the following normalisation method:
  • The best fitting distribution for data was empirically selected by trying to fit the data to one of the common standard distributions (Normal, Log-normal, Poisson, Beta, Gamma).
  • For distributions that require positive data, the data were right-shifted to ensure that the smallest value was positive.
  • The best fitting distribution for most columns (Gamma) was then fitted for all columns, and distribution parameters were calculated, together with the correlation coefficient, significance, and estimated lower and upper bounds at the 95% confidence level using the MATLAB “corrcoef” function. It is worth noting at this point that the Gamma distribution has previously been linked to gene expression in multiple studies [28].
  • The original values were then transformed to the value of the cumulative distribution function (CDF) of the fitted distribution at the original value, thus obtaining a value between 0 and 1, indicating the relative (expected) ratio of the population with a value lower than the original value [29].
Distribution fitting and transformation to the CDF values were performed using MATLAB R2022b software; the fitted parameters are available in the Tables S4 and S5. For missing data, the value of 0.5 was used, effectively meaning that median-based imputation was used for missing data imputation.
The data were then merged into a single dataset, and finally split into the training and test datasets:
  • The training dataset was created by taking all 22 normalised samples from the TCGA dataset and combining them with 22 randomly selected samples from the study dataset, where a stratified random sampling approach was used to ensure the final dataset had a balanced distribution of the output variable (EC grade).
  • The remaining samples from the study dataset represented the test dataset.
The training and test datasets were compared using the Wilcoxon rank-sum (Mann–Whitney U) test using MATLAB’s built-in “ranksum” function and were then exported into CSV files for further processing/modelling.

2.3.2. Modelling and Testing

The case/control classification models were created using the previously described dataset using the mljar-supervised library 0.11.3 [30] and the underlying scikit-learn library version 1.1.1 [31], which can be used successfully for small dataset modelling, as was previously shown in similar datasets [14]. The library was configured using the “Compete” mode, and the model validation phase was customised to utilise a stratified 5-fold validation approach for model selection, where the models were optimised to improve the area under the curve (AUC) for the receiver operating characteristic (ROC) curve using the mljar-supervised, built-in roc_auc metric. The training was limited to 20 min per model.
Data were imported into Python using the Pandas library from the CSV format [32], and three subsets of the data were created based on the hypotheses tested:
  • A model utilising the data combining the tumour tissue, adjacent tissue data, and calculated ratios between the tumour tissue and adjacent tissue measurements;
  • A model utilising only the tumour tissue data;
  • A model utilising only the adjacent tissue data.
For each feature group, the best-performing model (calculated using the aforementioned 5-fold cross-validation method within the training dataset) was tested on the study data holdout (test) samples, containing four high-grade and ten low-grade samples. Confusion matrices were generated utilising the decision threshold calculated during training, providing the four standard metrics (true positive, false positive, true negative, false negative), based on which the model precision, recall, accuracy, sensitivity, specificity, and F1 score were estimated. Confusion matrices were further tested using Fisher’s exact test to confirm that the model result was statistically significantly divergent from random guessing.
All models were trained to predict whether the sample belonged to the high-grade or low-grade EC group. The complete MATLAB, Jupyter Notebook, and Python scripts are available at the following link: https://github.com/klokedm/EndometrialCancerGradePrediction.

3. Results

3.1. Public Databases Examination Revealed Twenty-One AF-Encoding Genes and Twenty-Two AF Proteins That Fulfilled Selection Criteria; Nine Gene/Protein Pairs Were in the Intersection

We first examined the publicly available mRNA and proteome datasets for the presence of angiogenesis-associated proteins (EC tissue, n = up to 95; TA tissue, n = up to 25; paired samples, n = up to 24) and the expression of angiogenesis-associated genes (EC tissue, n = up to 548; TA tissue, n = up to 35; paired samples, n = up to 23) in EC tissue compared to adjacent control tissue. The expression of 91 angiogenesis-associated genes and the levels of 64 angiogenesis-associated proteins in endometrial cancer tissues and adjacent control tissues are collected in Tables S1–S3. The most significantly down-regulated and the most significantly up-regulated genes are shown in Figure 1A, and differential levels of angiogenesis-associated proteins in EC compared to adjacent control tissue are presented in Figure 1B.
Nine genes/proteins of interest (CXCL12, ENPP2, FBLN5, FGF2, LYVE1, PDGFRB, SERPINF1, TIMP2, TIMP3) were then selected from analysed datasets based on the following criteria: (a) the significant difference (adjusted p < 0.01) in gene expression between tumour and tumour-adjacent tissue was more than 3-fold, and (b) the significant difference in protein levels between tumour and adjacent tissues was more than 2-fold. An additional six genes (CSF3, IL8, LEP, NRP1, TEK, FST) were included in further research based on the results from our previous studies of biomarkers in plasma samples of EC patients [14,18]. We ultimately chose 15 genes, which we further examined in the validation study on a cohort of 36 EC patients (Figure 1C).
Finally, we analysed the STRING database [33] for the protein–protein interaction between proteins encoded by 15 selected genes. This way, we detected several known (i.e., from curated databases and credible experimental data) and predicted interactions (i.e., based on gene co-occurrence, co-expression, and gene homology). The interactions are depicted in Figure 1D and are ranked (the thickness of the line) based on the interaction confidence score. The highest combined score (0.913) was assigned to the interaction between CXCL8 (also known as IL-8) and CSF3, coming in the main part from the database annotation score (0.900), enhanced with the co-expression score (0.172). This is followed by LEP/CSF3 and TIMP2/TIMP3, with combined scores of 0.800 and 0.714, respectively. High scores (0.600) from the database annotations are also assigned to PDGFRB/FGF2, TEK/FGF2, PDGFRB/LEP, and PDGFRB/CSF3. TIMP2/TIMP3 and PDGFRB/TEK show the highest gene homology, with scores of 0.925 and 0.588, respectively. The gene co-expression is highest between PDGFRB and each TIMP3, CXCL12, and FBLN5 (co-expression score 0.227, 0.211, and 0.180, respectively), as well as between TEK and LYVE1 (co-expression score 0.179).

3.2. Validation of Findings on the Clinical Cohort

3.2.1. Clinical Characteristics of Enrolled Patients

Paired samples of tumours and tumour-adjacent tissues were collected from 36 patients with histologically verified EC. Their mean age was 62.1 ± 14.0 years. In total, 25 patients (69.4%) were postmenopausal, 28 patients (77.8%) were diagnosed with endometrioid adenocarcinoma, and 8 (22.2%) with other histological types. Deep myometrial invasion was observed in nine EC patients (25.0%), <50% invasion into the myometrium in nineteen EC patients (52.8%), and no invasion into the myometrium in eight EC patients (22.2%). LVI was observed in eight patients (22.2%). According to the classification of the International Federation of Gynecology and Obstetrics [34], twenty-five patients were diagnosed with EC in stage IA (69.4%), nine patients in stage IB (25.0%), one patient in stage II (2.8%), and one patient in stage IV (2.8%). The detailed clinical characteristics are presented in Table 2.

3.2.2. Thirteen Genes Encoding AFs Are Differentially Expressed in Tumour Tissue Compared to Adjacent Control Tissue in EC Patients

The local expression of 15 genes encoding proteins involved in angiogenesis pathways was determined using quantitative polymerase chain reaction (qPCR) in 36 patient-matched samples of tumour and tumour-adjacent macroscopically normal tissue. A total of 13 genes were differentially expressed in EC versus adjacent control tissue samples. IL8 and LEP were up-regulated in tumour tissue (4.8-fold and 4.7-fold, respectively), while CXCL12, FGF2, LYVE2, NRP1, TIMP2, TIMP3, ENPP2, FBLN5, PDGFRB, TEK, and SERPINF1 were down-regulated in tumour tissue in comparison to morphologically normal tumour-adjacent tissue (Figure 2). The most profound differences in the gene expression between the two tissues were observed for CXCL12 and TIMP3 genes (18.2-fold and 14.8-fold change, respectively). Mean fold changes in tumour-to-adjacent tissue expression and the 95% CI for all genes are listed in Table 4.

3.2.3. Relationships of Gene Expression with Clinical Characteristics

In the next step, we analysed the effect of the clinicopathological conditions of patients on the expression of the selected genes. We stratified patients according to the clinical data, i.e., FIGO stage, menopausal status, and the histopathological data, i.e., histological tumour grade, depth of myometrial invasion, and the presence of lymphovascular invasion.

In Early Stages and Lower Grades of EC, but Not in More Advanced or Aggressive Forms of EC, Genes for AFs Tend to Be Differentially Expressed in Tumour Tissue Compared to Adjacent Control Tissue

We assessed endometrioid EC grade 3 cancers together with non-endometrioid tumours (serous and dedifferentiated EC) as high-grade EC since several important reports have firmly demonstrated that high-grade endometrioid cancers have molecular characteristics, risk factors, clinical behaviours, and prognoses overlapping with those of non-endometrioid cancers [35,36]. The low-grade EC group comprised endometrioid EC grade 1 and 2 and mucinous EC. In patients with low-grade EC (n = 26; Figure 3A), there was no difference in the expression of CSF3 and FST between the tumour and adjacent tissue. In addition to those two genes, in patients with FIGO stage IA (n = 25; Figure 3C), there was additionally no difference in the expression of LEP. In both groups of patients, IL8 was up-regulated in tumour tissue, by 5.4-fold and 5.6-fold, respectively. All other genes were down-regulated in low-grade and low-stage EC, most prominently CXCL12, which was down-regulated by 20.9-fold and 21.9-fold, respectively. In high-grade EC (n = 10; Figure 3B), NRP1, TIMP2, TIMP3, and SERPINF1 were significantly down-regulated in T tissue compared to TA tissue, by 6.0-fold, 6.2-fold, 17.2-fold, and 12.2-fold, respectively. In a group of patients with stages IB–IV EC (n = 11; Figure 3D), TIMP2, CSF3, ENPP2, and SERPINF1 were significantly down-regulated in tumour tissue, by 4.9-fold, 9.6-fold, 10.2-fold, and 9.2-fold, respectively.

Genes for AFs Are Differentially Expressed between Tumour and Adjacent Control Tissue Only in Patients without DMI or LVI

The stratification of data according to the presence or absence of deep myometrial or lymphovascular invasion revealed that genes for AFs are differentially expressed only in endometrial tissue from patients without DMI (n = 27; Figure 4A) or LVI (n = 28; Figure 4C). In both analyses, all genes except CSF3 and FST were differentially expressed between the tumour and tumour-adjacent tissue. In both patient categories, IL8 and LEP were up-regulated in tumour tissue: in patients without DMI, 5.8-fold and 5.4-fold, respectively, and in patients with absent LVI, 5.5-fold and 5.0-fold, respectively. Other genes were down-regulated in tumour versus adjacent tissue, most prominently CXCL12 (20.6-fold in patients without DMI and 21.7-fold in patients without LVI), followed by TIMP3 (15.0-fold in DMI absent and 16.2-fold in LVI-absent EC). No significant gene expression difference was detected in EC patients with DMI or LVI.
Noteworthy, the groups of patients with FIGO IB-IV, high-grade EC, patients with LVI, and patients with DMI included in the study were small; thus, data obtained in these groups must be considered cautiously, and additional studies in larger groups of patients are needed.

There Is Much Broader Angiogenesis-Related Gene Involvement in Postmenopausal Women with EC than in Women of Reproductive Age

Next, we stratified patients according to their menopausal status. In premenopausal patients (n = 11; Figure 5A), six genes were statistically significantly down-regulated in tumour tissue compared to tumour-adjacent tissue: TIMP3 (13.1-fold), ENPP2 (11.4-fold), FGF2 (7.6-fold), CXCL12 (7.5-fold), TIMP2 (5.2-fold), and PDGFRB (4.9-fold change). On the other hand, eleven genes were significantly down-regulated in postmenopausal women (n = 25; Figure 5B), most prominently CXCL12 (22.9-fold) and TIMP3 (15.5-fold change).
Finally, we separately compared the gene expression within the tumour and tumour-adjacent tissue in EC patients stratified according to EC grade, FIGO stage, presence of DMI, LVI, and menopausal status. ENPP2 was 2.4-fold down-regulated in tumour tissue, and LYVE1 was 2.8-fold down-regulated in high-grade cancer compared to low-grade cancer. On the other hand, ENPP2 was 2.3-fold up-regulated within tumour-adjacent tissue, and FGF2 was 2.4-fold up-regulated in postmenopausal women compared to premenopausal women. There was no statistically significant difference within the tumour tissue, nor in the tumour-adjacent tissue between patients with different FIGO stage EC, nor between the patients with a presence or absence of LVI or DMI (Figure S1).

3.2.4. Co-Expression Patterns of the Genes: Higher Number of Strong Correlations Was Identified in EC Patients with Present LVI

Correlations between gene expressions were performed to highlight trends and relationships in the expression profiles of selected angiogenesis-related genes in T and TA tissues. Correlations between gene expressions in different tissues and inter-tissue expression relations were analysed.
When all EC patients were considered, we identified five strong positive correlations with r > 0.85 and p < 0.05 within T tissue, with the strongest being between PDGFRB and SERPINF1. A list of all strong correlations is shown in Table 5. The pattern of gene correlations was similar within TA tissue; however, no correlations were considered strong (r > 0.85). No correlation between T and TA tissue gene expression reached r > 0.85 (Figure 6A).
Next, patients were stratified according to LVI status. We identified similar correlation patterns between genes in EC patients without LVI to those in a group of all EC patients (Figure 6B). In patients without LVI, the strongest correlation found within T tissue was between PDGFRB and SERPINF1 and within TA tissue between ENPP2 and TEK.
A drastically higher number of strong correlations was identified in EC patients with present LVI. Within T tissue, 24 strong correlations with r > 0.85 were detected; the strongest correlations were between PDGFRB and TEK, between LYVE1 and the genes NRP1, PDGFRB, and TEK, and between SERPINF1 and the genes NRP1, PDGFRB, TEK, and LYVE1. Nine strong positive correlations were identified within TA tissue, the strongest between CXCL12 and the genes SERPINF1 and TIMP2, between TIMP2 and the genes SERPINF1 and TIMP3, and between FBLN5 and SERPINF1. CSF3 was negatively correlated with the majority of the other genes, but no relationship reached r > 0.85.
Remarkably, when LVI was present, there was also a significant correlation between T versus TA tissue gene expression. We identified eight strong correlations between NRP1 in TA and the following genes in T: TIMP3, ENPP2, FGF2, NRP1, PDGFRB and TEK; and between PDGFRB in TA and the following genes in T: ENPP2 and FGF2. However, these results should be considered cautiously since the number of patients with LVI was low (n = 8).

3.3. Machine Learning Modelling Succeeded in Creating a Relatively Robust EC-Grade Prediction Model Based on the Tumour Gene Expressions

3.3.1. Data Normalisation Results

The results of the gamma distribution fitting (α, β) and the corresponding correlation coefficients for individual variables for the TCGA are shown in Table S4, and the results of the distribution fitting (α, β) and the corresponding correlation coefficients for the study dataset are shown in Table S5.

3.3.2. Comparison of Training and Test Datasets

The training dataset contained 44 patients (75.9%), the testing dataset contained 14 patients (24.1%), and none of the 62 variables used in modelling significantly differed between the training and test datasets. Each variable’s Wilcoxon rank-sum statistic results are presented in Table S6.
The training dataset contained 22 records (50%) from the TCGA dataset, and 22 records from the study dataset (50%), and the test dataset contained 14 records (100%) from the study dataset. Twenty-two records (50%) represented low-grade EC in the training dataset, and ten represented low-grade EC in the test dataset (71.4%).

3.3.3. Modelling Results

Of the three created models, all models showed a good prediction of the EC tumour grade (low/high) on the training dataset (AUC > 0.9), and two of the models kept good prediction capabilities on the holdout (test) data, including one of the models maintaining an AUC above 0.9. The ROC curves are presented in Figure 7, with the left (7A) figure representing the ROC curves for the training dataset and the right (7B) figure showing the ROC curves for the test dataset.
As shown in Table 6, all primary metrics for the EC tumour grade classification for all three models are high on the training data, with the accuracy, precision, F1, and specificity for all three models reaching or exceeding 90%, with AUC scores near 1.00. The remaining metrics exceeded 80% for all models on the training dataset.
The primary metrics remained high on two models for EC tumour grade prediction; specifically, the models utilising the data from the tumour tissue (all data and normalised tumour data), as shown in Table 7. The best-performing model on the test dataset is based only on tumour data, reaching an accuracy above 85%, with 100% recall and sensitivity, 80% specificity, and 66.7% precision; the AUC of the model remained very high at 0.98. The second model that performed well was the model using all data, which also reached an accuracy above 85%, with a slightly better precision at 75%, but reduced recall (75%) and F1 score, and a lower AUC score of 0.78. The model utilising only the tumour-adjacent tissue performed similarly to a random model, with an accuracy of 50%.
The confusion matrices for the EC tumour grade prediction on the test dataset, which form the basis for the metrics in Table 7, are presented in Figure 8 below. According to Fisher’s exact statistics, the confusion matrices for all data and normalised tumour models are statistically significant (all data: p < 0.05; tumour: p < 0.05). The confusion matrix for the model using the adjacent tissue data is not significant.

4. Discussion

In this study, we evaluated the expression of 15 genes known for their involvement in angiogenic processes, 9 of which were selected from publicly available libraries, TCGA [19,20,21] and CPTAC UCEC [22], out of 91 different angiogenesis-related genes and 64 encoded proteins, respectively, based on their differential detection in EC T tissue in comparison to morphologically normal adjacent endometrial tissue. Those nine genes were the following: CXCL12, ENPP2, FBLN5, FGF2, LYVE1, PDGFRB, SERPINF1, TIMP2, and TIMP3. Six genes, CSF3, IL8, LEP, NRP1, TEK, and FST, were preselected in our previous research [14,18] on plasma samples from EC patients. To the best of our knowledge, this is the largest set of genes involved in angiogenesis analysed in EC tissue.
Using a clinical cohort of 36 EC patients, we confirmed the difference between T and TA tissue expression in EC for thirteen out of fifteen analysed genes involved in angiogenesis. According to TCGA data, all nine genes selected from data libraries are down-regulated in T versus TA tissue, with an expression ratio range from −17.5 (FGF2) to −3.5 (PDGFRB). We observed similar T-to-TA expression ratios for individual genes in the clinical cohort. Out of six additional genes, LEP and IL8 were significantly up-regulated, and TEK was significantly down-regulated in T tissue, which is in accordance with our previous research, where plasma levels of EC patients were compared to those of control patients with benign gynaecological pathology [14,18]. In contrast to our plasma research, in the present tissue-based study, NRP1 was significantly down-regulated, and the differences in T versus TA expression for CSF3 and FST were insignificant.
Interestingly, all the genes selected from TCGA were down-regulated in T tissue compared to TA tissue (Figure 1A), highlighting the importance of the antiangiogenic lever of the angiogenic switch in the endometrial tissue. The endometrial tissue’s uniqueness is its cyclic exposure to extensive hormonal changes. During each monthly cycle, ovarian hormones trigger angiogenic processes and endometrium regeneration, followed by blood vessel loss. Endometrial tissue thus produces both pro- and anti-AFs [11,37]. The levels of various main AF groups, like angiopoietins, VEGFs, and MMPs, fluctuate during different menstrual cycle phases [38,39]. EC, however, occurs mainly in postmenopausal women where the expression of AF is not exposed to altered concentrations of ovarian hormones. When screening TCGA library data in EC, genes encoding these AFs were not differentially expressed between T and TA tissue, while on the other hand, genes for their endogenous inhibitors, i.e., TEK (binding to angiopoietins), NRP1, PDGFRB, FBLN5, SERPINF1 (receptors and inhibitors of VEGF), TIMP2, and TIMP3 (inhibitors of MMPs) were substantially down-regulated in T versus TA tissue. While pro-angiogenic factors promote angiogenesis during the normal monthly menstrual cycle [38], our data suggest that tumour angiogenesis in EC is promoted mainly by the decreased gene expression of various antiangiogenic factors.
In our study, CXCL12 was the most down-regulated gene in all analysed strata: it was 18.2-fold down-regulated in all EC patients and 20.9-fold, 21.9-fold, 20.6-fold, and 21.7-fold down-regulated, respectively, in low-grade EC, in low stage EC, in DMI-absent EC, and LVI-absent EC patients. CXCL12 is a chemokine that plays a critical role as a chemoattractant in the tumour niche. Its secretion by myofibroblasts stimulates tumour progression [40]. The CXCL12/CXCR4 axis plays a vital role in endometrial cancer’s proliferation, invasion, and metastasis [41]. CXCL12 primarily binds to its receptor CXCR4 to regulate the trafficking of both normal and malignant cells. In a paracrine manner, CXCL12 attracts CXCR4-expressing tumour cells to a new tumour niche, resulting in tumour cell invasion and metastasis [42,43]. CXCL12/CXCR4 also has essential roles in the muscular infiltration of endometrial cancer by activating the PI3K/Akt signalling pathway [44]. Different authors have reported that CXCL12/CXCR4 expression in human EC tissues was inversely related to the histological grade, whereas survival rates were significantly better in patients with higher levels of CXCR4 [45].
EC is a heterogeneous disease; different angiogenic mechanisms are expressed during different endometrial cancer phases. We stratified patients according to their clinical and histopathological characteristics. Similar observations were reached in all analyses. In patients with prognostically more favourable forms of EC—in less advanced stages and lower grades of EC, genes for AFs tended to be differentially expressed in T tissue compared to non-cancerous TA tissue. On the contrary, differences in gene expressions are less prominent in more advanced or aggressive forms of EC. Likewise, stratification according to the presence or absence of deep myometrial or lymphovascular invasion identified differentially expressed genes only in endometrial tissue from patients without DMI or LVI, whereas no significant difference was detected in gene expression in EC patients with DMI or LVI. The groups of patients with FIGO IB-IV, high-grade EC, patients with LVI, and patients with DMI included in our study were small; thus, data obtained in these groups must be considered cautiously, and additional studies in larger groups of patients are needed.
Nevertheless, the finding concurs with published data on various genes coding for AFs. The loss of TIMP3 correlated with advanced-stage disease and poor prognosis in various cancers [46]. A decreasing expression of TIMP-2 in EC tissue was correlated with the histological grade of EC, with the level of myometrial invasion, lymphovascular space involvement, and lymph node involvement [47].
We also examined the co-expressions of analysed genes within T and TA tissue in EC patients stratified according to LVI status, the cornerstone of risk stratification in EC [7]. Curiously, we identified a drastically higher number of strongly correlated mRNA expressions in both T and TA tissue when LVI was present. There was also a significant correlation between eight gene expressions in T versus TA tissue, which was not observed in the absence of LVI. Primarily, in T tissue and in LVI-positive samples, numerous strong correlations were found between the expression of SERPINF1 and other angiogenesis-related genes, which has not been acknowledged before in the STRING database. Due to the low number of samples with LVI, these findings should be additionally analysed in a larger cohort. SERPINF1 is a known antiangiogenic factor with many additional functions like anti-tumour, anti-inflammation, nutrition, and nerve protection functions, and is involved in fat metabolism. In various cancers, including EC, the expression of SERPINF1 is lower in tumour tissue than in control tissue [48]. It is expressed in the normal and cancerous endometrium, and its loss of expression is associated with endometrial hyperplasia, a precursor for EC, and increased EC proliferation [49].
Next, we stratified EC patients by menopausal status. Six genes were significantly down-regulated in the reproductive age group and eleven in postmenopausal women. The diversity of genes supporting angiogenesis in EC in younger populations was much lower than in postmenopausal patients. All the down-regulated genes in premenopausal patients were, with even higher significance, down-regulated in postmenopausal patients, while six additional genes were down-regulated in the latter group. This indicates broader angiogenesis-related gene involvement in postmenopausal women with EC, which is in accordance with findings of an age- and menopausal-status-related increase in somatic mutation frequency across many tumour types [50,51]. One of the genes with the most reduced expression in both menopausal status subgroups was tissue inhibitor of metalloproteinases, TIMP3, the silencing of which is consistently associated with cancer progression or poor patient prognosis in multiple human cancers, including EC. TIMP3 promoter is a frequently targeted methylation site, and its epigenetic silencing indicates a pro-tumorigenic outcome [46,52].
EC often involves patients with other comorbidities, like obesity, an established risk factor for EC [53,54,55,56,57]. Leptin, an adipokine encoded by the LEP gene, has an important role in energy balance and glucose metabolism. It plays an integral part in the link between obesity and EC, where the tyrosine kinase-dependent intracellular pathway promotes angiogenesis during cancer development [58,59,60]. There is a continuous debate with contradicting results on whether the effect of leptin on EC risk is related to higher BMI or whether it is an independent risk factor for EC [61,62,63]. In our previous research, we used an automated machine learning approach through which we showed that in univariate and multivariate models, leptin might predict EC better than BMI [14]. This supports findings that leptin might be involved in EC development through mechanisms beyond obesity-related pathophysiology, including through angiogenesis [59,60]. The overexpression of LEP in endometrioid EC compared to benign control patients was reported before [64]. Indeed, in our study, the expression of the LEP gene was significantly elevated in tumour tissue compared to patient-paired, morphologically normal, adjacent endometrial tissue (mean FR = 4.68; 95% CI 2.87–6.49; p = 0.0104; Table 4), which directly supports the hypothesis of leptin’s independent role in EC carcinogenesis.
Besides leptin, IL8 mRNA expression was up-regulated in T versus TA tissue (mean FR = 4.75; 95% CI 2.46–7.04; p = 0.0164), which supports our previous findings in plasma [14]. IL-8 is a pro-inflammatory cytokine secreted by adipocytes and represents another link between adipose tissue and EC [65,66]. It is chemotactic for lymphocytes and neutrophils and has an important role in angiogenesis [67,68]. Elevated IL-8 serum levels were independently associated with shorter disease-free and overall survival in EC cancer [69]. The protein–protein interaction analysis using the STRING database [33] revealed strong interaction and gene co-expression between IL8 and CSF3 (Figure 1D). Our data confirmed this interaction, but only within T tissue in EC patients with LVI present (r = 0.881; p = 7.2 × 10−3; Table 5).
Fertility-sparing treatment is considered for endometrioid patients with endometrial carcinoma of a low grade [8]. In order to verify whether the selected AF levels in the tumour or adjacent tissue could be used for EC grade stratification, we developed three different models utilising automated machine learning approaches using the differentially expressed genes identified on the TCGA dataset. While all models received near-perfect scores on the training dataset, the model based on the TA data did not generalise well (achieved random scores on the test data), which was probably caused by overfitting, as the sample was very small. However, the model based on the T tissue data retained excellent scores on the test data, retaining AUC at the 0.9 level, with a sensitivity of 100% and specificity of 80%. Caution is always required when interpreting results on such small datasets, but further multicentre validation studies, including one specifically for reproductive-age women, would probably be warranted, given the statistical significance of the result on the test dataset.
Our data indicate that the regulation of angiogenesis-related genes in EC with prognostically less favourable characteristics is not limited to T tissue alone but rather spreads onto the non-cancerous TA tissue of the surrounding endometrium. This is in accordance with the fact that TA tissues are often involved in the development and progression of the tumour [70,71,72]. TA tissue is a distinct tissue type that presents a unique intermediate state between healthy tissue and tumour tissue [72]. Several studies suggest that TA tissue may offer helpful information for predicting disease prognosis [73]. This may be due to either (1) field cancerisation theory, which suggests that paired TA tissues are in an intermediate state between normal and tumour, thus bearing information on early tumour initialisation and development, or (2) tumour microenvironment theory, which suggests that TA tissues contain information about the microenvironment surrounding tumours, which either promotes or suppresses tumour development [73]. There is increasing evidence that a single genetic mutation is insufficient to initiate the disease and that microenvironment-derived signals may be required to drive tumour progression. The neoplastic and non-neoplastic cells in the microenvironment communicate to produce a microenvironment favourable for the progression of endometrial carcinogenesis [43,45]. According to different lines of evidence, genomic data from non-cancerous TA tissue can independently predict cancer survival, and, in some cases, provide even superior performance relative to models based on tumour-derived data alone [71,73,74]. We found fewer differences between angiogenesis-related gene expression in T versus TA tissue in higher stages and grades of EC, indicating that the progression of the tumour was not only related to the expression of AFs in the tumorous tissue but also to the expression in the TA tissue samples. While our machine learning modelling did not confirm the ability to use TA tissue for cancer grade classification, it is worth stressing that the models for TA did overfit due to sample size limitations, and it is therefore impossible to draw direct conclusions from the result.
Additionally, in order to be able to model the combined data from the TCGA dataset and the data from our study, a novel approach to data merging (and normalisation) was utilised, building upon previous ideas on combining different datasets [25,28]. As the results are promising, a further study to identify whether the method could be generalised might be interesting.
It is again important to stress that the overall sample size used in the study is a limitation and results should be interpreted with that in mind. This could be addressed in the future by conducting a study in a larger group of patients, ideally in multiple geographically dispersed centres using the same clinical protocol.

5. Conclusions

We showed that angiogenesis in EC is promoted mainly by the decreased gene expression of antiangiogenic factors. Our data also indicate that the regulation of angiogenesis-related genes in EC with prognostically less favourable characteristics, i.e., higher cancer stage or grade, or the presence of LVI or DMI, is affected not only in T but also in the TA tissue of the surrounding endometrium, where gene expression is altered inside morphologically normal cells within the tumour microenvironment. However, we identified stronger gene co-expressions in T than in TA tissue; correlations were particularly strong when the lymphovascular invasion was present. We also confirmed broader AF gene involvement in postmenopausal compared to premenopausal women with EC. Additionally, by combining TCGA data and data from our study, we applied machine learning modelling to create a relatively robust model, able to differentiate between low-grade and high-grade EC based on the T gene expressions, which might be helpful in fertility-sparing settings in EC patients.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/cancers15143661/s1, Table S1: Normalized expression of angiogenesis-associated genes in endometrial cancer tissues and adjacent control tissues; data from the GDC TCGA Endometrioid Cancer (UCEC) study; Table S2: Normalized levels of angiogenesis-associated proteins in endometrial cancer tissues and adjacent control tissues; data from the CPTAC UCEC Discovery Study—Proteome; Table S3: Expression of angiogenesis-associated genes in endometrial cancer tissue (n = up to 527). Data are from the TCGA Pan-Cancer study; Table S4: The calculated distribution fit correlation with the original data, including the p-values and estimated upper and lower bounds for the 95% confidence intervals for the TCGA dataset; Table S5: The calculated distribution fit correlation with the original data, including the p-values and estimated upper and lower bounds for the 95% confidence intervals for the study dataset; Table S6: Comparison between the training and test datasets using the Wilcoxon rank-sum Test; Figure S1: Volcano plots identifying significant changes in gene expression within the tumour and tumour-adjacent tissue separately, in EC patients stratified according to the presence of LVI or DMI, EC grade, FIGO stage, and menopausal status.

Author Contributions

Conceptualization, T.L.R. and L.R.; methodology, R.P. and M.K.; software, M.K.; analysis, R.P., I.R., M.K. and L.R.; resources, Š.S. and T.L.R.; data curation, L.R., I.R. and M.K.; writing—original draft preparation, L.R., I.R. and M.K.; writing—review and editing, T.L.R. and Š.S.; visualization, I.R. and M.K.; supervision, T.L.R.; funding acquisition, T.L.R. and Š.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by two grants: the Slovenian Research Agency grants J3-2535 to T.L.R.; and the University Medical Centre Ljubljana tertiary project TP 202110160 to Š.S.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the National Medical Ethics Committee of the Republic of Slovenia (ID 0120-429/2017/8, 5 November 2017).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets used and/or analysed during the current study are available from the corresponding authors on reasonable request. Machine learning modelling: the complete MATLAB, Jupyter Notebook, and Python scripts are available here: https://github.com/klokedm/EndometrialCancerGradePrediction.

Acknowledgments

The authors thank all patients included in this study.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Crosbie, E.J.; Kitson, S.J.; McAlpine, J.N.; Mukhopadhyay, A.; Powell, M.E.; Singh, N. Endometrial cancer. Lancet 2022, 399, 1412–1428. [Google Scholar] [CrossRef] [PubMed]
  2. Zhang, S.; Gong, T.T.; Liu, F.H.; Jiang, Y.T.; Sun, H.; Ma, X.X.; Zhao, Y.H.; Wu, Q.J. Global, Regional, and National Burden of Endometrial Cancer, 1990–2017: Results from the Global Burden of Disease Study, 2017. Front. Oncol. 2019, 9, 1440. [Google Scholar] [CrossRef] [PubMed]
  3. Lee, N.K.; Cheung, M.K.; Shin, J.Y.; Husain, A.; Teng, N.N.; Berek, J.S.; Kapp, D.S.; Osann, K.; Chan, J.K. Prognostic factors for uterine cancer in reproductive-aged women. Obstet. Gynecol. 2007, 109, 655–662. [Google Scholar] [CrossRef] [PubMed]
  4. Wilczyński, M.; Danielska, J.; Wilczyński, J. An update of the classical Bokhman’s dualistic model of endometrial cancer. Przegląd Menopauzalny 2016, 15, 63–68. [Google Scholar] [CrossRef]
  5. Murali, R.; Soslow, R.A.; Weigelt, B. Classification of endometrial carcinoma: More than two types. Lancet Oncol. 2014, 15, e268–e278. [Google Scholar] [CrossRef]
  6. Cancer Genome Atlas Research Network; Kandoth, C.; Schultz, N.; Cherniack, A.D.; Akbani, R.; Liu, Y.; Shen, H.; Robertson, A.G.; Pashtan, I.; Shen, R.; et al. Integrated genomic characterization of endometrial carcinoma. Nature 2013, 497, 67–73. [Google Scholar]
  7. Concin, N.; Matias-Guiu, X.; Vergote, I.; Cibula, D.; Mirza, M.R.; Marnitz, S.; Ledermann, J.; Bosse, T.; Chargari, C.; Fagotti, A.; et al. ESGO/ESTRO/ESP guidelines for the management of patients with endometrial carcinoma. Int. J. Gynecol. Cancer 2021, 31, 12–39. [Google Scholar] [CrossRef]
  8. Rodolakis, A.; Scambia, G.; Planchamp, F.; Acien, M.; Di Spiezio Sardo, A.; Farrugia, M.; Grynberg, M.; Pakiž, M.; Pavlakis, K.; Vermeulen, N.; et al. ESGO/ESHRE/ESGE Guidelines for the fertility-sparing treatment of patients with endometrial carcinoma. Int. J. Gynecol. Cancer 2023, 33, 208–222. [Google Scholar] [CrossRef]
  9. Helpman, L.; Kupets, R.; Covens, A.; Saad, R.S.; Khalifa, M.A.; Ismiil, N.; Ghorab, Z.; Dubé, V.; Nofech-Mozes, S. Assessment of endometrial sampling as a predictor of final surgical pathology in endometrial cancer. Br. J. Cancer 2014, 110, 609–615. [Google Scholar] [CrossRef] [Green Version]
  10. Risau, W. Mechanisms of angiogenesis. Nature 1997, 386, 671–674. [Google Scholar] [CrossRef]
  11. Abulafia, O.; Sherer, D.M. Angiogenesis of the endometrium. Obstet. Gynecol. 1999, 94, 148–153. [Google Scholar] [PubMed]
  12. Carmeliet, P.; Jain, R.K. Angiogenesis in cancer and other diseases. Nature 2000, 407, 249–257. [Google Scholar] [CrossRef] [PubMed]
  13. Roškar, L.; Roškar, I.; Rižner, T.L.; Smrkolj, Š. Diagnostic and Therapeutic Values of Angiogenic Factors in Endometrial Cancer. Biomolecules 2021, 12, 7. [Google Scholar] [CrossRef] [PubMed]
  14. Roškar, L.; Pušić, M.; Roškar, I.; Kokol, M.; Pirš, B.; Smrkolj, Š.; Rižner, T.L. Models including preoperative plasma levels of angiogenic factors, leptin and IL-8 as potential biomarkers of endometrial cancer. Front. Oncol. 2022, 12, 972131. [Google Scholar] [CrossRef] [PubMed]
  15. Karkia, R.; Wali, S.; Payne, A.; Karteris, E.; Chatterjee, J. Diagnostic Accuracy of Liquid Biomarkers for the Non-Invasive Diagnosis of Endometrial Cancer: A Systematic Review and Meta-Analysis. Cancers 2022, 14, 4666. [Google Scholar] [CrossRef] [PubMed]
  16. Obradović, D.D.; Milić, N.M.; Miladinović, N.; McClements, L.; Oprić, D.M. Loss of Expression of Antiangiogenic Protein FKBPL in Endometrioid Endometrial Carcinoma: Implications for Clinical Practice. Medicina 2022, 58, 1330. [Google Scholar] [CrossRef] [PubMed]
  17. Kokol, P.; Kokol, M.; Zagoranski, S. Machine learning on small size samples: A synthetic knowledge synthesis. Sci. Prog. 2022, 105, 00368504211029777. [Google Scholar] [CrossRef]
  18. Roškar, L.; Klančič, T.; Knific, T.; Rižner, T.L.; Smrkolj, Š. Tie-2, G-CSF, and Leptin as Promising Diagnostic Biomarkers for Endometrial Cancer: A Pilot Study. J. Clin. Med. 2021, 10, 765. [Google Scholar] [CrossRef]
  19. Gao, J.; Aksoy, B.A.; Dogrusoz, U.; Dresdner, G.; Gross, B.; Sumer, S.O.; Sun, Y.; Jacobsen, A.; Sinha, R.; Larsson, E.; et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal 2013, 6, pl1. [Google Scholar] [CrossRef] [Green Version]
  20. Weinstein, J.N.; Collisson, E.A.; Mills, G.B.; Mills Shaw, K.R.; Ozenberger, B.A.; Ellrott, K.; Shmulevich, I.; Sander, C.; Stuart, J.M. The Cancer Genome Atlas Pan-Cancer analysis project. Nat. Genet. 2013, 45, 1113–1120. [Google Scholar] [CrossRef]
  21. Goldman, M.J.; Craft, B.; Hastie, M.; Repečka, K.; McDade, F.; Kamath, A.; Banerjee, A.; Luo, Y.; Rogers, D.; Brooks, A.N.; et al. Visualizing and interpreting cancer genomics data via the Xena platform. Nat. Biotechnol. 2020, 38, 675–678. [Google Scholar] [CrossRef] [PubMed]
  22. Dou, Y.; Kawaler, E.A.; Cui Zhou, D.; Gritsenko, M.A.; Huang, C.; Blumenberg, L.; Karpova, A.; Petyuk, V.A.; Savage, S.R.; Satpathy, S.; et al. Proteogenomic Characterization of Endometrial Carcinoma. Cell 2020, 180, 729–748. [Google Scholar] [CrossRef] [PubMed]
  23. Pavlič, R.; Vidic, S.; Anko, M.; Knific, T.; Büdefeld, T.; Marton, K.; Sinreih, M.; Poschner, S.; Jäger, W.; Frković-Grazio, S.; et al. Altered Profile of E1-S Transporters in Endometrial Cancer: Lower Protein Levels of ABCG2 and OSTβ and Up-Regulation of SLCO1B3 Expression. Int. J. Mol. Sci. 2021, 22, 3819. [Google Scholar] [CrossRef] [PubMed]
  24. Bustin, S.A.; Benes, V.; Garson, J.A.; Hellemans, J.; Huggett, J.; Kubista, M.; Mueller, R.; Nolan, T.; Pfaffl, M.W.; Shipley, G.L.; et al. The MIQE Guidelines: Minimum Information for Publication of Quantitative Real-Time PCR Experiments. Clin. Chem. 2009, 55, 611–622. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Song, K.; Zhou, Y.-H. Leveraging Scheme for Cross-Study Microbiome Machine Learning Prediction and Feature Evaluations. Bioengineering 2023, 10, 231. [Google Scholar] [CrossRef]
  26. Lee, A.J.; Park, Y.S.; Doing, G.; Hogan, D.A.; Greene, C.S. Correcting for experiment-specific variability in expression compendia can remove underlying signals. GigaScience 2020, 9, giaa117. [Google Scholar] [CrossRef]
  27. Butler, A.; Hoffman, P.; Smibert, P.; Papalexi, E.; Satija, R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat. Biotechnol. 2018, 36, 411–420. [Google Scholar] [CrossRef]
  28. Friedman, N.; Cai, L.; Xie, X.S. Linking stochastic dynamics to population distribution: An analytical framework of gene expression. Phys. Rev. Lett. 2006, 97, 168302. [Google Scholar] [CrossRef] [Green Version]
  29. Deisenroth, M.P.; Faisal, A.A.; Ong, C.S. Mathematics for Machine Learning; Cambridge University Press: London, UK, 2021. [Google Scholar]
  30. Piońska, A.; Pioński, P. MLJAR: State-of-the-art Automated Machine Learning Framework for Tabular Data. Version 0.10.3; MLJAR: Bialystok, Poland, 2021. [Google Scholar]
  31. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn Res. 2011, 12, 2825–2830. [Google Scholar]
  32. The Pandas Development Team. Pandas-Dev/Pandas: Pandas. Zenodo 2020, 21, 1–9. [Google Scholar]
  33. Szklarczyk, D.; Gable, A.L.; Nastou, K.C.; Lyon, D.; Kirsch, R.; Pyysalo, S.; Doncheva, N.T.; Legeay, M.; Fang, T.; Bork, P.; et al. The STRING database in 2021: Customizable protein–protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021, 49, D605. [Google Scholar] [CrossRef] [PubMed]
  34. Amant, F.; Mirza, M.R.; Koskas, M.; Creutzberg, C.L. Cancer of the corpus uteri. Int. J. Gynecol. Obstet. 2018, 143, 37–50. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Voss, M.A.; Ganesan, R.; Ludeman, L.; McCarthy, K.; Gornall, R.; Schaller, G.; Wei, W.; Sundar, S. Should grade 3 endometrioid endometrial carcinoma be considered a type 2 cancer-A clinical and pathological evaluation. Gynecol. Oncol. 2012, 124, 15–20. [Google Scholar] [CrossRef]
  36. Setiawan, V.W.; Yang, H.P.; Pike, M.C.; McCann, S.E.; Yu, H.; Xiang, Y.B.; Wolk, A.; Wentzensen, N.; Weiss, N.S.; Webb, P.M.; et al. Type I and II endometrial cancers: Have they different risk factors? J. Clin. Oncol. 2013, 31, 2607–2618. [Google Scholar] [CrossRef]
  37. Yetkin-Arik, B.; Kastelein, A.W.; Klaassen, I.; Jansen, C.H.J.R.; Latul, Y.P.; Vittori, M.; Biri, A.; Kahraman, K.; Griffioen, A.W.; Amant, F.; et al. Angiogenesis in gynecological cancers and the options for anti-angiogenesis therapy. Biochim. Biophys. Acta Rev. Cancer 2021, 1875, 188446. [Google Scholar] [CrossRef]
  38. Demir, R.; Yaba, A.; Huppertz, B. Vasculogenesis and angiogenesis in the endometrium during menstrual cycle and implantation. Acta Histochem. 2010, 112, 203–214. [Google Scholar] [CrossRef] [PubMed]
  39. Grzechocińska, B.; Dąbrowski, F.; Cyganek, A.; Panek, G.; Wielgoś, M. The role of metalloproteinases in endometrial remodelling during menstrual cycle. Ginekol. Pol. 2017, 88, 337–342. [Google Scholar] [CrossRef] [Green Version]
  40. Sun, X.; Cheng, G.; Hao, M.; Zheng, J.; Zhou, X.; Zhang, J.; Taichman, R.S.; Pienta, K.J.; Wang, J. CXCL12/CXCR4/CXCR7 chemokine axis and cancer progression. Cancer Metastasis Rev. 2010, 29, 709–722. [Google Scholar] [CrossRef] [Green Version]
  41. Liu, P.; Long, P.; Huang, Y.; Sun, F.; Wang, Z. CXCL12/CXCR4 axis induces proliferation and invasion in human endometrial cancer. Am. J. Transl. Res. 2016, 8, 1719. [Google Scholar]
  42. Gelmini, S.; Mangoni, M.; Castiglione, F.; Beltrami, C.; Pieralli, A.; Andersson, K.L.; Fambrini, M.; Taddei, G.L.; Serio, M.; Orlando, C. The CXCR4/CXCL12 axis in endometrial cancer. Clin. Exp. Metastasis 2009, 26, 261–268. [Google Scholar] [CrossRef]
  43. Sahoo, S.; Zhang, X.; Hondermarck, H.; Tanwar, P. The Emerging Role of the Microenvironment in Endometrial Cancer. Cancers 2018, 10, 408. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Guo, F.; Wang, Y.; Liu, J.; Mok, S.C.; Xue, F.; Zhang, W. CXCL12/CXCR4: A symbiotic bridge linking cancer cells and their stromal neighbors in oncogenic communication networks. Oncogene 2015, 35, 816–826. [Google Scholar] [CrossRef] [PubMed]
  45. Felix, A.S.; Weissfeld, J.; Edwards, R.; Linkov, F. Future directions in the field of endometrial cancer research: The need to investigate the tumor microenvironment. Eur. J. Gynaecol. Oncol. 2010, 31, 139–144. [Google Scholar] [PubMed]
  46. Jackson, H.W.; Defamie, V.; Waterhouse, P.; Khokha, R. TIMPs: Versatile extracellular regulators in cancer. Nat. Rev. Cancer 2017, 17, 38–53. [Google Scholar] [CrossRef]
  47. Graesslin, O.; Cortez, A.; Fauvet, R.; Lorenzato, M.; Birembaut, P.; Daraï, E. Metalloproteinase-2, -7 and -9 and tissue inhibitor of metalloproteinase-1 and -2 expression in normal, hyperplastic and neoplastic endometrium: A clinical-pathological correlation study. Ann. Oncol. 2006, 17, 637–645. [Google Scholar] [CrossRef]
  48. Zhang, C.; Yang, W.; Zhang, S.; Zhang, Y.; Liu, P.; Li, X.; Zhi, W.; Yang, D.; Li, M.; Lu, Y. Pan-cancer analysis of osteogenesis imperfecta causing gene SERPINF1. Intractable Rare Dis. Res. 2022, 11, 15–24. [Google Scholar] [CrossRef]
  49. Brook, N.; Brook, E.; Dass, C.R.; Chan, A.; Dharmarajan, A. Pigment Epithelium-Derived Factor and Sex Hormone-Responsive Cancers. Cancers 2020, 12, 3483. [Google Scholar] [CrossRef]
  50. Risques, R.A.; Kennedy, S.R. Aging and the rise of somatic cancer-associated mutations in normal tissues. PLoS Genet. 2018, 14, e1007108. [Google Scholar] [CrossRef] [Green Version]
  51. Milholland, B.; Auton, A.; Suh, Y.; Vijg, J. Age-related somatic mutations in the cancer genome. Oncotarget 2015, 6, 24627–24635. [Google Scholar] [CrossRef] [Green Version]
  52. Catasus, L.; Pons, C.; Muñoz, J.; Espinosa, I.; Prat, J. Promoter hypermethylation contributes to TIMP3 down-regulation in high stage endometrioid endometrial carcinomas. Histopathology 2013, 62, 632–641. [Google Scholar] [CrossRef]
  53. Shaw, E.; Farris, M.; McNeil, J.; Friedenreich, C. Obesity and Endometrial Cancer. In Obesity and Cancer. Recent Results in Cancer Research; Pischon, T., Nimptsch, K., Eds.; Springer: Cham, Switzerland, 2016; pp. 107–136. [Google Scholar]
  54. Reeves, G.K.; Pirie, K.; Beral, V.; Green, J.; Spencer, E.; Bull, D. Cancer incidence and mortality in relation to body mass index in the Million Women Study: Cohort study. Br. Med. J. 2007, 335, 1134–1139. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Onstad, M.A.; Schmandt, R.E.; Lu, K.H. Addressing the role of obesity in endometrial cancer risk, prevention, and treatment. J. Clin. Oncol. 2016, 34, 4225–4230. [Google Scholar] [CrossRef] [PubMed]
  56. Renehan, A.G.; Tyson, M.; Egger, M.; Heller, R.F.; Zwahlen, M. Body-mass index and incidence of cancer: A systematic review and meta-analysis of prospective observational studies. Lancet 2008, 371, 569–578. [Google Scholar] [CrossRef] [PubMed]
  57. Daley-Brown, D.; Oprea-Ilies, G.M.; Lee, R.; Pattillo, R.; Gonzalez-Perez, R.R. Molecular cues on obesity signals, tumor markers and endometrial cancer. Horm. Mol. Biol. Clin Investig. 2015, 21, 89–106. [Google Scholar] [CrossRef] [Green Version]
  58. Mantzoros, C.S.; Magkos, F.; Brinkoetter, M.; Sienkiewicz, E.; Dardeno, T.A.; Kim, S.Y.; Hamnvik, O.P.; Koniaris, A. Leptin in human physiology and pathophysiology. Am. J. Physiol. Metab. 2011, 301, E567–E584. [Google Scholar] [CrossRef] [Green Version]
  59. Bouloumié, A.; Drexler, H.C.; Lafontan, M.; Busse, R. Leptin, the product of Ob gene, promotes angiogenesis. Circ. Res. 1998, 83, 1059–1066. [Google Scholar] [CrossRef]
  60. Sierra-Honigmann, M.R.; Nath, A.K.; Murakami, C.; García-Cardeña, G.; Papapetropoulos, A.; Sessa, W.C.; Madge, L.A.; Schechner, J.S.; Schwabb, M.B.; Polverini, P.J.; et al. Biological action of leptin as an angiogenic factor. Science 1998, 281, 1683–1686. [Google Scholar] [CrossRef]
  61. Ellis, P.E.; Barron, G.A.; Bermano, G. Adipocytokines and their relationship to endometrial cancer risk: A systematic review and meta-analysis. Gynecol. Oncol. 2020, 158, 507–516. [Google Scholar] [CrossRef]
  62. Wang, P.P.; He, X.Y.; Wang, R.; Wang, Z.; Wang, Y.G. High leptin level is an independent risk factor of endometrial cancer: A meta-analysis. Cell Physiol. Biochem. 2014, 34, 1477–1484. [Google Scholar] [CrossRef] [Green Version]
  63. Hazelwood, E.; Sanderson, E.; Tan, V.Y.; Ruth, K.S.; Frayling, T.M.; Dimou, N.; Gunter, M.J.; Dossus, L.; Newton, C.; Ryan, N.; et al. Identifying molecular mediators of the relationship between body mass index and endometrial cancer risk: A Mendelian randomization analysis. BMC Med. 2022, 20, 125. [Google Scholar] [CrossRef]
  64. Boroń, D.; Nowakowski, R.; Grabarek, B.O.; Zmarzły, N.; Opławski, M. Expression pattern of leptin and its receptors in endometrioid endometrial cancer. J. Clin. Med. 2021, 10, 2787. [Google Scholar] [CrossRef] [PubMed]
  65. Kim, C.S.; Park, H.S.; Kawada, T.; Kim, J.H.; Lim, D.; Hubbard, N.E.; Kwon, B.S.; Erickson, K.L.; Yu, R. Circulating levels of MCP-1 and IL-8 are elevated in human obese subjects and associated with obesity-related parameters. Int. J. Obes. 2006, 30, 1347–1355. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Ciortea, R.; Mihu, D.; Mihu, C.M. Association between visceral fat, IL-8 and endometrial cancer. Anticancer Res. 2014, 34, 379–383. [Google Scholar] [PubMed]
  67. Koch, A.E.; Polverini, P.J.; Kunkel, S.L.; Harlow, L.A.; DiPietro, L.A.; Elner, V.M.; Elner, S.G.; Strieter, R.M. Interleukin-8 as a Macrophage-Derived Mediator of Angiogenesis. Science 1992, 258, 1798–1801. [Google Scholar] [CrossRef] [PubMed]
  68. Fujimoto, J.; Aoki, I.; Khatun, S.; Toyoki, H.; Tamaya, T. Clinical implications of expression of interleukin-8 related to myometrial invasion with angiogenesis in uterine endometrial cancers. Ann. Oncol. 2002, 13, 430–434. [Google Scholar] [CrossRef]
  69. Kotowicz, B.; Fuksiewicz, M.; Jonska-Gmyrek, J.; Berezowska, A.; Radziszewski, J.; Bidzinski, M.; Kowalska, M. Clinical significance of pretreatment serum levels of VEGF and its receptors, IL- 8, and their prognostic value in type I and II endometrial cancer patients. PLoS ONE 2017, 12, e0184576. [Google Scholar] [CrossRef] [Green Version]
  70. Raudenska, M.; Sztalmachova, M.; Gumulec, J.; Fojtu, M.; Polanska, H.; Balvan, J.; Feith, M.; Binkova, H.; Horakova, Z.; Kostrica, R.; et al. Prognostic significance of the tumour-adjacent tissue in head and neck cancers. Tumor. Biol. 2015, 36, 9929–9939. [Google Scholar] [CrossRef]
  71. Kulinczak, M.; Sromek, M.; Panek, G.; Zakrzewska, K.; Lotocka, R.; Szafron, L.M.; Chechlinska, M.; Siwicki, J.K. Endometrial Cancer-Adjacent Tissues Express Higher Levels of Cancer-Promoting Genes than the Matched Tumors. Genes 2022, 13, 1611. [Google Scholar] [CrossRef]
  72. Aran, D.; Camarda, R.; Odegaard, J.; Paik, H.; Oskotsky, B.; Krings, G.; Goga, A.; Sirota, M.; Butte, A.J. Comprehensive analysis of normal adjacent to tumor transcriptomes. Nat. Commun. 2017, 8, 1077. [Google Scholar] [CrossRef] [Green Version]
  73. Huang, X.; Stern, D.F.; Zhao, H. Transcriptional Profiles from Paired Normal Samples Offer Complementary Information on Cancer Patient Survival—Evidence from TCGA Pan-Cancer Data. Sci. Rep. 2016, 6, 20567. [Google Scholar] [CrossRef] [Green Version]
  74. Frost, H.R. Analyzing cancer gene expression data through the lens of normal tissue-specificity. PLoS Comput. Biol. 2021, 17, e1009085. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Selection of angiogenesis-related genes in the study. Volcano plot visualising fold change (FC) and the corresponding p-values of (A) normalised mRNA (data are from the GDC TCGA Endometrioid Cancer (UCEC) study, downloaded from UCSC Xena server [21]). Paired samples, n = up to 23; Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons, and (B) angiogenesis-associated proteins in tumour tissue versus control tissue (data are from the CPTAC UCEC Discovery Study—Proteome, PDC ID: PDC000125 [22]. Paired samples, n = up to 24; Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. Vertical lines: log2FC cut-off values in a selection protocol; red dots: genes/proteins that reach more than 2- or 3-fold significant difference with adjusted p < 0.01 as a criterion for further evaluation. (C) Venn-diagram of a selection process, and 21 genes with more than 3-fold expression change in tumour versus adjacent tissue, and 22 proteins with more than 2-fold level change in tumour versus adjacent tissue. Nine proteins and their encoding genes fulfilled both criteria simultaneously and were chosen for further validation using a clinical cohort. Genes encoding six proteins from our previous research [14,18] were added, leading to further analysis of 15 genes. (D) Analysis of protein–protein interactions from the STRING database for association networks [33]. Several known (from curated databases) and predicted interactions (based on gene co-occurrence, co-expression, and gene homology) are shown; the line thickness indicates the strength of data support.
Figure 1. Selection of angiogenesis-related genes in the study. Volcano plot visualising fold change (FC) and the corresponding p-values of (A) normalised mRNA (data are from the GDC TCGA Endometrioid Cancer (UCEC) study, downloaded from UCSC Xena server [21]). Paired samples, n = up to 23; Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons, and (B) angiogenesis-associated proteins in tumour tissue versus control tissue (data are from the CPTAC UCEC Discovery Study—Proteome, PDC ID: PDC000125 [22]. Paired samples, n = up to 24; Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. Vertical lines: log2FC cut-off values in a selection protocol; red dots: genes/proteins that reach more than 2- or 3-fold significant difference with adjusted p < 0.01 as a criterion for further evaluation. (C) Venn-diagram of a selection process, and 21 genes with more than 3-fold expression change in tumour versus adjacent tissue, and 22 proteins with more than 2-fold level change in tumour versus adjacent tissue. Nine proteins and their encoding genes fulfilled both criteria simultaneously and were chosen for further validation using a clinical cohort. Genes encoding six proteins from our previous research [14,18] were added, leading to further analysis of 15 genes. (D) Analysis of protein–protein interactions from the STRING database for association networks [33]. Several known (from curated databases) and predicted interactions (based on gene co-occurrence, co-expression, and gene homology) are shown; the line thickness indicates the strength of data support.
Cancers 15 03661 g001
Figure 2. Expression of genes encoding 15 angiogenic factors in 36 paired samples of EC (T) and tumour-adjacent (TA) tissue. Data were analysed using the Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. Fold regulation (FR) is presented as a mean of pairwise T/TA expression ratios or their negative inverse values. * p-value ≤ 0.05, ** p-value ≤ 0.01, ns—not significant.
Figure 2. Expression of genes encoding 15 angiogenic factors in 36 paired samples of EC (T) and tumour-adjacent (TA) tissue. Data were analysed using the Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. Fold regulation (FR) is presented as a mean of pairwise T/TA expression ratios or their negative inverse values. * p-value ≤ 0.05, ** p-value ≤ 0.01, ns—not significant.
Cancers 15 03661 g002
Figure 3. Expression of angiogenesis-related genes in EC patients stratified based on the cancer grade and stage. Expression of genes encoding 15 angiogenic factors in paired tissue samples from patients with (A) low-grade EC (n = 26) and (B) high-grade EC (n = 10); tumour tissue is shown in dark blue and tumour-adjacent tissue in light blue colour. Expression of genes in paired tissue samples from (C) patients with stage IA EC (n = 25) and (D) stage IB–IV EC (n = 11); tumour tissue is shown in dark green, and tumour-adjacent tissue is in light green colour. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, ** p-value ≤ 0.01, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Figure 3. Expression of angiogenesis-related genes in EC patients stratified based on the cancer grade and stage. Expression of genes encoding 15 angiogenic factors in paired tissue samples from patients with (A) low-grade EC (n = 26) and (B) high-grade EC (n = 10); tumour tissue is shown in dark blue and tumour-adjacent tissue in light blue colour. Expression of genes in paired tissue samples from (C) patients with stage IA EC (n = 25) and (D) stage IB–IV EC (n = 11); tumour tissue is shown in dark green, and tumour-adjacent tissue is in light green colour. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, ** p-value ≤ 0.01, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Cancers 15 03661 g003
Figure 4. Expression of angiogenesis-related genes in EC patients stratified based on the presence of deep myometrial (DMI) or lymphovascular invasion (LVI). Expression of genes encoding 15 angiogenic factors in paired tissue samples from (A) patients without DMI (n = 27) and (B) from patients with present DMI (n = 9); tumour tissue is shown in dark purple and tumour-adjacent tissue in light purple colour. Expression of genes in paired tissue samples from (C) patients without LVI (n = 28) and (D) from patients with present LVI (n = 8). Tumour tissue is shown in dark pink, and the adjacent tissue is in light pink. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, ** p-value ≤ 0.01, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Figure 4. Expression of angiogenesis-related genes in EC patients stratified based on the presence of deep myometrial (DMI) or lymphovascular invasion (LVI). Expression of genes encoding 15 angiogenic factors in paired tissue samples from (A) patients without DMI (n = 27) and (B) from patients with present DMI (n = 9); tumour tissue is shown in dark purple and tumour-adjacent tissue in light purple colour. Expression of genes in paired tissue samples from (C) patients without LVI (n = 28) and (D) from patients with present LVI (n = 8). Tumour tissue is shown in dark pink, and the adjacent tissue is in light pink. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, ** p-value ≤ 0.01, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Cancers 15 03661 g004
Figure 5. Expression of angiogenesis-related genes in paired tissue samples from (A) premenopausal (n = 11) and from (B) postmenopausal EC patients (n = 25). Tumour tissue is shown in dark red, and tumour-adjacent tissue is in orange colour. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Figure 5. Expression of angiogenesis-related genes in paired tissue samples from (A) premenopausal (n = 11) and from (B) postmenopausal EC patients (n = 25). Tumour tissue is shown in dark red, and tumour-adjacent tissue is in orange colour. Wilcoxon matched pairs signed rank tests with Bonferroni–Šidák corrections for multiple comparisons. Data are shown as scattered dot plots with marked means with 95% CI, * p-value ≤ 0.05, *** p-value ≤ 0.001, **** p-value ≤ 0.0001.
Cancers 15 03661 g005
Figure 6. Gene co-expression pattern in tumour tissue, tumour-adjacent tissue, and between both of them. (A) All EC patients; n = 36, (B) EC patients without LVI; n = 28, (C) EC patients with LVI; n = 8. A Heatmap of Spearman correlation coefficients is shown. For details on strong correlations, see Table 5.
Figure 6. Gene co-expression pattern in tumour tissue, tumour-adjacent tissue, and between both of them. (A) All EC patients; n = 36, (B) EC patients without LVI; n = 28, (C) EC patients with LVI; n = 8. A Heatmap of Spearman correlation coefficients is shown. For details on strong correlations, see Table 5.
Cancers 15 03661 g006
Figure 7. ROC Curves for the training (A) and test (B) datasets.
Figure 7. ROC Curves for the training (A) and test (B) datasets.
Cancers 15 03661 g007
Figure 8. Confusion matrices for all three models on the test dataset. The numbers represent the number of predictions falling into each of the combinations of actual (real) EC tumour grade and the EC tumour grade predicted by the respective model. The cell colours represent the relative numbers, with darker colours representing lower counts and lighter colours representing higher counts.
Figure 8. Confusion matrices for all three models on the test dataset. The numbers represent the number of predictions falling into each of the combinations of actual (real) EC tumour grade and the EC tumour grade predicted by the respective model. The cell colours represent the relative numbers, with darker colours representing lower counts and lighter colours representing higher counts.
Cancers 15 03661 g008
Table 1. List of publicly available datasets used in this study.
Table 1. List of publicly available datasets used in this study.
DatasetDownloaded fromDetailSamplesReferences
cBioPortalhttps://www.cbioportal.org/
accessed on 4 July 2022
TCGA Pan-Cancer studyn (T) = up to 527[19,20]
UCSC Xenahttps://xena.ucsc.edu/
accessed on 8 July 2022
GDC TCGA Endometrioid Cancer (UCEC) study
(TCGA data uniformly reanalysed at GDC by UCSC Xena group using the latest Human Genome Assembly hg38)
n (T) = up to 548;
n (TA) = up to 35;
n (paired samples) = up to 23.
[21]
NCI PDC serverhttps://proteomic.datacommons.cancer.gov/pdc/
accessed on 5 July 2022
CPTAC UCEC Discovery Study—Proteome,
PDC ID: PDC000125,
study ID: c935c587-0cd1-11e9-a064-0a9c39d33490
n (T) = up to 95;
n (TA) = up to 25;
n (paired samples) = up to 24.
[22]
TCGA—The Cancer Genome Atlas; CPTAC—Clinical Proteomic Tumor Analysis Consortium; UCEC—Uterine Corpus Endometrial Carcinoma.
Table 2. Detailed information about the patients included in the study.
Table 2. Detailed information about the patients included in the study.
SampleAgeMenopausal StatusHistological Type/GradeFIGO StageGradus HG/LGDepth of Myometrial InvasionLymphovascular Invasion
539premenopausaldedifferentiatedIBHG>50%yes
750premenopausalendometrioid G1IBLGnono
883postmenopausaldedifferentiatedIBHG>50%no
941premenopausalendometrioid G1IALG<50%no
1053postmenopausalendometrioid G1IALGnono
1364postmenopausalendometrioid G1IVLG<50%NA
1473postmenopausalendometrioid G1IBLG>50%no
1669postmenopausalendometrioid G1IALG<50%no
1879postmenopausalendometrioid G1IBLG>50%no
1974postmenopausalendometrioid G1IALG<50%no
2076postmenopausalendometrioid G1IA LG<50%no
2153premenopausalendometrioid G2IALGnono
2345premenopausalendometrioid G1IALGnono
2469postmenopausalendometrioid G2IBLG>50%yes
2554premenopausalendometrioid G3IAHG<50%no
2672postmenopausalendometrioid G1IA LG<50%no
3054premenopausalendometrioid G1IA LGnono
3377postmenopausalendometrioid G3IBHG>50%no
3457postmenopausalmucinousIALG<50%no
4071postmenopausalserousIA HG<50%no
4473postmenopausalserousIBHG>50%yes
4727premenopausaldedifferentiatedIAHG<50%no
4970postmenopausalendometrioid G1IALG<50%no
5073postmenopausalendometrioid G1IALG<50%no
5175postmenopausalendometrioid G2IALG>50%yes
5275postmenopausalendometrioid G2IALG<50%yes
5350postmenopausalendometrioid G3IAHG<50%yes
5471postmenopausalendometrioid G1IALG<50%no
5655postmenopausalendometrioid G1IALGnono
5743premenopausalendometrioid G1IALGnono
6259postmenopausalendometrioid G1IALGnono
6366postmenopausalendometrioid G1IALG<50%no
6580postmenopausalcarcinosarcomaIBHG>50%yes
6672postmenopausalendometrioid G1IALG<50%no
6845premenopausalendometrioid G1IILG<50%no
7148premenopausalserousIAHG<50%no
G—gradus, HG—high grade, LG—low grade.
Table 3. Details for the TaqMan “Assays on Demand” used for the 15 investigated genes and 2 reference genes.
Table 3. Details for the TaqMan “Assays on Demand” used for the 15 investigated genes and 2 reference genes.
Gene SymbolGene/AF NameAssay ID
CSF3colony stimulating factor 3Hs99999083_m1
CXCL12C-X-C motif chemokine ligand 12Hs00171022_m1
ENPP2ectonucleotide pyrophosphatase/phosphodiesterase 2Hs00196470_m1
FBLN5fibulin 5Hs00197064_m1
FGF2fibroblast growth factor 2Hs00266645_m1
FSTfollistatinHs00246256_m1
HPRT1 *hypoxanthine-guanine phosphoribosyltransferaseHs02758991_g1
IL8C-X-C motif chemokine ligand 8Hs00174103_m1
LEPleptinHs00174877_m1
LYVE1lymphatic vessel endothelial hyaluronan receptor 1Hs00272659_m1
NRP1neuropilin 1Hs00826128_m1
PDGFRBplatelet derived growth factor receptor betaHs00387364_m1
POLR2A *DNA-directed RNA polymerase II subunit RPB1Hs00426592_m1
SERPINF1serpin family F member 1Hs00171467_m1
TEKTEK receptor tyrosine kinase; Tie-2Hs00176096_m1
TIMP2TIMP metallopeptidase inhibitor 2Hs00234278_m1
TIMP3TIMP metallopeptidase inhibitor 3Hs00165949_m1
* Reference gene.
Table 4. Comparison of fold regulation (FR) in gene expression between tumour and tumour-adjacent control tissue.
Table 4. Comparison of fold regulation (FR) in gene expression between tumour and tumour-adjacent control tissue.
GenesMean FRFR CI 95%p-Value
IL84.752.467.040.0164*
CXCL12−18.21−31.95−4.460.0015**
FGF2−7.39−10.51−4.270.0015**
LEP4.682.876.490.0104*
LYVE1−10.98−16.52−5.440.0015**
NRP1−3.67−5.11−2.220.0015**
TIMP2−6.79−8.87−4.700.0015**
TIMP3−14.77−21.77−7.770.0015**
CSF3−10.43−20.51−0.350.8328ns
ENPP2−10.10−15.05−5.140.0015**
FBLN5−8.27−12.70−3.830.0015**
FST−6.77−9.98−3.560.0668ns
PDGFRB−4.50−6.08−2.920.0015**
TEK−5.20−7.50−2.910.0015**
SERPINF1−8.68−12.71−4.660.0015**
adjusted p-values after Wilcoxon matched-pairs signed rank test with Bonferroni–Šidák corrections for multiple comparisons. * p-value ≤ 0.05, ** p-value ≤ 0.01, ns—not significant.
Table 5. Strong correlations between mRNA expressions within the tumour and tumour-adjacent tissue and between both tissues, considering EC patients stratified according to LVI status.
Table 5. Strong correlations between mRNA expressions within the tumour and tumour-adjacent tissue and between both tissues, considering EC patients stratified according to LVI status.
Tumour TissueTumour-Adjacent TissueTumour Tissue vs. Tumour-Adjacent Tissue
Gene_TGene_Trp-ValueGene_TAGene_TArp-ValueGene_TGene_TArp-Value
All EC PatientsPDGFRBSERPINF10.9063.3 × 10−14
TEKSERPINF10.8884.9 × 10−13
TIMP2TIMP30.8791.7 × 10−12
FGF2TIMP30.8714.9 × 10−12
PDGFRBTEK0.8669.2 × 10−12
EC Patients without LVIPDGFRBSERPINF10.8787.9 × 10−10ENPP2TEK0.8547.2 × 10−9
CXCL12TIMP30.8751.1 × 10−9
FBLN5PDGFRB0.8731.4 × 10−9
TIMP2SERPINF10.8585.3 × 10−9
TIMP2TIMP30.8575.8 × 10−9
EC Patients with LVIPDGFRBTEK1.0004.9 × 10−5CXCL12SERPINF10.9520.001TIMP3NRP10.9290.002
LYVE1NRP10.9764.0 × 10−4CXCL12TIMP20.9290.002ENPP2PDGFRB0.9290.002
LYVE1PDGFRB0.9764.0 × 10−4TIMP2SERPINF10.9290.002FGF2PDGFRB0.9050.005
LYVE1TEK0.9764.0 × 10−4TIMP2TIMP30.9050.005ENPP2NRP10.9050.005
NRP1SERPINF10.9764.0 × 10−4FBLN5SERPINF10.9050.005FGF2NRP10.8810.007
PDGFRBSERPINF10.9764.0 × 10−4FGF2TIMP30.8570.011NRP1NRP10.8810.007
TEKSERPINF10.9764.0 × 10−4FGF2ENPP20.8570.011PDGFRBNRP10.8810.007
LYVE1SERPINF10.9521.1 × 10−3TIMP3ENPP20.8570.011TEKNRP10.8810.007
CXCL12FST0.9222.6 × 10−3PDGFRBTEK0.8570.011
FGF2TIMP30.9054.6 × 10−3
NRP1TIMP30.9054.6 × 10−3
TIMP3PDGFRB0.9054.6 × 10−3
TIMP3TEK0.9054.6 × 10−3
IL8CSF30.8817.2 × 10−3
CXCL12NRP10.8817.2 × 10−3
CXCL12PDGFRB0.8817.2 × 10−3
CXCL12TEK0.8817.2 × 10−3
TIMP3ENPP20.8817.2 × 10−3
TIMP3SERPINF10.8817.2 × 10−3
CXCL12TIMP30.8571.1 × 10−2
CXCL12SERPINF10.8571.1 × 10−2
FGF2TIMP20.8571.1 × 10−2
LYVE1TIMP30.8571.1 × 10−2
ENPP2SERPINF10.8571.1 × 10−2
r—Spearman correlation coefficient, p < 0.05 is considered significant.
Table 6. Primary metrics achieved on the training dataset.
Table 6. Primary metrics achieved on the training dataset.
ModelAccuracyPrecisionRecallF1AUCSensitivitySpecificity
All Data93.2%100%86.4%92.7%1.0086%100%
Tumour (Normalised T)90.9%100%81.8%90%0.9982%100%
Adjacent (Normalised TA)95.5%91.7%100%95.7%0.99100%91%
Table 7. Primary metrics achieved on the test dataset.
Table 7. Primary metrics achieved on the test dataset.
ModelAccuracyPrecisionRecallF1AUCSensitivitySpecificity
All Data85.7%75%75%75%0.7875%90%
Tumour (Normalised T)85.7%66.7%100%80%0.98100%80%
Adjacent (Normalised TA)50%28.6%50%36.4%0.4050%50%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Roškar, L.; Kokol, M.; Pavlič, R.; Roškar, I.; Smrkolj, Š.; Rižner, T.L. Decreased Gene Expression of Antiangiogenic Factors in Endometrial Cancer: qPCR Analysis and Machine Learning Modelling. Cancers 2023, 15, 3661. https://doi.org/10.3390/cancers15143661

AMA Style

Roškar L, Kokol M, Pavlič R, Roškar I, Smrkolj Š, Rižner TL. Decreased Gene Expression of Antiangiogenic Factors in Endometrial Cancer: qPCR Analysis and Machine Learning Modelling. Cancers. 2023; 15(14):3661. https://doi.org/10.3390/cancers15143661

Chicago/Turabian Style

Roškar, Luka, Marko Kokol, Renata Pavlič, Irena Roškar, Špela Smrkolj, and Tea Lanišnik Rižner. 2023. "Decreased Gene Expression of Antiangiogenic Factors in Endometrial Cancer: qPCR Analysis and Machine Learning Modelling" Cancers 15, no. 14: 3661. https://doi.org/10.3390/cancers15143661

APA Style

Roškar, L., Kokol, M., Pavlič, R., Roškar, I., Smrkolj, Š., & Rižner, T. L. (2023). Decreased Gene Expression of Antiangiogenic Factors in Endometrial Cancer: qPCR Analysis and Machine Learning Modelling. Cancers, 15(14), 3661. https://doi.org/10.3390/cancers15143661

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop