Abstract
Background/Objectives: Colorectal cancer (CRC) is one of the most common malignancies worldwide, with systemic inflammation increasingly recognised as a determinant of disease progression. This study aimed to establish a taxonomy-based classification of patients with newly diagnosed primary CRC using systemic inflammatory, haematological, and anthropometric markers, and to evaluate its association with tumour stage. Methods: A total of 229 patients (111 women, 118 men) undergoing surgery for primary CRC were included. Blood samples were analysed for haemoglobin, leukocytes, neutrophils, lymphocytes, platelets, C-reactive protein (CRP), and carcinoembryonic antigen (CEA). Anthropometric data were collected. Taxonomic clustering and ordinal logistic regression were used to explore associations with TNM and Astler–Coller classifications. Results: Men had higher neutrophil and leukocyte counts, elevated CEA concentrations (132.8 vs. 81.3 ng/mL), and higher NLR values (4.74 vs. 4.23) compared with women. Logistic regression confirmed that platelet count (OR 1.003; p = 0.004), PLR (OR 1.003; p = 0.003), and CEA (OR 1.03; p < 0.001) were positively associated with advanced TNM stage, while haemoglobin was inversely correlated (OR 0.88; p = 0.045). Among 84 clustering models, two taxonomies were the most clinically informative: Taxonomy I (BMI, neutrophils, platelets) and Taxonomy II (age, lymphocytes, platelets), both significantly associated with T, N, M, overall TNM stage, and Astler–Coller grade. Taxonomy I identified three patient groups. Type 3 represented the poorest phenotype, characterised by low BMI and haemoglobin, high platelets, elevated CEA and PLR, and predominance of TNM IIIC tumours, consistent with a cachectic–inflammatory profile. Type 1 displayed higher BMI, lower inflammation, and earlier-stage disease. Type 2 was characterized by elevated neutrophils and leukocytes. Taxonomy II distinguished four groups, with Type 2 demonstrating the most favourable profile (high haemoglobin and lymphocytes, low NLR and PLR, early TNM stage). Conclusions: Systemic inflammatory markers, haemoglobin, platelets, and CEA strongly predict CRC advancement. The proposed taxonomy provides clinically meaningful stratification of CRC patients and may support personalised risk assessment. This accessible approach may facilitate early identification of high-risk individuals, although validation in prospective studies is required.
1. Introduction
The diagnosis and treatment of cancer remain among the key challenges of contemporary medicine, with colorectal cancer (CRC) holding a particularly important position. CRC is characterized by high incidence and mortality rates and is a major focus of oncological research worldwide. The global number of new CRC cases is projected to reach 3.2 million by 2040, driven by population aging, growth, and changes in human development [1]. Furthermore, increases in CRC mortality among young adults are projected in several countries, including Italy, the United Kingdom, Poland, and Spain for men, and Germany for women [2].
Given the biological heterogeneity of CRC and the complexity of its progression, identifying novel diagnostic and prognostic tools remains an urgent priority. Increasing attention is being paid to inflammatory markers and indicators of systemic immune response as promising prognostic and predictive factors [3].
Chronic inflammation plays a significant role in the pathogenesis of many tumors, including CRC. It contributes to cancer cell proliferation, angiogenesis, and metastatic potential. Analysis of inflammatory processes and their associated molecular signatures offers valuable insight into mechanisms of carcinogenesis, tumor invasiveness, and response to treatment [4,5,6].
Among the most accessible and clinically relevant prognostic markers are hematological indicators derived from peripheral blood. The neutrophil-to-lymphocyte ratio (NLR) is considered a particularly strong predictor of overall survival (OS) and disease-free survival (DFS) in CRC patients [7,8]. A meta-analysis showed that NLR values ≥ 3.0 and platelet-to-lymphocyte ratio (PLR) ≥ 160 were associated with poor prognosis in stage III–IV CRC [9]. Moreover, recent research has demonstrated that pre-diagnostic levels of neutrophils, leukocytes, and NLR also serve as independent predictors of CRC outcomes, even before clinical diagnosis [10].
Other markers such as PLR, lymphocyte-to-monocyte ratio (LMR), and the systemic immune–inflammation index (SII) have also shown prognostic value in various stages of CRC, particularly in the postoperative setting [11,12]. Inflammation-related plasma proteins such as C-reactive protein (CRP), IL-6 (interleukin-6), TNF-alpha (tumor necrosis factor) and its receptors, adiponectin play complex roles in colorectal cancer development [13].
Understanding the role of inflammation in CRC development and progression is essential for refining risk stratification, improving early detection, and identifying potential targets for immunomodulatory therapies. Consequently, the investigation of inflammatory markers remains a dynamic and clinically relevant field in modern oncology.
Actually, CRC is the second most common cancer in women and the third in men in Poland [14,15,16]. In 2022, the number of new CRC cases exceeded 18,800, with an incidence rate of 57 per 100,000 in men and 44 per 100,000 in women [17]. Despite detailed consideration of the epidemiology of this pathology, prognostic interventions in this area were not discussed.
Some authors have considered CRC to be a “disease of affluence,” as its prevalence and dynamics are closely related to the level of development and living conditions in different regions worldwide [18,19]. Currently, the highest increase in cancer incidence is observed in countries where economic development has been the most rapid in recent decades.
Although the socioeconomic background of CRC is increasingly recognised, current prognostic approaches still rely largely on clinical staging and histopathological assessment. Meanwhile, growing evidence suggests that systemic inflammation plays a central role not only in tumorigenesis but also in tumour progression and patient prognosis. Therefore, accessible biomarkers reflecting the systemic immune response—such as leukocyte, neutrophil, lymphocyte, and platelet counts—are being actively explored as potential tools to support clinical decision-making [20]. Despite their simplicity, these markers may offer valuable insights into disease aggressiveness and progression.
The role of body mass index (BMI) in CRC progression is still under debate. While obesity is a recognized risk factor for CRC development, its relationship with tumour stage at diagnosis remains inconclusive. Some studies suggest that higher BMI may not correlate with more advanced TNM (tumor, nodes, metastases) stages, possibly due to earlier detection in overweight individuals or the so-called “obesity paradox” [20,21]. Conversely, underweight status and sarcopenia have been associated with more aggressive disease and poorer outcomes [22,23]. Furthermore, BMI does not reflect body fat distribution, which may be more relevant for cancer biology. Therefore, assessing the prognostic value of BMI alongside inflammatory markers may contribute to a more comprehensive understanding of disease progression in CRC.
By focusing on routinely measured hematological parameters and their correlation with tumour stage, the aim of the present study was to develop a clinical-biochemical taxonomy of patients with newly diagnosed primary colorectal cancer (CRC), identifying distinct taxa based on systemic inflammatory, hematological, and anthropometric markers, and to assess their relationship with tumour stage and disease progression.
2. Materials and Methods
2.1. Materials
A total of 229 patients with primary CRC who underwent surgical treatment at two departments of General and Colorectal Surgery in Poland (Sosnowiec, Bytom) were included. A comprehensive medical interview and physical examination were performed for each participant. Clinical data were obtained using a physician-administered questionnaire and included: clinical symptoms and their duration, body mass index (BMI; including weight loss within the previous three months), general condition assessed according to the Karnofsky performance status [23], family history of cancer, comorbidities, and tumor location. Furthermore, the exclusion criteria were as follows: emergency hospitalization, lack of histological confirmation of cancer, a concomitant inflammatory disease, previous non-adjuvant therapy, and no written informed consent.
The study was conducted in accordance with the Declaration of Helsinki and approved by the Bioethics Committee of the Medical University of Silesia. As this was a retrospective analysis based on anonymized clinical data, formal bioethical committee approval was not required. (application no. KNW/0022/KB/74/I/19, 21 March 2019). Statistical analyses were independently verified by consulting biostatisticians from the Department of Public Health, Wroclaw Medical University.
2.2. Methods
2.2.1. Patient’s Database Completing
The database also contained information on surgical interventions and histological findings. The evaluation criteria included parameters such as the TNM classification [24], the number of lymph nodes removed and their involvement in the cancer process, the degree of histological differentiation, stage, and the degree of clinical progression according to the TNM and Astler-Coller classification [25].
2.2.2. Analysis of Blood Samples
All venous blood samples were collected in the morning after an overnight fast and processed in a certified hospital laboratory according to standard clinical procedures.
Although routine biochemical and hematological panels were performed for all patients, the present analysis focused on selected parameters relevant to systemic inflammation and tumour progression. These included: hemoglobin, leukocytes, neutrophils, lymphocytes, C-reactive protein (CRP), and carcinoembryonic antigen (CEA).
Peripheral blood smears were examined microscopically for morphology, while serum CRP and CEA concentrations were determined using standard laboratory assays.
2.2.3. Statistical Analysis
All statistical analyses were performed using the R statistical environment (version 4.0.0) [26]. We also used the “cluster” statistical package” (Version 2.1.0. CRAN) [27].
The use of linear progression and causal relationships was not appropriate in the context of the analysed clinical phenomena. Therefore, the study was extended to include taxonomic analysis [28]. The formation of patient subgroups based on the principles of taxonomy was determined by the similarity of specific clinical and laboratory parameters.
The Marczewski–Steinhaus metric [29] was used to measure pairwise taxonomic distances between patients, defined as: d(A, B) = ∣AΔB∣ max (∣A∣, ∣B∣).
The use of this metric allowed a structural interpretation of relationships between patients considered as sets of features. The selection of this metric was motivated by the mixed nature of the dataset, which included both continuous and pseudo-categorical variables, making it suitable for assessing structural similarities between patients, especially for capturing heterogeneous biomedical traits.
Complete-linkage agglomeration was applied because it generates compact, internally consistent clusters, which is important for the clinical interpretation of patient phenotypes.
Hierarchical clustering was performed using the complete-linkage agglomeration method, defined as:
[30]
A total of 84 trivariate combinations of variables were evaluated during the exploratory phase. Cut-off thresholds for dendrogram segmentation were determined using a scree-plot criterion. Two taxonomies demonstrated the highest internal coherence and clinical interpretability:
- BMI + neutrophils + platelets,
- age + lymphocytes + platelets.
Only these two taxonomic models were retained in the final results of the manuscript for clarity. As the taxonomic analysis was exploratory in nature, no correction for multiple comparisons (Bonferroni or FDR) was applied, and the findings should be regarded as hypothesis-generating.
2.2.4. Limitations of the Study
This study has several limitations that should be acknowledged.
First, the analysis was cross-sectional and did not include survival or treatment-response data (e.g., overall or disease-free survival). Therefore, the prognostic value of the identified clusters should be interpreted cautiously until confirmed in prospective cohorts.
Second, although ordinal logistic regression was used to assess the association between clinical and hematological parameters and tumour stage, protein-level validation of the analysed markers was not performed due to the retrospective design of the study.
Third, the taxonomic analysis included multiple trivariate combinations (n = 84), which increases the probability of type I error. Given the exploratory nature of this approach, no correction for multiple comparisons (Bonferroni or FDR) was applied; hence, these findings should be considered hypothesis-generating rather than confirmatory.
Fourth, although sex-related differences were addressed in the statistical models, not all results were stratified by sex due to limited subgroup sizes.
Finally, this study was based on data from a single academic centre, which may limit generalizability to other populations.
3. Results
3.1. Study Group
The study group included 111 women (mean age 64.94 years; range 31–90 years). The mean height was 161.2 cm (range 143–176 cm), and the mean weight was 69.3 kg (range 32–140 kg). The mean BMI in women was 25.6 kg/m2. The study also included 118 men (mean age 63.98 years; range 36–94 years); the mean height was 174.1 cm (range 149–189 cm), and the mean weight was 81.6 kg (range 50–160 kg). The mean BMI was 26.9 in men—Table 1.
Table 1.
Characteristics of Patients.
3.2. Laboratory and Anthropometrics Parameters in Study Group
The analysis of laboratory and anthropometric parameters in the study groups showed different results depending on the gender. When laboratory parameters of the female subgroup were analysed, the mean values were as follows: haemoglobin—11.99 g/dl, leukocytes—8.04, neutrophils—5.82, lymphocytes—1.64, platelets—311.2 and CEA—81.31. The mean values of PLR and NRL were 221.54 and 4.23, respectively. In turn, the results of the laboratory findings in the male subgroup were as follows: haemoglobin—12.41 g/dl, leukocytes—8.6, neutrophils—6.26, lymphocytes—1.63, platelets—316.7, CEA—132.78, PLR—234.52 and NRL—4.74.
The mean haemoglobin level in women was 11.99 g/dl and 12.41 g/dl in men, which might have been due to physiological differences between the genders. Low haemoglobin levels in women may be an indicator of a higher risk of anaemia, particularly in terms of cancer. The mean white blood cell counts in women and men were 8.04 and 8.6, respectively. The mean lymphocyte counts in women were 1.64 and 1.63 in men. The similar values of the parameters indicate the preservation of basic immune reactivity in both genders, although some differences in other blood parameters were found. The mean neutrophil counts were 5.82 in women and 6.26 in men. Higher neutrophil counts in men might have been associated with a more pronounced inflammatory response or might have reflected the differences in the immune response to cancer.
The mean platelet counts in women and men did not differ significantly, indicating a similarity in the blood coagulation response between both genders, which is essential in terms of the risk of developing thrombotic complications in CC. PLR showed no significant dimorphism, which is natural given the similar levels of platelets. The mean CEA in women was 81.31. However, it was significantly higher in men (132.78). High levels of CEA in men may indicate a higher risk of cancer progression and metastasis, as CEA is often associated with more aggressive forms of CC. Determination of NRL showed that the mean value in women was significantly lower than in men, which may have indicated a greater influence of inflammation on cancer progression since a high neutrophil-to-lymphocyte ratio is often associated with a poor prognosis in cancer patients.
3.3. Nosological Structure
The nosological structure also demonstrated some sex-related dimorphism. Differentiation of patients according to the TNM classification showed that, among women, the following subgroups were identified: T1—3 patients, T2—27, T3—65, T4—13; N0—49, N1—28, N2—31; M0—81 and M1—25 patients. Among men, the corresponding distribution was: T1—2, T2—25, T3—78, T4—11; N0—43, N1—29, N2—44; M0—88 and M1—26 patients. The comparison of the nosological structure is presented in Figure 1.
Figure 1.
(A–C) Nosological structure of the disease according to TNM parameters (A—T, B—N, C—M) in female and male subgroups.
The interpretation of the above data shows that although the overall trends in the distribution of disease stages were similar between the genders, some differences were found. For instance, a higher proportion of women presented with T3 tumours, while a higher proportion of men presented with T3 and N2 tumours. A slight increase in the number of men with distant metastases (M1) was also found compared to women. The data may indicate the need for a more differentiated approach to the diagnosis and treatment of CRC, depending on the gender, given their specific characteristics in the clinical course of the disease.
Statistically significant results (p < 0.05) and the results approaching significance (p < 0.1) obtained by ordinal logistic regression are given in Table 2. This allowed us to better understand the potential impact of different biomarkers on the course and prognosis of CRC.
Table 2.
Statistically significant results were obtained using ordinal logistic regression (p < 0.05 and p < 0.1).
The statistical interpretation of the data in Table 2 showed that increased haemoglobin concentration was negatively correlated with the progression of the clinical stage (T stage). An increase in the blood level by 1 g/dL correlated with a decreased risk of progression according to the TNM classification by about 12%, while an increase by 5 g/dl correlated with a reduced progression risk by about 47%. Therefore, an inverse proportional relationship was found. In turn, platelet counts and PLR levels were directly correlated with disease progression (T parameter). The statistical interpretation of other results included in Table 2 was similar. The statistical results were presented in parallel on the forest plot shown in Figure 2.
Figure 2.
Odds ratios illustrating the impact of statistically significant (p < 0.05) and borderline significant (p < 0.1) risk factors on clinical tumour stage (TNM), tumour grade (G), Astler–Coller classification, and clinical complications (improvement vs. deterioration).
The method used here was based on the framework of logistic regression with a dichotomous response variable and represents its extension to a scale with more than two categories [31]. In ordinal logistic regression, the dependent (outcome) variable has an ordered scale, and the estimated regression parameters are interpreted in the same way as in standard logistic regression, i.e., by means of odds ratios (ORs).
However, adopting a linear dependence model and the causal relationships obtained under this assumption was not fully satisfactory in substantive terms for the clinical phenomena under analysis. Therefore, the statistical analysis was extended by an alternative, methodologically distinct approach to the data in the form of a taxonomic analysis.
3.4. Characteristics of Taxonomy Types
Next, hierarchical clustering was performed using the Marczewski–Steinhaus metric and the complete-linkage agglomeration method. A total of 84 taxonomic sets of clinical features were analysed, of which two were considered most informative and clinically interpretable: Taxonomy I (BMI, neutrophil count, platelet count) and Taxonomy II (age, lymphocyte count, platelet count). These two combinations yielded the highest number of statistically significant associations (p < 0.05) and results at the threshold of statistical significance (p < 0.1) with clinical responses such as T, N, M, overall TNM stage, and Astler–Coller classification.
3.4.1. Taxonomy I
Based on BMI, neutrophils, and PLT, three main patient types (“Type 1”, “Type 2” and “Type 3”) were identified. Group comparisons between these types were performed using one-way ANOVA or the Kruskal–Wallis test, depending on the distribution of variables. Statistically significant (p < 0.05) and borderline (p < 0.1) differences are presented in Table 3.
Table 3.
Clinical and laboratory characteristics of the three taxonomic patient types identified in Taxonomy I.
The clinical and laboratory profiles corresponding to the three clusters in Taxonomy I are illustrated in Figure 3.
Figure 3.
Taxonomic dendrogram and forest plot illustrating the three patient types identified in Taxonomy I based on BMI, neutrophil count, and platelet count (PLT).
In summary, the statistical analysis shows that the three patient types identified in Taxonomy I are characterised by distinct relative levels of features and clinical responses (Table 4). In the most extreme cases, patients with low levels of clinical response in terms of T, N, M, overall TNM stage, and Astler–Coller classification were relatively characterised by:
Table 4.
Summary of clinical parameter levels by Taxonomy I type.
- a moderate BMI,
- a high neutrophil count,
- a moderate PLT level,
- a moderate body weight,
- a moderate haemoglobin level,
- a high WBC count,
- a moderate CEA concentration,
- a high NLR, and
- a moderate PLR.
In contrast, patients with the poorest prognosis were those relatively characterised by:
- a low BMI,
- a moderate neutrophil count,
- a high PLT count,
- a low body weight,
- a low haemoglobin level,
- a moderate WBC count,
- a high CEA concentration,
- a moderate NLR, and
- a high PLR.
Based on statistical comparisons, Type 3 patients had the poorest prognosis and were characterised by, low BMI, mean neutrophil counts, high platelet counts, low weight, low haemoglobin levels, mean leukocyte counts, high CEA concentrations and mean NLR and PLR levels—Table 4.
3.4.2. Taxonomy II
Using the same computational procedures, Taxonomy II was constructed based on patient age, lymphocyte count and platelet count (PLT). In this feature space, no equally clear “harmonic” ordering of data and inter-variable dependencies was observed as in Taxonomy I; however, the clustering still yielded four patient types (“Type 1”, “Type 2”, “Type 3” and “Type 4”). The clinical characteristics of these four types are presented in Table 5 and illustrated in Figure 4.
Table 5.
Clinical characteristics of taxonomy II with the types 1–4.
Figure 4.
Taxonomic dendrogram of patients and their profiles based on age, lymphocyte count and platelet count (PLT) in Taxonomy II.
When a similar analysis was conducted in patients included in Taxonomy II, no similar homogeneity of data was found. Nevertheless, the group was divided into four types (“Type 1”, “Type 2”, “Type 3” and “Type 4”). The clinical characteristics of Taxonomy II, including the four identified types, are shown in Table 5.
The analysis of the data using the same analytical scheme as in the case of Taxonomy I showed that the most representative type in Taxonomy II was Type 2 (n = 96). These patients exhibited the highest levels of lymphocytes, haemoglobin, and leukocytes, along with the lowest levels of NLR and PLR. They also had a relatively lower clinical stage according to the TNM and Astler-Coller classification.
Importantly, Type 2 patients differed significantly in terms of clinical parameters from Type 1 subjects of the same age, although both groups represented a similar health status as measured by the oncological classification. The age of the patients was inversely proportional to their height. The statistical analysis showed that younger patients had a worse prognosis. The study also showed some other indirect causal relationships, which were relatively insignificant compared to the above findings.
The results of taxonomies for other combinations of risk factors that provided at least one statistical difference (p < 0.05) or borderline statistical significance (p < 0.1) in terms of clinical responses (TNM classification) are given in Table 6.
Table 6.
Three variables of a taxonomic set of clinical features that predicted statistically different (p < 0.05) clinical responses.
The analysis of the data for the additional taxonomic sets showed that only two of them reflected causal relationships that were considered reliable due to the number of positive clinical responses. Therefore, 84 combinations of clinical features were analysed. Only 20 (2 + 18) taxonomic combinations reflected a causal relationship with the clinical responses, such as stages (T, N, M). Based on the taxonomic analysis, the most important prognostic factors for patients in terms of oncological classification were BMI, lymphocyte, platelet and haemoglobin levels. The taxonomic relationships identified here supplement the knowledge about causal patterns obtained from ordinal logistic regression, in which these factors were expressed through odds ratios, providing a structural and nonlinear perspective on the prognostic value of these factors.
4. Discussion
This study analysed systemic inflammatory, haematological, and anthropometric parameters in patients with newly diagnosed colorectal cancer (CRC), with particular emphasis on sex-related differences and tumour stage. Male patients demonstrated consistently higher neutrophil and leukocyte counts, elevated CEA concentrations, and higher NLR values, which may reflect both a more pronounced inflammatory response and potentially more aggressive tumour biology. These observations are in line with previous studies suggesting sex-specific biological and clinical patterns in CRC progression, where obesity and elevated BMI show a stronger correlation with CRC incidence in men compared with women [32,33,34,35,36].
Multivariable logistic regression confirmed that PLR and CEA levels were positively correlated with more advanced TNM stages, while higher haemoglobin concentration was inversely associated with tumour progression. These findings reinforce the prognostic value of accessible and cost-effective systemic inflammatory markers. Such markers, particularly PLR and NLR, have been consistently linked to adverse outcomes in CRC [37,38] and other malignancies, including lung [39,40], gastric [41,42], and ovarian cancers [43,44], as well as in patients undergoing immunotherapy [45].
The novel contribution of this study lies in the introduction of a taxonomy-based classification system that integrates inflammatory, haematological, and anthropometric data. Using a multivariate clustering approach, three distinct clinical-biochemical taxa (Taxonomy I) were identified, each with a unique systemic and metabolic profile. Patients classified as Type 3 exhibited the most unfavourable characteristics: low BMI and haemoglobin levels, high platelet counts, elevated CEA and PLR values, and advanced tumour stage (TNM IIIC). This group likely represents a cachectic-inflammatory phenotype associated with poor prognosis. In contrast, Type 1 patients had higher BMI, lower inflammatory markers, and less advanced disease, while Type 2 was characterised by elevated neutrophil and leukocyte counts, indicating a systemic inflammatory response.
A secondary classification (Taxonomy II) further stratified the cohort into four groups. Type 2 patients in this model displayed the most favourable profiles, with high lymphocyte and haemoglobin levels, low NLR and PLR values, and less advanced clinical staging. These taxonomic patterns suggest that inflammatory and metabolic markers can be used not only as prognostic indicators but also as a basis for stratifying patients into biologically and clinically meaningful subgroups.
Our results are consistent with the growing body of evidence linking systemic inflammation and nutritional status to cancer prognosis. For instance, Petrelli et al. [34] reported that while obesity is generally associated with decreased overall and cancer-specific survival, it may paradoxically confer a survival advantage in certain malignancies such as lung, kidney, and melanoma. Similarly, Harmantepe et al. [46] proposed combining clinical, inflammatory, and histopathological indicators in obstructive CRC to refine survival prediction. Our taxonomy reflects this complexity by integrating body composition with immunological and haematological parameters.
The prognostic significance of PLR and NLR has been well documented across tumour types. Zhou et al. [47] demonstrated that high PLR was associated with worse outcomes in immunotherapy-treated lung cancer, while Wang et al. [48] reported a similar association with clinical outcomes in surgically treated lung cancer patients. Unlike previous studies focusing on isolated markers, our taxonomy combines these parameters into multidimensional and clinically interpretable subtypes. Importantly, systemic inflammatory markers such as NLR and PLR have also been linked to surgical outcomes, including anastomotic leakage in both general and elderly CRC populations [49,50,51,52,53,54].
The generalisability of this taxonomic framework to other malignancies appears promising and warrants further investigation. Although the present analysis focuses on CRC, stratification of patients based on routinely collected inflammatory, haematological, and anthropometric data could also enhance prognostic precision in other oncological settings.
Nevertheless, several limitations should be acknowledged. The double-centre, cross-sectional design without survival data precludes direct conclusions regarding long-term outcomes. Further multicentre prospective studies are required to validate these taxonomies, integrate molecular tumour characteristics, and assess their predictive value for treatment response and survival.
5. Conclusions
Colorectal cancer patients can be systematically classified into clinical-biochemical taxonomies that reflect distinct inflammatory and nutritional states, which correspond to differences in tumour advancement. Given their accessibility and low cost, the proposed taxonomy-based approach has potential for implementation in routine clinical practice and may support risk-adapted management strategies in CRC and, potentially, other malignancies. Such a taxonomy may facilitate early identification of high-risk patients, particularly those with a cachectic-inflammatory phenotype, who could benefit from intensified surveillance or adjunctive nutritional and anti-inflammatory interventions. Future prospective multicentre studies are needed to validate these findings, determine their predictive value for survival and treatment response, and explore integration with molecular and genomic tumour characteristics to further refine patient stratification.
Author Contributions
Conceptualization, D.W. and M.N.; methodology, D.W., M.N., Z.L. and W.K.; software, M.N., K.S.-M., B.B., M.B.-L. and Ł.M.; validation, M.N., Ł.M. and Z.L.; formal analysis, D.W., M.M.-W. and Z.L.; investigation, D.W., B.B., M.B.-L. and K.S.-M.; writing—original draft preparation, D.W., M.M.-W., Ł.M. and M.K.; writing—review and editing, D.W., M.M.-W., M.K. and Z.L.; visualization, M.N., K.S.-M., M.B.-L. and W.K.; supervision, D.W. and M.M.-W. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Medical University of Silesia in Katowice, Poland, grant number BNW-1-2024/4.
Institutional Review Board Statement
The study was conducted in accordance with the Declaration of Helsinki and approved by the Bioethics Committee of the Medical University of Silesia in Katowice, Poland (application no. KNW/0022/KB/74/I/19, 21 March 2019).
Informed Consent Statement
Informed consent was obtained from all subjects involved in the study.
Data Availability Statement
Data may be made available from the corresponding author upon reasonable request.
Acknowledgments
Artificial intelligence (AI)-assisted (ChatGPT 5.1) technologies were used to improve the clarity, and style of the manuscript. The authors reviewed and approved all content and take full responsibility for the final version of the work.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Xi, Y.; Xu, P. Global colorectal cancer burden in 2020 and projections to 2040. Transl. Oncol. 2021, 14, 101174. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Santucci, C.; Mignozzi, S.; Malvezzi, M.; Boffetta, P.; Collatuzzo, G.; Levi, F.; La Vecchia, C.; Negri, E. European cancer mortality predictions for the year 2024 with focus on colorectal cancer. Ann. Oncol. 2024, 35, 308–316. [Google Scholar] [CrossRef] [PubMed]
- Shevchenko, I.; Serban, D.; Simion, L.; Motofei, I.; Cristea, B.M.; Dumitrescu, D.; Tudor, C.; Dascalu, A.M.; Serboiu, C.; Tribus, L.C.; et al. Clinical Significance of Blood Cell-Derived Inflammation Markers in Assessing Potential Early and Late Postoperative Complications in Patients with Colorectal Cancer: A Systematic Review. J. Clin. Med. 2025, 14, 2529. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Shalapour, S.; Karin, M. Immunity, inflammation, and cancer: An eternal fight between good and evil. J. Clin. Investig. 2015, 125, 3347–3355. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Burgos-Molina, A.M.; Téllez Santana, T.; Redondo, M.; Bravo Romero, M.J. The Crucial Role of Inflammation and the Immune System in Colorectal Cancer Carcinogenesis: A Comprehensive Perspective. Int. J. Mol. Sci. 2024, 25, 6188. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Qu, D.; Shen, L.; Liu, S.; Li, H.; Ma, Y.; Zhang, R.; Wu, K.; Yao, L.; Li, J.; Zhang, J. Chronic inflammation confers to the metabolic reprogramming associated with tumorigenesis of colorectal cancer. Cancer Biol. Ther. 2017, 18, 237–244. [Google Scholar] [CrossRef] [PubMed]
- Li, M.X.; Liu, X.M.; Zhang, X.F.; Zhang, J.F.; Wang, W.L.; Zhu, Y.; Dong, J.; Cheng, J.W.; Liu, Z.W.; Ma, L.; et al. Prognostic role of neutrophil-to-lymphocyte ratio in colorectal cancer: A systematic review and meta-analysis. Int. J. Cancer 2014, 134, 2403–2413. [Google Scholar] [CrossRef] [PubMed]
- Dimitriou, N.; Felekouras, E.; Karavokyros, I.; Alexandrou, A.; Pikoulis, E.; Griniatsos, J. Neutrophils to lymphocytes ratio as a useful prognosticator for stage II colorectal cancer patients. BMC Cancer 2018, 18, 1202. [Google Scholar] [CrossRef]
- Kim, J.H.; Lee, J.Y.; Kim, H.K.; Lee, J.W.; Jung, S.G.; Jung, K.; Kim, S.E.; Moon, W.; Park, M.I.; Park, S.J. Prognostic significance of the neutrophil-to-lymphocyte ratio and platelet-to-lymphocyte ratio in patients with stage III and IV colorectal cancer. World J. Gastroenterol. 2017, 23, 505–515. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Turri, G.; Caligola, S.; Ugel, S.; Conti, C.; Zenuni, S.; Barresi, V.; Ruzzenente, A.; Lippi, G.; Scarpa, A.; Bronte, V.; et al. Pre-diagnostic prognostic value of leukocytes count and neutrophil-to-lymphocyte ratio in patients who develop colorectal cancer. Front. Oncol. 2023, 13, 1148197. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Zou, Z.Y.; Liu, H.L.; Ning, N.; Li, S.Y.; Du, X.H.; Li, R. Clinical significance of pre-operative neutrophil lymphocyte ratio and platelet lymphocyte ratio as prognostic factors for patients with colorectal cancer. Oncol. Lett. 2016, 11, 2241–2248. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.H.; Zhai, E.T.; Yuan, Y.J.; Wu, K.M.; Xu, J.B.; Peng, J.J.; Chen, C.Q.; He, Y.L.; Cai, S.R. Systemic immune-inflammation index for predicting prognosis of colorectal cancer. World J. Gastroenterol. 2017, 23, 6261–6272. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Li, Y.; Luo, Y.; Ran, Y.; Lu, F.; Qin, Y. Biomarkers of inflammation and colorectal cancer risk. Front. Oncol. 2025, 15, 1514009. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- OECD/European Commission. EU Country Cancer Profile: Poland 2025; OECD Publishing: Paris, France, 2025. [Google Scholar] [CrossRef]
- Choręza, P.; Owczarek, A.J.; Kruk, W.; Chudek, J. The epidemiology of the most frequent cancers in Poland in 2015–2021 and the impact of the COVID-19 pandemic on cancer incidence. Arch. Public Health 2024, 82, 49. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Wojciechowska, U.; Barańska, K.; Miklewska, M.; Didkowska, J.A. Cancer incidence and mortality in Poland in 2020. Nowotw. J. Oncol. 2023, 73, 129–145. [Google Scholar] [CrossRef]
- Available online: https://onkologia.org.pl/pl (accessed on 25 August 2025).
- Brenner, H.; Chen, C. The colorectal cancer epidemic: Challenges and opportunities for primary, secondary and tertiary prevention. Br. J. Cancer 2018, 119, 785–792. [Google Scholar] [CrossRef]
- Available online: https://www.jwatch.org/na49197/2019/05/28/young-onset-colorectal-cancer-disease-affluence (accessed on 3 August 2025).
- Nycz, M.; Kropieniewicz, J.; Suliga, K.; Zawadzki, T.; Opiłka, M.N.; Lorenc, Z. Peripheral blood inflammatory parameters in patients with colorectal cancer. Nowa Med. 2019. [Google Scholar] [CrossRef]
- Fuchs, J.; Schellerer, V.S.; Brunner, M.; Geppert, C.I.; Grützmann, R.; Weber, K.; Merkel, S. The impact of body mass index on prognosis in patients with colon carcinoma. Int. J. Color. Dis. 2022, 37, 1107–1117. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Mandic, M.; Li, H.; Safizadeh, F.; Niedermaier, T.; Hoffmeister, M.; Brenner, H. Is the association of overweight and obesity with colorectal cancer underestimated? An umbrella review of systematic reviews and meta-analyses. Eur. J. Epidemiol. 2023, 38, 135–144, Erratum in Eur. J. Epidemiol. 2024, 39, 231. https://doi.org/10.1007/s10654-023-01092-3.. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Karnofsky, D.A.; Burchenal, J.H. The Clinical Evaluation of Chemotherapeutic Agents in Cancer. In Evaluation of Chemotherapeutic Agents; MacLeod, C.M., Ed.; Columbia University Press: New York, NY, USA, 1949; p. 196. [Google Scholar]
- Rosen, R.D.; Sapra, A. TNM classification. In StatPearls [Internet]; StatPearls Publishing: Treasure Island, FL, USA, 2022. Available online: https://www.ncbi.nlm.nih.gov/books/NBK553187/ (accessed on 25 August 2025).
- Banias, L.; Jung, I.; Chiciudean, R.; Gurzu, S. From Dukes-MAC Staging System to Molecular Classification: Evolving Concepts in Colorectal Cancer. Int. J. Mol. Sci. 2022, 23, 9455. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- R Core Team. R: A Language and Environment for Statistical Computing, Version 4.0.0.; R Foundation for Statistical Computing: Vienna, Austria, 2020.
- Maechler, M.; Rousseeuw, P.; Struyf, A.; Hubert, M.; Hornik, K.; Studer, M.; Roudier, P. Package ‘Cluster’, Version 2.1.0.; Finding groups in data; CRAN: Vienna, Austria, 2019.
- Schober, P.; Vetter, T.R. Logistic Regression in Medical Research. Anesth. Analg. 2021, 132, 365–366. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Szumilas, M. Explaining odds ratios. J. Can. Acad. Child Adolesc. Psychiatry 2010, 19, 227–229, Erratum in J. Can. Acad. Child Adolesc. Psychiatry 2015, 24, 58. [Google Scholar] [PubMed] [PubMed Central]
- Everitt, B. Cluster Analysis; Wiley: Chichester, UK, 2011. [Google Scholar]
- Venables, W.; Ripley, B. Modern Applied Statistics with S, 4th ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
- Sarker, I.H. Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions. SN Comput. Sci. 2021, 2, 420. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Marczewski, E.; Steinhaus, H. On a certain distance of sets and the corresponding distance of functions. Colloq. Math. 1958, 6, 319–327. Available online: http://matwbn.icm.edu.pl/ksiazki/cm/cm6/cm6141.pdf (accessed on 25 August 2025).
- Campbell, P.T.; Lin, Y.; Bien, S.A.; Figueiredo, J.C.; Harrison, T.A.; Guinter, M.A.; Berndt, S.I.; Brenner, H.; Chan, A.T.; Chang-Claude, J.; et al. Association of Body Mass Index with Colorectal Cancer Risk by Genome-Wide Variants. J. Natl. Cancer Inst. 2021, 113, 38–47. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Petrelli, F.; Cortellini, A.; Indini, A.; Tomasello, G.; Ghidini, M.; Nigro, O.; Salati, M.; Dottorini, L.; Iaculli, A.; Varricchio, A.; et al. Association of Obesity with Survival Outcomes in Patients with Cancer: A Systematic Review and Meta-analysis. JAMA Netw. Open 2021, 4, e213520. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- González-Flores, E.; Garcia-Carbonero, R.; Élez, E.; Redondo-Cerezo, E.; Safont, M.J.; Vera García, R. Gender and sex differences in colorectal cancer screening, diagnosis and treatment. Clin. Transl. Oncol. 2025, 27, 2825–2837. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Chen, X.; Yang, M.; Zhao, W.; Tu, J.; Liu, Q.; Yuan, X. Mendelian randomization unraveled: Gender-specific insights into obesity-related phenotypes and colorectal cancer susceptibility. Front. Endocrinol. 2024, 15, 1322253. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Popovici, D.; Stanisav, C.; Saftescu, S.; Negru, S.; Dragomir, R.; Ciurescu, D.; Diaconescu, R. Exploring the Influence of Age, Gender and Body Mass Index on Colorectal Cancer Location. Medicina 2023, 59, 1399. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Hernandez-Ainsa, M.; Velamazan, R.; Lanas, A.; Carrera-Lasfuentes, P.; Piazuelo, E. Blood-Cell-Based Inflammatory Markers as a Useful Tool for Early Diagnosis in Colorectal Cancer. Front. Med. 2022, 9, 843074. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Tez, M. Inflammatory and nutritional markers in colorectal cancer: Implications for prognosis and treatment. World J. Clin. Oncol. 2024, 15, 1264–1268. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Gaur, P.; Bhattacharya, S.; Kant, S.; Kushwaha, R.; Garg, R.; Singh, G.; Pandey, S.; Sharma, S. Association of inflammatory biomarkers with lung cancer in North Indian population. Afr. Health Sci. 2019, 19, 2147–2155. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Şahin, F.; Aslan, A.F. Relationship between Inflammatory and Biological Markers and Lung Cancer. J. Clin. Med. 2018, 7, 160. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Tan, S.; Zheng, Q.; Zhang, W.; Zhou, M.; Xia, C.; Feng, W. Prognostic value of inflammatory markers NLR, PLR, and LMR in gastric cancer patients treated with immune checkpoint inhibitors: A meta-analysis and systematic review. Front. Immunol. 2024, 15, 1408700. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Kim, M.R.; Kim, A.S.; Choi, H.I.; Jung, J.H.; Park, J.Y.; Ko, H.J. Inflammatory markers for predicting overall survival in gastric cancer patients: A systematic review and meta-analysis. PLoS ONE 2020, 15, e0236445. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Bizzarri, N.; D’Indinosante, M.; Marchetti, C.; Tudisco, R.; Turchiano, F.; Scambia, G.; Fagotti, A. The prognostic role of systemic inflammatory markers in apparent early-stage ovarian cancer. Int. J. Clin. Oncol. 2023, 28, 314–320. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Savant, S.S.; Sriramkumar, S.; O’Hagan, H.M. The Role of Inflammation and Inflammatory Mediators in the Development, Progression, Metastasis, and Chemoresistance of Epithelial Ovarian Cancer. Cancers 2018, 10, 251. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Midik, M.M.; Gunenc, D.; Acar, P.F.; Karaca, B.S. Prognostic Value of Blood-Based Inflammatory Markers in Cancer Patients Receiving Immune Checkpoint Inhibitors. Cancers 2024, 17, 37. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Harmantepe, A.T.; Şentürk, A. Survival predictors in obstructive colorectal cancer: A combined clinical, inflammatory, and histopathological approach. In Vivo 2025, 39, 2419–2428. [Google Scholar] [CrossRef] [PubMed]
- Zhou, K.; Cao, J.; Lin, H.; Liang, L.; Shen, Z.; Wang, L.; Peng, Z.; Mei, J. Prognostic role of the platelet to lymphocyte ratio (PLR) in the clinical outcomes of patients with advanced lung cancer receiving immunotherapy: A systematic review and meta-analysis. Front. Oncol. 2022, 12, 962173. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Wang, J.; Li, H.; Xu, R.; Lu, T.; Zhao, J.; Zhang, P.; Qu, L.; Zhang, S.; Guo, J.; Zhang, L. The MLR, NLR, PLR and D-dimer are associated with clinical outcome in lung cancer patients treated with surgery. BMC Pulm. Med. 2022, 22, 104. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Sun, J.; Yang, R.; Wu, H.; Li, L.; Gu, Y. Prognostic value of preoperative combined with postoperative systemic immune-inflammation index for disease-free survival after radical rectal cancer surgery: A retrospective cohort study. Transl. Cancer Res. 2024, 13, 371–380. [Google Scholar] [CrossRef] [PubMed]
- Ioannidis, A.; Tzikos, G.; Smprini, A.; Menni, A.-E.; Shrewsbury, A.; Stavrou, G.; Paramythiotis, D.; Michalopoulos, A.; Kotzampassi, K. Negative and positive predictors of anastomotic leakage in colorectal cancer patients—The case of neutrophil-to-lymphocyte ratio. Diagnostics 2024, 14, 1806. [Google Scholar] [CrossRef] [PubMed]
- Xu, N.; Zhang, J.-X.; Zhang, J.-J.; Huang, Z.; Mao, L.-C.; Zhang, Z.-Y.; Jin, W.-D. The prognostic value of the neutrophil-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR) in colorectal cancer and colorectal anastomotic leakage patients: A retrospective study. BMC Surg. 2025, 25, 57. [Google Scholar] [CrossRef]
- Wang, C.-Y.; Li, X.-L.; Ma, X.-L.; Yang, X.-F.; Liu, Y.-Y.; Yu, Y.-J. Preoperative neutrophil-to-lymphocyte ratio predicts symptomatic anastomotic leakage in elderly colon cancer patients: Multicenter propensity score-matched analysis. World J. Gastrointest. Surg. 2024, 16, 438–450. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).