1. Introduction
Lung cancer remains a major global health challenge, ranking as the second most commonly diagnosed malignancy and the leading cause of cancer-related mortality. In 2020 alone, it accounted for an estimated 2.2 million newly diagnosed cases and 1.8 million deaths worldwide, underscoring its substantial contribution to the global burden of disease [
1,
2]. Both incidence and mortality vary markedly across geographical regions and between sexes, reflecting differences in exposure to risk factors, healthcare access and early detection practices.
In Eastern Europe, lung cancer outcomes are particularly poor due to high smoking prevalence, delayed diagnosis, and limited access to molecular or imaging-based testing. Consequently, most patients are diagnosed at advanced stages, and survival remains significantly lower than in Western Europe. These disparities highlight the need for simple, accessible, and reproducible prognostic tools that could support individualized management even in resource-limited healthcare systems [
3].
Routine laboratory parameters—such as hemoglobin (Hb), white blood cell (WBC) and platelet counts (PLT), C-reactive protein (CRP), fibrinogen and D-dimer—offer indirect but clinically relevant evidence of these mechanisms and are available in everyday practice [
4,
5,
6]. Low Hb and albumin levels are established indicators of cancer-associated cachexia, while elevated CRP, fibrinogen, and D-dimer mirror inflammatory and thrombotic activity within the tumor microenvironment. Their ubiquity and low cost make them attractive candidates for pragmatic prognostic modeling, particularly in low- and middle-income contexts [
7].
Functional performance, most commonly assessed using the Eastern Cooperative Oncology Group (ECOG) score, remains a cornerstone in survival prediction for lung cancer patients. Poor performance status (ECOG ≥ 2) is associated with a markedly reduced ability to tolerate aggressive systemic therapy, with median survival rarely exceeding 6–12 months in large observational cohorts [
8,
9].
However, current prognostic models often depend on advanced molecular profiling or imaging biomarkers which are not routinely available in many healthcare systems. Few studies have systematically evaluated combinations of accessible laboratory parameters and clinical indicators within a unified prognostic framework, and even fewer have done so in Eastern European populations, where late diagnosis and restricted access to novel therapies remain prevalent [
10,
11,
12,
13].
The present study aimed to evaluate the prognostic significance of four routinely measured hematological parameters—Hb, CRP, fibrinogen and D-dimer—together with ECOG performance status in patients with lung cancer treated at a regional oncology center in Eastern Europe. Although prior evidence regarding CRP and D-dimer has been inconsistent, both markers were included due to their biological plausibility and clinical availability. Their assessment alongside TNM staging was intended to determine their potential additive value in refining prognostic accuracy. By integrating these inexpensive, reproducible variables into a composite prognostic model, we sought to provide a pragmatic and regionally adaptable framework for early mortality risk stratification in lung cancer.
An internal validation process was undertaken to assess the predictive accuracy of the model, with the ultimate goal of supporting timely, evidence-based therapeutic decision-making, particularly in healthcare settings with limited access to advanced diagnostic resources.
2. Materials and Methods
2.1. Study Design and Setting
This retrospective, observational, single-center cohort study was conducted at the Constanta Clinical Hospital for Pneumophthisiology in Romania, a regional referral center for respiratory diseases. This study included patients diagnosed and/or treated between January 2023 and June 2025. All data were collected and analyzed retrospectively from existing medical records, with a predefined cut-off date of 1 June 2025. Consequently, survival analysis was performed on complete, verified data available up to that date, ensuring that no prospective follow-up or ongoing cases were included.
2.2. Study Population
From an initial pool of 602 patients with histologically confirmed lung cancer (including both non-small cell lung cancer [NSCLC] and small cell lung cancer [SCLC]), only those with complete clinical, laboratory and follow-up data—including a documented date of death—were eligible for inclusion in the survival analysis. The final study cohort comprised 304 individuals.
Patients were considered eligible for the present study if they fulfilled all of the following conditions:
Histological confirmation of primary lung cancer, either NSCLC or SCLC, as defined by the latest World Health Organization (WHO) classification of thoracic tumors.
Adult age group, defined as 18 years or older at the time of initial diagnosis.
Comprehensive and verifiable clinical documentation, including complete demographic data, medical history, oncological staging, and follow-up information, with explicit confirmation of survival status at the last contact.
Baseline hematological evaluation performed within 14 days after histological diagnosis, encompassing all parameters of interest for this study (Hb, WBC, PLT count, CRP, fibrinogen, erythrocyte sedimentation rate—ESR, 30/60 min, and D-dimer).
These criteria were established to ensure the accuracy, consistency and comparability of data across the study cohort, thereby strengthening the internal validity of the survival analyses.
Patients were excluded from this study if they met any of the following conditions:
Presence of secondary lung malignancies, defined as pulmonary metastases originating from extrapulmonary primary tumors.
Incomplete or insufficient clinical documentation, including missing key variables such as survival status, baseline hematological parameters, or ECOG performance score.
Concurrent medical conditions with the potential to confound laboratory results, such as active hematological disorders, ongoing acute infections, or chronic inflammatory diseases unrelated to lung cancer.
History of other active malignancies diagnosed within the previous five years, except non-melanoma skin cancers or in situ cervical carcinoma, to avoid the influence of competing cancer-related mortality.
Baseline hematological evaluation was performed outside the predefined time window of 14 days from histological diagnosis, ensuring temporal consistency between diagnostic confirmation and laboratory assessment.
These exclusion criteria were applied to reduce confounding factors, enhance the reliability of prognostic evaluations, and maintain methodological rigor throughout this study.
2.3. Data Collection and Variables
Demographic, clinical and laboratory data were retrieved from the hospital’s electronic medical records and oncology registry. The following variables were analyzed:
Hematological parameters: Hb, WBC count, PLT count, CRP, ESR, 30/60 min, fibrinogen, and D-dimer. Although serum albumin is a well-established prognostic biomarker in lung cancer, it was not included in the multivariable analysis because baseline albumin levels were missing for approximately one-third of the cohort. To minimize data loss and potential selection bias, only hematological parameters with complete baseline values were retained.
Functional status: ECOG performance status (scale 0–5).
Comorbidities: chronic obstructive pulmonary disease (COPD), cardiovascular disease, diabetes mellitus, and other significant chronic illnesses.
Outcome: Overall survival (OS), defined as the time from histological diagnosis to death from any cause or last follow-up (censored for patients alive at the study endpoint).
2.4. Statistical Analysis
Data analysis was performed using IBM SPSS Statistics version 25, with figures generated in Microsoft Office Excel and Word 2024. Categorical variables were expressed as counts and percentages and compared using Fisher’s exact test. Continuous variables were expressed as medians with interquartile ranges (IQR) or means with standard deviations, depending on distribution; normality was assessed using the Shapiro–Wilk test. Between-group comparisons for non-normally distributed variables were performed using the Mann–Whitney U test.
Receiver operating characteristic (ROC) curves were constructed for significant continuous predictors, with the area under the curve (AUC) and 95% confidence intervals (CI) calculated. Cut-off values were determined using Youden’s index, with corresponding sensitivity and specificity reported. Kaplan–Meier survival curves were plotted, and differences between groups were assessed using the log-rank test.
Cox proportional hazards regression analysis was applied to identify independent predictors of mortality. Variables significant in univariable analysis (p < 0.05) were entered into a multivariable model using forward stepwise selection (Wald method).
Multivariable Cox proportional hazards models were constructed using a stepwise forward selection procedure, retaining only variables with p < 0.05 and variance inflation factor (VIF) < 2 to minimize multicollinearity and overfitting. Cluster analysis was performed as an exploratory technique to identify homogeneous subgroups within the cohort. Hierarchical clustering was first applied using Ward’s method to estimate the optimal number of clusters based on the dendrogram structure and silhouette coefficients. K-means clustering was subsequently used to refine these groupings using continuous hematological and clinical variables. Latent class analysis was not adopted because it relies on categorical variables and requires larger sample sizes for model stability.
A two-tailed p-value < 0.05 was considered statistically significant for all analyses.
2.5. Ethics Statement
This study was approved by the Institutional Ethics Committee of the Constanta Clinical Hospital for Pneumophthisiology (Approval No. 6744/02.07.2024) and was conducted in accordance with the Declaration of Helsinki. Informed consent was obtained from all participants for the use of anonymized medical data for research purposes.
3. Results
3.1. Patient Characteristics
A total of 304 patients with histologically confirmed lung cancer were included in this study. According to the table, 68.4% of the patients were men, with a median age of 67 years (IQR = 62–72), living in an urban environment (59.9%). Most of the patients were active smokers (52%) or former smokers (31.2%), with a median pack-years value of 31 (IQR = 20–45). 36.8% of the patients had job exposure. ECOG performance status at diagnosis ranged from 0 to 3, with a median score of 1.
Detailed demographic and baseline clinical characteristics are provided in
Table 1.
3.2. Tumor Location
In terms of anatomical distribution, tumors were most frequently located in the right lung (54.6%), particularly in the upper lobe (33.2%). The left lung accounted for 45.4% of cases, with the upper lobe also being the predominant site (27.6%). Bilateral involvement was rare. The detailed distribution of tumor sites is shown in
Table 2.
3.3. Radiological Appearance
Radiological assessment revealed that the most common presentation was a pulmonary mass (64.5%). Patients who had pulmonary nodes were more frequently associated with survival (16.5% vs. 6.1%,
p = 0.022). These imaging characteristics are summarized in
Table 3.
3.4. TNM Classification and Survival
With respect to tumor staging, most patients presented with advanced disease, with T4 tumors in 61.2% of cases and distant metastases in 57.1%. Early-stage disease was less common but conferred a clear survival advantage; for example, stage IB accounted for 6.6% of survivors compared with only 0.9% among deceased patients, whereas stage IVC was strongly associated with mortality (9.9% vs. 2.2%,
p = 0.005). The complete stage-specific distribution of survivors and deceased patients according to the TNM classification is provided in
Supplementary Table S1.
Nodal involvement further stratified outcomes: patients with N0 disease showed significantly better survival (37.4% vs. 18.0%, p = 0.006), while N2 disease was the most frequent nodal stage (44.4%). Metastatic status also had a major prognostic impact. Patients without metastases were more likely to survive (50.6% vs. 37.3%, p = 0.010), whereas combined pulmonary and extrapulmonary metastases were strongly associated with death (26.9% vs. 12.4%).
Kaplan–Meier analysis confirmed these associations, showing that patients with stage ≤IIC had nearly double the median overall survival compared with those at stage ≥IIIA (
p = 0.001). Survival curves according to TNM stage and metastatic status are illustrated in
Figure 1 and detailed distributions are provided in
Table 4. Detailed Kaplan–Meier survival statistics for tumor aspect, T4 stage, nodal and metastatic status are presented in
Supplementary Table S2.
3.5. Hematological and Inflammatory Parameters in Survivors vs. Deceased Patients
Median values of hematological and inflammatory markers are presented in
Table 5. Graphical comparisons of hemoglobin, leukocyte, platelet and fibrinogen levels according to survival status are illustrated in
Supplementary Figure S1.
Compared with survivors, deceased patients exhibited significantly lower Hb levels and mean corpuscular volume, as well as higher WBC and PLT counts and fibrinogen concentrations (all p < 0.05). MCHC also showed a slight but statistically significant increase in deceased patients. In contrast, ESR, CRP, and D-dimer did not differ significantly between groups.
These findings indicate that anemia, leukocytosis, thrombocytosis and elevated fibrinogen levels are more strongly associated with poor prognosis, whereas inflammatory and coagulation markers such as CRP and D-dimer have limited discriminative value in this cohort. Corresponding ROC curve analyses for mortality prediction using hematological parameters are summarized in
Supplementary Table S3.
Kaplan–Meier survival analysis demonstrated that anemia (Hb < 11.85 g/dL) and leukocytosis (WBC > 11.71 × 10
9/L) were strongly associated with shorter overall survival (both
p < 0.001). Thrombocytosis (PLT > 358.5 × 10
9/L) and hyperfibrinogenemia (>4.06 g/L) were also significantly correlated with poorer outcomes (
p = 0.021 and
p = 0.032, respectively). By contrast, CRP and D-dimer did not show significant associations with survival. Representative survival results and curves for Hb and WBC count are presented in
Table 6 and
Figure 2.
3.6. Cox Regression Analysis
In the univariate Cox regression analysis, several parameters were associated with increased mortality risk, including advanced tumor stage, nodal involvement, presence of metastases, anemia, leukocytosis, thrombocytosis and elevated fibrinogen (all p < 0.05). However, in the multivariable model, only three factors remained independent predictors of mortality: T4 tumor stage (HR 1.47, 95% C.I.: 1.08–2.01, p = 0.015), anemia defined as Hb < 11.85 g/dL (HR 1.70, 95% C.I.: 1.27–2.27, p < 0.001), and leukocytosis defined as WBC > 11.71 × 109/L (HR 1.54, 95% C.I.: 1.15–2.05, p = 0.004).
These results emphasize that, beyond TNM staging, routine hematological markers such as Hb and WBC count provide independent prognostic information in lung cancer. The detailed results of the univariate and multivariable analyses are shown in
Table 7.
3.7. Cluster Analysis
Cluster analysis integrating tumor stage and hematological parameters identified two distinct patient subgroups with significantly different clinical profiles. Cluster 1 included predominantly patients with less advanced disease, while Cluster 2 was defined by a high prevalence of T4 tumors (87.1% vs. 40.9%,
p < 0.001), anemia (58.3% vs. 15.8%,
p < 0.001), and leukocytosis (80.6% vs. 11.7%,
p < 0.001). Patients assigned to Cluster 2 exhibited a more aggressive disease pattern and were associated with significantly poorer survival outcomes compared with Cluster 1. Detailed cluster characteristics and distances between final cluster centers are provided in
Supplementary Table S4. The clinical distribution of these defining variables is presented in
Table 8, while the graphical representation of cluster assignment is shown in
Figure 3.
Using a univariable Cox-proportional hazard regression model, with type of cluster as an independent variable, shows that patients in Cluster 2 have an increased risk of death by 2.334 times (95% C.I. = 1.764–3.089) than patients in Cluster 1.
4. Discussions
4.1. Summary of Main Findings
In this retrospective cohort of 304 patients with lung cancer, we evaluated the prognostic role of routine hematological markers in the context of TNM classification and ECOG performance status. Our main findings can be summarized as follows. First, tumor stage remained the strongest determinant of survival, with patients diagnosed at stage ≤IIC having nearly double the median overall survival compared with those at stage ≥IIIA. Nodal involvement and the presence of distant metastases were likewise associated with significantly poorer outcomes, confirming the well-established prognostic gradient across TNM categories.
Second, among the hematological parameters, anemia (Hb < 11.85 g/dL) and leukocytosis (WBC > 11.71 × 109/L) were identified as independent predictors of mortality in the multivariable Cox model, together with T4 tumor stage. PLT count and fibrinogen showed prognostic value in univariate and Kaplan–Meier analyses but did not retain significance after adjustment for other variables, while CRP and D-dimer were not significantly associated with survival.
Third, the application of cluster analysis integrating tumor stage with hematological parameters revealed two distinct subgroups with markedly different prognoses. Patients in Cluster 2, characterized by advanced disease (T4), anemia, and leukocytosis, had more than a two-fold higher risk of death compared with those in Cluster 1.
Taken together, these results emphasize that, beyond conventional TNM staging, simple and routinely available hematological markers can refine prognostic assessment. The integration of such variables through cluster analysis may help capture clinically relevant heterogeneity and provide a pragmatic framework for individualized risk stratification in lung cancer.
Although several hematological markers showed significant associations with survival in univariate analyses, only three retained independent prognostic value in the multivariable Cox model. This likely reflects the moderate sample size and the interdependence between inflammatory and tumor-related variables. The stepwise approach was applied to ensure model robustness and to prevent overfitting.
4.2. Comparison with Previous Literature
Numerous studies have confirmed the prognostic significance of D-dimer, fibrinogen and CRP in lung cancer, with abnormal levels consistently associated with reduced overall survival [
8,
14,
15].
4.2.1. Survival According to TNM Stages
The study results revealed marked differences in survival according to tumor stage. Patients diagnosed at early stages (IA, IB, IIA, IIB) had a higher probability of survival, being more frequently alive at the time of analysis (e.g., survivors 6.6% in IB vs. 0.9% among deceased). In contrast, advanced stages, particularly IVC (9.9% vs. 2.2%,
p = 0.005), were strongly associated with higher mortality. Kaplan–Meier analysis confirmed that median survival for patients in stages ≤IIC was nearly double compared with those in ≥IIIA, emphasizing the importance of early detection. These findings align with population-based data, such as SEER and European registries, which show that 5-year survival exceeds 60% for patients diagnosed at stage I but drops below 10% for those at stage IV [
16].
Our findings regarding the poor survival outcomes of patients with stage IV disease are consistent with the recent analysis by Ayoade et al. (2025), which documented meaningful survival gains in stage IV NSCLC across sociodemographic strata. Using population-based data, the authors demonstrated that although stage IV remains associated with dismal overall survival, advances in systemic therapy—particularly immunotherapy and targeted approaches—have led to incremental improvements in survival over the past decade. Importantly, these benefits were not equally distributed, with disparities observed by age, race and socioeconomic status, highlighting the persistent inequities in access to novel therapies [
17]. This supports the idea that prognostic models based on easily accessible biomarkers, like those analyzed in our study, hold significant value in environments where molecular testing and expensive therapies are not readily available.
4.2.2. Nodal Status and Metastases
Regarding lymph node involvement, patients with N0 had significantly better survival (37.4% vs. 18%, p = 0.006) compared to those with N2/N3 disease, where the risk of death was markedly increased. The presence of metastases was a significant negative prognostic factor: patients without metastases had a survival rate of 50.6% compared to 37.3% in those with dissemination, while combined pulmonary and extrapulmonary metastases were associated with the poorest outcomes (26.9% vs. 12.4%, p = 0.010).
This distribution is consistent with findings from multicenter European and Asian studies, where metastatic status is considered the strongest predictor of overall survival [
18].
4.2.3. Cluster Analysis
The application of cluster analysis in our cohort enabled the stratification of patients into two distinct prognostic subgroups. Cluster 1 was characterized by less advanced stages (T1–T3), absence of anemia, and absence of leukocytosis and was associated with a more favorable prognosis. In contrast, Cluster 2 consisted predominantly of patients with T4 tumors, frequently presenting anemia and leukocytosis, and demonstrated a 2.3-fold higher risk of death (HR = 2.334, 95% CI: 1.764–3.089). Comparable findings have been reported in external cohorts. Wang et al. (2023) showed that the integration of hematological and inflammatory markers into cluster analysis identified subgroups with distinct therapeutic responses and survival trajectories in SCLC [
19]. Similarly, a real-world analysis published by Li et al. (2024) confirmed that combining clinical and biological data through clustering allows more refined risk stratification in extensive-stage SCLC, identifying cohorts with median survival differences of up to 8 months [
20].
These findings highlight the added prognostic value of unsupervised learning techniques such as clustering, which can detect clinically relevant heterogeneity beyond TNM staging. By integrating simple laboratory markers with tumor characteristics, cluster-based approaches provide a pragmatic framework for risk-adapted treatment strategies and patient counseling in lung cancer.
4.2.4. D-Dimer
D-dimer represents a fragment produced during the dissolution of blood clots, reflecting ongoing activation of both coagulation and fibrinolytic pathways [
21].
Prospective and retrospective studies consistently support its prognostic relevance. Ay et al. (2012), in the extensive Cancer and Thrombosis Study (CATS), including 1178 patients with various malignancies, demonstrated that elevated baseline D-dimer levels independently predicted reduced overall survival, with a hazard ratio (HR) of approximately 1.5 per doubling of the value [
8].
A meta-analysis by Ma et al. (2021), encompassing 28 studies with a total of 8452 lung cancer patients, demonstrated that elevated plasma D-dimer levels were significantly associated with reduced overall survival (HR = 1.742, 95% CI: 1.542–1.969,
p < 0.001) and progression-free survival (HR = 1.385, 95% CI: 1.169–1.641,
p = 0.003) compared with lower levels. Subgroup analyses indicated that tumor localization, detection methodology, and disease stage influenced the strength of this association. These results provide robust evidence supporting high plasma D-dimer as an independent negative prognostic biomarker in lung cancer [
22].
In surgical cohorts, Li et al. (2022) found that lower preoperative D-dimer levels were associated with significantly longer survival in stage I–II NSCLC after complete resection, indicating prognostic relevance even in early-stage disease. In a separate retrospective study of 277 patients with advanced NSCLC, the same group stratified patients by pretreatment plasma D-dimer levels—normal (≤0.5 µg/mL) versus elevated (>0.5 µg/mL)—and demonstrated that high D-dimer was independently associated with shorter progression-free survival (PFS) and overall survival (OS) (
p < 0.01). After propensity score matching, elevated D-dimer remained strongly linked to poorer outcomes, with median PFS of 6.4 versus 11.5 months (HR = 1.70, 95% CI: 1.25–2.37,
p < 0.001) and median OS of 12.7 versus 30.4 months (HR = 2.29, 95% CI: 1.54–3.41,
p < 0.001). Collectively, these findings underscore the prognostic utility of D-dimer across both early- and advanced-stage NSCLC [
23].
However, in our cohort, D-dimer did not remain an independent prognostic significance after multivariable adjustment, suggesting that its apparent effect may be confounded by tumor stage and overall biological status. While D-dimer reflects cancer-related hypercoagulability, its effectiveness as an independent prognostic biomarker should be interpreted with caution and needs further validation through larger, multicenter studies.
The discrepancy between our results and those of previous large-scale studies may reflect methodological and cohort-specific differences. Unlike meta-analyses that included heterogeneous populations and early-stage disease, our cohort predominantly consisted of patients with advanced or metastatic tumors, in whom D-dimer elevation is almost universal and therefore less discriminative. In addition, differences in assay standardization and cutoff definitions may have reduced the statistical sensitivity to detect subtle prognostic gradients.
4.2.5. Fibrinogen
Fibrinogen has been widely investigated as a prognostic biomarker in lung cancer, with consistent evidence linking elevated levels to adverse survival outcomes. Zhong et al. (2018), in a meta-analysis of 16 studies including 6881 patients, reported that high plasma fibrinogen was significantly associated with worse overall survival (HR = 1.38, 95% CI: 1.22–1.55,
p < 0.001) and shorter disease-free or progression-free survival (HR = 1.29, 95% CI: 1.01–1.65,
p = 0.042). The association persisted in the NSCLC subgroup, where elevated fibrinogen predicted poorer OS (HR = 1.32, 95% CI: 1.14–1.53,
p < 0.001) [
15]. Similarly, Zhang et al. (2019) analyzed 20 studies involving 6494 patients and confirmed the adverse prognostic impact of high fibrinogen, with pooled HRs of 1.44 for OS, 1.49 for PFS, and 1.69 for DFS. Moreover, elevated fibrinogen correlated with aggressive tumor features, including lymph node metastasis and advanced stage [
24].
The prognostic value of fibrinogen has also been demonstrated in specific clinical contexts. Sinn et al. (2022) evaluated patients with stage III/N2 NSCLC treated with neoadjuvant therapy and found that post-treatment fibrinogen > 400 mg/dL was associated with significantly shorter OS (28.2 vs. 60.9 months; HR ≈ 1.78,
p = 0.048). Interestingly, a decline in fibrinogen following therapy independently predicted better survival, suggesting that changes in fibrinogen may reflect treatment response [
25]. In early-stage disease, Mitsui et al. (2022) reported that high preoperative fibrinogen was linked to worse survival even after complete resection and was associated with pathological features of tumor aggressiveness, such as lymphovascular invasion [
26].
Our results align with this body of evidence, showing that elevated fibrinogen levels were associated with early mortality in lung cancer patients, though not independently in multivariable analysis. Taken together, these findings highlight fibrinogen’s role not only as a marker of systemic inflammation and hypercoagulability but also as a surrogate for underlying tumor biology. Its routine availability and low cost further support its consideration in prognostic models, particularly in settings where access to molecular or advanced imaging biomarkers is limited.
4.2.6. CRP
Multiple studies have established a strong association between elevated CRP levels and poor survival outcomes in lung cancer.
In a bi-center retrospective study of patients with NSCLC treated with PD-1/PD-L1 inhibitors, Riedl et al. (2020) reported that elevated baseline CRP was an independent predictor of poor prognosis, being significantly associated with reduced progression-free survival (PFS) and overall survival (OS), with hazard ratios per doubling of CRP of 1.37 for PFS and 1.42 for OS [
27].
Regarding SCLC, Stensvold et al. (2021) analyzed a Danish registry including 923 patients. They found that pretreatment CRP levels above the clinical threshold (>8 mg/L) were significantly associated with worse overall survival (adjusted HR = 1.25, 95% CI: 1.08–1.46) [
14].
More recent evidence, such as that provided by Onodera et al. (2023), suggests that elevated CRP levels prior to initiation of immune checkpoint inhibitor (ICI) therapy are linked not only to poorer prognosis but also to a higher incidence of immune-related adverse events (irAEs). High CRP was significantly associated with shorter survival, with median PFS of 2.2 versus 3.3 months and median OS of 8.9 versus 39.1 months [
28].
Our findings are in partial agreement with these observations, as elevated CRP (>20 mg/L) was associated with early mortality in our cohort. This association likely reflects the impact of a systemic pro-tumorigenic inflammatory milieu, which contributes to angiogenesis, immune evasion and metastatic spread. However, in contrast to the consistent results reported in the literature, CRP did not remain an independent predictor in our multivariable model. This indicates that its clinical utility should be assessed alongside other parameters such as Hb, WBC and tumor T4, which more accurately capture the independent prognostic signal in our cohort.
The lack of independent prognostic value of CRP in our multivariable model contrasts with several studies reporting its significance. This inconsistency may be explained by variations in patient selection, disease stage, and cutoff thresholds across studies. As most individuals in our cohort presented with late-stage disease, the uniformly elevated inflammatory status might have diminished the discriminatory power of CRP. Furthermore, the moderate sample size could have limited our ability to detect weaker associations.
4.2.7. Integration of Hematological Biomarkers with ECOG Performance Status
Only a limited number of studies have explicitly combined inflammatory or coagulation-related biomarkers with ECOG performance status into a single prognostic framework.
A relevant example is provided by a 2023 retrospective study of patients with SCLC treated with etoposide–cisplatin–based chemotherapy. The authors assessed risk factors for febrile neutropenia (FN) and identified ECOG performance status, together with nutritional status and inflammatory markers, as independent predictors of FN risk (
p < 0.001 for the overall model;
p = 0.019 for ECOG-PS). This integrative approach illustrates the value of combining biological, clinical, and functional parameters for patient risk stratification [
29].
Similarly, in our study, the incorporation of ECOG performance status with routine hematological biomarkers provided a comprehensive prognostic framework, capturing both tumor-related biological activity and patient functional reserve.
4.3. Limitations
This study has several important limitations. First, its retrospective and single-center design limits the generalizability of the findings, as the patient population and management strategies may not fully reflect those in other institutions or regions. This design allowed comprehensive data collection and consistent laboratory testing. Future multicenter prospective studies are required to validate the reproducibility and external applicability of these findings.
Another important limitation is the lack of external validation. Although internal validation was performed through ROC, Kaplan–Meier, and multivariable Cox analyses, the predictive performance of the proposed model requires confirmation in independent, multicenter cohorts. Future studies with larger and more diverse populations are needed to verify its generalizability and clinical utility.
Furthermore, the analysis included only complete cases, which may have introduced selection bias and potentially led to unrepresentative estimates of prognostic associations. Survival analysis was restricted to patients for whom complete mortality data, including the exact date of death, were available. As a result, only 304 of the 602 eligible cases were included in the survival assessment, which may further limit applicability to the broader lung cancer population. The exclusion of nearly half of the screened population—mainly due to missing laboratory or follow-up information—may have introduced a selection bias favoring cases with more comprehensive clinical documentation. To mitigate this concern, baseline characteristics such as age, sex, and histological subtype were compared between included and excluded patients, revealing no substantial differences. Nevertheless, this limitation should be acknowledged when generalizing the study findings to the broader lung cancer population.
Treatment-related variables, including surgery, chemotherapy, and immunotherapy, were not included in the analysis because these data were incomplete in the registry and outside the primary focus of this study. Our objective was to assess the prognostic impact of routinely available hematological and clinical parameters at diagnosis, independent of subsequent therapeutic interventions. Nevertheless, we acknowledge that treatment heterogeneity may have influenced survival outcomes, and future prospective studies should address this aspect.
Another limitation is the absence of imaging-based or molecular biomarkers, such as radiomics features or genomic alterations, which have been shown to enhance the accuracy of prognostic models in lung cancer. The inclusion of such parameters, along with external validation in independent, multicenter cohorts, would strengthen the robustness and clinical applicability of the proposed model. Future research should aim to integrate routine hematological parameters with advanced molecular and imaging biomarkers to develop comprehensive, multimodal prognostic tools capable of informing individualized treatment strategies.
4.4. Clinical Implications and Future Directions
The prognostic model developed in this study, integrating four routinely measured hematological biomarkers with ECOG performance status, offers a pragmatic and cost-effective tool for survival risk stratification in lung cancer. All included variables are inexpensive, widely available and part of standard clinical workflows, which makes the model particularly relevant in resource-limited healthcare systems where access to molecular testing, advanced imaging, or targeted therapies is restricted.
By combining objective laboratory measurements with functional status, the model captures both tumor-related biological activity and the patient’s physiological reserve—two complementary dimensions that critically influence treatment tolerance and survival outcomes. In clinical settings, such an approach could support early identification of high-risk patients, guiding treatment decisions toward tailored strategies, such as prioritizing supportive care, closer follow-up or targeted interventions to improve functional capacity.
Furthermore, the simplicity of the model facilitates its integration into routine oncology practice without the need for specialized equipment or expertise, enabling its potential adoption in both tertiary cancer centers and peripheral hospitals. With external validation in diverse populations, this approach could contribute to more equitable prognostic assessment and better allocation of healthcare resources in lung cancer management.
The integration of routine hematological markers into existing prognostic systems such as TNM staging and ECOG performance status may enhance their predictive accuracy. While TNM classification provides an anatomical assessment of disease burden, parameters such as Hb and WBC count reflect the patient’s systemic and inflammatory status, offering additional biological context. Incorporating these readily available variables could help identify high-risk patients within the same TNM stage, guiding follow-up intensity and treatment decisions. This combined approach may be particularly valuable in healthcare settings with limited access to molecular or imaging biomarkers.
5. Conclusions
In this retrospective analysis, we evaluated the prognostic value of routine hematological markers (Hb, CRP, fibrinogen, D-dimer) integrated with ECOG performance status in patients with lung cancer. The results demonstrated that anemia (Hb < 11.85 g/dL) and leukocytosis (WBC > 11.71 × 109/L), together with the presence of tumor T4, were independent predictors of mortality. CRP, fibrinogen and D-dimer showed significant associations in univariate and Kaplan–Meier analyses, but not in the multivariate model, suggesting that other clinico-biological factors influence their prognostic value.
Integrating hematological markers with ECOG performance status remains a practical and cost-effective strategy for risk stratification, especially in resource-limited healthcare systems. Prospective multicenter studies are needed to validate these results externally and to explore multimodal models that also include molecular biomarkers or advanced imaging parameters.
This highlights the importance of validating simple, inexpensive prognostic markers in real-world settings, while acknowledging their limitations compared with molecular or imaging biomarkers.