1. Introduction
High-grade glioma (HGG) is the most common primary malignant brain tumor, with an incidence of 3–5 cases per 100,000 people [
1,
2]. According to the World Health Organization (WHO) 2021 classification, HGGs include several entities, such as IDH-mutant grade III and grade IV astrocytomas, as well as IDH-mutant and 1p/19q-codeleted oligodendrogliomas [
2]. A maximal safe surgical resection, along with systemic therapy and radiation, is the mainstay of treatment [
3,
4]. Despite the application of multimodal therapy regimens and emerging approaches such as targeted therapy, immunotherapy, and tumor-treating fields, the prognosis remains poor [
4]. Median survival for grade III gliomas ranges from 24 to 72 months, while for grade IV gliomas, it is only 14 to 16 months [
5,
6,
7].
Perioperative factors, including volative anesthetic agents, may influence oncological outcomes by modulating host immunity [
6,
7]. Although this has been explored in colorectal and prostate cancers, its relevance in HGG remains unclear and underexplored. Given the poor prognosis of HGG, it is crucial to investigate all potentially modifiable factors, including anesthetic technique. Clinical and preclinical studies suggest that the type of anesthesia may influence patient outcomes with HGGs. Xu et al. found that propofol exposure in glioma cell cultures inhibited proliferation and invasion and induced cell apoptosis [
8]. Shi et al. found that sevoflurane promoted glioma stem cell growth [
9]. Huang et al. found that patients undergoing glioblastoma (GB) surgery with total intravenous anesthesia (TIVA) had improved OS and fewer postoperative complications compared to those who received inhalational anesthesia (INHA) [
3]. On the other hand, other studies have reported conflicting results. A retrospective study showed no significant difference in OS or PFS between patients undergoing surgery under TIVA versus INHA [
1]. Similarly, Schmoch et al. found no difference in survival outcomes in patients with IDH-wildtype GBM between the two anesthetic groups [
5].
The choice of anesthetic agent during glioma surgery may have implications extending beyond perioperative management, potentially influencing tumor biology and patient outcomes [
8,
9]. Preclinical studies have demonstrated that propofol exerts antitumor effects by inhibiting glioma cell proliferation, invasion, and migration, primarily through mechanisms such as upregulation of microRNA-218 and suppression of hypoxia-inducible factor 1-alpha (HIF-1α) and matrix metalloproteinase-2 (MMP-2) activity [
10,
11,
12]. Conversely, volatile anesthetics like sevoflurane have been associated with enhanced glioma stem cell expansion and reduced apoptosis, possibly promoting tumor aggressiveness via activation of hypoxia-inducible factors [
13]. Additionally, inhalational agents differ from intravenous anesthetics in their modulation of neuroinflammation and immune responses, influencing oxidative stress and cytokine release [
14]. These mechanistic differences suggest that anesthetic choice may affect glioma progression and patient survival, underscoring the need for comprehensive evaluation of clinical outcomes associated with different anesthesia techniques.
Despite these findings, most of the available studies are limited by small sample sizes, methodological heterogeneity, and retrospective study designs. To date, no conclusive evidence has established whether anesthetic choice impacts long-term outcomes in HGG patients, which has resulted in a notable gap in the existing literature. To bridge this gap, we performed a systematic review and meta-analysis to compare the impact of total intravenous anesthesia (TIVA) versus inhalational anesthesia (INHA) on overall survival (OS) and progression-free survival (PFS) in patients with high-grade gliomas (HGGs). By integrating evidence from multiple studies, our analysis offers a comprehensive assessment of how anesthetic technique may influence OS and PFS in this patient population. To our knowledge, this is the first meta-analysis specifically addressing this association. The results are intended to support future clinical guidelines and underscore the potential of anesthetic choice as a modifiable factor in optimizing oncologic outcomes for individuals with HGGs.
4. Discussion
The potential influence of anesthetic technique on long-term oncological outcomes following cancer surgery remains a subject of intense debate and investigation. For patients undergoing resection of gliomas, particularly high-grade gliomas like glioblastoma, where prognosis is often poor despite multimodal therapy, any modifiable factor that could impact survival or recurrence is of significant clinical interest. In this systematic review and meta-analysis of 5 studies and 827 patients, we evaluated the effect of TIVA versus INHA on OS and PFS in HGG patients. The main findings from the pooled analyses were as follows: (1) There were no statistically significant differences between the groups regarding OS or PFS; (2) In terms of mOS or mPFS, there were also no statistically significant differences between the groups; (3) Single-arm meta-analyses of mOS and mPFS for TIVA and INHA also showed no statistically significant subgroup differences. Our review highlights the lack of comprehensive data on the impact of anesthetic technique on long-term outcomes in HGG patients. This meta-analysis aimed to synthesize the available evidence comparing TIVA, typically propofol-based, with INHA regarding OS and PFS in this patient population.
The principal findings of this analysis, encompassing data from 827 patients (406 receiving TIVA, 421 receiving INHA) across five retrospective studies, indicate no statistically significant difference between the two anesthetic techniques for either primary endpoint. Specifically, the pooled HR for OS was 0.77 (95% CI [0.58; 1.02]; p = 0.07), and for PFS, the HR was 0.88 (95% CI [0.70; 1.10]; p = 0.27). While the point estimate for OS suggested a potential 23% reduction in the hazard of death associated with TIVA, the result did not achieve conventional statistical significance, and the confidence interval included the possibility of no difference. The PFS results more clearly indicated equivalence between the groups. However, these summary estimates must be interpreted with considerable caution due to the substantial heterogeneity observed, particularly for OS (I2 = 67%), and moderate heterogeneity for PFS (I2 = 51%), alongside indications of publication bias. Notably, subgroup analysis by tumor grade revealed a statistically significant survival benefit associated with TIVA in grade IV tumors (HR 0.70; 95% CI [0.51–0.96]; p = 0.03). However, no significant effect was observed in the mixed grade III–IV subgroup. Although the test for subgroup differences did not reach significance (p = 0.0669), this trend suggests that tumor grade may influence anesthetic effect and highlights the need for stratified analysis in future studies.
The pooled analysis of OS yielded an HR of 0.77 in favor of propofol-based TIVA versus INHA, with a
p-value of 0.07, just shy of conventional significance. However, inconsistency across studies was high (I
2 = 67%), indicating that the true effect likely varies from one setting or patient population to another. Such heterogeneity means that the pooled HR may not accurately reflect a uniform benefit of TIVA for all glioma patients. This is further supported by the grade-specific analysis, which demonstrated a significant benefit of TIVA in grade IV patients, but not in mixed-grade populations—pointing to potential biological or clinical differences between tumor grades that may mediate anesthetic effects. A Baujat plot pinpointed Dong et al. (2019) [
1] as a major driver of both the overall effect and heterogeneity. In their cohort of high-grade glioma patients, median OS and PFS did not differ overall between propofol TIVA and sevoflurane INHA (both medians 18 months). Yet, when patients were stratified by preoperative Karnofsky Performance Status (KPS), those with poorer function (KPS < 80) experienced significantly longer OS with propofol (15 vs. 11 months) and a lower risk of death, whereas no difference emerged in the higher-function subgroup [
1]. This context-dependent finding—absent from other trials—likely inflated the apparent benefit of TIVA in the pooled estimate. Heterogeneity was further compounded by the study of Huang et al. (2021), which compared propofol TIVA to desflurane-based INHA in glioblastoma surgery and found a marked OS benefit for TIVA (matched HR 0.51; 95% CI 0.29–0.89;
p = 0.011) both before and after propensity matching [
3]. In contrast, Grau et al. (2020) and Schmoch et al. (2021)—each comparing TIVA to sevoflurane INHA in similar glioblastoma populations—reported no significant OS differences, and Kumaria et al. (2024) likewise observed equivalent survival between propofol/remifentanil TIVA and sevoflurane INHA groups [
2,
4,
5]. These contradictory outcomes underscore that the specific inhalational agent matters: desflurane and sevoflurane differ in their biochemical properties and potential tumor-modulating effects, so grouping all volatile agents into one category injects clinical and biological variability into the meta-analysis.
Beyond heterogeneity, the funnel plot for OS was markedly asymmetric, suggesting the presence of publication bias. Specifically, smaller studies showing no difference—or even favoring INHA—may have gone unpublished or remained inaccessible, skewing the available literature toward positive findings for TIVA. This selective publication may inflate the apparent benefit of TIVA in the pooled analysis and contribute to the non-significant trend observed (HR 0.77; p = 0.07). Such bias threatens the validity of the conclusions by systematically distorting effect estimates away from the null, especially when fewer studies are available. The clinical implication is that the observed trend favoring TIVA should be interpreted with caution, as the true effect may be closer to null than suggested by the current data. Likewise, subgroup analysis by risk of bias did not substantially alter the overall effect (I2 = 63%), indicating that study quality alone does not account for the observed variability. The single-arm meta-analysis estimated median OS at 20.93 months for TIVA and 19.28 months for INHA (p = 0.71), but extreme heterogeneity within each arm (I2 = 88% for TIVA, I2 = 90% for INHA) renders these values illustrative at best. Together, the potential presence of publication bias and underlying clinical heterogeneity underscore the need for well-powered prospective studies with pre-registered protocols that include both positive and null results.
The PFS found an HR of 0.88 (95% CI 0.70–1.10;
p = 0.27), indicating no statistically significant difference between propofol-based TIVA and volatile INHA. The confidence interval includes 1.0 comfortably, suggesting equivalence in the risk of disease progression or death when data are aggregated. Moderate statistical heterogeneity was observed (I
2 = 51%), implying that roughly half of the variation in the study results is due to real differences rather than chance. A Baujat plot identified Huang et al. (2021) [
3] and Schmoch et al. (2021) [
5] as the most influential contributors to this heterogeneity. In particular, Huang et al. reported a significant reduction in postoperative tumor recurrence for TIVA versus desflurane INHA (matched HR 0.60; 95% CI 0.37–0.97;
p = 0.040) [
3]. This contrasts sharply with trials comparing TIVA to sevoflurane. Similarly, Grau et al. (2020) compared propofol TIVA to predominantly sevoflurane INHA and found virtually identical median recurrence-free survival (RFS): 8.4 months in the TIVA group versus 8.0 months in the INHA group [
2]. Dong et al. (2019) similarly reported no significant difference in median PFS (10 months for TIVA vs. 11 months for sevoflurane) [
1]. Schmoch et al. (2021) also detected no difference in the time to progression between propofol TIVA and sevoflurane INHA [
5]. The clear benefit seen with desflurane in Huang et al. thus appears to drive much of the heterogeneity when pooled with sevoflurane comparisons.
Assessment of publication bias via the funnel plot showed mild asymmetry, suggesting that smaller or negative studies may be underrepresented, though the bias is less pronounced than for OS. LOO sensitivity analyses confirmed that the pooled HR for PFS remained stable when any single study was omitted, indicating that no one trial unduly skewed the overall estimate. A subgroup analysis using MD rather than HR likewise showed no significant effect: MD 0.49 months (95% CI –0.74 to 1.72; p = 0.43) with zero heterogeneity (I2 = 0%). However, this metric examines a different aspect of the data and does not eliminate the variability seen in HR analyses. The single-arm meta-analyses estimated a pooled median PFS of 10.08 months for the TIVA cohorts and 9.28 months for the INHA cohorts (p = 0.75). However, within-arm heterogeneity was extremely high (I2 = 93% for TIVA, I2 = 89% for INHA), making these median values illustrative at best rather than definitive.
This meta-analysis of retrospective studies found no statistically significant difference between propofol-based TIVA and INHA for either OS or PFS in glioma resection. For OS, there was a non-significant trend favoring TIVA (HR 0.77;
p = 0.07) accompanied by high heterogeneity (I
2 = 67%) and funnel-plot asymmetry, suggesting publication bias [
1,
3]. PFS results were more clearly equivalent (HR 0.88;
p = 0.27) with moderate heterogeneity (I
2 = 51%), and leave-one-out sensitivity analyses confirmed the robustness of the null finding [
2,
5].
The divergence of Huang et al. (2021) [
3]—who compared TIVA to desflurane and found significant OS and PFS benefits (matched OS HR 0.51;
p = 0.011; matched PFS HR 0.60;
p = 0.040)—from other studies using sevoflurane suggests that the choice of volatile agent drives much of the heterogeneity and may account for any apparent TIVA advantage [
3]. Dong et al. (2019) further demonstrated that TIVA may benefit patients with poorer preoperative function (KPS < 80) even when compared to sevoflurane (median OS 15 vs. 11 months) [
1]. No survival differences were seen in comparisons of TIVA versus sevoflurane by Grau et al. (2020), Schmoch et al. (2021), or Kumaria et al. (2024) [
2,
4,
5]. The high heterogeneity, potential publication bias, and retrospective design limit the strength of conclusions, though operative efficiency benefits with TIVA (shorter operative times: 104 vs. 129 min) may still inform anesthetic choice in practice [
4].
Subgroup analyses by tumor grade added further nuance to our findings. While the overall pooled analysis showed no significant difference in OS between TIVA and INHA, patients with grade IV gliomas experienced a statistically significant OS benefit with TIVA. This effect was not observed in mixed grade III–IV cohorts, and the test for subgroup differences did not reach significance (p = 0.0669). Thus, this result should be interpreted with caution, as the observed effect in grade IV tumors may reflect within-group variability rather than a true differential effect across tumor grades. Nonetheless, the finding supports the hypothesis that the benefit of TIVA may be confined to higher-grade tumors, possibly due to differences in tumor biology or treatment responsiveness. Future prospective studies should incorporate stratified analyses by tumor grade to validate this effect.
The present meta-analysis combines data from 827 glioma resection patients and employs rigorous methods—heterogeneity assessment, publication-bias evaluation, and LOO sensitivity analyses—to address a focused clinical question. However, its findings are tempered by several key limitations:
All included studies were retrospective in design, which introduces inherent methodological limitations, including selection bias, unmeasured confounding, and lack of standardization across cohorts. One important source of bias is confounding by indication, as the choice of anesthetic technique may have been influenced by patient-specific factors (e.g., comorbidities, functional status), tumor characteristics (e.g., location, grade), or institutional preferences and protocols. These influencing factors are often not fully reported or controlled for in retrospective analyses. Even in studies that used propensity score matching to balance baseline characteristics, residual confounding cannot be ruled out, particularly for variables not included in the matching process. This limitation is fundamental and restricts the ability to draw causal inferences between anesthetic modality and survival outcomes. As such, our findings should be interpreted as exploratory and hypothesis-generating rather than definitive. Future prospective, randomized studies are needed to establish causality and isolate the effect of anesthetic choice from underlying confounders.
An important limitation of this meta-analysis is the aggregation of different volatile anesthetics, which may have distinct pharmacologic and biologic effects. For instance, sevoflurane and desflurane differ in their mechanisms of action, neuroinflammatory profiles, and reported impacts on tumor biology. Pooling these agents under a single INHA group may contribute to the observed statistical heterogeneity and could mask agent-specific effects. In particular, the substantial heterogeneity observed for OS (I2 = 67%) raises concerns about the appropriateness of combining these agents statistically. This heterogeneity may compromise the validity of the pooled estimates and limit the generalizability of the findings to clinical decision-making. Future meta-analyses and prospective trials should perform stratified analyses by specific volatile agents to delineate their individual impact on glioma outcomes and guide more tailored anesthetic strategies.
In addition to methodological concerns, there is growing evidence that different anesthetic agents may exert distinct biological effects on glioma cells and the tumor microenvironment. Preclinical studies suggest that propofol may inhibit glioma cell proliferation, invasion, and migration and promote apoptosis through mechanisms such as the upregulation of microRNA-218 and suppression of HIF-1α and MMP-2 activity [
8,
11]. In contrast, sevoflurane has been shown to enhance glioma stem cell expansion by activating hypoxia-inducible factors and to suppress apoptosis, thereby potentially promoting tumor aggressiveness [
9,
12]. Additionally, volatile agents such as sevoflurane and desflurane differ in their neuroinflammatory and immunomodulatory profiles, with evidence indicating that they modulate oxidative stress, cytokine release, and immune cell activity in distinct ways [
13,
14]. These mechanistic differences may underlie the variable clinical outcomes observed in studies comparing total intravenous anesthesia with inhalational anesthesia and highlight the importance of agent-specific investigation in future research. Clarifying these biologic effects could provide critical insight into how anesthetic choice may influence glioma progression and survival.
Incomplete adjustment for prognostic factors—such as MGMT methylation, IDH status, histological subtype, and extent of resection—limits the ability to explore subgroup effects or perform fully adjusted analyses. Moreover, while tumor grade was examined as a subgroup, the lack of consistent reporting and stratification across studies limited our ability to perform more nuanced, grade-specific analyses. Nevertheless, the observed OS benefit in grade IV tumors warrants further exploration in stratified or grade-specific trials. These unaccounted prognostic variables may confound the association between anesthetic technique and survival outcomes, thereby limiting the strength of causal inferences and the validity of conclusions regarding independent anesthetic effects.
Evidence of publication bias, particularly for OS, suggests that the pooled HR may overestimate any true effect, with the real difference likely closer to null. Although the funnel plot for PFS showed only mild asymmetry, the potential for selective reporting cannot be excluded and may still influence effect estimates. This underscores the need for comprehensive reporting of neutral findings in future studies.
These limitations underscore the need for prospective randomized trials stratified by inhalational agent and accounting for key tumor and patient characteristics. We sought to overcome these limitations by applying LOO sensitivity analyses, as well as Baujat and funnel plot analyses, alongside subgroup analyses stratified by risk of bias and tumor grade, and secondary analyses with median OS and PFS values, as well as single-arm meta-analyses using pooled medians for each anesthesia type.