1. Introduction
Immunotherapies have demonstrated durable survival benefits in a small fraction of patients with recurrent and metastatic head and neck squamous cell carcinoma (HNSCC) [
1,
2]. A major challenge with their widespread utility in HNSCC has been an inability to reliably predict response. One factor contributing to this limitation is an incomplete understanding of the composition and roles of immune cell subtypes in the HNSCC tumor microenvironment (TME) [
3,
4]. Immunotherapies targeting tumor infiltrating lymphocytes (TIL) have spurred investigations into the proportions and functional states of T lymphocytes, specifically [
5,
6,
7]. Accordingly, some aspects of TIL in HNSCC have been elucidated, including the importance of cytotoxic and regulatory T cells and the concept of T cell exhaustion. However, the roles of rare immune cell subsets remain less clear.
With the large quantity of publicly available bulk transcriptomic data, several recent studies have utilized computational deconvolution efforts to explore immune cell subtypes [
8,
9,
10,
11,
12,
13]. However, despite describing similar methodologies for deconvolution, these studies derived disparate conclusions regarding the prognostic associations of the immune infiltrate, overall [
8,
9,
10], as well as specific immune cell subtypes [
8,
11,
12]. Most importantly, existing studies have not integrated novel data on immune cell subtypes with demographic, clinicopathologic, or genomic features, including those with known clinical utility, to validate conclusions or to place them in an appropriate clinical context.
Here, we utilize the cell-type identification by estimating relative subsets of RNA transcripts (CIBERSORT) algorithm and the LM22 signature matrix [
14] to deconvolve HNSCC bulk RNA-sequencing data from The Cancer Genome Atlas (TCGA) and integrate it with clinicopathologic and genomic data in a multivariable survival analysis to examine the independent associations of immune cell subtypes with survival in HPV-negative HNSCC.
2. Materials and Methods
Bulk RNA-Seq data and clinical information of HNSCC tumors from TCGA. Bulk RNA-sequencing data of HNSCC tumors were obtained from The Cancer Genome Atlas (TCGA) database (
https://portal.gdc.cancer.gov/, dbGaP study accession phs000178, accessed on 1 September 2019). A total of 566 cases were selected using the following filtering criteria: primary site = head and neck, disease type = squamous cell neoplasms, sample type = primary tumor, experimental strategy = RNA-seq, and workflow type = HTseq (High Throughput sequencing) counts or FPKM (Fragments Per Kilobase Million). To maintain the biological uniformity of the cohort, HPV-positive oropharynx HNSCC patients were eliminated, with a filtered dataset of 517 cases. Clinical information for patients with HNSCC, including age, gender, smoking status, T stage, N stage, perineural invasion, lymphovascular invasion, and survival in months, was accessed on 1 September 2019.
CIBERSORT deconvolution analysis. We utilized the CIBERSORT online tool implementation [
14] to infer the proportions of 22 immune cell subtypes from bulk RNA-seq data. The HTseq-FPKM normalized expression data for 517 HPV-negative HNSCC cases from TCGA and the LM22 signature matrix were used as the inputs for CIBERSORT, as previously described [
14]. The LM22 signature matrix contains transcriptional signatures for 22 immune cell subtypes, including naïve B cells, memory B cells, plasma cells, CD8 T cells, naïve CD4 T cells, memory resting CD4 T cells, memory activated CD4 T cells, follicular helper T cells, regulatory T cells, γδ T cells, resting natural killer (NK) cells, activated NK cells, monocytes, M0 macrophages, M1 macrophages, M2 macrophages, resting dendritic cells, activated dendritic cells, resting mast cells, activated mast cells, eosinophils, and neutrophils. A CIBERSORT
p-value cutoff of 0.05 was used to filter out cases with an unreliable deconvolution, leaving a final dataset of 317 HPV-negative HNSCC cases. This analysis provided sample-level fractions of 22 individual immune cell subtypes. Average cell type fractions across samples were computed, as well as Pearson correlations between cell types.
Survival analysis. A univariate Cox proportional hazards model was used to assess the impact of 22 immune cell proportions, as computed by CIBERSORT, on disease-free survival (DFS). Disease-free survival was defined as recurrence or death, whichever was first. For this analysis, tumors were categorized as either subtypehigh (top quartile) or subtypelow (bottom three quartiles) for each immune cell subtype. For cell types that were detected in less than 25% of tumors, those tumors in which the cell type was detected (subtype = 1) were compared with those in which it was not detected (subtype = 0). Kaplan−Meier survival curves were generated for DFS when stratified by immune cell subtypes found to be significant in the univariate analysis, as well as other immune cell subtypes of potential prognostic significance. Immune cell subtypes with significant associations in the univariate analysis were included in the multivariable Cox proportional hazards model, along with additional demographic, clinical, pathologic, and genomic features, including age (<60 vs. 60+), smoking status (never/former vs. current), T stage (T1-T2 vs. T3–T4), N stage (N0–N1 vs. N2–N3), subsite (oral cavity, larynx, or HPV-negative oropharynx), and TCGA expression subtype (basal, atypical, classical, or mesenchymal). Parameters with significant missing values, including perineural invasion (PNI) and lymphovascular invasion (LVI), were eliminated from the multivariable analysis due to concern for critical reduction in sample size. To compare systematic differences in clinicopathologic parameters between patients with and without γδ T cells, chi-squared tests were performed. All statistical analyses were performed using R (4.2.0) and a p-value cutoff of 0.05.
Differential gene expression and pathway analysis. Gene expression was used to perform pairwise differential analysis between groups of samples. A Wilcoxon rank-sum test was performed to determine the differential expressed genes (DEG). The p-value was adjusted using the Benjamini−Hochberg procedure, and genes were considered significant with an adjusted p-value < 0.05. For each comparison, we obtained the top DEGs ranked by the highest fold change that were significantly different between the comparison groups (FDR < 0.05). We used ConsensusPathDB-human for gene set over-representation analysis.
3. Results
CIBERSORT deconvolution provides proportions of 22 immune cell subtypes. To infer the proportions of common and rare immune cell subtypes in HNSCC, we utilized the CIBERSORT algorithm and the LM22 immune cell signature matrix [
14] to deconvolve bulk TCGA RNA-seq data for 517 HPV-negative HNSCC tumors (
Figure 1A). Samples without reliable deconvolution (
p > 0.05) were removed, and the final filtered dataset contained 317 tumors, including 226 oral cavity, 70 larynx, and 21 oropharynx HPV-negative tumors. Immune cell proportions varied widely across cell types and tumors. The most common immune cell types were M0 macrophages (25.6 ± 0.9%), M2 macrophages (13.0 ± 0.4%), and memory resting CD4 T cells (10.4 ± 0.4%), while the least common immune cell types were eosinophils (0.1 ± 0.0%), γδ T cells (0.1 ± 0.0%), and naïve CD4 T cells (0.1 ± 0.0%) (
Figure 1B). To ensure that immune cell types were not highly correlated and, instead, represented independent features of the tumor immune microenvironment, we assessed Pearson correlations between each pair of immune cell types and found no significant correlations in our dataset (
Figure 1C), thus supporting the validity of the 22-cell-type classification used.
Univariate analysis reveals rare T cell subtypes associated with survival. To assess the association of immune cell subtypes with DFS, we first performed a univariate analysis using a Cox proportional hazards model (
Supplementary Table S1). In this analysis, only two immune cell subtypes showed a significant association with survival: memory activated CD4 T cells (
p = 0.02) and γδ T cells (
p < 0.01). Kaplan−Meier curves for these two cell types, as well as other cell types of interest within HNSCC, were then generated (
Figure 2). Interestingly, while a higher proportion of infiltrated CD4 memory T cells was associated with better DFS (
Figure 2A), a non-zero fraction of γδ T cells was associated with a poorer DFS (
Figure 2B). By contrast, proportions of CD8 T cells (
Figure 2C), regulatory T cells (
Figure 2D), activated M1 macrophages (
Figure 2E), and activated NK cells (
Figure 2F) had no association with DFS in our cohort.
Multivariable analysis confirms independent association ofγδ T cells with survival. To determine whether immune cell subtypes were independently associated with DFS in our cohort, we then developed a rigorous multivariable Cox proportional hazards model, in which significant cell types from the univariate analysis were included, along with additional demographic, pathologic, and genomic features. Complete case analysis was performed, with a total of 287 patients included. As expected, higher N stage was highly significantly associated with poorer DFS (HR 1.70, 95% CI [1.18–2.45],
p < 0.005,
Table 1). Interestingly, the only other feature that was significantly associated with DFS was the presence of γδ T cells, which was associated with poorer DFS (HR 2.13, 95% CI [1.10–4.13],
p < 0.05,
Table 1). There was no association of age, smoking status, TCGA molecular subtype, anatomic site, T stage, or proportion of memory activated CD4 T cells with survival (
Table 1). We then compared demographic and clinicopathologic features of patients with (n = 14) and without (n = 303) γδ T cells and found no significant associations with gender, smoking status, T stage, N stage, PNI, LVI, anatomic subsite, or TCGA subtype (
Supplementary Table S2), confirming that the presence of γδ T cells is a truly independent negative predictor of survival.
Differential expression and pathway analysis fail to reveal a clear transcriptional signature associated with γδ T cell infiltration. Finally, to explore potential mechanisms behind the poor DFS associated with the presence of γδ T cells, we examined differential gene expression among patients with and without γδ T cells. Interestingly, several genes were significantly upregulated in patients without γδ T cells, but no genes upregulated in patients with γδ T cells (
Supplementary Figure S1A). We then conducted a pathway analysis to determine if there were any coherent patterns in differential gene expression and found increased expression of pathways related to gene expression and transcription in tumors with γδ T cells (
Supplementary Figure S1B and Table S3).
4. Discussion
The advent of immunotherapy as a novel treatment modality has sparked a major interest in understanding the roles of immune cell subtypes in solid tumors. Here, we performed an immune cell deconvolution of TCGA bulk RNA-seq data for 517 untreated HPV-negative HNSCC tumors and uncovered a novel independent association of γδ T cells with poor DFS. Our study provides a rich and comprehensive analysis of the tumor immune microenvironment composition in HNSCC, which should be a valuable resource for future studies. Importantly, unlike prior studies, we combined immune cell deconvolution with clinicopathologic and genomic features in a multivariable analysis. These analyses revealed γδ T cells and N stage as independently associated with survival, uncovering a novel potential role for γδ T cells in HNSCC biology, while affirming the established role of nodal stage as an important adverse pathologic feature in HPV-negative HNSCC. Together, these findings represent a critical observation likely to inform future studies, as well as HNSCC prognostication and therapeutics.
γδ T cells are a subgroup of T lymphocytes with T cell receptors with gamma and delta chains [
15]. γδ T cells are much less common than αβ T cells but recognize antigens in a non-MHC restricted fashion and have a high secretory capacity for cytokines, suggesting potential for antitumor efficacy [
15]. Accordingly, γδ T cells have been proposed to be associated with improved outcomes in many solid tumors, sparking interest in the development of immunotherapies targeting this rare subset [
16]. However, γδ T cells also secrete pro-angiogenic IL-17 and promote proliferation of myeloid derived suppressor cells, and thereby may have potential pro-tumorigenic functions [
15]. Thus, while γδ T cells isolated from peripheral blood are cytotoxic to oral tumor cells in vitro [
17] and γδ T cells, as estimated by TRDC, TRGC1, and TRGC2 expression in bulk RNA-seq data, were associated with improved prognosis in HNSCC [
18], Bas et al. found an increased fraction of peripheral blood γδ T cells in patients with HNSCC relative to healthy controls, as well as higher fractions in patients with recurrent disease [
19]. These conflicting results may be related to difficulty in accurately quantifying γδ T cells by gene expression or flow cytometry due to overlap in expression with NK cells and CD8 T cells [
20]. They also underscore the importance of specifically understanding the prognostic role of γδ T cells in HNSCC, including any potential role in facilitating the progression from dysplasia to carcinoma.
In the present study, we found an independent association of γδ T cells with poor prognosis in HPV-negative HNSCC. Our use of CIBERSORT with the LM22 signature matrix enabled us to distinguish immune cell subtypes with overlap in expression and to accurately quantify rare immune cell subsets. Interestingly, only 14 patients (4.4%) had non-zero fractions of γδ T cells, underscoring the rarity of these cells. It has been estimated that approximately 1–10% of peripheral blood CD3 positive cells are γδ T cells [
16,
19]; in our dataset, γδ T cells comprised < 0.1% of all immune cells and 0.3% of all T cells. This discrepancy may be related to overestimation of γδ T cells when measured by flow cytometry or gene expression, or to differences in fraction between peripheral blood and tumor tissue.
The association of γδ T cells with decreased DFS was maintained in the multivariable analysis, which accounted for numerous other demographic, clinical, pathologic, and genomic factors, indicating that γδ T cells are an independent predictor of survival. Unlike prior studies, our work specifically incorporated these other well-established predictors into our model, thereby more robustly querying the role of γδ T cells in patient outcomes. Importantly, the adverse significance of nodal stage was also maintained in this analysis, further validating the clinical utility of the findings related to γδ T cells. Finally, this relationship was confirmed by the similarities in patient demographics and tumor features when comparing patients with and without γδ T cells.
The present work has several limitations. First, although the use of LM22 to deconvolve immune cell subtypes with overlapping expression is a strength of the present work, it is also a limitation, in that our ability to deconvolve cellular subsets is limited to the 22 cell types present in LM22. Accordingly, we were unable to deconvolve myeloid derived suppressor cells, previously proposed as a mechanism by which γδ T cells may be pro-tumorigenic [
15]. With the lack of consensus in the literature on the significance of γδ T cells, further exploration into potential pro-tumorigenic mechanisms in head and neck cancer is critical. Moreover, 200 patients were lost due to insufficient deconvolution, thus limiting the sample size used in the final analyses. Second, given the rarity of γδ T cells, these cells were only detected in 14 patients in our cohort. As a result, these analyses may thus more accurately be seen as hypothesis-generating, and there is a need to reproduce these results in an independent cohort to support clinical utility. In addition, given the predominance of oral cavity subsite tumors in TCGA, these results warrant validation in larger cohorts of larynx and oropharynx SCC patients to determine whether these results extend to other subsites. Moreover, with the number of clinical and immune cell variables investigated, even in a cohort as large as ours and with the use of a multivariable analysis, the possibility of a false positive result remains, underscoring the need for prospective validation. Third, as treatment paradigms of patients included in TCGA may vary and are not detailed explicitly, cross-institutional and subsite-related differences may drive differences in survival across cohorts. Fourth, as differential expression analysis (
Supplementary Figure S1) revealed differential activation of pathways related to gene expression and transcription in tumors’ γδ T cells, the mechanism by which γδ T cells are related to poor prognosis in HNSCC remains unclear. Finally, given missing data on pathologic factors such as PNI, LVI, ENE, and surgical margin status in TCGA, we were not able to include these factors in the multivariable analysis. Still, our findings underscore the importance of better characterizing the functional role of γδ T cells in HNSCC and understanding whether the presence of γδ T cells may predict the need for alternative therapy or the utility and type of immunotherapy, which are important topics for future investigation.
5. Conclusions
CIBERSORT deconvolution of TCGA HNSCC RNA-seq data using the LM22 matrix facilitated reliable deconvolution of 22 immune cell subtypes in 517 tumors, providing a rich and comprehensive encyclopedia of the HNSCC tumor immune microenvironment. Importantly, multivariable survival analysis indicated that, in addition to nodal status, γδ T cells are predictive of poor DFS, even when accounting for a variety of well-established clinicopathologic and genomic features of known significance, including age, smoking status, expression subtype, anatomic subsite, and T stage. Differential gene expression analysis across tumors with and without γδ T cells revealed upregulation of pathways related to gene expression and transcription in tumors with γδ T cells, but the mechanisms behind this novel survival association remain unclear. Still, our findings underscore the need to better understand the role of this rare immune cell subtype in HNSCC and whether the presence of γδ T cells may predict the need for adjuvant therapy.