Next Article in Journal
Long-Term Systemic Treatment of a Mouse Model Displaying Chronic FSHD-like Pathology with Antisense Therapeutics That Inhibit DUX4 Expression
Next Article in Special Issue
High-Throughput Molecular Dynamics-Based Alchemical Free Energy Calculations for Predicting the Binding Free Energy Change Associated with the Selected Omicron Mutations in the Spike Receptor-Binding Domain of SARS-CoV-2
Previous Article in Journal
Tuning the Degradation Rate of Alginate-Based Bioinks for Bioprinting Functional Cartilage Tissue
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction of Conserved HLA Class I and Class II Epitopes from SARS-CoV-2 Licensed Vaccines Supports T-Cell Cross-Protection against SARS-CoV-1

Centro Nacional de Microbiología, Instituto de Salud Carlos III, 28220 Majadahonda, Spain
Biomedicines 2022, 10(7), 1622; https://doi.org/10.3390/biomedicines10071622
Submission received: 5 May 2022 / Revised: 28 June 2022 / Accepted: 5 July 2022 / Published: 7 July 2022
(This article belongs to the Special Issue Emerging Issues in COVID and T Cells)

Abstract

:
Heterologous immunity-inducing vaccines against different pathogens are necessary to deal with new pandemics. In this study, the possible impact of COVID-19 licensed formulations in the cytotoxic and the helper cellular immune responses against SARS-CoV-1 is analyzed for the 567 and 41 most abundant HLA class I and II alleles, respectively. Computational prediction showed that most of these 608 alleles, which cover >90% of the human population, contain enough conserved T-cell epitopes among SARS-CoV-1 and SARS-CoV-2 spike proteins. In addition, the vast majority of these predicted peptides were defined as epitopes recognized by CD4+ or CD8+ T lymphocytes, showing a very high correlation between the bioinformatics prediction and the experimental assays. These data suggest that both cytotoxic and helper cellular immune protection elicited by the currently licensed COVID-19 vaccines should be effective against SARS-CoV-1 infection. Lastly, this study has potential implications for public health against current and future pandemics, given that the SARS-CoV-1 vaccines in pipeline since the early 20th century could generate similarly cross-protection against COVID-19.

1. Introduction

SARS-CoV-2, the etiologic agent of COVID-19, has caused a devastating pandemic resulting in more than 481 million of confirmed cases and 6 million deaths worldwide to date (https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports accessed on 1 May 2022). Thus, the vaccine prophylaxis against COVID-19 and future pandemics is a key issue in the current highly connected and globalized world. However, although the COVID-19 vaccines were developed in record time, saving the lives of millions of people, could prophylaxis against this pandemic have been developed faster?
Robust activation of the different components of adaptive immunity: Neutralizing antibodies, helper CD4+ T lymphocytes, and cytotoxic CD8+ T lymphocytes are key against SARS-CoV-2 natural infection and protective immune response after vaccination [1]. The specific interaction of the CD8+ or CD4+ T lymphocyte receptors with short viral peptides bound to human leukocyte antigen (HLA) class I or II molecules triggers the activity of these T cells, and also initiates, regulates, or suppresses other components of adaptive immune responses [2]. In the absence of correct HLA class I- and II-restricted T-cell recognition, both cellular and humoral immune responses are not activated efficiently and thus, the infectious virus could spread through the organism with fatal results for the host. This highly complex set of immune events can be altered, or even suppressed by single changes in the virus epitope sequences that lead to a complete loss of antigen recognition by either CD4+ or CD8+ T cells as was previous described for influenza [3], HCV [4], HIV [5], LCMV [6], SIV [5], and even in coronavirus [7] and SARS-CoV-2 [8]. This extremely low tolerance to amino acid changes in the antigen recognition can render ineffective the lymphocytes previously activated by the administration of vaccines if new variants of the virus emerge with multiple changes [9]. In this context, it is unlikely that a vaccine developed against a certain virus will be able to generate a cross-response against another related virus.
All licensed formulations against SARS-CoV-2 are based on the original D614 spike protein sequence of the Wuhan-1 wild-type strain. The main difference is the type of platform vaccine used: messenger RNA, non-replicating viral vector, inactivated SARS-CoV-2, or protein subunit. The main concern of all these vaccines is their cost and that specialized infrastructure is needed; thus, vaccines adapted to low- and lower-middle-income countries would be necessary. Moreover, SARS-CoV-2 is included in the subgenus Sarbecovirus with their highly homolog SARS-CoV-1. Spike proteins from these related viruses differ in 304 amino acids. These changes are not randomly distributed throughout the entire spike protein sequence and, thus, the possible cross-recognition of cytotoxic and/or helper immune responses by conserved epitopes among sarbecovirus remains open. In this study, I approach this aspect focusing on the HLA class I- and II-restricted epitopes conserved among SARS-CoV-1 and SARS-CoV-2 spike proteins. Although a significant loss of HLA class I- and II-restricted epitopes derived from SARS-CoV-2 vaccines was observed, a relevant number of epitopes remained conserved among the sarbecovirus spike proteins, which could generate the global cytotoxic and helper responses against SARS-CoV-1 elicited with the currently licensed vaccines. Additionally, the SARS-CoV-1 vaccines in the pipeline since the early 20th century could have generated cross-protection against SARS-CoV-2 infection.

2. Materials and Methods

2.1. Selection of Antigenic Proteins

The spike protein of the SARS-CoV-2 reference proteome (Wuhan-1, RefSeq: NC_045512.2) was initially selected. In addition, the modifications of the spike protein added to Moderna mRNA-1273 (ModeRNA Therapeutics, Cambridge, MA, USA), Pfizer BNT162b2 (Pfizer, Inc., New York, NY, USA) and Janssen Ad26.COV2.S (Janssen Pharmaceutica, Beerse, Belgium) vaccines were also included. The spike proteins of the following SARS-CoV-1 proteomes were also selected: Beijing-01, RefSeq: AY278488.2; Beijing-02, RefSeq: AY278487; Beijing-03, RefSeq: AY278490; and Beijing-04, RefSeq: AY279354.

2.2. Selection of HLA Class I and II Alleles

HLA class I alleles that share anchor residues of the 551 alleles, including in the twelve HLA class I supertypes (A01, A0103, A0124, A02, A03, A24, B07, B08, B27, B44, B58, and B62) [10] and the 16 most frequent HLA-C alleles, were selected. In addition, 41 alleles, including in the ten HLA class II supertypes (DR1, DR52, DR53, DP1, DP3, DQ2, DQ4, DQ5, DQ7, and DQ8), were also included in the study [11,12]. In addition, these HLA class I and II supertypes cover >90% and >95% of the world population regardless of ethnicity [10,11,12].

2.3. HLA Class I Epitope Prediction

Non-redundant HLA class I epitopes between 8–12 residues from the spike protein of the SARS-CoV-2 (including the modifications of the spike protein added to the Moderna mRNA-1273, Pfizer BNT162b2, and Janssen Ad26.COV2.S vaccines) were predicted using the latest versions of the universal and neural-network-based netMHCpan EL and BA algorithms. These bioinformatics tools outperform any other method to date [13] and are recommended by the central Immune Epitope Database and Analysis Resource [14]. First, the peptides considered “Strong Binders” (rank ≤ 0.5) by NetMHCIpan EL 4.1 [13] for HLA class I ligands were selected. For redundant epitopes, those sharing the same binding core for the same allele, only the one with the highest score was considered per allele. As the NetMHCIpan EL 4.1 algorith was trained only with mass spectrometry elution (EL) data, non-redundant epitopes were further verified through the NetMHCIpan BA 4.1 [13] algorithm, which includes binding affinity (BA) data and thus, the combination of both bioinformatics tools yield the most accurate results. These verified non-redundant epitopes did not match any of those predicted for a random sequence with the same length and residue composition than the reference SARS-CoV-2 spike protein generated with the EXPASY RandSeq tool (https://web.expasy.org/randseq/ accessed on 12 May 2022). Predictions were restricted to alleles that share anchor residues of the 567 HLA-A, -B, and -C alleles previously selected. To further test the specificity and sensitivity of the NetMHCIpan EL 4.1 algorithm, the substitution of all Pro and Arg/Gln by Ala yielded no epitopes for the HLA-B*07:02 and HLA-B*27:05 alleles, respectively, as these amino acids are their respective anchor motif residues.

2.4. HLA Class II Epitope Prediction

Similar to HLA class I, non-redundant HLA class II epitopes between 12–18 residues were predicted using the NetMHCIIpan EL 4.0 [13]. Only the “Strong Binders” (rank ≤ 0.5) were considered with this algorithm, and later were further verified through the NetMHCIIpan BA 4.0 [13] algorithm. As for HLA class I, these verified non-redundant HLA class II epitopes did not match any of those predicted for a random sequence with the same length and residue composition than the reference SARS-CoV-2 spike protein generated with the EXPASY RandSeq tool (https://web.expasy.org/randseq/, accessed on 15 May 2022). Predictions were restricted to alleles that share anchor residues of the 41 HLA-DR, -DP, and -DQ alleles previously selected.

2.5. Other Analyses

A theoretical SARS-CoV-2 spike protein with 304 random changes, the number of changes among spike proteins from SARS-CoV-1 and SARS-CoV-2, was generated. Progressive random position datasets were produced by the selection of numbers between 1 and 1273, the spike protein residue length, by the perl rand function. Iterative selection was carried out on the remaining non-selected positions after completing 304 positions. The data of the experimentally detected epitopes were obtained from Immune Epitope Database and Analysis Resource (IEDB; http://www.iedb.org/ accessed on 15 May 2022) [14]. A search in IEDB of the predicted epitopes for the most abundant HLA class I and II alleles in the population was carried out. Positive response for activation and/or cytokine secretion T-cell assay was manually confirmed in the original article describing each epitope. Data of population coverage by HLA class I and II molecules were obtained from IEDB http://tools.iedb.org/population/; accessed on 15 February 2022) [14].

3. Results and Discussion

The extraordinary polymorphism of HLA class I and II molecules, with more than 24,000 and 8000 alleles identified to date, respectively, greatly hinders the experimental study of the cellular immune response at the human population level. However, many of the HLA class I and II molecules identified have been grouped first in families, later in superfamilies, and finally in twelve and ten canonical HLA class I and II supertypes, respectively. These supertypes share strong similarities at the peptide-ligand specificity level and include 551 HLA-A and -B class I alleles and other 41 HLA class II alleles. In addition, these twelve and ten HLA class I and II supertypes cover >90% [10] and >95% [11,12] of the world population regardless of ethnicity, respectively. Thus, the use of supertypes significantly reduces data complexity and facilitates the computational analysis of herd immunity.
Therefore, first bioinformatics prediction to the theoretical epitopes from the SARS-CoV-2 spike protein, the viral protein included in internationally licensed vaccines, was carried out as previously described [15,16]. Each of the 551 HLA class I alleles associated with the twelve canonical HLA-A and -B class I supertypes were analyzed, respectively (Figure 1). The predicted ligands for HLA-A class I molecules ranged between nearly 40 epitopes per allele in supertype A24 and less than 10 epitopes per allele for several HLA-A supertypes (Figure 1A, and Supplemental Table S1). Similarly, a predictive analysis of the impact of changes described in the SARS-CoV-1 spike protein over the HLA-A class I supertypes was carried out. As the changes are not randomly distributed throughout the entire spike protein sequences from sarbecoviruses, a significant but not total loss of HLA-A-restricted epitopes derived from SARS-CoV-2 vaccines was detected (Figure 1A and Table 1, Table 2 and Table S1). However, strikingly, for all HLA-A supertypes, the number of HLA-A-restricted epitopes conserved among the spike proteins from both sarbecoviruses was statistically significant versus 304 random mutations generated over the SARS-CoV-2 spike protein sequence (Figure 1A and Table 1). In addition, up to 147 HLA-A class I alleles from all supertypes, except for A0103, retained more than four predicted epitopes conserved among the spike proteins from both SARS-CoVs (Table 3). For example, the HLA-A*26:02 allele from A01 supertype could bind nine conserved epitopes among both spike proteins (Supplemental Table S1). Additionally, seven HLA-A alleles (A*29:01, A*29:02, A*29:06, A*29:09, A*29:10, A*29:11, and A*29:12) from the A0124 supertype retained seven unchanged epitopes (Supplemental Table S1). More strikingly, 31 different HLA-A alleles of the A02 supertype, including HLA-A*02:01, the most prevalent HLA allele in humans, retained more than 10 conserved epitopes on the SARS-CoV-1 spike protein (Table 3 and Table S1). In addition, other five HLA alleles from the A24 supertype retained more than 10 conserved epitopes among the SARS-CoV spike proteins (Table 3 and Table S1).
Similar to the HLA-A locus, a predictive analysis of the impact of changes described in the SARS-CoV-1 spike protein over the HLA class I molecules from the HLA-B supertypes was also carried out. Equal to the HLA-A class I alleles, the changes in the spike protein from SARS-CoV-1 generated a significant, but not total, loss of HLA-B-restricted epitopes derived from SARS-CoV-2 vaccines (Figure 1B and Table 1, Table 2 and Table S1). Newly, all the HLA-B supertypes, except B07, showed a statistically significant difference in the number of HLA-B-restricted epitopes conserved among the spike proteins from both SARS-CoVs versus 304 random mutations generated over the SARS-CoV-2 spike protein sequence (Figure 1B and Table 1). The lack of statistical difference in the B07 supertype among the conserved SARS-CoV-1 epitopes versus random changes was due to the multiple HLA alleles with no or very few conserved epitopes among the spike proteins from SARS-CoVs. However, for the 14 alleles included in this supertype, more than four epitopes conserved among SARS-CoV-1 and SARS-CoV-2 spike proteins were predicted (Table 3). For example, both the HLA-B*35:21 and -B*35:32 alleles could bind seven conserved epitopes among both spike proteins each one, and the HLA-B*35:11 another eight (Supplemental Table S1). Moreover, the HLA-B*35:35 and -B*35:41 alleles from the B07 supertype retained 12 unchanged epitopes among both sarbecoviruses (Table 3 and Table S1). Similarly, another 90 HLA-B alleles from the B27, B44, B58, and B62 supertypes retained more than four predicted epitopes conserved among the spike proteins from SARS-CoV-1 and SARS-CoV-2; of these alleles, 9 even exceed 10 conserved epitopes per HLA-B class I molecule (Table 3). In total, 251 (46%) and 48 (9%) of the HLA-A and -B class I alleles analyzed could bind ≥ 4 or ≥ 10 conserved epitopes among both spike proteins, respectively (Table 3).
Although supertypes have not been defined in HLA-C, 16 alleles from this locus cover >95% of the world population regardless of ethnicity. Thus, similar to the HLA-A and -B loci, a predictive analysis of the impact of changes described in the SARS-CoV-1 spike protein over the SARS-CoV-2 vaccines for these 16 HLA-C class I molecules was carried out. Again, the changes in the spike protein from SARS-CoV-1 generated a significant, but not total, loss of HLA-C-restricted epitopes derived from SARS-CoV-2 vaccines (Figure 1B and Table 1, Table 2 and Table S2). These conserved epitopes among both SARS-CoVs molecules showed a statistically significant versus the 304 random mutations generated over the SARS-CoV-2 spike protein sequence (Figure 1B and Table 1). All HLA-C alleles analyzed retained more than four conserved epitopes among SARS-CoV spike proteins (Table 3 and Table S2). In addition, 3 of these 16 HLA class I molecules (HLA-C*02:02, HLA-C*03:03, and HLA-C*03:04) retained more than 10 conserved epitopes (Table 3 and Table S2).
In summary, 267 (47%) HLA-A, -B, and -C class I alleles analyzed and another 51 (9%) HLA-DR, -DP, and -DQ class II molecules could bind ≥4 or ≥10 conserved epitopes among both spike proteins (Table 3).
Additionally, a predictive analysis of the impact of changes described in the SARS-CoV-1 spike protein sequence over the HLA-DR, -DP, and -DQ class II supertypes was also carried out. As Figure 2 and Table 2 and Table S3 show, changes described in the SARS-CoV-1 spike protein versus vaccine sequence generated a significant, but not total, loss of HLA class II-conserved epitopes of both strains for the three DR, two DP, and five DQ supertypes analyzed, similar to the HLA class I-restricted epitopes. These conserved epitopes among both SARS-CoVs molecules showed a statistically significant versus the 304 random mutations generated over the SARS-CoV-2 spike protein sequence for all HLA class II supertypes, except for DR53 (Figure 2 and Table 1). In this supertype, the lack of statistical difference among the conserved SARS-CoV-1 epitopes versus SARS-CoV-2 vaccines and random changes was due to the large dispersion of HLA-DR53 alleles with no or very few conserved epitopes among the spike proteins from SARS-CoVs in some of these HLA class II molecules and others with up to eight conserved epitopes (Figure 2A and Supplemental Table S3). In addition, up to 26 HLA class II alleles from all supertypes, except for DQ2, retained more than four predicted epitopes conserved among the spike proteins from both SARS-CoVs. These alleles were 63% of the total of HLA class II molecules analyzed (Table 3).
The HLA class I frequencies of the 608 HLA class I molecules analyzed in this study range from low prevalence to more than 20% human population for some very frequent alleles as HLA-A*02:01, -A*24:02, or -C*07:02. Thus, to estimate the percentage of human population who might have a sufficient cellular immune response against SARS-CoV-1 with the currently licensed SARS-CoV-2 vaccines, Table 4 shows the HLA class I alleles with a world population coverage >1% that could bind more than four epitopes conserved in SARS-CoV-1. Therefore, seven HLA-A class I alleles, HLA-A*02:01, -A*03:01, -A*11:01, -A*23:01, -A*24:02, -A*29:02, and -A*68:02, cover 80.8% of the human population, regardless of ethnicity (Table 4). Similarly, the other 10 HLA-B and 16 HLA-C alleles with more than four epitopes conserved among SARS-CoV-1 and SARS-CoV-2 cover 40.0% and >95% of the world population, respectively (Table 4). More interesting, the five HLA class I alleles that could bind more than 10 epitopes conserved in SARS-CoV-1, HLA-A*02:01, -B*15:03, -C*02:02, -C*03:03, and -C*03:04, cover 57% of the human population. Similarly, the HLA class II alleles with a world population coverage > 1% that could bind more than four epitopes conserved in SARS-CoV-1 are indicated in Table 5. Thus, five HLA-DR class II alleles, DRB1*07:01, DRB1*09:01, DRB1*16:02, DRB1*13:02, and DRB4*01:01, cover 49.2% of the human population regardless of ethnicity (Table 5). In the other two HLA class II loci, the more frequent 5 HLA-DP and 14 HLA-DQ alleles cover 94.6% and 86.3% of the world population, respectively (Table 5). In summary, the currently licensed vaccines against SARS-CoV-2 could generate enough conserved epitopes in SARS-CoV-1 to trigger a complete cellular immune response restricted by the most frequent HLA class I and class II alleles expressed by the human population.
Additionally, the HLA genes are closely linked in the genome and, thus, a set of HLA-A, -B, -C, -DR, -DP, and -DQ genes, called HLA haplotype, is inherited in a Mendelian fashion from each parent. Therefore, the number of conserved epitopes among the SARS-CoV-1 and SARS-CoV-2 spike protein sequences predicted for all HLA class I or II alleles was analyzed by HLA loci. Figure 3 shows an average of five, three, and seven conserved epitopes for HLA-A, -B, and -C loci, respectively, and another four conserved epitopes for each HLA class II locus. Thus, on average, 15 conserved epitopes of the HLA class I and another 12 of the HLA class II could be associated with each individual HLA haplotype (Figure 3), and the different HLA molecules in a homozygous individual would present these 27 conserved epitopes. However, as less of 15% of humans are homozygotes for HLA [17], the currently licensed vaccines against SARS-CoV-2 could generate an average of 30 HLA class I epitopes and another 24 HLA class II epitopes conserved in SARS-CoV-1 for more than 85% of human population. This striking relative abundance of conserved epitopes among sarbecovirus spike proteins is because, unlike random mutations, the entire viral protein sequence cannot change randomly. However, 304 random mutational changes virtually cover the entire protein sequence, destroying almost all epitopes, as indicated in Figure 1 and Figure 2. In contrast, 27 segments that include between 9 and 111 consecutive residues are conserved among sarbecovirus spike proteins, which accumulated 304 changes. Thus, up to 579 amino acids of SARS-CoV-1 can be used by the immune system to generate HLA-restricted epitopes conserved with SARS-CoV-2. In contrast, the spike protein from MERS-CoV, another member of the betacoronavirus genus related to sarbecoviruses, presents only 371 conserved residues with SARS-CoV-2. Additionally, these residues are in practice randomly distributed throughout the protein sequence; thus, there are no conserved epitopes among MERS-CoV and SARS-CoV-2 for any HLA class I and II allele.
Finally, in this study, the viral epitopes were computationally predicted and, therefore, experimental confirmation is needed. In this pandemic context, currently the number of experimentally detected HLA-restricted epitopes from SARS-CoV-2 included in the IEDB database is continuously increased [14]. Obviously, the most abundant HLA-A and -B alleles in the population are also the most extensively studied. For this reason, a search in the IEDB database of the epitopes conserved among both coronaviruses for those alleles of HLA-A and -B with a world population coverage >5% and with ≥4 predicted epitopes conserved among sarbecoviruses was carried out. As shown in Table 6, the vast majority of these predicted epitopes were defined as epitopes recognized by CD8+ T cells. The coincidence percentage between the predicted and experimentally detected epitopes ranged from 71% in HLA-A*11:01 to 100% for HLA-A*02:01 and -A*23:01 (Table 6). Among these five HLA-A alleles, which cover the 77.8% of the world population, 89% of the predicted epitopes were functionally detected (Table 6). Additionally, in the four HLA-B alleles analyzed, which cover 28.4% of the world population, 78% of the predicted epitopes are included in the IEDB database as confirmed epitopes (Table 6). Overall, 50 of the 59 predicted epitopes associated with HLA-A and -B class I molecules (83.3%) are defined in the IEDB database as targets recognized by CD8+ T cells (Table 6). Similarly, a search in the IEDB database of the epitopes conserved among both coronaviruses for those HLA-DR class II alleles with ≥4 predicted epitopes conserved among sarbecoviruses was carried out. Only 1 of these 27 predicted epitopes, which was associated with HLA-DRB4*01:01, was not included in the IEDB database as a target of CD4+ T cells (Table 7). In addition, 11 of the 12 predicted epitopes associated with the three most frequent HLA-DP alleles were experimentally detected as targets of CD4+ T cells (Table 7). Lastly, all predicted epitopes associated with the six most frequent HLA-DQ alleles were included in the IEDB database as targets of CD4+ T cells (Table 7). Overall, 97% of the predicted epitopes associated with these 14 HLA class II alleles were experimentally identified as targets recognized by CD4+ T cells (Table 7). These results indicate a very high correlation between the bioinformatics prediction and the experimental assays, at least for conserved epitopes among coronavirus spike proteins, and validate the methodological approach. Importantly, those predicted epitopes that are not currently included in the IEDB database may not have been tested and could be detected as epitopes recognized by T cells in future studies.
More interesting, the current study has potential implications for public health against current and future pandemics. In addition to humoral response, if many epitopes associated with multiple and frequent HLA class I and II molecules are conserved among sarbecovirus, the currently licensed vaccines against COVID-19 must be effective against SARS-CoV-1 infection. Similarly, SARS-CoV-1 vaccines could generate cross-protection against SARS-CoV-2 infection. In addition, for years, it has been known that different SARS-CoV-1 spike-protein-based vaccines elicit potent immune responses and protective effects in preclinical models [18,19,20], and even in phase I human studies ([21] and Clinicaltrial.gov: NCT00533741 and NCT01376765). However, the first COVID-19 vaccine, the “Pfizer-BioNTech COVID-19 Vaccine”, was approved by the FDA in August 2021. Thus, if while COVID-19-specific vaccines began to be developed, the SARS-CoV-1 vaccines in the pipeline had been included in more advanced phases of clinical trials, then perhaps they could have been available to prevent part of the hundreds of thousands of deaths caused by COVID-19 in 2020. In this context, and in line with the data presented above, a very recent study using a mice model has shown that SARS-CoV-1 vaccination induces cross-reactive antibodies and T cells against SARS-CoV-2 and protects against a SARS-CoV-2 challenge [22]. Thus, the use of bioinformatics analysis such as the one developed in the present study and similar exploration of cross-reactive humoral responses could be a useful rapid response strategy to face future pandemics with the vaccine tools available at that time.
Finally, a computational analysis such as the one carried out in the present study can be extended to analyze the influence of virus molecular evolution on the cellular immune response. Therefore, HLA-restricted epitopes of the different virus variants emerging over time in different countries can be analyzed. These studies are very relevant because the effect of emerging virus variants on vaccine efficacy is of critical importance, and the potential impact of mutations that could facilitate escape from the cellular immune response would allow to check for convenient or optimized vaccine candidates. These studies have recently been carried out in our laboratory with all the relevant SARS-CoV-2 strains up to the Omicron variant of concern, showing that most of the HLA class I and II alleles, which cover >90% of the population, contain enough HLA-restricted epitopes without escape mutations [15,16]. These data previously published by our laboratory indicated that the cellular immune protection elicited by the currently licensed vaccines was not affected by emerging SARS-CoV-2 variants [15,16].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biomedicines10071622/s1, Table S1: Predicted ligands for HLA-A, and B class I molecules; Table S2: Predicted ligands for HLA-C class I molecules; Table S3: Predicted ligands for HLA class II molecules.

Funding

This work was supported by the Spanish Ministry of Science and “Acción Estratégica en Salud” MPY 388/18.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data are included in Supplementary Tables S1–S3.

Conflicts of Interest

The author declares no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Sahin, U.; Muik, A.; Derhovanessian, E.; Vogler, I.; Kranz, L.M.; Vormehr, M.; Baum, A.; Pascal, K.; Quandt, J.; Maurus, D.; et al. COVID-19 vaccine BNT162b1 elicits human antibody and TH1 T cell responses. Nature 2020, 5865, 594–599. [Google Scholar] [CrossRef]
  2. Shastri, N.; Schwab, S.; Serwold, T. Producing nature’s gene-chips: The generation of peptides for display by MHC class I molecules. Annu. Rev. Immunol. 2002, 20, 463–493. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Valkenburg, S.A.; Quiñones-Parra, S.; Gras, S.; Komadina, N.; McVernon, J.; Wang, Z.; Halim, H.; Iannello, P.; Cole, C.; Laurie, K.; et al. Acute emergence and reversion of influenza A virus quasispecies within CD8+ T cell antigenic peptides. Nat. Commun. 2013, 4, 2663. [Google Scholar] [CrossRef] [Green Version]
  4. Bowen, D.G.; Walker, C.M. Mutational escape from CD8+ T cell immunity: HCV evolution, from chimpanzees to man. J. Exp. Med. 2005, 201, 1709–1714. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Goulder, P.J.R.; Watkins, D.I. HIV and SIV CTL escape: Implications for vaccine design. Nat. Rev. Immunol. 2004, 4, 630–640. [Google Scholar] [CrossRef] [PubMed]
  6. Achour, A.; Michaelsson, J.; Harris, R.A.; Odeberg, J.; Grufman, P.; Sandberg, J.K.; Levitsky, V.; Karre, K.; Sandalova, T.; Schneider, G. A structural basis for LCMV immune evasion: Subversion of H-2D(b) and H-2K(b) presentation of gp33 revealed by comparative crystal structure.Analyses. Immunity 2002, 17, 757–768. [Google Scholar] [CrossRef] [Green Version]
  7. Butler, N.S.; Theodossis, A.; Webb, A.I.; Dunstone, M.A.; Nastovska, R.; Ramarathinam, S.; Rossjohn, J.; Purcell, A.; Perlman, S. Structural and biological basis of CTL escape in coronavirus-infected mice. J. Immunol. 2008, 180, 3926–3937. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. de Silva, T.I.; Liu, G.; Lindsey, B.B.; Dong, D.; Moore, S.C.; Hsu, N.S.; Shah, D.; Wellington, D.; Mentzer, A.J.; Angyal, A.; et al. The impact of viral mutations on recognition by SARS-CoV-2 specific T cells. iScience 2021, 24, 103353. [Google Scholar] [CrossRef] [PubMed]
  9. Borrow, P.; Shaw, G.M. Cytotoxic T-lymphocyte escape viral variants: How important are they in viral evasion of immune clearance in vivo? Immunol. Rev. 1998, 164, 37–51. [Google Scholar] [CrossRef]
  10. Sidney, J.; Peters, B.; Frahm, N.; Brander, C.; Sette, A. HLA class I supertypes: A revised and updated classification. BMC Immunol. 2008, 9, 1. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  11. Greenbaum, J.; Sidney, J.; Chung, J.; Brander, C.; Peters, B.; Sette, A. Functional classification of class II human leukocyte antigen (HLA) molecules reveals seven different supertypes and a surprising degree of repertoire sharing across supertypes. Immunogenetics 2011, 63, 325–335. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Shen, W.J.; Zhang, X.; Zhang, S.; Liu, C.; Cui, W. The Utility of Supertype Clustering in Prediction for Class II MHC-Peptide Binding. Molecules 2018, 23, 3034. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Reynisson, B.; Alvarez, B.; Paul, S.; Peters, B.; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: Improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020, 48, W449–W454. [Google Scholar] [CrossRef] [PubMed]
  14. Vita, R.; Mahajan, S.; Overton, J.A.; Dhanda, S.K.; Martini, S.; Cantrell, J.R.; Wheeler, D.K.; Sette, A.; Peters, B. The Immune Epitope Database (IEDB): 2018 update. Nucleic Acids Res. 2019, 47, D339–D343. [Google Scholar] [CrossRef] [Green Version]
  15. Martin-Galiano, A.J.; Diez-Fuertes, F.; McConnell, M.J.; Lopez, D. Predicted Epitope Abundance Supports Vaccine-Induced Cytotoxic Protection Against SARS-CoV-2 Variants of Concern. Front. Immunol. 2021, 12, 732693. [Google Scholar] [CrossRef]
  16. Lopez, D. Predicted HLA Class I and Class II Epitopes From Licensed Vaccines Are Largely Conserved in New SARS-CoV-2 Omicron Variant of Concern. Front. Immunol. 2022, 13, 832889. [Google Scholar] [CrossRef]
  17. Maiers, M.; Gragert, L.; Klitz, W. High-resolution HLA alleles and haplotypes in the United States population. Hum. Immunol. 2007, 68, 779–788. [Google Scholar] [CrossRef]
  18. Bisht, H.; Roberts, A.; Vogel, L.; Bukreyev, A.; Collins, P.L.; Murphy, B.R.; Subbarao, K.; Moss, B. Severe acute respiratory syndrome coronavirus spike protein expressed by attenuated vaccinia virus protectively immunizes mice. Proc. Natl. Acad. Sci. USA 2004, 101, 6641–6646. [Google Scholar] [CrossRef] [Green Version]
  19. He, Y.; Li, J.; Heck, S.; Lustigman, S.; Jiang, S. Antigenic and immunogenic characterization of recombinant baculovirus-expressed severe acute respiratory syndrome coronavirus spike protein: Implication for vaccine design. J. Virol. 2006, 80, 5757–5767. [Google Scholar] [CrossRef] [Green Version]
  20. Li, J.; Ulitzky, L.; Silberstein, E.; Taylor, D.R.; Viscidi, R. Immunogenicity and protection efficacy of monomeric and trimeric recombinant SARS coronavirus spike protein subunit vaccine candidates. Viral Immunol. 2013, 26, 126–132. [Google Scholar] [CrossRef] [Green Version]
  21. Martin, J.E.; Louder, M.K.; Holman, L.A.; Gordon, I.J.; Enama, M.E.; Larkin, B.D.; Andrews, C.A.; Vogel, L.; Koup, R.A.; Roederer, M.; et al. A SARS DNA vaccine induces neutralizing antibody and cellular immune responses in healthy adults in a Phase I clinical trial. Vaccine 2008, 26, 6338–6343. [Google Scholar] [CrossRef] [PubMed]
  22. Dangi, T.; Palacio, N.; Sanchez, S.; Park, M.; Class, J.; Visvabharathy, L.; Ciucci, T.; Koralnik, I.J.; Richner, J.M.; Penaloza-MacMaster, P. Cross-protective immunity following coronavirus vaccination and coronavirus infection. J. Clin. Investig. 2021, 131, e151969. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Average number of epitopes in the SARS-CoV-2 spike protein sequence predicted for HLA class I alleles, including the 12 HLA class I supertypes and the 16 most frequent HLA-C alleles and their conservation in SARS-CoV-1. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values. Panel (A): number of epitopes in the SARS-CoV-2 (black), the conserved in SARS-CoV-1 (blue), and with 304 random mutations generated over the SARS-CoV-2 spike protein sequence (red) predicted for HLA class I alleles, including in the 6 HLA-A supertypes. Similarly, the epitopes associated with the 6 HLA-B supertypes and the 16 most frequent HLA-C alleles are depicted in panel (B).
Figure 1. Average number of epitopes in the SARS-CoV-2 spike protein sequence predicted for HLA class I alleles, including the 12 HLA class I supertypes and the 16 most frequent HLA-C alleles and their conservation in SARS-CoV-1. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values. Panel (A): number of epitopes in the SARS-CoV-2 (black), the conserved in SARS-CoV-1 (blue), and with 304 random mutations generated over the SARS-CoV-2 spike protein sequence (red) predicted for HLA class I alleles, including in the 6 HLA-A supertypes. Similarly, the epitopes associated with the 6 HLA-B supertypes and the 16 most frequent HLA-C alleles are depicted in panel (B).
Biomedicines 10 01622 g001
Figure 2. Average number of epitopes in the SARS-CoV-2 spike protein sequence predicted for HLA class II alleles, including in the 10 HLA class II supertypes and their conservation in SARS-CoV-1. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values. The number of epitopes in the SARS-CoV-2 (black), the conserved in SARS-CoV-1 (blue), and with 304 random mutations generated over the SARS-CoV-2 spike protein sequence (red) predicted for HLA class II alleles, including in the 3 HLA-DR, and 2 -DP, or 5 -DQ supertypes are depicted in panels (A) or (B), respectively.
Figure 2. Average number of epitopes in the SARS-CoV-2 spike protein sequence predicted for HLA class II alleles, including in the 10 HLA class II supertypes and their conservation in SARS-CoV-1. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values. The number of epitopes in the SARS-CoV-2 (black), the conserved in SARS-CoV-1 (blue), and with 304 random mutations generated over the SARS-CoV-2 spike protein sequence (red) predicted for HLA class II alleles, including in the 3 HLA-DR, and 2 -DP, or 5 -DQ supertypes are depicted in panels (A) or (B), respectively.
Biomedicines 10 01622 g002
Figure 3. Average number of conserved epitopes among sarbecovirus spike protein sequences predicted for each HLA class I and II locus. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values.
Figure 3. Average number of conserved epitopes among sarbecovirus spike protein sequences predicted for each HLA class I and II locus. The median value is indicated. Box limits indicate the interquartile range. Whiskers are adjusted to maximal and minimal values.
Biomedicines 10 01622 g003
Table 1. Statistical significance of the number of predicted HLA class I and class II epitopes.
Table 1. Statistical significance of the number of predicted HLA class I and class II epitopes.
SupertypeSARS1/SARS2 p-ValueSARS1/304 Random Mutation p-Value
A01<0.00010.0026
A01030.0240.025
A0124<0.00010.0005
A02<0.0001<0.0001
A03<0.00010.0064
A24<0.0001<0.0001
B07<0.0001n.s. a
B080.0020.04
B27<0.00010.01
B44<0.0001<0.0001
B58<0.00010.007
B62<0.00010.002
HLA-C<0.0001<0.0001
DR1<0.00010.0003
DR520.00440.0082
DR53n.s.n.s.
DP1<0.00010.02
DP30.040.04
DQ20.010.01
DQ40.00110.0001
DQ50.0220.048
DQ7<0.0001<0.0001
DQ80.00340.01
a Not significant.
Table 2. Percentage of predicted HLA class I and class II epitopes conserved among sarbecoviruses’ spike proteins.
Table 2. Percentage of predicted HLA class I and class II epitopes conserved among sarbecoviruses’ spike proteins.
Supertype% of Conserved Epitopes
A0119
A010316
A012424
A0232
A0317
A2419
B0719
B0830
B2733
B4444
B5819
B6420
HLA-C25
DR124
DR5229
DR5340
DP120
DP335
DQ222
DQ438
DQ532
DQ740
DQ837
Table 3. Number of HLA alleles with more than 4 or 10 predicted epitopes conserved among sarbecoviruses’ spike proteins.
Table 3. Number of HLA alleles with more than 4 or 10 predicted epitopes conserved among sarbecoviruses’ spike proteins.
HLA SuperfamilyNumber of HLA Alleles with
≥4 Epitopes Conserved≥10 Epitopes Conserved
A01120
A010300
A012470
A025331
A03440
A24315
B07142
B0800
B27227
B44441
B5871
B62170
HLA-C163
No. of HLA class I alleles267 (47%)51
DR130
DR5220
DR5330
DP140
DP320
DQ200
DQ430
DQ520
DQ750
DQ820
No. of HLA class II alleles26 (63%)0
Table 4. Predicted epitopes conserved in SARS-CoV-1 and % population coverage in the most frequent HLA class I alleles.
Table 4. Predicted epitopes conserved in SARS-CoV-1 and % population coverage in the most frequent HLA class I alleles.
HLA Class I AlleleSupertypeEpitopes Conserved in SARS-CoV-1% Population Coverage a
A*29:02A012473.9
A*02:01A021039.1
A*68:02A0272.5
A*03:01A03616.8
A*11:01A03715.5
A*23:01A2465.4
A*24:02A24721.4
7 HLA-A alleles 80.8
B*35:01B0768.4
B*15:03B27111.3
B*27:05B2784.8
B*39:02B2771.0
B*40:01B4497.8
B*40:02B4483.5
B*40:06B4441.1
B*44:02B4447.6
B*44:03B4446.7
B*45:01B4451.3
10 HLA-B alleles 40.0
17 HLA-A, and -B alleles 86.1
C*01:02 410.5
C*02:02 119.5
C*03:03 128.1
C*03:04 1212.8
C*04:01 820.0
C*05:01 57.9
C*06:02 615.5
C*07:01 519.4
C*07:02 621.5
C*08:01 64.6
C*08:02 64.2
C*12:03 610.3
C*14:02 63.0
C*15:02 84.4
C*16:01 54.7
C*17:01 83.3
16 HLA-C alleles >95
33 HLA class I alleles >95
a Only HLA class I molecules with a world population coverage > 1% were included.
Table 5. Predicted epitopes conserved in SARS-CoV-1 and % population coverage in the most frequent HLA class II alleles.
Table 5. Predicted epitopes conserved in SARS-CoV-1 and % population coverage in the most frequent HLA class II alleles.
HLA Class II AlelleSupertypeEpitopes Conserved in SARS-CoV-1% Population Coverage a
DRB1*07:01DR1418.2
DRB1*09:01DR156.4
DRB1*16:02DR142.0
DRB1*13:02DR5266.7
DRB4*01:01DR53841.8
5 HLA-DR alleles 49.2
HLA-DPA1*01:03-DPB1*02:01DP1476.4
HLA-DPA1*01:03-DPB1*04:01DP1479.2
HLA-DPA1*01:03-DPB1*06:01DP1470.2
HLA-DPA1*03:01-DPB1*04:02DP1427.5
HLA-DPA1*02:01-DPB1*14:01DP3632.8
5 HLA-DP alleles 94.6
HLA-DQA1*02:01-DQB1*04:02DQ4624.7
HLA-DQA1*03:03-DQB1*04:02DQ4616.0
HLA-DQA1*05:01-DQB1*04:02DQ4441.0
HLA-DQA1*06:01-DQB1*04:02DQ4513.2
HLA-DQA1*01:02-DQB1*05:02DQ5430.8
HLA-DQA1*01:04-DQB1*05:03DQ5513.8
HLA-DQA1*01:03-DQB1*06:03DQ7419.1
HLA-DQA1*02:01-DQB1*03:01DQ7544.4
HLA-DQA1*02:01-DQB1*03:03DQ7525.3
HLA-DQA1*05:01-DQB1*03:01DQ7541.5
HLA-DQA1*05:01-DQB1*03:02DQ7546.8
HLA-DQA1*05:01-DQB1*03:03DQ7541.5
HLA-DQA1*03:01-DQB1*03:02DQ8540.2
HLA-DQA1*04:01-DQB1*04:02DQ8517.6
14 HLA-DQ alleles 86.3
24 HLA class II alleles >95
a Only HLA class II molecules with a world population coverage > 1% were included.
Table 6. Predicted and experimentally detected HLA class I epitopes conserved among sarbecoviruses.
Table 6. Predicted and experimentally detected HLA class I epitopes conserved among sarbecoviruses.
HLA Class I AlleleHLA Class I Epitopes Conserved among Sarbecoviruses% Population Coverage c
Predicted aExperimentally Confirmed b% Experimental versus Predicted
A*02:01101010039.1
A*03:01658316.8
A*11:01757115.5
A*23:01661005.4
A*24:02768621.4
5 HLA-A alleles36328977.8
B*35:0165838.4
B*40:0197787.8
B*44:0243757.6
B*44:0343756.7
4 HLA-B alleles23187828.4
9 HLA-A, and -B alleles59508583.3
a From this study (Table 4). b Positive for activation and/or cytokine secretion T-cell assays obtained from the IEDB database. c Only HLA-A and -B class I molecules with a world population coverage >5% were included.
Table 7. Predicted and experimentally detected HLA class II epitopes conserved among sarbecoviruses.
Table 7. Predicted and experimentally detected HLA class II epitopes conserved among sarbecoviruses.
HLA Class II AlleleHLA Class II Epitopes Conserved among Sarbecoviruses% Population Coverage c
Predicted aExperimentally Confirmed b% Experimental versus Predicted
DRB1*07:014410018.2
DRB1*09:01551006.4
DRB1*16:02441002.0
DRB1*13:02661006.7
DRB4*01:01878841.8
5 HLA-DR alleles27269649.2
HLA-DPA1*01:03-DPB1*02:01437576.4
HLA-DPA1*01:03-DPB1*04:014410079.2
HLA-DPA1*01:03-DPB1*06:014410070.2
3 HLA-DP alleles12119289.8
HLA-DQA1*05:01-DQB1*04:024410041.0
HLA-DQA1*02:01-DQB1*03:015510044.4
HLA-DQA1*05:01-DQB1*03:015510041.5
HLA-DQA1*05:01-DQB1*03:025510046.8
HLA-DQA1*05:01-DQB1*03:035510041.5
HLA-DQA1*03:01-DQB1*03:025510040.2
6 HLA-DQ alleles292910087.3
14 HLA class II alleles686697>95
a From this study (Table 5). b Positive for activation and/or cytokine secretion T-cell assays obtained from the IEDB database. c All HLA-DR alleles with ≥4 predicted epitopes conserved among sarbecoviruses and the HLA-DP and -DQ class II molecules with a world population coverage >40% were included.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

López, D. Prediction of Conserved HLA Class I and Class II Epitopes from SARS-CoV-2 Licensed Vaccines Supports T-Cell Cross-Protection against SARS-CoV-1. Biomedicines 2022, 10, 1622. https://doi.org/10.3390/biomedicines10071622

AMA Style

López D. Prediction of Conserved HLA Class I and Class II Epitopes from SARS-CoV-2 Licensed Vaccines Supports T-Cell Cross-Protection against SARS-CoV-1. Biomedicines. 2022; 10(7):1622. https://doi.org/10.3390/biomedicines10071622

Chicago/Turabian Style

López, Daniel. 2022. "Prediction of Conserved HLA Class I and Class II Epitopes from SARS-CoV-2 Licensed Vaccines Supports T-Cell Cross-Protection against SARS-CoV-1" Biomedicines 10, no. 7: 1622. https://doi.org/10.3390/biomedicines10071622

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop