Next Article in Journal
Estimating the Impact of Influenza Vaccination on Acute and ICU Hospital Bed Usage in an Influenza Season under Endemic COVID-19 in the US
Next Article in Special Issue
Immunoinformatics Approach to Design a Multi-Epitope Nanovaccine against Leishmania Parasite: Elicitation of Cellular Immune Responses
Previous Article in Journal
Chitosan-Based Nanomaterial as Immune Adjuvant and Delivery Carrier for Vaccines
Previous Article in Special Issue
IntegralVac: A Machine Learning-Based Comprehensive Multivalent Epitope Vaccine Design Method
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells

1
The Roslin Institute, The Royal (Dick) School of Veterinary Science, The University of Edinburgh, Edinburgh EH25 9RG, UK
2
The Jenner Institute, Nuffield Department of Medicine, The University of Oxford, Oxford OX3 7BN, UK
3
Ribeirão Preto College of Nursing, University of São Paulo, Av Bandeirantes, Ribeirão Preto 3900, Brazil
4
Department of Health Technology, Technical University of Denmark, DK-2800 Copenhagen, Denmark
5
Laboratory of Experimental Immunology, Department of Immunology and Microbiology, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark
6
Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, San Martín CP1650, Argentina
*
Author to whom correspondence should be addressed.
Vaccines 2022, 10(11), 1907; https://doi.org/10.3390/vaccines10111907
Submission received: 2 October 2022 / Revised: 3 November 2022 / Accepted: 4 November 2022 / Published: 11 November 2022
(This article belongs to the Special Issue Reverse Vaccinology and Vaccine Antigens)

Abstract

:
The apicomplexan parasite Theileria parva is the causative agent of East Coast fever, usually a fatal disease for cattle, which is prevalent in large areas of eastern, central, and southern Africa. Protective immunity against T. parva is mediated by CD8+ T cells, with CD4+ T-cells thought to be important in facilitating the full maturation and development of the CD8+ T-cell response. T. parva has a large proteome, with >4000 protein-coding genes, making T-cell antigen identification using conventional screening approaches laborious and expensive. To date, only a limited number of T-cell antigens have been described. Novel approaches for identifying candidate antigens for T. parva are required to replace and/or complement those currently employed. In this study, we report on the use of immunopeptidomics to study the repertoire of T. parva peptides presented by both BoLA-I and BoLA-DR molecules on infected cells. The study reports on peptides identified from the analysis of 13 BoLA-I and 6 BoLA-DR datasets covering a range of different BoLA genotypes. This represents the most comprehensive immunopeptidomic dataset available for any eukaryotic pathogen to date. Examination of the immunopeptidome data suggested the presence of a large number of coprecipitated and non-MHC-binding peptides. As part of the work, a pipeline to curate the datasets to remove these peptides was developed and used to generate a final list of 74 BoLA-I and 15 BoLA-DR-presented peptides. Together, the data demonstrated the utility of immunopeptidomics as a method to identify novel T-cell antigens for T. parva and the importance of careful curation and the application of high-quality immunoinformatics to parse the data generated.

1. Introduction

A major challenge to the development of novel vaccines for complex intracellular pathogens is the identification of relevant T-cell antigens. One example of this is Theileria parva, the causative agent of East Coast fever (ECF), a highly pathogenic disease for cattle that is prevalent in large areas of eastern, central, and southern Africa. ECF is estimated to kill ~1 million cattle a year and inflict an annual economic cost of up to 600 million USD [1]; as a major proportion of this burden is borne by smallholder farmers, ECF poses a major threat to the livelihoods and food security of some of the poorest communities in the world. T. parva is transmitted by the brown-eared tick (Rhipicephalus appendiculatus), which deposits sporozoites into the skin of the cattle host while taking a blood meal. The sporozoites rapidly invade host lymphocytes and, once within cells, transition to a schizont form. Schizont-infected cells undergo transformation and are induced into a phase of sustained proliferation, during which infection with the parasite is maintained in the ‘daughter’ cells; dissemination of these cells and the host’s inflammatory response ultimately lead to the clinical symptoms of the disease, and frequently the death of the infected animal.
Currently, ECF control options are limited to the intensive use of acaricides, a single theilericidal chemotherapeutic agent, and an ‘infection-and-treatment’ (ITM) form of immunisation, which are used either alone or in combination. However, none of these options is optimal or sustainable in the longer term due to, among other reasons, increasing acaricide resistance, the expense and poor efficacy of the theilericidal agent, and the logistical issues of deploying a live vaccine, which requires an extensive liquid nitrogen cold chain, in rural Africa. Thus, alternative forms of control are actively being sought. Predominant among these is the development of novel subunit vaccines. Studies on the immune responses generated by ITM immunisation have demonstrated that protection is associated with major histocompatibility complex class I (MHCI)-restricted CD8+ T-cell responses [2,3,4,5]. Thus, CD8+ T-cell antigens are a major target for the development of efficacious subunit vaccines. The role of CD4+ T-cells in promoting the induction and maintenance of functionally mature CD8+ T-cells suggests that antigens presented by BoLA class II molecules (BoLA-II) will also be required for an effective CD8+ T-cell-targeting vaccine [6,7]. Selection of CD8+/CD4+ T-cell antigens for inclusion in a vaccine will require consideration of both the diversity of the MHCI and the MHCII genotypes in the target cattle populations and the antigenic variability between different T. parva strains [8,9,10,11]. Consequently, it is likely that the repertoire of antigens needed for an effective subunit vaccine will be complex and will need to be minimised by careful selection, if the production of the vaccine is to be practicable.
Intensive efforts applying conventional antigen-screening approaches (using T. parva cDNA and/or peptide libraries) to identify candidate antigens recognised by T. parva-specific T-cells from immune animals have resulted in the identification of a limited number of CD8+ T-cell antigens (Tp1-11 and Tp32-35) from the Muguga genome-reference strain of T. parva [12,13]. Recent work has also identified 26 antigens containing BoLA-DR and/or BoLA-DQ epitopes recognised by T. parva Muguga-specific CD4+ T-cells [13]. However, for many cattle MHC genotypes, there are still no identified T. parva antigens, and due to the diversity of both the bovine MHC and the parasite variants present ‘in the field’; these epitopes would be anticipated to only provide protection to a proportion of the cattle populations and only against a subset of T. parva strains. The expression of >4000 proteins by T. parva schizonts [14] places severe logistical limitations on the capacity for such approaches to screen the full T. parva proteome across multiple strains and MHC haplotypes that would be required to generate a comprehensive list of antigens. Such efforts are further impeded by the marked immunodominance that is characteristic of the CD8+ T-cell responses against T. parva [15], which narrowly restricts the number of epitopes that elicit detectable T-cell responses in individual animals following natural infection or ITM immunisation. Many of the immunodominant epitopes exhibit high levels of strain polymorphism, limiting their potential utility as future candidate antigens [11]. As such, there is a need to complement ongoing antigen identification efforts with alternative strategies.
Recent advances in nanoflow ultra-high performance liquid chromatography coupled to mass-spectrometry (LC-MS) used in peptidomics have significantly increased the sensitivity, dynamic range, and mass accuracy of the technique, such that the analysis of peptides is orders of magnitude superior to that possible a decade ago. Application of this technique in peptide-MHC (pMHC) elution studies has greatly enhanced our understanding of the repertoire of MHC-bound peptides (‘immunopeptidome’) under steady-state conditions and following neoplastic transformation (for just a few recent examples, see references [16,17,18]). Sequencing of MHC-eluted peptides by LC-MS has also been used to examine the pathogen-derived peptides presented by MHC molecules for a number of infectious agents, including the Mumps virus, HIV, Toxoplasma, Plasmodium, Mycobacterium, and Leishmania [19,20,21,22,23,24,25]. Data from these studies have identified a number of proteins that could be used to generate immune responses when administered as a component of an experimental vaccine.
Herein, we report on an immunopeptidomic analysis of T. parva-derived peptides presented by infected cells and demonstrate how these mass-spectrometry data, combined with immuno-informatics, can be used as an alternative approach to formulate a list of new T. parva candidate antigens. To the authors’ knowledge, this is the first time immunopeptidomics has been used to study a bovine pathogen, and the first time the immunopeptidome of a protozoan pathogen has been studied in the context of multiple MHCI and MHCII haplotypes. We identified a total of 74 and 15 unique BoLA-I and BoLA-DR-restricted peptides—demonstrating the capacity to use this approach to rapidly characterise the immunopeptidome of T. parva-infected cells and so identify novel candidate CD8+ and CD4+ T-cell antigens.

2. Materials and Methods

2.1. BoLA-Defined Cell Lines

Cell lines were established from animals for which BoLA-I and BoLA-DR genotypes had been determined by a combination of conventional Sanger-based sequencing of amplicons generated using MHCI-allele-specific PCR [26], Sanger-based typing of BoLA-DR [27], and MiSeq-based BoLA-I and BoLA-DR sequencing [28]. Theileria parva-infected B- and T-cell lines were generated and characterised as part of previous studies and maintained using well-established protocols [29]. The cell lines had all been generated from Holstein-Friesian animals from the University of Edinburgh Herd and had been used under a license granted under the UK Animal (Scientific Procedures) Act 1986.

2.2. Peptide-BoLA-I and Peptide-BoLA-DR Complex Purification

Cells were harvested whilst in a log growth phase, and trypan blue staining was used to verify that >95% of the cells were viable at the point of harvest. Cells were washed twice with ice-cold PBS and then lysed in buffer (1% IGEPAL, 15 mM TRIS pH 8.0, 300 mM NaCl, and a complete protease inhibitor (Roche, Welwyn Garden City, UK)) at a density of 2 × 108 cells/mL for 1 min, diluted with PBS 1:1 and solubilized for 45 min at 4 °C. Lysates were cleared with two-step centrifugation at 500× g for 15 min, followed by 15,000× g for 45 min at 4 °C. pBoLA-I complexes were captured directly from the lysate using a pan-specific anti-BoLA-I antibody (ILA-88) covalently conjugated to protein A sepharose immunoresin (Amintra, Expedeon, Cambridge, UK) at a concentration of 5 mg/mL. BoLA-DR complexes were captured from the lysate, following a preliminary removal of pBoLA-I complexes (using BoLA-I capture as described above), using a pan-specific anti-BoLA-DR antibody (ILA-21) conjugated to protein A sepharose immunoresin at a concentration of 5 mg/mL. Captured pBoLA-I and pBoLA-DR complexes were washed sequentially using buffers of 50 mM Tris buffer, pH 8.0 containing 150 mM NaCl, then 400 mM NaCl, and finally 0 mM NaCl, prior to elution of the BoLA-bound peptides, BoLA protein chains, and β2M in 10% acetic acid and stored as described previously [30].

2.3. High Performance Liquid Chromatography (HPLC) Fractionation

Affinity column-eluted material was resuspended in 120 μL loading buffer (0.1% formic acid, 1% acetonitrile in water) and loaded onto a 4.6 × 50 mm ProSwiftTM RP-1S column (Thermo Scientific, Waltham, MA, USA) for reverse-phase chromatography on an Ultimate 3000 HPLC system (Thermo Scientific). Elution was performed using a 0.5 mL/min flow rate for over 5 min on a gradient of 2–35% buffer B (0.1% formic acid in acetonitrile) in buffer A (0.1% formic acid). Eluted fractions were collected from 1 to 8.5 min, for 30 s each. Protein detection was performed at 280 nm. Even and odd eluted fractions were pooled together, vacuum, dried, and stored at −80 °C until use.

2.4. LC-MS2 Analysis

Samples were suspended in a 20 μL loading buffer and analysed on an Ultimate 3000 nano UPLC system online coupled to either an Orbitrap FusionTm TribridTm Mass Spectrometer or a Q Exactive™ HF-X Hybrid Quadrupole-Orbitrap™ Mass Spectrometer (Thermo Scientific). Peptides were separated on a 75 μm × 50 cm PepMap C18 column using a 1 or 2 h linear gradient from 2–5% buffer A to 35% buffer B at a flow rate of 250 nL/min (approx. 600 bar at 40 °C). Peptides were introduced into the mass spectrometer using a nano Easy Spray source (Thermo Scientific) at 2000 V. The ion transfer tube temperature was set to either 305 °C (Fusion Lumos), or 250 °C (HF-X). Subsequent isolation and higher-energy C-trap dissociation (HCD) were induced on the 20 most abundant ions per full MS scan with an accumulation time of 128 ms and an isolation width of 1.2 Da (Fusion Lumos) or 1.6 Da (HF-X). All fragmented precursor ions were actively excluded from repeated selection for 8 s (Fusion) or 15 s (HF-X). The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository [31] with the dataset identifiers PXD008151 and PXD024053.

2.5. MS Data Analysis

The sequence interpretations of mass spectrometry spectra were performed using a database containing all bovine UniProt entries (total of 41,610 entries) and 4084 entries for the T. parva Muguga proteome [14]. The spectral interpretation was performed using the novo-assisted database search with PEAKS v8.5 or 10 (Bioinformatics Solutions), in ‘no enzyme’ mode, with mass tolerances of 5 ppm for precursor ions and 0.03 Da for fragment ions. The data were further searched against 313 in-build peptide modifications. To be included in the downstream analyses, T. parva peptides had to meet the following criteria: (1) a peptide-spectrum matching score (−10 lgP) of >20, (2) no predicted post-translational modifications, (3) a minimum of 2 amino acids difference from any bovine peptide sequence, and (4) for BoLA-DR peptides, predicted to not be a BoLA-I-binding peptide (the latter due to evidence of coprecipitation of BoLA-I peptides in the BoLA-DR eluted datasets [32]). Prediction of the MHC binding capacity of T. parva peptides was conducted using NetMHCpan-4.1 [33] for BoLA-I, and NetBoLA-IIpan [32] for BoLA-DR eluted peptides. Following the default parameters, peptides with a percent rank predicted score of >20% were considered to be nonbinders, whilst peptides with scores <2% and <5% were considered to be binders for BoLA-I and BoLA-DR, respectively. Coprecipitants were identified as overlapping peptides present in multiple cell-lines carrying nonhomologous BoLA-I/BoLA-DR molecules, where the lowest percent rank prediction score in at least one sample was >20% (i.e., predicted to be a nonbinder).

2.6. IFNG ELISPOT

IFNG ELISPOT was conducted using a standard format. In brief, a capture anti-IFNG monoclonal antibody (CC330—Biorad, Watford, UK) was bound to a pre-wetted PVF membrane multiscreen plate (Millipore, Watford, UK). Autologous T. annulata cells (used as antigen-presented cells −2 × 104 cells per well) were loaded with peptides at a concentration of 5 μg/mL for 2 h. At 37 °C before CD8+ T-cells were added at a density of 1 × 104 cells/well. Plates were incubated at 37 °C for 20 h before washing and identification of IFNG-producing cells by the addition of a biotinylated detection anti-IFNG antibody (CC302b—produced in-house) and development by sequential use of Vectastain (Vector Laboratories, Burlingame, CA, USA) and AEC substrate (Calbiochem, Watford, UK) solutions. Analysis of spot-forming units (SFU) was completed using an AID automated ELISpot reader (AID, Strassberg, Germany).

2.7. In Vitro Measurement of Peptide-BoLA-I Binding

The extracellular domains (positions 1–275, i.e., truncated at the transmembrane region) of BoLA-I heavy chain molecules had previously been produced as recombinant proteins and used to measure the affinity of peptide-BoLA-I interactions using human beta-2-microglubulin as the light chain component [34,35]. A previously described assay measuring peptide-human MHCI dissociation rate at 37 °C was adapted to these BoLA-I molecules [36]. Briefly, this assay used the dissociation of the invariant beta-2-microglubulin as a proxy to measure the dissociation of the peptides offered to the BoLA-I. The dissociation at 37 °C of 125I-radiolabeled beta-2-microglubulin was monitored in real-time by a high-throughput scintillation proximity assay, and the half-life of the dissociation was determined. Peptides that failed to bind to BoLA-I molecules did not register a half-life period; the length of the half-life was used to infer the stability and binding strength of individual peptide-BoLA-I complexes.

3. Results

3.1. Identification of BoLA-I Associated Peptides Derived from T. parva

To assess the ability to detect T. parva-derived BoLA-I-associated peptides using an immunopeptidomics approach, BoLA-I-associated peptides were purified with three T. parva-infected cell lines (TP), and the peptide fractions were analysed by LC-MS. Each TP was homozygous for a different MHC haplotype, which expressed 1, 2, or 4 BoLA-I genes (in contrast to humans and mice, there is a variable number of MHCI loci expressed in different BoLA-I haplotypes, ranging from 1 to 4 [5,37]; 641TP (A18: 6:01301), 1011TP (A10: 2:01201, 3:00201), and 2229TP (A14: 2:02501, 4:02401, 1:02301, 6:04001). From these cell lines, a total of 7672, 6961, and 6871 peptide sequences were identified, respectively (Figure 1A). Due to the high degree of homology between the sections of the bovine and T. parva genomes, a blast search of all peptide sequences putatively derived from T. parva against the bovine proteome was completed. To remove peptides potentially derived from bovine protein, variants from further analysis sequences that were less than two amino acids (a.a.) different from a matching sequence in the bovine proteome were excluded. After filtering, 25 (0.32%), 18 (0.25%), and 25 (0.36%) peptide sequences were identified as being unambiguously derived from T. parva proteins in the 1011TP, 2229TP, and 641TP cell lines, respectively (total number and number of unique T. parva peptides identified were 68 and 62, respectively—Table 1, Figure 1A).
To verify the accuracy of the MS spectral sequence annotation of the identified T. parva peptides, we employed a spectral matching approach, performing analysis of synthetic peptides under identical LC-MS conditions for a subset of the peptides (n = 33). The spectra obtained from the synthetic peptides matched those measured for the majority of the peptides analysed (n = 31, 93.9%), confirming their correct identification (Table 1, Figure 1B).
The capacity of the identified peptides to bind to the BoLA-I molecules expressed in the cell lines was predicted using the NetMHCpan4.1 algorithm [38], with peptides achieving a percent predicted rank binding score of <2% considered to be BoLA-I binders (Table 1). Only 33.8% of the T. parva peptides were predicted to be BoLA-I binders, in contrast to the bovine-derived peptides from the same samples, where ~90% of peptides were predicted to be capable of binding to a BoLA-I molecule expressed in the respective samples (Figure 1C).
Notably, unlike the bovine-derived peptides, the T. parva peptides did not exhibit a Gaussian (‘normal’) length distribution, and the low percentage of T. parva peptides predicted to be BoLA-I binders appeared to be due, in part, to the presence of a substantial fraction of peptides (n = 32, 47%) that were longer than the canonical 8–12 a.a. length of MHCI-binding peptides (Figure 1D). Only 9.4% of ≥ 13 mer peptides (n = 3/32) were predicted to be BoLA-I binders, whereas 55.6% (n = 20/36) of the 8–12 mer peptides had a predicted rank binding score of <2% (Table 1). This overrepresentation of longer peptides in the T. parva-derived peptidome suggested either a specific property for parasite peptides associated with MHC-I complexes, as has previously been observed for Toxoplasma gondii peptides [20], or a high proportion of coprecipitating peptides in the pathogen-derived fraction of the peptidome. The derivation of 13/32 (~40%) of ≥13 mer T. parva peptides from a single 28 a.a. region of one protein (hypothetical protein TpM_02g00758549–577), some of which were present in multiple samples despite the disparity in the BoLA-I molecules expressed by the three cell lines, was suggestive that a substantial proportion of the longer peptides were coprecipitants rather than peptides eluted from the peptide-binding groove of the purified BoLA-I molecules.
To confirm the capacity of the T. parva peptides to bind to BoLA-I molecules in vitro, binding assays were performed on a subset (n = 19) of peptides (Table 2, Figure 1E). This included representative peptides from across a range of NetMHCpan4.1 predictive scores (0.01–22.8%) and at least one allele from each MHC haplotype. For 12/13 of the peptides predicted to be BoLA-I binders, the binding assay confirmed binding; the exception was NSFVTDTFEKL, which had the poorest ranking score of the predicted binders (rank 1.68%, 3*00201). Conversely, three peptides with a rank of >2% bound to the relevant MHCI alleles in vitro, RLFNFATKRI (rank 2.32%), and SLKSALIDTLI (rank 2.47%) bound to 6*01301 and YGDYGEFDRKTK (rank 13.4%) bound to 2*01201; however, all three exhibited weaker binding than peptides that were predicted binders. The three peptides with the poorest rank scores (all rank >14%) failed to exhibit any binding on the assay. A notable feature of the results was the high level of correlation between the predicted percent rank binding scores and the quantitative results observed in the in vitro assay, with the data from the in vitro analysis, therefore, corroborating the BoLA-I binding predictions from NetMHCpan4.1. Interestingly, the in vitro binding assays demonstrated that three peptides identified as 2*01201-binders from the BoLA-A10 sample (1011TP) had the capacity to bind to 6*01301 (A18); however, the level of binding was very low, being generally >10 fold lower than the weakest predicted 6*01301 binding peptide. This may reflect the similarity of the peptide binding motifs of the 2*01201 and 6*01301 alleles [38].
ELISPOT assays using established in vitro autologous T. parva-specific CD8+ T-cell lines were conducted for the same subset of peptides that had been validated by spectral matching. These CD8+ T-cell lines produced potent responses against the positive control immunodominant epitopes (Tp2: TpMuguga_01g0005649–59; KSSHGMGKVGK, Tp9: TpMuguga_02g0089567–75: AKFPGMKKS, and Tp1: TpMuguga_03g00849214–224: VGYPKVKEEML for BoLA-A10, A14, and A18, respectively). Responses were also detected against one of the peptides identified from the pMHCI-elution dataset, TpMuguga_02g0089567–81, which contained the previously identified Tp9 epitope recognised by BoLA-A14+ animals (data not shown), however, no responses were elicited by any of the other peptides tested.
Thus, from this primary set of samples, a total of 68 T. parva peptides were identified, of which 23 were predicted to be BoLA-I binders. Data from spectral matching, BoLA-I binding prediction, and in vitro binding assays confirmed the identity of the pMHCI-eluted peptides and their capacity to bind to the relevant BoLA-I molecules; however, ELISPOT data indicated that only 1/33 of the assayed peptides were recognised by T. parva-specific CD8+ T-cells derived from an ITM-immunised donor.

3.2. Immunopeptidome Analysis of Additional T. parva-Infected Cell Lines

Having verified the capacity of LC-MS analysis of BoLA-I-eluted peptides to identify BoLA-I-presented T. parva-derived peptides, a second sample set comprising 10 TP cell lines was studied. This included four cell lines, which between them expressed four additional BoLA-I haplotypes: 2824TP (A19: 2:01601, 6:01402), 5350TP (A20: 2:02601, 3:02701), 2408TP (A15: 1:00901, 2:02501, 4:02401), and 2123TP (A12/A15: −2:00801, 1:01901 1:00901, 2:02501, 4:02401); independent replicate samples derived from cell lines 641TP, 2824TP, 2123TP, 5350TP (denoted by the suffix ‘_rpt’), and duplicate samples from an additional BoLA-A10 cell line (5072TP).
The data generated from the second sample set had a similar profile to that obtained from the preliminary set of samples. A range of between 5333 and 12119 total peptides (average = 8487) and 6–107 T. parva peptides (average = 46) were identified in each sample (Figure 2A); thus, the average percentage of T. parva peptides in the data was 0.53% (a summary of all of the data is provided in Supplementary Data 1.1). The total number of T. parva peptides identified and number of unique sequences identified in all 10 samples combined were 456 and 294, respectively. Details of all T. parva peptides identified in the second sample set are provided in Supplementary Data 2.1. As in the preliminary dataset, the T. parva peptides exhibited an anomalous length distribution, with only 53% being 8–12 mers and 44.9% being ≥13 mers (1.8% of T. parva peptides were 7 mers, n = 8), whilst >90% of the bovine peptides were of the canonical 8–12 a.a. length, suggesting the anomalous profile was parasite-specific (Figure 2B). Similarly, the percent rank binding prediction results were similar to the first dataset, with a high proportion of bovine derived 8–12-mer peptides predicted to be binders (for all samples combined = 91%, range in samples = 86–96%), whilst only 57% (range in samples = 20–100%) of the T. parva-derived 8–12-mer peptides were predicted to be BoLA-I binders (Figure 2C).
Together, the data indicate that BoLA-I immunopeptidomic analysis of Theileria parva-infected cell lines generated a consistent data profile that comprised subsets of peptides of canonical length, of which ~50% were predicted to be MHCI-binders, and peptides of anomalous lengths that contained few MHCI-binders.

3.3. Exclusion of Putative Coprecipitating Parasite Proteins and Application of Immunoinformatics Provides a Refined List of Putative BoLA-I-eluted T. parva Peptides

Examination of the collated data derived from the 13 samples demonstrated that ribosomal proteins, histones, and TpM_02g00758 were dominant sources for the 524 T. parva-peptides identified, accounting for 27.1%, 19.3%, and 16.2% of the peptide repertoire, respectively. The majority of peptides derived from TpM_02g00758 were of an anomalous length (average length = 17.5 a.a., with 67/85 of the peptides being ≥13 a.a. in length), were predominantly overlapping peptides originating from a small 30 a.a. region (79/85 peptides (92.9%) derived from TpM_02g00758547–577), and had a poor percent rank binding prediction score (median = 93.1%, only one peptide had a predicted percent ranking binding score below the 2% threshold).
Peptides from other proteins exhibited similar, but less pronounced, characteristics—for example, ribosomal protein S28-B 40s (TpMuguga_03g004281–14) and histone H2A variant 1 (TpMuguga_02g00611), as shown in Figure 3. The recurrence of peptides from localised regions of a small subset of proteins in multiple samples of nonsimilar BoLA-I haplotypes, which generally exhibited poor percent rank binding scores and anomalous peptide lengths, supports the designation of these peptides as co-precipitants rather than genuine MHC-binders.
Based on this, the dataset was refined by removing overlapping peptides identified in multiple samples expressing disparate BoLA-I haplotypes (for this purpose, BoLA-A14 and BoLA-A15, which express common BoLA alleles, were grouped together) where one or more of the peptides was not predicted to be a BoLA-I binder (defined as a rank-predicted binding score >20%). When applied to the combined dataset, this refining process removed 55.2% of the T. parva peptides (n = 289/524—Supplementary Data 3). The removed peptides had an average length of 14.7 amino acids and a median rank prediction score of 66.3%, with only 20 (6%) having a rank prediction score of <2%. In contrast, the remaining 235 peptides had an average length of 11.6 amino acids, a median rank prediction score of 1.43%, and the number of peptides with a percent rank prediction score of <2% was 128 (55.4%)—thus, the removal of the coprecipitants had a profound effect on the dataset, leading to a substantial improvement of the predicted percent rank score and making the enrichment for genuine BoLA-I binders in the dataset evident (Figure 4).
These putative coprecipitant peptides were derived from 25 proteins. These included TpM_02g00758, 12 ribosomal proteins, and 3 histones, which together were the source of 88.9% of the coprecipitant peptides (n = 257/289). Based on the high representation of these proteins in the coprecipitation pool, it was decided to remove all peptides derived from these classes of proteins. This resulted in the removal of an additional 75 peptides, so that a total of 364 peptides were excluded; this peptide set had an average length of 14 a.a., a median predicted-rank score of 22.8%, and 68 peptides (18.7%) of the peptides had a rank prediction score of <2%. The retained peptide dataset consisted of 160 peptides, with an average peptide length of 11.6 a.a., a median rank predicted binding score of 2.03%, and 80 peptides (50%) that were predicted to be binders. Thus, the removal of all peptides derived from ribosomes, histones, and TpM_02g00758 caused a slight deterioration in the statistics of the retained peptide set but was considered a good compromise to decrease the retention of possible coprecipitant artefacts. As a final step to refine the peptide dataset, an immunoinformatics filter was used, and all remaining peptides that had a predicted percent rank binding score of >2% (i.e., not predicted MHCI-binders) were removed. This left a final dataset of 80 peptides, which, after consolidation of overlapping, nesting, and duplicate identifications, resulted in 74 unique peptides from 68 proteins (Table 3).

3.4. Analysis of the Reproducibility of the Identified T. parva BoLA-I Immunopeptidomes

In the final dataset, the average number of T. parva peptides identified per sample was approximately six, suggesting that only a small subset of the BoLA-I-presented T. parva-peptides had been identified. To evaluate what effect this had on the reproducibility of the T. parva peptidomes described, we examined the overlap of peptides identified in cell lines that had been subjected to duplicate analysis of independent samples (technical duplicates for 641TP, 2824TP, and 5350TP, respectively) and in the triplicate datasets from TP cell lines expressing BoLA-A10 and BoLA-A15 haplotypes (comprising the 1011TP/5072TP and 2123TP/2408TP samples, respectively).
For the BoLA-A18, A10, and A15 groups, there was partial, but limited, overlap between replicate samples; in contrast, for the BoLA-A19 and BoLA-A20 groups, there was no overlap between the samples (Figure 5). As a summary statistic, the percentage of T. parva peptides identified in replicate samples was 7.3%; in comparison, the overlap between the bovine peptidomes from the same samples was greater, with 47.1% of bovine peptides identified in replicate samples. The low level of overlap observed between replicate T. parva immunopeptidomes is most likely a consequence of the low number of T. parva peptides identified (notably in the BoLA-A19 and BoLA-A20 groups, only one peptide was identified in one of the replicate samples); however, the identification of a subset of T. parva peptides in replicate samples indicates that the immunopeptidomes described in this study are at least partially reproducible, and higher resolution studies, yielding greater depth of peptide repertoire coverage, would likely produce datasets exhibiting greater reproducibility.

3.5. Analysis of T. parva Peptides Presented by BoLA-DR

We sought to expand the immunopeptidiomic analysis to bovine MHCII molecules. Cattle express two BoLA-II isotypes—DR and DQ. The peptide-binding groove of MHCII molecules is formed by a combination of the coexpressed α and β chains that form the MHCII heterodimer. Both BoLA-DQA and DQB loci exhibit polymorphism and are duplicated in some BoLA-haplotypes [39], whereas the BoLA-DRA locus is monomorphic and there is only a single function and expressed BoLA-DRB locus [40,41]. Consequently, immunopeptidomic analysis of BoLA-DR was considered less complex, and we undertook an analysis of the peptides eluted from BoLA-DR molecules of six T. parva-infected cell lines: 2123TP (BoLA-DR 15:01/11:01), 2824TP (BoLA-DR 16:01), 5072TP (BoLA-DR 10:01), 641TP (BoLA-DR 20:02), 495TP (BoLA-DR 10:01/11:01), and 5350TP (BoLA-DR 12:01), which between them expressed six different BoLA-DR molecules (Supplementary Data 1.2). The total number of peptides identified in each sample ranged from 4592 to 8547 peptides (average = 6738, Figure 6A). After filtering sequences with close homology to the bovine proteome, a range of 58–151 peptides (average = 101) were identified as being unambiguously derived from T. parva, representing 1.5% of the total peptides identified (total number and number of unique T. parva peptides identified were 607 and 326, respectively; Supplementary Datas 1 and 2). The average length of T. parva peptides was slightly shorter (15.0 a.a.) than the bovine peptides (15.7 a.a) (Figure 6B) and adhered less to a classic Gaussian distribution. The proportion of peptides that were predicted to be binders (i.e., had a percent rank predicted binding score of <5% when using NetBoLAIIpan; the threshold used for BoLA-DR binding) for 13–21-mer bovine peptides were consistently high, ranging from 82 to 84% (average = 83%). In contrast, the proportion of T. parva-derived 13–21-mer peptides that were predicted to be BoLA-DR binders was much lower, ranging from 5 to 22% (average = 12%; Figure 6C).
Similar to the BoLA-I data, a notable feature of the T. parva peptides in the BoLA-DR dataset was the dominant representation of peptides from a small subset of proteins. This included TpMuguga_02g00758, from which peptides were identified in all six samples and which accounted for a total of 118 peptides (19.4% of all T. parva peptides in the BoLA-DR dataset). As with the BoLA-I data, the peptides from these proteins identified in different samples were often clustered in specific regions, were overlapping, and predominantly had poor predicted percent rank binding scores, indicative of coprecipitating peptides. Application of the same process as described for the BoLA-I data to identify putative coprecipitants suggested that a substantial majority of the T. parva peptides (82.9%, n = 503/608—Supplementary Data 3) were coprecipitants. These peptides were derived from 36 individual proteins, of which 25 were either ribosomal or histone proteins. A comparison of those proteins that were identified as the sources of coprecipitant peptides in the BoLA-I and BoLA-DR datasets showed a high level of convergence (15 proteins common to both) and a correlation in the number of peptides that individual proteins contributed to the BoLA-I and BoLA-DR datasets (Figure 7A and Supplementary Data 3).
The removal of coprecipitated peptides had a limited impact on the average predicted rank percent binding score of the combined BoLA-DR peptide dataset (46.3% vs. 46.9%), however, the distribution of the retained peptides showed a clear bimodal pattern with peaks of peptides with a rank percent prediction score of <5% and >95%; in contrast, the profile of the coprecipitated T. parva peptides showed no evidence of selection of predicted BoLA-DR binders, with only a dominant peak for peptides with a predicted rank percent binding score of >95%—Figure 7B. Although removal of the coprecipitants provided an enhanced dataset, the majority of the peptides retained in the dataset were not predicted to be BoLA-DR binders; 79/104 (76%) of the peptides had a predicted rank binding score of >5%. As with the BoLA-I data, all peptides derived from TpM_02g00758, ribosomal proteins, and histones were removed (34 peptides with a median rank predicted binding score of 46.95, and only two peptides were predicted to be BoLA-DR binders), and the default threshold (i.e., a percent rank-predicted binding score of <5%) used to predict MHC binding was applied to generate a final list of BoLA-DR presented T. parva peptides. After consolidation of the nested peptides, this list included 15 peptides, each derived from a different T. parva protein (Table 4).

3.6. Comparison of T. parva Peptidome Data with Previously Identified T. parva Antigens

Recent work applying conventional antigen-screening techniques to identify CD4+ and CD8+ T-cell epitopes with a peptide library covering 502 T. parva proteins from the reference Muguga strain [13] has expanded the number of known T-cell antigens to 36; twenty CD4+ T-cell antigens, 10 CD8+ T-cell antigens, and six antigens containing epitopes for both CD4+ and CD8+ T-cells [12,13]. The library included peptides covering 19 out of the 105 proteins that were identified as sources of BoLA-I and/or BoLA-DR presented peptides in this study. Although none of the peptides from these 19 proteins matched experimentally mapped CD8+ or CD4+ T-cell epitopes, five of the proteins have been identified as T-cell antigens (Table 5). This includes TpMuguga_02g00123 (Tp32—DEAD/DEAH box helicase) and TpMuguga_02g00895 (Tp9), which have been shown to contain epitopes for both CD8+ and CD4+ T-cells. Thus, 26.3% of the proteins identified as sources of MHCI/MHCII presented peptides from immunopeptidomics that have been included in the recent conventional antigen-screening study [13] have been demonstrated to contain recognised epitopes. In contrast, only 7.5% of the proteins selected for inclusion in that study were shown to contain epitopes. This suggests that, although only one of the peptides identified in this study has been validated as containing an epitope, immunopeptidomics could be used to preferentially select proteins that are sources of CD4+/CD8+ T cell antigens.

4. Discussion

Identification of T-cell antigens for inclusion in vaccines against complex pathogens remains challenging. The potential for immunopeptidomics to address this critical obstacle in the development of novel subunit vaccines against a wide range of pathogens, especially nonviral pathogens with complex proteomes [42], is receiving greater attention. This is of especial relevance to intracellular eukaryotic pathogens, such as Plasmodium, where integration of data from immunopeptidomics with data from other antigen-identification approaches has been advocated as the most efficient way in which to identify antigens with potential for vaccination for pre-erythrocytic malaria vaccines [43].
Antigen-identification persists as a constraint on the development of novel T. parva subunit vaccines with the capacity to induce T-cell responses. Conventional screening approaches have yielded a number of T. parva antigens for both CD4+ and CD8+ T-cells [12,13,44] on some MHC backgrounds, and in a recent study, use of a purely ‘immuno-informatic’ approach to identifying T-cell antigens from Theileria has been attempted [45]. In this study, we provide the first description of MHC-peptide elution studies being used to investigate the immunopeptidome of T. parva-infected cells to define peptides presented by BoLA-I and BoLA-DR molecules.
Although the large proteome and high parasite diversity are disadvantages in conventional antigen-screening approaches, the biology of T. parva has inherent advantages with regards to the application of immunopeptidomics. Firstly, it is easy to generate and maintain rapidly and indefinitely proliferating T. parva-infected cells in vitro—enabling the accumulation of sufficient cells (>1 × 109 infected cells), with an infection rate of ~100%, from the target host species. In malaria, the application of immunopeptidomics has been hindered by the low level of hepatocyte infection (<10%) that can be achieved [43]. The second feature of T. parva is that the in vitro infected cells express high levels of both BoLA-I and BoLA-II and are of the same phenotype as the cells infected in vivo (i.e., T and B lymphocytes). The capacity of these T. parva-infected cells to recall antigen-specific T-cells from recovered animals without the need for supplementary APCs confirms that the peptides presented in the context of BoLA-I/BoLA-II by T. parva-infected cells are those that stimulate T-cell responses [3,29]. As such, cells infected in vitro with T. parva are highly appropriate for studies to apply immunopeptidomics and avoid the potential complication of cell types selected for analysis that was observed in recent Chlamydia studies, where dendritic and epithelial cells were found to have discordant and nonoverlapping immunopeptidomes [46].
In this study, we exploited the features of T. parva to analyse the immunopeptidomes from a total of nine different cell lines representing a range of BoLA-I haplotypes and BoLA-DR molecules. This gave our study a unique structure compared to immunopeptidomic studies that have been reported for other eukaryotic parasites (Leishmania, Plasmodium, and Toxoplasma), where data have been generated for either MHCI or MHCII and from only single samples [19,20,21]. This approach was primarily driven by two factors. Firstly, as cattle are not a ‘model’ species, there was, at the outset of these studies, very limited information on the peptide binding motifs of bovine MHC molecules—the inclusion of multiple BoLA-I/BoLA-DR genotypes allowed us to define their peptide-binding motifs [32,38,47], which will find applications in enhancing ‘immuno-informatic’ based studies in cattle (and was also pivotal in subsequently refining the data presented herein). Secondly, the ultimate aim of the study was the exploration of the feasibility of using immunopeptidome analysis as an alternative and/or complementary approach contributing to CD4+/CD8+ T-cell candidate antigen identification for a T. parva vaccine that will be used in outbred cattle populations. As such, there was an interest in generating data for both BoLA-I and BoLA-II molecules for a range of different genotypes. In this study, we purposely focused on MHC haplotypes present in Holstein-Friesian cattle. This was primarily because these high-yielding dairy cattle and their crosses with indigenous cattle continue to increase in numbers in regions of Africa where ECF occurs. These high-value animals are considered critical to improving agricultural production and meeting the growing local demand for dairy products. As an ‘exotic’ breed that shows minimal tolerance to T. parva infections, these animals are particularly susceptible to ECF and are, therefore, the primary target for any vaccine against T. parva. Secondly, most previous antigen-identification work has primarily focused on animals expressing MHC haplotypes present in Holstein-Friesian animals. Thirdly, at the moment there is insufficient data on the diversity and frequency of MHC genotypes in cattle breeds indigenous to T. parva-endemic regions on which to base the selection of samples to subject to immunopeptidomic analysis in these populations. This fundamental gap in our knowledge is currently being addressed by several groups through the development and application of high-throughput sequencing approaches for exploring MHC diversity in a range of cattle populations [28,48,49].
A consistent feature of the BoLA-I and BoLA-DR T. parva-derived peptide datasets obtained during this study was the predominance of peptides that were of noncanonical length and/or predicted to be non-MHC binders. Equivalent anomalous peptides were not a feature of the bovine peptidomes, suggesting that these anomalous peptides were specific to the pathogen rather than a technical fault in the protocol. Non-genuine MHC binders that co-precipitate with pMHC molecules in elution studies using equivalent protocols have been documented before (e.g., in HIV-1 [50]). However, the extent to which putative co-precipitants dominated the T. parva datasets was remarkable; a total of 55.2% and 82.9% of the peptides identified from the BoLA-I and BoLA-DR datasets were defined as co-precipitants in this study. The co-precipitant peptides came from a small subset of proteins, which showed a high level of overlap between the BoLA-I and BoLA-DR datasets, suggesting that co-precipitating peptides did not represent random ‘noise’ but rather a feature of specific T. parva proteins. TpMuguga_02g00758, a hypothetical protein of 640 amino acids, was the largest contributor of co-precipitating peptides to both datasets (29.5% and 23.3% in the BoLA-I and BoLA-DR datasets, respectively). The second and third most common proteins acting as sources of co-precipitants in both datasets were histones (contributing a total of 28.5% and 24.5% of co-precipitants in MHCI and DR, respectively), whilst the most commonly represented family of proteins were ribosomal proteins (11 and 21 proteins contributing 25% and 39.4% of coprecipitating peptides in the BoLA-I and BoLA-DR datasets, respectively). Together, TpMuguga_02g00758, histones, and ribosomes constituted >80% of the co-precipitants identified in both datasets.
Relatively few immunopeptidomic studies have been conducted on parasite-infected cells [19,20,21], so there is little comparative data to assess whether this magnitude of coprecipitating peptides is a feature common to intra-cellular parasites or specific to Theileria (only eukaryotes express both ribosomes and histones, preventing direct comparison with data from viral or bacterial studies). Coprecipitants were not specifically examined in any of the other parasite immunopeptidomic studies. However, direct comparison between the studies would perhaps be confounded by technical differences; in the Toxoplasma gondii study [20], a monoallelic secreted MHCI model was used and in the Plasmodium study [21], DCs were loaded by incubation with parasitized RBCs, which may, through different mechanisms, change the quantity and/or source of any coprecipitants observed. The high level of coprecipitants observed for T. parva may reflect fundamental biological differences between it and the other parasites for which immunopeptidomic studies have been completed. Foremost of these is the location of T. parva schizonts free within the host cell cytoplasm (Plasmodium, Toxoplasma, and Leishmania reside within vacuoles). This feature, combined with the active secretion of Theileria proteins into the host cell cytoplasm [51,52,53] and the close interaction between the parasite and various components of the host cell’s architecture (e.g., attachment to the host cell’s mitotic spindle during cell division [54] and the formation of various host-pathogen complexes on the schizont surface [53], may afford greater opportunities for T. parva proteins to non-specifically bind to MHC molecules and/or integrate into elements of the host cell’s structures that are subsequently co-precipitated during the elution protocol. Notably, the TpMuguga_02g00758 gene product has a signal peptide, indicating that it is likely to be secreted into the host-cell cytoplasm. However, absence of any other annotated features of this protein precludes any inferences about its biological function and how this might relate to it being the major co-precipitant in both the BoLA-I and BoLA-DR datasets. A second pertinent feature of T. parva biology is the transformation of the host cell and rapid proliferation of both the host cell and parasite. To sustain the DNA and protein production that proliferation requires, histones and ribosomes are likely to be subject to high levels of expression and turn-over which may contribute to their overrepresentation as coprecipitants. At present there is no proteome of the schizont stage of T. parva, however transcriptomic analyses of T. parva schizonts show that several histone and ribosomal proteins are amongst the most abundantly expressed proteins [55]. Comparison of these data with proteomic analysis of schizonts of the closely related T. annulata has shown a high level of concordance—suggesting the transcriptome is quantitatively representative of the T. parva schizont proteome [56].
After removal of co-precipitants, both the BoLA-I and BoLA-DR datasets showed bias for peptides that were predicted to be strong MHC-binders (Figure 4 and Figure 7B), demonstrating preferential selection of MHC-binding peptides. Within these datasets there remained a high number of peptides derived from TpMuguga_02g00758, ribosomes and histones. Based on the high frequency at which these proteins were the source of co-precipitating peptides we decided to exclude all peptides from these proteins. Whilst this did not enhance the parameters by which the quality of the datasets were measured, it removed classes of proteins that were evidently sources of co-precipitating proteins and so compromised as potential candidate vaccine antigens. Following removal of these peptides, the BoLA-I and -DR datasets were reduced substantially from 524 and 607 to 160 and 70 respectively; however, within these there remained a substantial proportion of peptides that were predicted to be non-MHC binders. It is possible that these peptides may also be co-precipitants; however, there was no clear rationale in the current dataset for defining them as such and so they were excluded from the final peptide selections solely on the basis of default thresholds of predicted rank percent scores used to define BoLA-I (<2%) and BoLA-DR (<5%) binding. The high correlation seen between the predicted % rank binding scores and the results from the in vitro binding assay (Table 2), the very high proportion of bovine derived ligands with low rank scores (indicating BoLA binding) [38] and the observation that most known CD4+ and CD8+ T-cell epitopes from T. parva proteins have predicted % rank binding scores within the respective 5% and 2% thresholds [32,38], support this approach to rationalising the final peptide lists. The availability of well-defined peptide-binding motifs for these bovine MHC molecules was critical in curation of the datasets and attests to the importance of high quality immuno-informatic data to support future immunopeptidomic studies.
Recognition by T-cells from immune animals has been used as a standard component of assays attempting to ‘validate’ peptides identified from immunopeptidomic studies. On assaying a subset of the T. parva peptides identified from the first 3 BoLA-I samples only 1 out of 33 of the peptides (3.3%) was recognised. Prima facie, this lack of recognition was disappointing. However, it is similar to results from other immunopeptidomic studies (e.g., [57,58]) and in the context of immunodominance, which is a characteristic feature of many CD8+ T-cell responses [59], is to be anticipated. Due to factors such as TCR repertoire and APC competition [60], although a broad range of peptides may be presented by MHC molecules (and so reported in the immunopeptidome), immunodominance causes a detectable T-cell response against only a small subset of the presented peptides. Previous work by our group has confirmed that T. parva-specific CD8+ T-cell responses are subject to immunodominance, with up to 78% of the responses in BoLA-A10 and A18 homozygous animals being directed against single peptides [15]. The failure of the peptides identified in the immunopeptidome to be recognised by T-cells from immune animals does not discredit them as vaccine candidates. Studies in a range of pathogens including Trypanosoma cruzi [61], Mycobacterium tuberculosis [62,63] and various viruses [64,65,66] has shown that subdominant/cryptic epitopes can confer protection, which in some cases is better than that afforded by immunodominant epitopes. Furthermore, it has been shown in numerous studies [61,67,68,69,70,71] across a similar range of pathogens, that induction of T-cell responses in the absence of immunodominant antigens (as can be engineered in subunit vaccines) can successfully elicit responses against epitopes that are normally ‘cryptic’/subdominant following natural infection (which, in the context of T. parva, ITM immunisation essentially is). Thus, with regards to using immunopeptidomics to identify novel vaccine candidate antigens, the concept of using recognition of peptides by T-cells from immune animals to ‘validate’ peptides is conceptually flawed. To discount peptides not recognised by T-cells of immune cattle would mitigate against a major benefit of immunopeptidomics, the capacity to identify a large repertoire of MHC-presented peptides, amongst which may be potential epitopes that cannot be identified from T-cell screening but which potentially have the capacity to confer protection when utilised in vaccines. Data from recent studies of human melanoma [57] and African Swine Fever Virus [58] have confirmed T-cells recognising peptides identified through immunopeptidomics but which were ‘cryptic’ (i.e., not recognised by T-cells from the affected individuals/infected pigs) can successfully be induced by vaccination with these peptides. In future studies we plan to generate a multi-epitope vaccine construct including a selection of the peptides identified herein to directly assess their potential as vaccine candidates (see below). Encouragingly, evidence from a recent, unrelated study implies that it will be possible to induce T-cell responses against T. parva epitopes that are not recognised following natural infection. Following Adenovirus/MVA heterologous prime-boost immunisation of 4 calves with T. parva antigens (Tp2, Tp9 and Tp10) all 4 animals expressing either BoLA-DRB3*11:01 and/or 10:01 generated CD4+ T-cell responses specific for the Tp10 (TpMuguga_04g00772) antigen, which had not been previously observed as a CD4+ T-cell antigen in ITM-immunised animals bearing these or other BoLA-DR alleles [13]. These Tp10-specific CD4+ T-cells were capable of recognising T. parva-infected cells and, as far as could be determined, were functionally similar to CD4+ T-cells against immunodominant epitopes induced by ITM.
The T. parva cell lines chosen for the initial immunopeptidomic analysis were selected partly because immunodominant CD8+ T-cell epitopes restricted by MHCI proteins expressed by these cells (BoLA-A10+, BoLA-A18+ and BoLA-A14+) had already been defined. However, only the BoLA-A14-restricted TpMuguga_02g0089567–75 epitope was identified in the immunopeptidome (as a nested peptide within a 15-mer). This most likely was a reflection that the immunopeptidomes characterised in this study were only partial. In the final dataset the average number of T. parva unique peptides identified per BoLA-I and BoLA-DR sample was only ~6 and ~2.5 respectively. Examination of replicate samples demonstrated very limited overlap in the BoLA-I immunopeptidomes described (Figure 5—only 7.3% of the peptides were identified in multiple samples)—confirming that only a fraction of the T. parva peptidome was being identified from any individual sample at the depth of data generated in this study. A more intensive immunopeptidome analysis would be expected to enable a more complete characterisation of T. parva immunopeptidomes, which would potentially include the identification of the known T. parva CD4+ and CD8+ epitopes. For reasons given above, we elected in this study to prioritise the inclusion of multiple BoLA genotypes. However, future work complementing this with ‘deep’ immunopeptidomic profiling for a limited number of samples would be beneficial.
The ultimate aim of the study was to assess the feasibility of immunopeptidomics to identify candidate antigens to include in novel subunit vaccines. At the end of the analysis, a total of 74 and 15 unique peptides with predicted high binding capacities for BoLA-I and BoLA-DR molecules were identified. This information can be used in multiple ways to inform candidate antigen selection. For example, as described in the final results section, immunopeptidomic data could increase the efficiency of conventional antigen screening by focusing on T. parva proteins that are evidently contributing to the immunopeptidome. This may be of particular value as other parameters that have been evaluated (e.g., transcript abundance, presence of signal peptides) have not proved reliable for targeting antigen screening [13]. The simplest and most direct application of the data would be to include the identified peptides in a vaccine delivery platform (e.g., an adenoviral-vectored vaccine) and administer this to animals of the appropriate BoLA genotypes to assess immunogenicity and/or protective efficacy. However, this approach would only partially utilise the information made available from the immunopeptidomic data. The well-documented antigenic diversity of T. parva [8,9,10,11] and the polymorphism of bovine MHC in cattle in T. parva-endemic areas (Vasoya et al. submitted) preclude the direct application of immunopeptidomic studies to identify specific epitopes for all the potential permutations of MHC genotype and parasite strain. A more practicable approach would be to integrate this and future immunopeptidome data identifying proteins preferentially accessing the MHC processing pathways with data analysing the host immunogenetic and pathogen genetic diversity. The repertoire of bovine BoLA genotypes in cattle populations in T. parva-endemic areas is now being analysed using high-throughput sequencing approaches (Vasoya et al. submitted), and work has begun to develop improved immuno-informatics algorithms for cattle [32,38,47] enabling accurate prediction of binding motifs of BoLA-I and BoLA-DR molecules. Simultaneously, transcriptomic and genomic sequencing of multiple T. parva strains [8,14] has recently been undertaken, enabling the strain diversity of T. parva proteins to be assessed. Together, the outputs of these ‘omics’ technologies could be utilised to identify candidate T. parva antigens presented on BoLA-I/BoLA-II (immunopeptidomics) and evaluate them for their degree of conservation (genomics/transcriptomics of T. parva strains) and their content of peptides (immunoinformatics) that can be presented on multiple BoLA-I and BoLA-DR genotypes (MHC repertoire analyses). The complexity of both the host MHC and pathogen strain diversity will undoubtedly make identification of a comprehensive set of candidate antigens immensely challenging, but this approach will make the best use of the data being generated from a suite of ‘omics technologies and provide a rational approach to identifying a ‘minimal’ set of antigens that would enable the induction of CD8+ and CD4+ T-cells from animals bearing a range of different MHC genotypes against diverse T. parva strains.
In this study, we have provided the first exploration of the immunopeptidome of T. parva-infected cells. The study encompasses data from multiple BoLA-I and BoLA-DR genotypes, identifying 74 and 15 unique BoLA-I and BoLA-DR binding peptides, and as such forms probably the most comprehensive immunopeptidomic analysis of any eukaryotic pathogen to date. However, there is clear evidence that the characterisation of the immunopeptidome completed herein is only partial and further studies are required to gain a more comprehensive understanding of the T. parva BoLA-I and BoLA-II immunopeptidomes. Based on the dataset obtained in this study, we have developed a simplified curation system that can be applied in future T. parva immunopeptidomic studies to effectively remove coprecipitating peptides that can dominate eluted peptide datasets and restrict the data to peptides that are predicted to be genuine MHC-binders. Due to the high levels of diversity in both T. parva and bovine MHC genotypes, we propose that the information derived from immunopeptidomics can most productively be used in the development of novel vaccines, not as a means to identify individual epitopes that are specific to a particular combination of T. parva strain and bovine MHC allele, but rather as part of an integrated strategy that aims to identify a repertoire of proteins that can potentially elicit CD4+ and CD8+ T-cell responses across a breadth of cattle MHC genotypes and that are able to provide protection against the spectrum of T. parva strains present in endemic areas. Such an approach will build on the synergy of the multiple new technologies that are being used to study T. parva and bovine immunogenetics and offer new opportunities to tackle the conundrum of T-cell antigen identification, which remains an obstacle to developing novel vaccines against this important disease.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/vaccines10111907/s1. Supplementary Data 1: Summary statistics of the T. parva peptides eluted from samples subjected to BoLA-I (Supplementary Data 1.1) and BoLA-DR (Supplementary Data 1.2) elution in this study. For each sample summary data are provided for both the uncorrected T. parva peptidome (i.e., prior to removal of co-precipitants) and for the corrected T. parva peptidome (i.e., after removal of co-precipitants). The data includes the number of T. parva peptides, the percentage of the total peptidome (i.e., the combined number of bovine and T. parva derived peptides) that the number of T. parva peptides represents, the average (median) predicted % rank score (%RPS) of the T. parva peptides, the number and percentage of the T. parva peptides that were considered to be predicted binders (i.e., had a %RPS <2% for BoLA-I and <5% for BoLA-DR) and the average length of the T. parva peptides. Supplementary Data 2: T. parva peptides derived by BoLA-I elution from (Supplementary Data 2.1) 2824TP, 5350TP, 2408TP, 2123TP, 5350TP_rpt, 2123TP_rpt, 641TP_rpt, 2824TP_rpt, 5072TP, 5072TP_rpt and BoLA-DR elution from (Supplementary Data 2.2) 2123TP_DR, 2824TP_DR, 495TP_DR, 5350TP_DR, 641TP_DR and 5072TP_DR. In Supplementary Data 2.1 for each peptide the amino acid sequence (column A), -10lgP score (B), peptide length (C), sample (D), number of spectra (E), MHC allele to which the peptide has the highest % rank binding score (F), core peptide sequence (G), % rank prediction score (H), accession number of the source protein (I), description of the source protein (J) and the peptide location within the source protein (K and L) are shown. (Equivalent data for 641TP, 1011TP and 2229TP samples are provided in Table 1). In Supplementary Data 2.2 the same data is provided but in column I is shown the context of the peptides, with subsequent data shifted by 1 column. Supplementary Data 3: Analysis of putative co-precipitants in the T. parva peptide datasets obtained from BoLA-I (BoLA-I dataset) and BoLA-DR (BoLA-DR dataset) elutions and a summary of the proteins that are sources of co-precipitants (Co-precipitating proteins). Co-precipitants were defined as overlapping peptides identified in multiple samples expressing non-matching MHC genotypes where one of more of the peptides were predicted to be non-MHC binders (defined as a % rank predicted score of >20%). Peptides designated as putative co-precipitants are identified in column M as YES; peptides removed as they were derived from either ribosomal proteins, histones or TpM_02g00758 are identified in column N as YES. Worksheets ‘MHCI dataset’ and ‘DR dataset’ provide the information from the combined BoLA-I and DR datasets respectively. The ‘Co-precipitating Proteins’ spreadsheet provides details of the proteins identified as the source of co-precipitating peptides. The number of peptides from each protein that were removed as putative co-precipitatants is shown.

Author Contributions

T.C., N.T., A.H., M.N. and I.M. conceived the study. T.C., T.S., C.V. and A.F. prepared samples and conducted laboratory work; A.N. and N.T. completed the mass spectrometry work; B.R. and M.N. conducted the immuno-informatic analyses; S.B. completed the in vitro MHC binding analysis. T.C., S.B., I.M., M.N. and N.T. contributed to the preparation of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by funds from multiple sources, including DfiD and the Bill and Melinda Gates Foundation through the ‘Improved vaccines for the control of East Coast Fever in cattle in Africa’ grant (OPP1078791); the BBSRC through the ISP award made to the Roslin Institute (BBS/E/D/20002174); a small grant award from the Roslin Foundation; and a FAPESP scholarship awarded to AF (2014/11010-9, 2017/21401-4, and 2018/23579-8). The findings and conclusions contained within are those of the authors and do not necessarily reflect the positions or policies of the Bill and Melinda Gates Foundation, the UK Government, or any of the other funders.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Acknowledgments

We gratefully acknowledge the supporting work from other members of the BMGF-funded ‘Improved vaccines for the control of East Coast Fever in cattle in Africa’ consortium—a collection of laboratories that together made great advances in our understanding of the patho-biology of, and immune responses against, Theileria parva.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. East Coast Fever. Available online: https://www.galvmed.org/livestock-and-diseases/livestock-diseases/east-coast-fever/ (accessed on 30 September 2022).
  2. Goddeeris, B.M.; Morrison, W.I.; Toye, P.G.; Bishop, R. Strain specificity of bovine Theileria parva-specific cytotoxic T cells is determined by the phenotype of the restricting class I MHC. Immunology 1990, 69, 38–44. [Google Scholar] [PubMed]
  3. Goddeeris, B.M.; Morrison, W.I.; Teale, A.J. Generation of bovine cytotoxic cell lines, specific for cells infected with the protozoan parasite Theileria parva and restricted by products of the major histocompatibility complex. Eur. J. Immunol. 1986, 16, 1243–1249. [Google Scholar] [CrossRef] [PubMed]
  4. McKeever, D.J.; Taracha, E.L.; Innes, E.L.; MacHugh, N.D.; Awino, E.; Goddeeris, B.M.; Morrison, W.I. Adoptive transfer of immunity to Theileria parva in the CD8+ fraction of responding efferent lymph. Proc. Natl. Acad. Sci. USA 1994, 91, 1959–1963. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Taracha, E.L.; Goddeeris, B.; Teale, A.J.; Kemp, S.J.; Morrison, W.I. Parasite strain specificity of bovine cytotoxic T cell responses to Theileria parva is determined primarily by immunodominance. J. Immunol. 1995, 155, 4854–4860. [Google Scholar]
  6. Christensen, J.P.; Marker, O.; Thomsen, A.R. The role of CD4+ T cells in cell-mediated immunity to LCMV: Studies in MHC class I and class II deficient mice. Scand. J. Immunol. 1994, 40, 373–382. [Google Scholar] [CrossRef]
  7. Thomsen, A.R.; Johansen, J.; Marker, O.; Christensen, J. Exhaustion of CTL memory and recrudescence of viremia in lymphocytic choriomeningitis virus-infected MHC class II-deficient mice and B cell-deficient mice. J. Immunol. 1996, 157, 3074–3080. [Google Scholar]
  8. Palmateer, N.C.; Tretina, K.; Orvis, J.; Ifeonu, O.O.; Crabtree, J.; Drabék, E.; Pelle, R.; Awino, E.; Gotia, H.T.; Munro, J.B.; et al. Capture-based enrichment of Theileria parva DNA enables full genome assembly of first buffalo-derived strain and reveals exceptional intra-specific genetic diversity. PLoS Negl. Trop. Dis. 2020, 14, e0008781. [Google Scholar] [CrossRef]
  9. Atuhaire, D.K.; Muleya, W.; Mbao, V.; Bazarusanga, T.; Gafarasi, I.; Salt, J.; Namangala, B.; Musoke, A.J. Sequence diversity of cytotoxic T cell antigens and satellite marker analysis of Theileria parva informs the immunization against East Coast fever in Rwanda. Parasites Vectors 2020, 13, 452. [Google Scholar] [CrossRef]
  10. Hemmink, J.D.; Sitt, T.; Pelle, R.; De Klerk-Lorist, L.-M.; Shiels, B.; Toye, P.G.; Morrison, W.I.; Weir, W. Ancient diversity and geographical sub-structuring in African buffalo Theileria parva populations revealed through metagenetic analysis of antigen-encoding loci. Int. J. Parasitol. 2018, 48, 287–296. [Google Scholar] [CrossRef]
  11. Pelle, R.; Graham, S.P.; Njahira, M.N.; Osaso, J.; Saya, R.M.; Odongo, D.O.; Toye, P.G.; Spooner, P.R.; Musoke, A.J.; Mwangi, D.M.; et al. Two Theileria parva CD8 T cell antigen genes are more variable in buffalo than cattle parasites, but differ in pattern of sequence diversity. PLoS ONE 2011, 6, e19015. [Google Scholar] [CrossRef] [Green Version]
  12. Graham, S.P.; Pellé, R.; Honda, Y.; Mwangi, D.M.; Tonukari, N.J.; Yamage, M.; Glew, E.J.; de Villiers, E.P.; Shah, T.; Bishop, R.; et al. Theileria parva candidate vaccine antigens recognized by immune bovine cytotoxic T lymphocytes. Proc. Natl. Acad. Sci. USA 2006, 103, 3286–3291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Morrison, W.I.; Aguado, A.; Sheldrake, T.A.; Palmateer, N.C.; Ifeonu, O.O.; Tretina, K.; Parsons, K.; Fenoy, E.; Connelley, T.; Nielsen, M.; et al. CD4 T Cell Responses to Theileria parva in Immune Cattle Recognize a Diverse Set of Parasite Antigens Presented on the Surface of Infected Lymphoblasts. J. Immunol. 2021, 207, 1965–1977. [Google Scholar] [CrossRef] [PubMed]
  14. Tretina, K.; Pelle, R.; Orvis, J.; Gotia, H.T.; Ifeonu, O.O.; Kumari, P.; Palmateer, N.C.; Iqbal, S.B.A.; Fry, L.M.; Nene, V.M.; et al. Re-annotation of the Theileria parva genome refines 53% of the proteome and uncovers essential components of N-glycosylation, a conserved pathway in many organisms. BMC Genom. 2020, 21, 279. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. MacHugh, N.D.; Connelley, T.; Graham, S.; Pelle, R.; Formisano, P.; Taracha, E.L.; Ellis, S.A.; McKeever, D.J.; Burrells, A.; Morrison, W.I. CD8+ T-cell responses to Theileria parva are preferentially directed to a single dominant antigen: Implications for parasite strain-specific immunity. Eur. J. Immunol. 2009, 39, 2459–2469. [Google Scholar] [CrossRef] [Green Version]
  16. Han, K.-C.; Park, D.; Ju, S.; Lee, Y.E.; Heo, S.-H.; Kim, Y.-A.; Lee, J.E.; Lee, Y.; Park, K.H.; Park, S.-H.; et al. Streamlined selection of cancer antigens for vaccine development through integrative multi-omics and high-content cell imaging. Sci. Rep. 2020, 10, 5885. [Google Scholar] [CrossRef] [Green Version]
  17. Newey, A.; Griffiths, B.; Michaux, J.; Pak, H.S.; Stevenson, B.J.; Woolston, A.; Semiannikova, M.; Spain, G.; Barber, L.J.; Matthews, N.; et al. Immunopeptidomics of colorectal cancer organoids reveals a sparse HLA class I neoantigen landscape and no increase in neoantigens with interferon or MEK-inhibitor treatment. J. Immunother. Cancer 2019, 7, 309. [Google Scholar] [CrossRef]
  18. Teck, A.T.; Urban, S.; Quass, P.; Nelde, A.; Schuster, H.; Letsch, A.; Busse, A.; Walz, J.S.; Keilholz, U.; Ochsenreither, S. Cancer testis antigen Cyclin A1 harbors several HLA-A*02:01-restricted T cell epitopes, which are presented and recognized in vivo. Cancer Immunol. Immunother. 2020, 69, 1217–1227. [Google Scholar] [CrossRef]
  19. Mou, Z.; Li, J.; Boussoffara, T.; Kishi, H.; Hamana, H.; Ezzati, P.; Hu, C.; Yi, W.; Liu, D.; Khadem, F.; et al. Identification of broadly conserved cross-species protective Leishmania antigen and its responding CD4+ T cells. Sci. Transl. Med. 2015, 7, 310ra167. [Google Scholar] [CrossRef]
  20. McMurtrey, C.; Trolle, T.; Sansom, T.; Remesh, S.G.; Kaever, T.; Bardet, W.; Jackson, K.; McLeod, R.; Sette, A.; Nielsen, M.; et al. Toxoplasma gondii peptide ligands open the gate of the HLA class I binding groove. eLife 2016, 5, e12556. [Google Scholar] [CrossRef]
  21. Draheim, M.; Wlodarczyk, M.F.; Crozat, K.; Saliou, J.-M.; Alayi, T.D.; Tomavo, S.; Hassan, A.; Salvioni, A.; Demarta-Gatsi, C.; Sidney, J.; et al. Profiling MHC II immunopeptidome of blood-stage malaria reveals that cDC1 control the functionality of parasite-specific CD4 T cells. EMBO Mol. Med. 2017, 9, 1605–1621. [Google Scholar] [CrossRef] [Green Version]
  22. De Wit, J.; Emmelot, M.E.; Meiring, H.; Brink, J.A.M.V.G.-V.D.; Els, C.A.C.M.V.; Kaaijk, P. Identification of Naturally Processed Mumps Virus Epitopes by Mass Spectrometry: Confirmation of Multiple CD8+ T-Cell Responses in Mumps Patients. J. Infect. Dis. 2020, 221, 474–482. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Ramarathinam, S.H.; Gras, S.; Alcantara, S.; Yeung, A.W.; Mifsud, N.A.; Sonza, S.; Illing, P.T.; Glaros, E.N.; Center, R.J.; Thomas, S.R.; et al. Identification of Native and Posttranslationally Modified HLA-B*57:01-Restricted HIV Envelope Derived Epitopes Using Immunoproteomics. Proteomics 2018, 18, 1700253. [Google Scholar] [CrossRef]
  24. Ternette, N.; Yang, H.; Partridge, T.; Llano, A.; Cedeño, S.; Fischer, R.; Charles, P.D.; Dudek, N.L.; Mothe, B.; Crespo, M.; et al. Defining the HLA class I-associated viral antigen repertoire from HIV-1-infected human cells. Eur. J. Immunol. 2016, 46, 60–69. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Bettencourt, P.; Müller, J.; Nicastri, A.; Cantillon, D.; Madhavan, M.; Charles, P.D.; Fotso, C.B.; Wittenberg, R.; Bull, N.; Pinpathomrat, N.; et al. Identification of antigens presented by MHC for vaccines against tuberculosis. NPJ Vaccines 2020, 5, 2. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Ellis, S.A.; Staines, K.A.; Stear, M.J.; Hensen, E.J.; Morrison, W.I. DNA typing for BoLA class I using sequence-specific primers (PCR-SSP). Eur. J. Immunogenet. 1998, 25, 365–370. [Google Scholar] [CrossRef]
  27. Baxter, R.; Hastings, N.; Law, A.; Glass, E.J. A rapid and robust sequence-based genotyping method for BoLA-DRB3 alleles in large numbers of heterozygous cattle. Anim. Genet. 2008, 39, 561–563. [Google Scholar] [CrossRef]
  28. Vasoya, D.; Law, A.; Motta, P.; Yu, M.; Muwonge, A.; Cook, E.; Li, X.; Bryson, K.; MacCallam, A.; Sitt, T.; et al. Rapid identification of bovine MHCI haplotypes in genetically divergent cattle populations using next-generation sequencing. Immunogenetics 2016, 68, 765–781. [Google Scholar] [CrossRef] [Green Version]
  29. Goddeeris, B.M.; Morrison, W.I. Techniques for the generation, cloning, and characterization of bovine cytotoxic T cells specific for the protozoan Theileria parva. J. Tissue Cult. Methods 1988, 11, 101–110. [Google Scholar] [CrossRef]
  30. Purcell, A.W.; Ramarathinam, S.H.; Ternette, N. Mass spectrometry–based identification of MHC-bound peptides for immunopeptidomics. Nat. Protoc. 2019, 14, 1687–1707. [Google Scholar] [CrossRef]
  31. Perez-Riverol, Y.; Csordas, A.; Bai, J.; Bernal-Llinares, M.; Hewapathirana, S.; Kundu, D.J.; Inuganti, A.; Griss, J.; Mayer, G.; Eisenacher, M.; et al. The PRIDE database and related tools and resources in 2019: Improving support for quantification data. Nucleic Acids Res. 2019, 47, D442–D450. [Google Scholar] [CrossRef]
  32. Fisch, A.; Reynisson, B.; Benedictus, L.; Nicastri, A.; Vasoya, D.; Morrison, W.I.; Buus, S.; Ferreira, B.R.; de Miranda Santos, I.K.; Ternette, N.; et al. Integral Use of Immunopeptidomics and Immunoinformatics for the Characterization of Antigen Presentation and Rational Identification of BoLA-DR-Presented Peptides and Epitopes. J. Immunol. 2021, 206, 2489–2497. [Google Scholar] [CrossRef] [PubMed]
  33. Reynisson, B.; Alvarez, B.; Paul, S.; Peters, B.; Nielsen, M. NetMHCpan-4.1 and NetMHCIIpan-4.0: Improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res. 2020, 48, W449–W454. [Google Scholar] [CrossRef] [PubMed]
  34. Svitek, N.; Hansen, A.M.; Steinaa, L.; Saya, R.; Awino, E.; Nielsen, M.; Buus, S.; Nene, V. Use of “one-pot, mix-and-read” peptide-MHC class I tetramers and predictive algorithms to improve detection of cytotoxic T lymphocyte responses in cattle. Vet. Res. 2014, 45, 50. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Hansen, A.M.; Rasmussen, M.; Svitek, N.; Harndahl, M.; Golde, W.T.; Barlow, J.; Nene, V.; Buus, S.; Nielsen, M. Characterization of binding specificities of bovine leucocyte class I molecules: Impacts for rational epitope discovery. Immunogenetics 2014, 66, 705–718. [Google Scholar] [CrossRef] [Green Version]
  36. Harndahl, M.; Rasmussen, M.; Roder, G.; Buus, S. Real-time, high-throughput measurements of peptide–MHC-I dissociation using a scintillation proximity assay. J. Immunol. Methods 2011, 374, 5–12. [Google Scholar] [CrossRef] [Green Version]
  37. Codner, G.F.; Stear, M.J.; Reeve, R.; Matthews, L.; Ellis, S.A. Selective forces shaping diversity in the class I region of the major histocompatibility complex in dairy cattle. Anim. Genet. 2012, 43, 239–249. [Google Scholar] [CrossRef]
  38. Nielsen, M.; Connelley, T.; Ternette, N. Improved Prediction of Bovine Leucocyte Antigens (BoLA) Presented Ligands by Use of Mass-Spectrometry-Determined Ligand and in Vitro Binding Data. J. Proteome Res. 2018, 17, 559–567. [Google Scholar] [CrossRef] [Green Version]
  39. Andersson, L.; Rask, L. Characterization of the MHC class II region in cattle. The number of DQ genes varies between haplotypes. Immunogenetics 1988, 27, 110–120. [Google Scholar] [CrossRef]
  40. Zhou, H.; Hickford, J.; Fang, Q.; Byun, S. Short Communication: Identification of Allelic Variation at the Bovine DRA Locus by Polymerase Chain Reaction-Single Strand Conformational Polymorphism. J. Dairy Sci. 2007, 90, 1943–1946. [Google Scholar] [CrossRef] [Green Version]
  41. Burke, M.G.; Stone, R.T.; Muggli-Cockett, N.E. Nucleotide sequence and northern analysis of a bovine major histocompatibility class II DR beta-like cDNA. Anim. Genet. 1991, 22, 343–352. [Google Scholar] [CrossRef]
  42. Mayer, R.L.; Impens, F. Immunopeptidomics for next-generation bacterial vaccine development. Trends Microbiol. 2021, 29, 1034–1045. [Google Scholar] [CrossRef]
  43. Bettencourt, P. Current Challenges in the Identification of Pre-Erythrocytic Malaria Vaccine Candidate Antigens. Front. Immunol. 2020, 11, 190. [Google Scholar] [CrossRef] [PubMed]
  44. Graham, S.P.; Pellé, R.; Yamage, M.; Mwangi, D.M.; Honda, Y.; Mwakubambanya, R.S.; de Villiers, E.P.; Abuya, E.; Awino, E.; Gachanja, J.; et al. Characterization of the fine specificity of bovine CD8 T-cell responses to defined antigens from the protozoan parasite Theileria parva. Infect. Immun. 2008, 76, 685–694. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Kar, P.P.; Srivastava, A. Immuno-informatics Analysis to Identify Novel Vaccine Candidates and Design of a Multi-Epitope Based Vaccine Candidate Against Theileria parasites. Front. Immunol. 2018, 9, 2213. [Google Scholar] [CrossRef]
  46. Karunakaran, K.P.; Yu, H.; Jiang, X.; Chan, Q.W.T.; Foster, L.J.; Johnson, R.M.; Brunham, R.C. Discordance in the Epithelial Cell-Dendritic Cell Major Histocompatibility Complex Class II Immunoproteome: Implications for Chlamydia Vaccine Development. J. Infect. Dis. 2020, 221, 841–850. [Google Scholar] [CrossRef]
  47. Alvarez, B.; Reynisson, B.; Barra, C.; Buus, S.; Ternette, N.; Connelley, T.; Andreatta, M.; Nielsen, M. NNAlign_MA.; MHC Peptidome Deconvolution for Accurate MHC Binding Motif Characterization and Improved T-cell Epitope Predictions. Mol. Cell. Proteom. 2019, 18, 2459–2477. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  48. Obara, I.; Nielsen, M.; Jeschek, M.; Nijhof, A.; Mazzoni, C.J.; Svitek, N.; Steinaa, L.; Awino, E.; Olds, C.; Jabbar, A.; et al. Sequence diversity between class I MHC loci of African native and introduced Bos taurus cattle in Theileria parva endemic regions: In silico peptide binding prediction identifies distinct functional clusters. Immunogenetics 2016, 68, 339–352. [Google Scholar] [CrossRef] [Green Version]
  49. Vasoya, D.; Oliveira, P.S.; Muriel, L.A.; Tzelos, T.; Vrettou, C.; Morrison, W.I.; Santos, I.K.F.M.; Connelley, T. High throughput analysis of MHC-I and MHC-DR diversity of Brazilian cattle populations. HLA 2021, 98, 93–113. [Google Scholar] [CrossRef] [PubMed]
  50. Partridge, T.; Nicastri, A.; Kliszczak, A.E.; Yindom, L.-M.; Kessler, B.; Ternette, N.; Borrow, P. Discrimination Between Human Leukocyte Antigen Class I-Bound and Co-Purified HIV-Derived Peptides in Immunopeptidomics Workflows. Front. Immunol. 2018, 9, 912. [Google Scholar] [CrossRef] [Green Version]
  51. Schneider, I.; Haller, D.; Kullmann, B.; Beyer, D.; Ahmed, J.S.; Seitzer, U. Identification, molecular characterization and subcellular localization of a Theileria annulata parasite protein secreted into the host cell cytoplasm. Parasitol. Res. 2007, 101, 1471–1482. [Google Scholar] [CrossRef]
  52. Tajeri, S.; Langsley, G. Theileria secretes proteins to subvert its host leukocyte. Biol. Cell 2021, 113, 220–233. [Google Scholar] [CrossRef] [PubMed]
  53. Woods, K.; Perry, C.; Brühlmann, F.; Olias, P. Theileria’s Strategies and Effector Mechanisms for Host Cell Transformation: From Invasion to Immortalization. Front. Cell Dev. Biol. 2021, 9, 662805. [Google Scholar] [CrossRef]
  54. Von Schubert, C.; Xue, G.; Schmuckli-Maurer, J.; Woods, K.L.; Nigg, E.A.; Dobbelaere, D.A.E. The Transforming Parasite Theileria Co-opts Host Cell Mitotic and Central Spindles to Persist in Continuously Dividing Cells. PLoS Biol. 2010, 8, e1000499. [Google Scholar] [CrossRef] [Green Version]
  55. Bishop, R.; Shah, T.; Pelle, R.; Hoyle, D.; Pearson, T.; Haines, L.; Brass, A.; Hulme, H.; Graham, S.P.; Taracha, E.L.; et al. Analysis of the transcriptome of the protozoan Theileria parva using MPSS reveals that the majority of genes are transcriptionally active in the schizont stage. Nucleic Acids Res. 2005, 33, 5503–5511. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  56. Witschi, M.; Xia, D.; Sanderson, S.; Baumgartner, M.; Wastling, J.; Dobbelaere, D. Proteomic analysis of the Theileria annulata schizont. Int. J. Parasitol. 2013, 43, 173–180. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Wickström, S.L.; Lövgren, T.; Volkmar, M.; Reinhold, B.; Duke-Cohan, J.S.; Hartmann, L.; Rebmann, J.; Mueller, A.; Melief, J.; Maas, R.; et al. Cancer Neoepitopes for Immunotherapy: Discordance Between Tumor-Infiltrating T Cell Reactivity and Tumor MHC Peptidome Display. Front. Immunol. 2019, 10, 2766. [Google Scholar] [CrossRef]
  58. Bosch-Camós, L.; López, E.; Navas, M.J.; Pina-Pedrero, S.; Accensi, F.; Correa-Fiz, F.; Park, C.; Carrascal, M.; Domínguez, J.; Salas, M.L. Identification of Promiscuous African Swine Fever Virus T-Cell Determinants Using a Multiple Technical Approach. Vaccines 2021, 9, 29. [Google Scholar] [CrossRef]
  59. Yewdell, J.W.; Bennink, J.R. Immunodominance in major histocompatibility complex class I-restricted T lymphocyte responses. Annu. Rev. Immunol. 1999, 17, 51–88. [Google Scholar] [CrossRef]
  60. Kastenmuller, W.; Gasteiger, G.; Gronau, J.H.; Baier, R.; Ljapoci, R.; Busch, D.H.; Drexler, I. Cross-competition of CD8+ T cells shapes the immunodominance hierarchy during boost vaccination. J. Exp. Med. 2007, 204, 2187–2198. [Google Scholar] [CrossRef] [Green Version]
  61. Dominguez, M.; Silveira, E.L.V.; De Vasconcelos, J.R.C.; De Alencar, B.C.G.; Machado, A.V.; Bruña-Romero, O.; Gazzinelli, R.T.; Rodrigues, M.M. Subdominant/Cryptic CD8 T Cell Epitopes Contribute to Resistance against Experimental Infection with a Human Protozoan Parasite. PLoS ONE 2011, 6, e22011. [Google Scholar] [CrossRef] [Green Version]
  62. Aagaard, C.S.; Hoang, T.T.K.T.; Vingsbo-Lundberg, C.; Dietrich, J.; Andersen, P. Quality and Vaccine Efficacy of CD4+T Cell Responses Directed to Dominant and Subdominant Epitopes in ESAT-6 from Mycobacterium tuberculosis. J. Immunol. 2009, 183, 2659–2668. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  63. Olsen, A.W.; Hansen, P.R.; Holm, A.; Andersen, P. Efficient protection against Mycobacterium tuberculosis by vaccination with a single subdominant epitope from the ESAT-6 antigen. Eur. J. Immunol. 2000, 30, 1724–1732. [Google Scholar] [CrossRef]
  64. Frahm, N.; Kiepiela, P.; Adams, S.; Linde, C.H.; Hewitt, H.S.; Sango, K.; E Feeney, M.; Addo, M.; Lichterfeld, M.; Lahaie, M.P.; et al. Control of human immunodeficiency virus replication by cytotoxic T lymphocytes targeting subdominant epitopes. Nat. Immunol. 2006, 7, 173–178. [Google Scholar] [CrossRef]
  65. Holtappels, R.; Simon, C.O.; Munks, M.W.; Thomas, D.; Deegen, P.; Kuhnapfel, B.; Daubner, T.; Emde, S.F.; Podlech, J.; Grzimek, N.K.A.; et al. Subdominant CD8 T-Cell Epitopes Account for Protection against Cytomegalovirus Independent of Immunodomination. J. Virol. 2008, 82, 5781–5796. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  66. Liu, J.; Haddad, E.K.; Marceau, J.; Morabito, K.M.; Rao, S.S.; Filali-Mouhim, A.; Sekaly, R.P.; Graham, B.S. A Numerically Subdominant CD8 T Cell Response to Matrix Protein of Respiratory Syncytial Virus Controls Infection with Limited Immunopathology. PLoS Pathog. 2016, 12, e1005486. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  67. Im, E.-J.; Hong, J.P.; Roshorm, Y.; Bridgeman, A.; Létourneau, S.; Liljeström, P.; Potash, M.J.; Volsky, D.J.; McMichael, A.J.; Hanke, T. Protective Efficacy of Serially Up-Ranked Subdominant CD8+ T Cell Epitopes against Virus Challenges. PLoS Pathog. 2011, 7, e1002041. [Google Scholar] [CrossRef] [Green Version]
  68. Mok, H.; Lee, S.; Wright, D.W.; Crowe, J.E., Jr. Enhancement of the CD8+ T cell response to a subdominant epitope of respiratory syncytial virus by deletion of an immunodominant epitope. Vaccine 2008, 26, 4775–4782. [Google Scholar] [CrossRef] [Green Version]
  69. Riedl, P.; Wieland, A.; Lamberth, K.; Buus, S.; Lemonnier, F.; Reifenberg, K.; Reimann, J.; Schirmbeck, R. Elimination of immunodominant epitopes from multispecific DNA-based vaccines allows induction of CD8 T cells that have a striking antiviral potential. J. Immunol. 2009, 183, 370–380. [Google Scholar] [CrossRef] [Green Version]
  70. Wieland, A.; Riedl, P.; Reimann, J.; Schirmbeck, R. Silencing an immunodominant epitope of hepatitis B surface antigen reveals an alternative repertoire of CD8 T cell epitopes of this viral antigen. Vaccine 2009, 28, 114–119. [Google Scholar] [CrossRef]
  71. Woodworth, J.S.; Aagaard, C.S.; Hansen, P.R.; Cassidy, J.P.; Agger, E.M.; Andersen, P. Protective CD4 T Cells Targeting Cryptic Epitopes of Mycobacterium tuberculosis Resist Infection-Driven Terminal Differentiation. J. Immunol. 2014, 192, 3247–3258. [Google Scholar] [CrossRef]
Figure 1. Characteristics of BoLA-I-eluted peptides from 1011TP, 2229TP, and 641TP cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Spectra obtained from the experiment (bottom), and synthetic library (top), respectively, for two example peptides that were mapped to the T. parva proteome. The most abundant, annotated b- (N-terminal) and y- (C-terminal) fragment ions are indicated together with the measured mass over charge ratio. (C) Binding predictions of B. taurus and T. parva 8–12 mer peptides stratified by their predicted BoLA-I allele of origin (peptides with a rank predicted binding score of <2% to a BoLA-I allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets. (D) Length distributions of B. taurus and T. parva peptides were identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. (E) A summary of results from a subset of peptides assayed in an in vitro BoLA-I binding assay is presented in Table 2. For each BoLA allele and for all of the BoLA alleles combined (Total), the number of peptides for which the in vitro assay corroborated the in silico predicted capacity to bind to BoLA-I are shown as filled bars, whilst peptides for which the results from the in vitro binding did not support the in silico prediction are shown as hatched bars, as described in the legend.
Figure 1. Characteristics of BoLA-I-eluted peptides from 1011TP, 2229TP, and 641TP cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Spectra obtained from the experiment (bottom), and synthetic library (top), respectively, for two example peptides that were mapped to the T. parva proteome. The most abundant, annotated b- (N-terminal) and y- (C-terminal) fragment ions are indicated together with the measured mass over charge ratio. (C) Binding predictions of B. taurus and T. parva 8–12 mer peptides stratified by their predicted BoLA-I allele of origin (peptides with a rank predicted binding score of <2% to a BoLA-I allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets. (D) Length distributions of B. taurus and T. parva peptides were identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. (E) A summary of results from a subset of peptides assayed in an in vitro BoLA-I binding assay is presented in Table 2. For each BoLA allele and for all of the BoLA alleles combined (Total), the number of peptides for which the in vitro assay corroborated the in silico predicted capacity to bind to BoLA-I are shown as filled bars, whilst peptides for which the results from the in vitro binding did not support the in silico prediction are shown as hatched bars, as described in the legend.
Vaccines 10 01907 g001
Figure 2. Characteristics of BoLA-I-eluted peptides from 5072TP/5072TP_rpt (A10), 2123TP/2123TP_rpt (A12/A15), 2408TP (A15), 2824TP/2824TP_rpt (A19), 641TP_rpt (A18), and 5350TP/5350TP_rpt (A20) cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Length distributions of B. taurus and T. parva peptides identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. For samples that had been analysed in two independent elutions, the data has been amalgamated into a single panel. (C) Binding predictions of B. taurus and T. parva 8–12-mer peptides stratified by their predicted BoLA-I allele of origin (peptides with a rank predicted binding score of <2% to a BoLA-I allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets.
Figure 2. Characteristics of BoLA-I-eluted peptides from 5072TP/5072TP_rpt (A10), 2123TP/2123TP_rpt (A12/A15), 2408TP (A15), 2824TP/2824TP_rpt (A19), 641TP_rpt (A18), and 5350TP/5350TP_rpt (A20) cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Length distributions of B. taurus and T. parva peptides identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. For samples that had been analysed in two independent elutions, the data has been amalgamated into a single panel. (C) Binding predictions of B. taurus and T. parva 8–12-mer peptides stratified by their predicted BoLA-I allele of origin (peptides with a rank predicted binding score of <2% to a BoLA-I allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets.
Vaccines 10 01907 g002
Figure 3. Alignment of peptides identified from (A) TpMuguga_03g00428 (ribosomal protein S28-B40S) and (B) TpMuguga_02g00611 (histone H2A variant 1) in BoLA-I elution datasets. (A) The 19 peptides identified from 8 different samples are shown aligned against the parent protein. Fourteen of the peptides (from 6 different samples representing 4 different MHCI haplotypes) are found as overlapping peptides in the 1–14 region. The median rank-predicted binding score of these peptides is 43.8% (range = 2.96–64.55%), with none predicted to be BoLA-I binders. The mean length of the peptides was 13 a.a. (B) The 8 peptides identified from 4 different samples (representing 3 different MHCI haplotypes) are shown aligned against the parent protein. All 8 peptides are overlapping peptides in the 121–137 region. The median rank-predicted score of these peptides is 50.1% (range = 32.2–100%), with none predicted to be BoLA-I binders. None of the peptides were of the 8–12 a.a. length of canonical MHCI-binding peptides.
Figure 3. Alignment of peptides identified from (A) TpMuguga_03g00428 (ribosomal protein S28-B40S) and (B) TpMuguga_02g00611 (histone H2A variant 1) in BoLA-I elution datasets. (A) The 19 peptides identified from 8 different samples are shown aligned against the parent protein. Fourteen of the peptides (from 6 different samples representing 4 different MHCI haplotypes) are found as overlapping peptides in the 1–14 region. The median rank-predicted binding score of these peptides is 43.8% (range = 2.96–64.55%), with none predicted to be BoLA-I binders. The mean length of the peptides was 13 a.a. (B) The 8 peptides identified from 4 different samples (representing 3 different MHCI haplotypes) are shown aligned against the parent protein. All 8 peptides are overlapping peptides in the 121–137 region. The median rank-predicted score of these peptides is 50.1% (range = 32.2–100%), with none predicted to be BoLA-I binders. None of the peptides were of the 8–12 a.a. length of canonical MHCI-binding peptides.
Vaccines 10 01907 g003
Figure 4. The distribution of predicted percent rank binding scores for peptides considered to be co-precipitants and peptides retained in the combined BoLA-I dataset after removal of the coprecipitating peptides. Peptides with a predicted percent rank binding score of <2% are considered to be binders. A small number of short peptides (<8 amino acids long, n = 8) that did not receive a predicted percent rank binding score were ascribed a default value of 100%.
Figure 4. The distribution of predicted percent rank binding scores for peptides considered to be co-precipitants and peptides retained in the combined BoLA-I dataset after removal of the coprecipitating peptides. Peptides with a predicted percent rank binding score of <2% are considered to be binders. A small number of short peptides (<8 amino acids long, n = 8) that did not receive a predicted percent rank binding score were ascribed a default value of 100%.
Vaccines 10 01907 g004
Figure 5. Overlap between the peptidomes identified from duplicate analysis of BoLA-A18, BoLA-A19, and BoLA-A20 samples and from samples sharing the BoLA-A10 and BoLA-A15 haplotypes. Euler diagrams displaying the overlap in the T. parva (left) and total (right) peptidomes of duplicated samples or samples sharing BoLA-I haplotypes. The number of peptides that are unique to each sample and shared between samples is indicated. The peptides identified in replicate samples were: BoLA-A18—RMDDKSGGLL from TpMuguga_01g00736 (hypothetical protein) and GEFEKKYIPTL from TpMuguga_01g00757 (Ras family protein); BoLA-A10—AGVELDTQKKFL from TpMuguga_03g00858 (multiprotein-bridging factor 1c); BoLA-A15—FEYEFPINH from TpMuguga_01g00471 (bifunctional thioredoxin reductase/thioredoxin); and EEIAHVLHY from TpMuguga_01g02030 (hypothetical protein). Note that in the BoLA-A15 group, there were no T. parva peptides identified in sample 2123TP and so this sample is not represented in the T. parva euler diagram.
Figure 5. Overlap between the peptidomes identified from duplicate analysis of BoLA-A18, BoLA-A19, and BoLA-A20 samples and from samples sharing the BoLA-A10 and BoLA-A15 haplotypes. Euler diagrams displaying the overlap in the T. parva (left) and total (right) peptidomes of duplicated samples or samples sharing BoLA-I haplotypes. The number of peptides that are unique to each sample and shared between samples is indicated. The peptides identified in replicate samples were: BoLA-A18—RMDDKSGGLL from TpMuguga_01g00736 (hypothetical protein) and GEFEKKYIPTL from TpMuguga_01g00757 (Ras family protein); BoLA-A10—AGVELDTQKKFL from TpMuguga_03g00858 (multiprotein-bridging factor 1c); BoLA-A15—FEYEFPINH from TpMuguga_01g00471 (bifunctional thioredoxin reductase/thioredoxin); and EEIAHVLHY from TpMuguga_01g02030 (hypothetical protein). Note that in the BoLA-A15 group, there were no T. parva peptides identified in sample 2123TP and so this sample is not represented in the T. parva euler diagram.
Vaccines 10 01907 g005
Figure 6. Characteristics of BoLA-DR eluted peptides from 2123TP (BoLA-DR 15:01/11:01), 2824TP (BoLA-DR 16:01), 5072TP (BoLA-DR 10:01), 641TP (BoLA-DR 20:02), 495TP (BoLA-DR 10:01/11:01), and 5350TP (BoLA-DR 12:01) cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Length distributions of B. taurus and T. parva peptides identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. (C) Binding predictions of B. taurus and T. parva 13–21-mer peptides stratified by their predicted BoLA-DR allele of origin (peptides with a rank-predicted binding score of <5% to a BoLA-DR allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets.
Figure 6. Characteristics of BoLA-DR eluted peptides from 2123TP (BoLA-DR 15:01/11:01), 2824TP (BoLA-DR 16:01), 5072TP (BoLA-DR 10:01), 641TP (BoLA-DR 20:02), 495TP (BoLA-DR 10:01/11:01), and 5350TP (BoLA-DR 12:01) cell lines. (A) Number of B. taurus and T. parva peptide sequences identified in each sample. (B) Length distributions of B. taurus and T. parva peptides identified in each sample. The horizontal axis shows the length of peptides, and the vertical axis shows the proportion of the peptides identified for each length. (C) Binding predictions of B. taurus and T. parva 13–21-mer peptides stratified by their predicted BoLA-DR allele of origin (peptides with a rank-predicted binding score of <5% to a BoLA-DR allele were considered to be binders). Peptides predicted to be MHC-binders are represented by coloured blocks, peptides predicted to be non-MHC binders are represented by grey blocks; the size of the blocks is proportional to the number of peptides in the respective datasets.
Vaccines 10 01907 g006
Figure 7. Coprecipitating peptides in the BoLA-MHCI and BoLA-DR datasets. (A) A scatterplot showing the level of correlation between the proteins from which coprecipitating peptides were derived in the BoLA-I and BoLA-DR datasets. Each point represents a single protein, and the number of coprecipitating peptides from individual proteins in the BoLA-MHCI and BoLA-DR datasets is shown on the horizontal and vertical axis, respectively. The R2 value calculated from the Pearson’s coefficient of correlation is shown. (B) The distribution of percent rank prediction scores for peptides considered to be coprecipitants and peptides retained after removal of the coprecipitant peptides in the combined BoLA-DR dataset. Peptides with a percent rank prediction score of <5% are considered to be binders. A small number of short peptides (<9 amino acids long, n = 26) that did not receive a percent rank prediction score were ascribed a default value of 100%.
Figure 7. Coprecipitating peptides in the BoLA-MHCI and BoLA-DR datasets. (A) A scatterplot showing the level of correlation between the proteins from which coprecipitating peptides were derived in the BoLA-I and BoLA-DR datasets. Each point represents a single protein, and the number of coprecipitating peptides from individual proteins in the BoLA-MHCI and BoLA-DR datasets is shown on the horizontal and vertical axis, respectively. The R2 value calculated from the Pearson’s coefficient of correlation is shown. (B) The distribution of percent rank prediction scores for peptides considered to be coprecipitants and peptides retained after removal of the coprecipitant peptides in the combined BoLA-DR dataset. Peptides with a percent rank prediction score of <5% are considered to be binders. A small number of short peptides (<9 amino acids long, n = 26) that did not receive a percent rank prediction score were ascribed a default value of 100%.
Vaccines 10 01907 g007
Table 1. T. parva peptides derived from 1011TP, 2229TP, and 641TP cell lines. For each peptide, the amino acid sequence (column A), −10 lgP score (B), peptide length (C), sample (D), number of spectra (E), BoLA-I allele to which the peptide has the highest percent rank binding score (F), core peptide sequence (G), percent rank binding prediction score (H), accession number of the source protein (I), description of the source protein (J), the peptide location within the source protein (K and L), and the results of spectral matching of synthetic peptides (M—identified as either positive (matching), negative (not-matching), or ND (not tested)) are shown.
Table 1. T. parva peptides derived from 1011TP, 2229TP, and 641TP cell lines. For each peptide, the amino acid sequence (column A), −10 lgP score (B), peptide length (C), sample (D), number of spectra (E), BoLA-I allele to which the peptide has the highest percent rank binding score (F), core peptide sequence (G), percent rank binding prediction score (H), accession number of the source protein (I), description of the source protein (J), the peptide location within the source protein (K and L), and the results of spectral matching of synthetic peptides (M—identified as either positive (matching), negative (not-matching), or ND (not tested)) are shown.
Peptide−10lgPLengthSample#SpecAlleleIcore%RankAccessionProtein DescriptionPeptide Location StartPeptide Location StopSpectral Match
VANTKIEFPEI31.711641TP46:01301VANTKIEFPEI7.15TpMuguga_01g00263hypothetical protein23712381ND
FSVPNQVKAAKVEATIPSHLEKKVITNKKN73.1301011TP13:00201FSVPNQVKAAK92.00TpMuguga_01g0029360S ribosomal protein L38, putative5079ND
QAYQQKVDL21.392229TP12:02501QAYQQKVDL0.45TpMuguga_01g00386hypothetical protein15881596ND
SSISSSLLSVK23.7111011TP22:01201SSISSSLLSVK0.05TpMuguga_01g00421hypothetical protein, conserved365375ND
RMYGKGKGISSSSIP46.815641TP36:01301RMYGKGKGISSSSI19.53TpMuguga_01g0050240S ribosomal protein S13, putative317ND
SEKDSYLSSIKKLNL59.115641TP16:01301SEKDSYLSSIKKLNL3.93TpMuguga_01g00946ribosome biogenesis regulatory protein, putative221235ND
GKFKNPTKTHKMDEVSESSLQ28.6212229TP11:02301GKFKNPTKTH90.00TpMuguga_01g01108hypothetical protein222ND
GKFKNPTKTHKMDEVSESSLQQK54.9232229TP11:02301GKFKNPTKTH91.67TpMuguga_01g01108hypothetical protein224ND
LNIGRIELIDYI27.812641TP66:01301LNIGRIELIDYI26.37TpMuguga_01g01149hypothetical protein914925ND
RALFDDLHRY22.9102229TP14:02401RALFDDLHRY0.28TpMuguga_01g01198hypothetical protein4655ND
ILGFDFNKLGKI2812641TP26:01301ILGFDFNKLGKI7.89TpMuguga_02g00219hypothetical protein495506ND
SLQLKFAQGSDLPNL5115641TP16:01301SLQLKFAQGSDLPNL5.77TpMuguga_02g00321hypothetical protein188202ND
RVRKLCEYAI22.210641TP36:01301RVRKLCEYAI2.40TpMuguga_02g00476crooked neck protein, putative499508ND
ERLAALALYGDYGEFDRKTKEDSK64.6242229TP16:04001ERLAALALY95.00TpMuguga_02g00758hypothetical protein552575ND
ERLAALALYGDYGEFD6116641TP16:01301ERLAALALYGDYGEF100.0TpMuguga_02g00758hypothetical protein552567ND
DRKLFSTKRPSLSL20.4142229TP16:04001RKLFSTKRPSLSL24.46TpMuguga_02g0080360S ribosomal protein L18, putative178191ND
QPSYLSQAL26.791011TP13:00201QPSYLSQAL7.04TpMuguga_03g00202DNA polymerase alpha, putative159167ND
SKVDRVSL23.682229TP61:02301SKVDRVSL0.09TpMuguga_03g00507tRNA nucleotidyltransferase (putative)495502ND
SMKGKHELTL22.110641TP36:01301SMKGKHELTL0.01TpMuguga_03g00747ATP-dependent RNA helicase, putative22232232ND
SSIDVNVKL21.191011TP13:00201SSIDVNVKL0.09TpMuguga_03g02030N-terminal region of Chorein, a TM vesicle-mediated sorter family protein35103518ND
KQVVRDAMVEQDML3214641TP16:01301KQVVRDAMVEQDML1.58TpMuguga_03g02350RF-1 domain protein534547ND
LELIRARNEI23.610641TP16:01301LELIRARNEI6.82TpMuguga_04g0003126S proteasome aaa-ATPase subunit Rpt3, putative4756ND
SPDQPDQHHQPTPAAQP43.3171011TP13:00201SPDQPDQHHQPTPAAQP61.79TpMuguga_04g00051polymorphic immunodominant molecule228244ND
FRNEKDLGF29.19641TP76:01301FRNEKDLGF7.44TpMuguga_04g00227hypothetical protein14031411ND
VKKRVHKGKKKARSETYSTYIF28.5221011TP13:00201KARSETYSTYIF96.88TpMuguga_04g00404histone H2B-III, putative223ND
ENKLVEEALK34.410641TP26:01301ENKLVEEAL50.94TpMuguga_04g00505hypothetical protein99108ND
KLLYVLKPFI22.110641TP26:01301KLLYVLKPFI4.10TpMuguga_04g00611hypothetical protein234243ND
LSPIDILDVAGLVT21.1141011TP23:00201LSPIDILDVAGLVT61.30TpMuguga_04g00621DNA repair exonuclease, putative401414ND
IGSAIKDNPAFITL46.3142229TP26:04001IGSAIKDNPAFITL1.66TpMuguga_01g00188prohibitin, putative231244Positive
RMDDKSGGLL42.710641TP16:01301RMDDKSGGLL1.20TpMuguga_01g00736hypothetical protein2029Positive
GEFEKKYIPTL50.711641TP16:01301GEFEKKYIPTL0.15TpMuguga_01g00757GTP-binding nuclear protein ran, putative3040Positive
VQHIPVDDFSGLQTEVVANE63.5201011TP63:00201VQHIPVDDFSGLQTEV97.35TpMuguga_01g0092460S ribosomal protein L31, putative99118Positive
VQHIPVDDFSGLQTEVVANE57.3202229TP36:04001VQHIPVDDFSGL95.77TpMuguga_01g0092460S ribosomal protein L31, putative99118Positive
RQMQVKLNLP23.810641TP26:01301RQMQVKLNL0.79TpMuguga_01g0098040S ribosomal protein S26e, putative101110Positive
TQYERIKERL28.810641TP16:01301TQYERIKERL0.01TpMuguga_01g01210hypothetical protein211Positive
GQIQDVFKRL33.110641TP16:01301GQIQDVFKRL0.25TpMuguga_02g00123RNA helicase-1, putative189198Positive
SKDEHKKLY39.592229TP31:02301SKDEHKKLY0.07TpMuguga_02g00142hypothetical protein9199Positive
ATIIGFHK2881011TP12:01201ATIIGFHK0.11TpMuguga_02g0022240S ribosomal protein S29, putative4552Positive
SLKSALIDT21.59641TP16:01301SLKSALIDT22.80TpMuguga_02g00333translation initiation factor 6, putative239247Positive
SLKSALIDTLI25.511641TP46:01301SLKSALIDTLI2.47TpMuguga_02g00333translation initiation factor 6, putative239249Positive
EIKERLAALAL35.5111011TP53:00201EIKERLAALAL14.51TpMuguga_02g00758hypothetical protein549559Positive
EIKERLAALAL27112229TP26:04001EIKERLAALAL14.92TpMuguga_02g00758hypothetical protein549559Positive
EIKERLAALALYGDYGEFDRKT66.9221011TP33:00201EIKERLAALAL98.82TpMuguga_02g00758hypothetical protein549570Positive
EIKERLAALALYGDYGEFDRKT73.5222229TP36:04001EIKERLAALALALY95.00TpMuguga_02g00758hypothetical protein549570Positive
EIKERLAALALYGDYGEFDRKT72.222641TP46:01301EIKERLAALAL95.00TpMuguga_02g00758hypothetical protein549570Positive
ERLAALALYGDYGEFDRKT71.2191011TP33:00201RLAALALYGDYGEFDRK93.50TpMuguga_02g00758hypothetical protein552570Positive
ERLAALALYGDYGEFDRKT67.9192229TP56:04001ERLAALALYGDYGEFDRKT95.00TpMuguga_02g00758hypothetical protein552570Positive
ERLAALALYGDYGEFDRKT72.219641TP26:01301ERLAALALYGDYGEFDRKT100.00TpMuguga_02g00758hypothetical protein552570Positive
ERLAALALYGDYGEFDRKTK61.9202229TP16:04001RLAALALYGDYGEFDRKTK92.50TpMuguga_02g00758hypothetical protein552571Positive
ERLAALALYGDYGEFDRKTKE58.1212229TP16:04001ERLAALALY95.00TpMuguga_02g00758hypothetical protein552572Positive
YGDYGEFDRKT52.8111011TP33:00201YGDYGEFDRK16.16TpMuguga_02g00758hypothetical protein560570Positive
YGDYGEFDRKTK47.7121011TP22:01201YGDYGEFDRKTK13.40TpMuguga_02g00758hypothetical protein560571Positive
YGDYGEFDRKTKEDSK53.7161011TP32:01201YGDYGEFDRKTKEDSK36.34TpMuguga_02g00758hypothetical protein560575Positive
YGDYGEFDRKTKEDSK37.9162229TP16:04001YGDYGEFDRKTKEDSK81.67TpMuguga_02g00758hypothetical protein560575Positive
YGDYGEFDRKTKEDSKN36.3171011TP13:00201YGDYGEFDRKTKEDSK94.00TpMuguga_02g00758hypothetical protein560576Positive
AKFPGMKKSKGPKDK54.3152229TP31:02301AKFPGMKKSKGPKDK39.77TpMuguga_02g00895hypothetical protein6781Positive
FRDDLGSSFTSGYTK59.3151011TP22:01201FRDDLGSSFTSGYTK2.96TpMuguga_02g00895hypothetical protein4862Positive
RDDLGSSFTSGYTK59.9141011TP12:01201RDDLGSSFTSGYTK0.58TpMuguga_02g00895hypothetical protein4962Positive
SSFTSGYTK38.491011TP12:01201SSFTSGYTK0.01TpMuguga_02g00895hypothetical protein5462Positive
SSFTSGYTKQDLDAKFPGMK68.6201011TP52:01201SSFTSGYTK25.30TpMuguga_02g00895hypothetical protein5473Positive
KTAPVTGGVK20.4101011TP12:01201KTAPVTGGVK0.46TpMuguga_03g00152:
TpMuguga_04g00321
histone H3, putative2837Negative
KTAPVTGGVKK22.7111011TP42:01201KTAPVTGGVKK0.16TpMuguga_03g00152:
TpMuguga_04g00321
histone H3, putative2838Negative
SGWEGRTL32.182229TP16:04001SGWEGRTL0.21TpMuguga_03g0042840S ribosomal protein S28e, putative3340Positive
SGWEGRTLI30.891011TP13:00201SGWEGRTLI0.67TpMuguga_03g0042840S ribosomal protein S28e, putative3341Positive
ILRTIVQQL32.79641TP36:01301ILRTIVQQL0.10TpMuguga_03g0071640S ribosomal protein S19, putative121129Positive
NSFVTDTFEKL43.5111011TP13:00201NSFVTDTFEKL1.68TpMuguga_04g00404histone H2B-III, putative4454Positive
SETYSTYIFKVLK47.8131011TP22:01201SETYSTYIFKVLK3.65TpMuguga_04g00404histone H2B-III, putative1527Positive
RLFNFATKRI37.610641TP16:01301RLFNFATKRI2.32TpMuguga_04g00503hypothetical protein236245Positive
Table 2. Summary of in vitro determined peptide-MHCI dissociation rates (expressed as half-life of peptide-BoLA-I complexes at 37 °C in hours) for a subset of peptides identified from 1011TP, 2229TP, and 641TP cell lines. The dissociation rate of each peptide binding to BoLA-I alleles 1*02301, 2*01201, 3*00201, and 6*01301 was assayed by a scintillation proximity assay, essentially as previously described (see Materials and Methods). Peptides that managed to form a complex BoLA-I molecule and for which a half-life could be measured were considered to be BoLA-I binders. Scores from the in vitro assay that corroborated the in silico predicted ability of peptides to bind to BoLA-I molecules expressed in the cell lines from which they were identified are highlighted with a dark grey background. Scores that are discrepant with the in silico predictions are highlighted with a black background in white script. Three peptides that bound to 6*01301 (BoLA-A18) but were identified from 1011Tp (BoLA-A10) are shown in light grey script. The scores of negative controls (no peptide) and positive control peptides are shown (no peptide) and positive control peptides are shown.
Table 2. Summary of in vitro determined peptide-MHCI dissociation rates (expressed as half-life of peptide-BoLA-I complexes at 37 °C in hours) for a subset of peptides identified from 1011TP, 2229TP, and 641TP cell lines. The dissociation rate of each peptide binding to BoLA-I alleles 1*02301, 2*01201, 3*00201, and 6*01301 was assayed by a scintillation proximity assay, essentially as previously described (see Materials and Methods). Peptides that managed to form a complex BoLA-I molecule and for which a half-life could be measured were considered to be BoLA-I binders. Scores from the in vitro assay that corroborated the in silico predicted ability of peptides to bind to BoLA-I molecules expressed in the cell lines from which they were identified are highlighted with a dark grey background. Scores that are discrepant with the in silico predictions are highlighted with a black background in white script. Three peptides that bound to 6*01301 (BoLA-A18) but were identified from 1011Tp (BoLA-A10) are shown in light grey script. The scores of negative controls (no peptide) and positive control peptides are shown (no peptide) and positive control peptides are shown.
Peptide−10lgPPeptide LengthSampleAccessionAlleleCore%RankSpectralMatch1*023012*012013*002016*01301
TQYERIKERL28.8210641TPTpMuguga_01g012106:01301TQYERIKERL0.01positive0.000.000.0036.54
SSFTSGYTK38.3891011TPTpMuguga_02g008952:01201SSFTSGYTK0.01positive0.001.430.000.22
SKDEHKKLY39.4892229TPTpMuguga_02g001421:02301SKDEHKKLY0.07positive31.740.000.000.00
ILRTIVQQL32.719641TPTpMuguga_03g007166:01301ILRTIVQQL0.10positive0.000.000.0024.03
ATIIGFHK27.9881011TPTpMuguga_02g002222:01201ATIIGFHK0.11positive0.000.760.000.76
GEFEKKYIPTL50.6711641TPTpMuguga_01g007576:01301GEFEKKYIPTL0.15positive0.000.000.0024.56
KTAPVTGGVKK22.73111011TPTpMuguga_03g00152:TpMuguga_04g003212:01201KTAPVTGGVKK0.16negative0.000.320.000.48
GQIQDVFKRL33.1410641TPTpMuguga_02g001236:01301GQIQDVFKRL0.25positive0.000.000.0019.10
KTAPVTGGVK20.38101011TPTpMuguga_03g00152:TpMuguga_04g003212:01201KTAPVTGGVK0.46negative0.000.320.000.00
SGWEGRTLI30.7991011TPTpMuguga_03g004283:00201SGWEGRTLI0.67positive0.000.000.300.00
RQMQVKLNLP23.7610641TPTpMuguga_01g009806:01301RQMQVKLNL0.79positive0.000.000.0010.25
RMDDKSGGLL42.7210641TPTpMuguga_01g007366:01301RMDDKSGGLL1.20positive0.000.000.0011.75
NSFVTDTFEKL43.45111011TPTpMuguga_04g004043:00201NSFVTDTFEKL1.68positive0.000.000.000.00
RLFNFATKRI37.6310641TPTpMuguga_04g005036:01301RLFNFATKRI2.32positive0.000.000.006.76
SLKSALIDTLI25.4711641TPTpMuguga_02g003336:01301SLKSALIDTLI2.47positive0.000.000.007.89
YGDYGEFDRKTK47.74121011TPTpMuguga_02g007582:01201YGDYGEFDRKTK13.40positive0.000.100.000.00
EIKERLAALAL35.54111011TPTpMuguga_02g007583:00201EIKERLAALAL14.51positive0.000.000.000.00
YGDYGEFDRKT52.75111011TPTpMuguga_02g007583:00201YGDYGEFDRK16.16positive0.000.000.000.00
SLKSALIDT21.529641TPTpMuguga_02g003336:01301SLKSALIDT22.80positive0.000.000.000.00
Negative control (no peptide) 0.000.000.000.00
Positive control 3.32.004.507.30
Table 3. Predicted BoLA-I presented T. parva peptides identified in this study. For each peptide, the accession number and description of the protein from which it is derived are shown in columns A and B, and specific comments about particular proteins are given in column C. Columns D–H provide details about the individual peptides, including their sequence, length, the sample(s) they were identified in, the BoLA-MHCI allele predicted to present the peptide, and the predicted percent rank binding score.
Table 3. Predicted BoLA-I presented T. parva peptides identified in this study. For each peptide, the accession number and description of the protein from which it is derived are shown in columns A and B, and specific comments about particular proteins are given in column C. Columns D–H provide details about the individual peptides, including their sequence, length, the sample(s) they were identified in, the BoLA-MHCI allele predicted to present the peptide, and the predicted percent rank binding score.
AccessionProtein DescriptionCommentsPeptideLengthSampleBoLA_MHCI Allele%Rank
TpMuguga_01g00075hypothetical protein NQPKNVVEF95350TP_rpt3:027010.04
TpMuguga_01g00100Protein arginine N-methyltransferase 5 SEIDVKDVL92824TP_rpt6:014010.01
TpMuguga_01g00151Insulinase (Peptidase family M16) family protein SEVAVSAMGPL112824TP_rpt6:014011.87
TpMuguga_01g00176putative integral membrane protein VEDEAAYHVQL112824_TP2:016010.03
TpMuguga_01g00188Tp6—Prohibitin-2 IGSAIKDNPAFITL142229TP6:040011.66
TpMuguga_01g00235Eukaryotic translation initiation factor 3 subunit A AEKEIVELV92123TP_rpt1:019010.01
TpMuguga_01g00386DNA polymerase family A family protein QAYQQKVDL92229TP2:025010.45
TpMuguga_01g00421RWD domain proteinIdentifications in A10 and A18 samplesSSISSSLLSVK111011TP2:012010.05
TpMuguga_01g00421RWD domain proteinIdentifications in A10 and A18 samplesKLIWRFIRHL10641TP_rpt6:013010.29
TpMuguga_01g00461Cytokine-induced anti-apoptosis inhibitor 1, Fe-S biogenesis family protein GQVTKASFFSSL12641TP_rpt6:013010.23
TpMuguga_01g00471Bifunctional thioredoxin reductase/thioredoxinIdentification in three different A15 samplesFEYEFPINH92123TP_rpt/2408TP/2408_TP_rpt1:009010.03
TpMuguga_01g00566Brix domain protein NKKRPISIGF105350TP_rpt2:026010.13
TpMuguga_01g00736hypothetical proteinIdentification in duplicate A18 samplesRMDDKSGGLL10641TP6:013011.20
TpMuguga_01g00757Ras family proteinIdentification in duplicate A18 samplesGEFEKKYIPTL11641TP6:013010.15
TpMuguga_01g00808Ubiquitin elongating factor core family protein SKKDLFIQF95350TP_rpt3:027010.01
TpMuguga_01g00926Cofilin/tropomyosin-type actin-binding family protein VEDHDEVRGALA122824TP_rpt2:016011.95
TpMuguga_01g00934Heat shock cognate 90 kDa protein SQFVKYPIQL10641TP_rpt6:013010.00
TpMuguga_01g01188Translation initiation factor IF-2 NNPIGRVGF95350TP_rpt3:027010.10
TpMuguga_01g01198hypothetical protein RALFDDLHRY102229TP4:024010.28
TpMuguga_01g01207Pre-mRNA-splicing factor srp2 FGPINRIDF95350TP_rpt3:027010.90
TpMuguga_01g01210Mitochondrial carrier family protein TQYERIKERL10641TP6:013010.01
TpMuguga_01g02005Rab-GTPase-TBC domain protein KLNEQKILSL10641TP_rpt6:013010.08
TpMuguga_01g02030hypothetical proteinIdentifications in A12 and A15 samplesEEIAHVLHY92123TP_rpt1:01901/1:009010.39/1.00
TpMuguga_01g02175Sas10 C-terminal domain protein YLHEFHNFI92123TP_rpt2:025011.69
TpMuguga_02g00113DNA replication licensing factor MCM6 SSLLKLTNK9A10TP_rpt2:012010.01
TpMuguga_02g00123DEAD/DEAH box helicase GQIQDVFKRL10641TP6:013010.25
TpMuguga_02g00142Polyubiquitin SKDEHKKLY92229TP1:023010.07
TpMuguga_02g00248Nucleolar GTP-binding protein 1 HMFSGKRTL9641TP_rpt6:013010.05
TpMuguga_02g00488ATP synthase subunit D AEDFKSLVI92824TP_rpt2:016010.01
TpMuguga_02g00543putative integral membrane protein KLANSKNVSL10641TP_rpt6:013010.11
TpMuguga_02g0055123 kDa piroplasm membrane protein SKATDRLVV95350TP_rpt3:027010.22
TpMuguga_02g00613DEAD/DEAH box helicase AKKITELGF95350TP_rpt2:026010.06
TpMuguga_02g00703putative integral membrane proteinthree different identifications in one A20 sampleVKKLKESLL95350TP_rpt2:026010.02
TpMuguga_02g00703putative integral membrane proteinthree different identifications in one A20 sampleNKLGDPLTL95350TP_rpt3:027010.12
TpMuguga_02g00703putative integral membrane proteinthree different identifications in one A20 sampleYKPEGMEYPF105350TP_rpt3:027011.59
TpMuguga_02g00706hypothetical protein YQKNSNNPFM105350TP_rpt2:026011.11
TpMuguga_02g00718hypothetical protein GKNSVLLQV95350TP_rpt2:026010.15
TpMuguga_02g00723hypothetical protein RTFNDVSKRKH112408_TP1:009010.71
TpMuguga_02g00753chaperone protein DnaK TQVGIKVY82408_TP1:009010.15
TpMuguga_02g00895Tp9—Hypothetical proteintwo overlapping identifications in one A10 sample(RDDLG)SSFTSGYTK9/141011TP2:012010.01/0.58
TpMuguga_02g00896hypothetical protein AQGDPVFL82408_TP2:025010.16
TpMuguga_03g00253hypothetical proteinIdentifications in A10, A15 and A20 samplesLQSEVFPNY92408_TP1:009010.11
TpMuguga_03g00253hypothetical proteinIdentifications in A10, A15, and A20 samplesFNFSESKLTF105350_TP3:027011.38
TpMuguga_03g00253hypothetical proteinIdentifications in A10, A15, and A20 samplesLNTSIGGSL9A10_TP3:002011.00
TpMuguga_03g00257hypothetical protein SQNNRSEMSNL11641TP_rpt6:013010.07
TpMuguga_03g00330hypothetical protein SSMRDALNPPPTH132408_TP1:009011.16
TpMuguga_03g00388hypothetical protein SQKRKNKPL9641TP_rpt6:013010.01
TpMuguga_03g00469High mobility group protein homolog NHP1 AKKDPNAPKRAL125350TP_rpt2:026010.13
TpMuguga_03g00478DEAD/DEAH box helicase family protein YVGKAPTLW95350TP_rpt3:027011.32
TpMuguga_03g00507CCA tRNA nucleotidyltransferase mitochondrial SKVDRVSL82229TP1:023010.09
TpMuguga_03g00544putative integral membrane protein NVFPLILGK9A10_TP2:012010.08
TpMuguga_03g00577Cwf15/Cwc15 cell cycle control family protein SQQPPSFLNDAVRTDFH172408_TP1:009010.52
TpMuguga_03g00655N4—Hypothetical protein GVDVDQLLH92123TP_rpt1:009010.36
TpMuguga_03g00747DEAD/DEAH box helicase SMKGKHELTL10641TP6:013010.01
TpMuguga_03g00852Protein transport protein Sec61 subunit alphatwo different identifications in one A10 sampleSSMVMQLLAGSK12A10TP_rpt2:012011.27
TpMuguga_03g00852Protein transport protein Sec61 subunit alphatwo different identifications in one A10 sampleKGTEFEGALISL12A10TP_rpt3:002011.43
TpMuguga_03g00858Multiprotein-bridging factor 1cIdentifications in duplicate A10 samplesAGVELDTQKKFL12A10_TP3:002011.36
TpMuguga_03g00861p150—Hypothetical protein LGPILIYEDL10A10TP_rpt3:002010.10
TpMuguga_03g02030N-terminal region of Chorein, a TM vesicle-mediated sorter family protein SSIDVNVKL91011TP3:002010.09
TpMuguga_03g02350RF-1 domain protein KQVVRDAMVEQDML14641TP6:013011.58
TpMuguga_03g02680putative integral membrane protein FGIPLVTK8A10TP_rpt2:012010.38
TpMuguga_04g00144hypothetical protein NSNELKDIK9A10TP_rpt2:012011.60
TpMuguga_04g00206hypothetical protein IEDEVCKVI92824TP_rpt2:016010.09
TpMuguga_04g00229Pescadillo N-terminus family protein SLMPKKHKRLL11641TP_rpt6:013010.08
TpMuguga_04g00233Protein IWS1 homolog SQMSRNIESKH112123TP_rpt1:009010.03
TpMuguga_04g0036826S proteasome non-ATPase regulatory subunit 4 homolog IGLIASAKL9A10TP_rpt3:002010.27
TpMuguga_04g00439hypothetical protein NSQLRQKIRSM11641TP_rpt6:013011.09
TpMuguga_04g00484OST-HTH Associated domain protein SGINLGNVNSL11A10_TP3:002010.09
TpMuguga_04g00662putative integral membrane protein DKKALTVAL95350TP_rpt3:027010.01
TpMuguga_04g00754hypothetical protein SQFPRNPVDSLL12641TP_rpt6:013010.01
TpMuguga_04g00785Helicase C-terminal domain NNFNHSLL85350TP_rpt3:027010.28
TpMuguga_04g00790AP2-coincident C-terminal family protein SQSEEIEKYLH112408_TP1:009010.07
TpMuguga_04g02060hypothetical protein HQQDSQYYLTQH122408_TP1:009010.10
TpMuguga_04g02625Importin subunit alpha-6 KQLRNENEI9641TP_rpt6:013010.41
Table 4. Predicted BoLA-DR presented T. parva peptides identified in this study. For each peptide, the accession number and description of the protein from which it is derived are shown in columns A and B, and specific comments about particular proteins are given in column C. Column D–H provide details about the individual peptides, including its sequence, length, the sample it was identified in, the BoLA-DR allele predicted to present the peptide and the predicted percent rank binding score (%RPS).
Table 4. Predicted BoLA-DR presented T. parva peptides identified in this study. For each peptide, the accession number and description of the protein from which it is derived are shown in columns A and B, and specific comments about particular proteins are given in column C. Column D–H provide details about the individual peptides, including its sequence, length, the sample it was identified in, the BoLA-DR allele predicted to present the peptide and the predicted percent rank binding score (%RPS).
AccessionProtein DescriptionCommentsPeptideLengthSampleBoLA_DR Allele%RPS
TpMuguga_01g00016Vacuolar protein sorting/targeting protein 10 LDEKSYTILDTSEGAVI17641TP_DRDRB3_20020.23
TpMuguga_01g00324Tubulin/FtsZ family, GTPase domain proteinfour nested peptides in one sample(EFQ)TNLVPYPRIHFML13–165072TP_DRDRB3_10010.1/0.10/0.15/0.34
TpMuguga_01g00552hypothetical protein RAKEYNFISKIVYRS15495TP_DRDRB3_10010.06
TpMuguga_01g00701Rhoptry-associated protein 1 (RAP-1) family proteintwo nested peptides in one sample(SE)VQHVVFSFLNDPYK14, 165350TP_DRDRB3_12010.01/0.03
TpMuguga_01g00937Cell division control protein 48 homolog Etwo nested peptides in one sample(RPG)RLDQLIYIPLPDLPAR16, 19641TP_DRDRB3_20020.00/0.01
TpMuguga_01g00972hypothetical protein LPVWEAVNDERVDEA152824TP_DRDRB3_16010.04
TpMuguga_01g00987P104 VAPKDTTLEYLKVFLNK17641TP_DRDRB3_20020.46
TpMuguga_01g01056p32—Merozoite Antigentwo nested peptides in one sampleSEVKFETYYDDVLFKGK(S)17, 185350TP_DRDRB3_12010.00/0.00
TpMuguga_01g01129hypothetical protein TKTVSFSNKISFHYF15495TP_DRDRB3_10014.02
TpMuguga_02g00253ATP synthase, Delta/Epsilon chain, beta-sandwich domain protein NKDLVFSLLSSHEALY16641TP_DRDRB3_20020.06
TpMuguga_02g0055123 kDa piroplasm membrane protein EDRLATYKPFTEDPSKKR185350TP_DRDRB3_12010.01
TpMuguga_02g00753chaperone protein DnaK DFDQRILNFLVDEFKK165350TP_DRDRB3_12010.12
TpMuguga_04g0039526S proteasome non-ATPase regulatory subunit 6 KQGDLLINRIQKLSRIIDM192123TP_DRDRB3_15014.22
TpMuguga_04g007192-oxoisovalerate dehydrogenase subunit alpha mitochondrialtwo nested peptides in one sample(E)NTKAYELLPGLFDDV15, 16641TP_DRDRB3_20020.01/0.06
TpMuguga_04g02435ARF guanine-nucleotide exchange factor GNL1two nested peptides in one sampleKEKDFLADITKELDESQ(S)17, 182824TP_DRDRB3_16010.11/0.84
Table 5. T. parva proteins that have been shown to contain epitopes recognised by either CD4+ and/or CD8+ T-cells and as sources of peptides identified in T. parva samples during pMHC elution of either BoLA-I or BoLA-DR. Identity of confirmed CD4+ and CD8+ T-cell epitopes come from Graham et al. (2006) and Morrison et al. (2021).
Table 5. T. parva proteins that have been shown to contain epitopes recognised by either CD4+ and/or CD8+ T-cells and as sources of peptides identified in T. parva samples during pMHC elution of either BoLA-I or BoLA-DR. Identity of confirmed CD4+ and CD8+ T-cell epitopes come from Graham et al. (2006) and Morrison et al. (2021).
AccessionProtein NameAntigens Recognised by T. parva-SpecificIdentified in Peptide Elution From
CD8CD4DRMHCI
TpMuguga_01g00188Tp6—Prohibitin-2CD8--MHCI
TpMuguga_02g00123Tp32—DEAD/DEAH box helicaseCD8CD4-MHCI
TpMuguga_02g00895Tp9—Hypothetical proteinCD8CD4-MHCI
TpMuguga_03g00655Tp13—Hypothetical protein-CD4-MHCI
TpMuguga_03g00861Tp20—Hypothetical protein-CD4-MHCI
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Connelley, T.; Nicastri, A.; Sheldrake, T.; Vrettou, C.; Fisch, A.; Reynisson, B.; Buus, S.; Hill, A.; Morrison, I.; Nielsen, M.; et al. Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells. Vaccines 2022, 10, 1907. https://doi.org/10.3390/vaccines10111907

AMA Style

Connelley T, Nicastri A, Sheldrake T, Vrettou C, Fisch A, Reynisson B, Buus S, Hill A, Morrison I, Nielsen M, et al. Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells. Vaccines. 2022; 10(11):1907. https://doi.org/10.3390/vaccines10111907

Chicago/Turabian Style

Connelley, Timothy, Annalisa Nicastri, Tara Sheldrake, Christina Vrettou, Andressa Fisch, Birkir Reynisson, Soren Buus, Adrian Hill, Ivan Morrison, Morten Nielsen, and et al. 2022. "Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells" Vaccines 10, no. 11: 1907. https://doi.org/10.3390/vaccines10111907

APA Style

Connelley, T., Nicastri, A., Sheldrake, T., Vrettou, C., Fisch, A., Reynisson, B., Buus, S., Hill, A., Morrison, I., Nielsen, M., & Ternette, N. (2022). Immunopeptidomic Analysis of BoLA-I and BoLA-DR Presented Peptides from Theileria parva Infected Cells. Vaccines, 10(11), 1907. https://doi.org/10.3390/vaccines10111907

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop