Next Article in Journal
Large-Scale Screening of HCMV-Seropositive Blood Donors Indicates that HCMV Effectively Escapes from Antibodies by Cell-Associated Spread
Previous Article in Journal
Heartland Virus Epidemiology, Vector Association, and Disease Potential
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Proteogenomic Identification of a Novel Protein-Encoding Gene in Bovine Herpesvirus 1 That Is Expressed during Productive Infection

1
Department of Biochemistry, Molecular Biology, Entomology & Plant Pathology, Mississippi State University, 32 Creelman St., Starkville, MS 39762, USA
2
Department of Veterinary Pathobiology, Center for Veterinary Health Sciences, Oklahoma State University, Stillwater, OK 74078, USA
3
Department of Virology, Faculty of Veterinary Medicine, Benha University, Moshtohor 13736, Kaliobyia, Egypt
4
Department of Basic Sciences at the College of Veterinary Medicine, Mississippi State University, 240 Wise Center Drive, Starkville, MS 39762, USA
*
Author to whom correspondence should be addressed.
Viruses 2018, 10(9), 499; https://doi.org/10.3390/v10090499
Submission received: 19 July 2018 / Revised: 7 September 2018 / Accepted: 12 September 2018 / Published: 14 September 2018
(This article belongs to the Section Animal Viruses)

Abstract

:
Bovine herpesvirus 1 (BoHV-1) is one of several microbes that contributes to the development of the bovine respiratory disease (BRD) and can also induce abortions in cattle. As other alpha-herpesvirinae subfamily members, BoHV-1 efficiently replicates in many cell types and subsequently establishes a life-long latent infection in sensory neurons. BoHV-1 encodes more than 70 proteins that are expressed in a well-defined manner during productive infection. However, in silico open reading frame (ORF) prediction of the BoHV-1 genome suggests that the virus may encode more than one hundred proteins. In this study we used mass spectrometry followed by proteogenomic mapping to reveal the existence of 92 peptides that map to previously un-annotated regions of the viral genome. Twenty-one of the newly termed “intergenic peptides” were predicted to have a viable ORF around them. Twelve of these produced an mRNA transcript as demonstrated by strand-specific RT-PCR. We further characterized the 5′ and 3′ termini of one mRNA transcript, ORF-A, and detected a 55 kDa protein produced during active infection using a custom-synthesized antibody. We conclude that the coding potential of BoHV-1 is underestimated.

1. Introduction

Bovine herpesvirus type 1 (BoHV-1) is a linear, double stranded DNA virus composed of a host-derived lipid envelope and tegument proteins surrounding an icosahedral nucleocapsid. It is a member of the alphaherpesvirus subfamily within the Varicellovirus genus [1]. Three subtypes, BoHV-1.1, BoHV-1.2a, and BoHV-1.2b are recognized and are distinguished by their differences in endonuclease restriction sites [2,3]. BoHV-1.1 is commonly found in respiratory infections in feedlot cattle, while genital infections and abortions are more common in breeding cattle [4]. In fact, BoHV-1 is the most frequently diagnosed cause of viral abortion in North America. The subtype 2a typically causes balanopostitis and abortions and is not usually associated with respiratory disease, while subtype 2b can be associated with respiratory disease but not abortions [5,6].
BoHV-1 and other viruses such as bovine respiratory syncytial virus (BRSV) or bovine parainfluenza virus type 3 (bPI3V), are attributed to host immune suppression and a condition known as bovine respiratory disease (BRD) in which the attenuated immune response increases the risk of secondary infections of bacteria like Mannheimia haemolytica, Pasteurella multocida, or Histophilus somni, which can then lead to pneumonia [7,8,9]. A BoHV-1 entry protein encoded by the poliovirus receptor related 1 gene is a known marker of BRD susceptibility in Holstein calves, confirming that BoHV-1 is an important BRD cofactor [10].
Host immunosuppression by BoHV-1 can be partially attributed to the ability of the virus to reduce the number of CD4+ T lymphocytes, induce apoptosis in lymphoid tissues, and downregulate the number of MHC class I molecules [11,12]. In addition, BoHV-1 can establish multiple acute infections throughout life due to its ability to establish latency in the sensory neurons of the trigeminal ganglia until the virus is reactivated by stress or immune suppression [4,13]. Due to its ability to cause widespread infection, BoHV-1 has a persistent negative economic impact on the cattle industry [14,15]. Essential to creating an effective vaccine are the understanding of viral mechanisms necessary for productive infection and immune stimulation, and identification of viral and host proteins that participate in these processes.
BoHV-1 genome sequencing and structural annotation (demarcation of the boundaries of coding elements) was conducted internationally in 1996 by a collaboration of several groups that analyzed genome segments created by the Hind III restriction endonuclease [16,17,18,19,20]. When it was completely sequenced, the nearly 140 kb BoHV-1 genome, which is divided into a 104 kb unique long (UL) and a 10 kb unique short (US) region, was found to have 67 unique open reading frames (ORFs) and two ORF duplications, encoded on both strands in one of six different reading frames [21]. However, this sequence was essentially a chimera of multiple strains of the BoHV 1.1 and 1.2 subtype. The Cooper strain of the virus was sequenced in 2013, providing access to a full sequence for the subtype 1.1 [22]. Even though the differences between the BoHV-1.1 sequence and the (old) composite sequence were not extensive, the original annotation was not modified. Therefore, a more accurate knowledge of the viral coding capacity could be achieved through re-annotation of the BoHV-1.1 sequence.
Genome structural annotation is an integral part of genome sequencing efforts and is often accomplished by computational gene prediction based on comparative genomics. Computational gene prediction is performed on a finished genome sequence by searching databases for sequence similarities, or by analyzing multiple genomes at once to characterize similar elements [23,24]. Proteogenomic mapping (PGM) has emerged as a valuable complementary genome structural annotation method. PGM approach involves mass spectrometry-based identification of all expressed peptides from a biological sample, mapping the peptides to the genome sequence to identify genome coordinates to improve existing annotation and to identify novel protein-encoding ORFs that were missed by the computational prediction algorithms [25,26]. This experiment-based approach to genome structural annotation can capture context dependent expression of the genome and generate a more accurate genomic map, and there is increasing appreciation for the need to combine experimental genome annotation approaches with computational predictions for comprehensive annotation of genome sequences.
The proteogenomic mapping has been beneficial in annotating the genome of herpesviruses and identifying post-translational modifications of viral proteins [25,27,28,29,30]. They have been utilized in the case of many herpesviruses for annotation, characterization, and discovery of novel protein-coding genes. MS/MS has been essential for the characterization of proteins produced by Kaposi’s sarcoma herpesvirus (KSHV) coding capacity [27,31]. Proteomics has also been exceedingly useful in the protein characterization of non-human gammaherpesviruses such as rhesus monkey rhadinovirus, murid herpesvirus 4, and bovine herpesvirus 4 [30,32,33] Proteogenomic mapping made possible the identification of proteins produced by channel catfish virus during infection [25]. Many of the same methods have helped clarify the genomes of several avian herpesviruses [34].
In the present study, we used proteogenomic mapping to identify viral proteins expressed during infection. Our results confirmed the expression of 57 previously annotated viral proteins as well as identified many peptides that mapped to genomic coordinates of BoHV-1.1 Cooper strain with no annotated ORFs. We further characterized one of these novel genes named “ORF-A”, with respect to mRNA transcript boundaries and its RNA and protein expression. Our results suggest that the coding potential of BoHV-1 is underestimated in the current computational-based annotation.

2. Materials and Methods

2.1. Cells and Virus

Madin Darby Bovine Kidney (MDBK) and cells resistant to infection by the bovine viral diarrhea virus (CRIB-1) were cultured in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 5% (v/v) fetal calf serum, 100 U/mL penicillin and 100 μg/mL streptomycin. CRIB-1 cells are derived from MDBK cells [35]. Rabbit skin (RS) cells were grown in minimal essential medium (MEM) supplemented with 10% (v/v) FCS, penicillin (10 U/mL), and streptomycin (100 μg/mL). All cells were kept at 37 °C in a humidified incubator with 5% CO2 supply. The Bovine herpesvirus 1.1 (BoHV-1), Cooper isolate (GenBank Accession number JX898220.1) was used to infect cultured cells at multiplicity of infection (MOI) of 1 or 5.

2.2. Infected Whole Cell Sample Preparation

CRIB-1 cells were infected at a multiplicity of infection (MOI) of 5 and incubated at 4 °C for one hour to synchronize infection, before placing in humidified CO2 chamber at 37 °C. Infected samples were collected at 2, 8, and 16 h post infection (hpi). Cells were washed with cold phosphate-buffered saline (PBS), scraped and centrifuged for 30 min at 4 °C, and stored at −80 °C until processing for mass spectrometry. Whole cell extracts from uninfected and BoHV-1-infected samples (n = 3 for each) were solubilized in 10% SDS (to a final concentration of 4%) and treated with protease inhibitor cocktail (1/10 of final sample volume) (Sigma-Aldrich, Saint Louis, MO, USA). Samples were prepared for quantification with the Thermo Scientific Pierce BCA Protein Assay Kit (Fisher Scientific, Pittsburgh, PA, USA) and protein concentration was determined using a plate spectrophotometer (ThermoMax Microplate, Molecular Devices, Sunnyvale, CA, USA). Samples containing 100 µg protein were precipitated with methanol and chloroform (4:1), washed with methanol, spun and air-dried in a desiccator prior to tryptic digestion. Protein samples were treated with 8 M urea (RT for 30 min), reduced (0.005 M DTT at 65 °C for 10 min) and alkylated (0.01 M iodoacetamide at 37 °C for 30 min). Protein samples were diluted with water, pH was adjusted to 7.5 and were digested with molecular biology grade porcine trypsin (2 µg at 37 °C, overnight, 50:1 ratio of protein:trypsin, Promega Corporation, Madison, WI, USA). Tryptic peptides were cleaned up using a strong cation exchange (SCX) trap (Michrom BioResources Inc., Auburn, CA, USA), eluted in high salt buffer (0.005 M NaH2 PO4, 25% acetonitrile, 0.25 M KCl, pH 3) and dried. Dried tryptic peptides were desalted using a peptide macrotrap (Michrom BioResources Inc., Auburn, CA, USA), eluted in 0.1% triflouroacetic acid, 95% acetonitrile solution, and air-dried in a desiccator.

2.3. Liquid Chromatography-Tandem Mass Spectrometry

Spectral data were collected using an Orbitrap LTQ Velos mass spectrometer (Thermo Fisher Scientific, Waltham, MA, USA) linked with an UltiMate 3000 nano flow HPLC system (Thermo Fisher Scientific, Waltham, MA, USA). Two micrograms of protein tryptic digest were loaded on reversed phase fused silica Acclaim PepMap C18 column (75 µm × 150 mm, Thermo Fisher Scientific). A constant flow rate of 0.3 microliters per minute was used to separate and elute peptides with a 60-min long linear gradient of acetonitrile in 0.1% formic acid: 2–55% for 35 min, 95% for 10 min, 2% for 15 min. Peptides were detected by linear trap mass detector, in data dependent acquisition (DDA) mode with dynamic exclusion. One MS scan event (m/z range: 300–2000) was followed by seven tandem mass spectrometry (MS/MS) scans for the seven most intense ions detected in MS scan. Set parameters included: Normalized collision energy: 35%, AGC (automatic gain control) “on” with MSn Target 4 × 104, isolation width (m/z): 1.5, capillary temperature 170 °C, spray voltage 1.97 kV.
The raw files were searched using the SEQUEST algorithm of the Proteome Discoverer 1.1 software (Thermo Fisher Scientific) with given parameters: Lowest and highest charge: +1 and +3, respectively; minimum and maximum precursor mass: 300 and 6000 Da, respectively; minimum S/N ratio: 3; enzyme: trypsin; maximum missed cleavages: 2, FDR = 0.01; dynamic modifications: cysteine carbamidomethylation (+57.021), methionine oxidation (+15.995), methionine dioxidation (+31.990).

2.4. Proteogenomic Mapping

The proteogenomic target database was created by using the EMBOSS tool suite [36] to translate the entire BoHV-1.1 genome (Accession: JX898220.1) in all six reading frames, mask all stop codons, and split the resulting amino acid sequences into pseudo-proteins of length 500 overlapping by 50 amino acids. The decoy database was created by randomizing the sequences in the target database. The spectra were searched against both databases using X!tandem and OMSSA [37]. The X!tandem and OMSSA results were combined and filtered using the Proteomics Toolset (sf.net/projects/proteomicstools/). For each time point, the filter was optimized for a peptide FDR less than 5%. The peptides that passed filtering were aligned to the genome using tblastn [38]. Any alignment without 100% identity over the entire peptide was removed from further analysis. In addition, this list of peptides was blasted to reveal no matches against all host Bos Taurus proteins currently on NCBI. The bedtools tool suite was used to remove any alignment that overlapped the currently annotated ORFs, leaving only the intergenic peptides [39]. The genomic sequence corresponding to novel peptides was scanned up- and downstream in search of start and stop codons that constitute the boundaries of a potentially new gene [26]. In this study we define unique peptides as peptides mapping to a single location in the genome.

2.5. RNA Extraction, RT-PCR, and RACE

Messenger RNA (mRNA) was extracted from CRIB-1 cells that were mock infected or infected for 16 h with a MOI of 5 using the magnetic mRNA isolation kit (#S1550S, New England Biolabs, Ipswich, MA, USA). This was followed by DNase I treatment (#EN0521, Thermo Fischer Scientific), and the synthesis of complementary DNA (cDNA) via reverse transcriptase using the Invitrogen Superscript III module (#18080-044, Carlsbad, CA, USA) and gene specific primers produced by Eurofins (Louisville, KY, USA; see Table 1a). For the controls, cDNA was synthesized using random hexamer primers provided in the Invitrogen module. The strand-specific cDNA was used as a template for polymerase chain reaction (PCR) using the OneTaq module (#M0480L, New England Biolabs).
Rapid amplification of 3′ and 5′ cDNA ends (RACE) was performed using the Invitrogen GeneRacer Kit according to manufacturer’s instructions (L1502-01) and gene-specific primers (Eurofins; see Table 1b). Touchdown PCR was performed for RACE to increase the specificity of the transcript being amplified. To elucidate the 3′ end of the transcript, reverse primers designed in a stepwise manner were used in conjunction to the original “A” forward primer eventually capturing its full length. Transcription profiling was carried out at 4, 8, 12, 15, and 24 hpi via RT-PCR using reverse primer 9. Negative controls were conducted by using the DNase treated mRNA as a template instead of the cDNA to discard the possibility of genomic DNA contamination.

2.6. DNA Gel Electrophoresis, Purification, and Sequencing

PCR amplicons were visualized using a 1% agarose gel in TAE buffer (40 mM Tris (pH 7.6), 20 mM acetic acid, and 1 mM EDTA). Bands for the 5′ RACE and or other amplicons were excised with a clean scalpel and purified using the Monarch nucleic acid purification kit (#T1020, New England Biolabs). The purified DNA was quantified using NanoDrop, sent to Eurofins for sequencing, and compared to the full BoHV-1 Cooper sequence (accession: JX898220.1).

2.7. Generation of Peptide Specific Antiserum Directed against ORF-A

Two peptides present in ORF-A (RPARRGGHRRGEIGDA and DQPPKVAGRSTKHVGRGS) were used to generate peptide specific antiserum in rabbits (Pierce Biotechnology, Waltham, MA, USA). The antibody was generated by mixing both peptides in equal molar concentrations prior to injection of rabbits. All injections were performed with both peptides. The serum was collected, and peptide specific antibodies purified by affinity chromatography.

2.8. Western Blot Analysis

For Western blot studies, the designated cell line was infected with a MOI of 1. At different times after infection, cell cultures were washed with PBS and suspended in RIPA lysis buffer (50 mM Tris HCL (pH 8.0), 150 mM NaCL, 2 mM EDTA (pH 8.0), 1% NP-40, 0.5% sodium deoxycholate, 0.1% SDS) and one tablet of Complete Protease Inhibitor (Roche Molecular Biochemicals, Mannheim, Germany) in 10 mL of buffer. Cell lysate was incubated on ice for 30 min, sonicated, and then clarified by centrifugation at 13,000 RPM at 4 °C for 15 min. Protein concentrations were determined by the Bradford assay (Bio-Rad, Hercules, CA, USA). For SDS-PAGE, proteins were mixed with an equal amount of 2× sample loading buffer (62.5 mM Tris-HCl (pH 6.8), 2% SDS, 50 mM dithiothreitol, 0.1% bromophenol blue, 10% glycerol) and boiled for 5 min. Proteins were separated in an SDS–10% PAGE gel.
After electrophoresis, proteins were transferred onto a polyvinylidene difluoride membrane (Immobilon-P; Millipore, Burlington, MA, USA) and blocked for 1 h in 5% w/v nonfat dry milk with 1× Tris-buffered saline–0.1% Tween 20 (TBS-T). Membranes were then incubated with the designated primary antibody at 4 °C with gentle shaking overnight. The ORF-A specific antibody was diluted 1:1000 in the blocking solution. An antibody directed against tubulin was used as a loading control. After 45 min of washing with TBS-T, the blots were incubated with secondary antibodies (peroxidase-conjugated immunoglobulin G (Amersham Biosciences, Little Chalfont, UK), which was diluted 1:2000 in 5% nonfat milk in TBS-T for 1 h. Blots were washed 45 min with TBS-T and exposed to Amersham ECL reagents, and then autoradiography was performed. The secondary donkey anti-rabbit antibody (NA9340V, Chicago, IL, USA) was purchased from GE Healthcare.

3. Results

3.1. Proteogenomic Analysis of Infected Cells Identified Potential Novel Open Reading Frames

Guided by the hypothesis that the coding potential of BoHV-1 was underestimated, we conducted an ORF prediction analysis of the BoHV-1 genome using the FGENESV algorithm (www.softberry.com). This algorithm for viral gene prediction identified 116 potential ORFs in the 2012 sequence corresponding to the Cooper strain (JX898220.1 Bovine herpesvirus type 1.1 isolate NVSL challenge 97-11). The same analysis done using the 1996 multi-strain BoHV-1 sequence (NC_001847.1 Bovine herpesvirus 1) yielded 109 potential ORFs. Both these analyses revealed a significantly higher number of ORFs than the originally annotated 69 ORFs. In addition, our analysis revealed slight differences ORF boundaries for about 10 of the currently annotated ORFs (data not shown). Our computational analysis of both old and new BoHV-1 genomes suggested that the coding potential of the BoHV-1 genome is underestimated and constituted the starting point for exploring the genome with a new approach that incorporates experimental data into the re-annotation process.
After preparing infected cells for tandem mass spectrometry (MS/MS), the retrieved spectra were searched against a database consisting of the full genomic sequence of the BoHV-1 Cooper strain translated in all 6 reading frames. BoHV-1.1 proteogenomic mapping analysis identified a total of 2493 MS-MS spectra, 344 of them being unique (i.e., mapping to a single location in the whole BoHV-1.1 genome) and mapping to BoHV-1 currently annotated protein coding ORFs (Figure 1 and Figure 2; Table 2). In addition to these, other MS spectra mapped to viral genomic regions with no current annotation. We designated these intergenic spectra or intergenic peptides. We identified a total of 92 unique intergenic peptides (Figure 1 and Figure 2; Table 2). In order to determine if these peptides were encompassed within a potential ORF, the surrounding sequences were scanned up- and downstream in search of in-frame start and stop codons that could correspond to the putative new ORF. Out of the 92 unique intergenic peptides, 21 of them exhibited a potential ORF around them. Supplementary Figure S1 shows a diagram illustrating the proteogenomic mapping process and the work flow in this study. Peptides mapping to a region with no identifiable start and stop codons were excluded from further analysis. Table S1 summarizes the proteogenomic mapping data.

3.2. Validation of Intergenic Peptides with Surrounding Viable ORFs

To determine whether RNA transcripts were being synthesized from the 21 putative new ORFs (intergenic peptides embedded within a predicted ORF), we used strand specific RT-PCR (Figure 3). Custom primers are shown in Table 1. The use of strand specific reverse transcription was imperative in this procedure to avoid the misleading amplification of the genes located directly antisense from the predicted sequences on the reverse strand (see Figure 1). For the intergenic transcripts shown in Figure 3b, ORF-A is located antisense to the UL50 gene (encodes a dUTPase), and ORF-B is antisense to UL46 (encodes a viral tegument protein). A negative control without retrotranscription is also imperative as a control of no genomic DNA contamination, or random cDNA synthesis from transcripts derived from the opposite strand. In none of our experiments did we obtain amplification of RT(-) samples (Figures 3, 5, and 8). The cellular gene for actin and the viral bICP4 gene were used as positive controls to establish that samples were infected (+) or uninfected (-) (Figure 3a), while the successful retro-transcription and PCR amplification of two putative new transcripts, called ORF-A and ORF-B, at the predicted sizes is shown in Figure 3b. Of the 19 transcripts that were screened, 12 showed successful amplification. Two of the intergenic peptides were not screened by RT-PCR due to difficulties in primer design, since BoHV-1.1 has a 72% GC content. Importantly, amplification of putative new ORFs was never detected in mRNA derived from uninfected cells.

3.3. Characterization of 5′ Terminus of ORF-A via Rapid Amplification of cDNA Ends (RACE)

In order to obtain the full-length sequence of the predicted ORFs, RACE was used to amplify the 5′ extension of the transcript. This was achieved using a combination of primers that align with the predicted coding region and a specialized primer provided in the Invitrogen kit that binds to the 5′ cap structure of mRNA transcripts. Nested primers were then used with the initial PCR product as template for increased specificity. The final PCR result showed a significant amplicon around 300 kb for the 5′ RACE product (Figure 4). The amplicon was excised, sequenced and blasted to align with genomic sequences consistent with the predicted location of ORF-A, with its 5′ boundary at nucleotide 7947 on the forward strand of the BoHV-1.1 genome (Figure 6).

3.4. Characterization of 3′ Terminus of the A Transcript by Primer Walking

Characterizing the 3′ end of ORF-A transcript via RACE yielded suboptimal results, leading us to adopt an alternative strategy. Sixteen reverse primers were designed that progressively covered the transcript to its 3′ terminus. These strand-specific reverse primers were used for individual cDNA synthesis reactions to avoid retro-transcription of transcripts generated from the negative strand. After PCR, successful and increasingly longer amplicons were obtained (Figure 5). Primer 13 produced the longest amplicon, which was 1582 bp (Figure 5). Downstream of this location (primers 14–16) did not yield consistent amplification. Therefore, we predict the 3′ end of ORF-A transcript to fall after nucleotide 9529 but before nucleotide 9724 on the forward strand of the BoHV-1.1 genome (Figure 6). Amplicons were sequenced and blasted against the BoHV-1.1 genome to confirm the identity of the viral sequence.

3.5. The Novel ORF-A Expresses a 55 KDa Protein in Bovine Cells

To test whether ORF-A expresses a protein during infection, wild-type BoHV-1 was used to infect bovine kidney (MDBK) or rabbit skin cells (RS). At various times after infection, whole cell lysate was prepared, and Western blot analysis performed after separation of proteins on an SDS polyacrylamide gel. By 4 h after infection of MDBK cells, a viral specific protein migrating at approximately 55 kDa was detected (Figure 7; denoted by arrow). The size of the protein is consistent with a mRNA transcript of about 1582 bp (Figure 5). The intensity of this protein increased at later times after infection. Control uninfected cells (0 and 24 h after infection) were not specifically recognized by the peptide-specific antibody generated against ORF-A. Tubulin was included as a protein loading control. An additional viral-specific protein was detected that was migrating at approximately 140 kDa (Figure 7, denoted by closed circle). In RS cells, the peptide-specific ORF-A antiserum detected a viral specific 55 kDa protein at 16 and 24 h after infection: however, the 140 kDa viral protein was not readily detected. In general, BoHV-1 replication was slower, and the viral yields are lower in RS cells relative to MDBK cells. The purified peptide-specific antiserum from two different rabbits yielded similar results in Western blot studies in both cell lines (data not shown). In summary, this study provided evidence that ORF-A protein is expressed during productive infection.

3.6. ORF-A Transcription Profile is Consistent with Protein Synthesis

To observe if the transcript expression aligned with ORF-A protein expression, strand-specific RT-PCR was carried out with infected cells at similar time points to the protein expression experiment (Figure 8). The expression of ORF-A transcript was detectable at 4 hpi and fairly consistent throughout infection. Viral genes ribonucleotide reductase (RR) and glycoprotein C (gC) were also amplified as controls of early and late viral gene expression, respectively. GAPDH was used as a housekeeping control. Overall, the expression of RNA is in agreement with the observed protein expression (Figure 7).

4. Discussion

Mass spectrometry has allowed for numerous opportunities in the annotation of genomes in cases of not only eukaryotes, but also bacteria and viruses [26,40,41,42]. The original BoHV-1 structural annotation based on computational gene prediction methods is more than two decades old and originated from multiple strains or subtypes of the virus. Experimental approaches, such as proteogenomics adopted in this study allow for increased accuracy in the characterization of the protein content produced during viral infection, for instance to correct gene boundaries and identify novel ORFs. By working backwards from protein to RNA, we were able to amplify the transcripts that encode potential new proteins, providing further evidence for their existence and paving the way for opportunities to associate previously unaccounted viral functions to new protein(s). In the case of ORF-A, we were able to characterize the full transcript and verify the expression of ORF A protein product via Western blot.
The finding of 92 intergenic peptides was encouraging, and 21 of these peptides are embedded within a viable ORF. The remaining peptides may have no canonical start or stop in-frame codons in adjacent sequences. In other cases, peptides map to a complex area of viral genes in close proximity in the same strand but different frame. A likely explanation for the existence of these scattered but proximal peptides is splicing, ribosomal frame-shifting of certain viral mRNA during translation, or perhaps aberrant transcripts during late stages of infection. BoHV-1 exhibits complex alternative patterns of splicing in certain genes (LR gene, bICP0, bICP4 and bICP22, for example) [43,44,45]. As a result, one would not readily be able to predict an ORF in a given reading frame where an intergenic peptide mapped to. Alternatively, other initiation codons could be driving the synthesis of protein. Viruses have the potential to use alternative start codons and shift frames during transcription, which can increase the overall transcriptional capability [46,47]. Gene prediction algorithms scan for uninterrupted ORFs and search for traditional start and stop codons to predict potential ORFs, ignoring stop codon alternatives that have been noted in other organisms like UUG, CUU, or CCU [46,48,49]. In addition a plethora of codon usage alternatives have been documented in varied organisms, including several alternative start codons (https://www.ncbi.nlm.nih.gov/Taxonomy/Utils/wprintgc.cgi?mode=c). All these factors could confound gene prediction algorithms that missed these 21 ORFs.
It is also noteworthy that recent studies using deep sequencing of infected-cells transcriptomes have found a complex coding landscape for many herpesviruses [50], including long transcripts that were not known to exist. The transcriptome analysis of BoHV-1 would be a perfect future complement to our work and would elucidate some of the questions related to splicing and use of alternative start and stop codons.
Mapping the 5′ terminus of ORF-A transcript via RACE produced immediate and robust results indicating an unequivocal 5′ terminus for ORF-A transcript. However, RACE proved sub-optimal for mapping the 3′ terminus, leading us to use the primer walking strategy. This may be explained, in part, due to the exceptionally high GC content of the viral genome which interferes with PCR and may complicate primer design. In fact, two of the 21 intergenic peptides were not screened by strand specific RT-PCR because of difficulties in primer design. In the primer walking strategy, only one forward primer (located as upstream as possible) was used to avoid the amplification of downstream genes UL49 and UL49.5 that may or may not start transcription upstream, thus overlapping the ORF-A transcript. Therefore, we can only be certain of the existence of a transcript up to nucleotide 9529 (primer 13, Figure 5). The ORF-A transcript was successfully amplified with primers 1–3 and 5–13 (Table 1, Figure 5). Primer 13 yielded the longest amplicon of 1582 bp. If we assume the average molecular weight of an amino acid to be 110 Da, it can be predicted that a transcript of 1582 bp could direct the synthesis of a 50–55 kDa protein. This is an estimate, as we did not determine the transcription or the translation start sites of ORF-A. Further studies are needed to determine precise start and stop sites.
The viable ORF-A predicted around the MS/MS peptide was originally predicted to produce a protein of 15 kDa. After mapping the ORF-A transcript, we found that the transcript is at least 1582 nucleotides long (Figure 4 and Figure 5). Our result is consistent with the size of the ORF-A protein which is around 55 kDa as detected by Western blot analysis in both MDBK and RS cells (Figure 7). The original in silico-predicted ORF was smaller due to the existence of several stop codons that are in frame with the ORF-A MS/MS peptide. We know that splicing does not likely occur because the sequencing results show intact ampicons (Figure 6) that blasted back to the viral genome with no gaps (data not shown). While stop codon read-through is common in viral transcription [51,52,53], 9 in-frame stop codons may seem excessive, even if some of the stop codons localized to the 3′ untranslated region of the transcript. However, studies have shown potential for transcripts being produced by the readthrough of multiple stop codons in Drosophila melanogaster and Anopheles gambiae [54]. While the situation seems rare, this could be the case for ORF-A. Alternatively, if the ORF-A was smaller (with only a few readthrough events), the presence of post translational modifications such as phosphorylation or glycosylation on the protein could explain an apparently larger protein. Lastly, another common alternative in viral translation is ribosomal frameshifting [55,56]. However, this fact may not explain our results since stop codons also appear in the other two reading frames. Future work will likely elucidate the translational mechanism behind the synthesis of the ORF-A protein.
In conclusion, we have identified 21 intergenic peptides predicted to have a traditional ORF around them. The proof that 12 of them have an RNA associated is encouraging. In this study we further characterize one of these genes, which we have termed ORF-A. There are an additional 11 intergenic peptides that await discovery. Our results highlight the value of incorporating experimental data into gene discovery and annotation. By discovering and characterizing new genes, we expect to contribute to a more accurate BoHV-1 genome annotation, which could, in turn, provide insight to the field of vaccinology.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/10/9/499/s1, Figure S1: Diagram representing the proteogenomic mapping process. Table S1: Genomic coordinates and abundance of viral peptides mapped by the proteogenomic tool.

Author Contributions

Conceptualization: F.M.; formal analysis: V.A.J., B.N., and F.M.; funding acquisition: F.M. and B.N.; investigation: C.J., B.N., and F.M.; methodology: V.A.J., K.A.B., and F.S.E.; project administration: F.M.; resources: F.M., B.N., and C.J.; software: V.A.J.; supervision: F.M.; validation: V.A.J.; visualization: V.A.J.; writing—original draft: V.A.J.; writing—review and editing: V.A.J., C.J., B.N., and F.M.

Funding

This research was supported by a grant from the USDA-NIFA Competitive Grants Program (Award # 2013-67016-21321) awarded to Florencia Meyer and in part by funding through the Mississippi Agriculture and Forestry Experimental Station Special Research Initiative (MAFES-SRI). K.A.B. and Victoria A. Jefferson were funded in part through the USDA-AFRI award stated above. The proteomics and mass spectrometry analysis were performed at the Institute for Genomics, Biocomputing and Biotechnology, Mississippi State University. The research in the lab of CJ was supported by a grant from the USDA-NIFA Competitive Grants Program (16-09370).

Acknowledgments

We wish to acknowledge Leslie A. Shack for mass spectrometry sample preparation, and the Institute for Genomics, Biocomputing and Biotechnology at Mississippi State University for the tandem mass spectrometry work and data analysis.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Thiry, J.; Keuser, V.; Muylkens, B.; Meurens, F.; Gogev, S.; Vanderplasschen, A.; Thiry, E. Ruminant alphaherpesviruses related to bovine herpesvirus 1. Vet. Res. 2006, 37, 169–190. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Metzler, A.E.; Schudel, A.A.; Engels, M. Bovine Herpesvirus 1: Molecular and antigenic characteristics of variant viruses isolated from calves with neurological disease. Arch. Virol. 1986, 87, 205–217. [Google Scholar] [CrossRef] [PubMed]
  3. Engels, M.; Giuliani, C.; Wild, P.; Beck, T.M.; Loepfe, E.; Wyler, R. The genome of bovine herpesvirus 1 (BHV-1) strains exhibiting a neuropathogenic potential compared to known BHV-1 strains by restriction site mapping and cross-hybridization. Virus Res. 1986, 6, 57–73. [Google Scholar] [CrossRef]
  4. Jones, C.; Chowdhury, S. Bovine herpesvirus type 1 (BHV-1) is an important cofactor in the bovine respiratory disease complex. Vet. Clin. N. Am. Food Anim. Pract. 2010, 26, 303–321. [Google Scholar] [CrossRef] [PubMed]
  5. D’Arce, R.C.; Almeida, R.S.; Silva, T.C.; Franco, A.C.; Spilki, F.; Roehe, P.M.; Arns, C.W. Restriction endonuclease and monoclonal antibody analysis of Brazilian isolates of bovine herpesviruses types 1 and 5. Vet. Microbiol. 2002, 88, 315–324. [Google Scholar] [CrossRef]
  6. Van Oirschot, J.T. Bovine herpesvirus 1 in semen of bulls and the risk of transmission: A brief review. Vet Q. 1995, 17, 29–33. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Jones, C.; Chowdhury, S. A review of the biology of bovine herpesvirus type 1 (BHV-1), its role as a cofactor in the bovine respiratory disease complex and development of improved vaccines. Anim. Health Res. Rev. 2007, 8, 187–205. [Google Scholar] [CrossRef] [PubMed]
  8. Kirchhoff, J.; Uhlenbruck, S.; Goris, K.; Keil, G.M.; Herrler, G. Three viruses of the bovine respiratory disease complex apply different strategies to initiate infection. Vet. Res. 2014, 45, 1–12. [Google Scholar] [CrossRef] [PubMed]
  9. Yates, W.D. A review of infectious bovine rhinotracheitis, shipping fever pneumonia and viral-bacterial synergism in respiratory disease of cattle. Can. J. Comp. Med. 1982, 46, 225–263. [Google Scholar] [PubMed]
  10. Neibergs, H.; Seabury, C.; Wojtowicz, A.J.; Wang, Z.; Scraggs, E.; Kiser, J.N.; Neupane, M.; Womack, J.E.; Van Eenennaam, A.; Hagevortm, G.R.; et al. Susceptibility loci revealed for bovine respiratory disease complex in pre-weaned holstein calves. BMC Genomics 2014, 15, 1–19. [Google Scholar] [CrossRef] [PubMed]
  11. Nataraj, C.; Eidmann, S.; Hariharan, M.J.; Sur, J.H.; Perry, G.A.; Srikumaran, S. Bovine Herpesvirus 1 downregulates the expression of bovine MHC class I molecules. Viral. Immunol. 1997, 10, 21–34. [Google Scholar] [CrossRef] [PubMed]
  12. Winkler, M.T.C.; Doster, A.; Jones, C. Bovine Herpesvirus 1 can infect CD4+ T lymphocytes and induce programmed cell death during acute infection of cattle. J. Virol. 1999, 73, 8657–8668. [Google Scholar] [PubMed]
  13. Jones, C. Herpes Simplex Virus Type 1 and Bovine Herpesvirus 1 Latency. Clin. Microbiol. Rev. 2003, 16, 79–95. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Turin, L.; Russo, S.; Poli, G. BHV-1: New Molecular Approaches to Control Common and Widespread Infection. Mol. Med. 1999, 5, 261–284. [Google Scholar] [CrossRef] [PubMed]
  15. Van Drunen Littel-van den Hurk, S. Rationale and perspectives on the success of vaccination against bovine herpesvirus-1. Vet. Microbiol. 2006, 113, 275–282. [Google Scholar] [CrossRef] [PubMed]
  16. Khattar, S.K.; van Drunen Littel-van den Hurk, S.; Babiuk, L.A.; Tikoo, S.K. Identification and transcriptional analysis of a 3′-coterminal gene cluster containing UL1, UL2, UL3, and UL3.5 open reading frames of bovine herpesvirus-1. Virology 1995, 213, 28–37. [Google Scholar] [CrossRef] [PubMed]
  17. Leung-Tack, P.; Audonnet, J.-C.; Riviere, M. The Complete DNA Sequence and the Genetic Organization of the Short Unique Region (US) of the Bovine Herpesvirus Type 1 (ST Strain). Virology 1994, 199, 409–421. [Google Scholar] [CrossRef] [PubMed]
  18. Pidone, C.L.; Galosi, C.M.; Echeverria, M.G.; Nosetto, E.O.; Etcheverrigaray, M.E. Restriction Endonuclease Analysis of BHV-1 and BHV-5 Strains Isolated in Argentina. Zentralbl. Veterinarmed. B 1999, 46, 453–456. [Google Scholar] [CrossRef] [PubMed]
  19. Simard, C.; Langlois, I.; Styger, D.; Vogt, B.; Vlcek, C.; Chalifour, A.; Trudel, M.; Schwyzer, M. Sequence Analysis of the UL39, UL38, and UL37 Homologues of Bovine Herpesvirus 1 and Expression Studies of UL40 and UL39, the Subunits of Ribonucleotide Reductase. Virology 1995, 212, 734–740. [Google Scholar] [CrossRef] [PubMed]
  20. Vlček, Č.; Beneš, V.; Lu, Z.; Kutish, G.F.; Pačes, V.; Rock, D.; Letchworth, G.J.; Schwyzer, M. Nucleotide Sequence Analysis of a 30-kb Region of the Bovine Herpesvirus 1 Genome Which Exhibits a Colinear Gene Arrangement with the UL21 to UL4 Genes of Herpes Simplex Virus. Virology 1995, 210, 100–108. [Google Scholar]
  21. Schwyzer, M.; Ackermann, M. Molecular virology of ruminant herpesviruses. Vet. Microbiol. 1996, 53, 17–29. [Google Scholar] [CrossRef]
  22. D’Offay, J.M.; Fulton, R.W.; Eberle, R. Complete genome sequence of the NVSL BoHV-1.1 Cooper reference strain. Arch. Virol. 2013, 158, 1109–1113. [Google Scholar] [CrossRef] [PubMed]
  23. Koonin, E.V.; Galperin, M.Y. Genome Annotation and Analysis. In Sequence-Evolution-Function: Computational Approaches in Comparative Genomics; Eugene, V.K., Michael, Y.G., Eds.; Kluwer Academic: Boston, MA, USA, 2003; pp. 193–226, ISBN-10: 1-40207-274-0. [Google Scholar]
  24. Gupta, N.; Benhamida, J.; Bhargava, V.; Goodman, D.; Kain, E.; Kerman, I.; Nguyen, N.; Ollikainen, N.; Rodriguez, J.; Wang, J.; et al. Comparative Proteogenomics: Combining Mass Spectrometry and Comparative Genomics to Analyze Multiple Genomes. Genome Res. 2008, 18, 1–16. [Google Scholar] [CrossRef] [PubMed]
  25. Kunec, D.; Nanduri, B.; Burgess, S.C. Experimental annotation of channel catfish virus by probabilistic proteogenomic mapping. Proteomics 2009, 9, 2634–2647. [Google Scholar] [CrossRef] [PubMed]
  26. Jaffe, J.D.; Berg, H.C.; Church, G.M. Proteogenomic mapping as a complementary method to perform genome annotation. Proteomics 2004, 4, 59–77. [Google Scholar] [CrossRef] [PubMed]
  27. Bechtel, J.T.; Winant, R.C.; Ganem, D. Host and Viral Proteins in the Virion of Kaposi’s Sarcoma-Associated Herpesvirus Host and Viral Proteins in the Virion of Kaposi’s Sarcoma-Associated Herpesvirus. Society 2005, 79, 4952–4964. [Google Scholar]
  28. Engel, E.A.; Song, R.; Koyuncu, O.O.; Enquist, L.W. Investigating the biology of alpha herpesviruses with MS-based proteomics. Proteomics 2015, 15, 1943–1956. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  29. Johannsen, E.; Luftig, M.; Chase, M.R.; Weicksel, S.; Cahir-McFarland, E.; Illanes, D.; Sarracino, D.; Kieff, E. Proteins of purified Epstein-Barr virus. Proc. Natl. Acad. Sci. USA 2004, 101, 16286–16291. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  30. Vidick, S.; Leroy, B.; Palmeira, L.; Machiels, B.; Mast, J.; François, S.; Wattiez, R.; Vanderplasschen, A.; Gillet, L. Proteomic characterization of murid herpesvirus 4 extracellular virions. PLoS ONE 2013, 8, e83842. [Google Scholar] [CrossRef] [PubMed]
  31. Zhu, F.X.; Chong, J.M.; Wu, L.; Yuan, Y. Virion Proteins of Kaposi’s Sarcoma-Associated Herpesvirus. 2005, 79, 800–811. J. virol. 2005, 79, 800–811. [Google Scholar] [CrossRef] [PubMed]
  32. Lété, C.; Palmeira, L.; Leroy, B.; Mast, J.; Machiels, B.; Wattiez, R.; Vanderplasschen, A.; Gillet, L. Proteomic Characterization of Bovine Herpesvirus 4 Extracellular Virions. J Virol. 2012, 86, 11567–11580. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. O’Connor, C.M.; Kedes, D.H. Mass Spectrometric Analyses of Purified Rhesus Monkey Rhadinovirus Reveal 33 Virion-Associated Proteins. J. Virol. 2006, 80, 1574–1583. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Kunec, D. Proteomics applied to avian herpesviruses. Avian Dis. 2013, 57, 351–359. [Google Scholar] [CrossRef] [PubMed]
  35. Flores, E.F.; Donis, R.O. Isolation of a Mutant MDBK Cell Line Resistant to Bovine Viral Diarrhea Virus Infection Due to a Block in Viral Entry. Virology 1995, 208, 565–575. [Google Scholar] [CrossRef] [PubMed]
  36. Rice, P.; Longden, I.; Bleasby, A. EMBOSS: The European molecular biology open software suite. Trends. Genet. 2000, 16, 276–277. [Google Scholar] [CrossRef]
  37. Geer, L.Y.; Markey, S.P.; Kowalak, J.A.; Wagner, L.; Xu, M.; Maynard, D.M.; Yang, X.; Shi, W.; Bryant, S.H. Open Mass Spectrometry Search Algorithm. J. Proteome Res. 2004, 3, 958–964. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST+: Architecture and applications. BMC Bioinformatics 2009, 10, 421. [Google Scholar] [CrossRef] [PubMed]
  39. Quinlan, A.R.; Hall, I.M. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 2010, 26, 841–842. [Google Scholar] [CrossRef] [PubMed]
  40. Abd-Alla, A.M.M.; Kariithi, H.M.; Cousserans, F.; Parker, N.J.; İnce, İ.A.; Scully, E.D.; Boeren, S.; Geib, S.M.; Mekonnen, S.; Vlak, J.M.; et al. Comprehensive annotation of Glossina pallidipes salivary gland hypertrophy virus from Ethiopian tsetse flies: A proteogenomics approach. J. Gen. Virol. 2016, 97, 1010–1031. [Google Scholar] [CrossRef] [PubMed]
  41. Zhu, Y.; Engstrom, P.G.; Tellgren-Roth, C.; Baudo, C.D.; Kennell, J.C.; Sun, S.; Blake Billmyre, R.; Sch oder, M.S.; Andersson, A.; Holm, T.; et al. Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis. Nucleic Acids Res. 2017, 45, 2629–2643. [Google Scholar] [PubMed]
  42. Zhu, Y.; Orre, L.M.; Johansson, H.J.; Huss, M.; Boekel, J.; Vesterlund, M.; Fernandez-Woodbridge, A.; Branca, R.M.M.; Lehtiö, J. Discovery of coding regions in the human genome by integrated proteogenomics analysis workflow. Nat. Commun. 2018, 9, 903. [Google Scholar] [CrossRef] [PubMed]
  43. Devireddy, L.R.; Jones, C. Alternative splicing of the latency-related transcript of bovine herpesvirus 1 yields RNAs containing unique open reading frames. J. Virol. 1998, 72, 7294–7301. [Google Scholar] [PubMed]
  44. Devireddy, L.R.; Zhang, Y.; Jones, C.J. Cloning and initial characterization of an alternatively spliced transcript encoded by the bovine herpes virus 1 latency-related gene. J. Neurovirol. 2003, 9, 612–622. [Google Scholar] [CrossRef] [PubMed]
  45. Wirth, U.V.; Vogt, B.; Schwyzer, M. The three major immediate-early transcripts of bovine herpesvirus 1 arise from two divergent and spliced transcription units. J. Virol. 1991, 65, 195–205. [Google Scholar] [PubMed]
  46. Firth, A.E.; Brierley, I. Non-canonical translation in RNA viruses. J. Gen. Virol. 2012, 93, 1385–1409. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Sasaki, J. Nakashima, N. Translation initiation at the CUU codon is mediated by the internal ribosome entry site of an insect picorna-like virus in vitro. J. Virol. 1999, 73, 1219–1226. [Google Scholar] [PubMed]
  48. Schneider, R.; Agol, V.I.; Andino, R.; Bayard, F.; Cavener, D.R.; Chappell, S.A.; Chen, J.J.; Darlix, J.L.; Dasgupta, A.; Donzé, O.; et al. New ways of initiating translation in eukaryotes? Mol. Cell Biol. 2001, 21, 8238–8246. [Google Scholar] [CrossRef] [PubMed]
  49. Chen, S.-J.; Lin, G.; Chang, K.-J.; Yeh, L.-S.; Wang, C.-C. Translational efficiency of a non-AUG initiation codon is significantly affected by its sequence context in yeast. J. Biol. Chem. 2008, 283, 3173–3180. [Google Scholar] [CrossRef] [PubMed]
  50. Tombácz, D.; Balázs, Z.; Csabai, Z.; Snyder, M.; Boldogkői, Z. Long-Read Sequencing Revealed an Extensive Transcript Complexity in Herpesviruses. Front. Genet. 2018, 9, 259. [Google Scholar] [CrossRef] [PubMed]
  51. Yoshinaka, Y.; Katoh, I.; Copeland, T.D.; Oroszlan, S. Murine leukemia virus protease is encoded by the gag-pol gene and is synthesized through suppression of an amber termination codon. Proc. Natl. Acad. Sci. USA 1985, 82, 1618–1622. [Google Scholar] [CrossRef] [PubMed]
  52. Feng, Y.X.; Levin, J.G.; Hatfield, D.L.; Schaefer, T.S.; Gorelick, R.J.; Rein, A. Suppression of UAA and UGA termination codons in mutant murine leukemia viruses. J. Virol. 1989, 63, 2870–2873. [Google Scholar] [PubMed]
  53. Dreher, T.W.; Miller, W.A. Translational control in positive strand RNA plant viruses. Virology 2006, 344, 185–197. [Google Scholar] [CrossRef] [PubMed]
  54. Jungreis, I.; Chan, C.S.; Waterhouse, R.M.; Fields, G.; Lin, M.F.; Kellis, M. Evolutionary Dynamics of Abundant Stop Codon Readthrough. Mol. Biol. Evol. 2016, 33, 3108–3132. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Jacks, T.; Varmus, H.E. Expression of the Rous sarcoma virus pol gene by ribosomal frameshifting. Science 1985, 230, 1237–1242. [Google Scholar] [CrossRef] [PubMed]
  56. Liljeqvist, J.-Å.; Svennerholm, B.; Bergström, T. Herpes Simplex Virus Type 2 Glycoprotein G-Negative Clinical Isolates Are Generated by Single Frameshift Mutations. J. Virol. 1999, 73, 9796–9802. [Google Scholar] [PubMed]
Figure 1. MS/MS peptides mapped to the BoHV-1 genome. The 134 kb genome is displayed in the six possible reading frames (six different tracks). Currently annotated viral ORFs are shown in green in their corresponding reading frame; gene names are shown on the outside of the circle. MS/MS spectra are shown both as a histogram in the center as well as mapped onto the corresponding reading frame. Spectra mapping to coding regions are shown in red, while intergenic spectra (refer to text for explanation) appear in blue. Twenty-one viable ORFs surrounding intergenic spectra are shown in yellow in the corresponding reading frame.
Figure 1. MS/MS peptides mapped to the BoHV-1 genome. The 134 kb genome is displayed in the six possible reading frames (six different tracks). Currently annotated viral ORFs are shown in green in their corresponding reading frame; gene names are shown on the outside of the circle. MS/MS spectra are shown both as a histogram in the center as well as mapped onto the corresponding reading frame. Spectra mapping to coding regions are shown in red, while intergenic spectra (refer to text for explanation) appear in blue. Twenty-one viable ORFs surrounding intergenic spectra are shown in yellow in the corresponding reading frame.
Viruses 10 00499 g001
Figure 2. Quantification of unique BoHV-1 MS/MS spectra. Number of MS/MS spectra mapping to annotated viral ORFs (red) vs un-annotated genomic regions (blue). Yellow represents the number of spectra that mapped to un-annotated genomic regions that also contain nearby stop and start codons. The term unique refers to any peptide that maps to a single location of the genome. The bar graph is additive and not absolute. Numbers 0, 41, and 57 atop each bar represent the number of known viral proteins successfully identified.
Figure 2. Quantification of unique BoHV-1 MS/MS spectra. Number of MS/MS spectra mapping to annotated viral ORFs (red) vs un-annotated genomic regions (blue). Yellow represents the number of spectra that mapped to un-annotated genomic regions that also contain nearby stop and start codons. The term unique refers to any peptide that maps to a single location of the genome. The bar graph is additive and not absolute. Numbers 0, 41, and 57 atop each bar represent the number of known viral proteins successfully identified.
Viruses 10 00499 g002
Figure 3. RT-PCR for two intergenic sequences. (a) PCR for cellular gene actin and viral gene ribonucleotide reductase (RR) on mock and cells infected for 16 h. (b) PCR amplification was carried out using primers for putative ORF-A and ORF-B for the same mock and infected cells. RT indicates the presence (+) or absence (-) of a retrotranscriptase step with mRNA as a template.
Figure 3. RT-PCR for two intergenic sequences. (a) PCR for cellular gene actin and viral gene ribonucleotide reductase (RR) on mock and cells infected for 16 h. (b) PCR amplification was carried out using primers for putative ORF-A and ORF-B for the same mock and infected cells. RT indicates the presence (+) or absence (-) of a retrotranscriptase step with mRNA as a template.
Viruses 10 00499 g003
Figure 4. 5′ RACE product for ORF-A. Infected cells were subjected to mRNA isolation followed by RACE protocol (Invitrogen GeneRacer) to map the 5′ terminus of the ORF A transcript. After the initial primers lead to poor PCR amplification (left), a nested PCR (right) was performed using the initial RACE product as a template. The 300 bp amplicon (black arrow) was excised and sent for sequencing.
Figure 4. 5′ RACE product for ORF-A. Infected cells were subjected to mRNA isolation followed by RACE protocol (Invitrogen GeneRacer) to map the 5′ terminus of the ORF A transcript. After the initial primers lead to poor PCR amplification (left), a nested PCR (right) was performed using the initial RACE product as a template. The 300 bp amplicon (black arrow) was excised and sent for sequencing.
Viruses 10 00499 g004
Figure 5. Primer walking for elucidation of ORF-A 3′ terminus. Strand-specific RT-PCR of BoHV-1.1-infected cell mRNA extracts was was performed on cDNA produced from BoHV-1 infected cells using 16 different reverse primers that anneal further down (3′) to the viral genome. Successful amplification allowed for capture of increasing lengths of the ORF-A transcript sequence. RT indicates the presence (+) or absence (-) of a retrotranscriptase step with mRNA as a template. (a) Amplicons were produced with primers 1–8 using a 1-min extension time during PCR. (b) Amplicons were produced using primers 9-16 with a 2-min extension time to account for the increase in amplicon length.
Figure 5. Primer walking for elucidation of ORF-A 3′ terminus. Strand-specific RT-PCR of BoHV-1.1-infected cell mRNA extracts was was performed on cDNA produced from BoHV-1 infected cells using 16 different reverse primers that anneal further down (3′) to the viral genome. Successful amplification allowed for capture of increasing lengths of the ORF-A transcript sequence. RT indicates the presence (+) or absence (-) of a retrotranscriptase step with mRNA as a template. (a) Amplicons were produced with primers 1–8 using a 1-min extension time during PCR. (b) Amplicons were produced using primers 9-16 with a 2-min extension time to account for the increase in amplicon length.
Viruses 10 00499 g005
Figure 6. ORF-A Transcript Sequence. Sequence for ORF-A as determined by 5′ RACE and 3′ primer walking. The sequenced 5′ RACE product is shaded light gray and the darker gray shading indicates the location of the forward primer used for the 3′ walking strategy. Yellow highlighted text shows the sequence that encodes the experimental peptide found via MS/MS. Potential canonical start and stop codons are indicated in green and red, respectively. Italic text is the region of the transcript that has been confirmed by sequencing. Single and double underlined text show the binding site for primers 13 and 14, respectively.
Figure 6. ORF-A Transcript Sequence. Sequence for ORF-A as determined by 5′ RACE and 3′ primer walking. The sequenced 5′ RACE product is shaded light gray and the darker gray shading indicates the location of the forward primer used for the 3′ walking strategy. Yellow highlighted text shows the sequence that encodes the experimental peptide found via MS/MS. Potential canonical start and stop codons are indicated in green and red, respectively. Italic text is the region of the transcript that has been confirmed by sequencing. Single and double underlined text show the binding site for primers 13 and 14, respectively.
Viruses 10 00499 g006
Figure 7. Immunoblot for ORF-A protein production. MDBK (panel a) or RS cells (panel b) were infected with BoHV-1 at the indicated times after infection (hours). Proteins were separated via SDS-PAGE gel (50 ug protein/lane). Western blots were probed using the ORF-A peptide specific antiserum. Tubulin was used as a protein loading control. These results are the average of three independent experiments. A protein around 55 kDa was detected in both infected cell types. Peptide-speific antisera from two rabbits yielded similar results in both cell lines. Data from only one antiserum is shown here.
Figure 7. Immunoblot for ORF-A protein production. MDBK (panel a) or RS cells (panel b) were infected with BoHV-1 at the indicated times after infection (hours). Proteins were separated via SDS-PAGE gel (50 ug protein/lane). Western blots were probed using the ORF-A peptide specific antiserum. Tubulin was used as a protein loading control. These results are the average of three independent experiments. A protein around 55 kDa was detected in both infected cell types. Peptide-speific antisera from two rabbits yielded similar results in both cell lines. Data from only one antiserum is shown here.
Viruses 10 00499 g007
Figure 8. ORF-A is transcribed consistently throughout infection. CRIB cells infected with BoHV-1 over five infection time points (in hours, indicated at the top), followed by RNA extraction and gene specific RT-PCR. ORF-A transcript expression at the top, along with early viral gene ribonucleotide reductase (RR), late viral gene glycoprotein C (gC) and cellular gene GAPDH. Mock, uninfected cells.
Figure 8. ORF-A is transcribed consistently throughout infection. CRIB cells infected with BoHV-1 over five infection time points (in hours, indicated at the top), followed by RNA extraction and gene specific RT-PCR. ORF-A transcript expression at the top, along with early viral gene ribonucleotide reductase (RR), late viral gene glycoprotein C (gC) and cellular gene GAPDH. Mock, uninfected cells.
Viruses 10 00499 g008
Table 1. (a) Table of primers used for PCR amplification. (b) Reverse primers used for strand-specific retrotranscription during the 3′ walk strategy outlined in the text and in Figure 5.
Table 1. (a) Table of primers used for PCR amplification. (b) Reverse primers used for strand-specific retrotranscription during the 3′ walk strategy outlined in the text and in Figure 5.
(a)
TargetForwardReverse
AATGTGCTTTGTTTTACAGCCCGGTACGTGCGGCTGCCCCGTAG
BATGGCCGCGATCATGTACGGGTCGCTGGCTGCTGTGGTTCATGGA
ActinCGTGACATTAAGGAGAAGCTGTGCCTCAGGAGGAGCAATGATCTTGAT
GAPDHCCATGGAGAAGGCTGGGGCAAAGTTGTCATGGATGACC
RRTTTTACGAGACCGAGTGCCCGACGAAAAGGTTGTGGGTGC
gCTGATCGCAGCTATTTTCGCCTTCTGGGCTACGAACAGCAG
5′ RACEprovided by InvitrogenCGTGGCGCAGCTGCTTCTGCTGG
5′ Nested RACEprovided by InvitrogenGCTGTCAACGATACGCTACGTAACG
(b)
Reverse Primer
1GCGAAACGTCACGCAGGA
2GCCCTACGTGTTTGGTCGAT
3GAGGACGCCGGGTACGATA
4ACGCTCGCGCGGCTAA
5GCTACTCGGGGATCTTGCG
6CCCGGCGGATATGCCATTA
7GTGAGCCCGACTCGCTG
8CGCAACGATGAGCGGCG
9CAGGGCCACGGCGACCATTA
10GCCGGGACCGCGAGCTT
11CGGGCCGAGTGCGAAAAAGG
12TTCGGAGGGCCTGTGGAACC
13CGGCGCGCAGCTCTTCGTAT
14GGTCGGCGGCAGCTC
15AGACGCAACAGCATTAGCACCGGGGG
16GACGGCCTCGCAAAAGATCGTTCGGT
Table 2. Summary of proteogenomic mapping results.
Table 2. Summary of proteogenomic mapping results.
2 hpi8 hpi16 hpiTotal b
Total viral peptides449819912493
Total annotated peptides039416622056
Unique a annotated peptides0132342344
Viral genes identified0415757
Total intergenic peptides4104329437
Uniquea intergenic peptides2317992
Intergenic peptides with viable ORF072121
a unique = mapping to a single location on the BoHV-1 genome. b correcting for peptides that appeared in more than one timepoint. hpi, hours poist infection.

Share and Cite

MDPI and ACS Style

Jefferson, V.A.; Barber, K.A.; El-mayet, F.S.; Jones, C.; Nanduri, B.; Meyer, F. Proteogenomic Identification of a Novel Protein-Encoding Gene in Bovine Herpesvirus 1 That Is Expressed during Productive Infection. Viruses 2018, 10, 499. https://doi.org/10.3390/v10090499

AMA Style

Jefferson VA, Barber KA, El-mayet FS, Jones C, Nanduri B, Meyer F. Proteogenomic Identification of a Novel Protein-Encoding Gene in Bovine Herpesvirus 1 That Is Expressed during Productive Infection. Viruses. 2018; 10(9):499. https://doi.org/10.3390/v10090499

Chicago/Turabian Style

Jefferson, Victoria A., Kaley A. Barber, Fouad S. El-mayet, Clinton Jones, Bindu Nanduri, and Florencia Meyer. 2018. "Proteogenomic Identification of a Novel Protein-Encoding Gene in Bovine Herpesvirus 1 That Is Expressed during Productive Infection" Viruses 10, no. 9: 499. https://doi.org/10.3390/v10090499

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop