Next Article in Journal
Analysis of Swine Conventional Dendritic Cells, DEC205+CD172a+/−CADM1+, from Blood and Spleen in Response to PRRSV and PEDV
Next Article in Special Issue
Emergence of Rare Bovine–Human Reassortant DS-1-Like Rotavirus A Strains with G8P[8] Genotype in Human Patients in the Czech Republic
Previous Article in Journal
The Roles of Ebola Virus Soluble Glycoprotein in Replication, Pathogenesis, and Countermeasure Development
Previous Article in Special Issue
Novel Viruses in Mosquitoes from Brazilian Pantanal
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The First Complete Genome Sequences of Hepatitis C Virus Subtype 2b from Latin America: Molecular Characterization and Phylogeographic Analysis

by
Natália Spitz
1,
José J. Barros
1,
Kycia M. do Ó
2,
Carlos E. Brandão-Mello
3 and
Natalia M. Araujo
1,*
1
Laboratory of Molecular Virology, Oswaldo Cruz Institute, FIOCRUZ, Rio de Janeiro RJ 21040-360, Brazil
2
Viral Hepatitis Advisory Committee of the Ministry of Health, Brasilia DF 70058-900, Brazil
3
Gaffrée & Guinle Universitary Hospital, Federal University of Rio de Janeiro State, UNIRIO, Rio de Janeiro RJ 20270-901, Brazil
*
Author to whom correspondence should be addressed.
Viruses 2019, 11(11), 1000; https://doi.org/10.3390/v11111000
Submission received: 16 September 2019 / Revised: 30 September 2019 / Accepted: 10 October 2019 / Published: 31 October 2019
(This article belongs to the Special Issue Emerging Viruses: Surveillance, Prevention, Evolution and Control)

Abstract

:
The hepatitis C virus (HCV) has remarkable genetic diversity and exists as eight genotypes (1 to 8) with distinct geographic distributions. No complete genome sequence of HCV subtype 2b (HCV-2b) is available from Latin American countries, and the factors underlying its emergence and spread within the continent remain unknown. The present study was conducted to determine the first full-length genomic sequences of HCV-2b isolates from Latin America and reconstruct the spatial and temporal diversification of this subtype in Brazil. Nearly complete HCV-2b genomes isolated from two Brazilian patients were obtained by direct sequencing of long PCR fragments and analyzed together with reference sequences using the Bayesian coalescent and phylogeographic framework approaches. The two HCV-2b genomes were 9318 nucleotides (nt) in length (nt 37–9354). Interestingly, the long RT-PCR technique was able to detect co-circulation of viral variants that contained an in-frame deletion of 2022 nt encompassing E1, E2, and p7 proteins. Spatiotemporal reconstruction analyses suggest that HCV-2b had a single introduction in Brazil during the early 1980s, displaying an epidemic history characterized by a low and virtually constant population size until the present time. These results coincide with epidemiological data in Brazil and may explain the low national prevalence of this subtype.

1. Introduction

The hepatitis C virus (HCV) is a single-stranded positive-sense RNA virus belonging to the Flaviviridae family and is the leading etiologic agent of chronic liver disease [1]. According to the WHO, an estimated 71 million people worldwide suffer from chronic hepatitis C infection, resulting in 399,000 deaths mainly attributable to cirrhosis and hepatocellular carcinoma [2]. The HCV genome, which is approximately 9600 nucleotides (nt) long, contains two short untranslated regions at each end (5’UTR and 3’UTR) along with a single open reading frame encoding three structural (core, E1, and E2) and seven non-structural (p7, NS2, NS3, NS4A, NS4B, NS5A, and NS5B) proteins.
The high rate of replication (up to 1012 particles produced per day) coupled with an error-prone mechanism results in the extremely high sequence diversity of HCV [3]. In particular, HCV variants characterized by large in-frame genomic deletions typically encompassing E1–NS2 regions have been detected in patient sera and liver tissues [4,5,6,7,8,9]. These subgenomic HCV species, believed to be generated from the full-length viral genome through polymerase errors, can be transpackaged into infectious virions in the presence of a wild-type virus [5]. So far, their biological roles are yet to be established.
Due to accumulating mutations during the natural course of history, HCV isolates are classified into eight genotypes (1 to 8) and numerous subtypes [10,11]. The official classification of a new HCV type minimally requires one complete genome sequence to differ from other sequences by at least 30% (genotype) or 15% (subtype) [11,12]. Despite the revolutionary progress recently made with direct-acting antivirals (DAAs) against HCV, identification of the HCV genotype/subtype remains an important requirement for effective treatment decisions and serves as a good predictor of treatment success [13,14]. Therefore, HCV whole-genome sequencing appears useful for the accurate determination of viral genotypes and resistance-associated substitutions throughout the DAA target genome regions (NS3, NS5A, and NS5B) as well as identifying recombinant or rare viral types [15,16,17].
The global distribution of HCV isolates is characterized by regional variations in genotype/subtype prevalence and potentially influenced by historical and contemporary trends in human migration. HCV subtypes 1a, 1b, 2a, 2b, 2c, and 3a have worldwide distribution and account for a large proportion of HCV infections. On the other hand, many of the other HCV subtypes are comparatively rare and circulate in more restricted geographical areas such as endemic strains from genotypes 1 and 2 in West Africa, 3 in South Asia, 4 in Central Africa and the Middle East, 5 in Southern Africa, 6 in South East Asia, 7 in Canada [18,19], and 8 in India [10].
While HCV genotype 1 is the most prevalent genotype in the Latin American region, genotype 2 isolates have been successful in terms of establishment and dissemination in different American countries [20,21]. Specifically, HCV subtype 2b (HCV-2b) has been detected throughout the Caribbean (Martinique) and Central (Mexico) and South (Brazil, Colombia and Venezuela) America, as determined based on the HCV sequences reported to the Los Alamos database (https://hcv.lanl.gov/content/sequence/HCV/ToolsOutline.html). However, no complete genome sequences of HCV-2b are available from these countries, limiting the contribution of Latin American isolates to phylogenetic and phylogeographic studies. Moreover, the evolutionary history of HCV-2b in Brazil has not been investigated. Accordingly, the main objectives of the current study were to determine the first full-length HCV-2b genomes from Latin America and reconstruct the spatial and temporal diversification of this subtype in Brazil.

2. Materials and Methods

2.1. Patients

HCV-2b isolates were obtained from two patients referred to the Gaffrée & Guinle University Hospital in Rio de Janeiro, Brazil. Patient 1 was a 59 year-old woman serologically positive for anti-HCV since 2018 with a METAVIR fibrosis score F0 (no fibrosis) and no prior history of HCV treatment. Patient 2 was an 81 year-old woman serologically positive for anti-HCV since 1999 with a METAVIR fibrosis score F4 (cirrhosis) and no current HCV treatment. This patient had a prior history of non-response to interferon alfa-2a and ribavirin combination therapy. The study protocol was approved by the Ethics Committee of Oswaldo Cruz Institute (no. 2.927.747) and informed consent was obtained from both patients.

2.2. Long Template RT-PCR and Genome Sequencing

The RT-PCR assay was optimized to amplify the nearly complete genome of HCV-2b via two long PCR fragments (fragment 1 (nt 37 to 5406) and fragment 2 (nt 4535 to 9354); Figure 1).
For each genomic fragment, total RNA was extracted from 200 μL serum using a High Pure Viral Nucleic Acid Kit (Roche Diagnostics, Mannheim, Germany) according to the manufacturer’s instructions. Eluted RNA was precipitated at −20 °C overnight with 0.10 volumes of 3 M sodium acetate and 2.5 volumes of absolute ethanol and resuspended in 9.5 μL diethyl pyrocarbonate (DEPC)-treated water. For reverse transcription of HCV RNA, the total volume of precipitated RNA (9.5 μL) was incubated with 0.5 mM dNTPs and 0.1 μM anti-sense primer HCV5423R (fragment 1) and HCV9373R (fragment 2) (Table 1) at 65 °C for 5 min and subsequently placed on ice for 3 min. Next, cDNA synthesis was conducted by adding 200 U SuperScript IV Reverse Transcriptase, 4 μL 5X SSIV Buffer, 5 mM DTT, and 2 U RNaseOUT RNase Inhibitor (Invitrogen, Carlsbad, CA, USA) to the mixture and incubating at 53 °C for 10 min. The reaction was inactivated at 80 °C for 10 min, followed by the addition of 2 U of RNase H (Invitrogen) at 37 °C for 20 min for RNA removal. Special care was taken during all the mixing steps to avoid shearing of the long RNA template.
Amplification of the two genomic fragments was performed using nested PCR. The first round of PCR was carried out in a 25 μL reaction volume containing 3 μL cDNA, 0.2 μM outer primers HCV17S and HCV5423R (fragment 1), HCV4328S and HCV9373R (fragment 2) (Table 1), 2.5 μL 10X High Fidelity PCR Buffer (Invitrogen), 2 mM MgSO4, 0.2 mM dNTPs, 1 μL dimethyl sulfoxide (DMSO) (Life Technologies, Carlsbad, CA, USA), and 0.5 U Platinum Taq DNA Polymerase High Fidelity (Invitrogen). The second round of PCR was performed in a 50 μL reaction volume consisting of 2 μL of the first assay product, 0.2 μM inner primers HCV37S and HCV5406R (fragment 1) and HCV4535S and HCV9354R (fragment 2) (Table 1), 5 μL 10× High Fidelity PCR Buffer (Invitrogen), 2.0 mM MgSO4, 0.2 mM dNTPs, 2 μL DMSO (Life Technologies), and 1 U Platinum Taq DNA Polymerase High Fidelity (Invitrogen). Thermal conditions for both PCR assays included an initial denaturation step at 94 °C for 2 min, 35 cycles at 94 °C for 30 s, 55 °C for 30 s, and 68 °C for 5 min, and a final extension step at 68 °C for 10 min. PCR products were resolved on 1% agarose gel and purified with the Wizard SV Gel and PCR Clean-Up System (Promega, Madison, WI, USA). HCV full-length genome sequences were determined via direct sequencing using a BigDye Terminator Kit v3.1 (Applied Biosystems, Foster City, CA, USA) and a set of 17 specific primers distributed along the HCV genome (Table 1). Sequencing reactions were analyzed on an ABI3730xl automated sequencer (Applied Biosystems).

2.3. Phylogenetic and Genetic Analyses

Multiple sequence alignment was performed using the MUSCLE program and subjected to Maximum Likelihood (ML) phylogenetic analysis. ML phylogenetic trees were inferred with the online version of the PhyML program [22] under the GTR + I + G nucleotide substitution model selected with SMS (Smart Model Selection in PhyML) [23]. A heuristic tree search was performed with the aid of the SPR branch-swapping algorithm and the reliability of phylogeny estimated with the approximate likelihood-ratio test [24] based on a Shimodaira–Hasegawa-like procedure (SH-aLRT). To identify possible recombination, bootscan analyses of full-length sequences were performed in the SimPlot software program version 3.5.1 [25] with a sliding window size of 200 bp and step size of 20 bp increment. Potential DAA resistance-associated substitutions among the NS3, NS5A, and NS5B genomic regions were investigated using the Geno2pheno[HCV] interpretation system [26]. HCV reference isolate HC-J6CH (genotype 2; GenBank accession number NC_009823) was used for numbering the nucleotides, amino acids, and deletions [27]. Mean genetic distances between groups of full-length HCV subtype 2b sequences from different geographical regions were estimated via Kimura 2-parameter analysis.

2.4. HCV-2b Datasets

Two datasets containing HCV-2b sequences retrieved from public databases (Los Alamos and GenBank) were constructed. Full listings of accession numbers and sampling locations are provided in Table S1. Dataset 1 comprised 181 partial NS5B sequences (353 nt, nucleotide positions 8694–9046) with known country of origin as well as the two genomes sequenced in this study. As sampling dates were not available for all sequences included in dataset 1, a smaller dataset (dataset 2) specifically comprising isolates with known collection dates (n = 134) was created. Sequences were aligned using MUSCLE software [28]. The phylogenetic signal of the aligned nucleotide sequences from datasets 1 and 2 was examined using a test of substitution saturation [29] implemented in the DAMBE7 program [30].

2.5. Molecular Clock Analysis

The substitution rate (nucleotide substitutions per site per year, s/s/y) and time of the most recent common ancestor tMRCA (years) were inferred based on sequences sampled at different time-points (dataset 2) using a Bayesian Markov Chain Monte Carlo (MCMC) approach implemented in BEAST v1.10.4 [31] along with BEAGLE v3.1 to improve run time [32]. The temporal structure of the dataset was assessed by conducting regression of root-to-tip genetic distances against year of sampling using TempEst v1.5 [33]. Analyses were performed using the GTR + I + G nucleotide substitution model, which was the optimal model selected using the JModelTest [34,35] and the Bayesian Skyline coalescent tree prior [36]. The most suitable clock model was selected after running the analysis separately using strict and relaxed (uncorrelated lognormal) clocks. MCMC was run for 400 million generations with sampling every 40,000 generations. The effective sample size (ESS) value for each parameter was >100, indicating sufficient mixing of the Markov chain.

2.6. Bayesian Phylogeographic Analyses

Spatial reconstruction was achieved by applying a reversible discrete Bayesian phylogeographic model [37] using dataset 1. The Bayesian stochastic search variable selection (BSSVS) model was implemented, which allows a zero diffusion rate with a positive prior probability. Uncertainty of parameter estimates was assessed after excluding the initial 10% of the run by calculating the 95% Highest Probability Density (HPD) values using the TRACER v1.7.1 program. Maximum clade credibility (MCC) trees were summarized from the posterior distribution of trees with TreeAnnotator and visualized with FigTree v1.4.3.

3. Results and Discussion

Here, we successfully optimized the RT-PCR assay to amplify long fragments of the HCV-2b genome. HCV whole-genome sequencing is technically challenging and most assays described to date use several overlapping amplicons [17], which is time-consuming and increases selective bias due to the use of multiple primers. With the methodology described in this study, near-full-length genome sequences of HCV-2b were obtained by direct sequencing of only two overlapping amplicons (Figure 1).
The complete genome of isolate PAT1 (from patient 1) consisted of 9318 nt excluding the polypyrimidine tract, with a G + C content of 56.2% harboring the 10 HCV genomic regions. The complete PAT1 sequence has been deposited in the GenBank database under the accession number MN385563. Bootscan analysis reflected no evidence of recombination between different HCV genotypes (Figure 2a) or HCV-2 subtypes (Figure 2b). Moreover, no potential DAA resistance-associated substitutions were detected in PAT1.
To establish whether our RT-PCR assay could amplify other HCV-2b isolates, serum sample from a second chronic hepatitis C patient (patient 2) was included for study. Unexpectedly, a deletion of 2022 nt (genome positions 964–2985) encompassing E1, E2, p7, and the 5’ end of NS2 genomic regions was observed in one of the amplicons. A PCR primer pair (2bDel1 and HCV3007R) flanking the deleted region was used to investigate co-infection with wild-type isolates (Table 1). Using this strategy, we were able to amplify the deleted genomic region and obtain the complete genome of the PAT2 isolate. Patient 2 was an 81 year-old cirrhotic patient, corroborating earlier findings of an association of HCV large in-frame deletion mutants with patient age and increased necroinflammatory activity in the liver [5]. Previous reports have demonstrated a prevalence of these mutants in 19–26% of infected patients and proposed a potential role in viral persistence [5,6]. PAT2 contained 9318 nt excluding the polypyrimidine tract, with a G + C content of 55.8% harboring the 10 HCV genomic regions. The complete genome sequence of PAT2 has been deposited in the GenBank database under the accession number MN385564. Bootscan analysis revealed no evidence of recombination between different HCV genotypes (Figure 2c) or HCV-2 subtypes (Figure 2d). No potential DAA resistance-associated substitution was detected in PAT2.
Phylogenetic reconstructions based on complete genome and partial NS5B sequences of representative HCV genotypes 1 to 8 led to the classification of PAT1 and PAT2 isolates as HCV-2b (Figure 3a,b). Interestingly, PAT1 and PAT2 clustered in a well-supported monophyletic clade (aLRT = 0.84) together with all Brazilian NS5B HCV-2b sequences (Figure 3b). The mean genetic distance between our two Brazilian full-length HCV-2b genome sequences and 105 HCV-2b genomes from other regions worldwide was additionally calculated (Table 2). Comparable values (0.079–0.107) were obtained among all geographical groups analyzed (Australia, Brazil, China, Denmark, France, Japan, and the USA), indicating that HCV-2b is genetically similar among global isolates (Table 2). In fact, HCV epidemic subtypes such as HCV-2b are characterized by low genetic diversity, high prevalence, and a global distribution. On the other hand, endemic strains are more spatially restricted but harbor greater genetic diversity due to low transmission rates and their centuries-long persistence in geographically restricted areas [38,39].
For a more comprehensive analysis of the time and epicenter of diversification of HCV-2b in Brazil, a Bayesian MCMC analysis was conducted on dataset 1 (181 partial NS5B sequences from databanks and two from this study). Datasets 1 and 2 showed the appropriate phylogenetic signal for consistent phylogenetic and molecular clock inferences, since no substitution saturation was detected using either the test developed by Xia et al. (2003) [29] (Table S2) or transition and transversion versus divergence graphics (Supplemental Figure S1). Dataset 2 had a temporal structure, as revealed by the positive correlation coefficient between genetic divergence and time (Supplemental Figure S2), indicating that the timescale of HCV could be directly estimated from sampling dates of selected sequences. The analysis under the strict molecular clock model showed no convergence, even after combining two independent runs with 400 million states. Under the relaxed clock model, the estimated mean substitution rate of HCV-2b NS5B was 1.06 × 10−3 (95% HPD: 4.87 × 10−4–1.66 × 10−3) s/s/y, similar to previously estimated NS5B gene substitution rates [40]. The coefficient of rate variation for HCV-2b was significantly higher than zero, supporting the use of the relaxed molecular clock model.
Data from spatiotemporal reconstruction analysis showed that HCV-2b Brazilian sequences grouped into a highly supported monophyletic cluster (posterior probability (PP) = 0.89), indicating a single introduction of this subtype into the country. The Netherlands was the most probable source of HCV-2b introduction into Brazil (PSP = 0.31), which may have occurred during the early 1980s (tMRCA = 1982; 95% HPD: 1976–1990) (Figure 4). Notably, HCV subtypes 2a and 2b account for 10% of chronic HCV infections in the Netherlands [41]. The increase in parenteral routes of transmission such as contaminated blood products, transfusion, injecting drug use, and invasive medical procedures during the 20th century resulted in global dispersal of the current epidemic subtypes including HCV-2b [38,39,42]. Interestingly, the Bayesian skyline plot of Brazilian HCV-2b showed a low and virtually constant population size lasting until the present time (Figure 5). Expansion of HCV-2b in Brazil might have been hampered by the measures applied to prevent parenteral transmission of other viral agents such as HBV and HIV during the 1980s, followed by the introduction of anti-HCV screening tests in blood banks in 1993. These results corroborate previous reports [43] that the growth rate of the major HCV subtypes 1a, 1b, and 3a in Brazil has decreased since 1980–1995, suggesting that expansion of HCV may have been effectively controlled. Furthermore, according to Lampe et al. (2010) [43], the main Brazilian HCV subtypes potentially started to circulate within the country after 1940, with multiple independent introductions of each subtype into the population. Conversely, our findings suggest a single and more recent introduction of HCV-2b in Brazil, which might explain the lower prevalence of this subtype countrywide [44].
In conclusion, the long RT-PCR assay developed in this study facilitated the determination of the first complete genome sequences of HCV-2b from Latin America, increasing the contribution of Latin American isolates to ongoing phylogenetic and phylogeographic studies. Moreover, HCV-2b displayed a different epidemic history from the major Brazilian subtypes characterized by a relatively late and unique introduction into the country during the early 1980s. Elucidation of the history of the spread of HCV and associated epidemiological processes should be helpful in improving targeted HCV prevention and screening campaigns, and ultimately, the development of novel strategies to eliminate HCV.

Supplementary Materials

The following are available online at https://www.mdpi.com/1999-4915/11/11/1000/s1. Figure S1: Transition and transversion versus divergence graphics performed for nucleotide sequence alignments. The transitions (s, blue) and transversions (v, green) are plotted against genetic distance according to the general time reversible (GTR) model. (a) Dataset 1. (b) Dataset 2. Figure S2: Root-to-tip regression analysis of temporal structure. Plot of the root-to-tip genetic distance against sampling time is shown for phylogeny estimated from the alignment. Each dot represents a sequence. R2 value is given as an indicator of the degree to which evolution has been clock-like. Table S1: GenBank accession numbers of the 181 partial NS5B HCV-2b sequences used in the Bayesian phylogeographic analysis. Table S2: Index of substitution rate (Iss) and critical index of substitution rate (Iss.c) of the NS5B sequence datasets used for Bayesian analysis.

Author Contributions

Conceptualization, N.S., J.J.B., and N.M.A.; Methodology, N.S. and J.J.B.; Formal Analysis, N.M.A.; Investigation, N.S., J.J.B., and N.M.A.; Resources, K.M.d.Ó., C.E.B.-M., and N.M.A.; Funding Acquisition, N.M.A.; Writing—Original Draft Preparation, N.M.A.; Writing—Review and Editing, N.S., J.J.B., K.M.d.Ó., C.E.B.-M., and N.M.A.

Funding

This research was funded by the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), grant number 428676/2018-9.

Acknowledgments

The authors thank Nathalia Alves Araujo de Almeida for collecting the patient data, and the DNA Sequencing Platform (RPT01A-VPPCB/FIOCRUZ) of the Oswaldo Cruz Foundation for technical support.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Webster, D.P.; Klenerman, P.; Dusheiko, G.M. Hepatitis C. Lancet 2015, 385, 1124–1135. [Google Scholar] [CrossRef] [Green Version]
  2. WHO. Hepatitis C; Key Facts; World Health Organization: Geneva, Switzerland; Available online: https://www.who.int/en/news-room/fact-sheets/detail/hepatitis-c (accessed on 14 February 2019).
  3. Neumann, A.U.; Lam, N.P.; Dahari, H.; Gretch, D.R.; Wiley, T.E.; Layden, T.J.; Perelson, A.S. Hepatitis C viral dynamics in vivo and the antiviral efficacy of interferon-alpha therapy. Science 1998, 282, 103–107. [Google Scholar] [CrossRef] [PubMed]
  4. Bernardin, F.; Stramer, S.L.; Rehermann, B.; Page-Shafer, K.; Cooper, S.; Bangsberg, D.R.; Hahn, J.; Tobler, L.; Busch, M.; Delwart, E. High levels of subgenomic HCV plasma RNA in immunosilent infections. Virology 2007, 365, 446–456. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  5. Cheroni, C.; Donnici, L.; Aghemo, A.; Balistreri, F.; Bianco, A.; Zanoni, V.; Pagani, M.; Soffredini, R.; D’Ambrosio, R.; Rumi, M.G.; et al. Hepatitis C Virus Deletion Mutants Are Found in Individuals Chronically Infected with Genotype 1 Hepatitis C Virus in Association with Age, High Viral Load and Liver Inflammatory Activity. PLoS ONE 2015, 10, e0138546. [Google Scholar] [CrossRef]
  6. Karamichali, E.; Chihab, H.; Kakkanas, A.; Marchio, A.; Karamitros, T.; Pogka, V.; Varaklioti, A.; Kalliaropoulos, A.; Martinez-Gonzales, B.; Foka, P.; et al. HCV Defective Genomes Promote Persistent Infection by Modulating the Viral Life Cycle. Front. Microbiol. 2018, 9, 2942. [Google Scholar] [CrossRef]
  7. Li, Q.; Tong, Y.; Xu, Y.; Niu, J.; Zhong, J. Genetic Analysis of Serum-Derived Defective Hepatitis C Virus Genomes Revealed Novel Viral cis Elements for Virus Replication and Assembly. J. Virol. 2018, 92. [Google Scholar] [CrossRef]
  8. Noppornpanth, S.; Smits, S.L.; Lien, T.X.; Poovorawan, Y.; Osterhaus, A.D.; Haagmans, B.L. Characterization of hepatitis C virus deletion mutants circulating in chronically infected patients. J. Virol. 2007, 81, 12496–12503. [Google Scholar] [CrossRef]
  9. Yagi, S.; Mori, K.; Tanaka, E.; Matsumoto, A.; Sunaga, F.; Kiyosawa, K.; Yamaguchi, K. Identification of novel HCV subgenome replicating persistently in chronic active hepatitis C patients. J. Med. Virol. 2005, 77, 399–413. [Google Scholar] [CrossRef]
  10. Borgia, S.M.; Hedskog, C.; Parhy, B.; Hyland, R.H.; Stamm, L.M.; Brainard, D.M.; Subramanian, M.G.; McHutchison, J.G.; Mo, H.; Svarovskaia, E.; et al. Identification of a Novel Hepatitis C Virus Genotype From Punjab, India: Expanding Classification of Hepatitis C Virus Into 8 Genotypes. J. Infect. Dis. 2018, 218, 1722–1729. [Google Scholar] [CrossRef] [Green Version]
  11. Smith, D.B.; Bukh, J.; Kuiken, C.; Muerhoff, A.S.; Rice, C.M.; Stapleton, J.T.; Simmonds, P. Expanded classification of hepatitis C virus into 7 genotypes and 67 subtypes: Updated criteria and genotype assignment web resource. Hepatology 2014, 59, 318–327. [Google Scholar] [CrossRef]
  12. Simmonds, P.; Bukh, J.; Combet, C.; Deleage, G.; Enomoto, N.; Feinstone, S.; Halfon, P.; Inchauspe, G.; Kuiken, C.; Maertens, G.; et al. Consensus proposals for a unified system of nomenclature of hepatitis C virus genotypes. Hepatology 2005, 42, 962–973. [Google Scholar] [CrossRef] [PubMed]
  13. European Association for the Study of the Liver. EASL Recommendations on Treatment of Hepatitis C 2018. J. Hepatol. 2018, 69, 461–511. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Gotte, M.; Feld, J.J. Direct-acting antiviral agents for hepatitis C: Structural and mechanistic insights. Nat. Rev. Gastroenterol. Hepatol. 2016, 13, 338–351. [Google Scholar] [CrossRef] [PubMed]
  15. Jacka, B.; Lamoury, F.; Simmonds, P.; Dore, G.J.; Grebely, J.; Applegate, T. Sequencing of the Hepatitis C Virus: A Systematic Review. PLoS ONE 2013, 8, e67073. [Google Scholar] [CrossRef] [PubMed]
  16. Poljak, M. Next-Generation Sequencing: A Diagnostic One-Stop Shop for Hepatitis C? J. Clin. Microbiol. 2016, 54, 2427–2430. [Google Scholar] [CrossRef] [PubMed]
  17. Tremeaux, P.; Caporossi, A.; Thelu, M.A.; Blum, M.; Leroy, V.; Morand, P.; Larrat, S. Hepatitis C virus whole genome sequencing: Current methods/issues and future challenges. Crit. Rev. Clin. Lab. Sci. 2016, 53, 341–351. [Google Scholar] [CrossRef] [PubMed]
  18. Messina, J.P.; Humphreys, I.; Flaxman, A.; Brown, A.; Cooke, G.S.; Pybus, O.G.; Barnes, E. Global distribution and prevalence of hepatitis C virus genotypes. Hepatology 2015, 61, 77–87. [Google Scholar] [CrossRef]
  19. Smith, D.B.; Pathirana, S.; Davidson, F.; Lawlor, E.; Power, J.; Yap, P.L.; Simmonds, P. The origin of hepatitis C virus genotypes. J. Gen. Virol. 1997, 78 Pt 2, 321–328. [Google Scholar] [CrossRef]
  20. Cristina, J. Genetic diversity and evolution of hepatitis C virus in the Latin American region. J. Clin. Virol. 2005, 34 (Suppl. S2), S1–S7. [Google Scholar] [CrossRef]
  21. Kershenobich, D.; Razavi, H.A.; Sanchez-Avila, J.F.; Bessone, F.; Coelho, H.S.; Dagher, L.; Goncales, F.L.; Quiroz, J.F.; Rodriguez-Perez, F.; Rosado, B.; et al. Trends and projections of hepatitis C virus epidemiology in Latin America. Liver Int. 2011, 31 (Suppl. S2), 18–29. [Google Scholar] [CrossRef]
  22. Guindon, S.; Dufayard, J.F.; Lefort, V.; Anisimova, M.; Hordijk, W.; Gascuel, O. New algorithms and methods to estimate maximum-likelihood phylogenies: Assessing the performance of PhyML 3.0. Syst. Biol. 2010, 59, 307–321. [Google Scholar] [CrossRef] [PubMed]
  23. Lefort, V.; Longueville, J.E.; Gascuel, O. SMS: Smart Model Selection in PhyML. Mol. Biol. Evol. 2017, 34, 2422–2424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Anisimova, M.; Gascuel, O. Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative. Syst. Biol. 2006, 55, 539–552. [Google Scholar] [CrossRef]
  25. Lole, K.S.; Bollinger, R.C.; Paranjape, R.S.; Gadkari, D.; Kulkarni, S.S.; Novak, N.G.; Ingersoll, R.; Sheppard, H.W.; Ray, S.C. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J. Virol. 1999, 73, 152–160. [Google Scholar]
  26. Kalaghatgi, P.; Sikorski, A.M.; Knops, E.; Rupp, D.; Sierra, S.; Heger, E.; Neumann-Fraune, M.; Beggel, B.; Walker, A.; Timm, J.; et al. Geno2pheno[HCV]-A Web-based Interpretation System to Support Hepatitis C Treatment Decisions in the Era of Direct-Acting Antiviral Agents. PLoS ONE 2016, 11, e0155869. [Google Scholar] [CrossRef]
  27. Yanagi, M.; Purcell, R.H.; Emerson, S.U.; Bukh, J. Hepatitis C virus: An infectious molecular clone of a second major genotype (2a) and lack of viability of intertypic 1a and 2a chimeras. Virology 1999, 262, 250–263. [Google Scholar] [CrossRef] [PubMed]
  28. Edgar, R.C. MUSCLE: A multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 2004, 5, 113. [Google Scholar] [CrossRef] [PubMed]
  29. Xia, X.; Xie, Z.; Salemi, M.; Chen, L.; Wang, Y. An index of substitution saturation and its application. Mol. Phylogenet. Evol. 2003, 26, 1–7. [Google Scholar] [CrossRef]
  30. Xia, X. DAMBE7: New and Improved Tools for Data Analysis in Molecular Biology and Evolution. Mol. Biol. Evol. 2018, 35, 1550–1552. [Google Scholar] [CrossRef] [Green Version]
  31. Suchard, M.A.; Lemey, P.; Baele, G.; Ayres, D.L.; Drummond, A.J.; Rambaut, A. Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10. Virus Evol. 2018, 4, vey016. [Google Scholar] [CrossRef] [Green Version]
  32. Suchard, M.A.; Rambaut, A. Many-core algorithms for statistical phylogenetics. Bioinformatics 2009, 25, 1370–1376. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Rambaut, A.; Lam, T.T.; Max Carvalho, L.; Pybus, O.G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016, 2, vew007. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Darriba, D.; Taboada, G.L.; Doallo, R.; Posada, D. jModelTest 2: More models, new heuristics and parallel computing. Nat. Methods 2012, 9, 772. [Google Scholar] [CrossRef] [PubMed]
  35. Guindon, S.; Gascuel, O. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 2003, 52, 696–704. [Google Scholar] [CrossRef]
  36. Drummond, A.J.; Rambaut, A.; Shapiro, B.; Pybus, O.G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 2005, 22, 1185–1192. [Google Scholar] [CrossRef]
  37. Lemey, P.; Rambaut, A.; Drummond, A.J.; Suchard, M.A. Bayesian phylogeography finds its roots. PLoS Comput. Biol. 2009, 5, e1000520. [Google Scholar] [CrossRef]
  38. Pybus, O.G.; Charleston, M.A.; Gupta, S.; Rambaut, A.; Holmes, E.C.; Harvey, P.H. The epidemic behavior of the hepatitis C virus. Science 2001, 292, 2323–2325. [Google Scholar] [CrossRef]
  39. Simmonds, P. Genetic diversity and evolution of hepatitis C virus-15 years on. J. Gen. Virol. 2004, 85, 3173–3188. [Google Scholar] [CrossRef]
  40. Magiorkinis, G.; Magiorkinis, E.; Paraskevis, D.; Ho, S.Y.; Shapiro, B.; Pybus, O.G.; Allain, J.P.; Hatzakis, A. The global spread of hepatitis C virus 1a and 1b: A phylodynamic and phylogeographic analysis. PLoS Med. 2009, 6, e1000198. [Google Scholar] [CrossRef]
  41. Van de Laar, T.J.; Koppelman, M.H.; van der Bij, A.K.; Zaaijer, H.L.; Cuijpers, H.T.; van der Poel, C.L.; Coutinho, R.A.; Bruisten, S.M. Diversity and origin of hepatitis C virus infection among unpaid blood donors in the Netherlands. Transfusion 2006, 46, 1719–1728. [Google Scholar] [CrossRef]
  42. Markov, P.V.; van de Laar, T.J.; Thomas, X.V.; Aronson, S.J.; Weegink, C.J.; van den Berk, G.E.; Prins, M.; Pybus, O.G.; Schinkel, J. Colonial history and contemporary transmission shape the genetic diversity of hepatitis C virus genotype 2 in Amsterdam. J. Virol. 2012, 86, 7677–7687. [Google Scholar] [CrossRef] [PubMed]
  43. Lampe, E.; Espirito-Santo, M.P.; Martins, R.M.; Bello, G. Epidemic history of Hepatitis C virus in Brazil. Infect. Genet. Evol. 2010, 10, 886–895. [Google Scholar] [CrossRef] [PubMed]
  44. Lampe, E.; Lewis-Ximenez, L.; Espirito-Santo, M.P.; Delvaux, N.M.; Pereira, S.A.; Peres-da-Silva, A.; Martins, R.M.; Soares, M.A.; Santos, A.F.; Vidal, L.L.; et al. Genetic diversity of HCV in Brazil. Antivir. Ther. 2013, 18, 435–444. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Generalized approach to amplify the full-length HCV genome.
Figure 1. Generalized approach to amplify the full-length HCV genome.
Viruses 11 01000 g001
Figure 2. Analysis of potential recombination events of the PAT1 and PAT2 isolates. (a) BootScan plot of the PAT1 and HCV genotypes. (b) BootScan plot of the PAT1 and HCV-2 subtypes. (c) BootScan plot of the PAT2 and HCV genotypes. (d) BootScan plot of the PAT2 and HCV-2 subtypes. The parameters used for analysis are shown at the bottom of the figures. All analyses were performed within a window of 200 bp and step size of 20 bp under the Kimura 2-parameter model.
Figure 2. Analysis of potential recombination events of the PAT1 and PAT2 isolates. (a) BootScan plot of the PAT1 and HCV genotypes. (b) BootScan plot of the PAT1 and HCV-2 subtypes. (c) BootScan plot of the PAT2 and HCV genotypes. (d) BootScan plot of the PAT2 and HCV-2 subtypes. The parameters used for analysis are shown at the bottom of the figures. All analyses were performed within a window of 200 bp and step size of 20 bp under the Kimura 2-parameter model.
Viruses 11 01000 g002
Figure 3. Phylogenetic analysis of HCV sequences. Brazilian HCV sequences are depicted in green. The sequences generated in this study are highlighted with the symbol ●. Reference sequences are indicated by the accession number followed by subtype. The numbers in branches indicate statistical support (aLRT value). (a) Maximum Likelihood (ML) phylogenetic tree of HCV complete genome sequences. (b) ML phylogenetic tree of HCV NS5B sequences.
Figure 3. Phylogenetic analysis of HCV sequences. Brazilian HCV sequences are depicted in green. The sequences generated in this study are highlighted with the symbol ●. Reference sequences are indicated by the accession number followed by subtype. The numbers in branches indicate statistical support (aLRT value). (a) Maximum Likelihood (ML) phylogenetic tree of HCV complete genome sequences. (b) ML phylogenetic tree of HCV NS5B sequences.
Viruses 11 01000 g003
Figure 4. Bayesian maximum clade credibility tree of HCV-2b NS5B sequences. Branches are colored according to the potential locations of the parental node (colored legends in the figure). The scale at the bottom of the tree represents the years before the last sampling time. The numbers on the internal nodes represent posterior probabilities (pp). The tree was automatically rooted under the assumption of a relaxed molecular clock.
Figure 4. Bayesian maximum clade credibility tree of HCV-2b NS5B sequences. Branches are colored according to the potential locations of the parental node (colored legends in the figure). The scale at the bottom of the tree represents the years before the last sampling time. The numbers on the internal nodes represent posterior probabilities (pp). The tree was automatically rooted under the assumption of a relaxed molecular clock.
Viruses 11 01000 g004
Figure 5. Bayesian skyline plot showing the epidemic history of Brazilian HCV-2b. Median (solid line) and upper and lower 95% HPD (solid area) estimates of the effective population size (Y-axis; log10 scale) through time (X-axis; calendar years) are presented in the graph.
Figure 5. Bayesian skyline plot showing the epidemic history of Brazilian HCV-2b. Median (solid line) and upper and lower 95% HPD (solid area) estimates of the effective population size (Y-axis; log10 scale) through time (X-axis; calendar years) are presented in the graph.
Viruses 11 01000 g005
Table 1. Primers used for amplification and sequencing of the HCV genome.
Table 1. Primers used for amplification and sequencing of the HCV genome.
PrimerPrimer Sequence 5′–3′SensePosition a
RT-nested-PCR primers
HCV17SGGCGACACTCCGCCATGAATCACTforward17–40
HCV37SCACTCCCCTGTGAGGAACTACTGTCTTCACGforward37–67
2bDel1CTCSAACARCAGCATYACYTGGCforward961–983
HCV3007RCACHAGGCGTGGGTGBAGAATGreverse3007–2986
HCV4328SATCCTTGGCATTGGAACRGTCCTYGACCforward4328–4355
HCV4535SCATTCAAAGAAGAAGTGCGAYGAGCTforward4535–4560
HCV5406RCGGCCDATGATGGAAAYGCAGCCreverse5406–5384
HCV5423RGATCATTCAGGTGTADGCGGCCreverse5423–5402
HCV9354RCTGTGAWADATGTCGCCCCCGreverse9354–9334
HCV9373RGGGTCGGGCATGCGACACGCTGTGAWADATGTCreverse9373–9341
Sequencing primers
HCV37SCACTCCCCTGTGAGGAACTACTGTCTTCACGforward37–67
S7AGACCGTGCACCATGAGCACforward329–348
A5TACGCCGGGGGTCAKTRGGGCCCCAreverse683–659
HCV944STACGCCACYAATGATTGCTCforward944–963
Del1597CTGGCACATAAATCGGACCGforward1597–1616
HCV3007SCATTCTVCACCCACGCCTDGTGforward3007–2986
HCV3920SGCCAARTCYATTGACTTCATCCCforward3920–3942
HCV4356RTGGTCRAGGACYGTTCCRATGCCreverse4356–4334
HCV4535CATTCAAAGAAGAAGTGCGAYGAGCTforward4535–4560
HCV5285SATCGCCACGTGCATGCARGCTforward5285–5305
HCV5406RCGGCCDATGATGGAAAYGCAGCCreverse5406–5384
HCV6339RGACAGCCAGTTYTTRAAGTCTGreverse6339–6318
HCV6984STGAAGGCYACCTGYACCACYCAforward6984–7005
HCV7059RTCRCCYCCCATGAAVAGRTTreverse7059–7040
PR4GCNGARTAYCTVGTCATAGCCTCreverse8709–8687
HCV9354RCTGTGAWADATGTCGCCCCCGreverse9354–9334
HCV9140SCTTGGAGCGCCTCCCCTYAGforward9140–9159
a With reference to numbering nucleotides of HCV-2a (HC-J6CH).
Table 2. Mean genetic distances between groups of full-length HCV subtype 2b sequences from different geographical regions.
Table 2. Mean genetic distances between groups of full-length HCV subtype 2b sequences from different geographical regions.
BrazilFranceUSAJapanAustraliaChinaDenmark
Brazil 0.0030.0020.0030.0030.0030.003
France0.090 0.0020.0030.0020.0040.004
USA0.1000.097 0.0020.0020.0030.002
Japan0.1060.1020.098 0.0020.0030.002
Australia0.1030.1010.0990.101 0.0030.003
China0.1050.1020.1010.0980.101 0.004
Denmark0.1070.1010.0950.0790.1000.097
The gray values represent the standard deviation.

Share and Cite

MDPI and ACS Style

Spitz, N.; Barros, J.J.; do Ó, K.M.; Brandão-Mello, C.E.; Araujo, N.M. The First Complete Genome Sequences of Hepatitis C Virus Subtype 2b from Latin America: Molecular Characterization and Phylogeographic Analysis. Viruses 2019, 11, 1000. https://doi.org/10.3390/v11111000

AMA Style

Spitz N, Barros JJ, do Ó KM, Brandão-Mello CE, Araujo NM. The First Complete Genome Sequences of Hepatitis C Virus Subtype 2b from Latin America: Molecular Characterization and Phylogeographic Analysis. Viruses. 2019; 11(11):1000. https://doi.org/10.3390/v11111000

Chicago/Turabian Style

Spitz, Natália, José J. Barros, Kycia M. do Ó, Carlos E. Brandão-Mello, and Natalia M. Araujo. 2019. "The First Complete Genome Sequences of Hepatitis C Virus Subtype 2b from Latin America: Molecular Characterization and Phylogeographic Analysis" Viruses 11, no. 11: 1000. https://doi.org/10.3390/v11111000

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop