Next Article in Journal
Rapid and Sensitive Detection of Salmonella via Immunomagnetic Separation and Nanoparticle-Enhanced SPR
Previous Article in Journal
Rising Threats of MRSA and Carbapenem-Resistant Acinetobacter in Residential Care Homes for the Elderly During COVID-19 in Hong Kong
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Molecular Epidemiology of Hepatitis B Virus and Its Resistance-Associated Mutations in the Polymerase Gene in the Americas

by
Itzel A. Ruvalcaba
1,
Carlos Daniel Diaz-Palomera
2,
Adrián Alejandro Silva-Ríos
1,
José Francisco Muñoz-Valle
3 and
Oliver Viera-Segura
3,*
1
Licenciatura en Médico Cirujano y Partero, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara, Guadalajara 44340, Mexico
2
Laboratorio de Investigación en Cáncer e Infecciones, Departamento de Microbiología y Patología, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara, Guadalajara 44340, Mexico
3
Instituto en Investigación en Ciencias Biomédicas, Centro Universitario de Ciencias de la Salud, Universidad de Guadalajara, Guadalajara 44340, Mexico
*
Author to whom correspondence should be addressed.
Microorganisms 2025, 13(8), 1913; https://doi.org/10.3390/microorganisms13081913
Submission received: 5 July 2025 / Revised: 11 August 2025 / Accepted: 13 August 2025 / Published: 16 August 2025
(This article belongs to the Section Virology)

Abstract

The hepatitis B virus (HBV) is a DNA virus of major public health concern whose error-prone polymerase has driven the emergence of ten distinct genotypes and a multitude of resistance-associated mutations (RAMs). Herein, we conducted a retrospective observational study analyzing 8152 hepatitis B virus (HBV) sequences from 27 regions across the Americas, retrieved from GenBank, to construct a database and examine associations among HBV genotypes/subtypes, geographic distribution, resistance-associated mutations (RAMs), and resistance to nucleos(t)ide analogs (NAs) used in the treatment of chronic infection. Following phylogenetic analysis, mutations at clinically relevant sites in the reverse transcriptase domain were identified and classified by resistance to NAs. Genotypes A (21.1% A2 and 14.7% A1) and D predominated across the retrieved database, whereas genotypes E, G, H, and I each accounted for fewer than 3% of the sequences. Among the sequences in the database, 10.6% harbored RAMs, with genotypes G, A, and H predominating in this category. The most frequently observed RAM was L180M + M204V/I, which is associated with resistance to LMV, ETV, and TBV, whereas resistance to ADV and TDF remained rare. Genotypes G and A2 were significantly associated with a higher likelihood of harboring multiple RAMs (as evaluated by logistic regression), along with an increased risk of resistance to LMV, ETV, and TBV; the opposite was true for subtype A1. Notably, genotypes H and B5 were associated with an elevated risk of TDF resistance. A comprehensive understanding of RAMs and circulating genotypes in the Americas is essential for identifying high-risk populations and establishing geographically targeted therapeutic strategies.

1. Introduction

The World Health Organization (WHO) recognizes hepatitis B as a major public health problem, responsible for approximately 5 million infections in the Americas, with only a 4.4% treatment coverage and a 20,000 annual mortality rate [1]. The hepatitis B virus (HBV) belongs to the Hepadnaviridae family and can be transmitted either horizontally or vertically, with perinatal transmission being the predominant route in endemic countries [2]. Following infection, HBV undergoes an incubation period of one to four months, during which approximately 1/3rd of individuals develop symptoms, and 0.5% experience fulminant hepatic failure [3]. After initial acute exposure, the risk of chronicity is 90% in neonates, 30% in children infected at 1–4 years of age, and <5% in those infected in adulthood [1].
HBV is an enveloped icosahedral virus with a partially double-stranded, relaxed circular DNA (rcDNA) genome, approximately 3.2 kb in length, generated through reverse transcription [4]. The structure consists of four overlapping open reading frames (ORFs) from which viral proteins required for replication and pathogenesis are produced: pre-core/core (pre-C/C), DNA polymerase (P), transcriptional co-activator (X), and surface proteins (pre-S/S). Mutations in these regions can result in immune escape (C and S), drug resistance (P), oncogenicity (X and S), vaccine failure (S), and occult infection (S) [5]. Due to its limited proofreading capacity and error-prone polymerase, HBV has a mutation rate of 10−4 to 10−5 substitutions/site/year, 10-fold higher than that of other DNA viruses, and close to those of RNA viruses, which present mutation rates from 10−4 to 10−6 [5]. This contributes to extensive viral diversity, with quasispecies, subtypes, and genotypes continually emerging. Currently, 10 HBV genotypes (A–J) and over 40 subtypes have been identified, differentiated by nucleotide divergence rates of >7.5% and 4–7.5%, respectively [6].
Current evidence suggests that the most recent common ancestor of all HBV lineages existed between 20,000 and 12,000 years ago, facilitating the global distribution of early genotypes (A and D) [7]. Genotype A includes eight subtypes, predominantly found in Europe, South Africa, and Brazil. Genotype D comprises twelve subtypes, with a geographic distribution centered in the Mediterranean, Russia, and India [6]. Genotype G has been described as an atypical genotype, whose re-emergence after millennia of low persistence is commonly associated with the human immunodeficiency virus (HIV) pandemic [8]. The modern strain of genotype H likely originated in México, subsequently spreading to Nicaragua in the 1960s and, later, to more distant countries [9]. Genotype F can be divided into six subtypes (F1a–c, F2a–b, and F3–6), all with different areas of concentration in Central and South America. Genotype I arose from a recombination event between genotypes C, G, and A and accounts for only 0.32% of the global burden of chronic infections, mostly centered in East Asia [10]. Genotype E is highly localized in West Africa but is estimated to account for 20% of total chronic infections [11]. Genotype J is a recombinant of genotype C4 (typically in Australian Aborigines) and HBV from gibbons/orangutans, possibly stemming from the island of Borneo during World War II [11,12]. Genotypes B (B1–10) and C (C1–17) are primarily concentrated in Asia [13]; however, genotype B is also found in the Arctic Circle, and genotype C in the Pacific islands [6]. It is important to note that these genotypic differences may lead to variations in immune escape potential, oncogenicity, and responses to treatment, depending on the prevailing selective pressures [14].
The ability of HBV to develop resistance-associated mutations (RAMs) that reduce susceptibility to antiviral drugs used in treatment regimens is well documented. This propensity for resistance development is significantly influenced by HBV’s high substitution rate, which, coupled with an exceptionally high replication rate (peaking at 1013 virions per day), makes the daily generation of all possible single-nucleotide mutations and most double mutations mathematically probable [15]. Consequently, a single infected individual can harbor a diverse population of genetic variants known as quasispecies. While RAMs can exist as pre-treatment quasispecies within an individual, they can become dominant under the selective pressure imposed by antiviral therapy. In HBV, these mutations are found in the reverse transcriptase (RT) domain and can be classified as primary, compensatory, putative, and pre-treatment mutations. Within these categories, 42 RAMs have been identified in the HBV polymerase [16].
The primary objectives of antiviral treatment are to prevent progression to cirrhosis and hepatocellular carcinoma. The recommended first-line therapy consists of oral nucleos(t)ide analogs (NAs) with a high genetic barrier and a low rate of resistance associated with long-term use. Depending on their activation mechanism, these can be categorized into nucleotide analogs (adefovir and tenofovir) and nucleoside analogs (lamivudine, telbivudine, and entecavir), which function by causing DNA chain termination. Long-term treatments for chronic HBV infection may become ineffective due to the emergence of viral mutations conferring drug resistance [17,18,19]. Existing evidence indicates that resistance to NAs increases with duration of use, with older antivirals showing higher rates of resistance and cross-resistance after prolonged administration [20,21,22,23,24]. Also, unlike the hepatitis C virus, for which we already have a cure, the cccDNA from HBV can integrate into the host nucleus, resisting antivirals [25]. Genotypes and subtypes may vary in their disease presentation, outcome severity, and resistance patterns to NAs. Molecular epidemiological studies focusing on the burden of RAMs could support the development of public health interventions and guide early therapeutic adjustments in patients at high risk for resistance.

2. Methods

2.1. Study Design

This retrospective observational study aimed to examine the relationship between the HBV genotype (and their subtypes), geographic origin, specific mutations (S106C, H126Y, N236T, L269I, D134E, A181T/V, A194T, S202C/G/I, V173L, M204V/I, L269I + H126Y, L180M + M204V/I, L180M + M204V/I + V173L, and L180M + M204V/I + I169T), and resistance to five nucleos(t)ide analogs (NAs) approved for the treatment of chronic infection: lamivudine (LMV), adefovir (ADV), entecavir (ETV), telbivudine (TBV), and tenofovir (TDF and TAF) (Figure 1). Emtricitabine (FTC) was not included in this study as it is not yet FDA-approved for HBV treatment, with reports of severe exacerbations upon discontinuation in HIV co-infected patients [26].
The database includes 8152 HBV sequences and corresponding metadata (e.g., genotype, year of isolation, geographic origin, mutations, and resistance) obtained from the public NCBI GenBank database (https://www.ncbi.nlm.nih.gov/genbank/ [accessed on 30 March 2023]). Information on comorbidities or prior antiviral treatment of the patients from whom viral sequencing samples were obtained was extracted from associated publications. Sequences were retrieved between January and March 2023 using the keywords “hepatitis B virus” with the [organism] specification, followed by the Boolean operator “AND” and the corresponding country name ([country]). Clone sequences and those whose chronological or geographical information was incomplete or foreign to the American continent were excluded. Additionally, those sequences with an incomplete or absent reverse transcriptase domain were removed from the database.

2.2. Sequence Classification and Phylogenetic Analysis

The sequences were classified into three different groups:
  • Cluster 1: Genotype and subtype identified in GenBank or associated publication (n = 3358).
  • Cluster 2: Genotype provided; subtype not reported (n = 4071).
  • Cluster 3: Neither genotype nor subtype reported (n = 723).
All three groups were further divided by country and region of genomic coverage (complete or partial). The 306 resulting groups were subjected to multiple alignment using the ClustalW v.2.1 program on the Galaxy platform (https://usegalaxy.org/ [accessed on 30 March 2023]). Alignments were manually verified, and in cases requiring correction, MEGA X v.10.2.6 (Pennsylvania, PA, USA) and AliView v.1.28 (Uppsala, Sweden) were used.
Cluster 1 was subjected to confirmatory phylogenetic analyses. Of these, 84 sequences were reclassified according to our findings (Supplementary Figure S1). Cluster 2 underwent phylogenetic analysis to determine subtypes (Supplementary Figure S2), while cluster 3 was analyzed to determine both genotype (Supplementary Figure S3) and subtype (Supplementary Figure S2). A total of 202 reference sequences were used for the genotyping and subgenotyping (Supplementary Figure S4 and Table S1), encompassing known HBV genotypes A-J and their subtypes, where applicable. For consistency and due to the controversial classification of genotypes A and B [6,27,28], we employed the extended classification scheme, avoiding the grouping of A3, A4, A5, A7, and B3, B5, B7, B8, and B9. In our analyses, subtypes D3 and D6 were grouped due to the phylogenetic similarities displayed and the lack of clarity in their separation while classifying the sequences [29].
Phylogenetic trees were constructed using the maximum-likelihood (ML) algorithm with the Tamura-Nei model and 1000 bootstrap replicates, implemented in MEGA X. Trees were exported in “.nwk” format and subsequently visualized, analyzed, and recorded using FigTree v.1.4.4. The classification of sequences according to genotype, country, and their alignment to the HBV complete genome reference sequence NC_003977.2 can be found in Supplementary Table S2. Out of all the sequences analyzed, three could not be accurately genotyped, and four could not be subgenotyped.

2.3. Analysis of Resistance-Associated Mutations

The coding sequence (CDS) of each entry was used for translation. Amino acid translation was performed using the HBV polymerase gene, with EDWGPC (in the 336aa position) as the starter and IHTAEL (in the 719aa position) as the termination marker. These positions cover the RT domain of the polymerase gene. This information was used to build a database containing amino acid data for the analysis of clinically relevant sites in the RT domain, specifically positions 106, 126, 134, 169, 173, 180, 181, 194, 202, 204, 236, and 269. The resistance-associated mutations were classified according to their variants, combinations, and specific functions (classical/primary, compensatory, and putative). Of the mutations at the 12 sites, six were in different combinations with each other, as certain primary mutations often pair up with compensatory ones. Associations between mutations and resistance to specific NAs were determined based on previously established evidence.

2.4. Statistical Analysis

The figures in this article were created using Lucidchart website (https://www.lucidchart.com), Excel v.2503, PowerPoint v.2503, and RStudio v.4.3.1 (“Beagle Scouts”). The risk of resistance to NAs and mutations according to the genotypes and subtypes was calculated using univariate logistic regression. Those with an HIV co-infection and treatment prior to this study were also analyzed as risk factors.
Genotype frequencies were organized by country, and distribution patterns were assessed using Hierarchical Cluster Analysis, specifically the Ward method. The resulting dendrogram (Supplementary Figure S4) was used to structure the international grouping. The analyses were carried out with the IBM SPSS v.27 software.

3. Results

3.1. Geographical Distribution

A total of 8152 sequences were included, originating from 24 countries, 1 U.S. state (Alaska), and 2 territories (Martinique and Greenland) within the Americas. Although the United States contributed the highest number of sequences (2431), South America was the region with the largest overall contribution, primarily due to Brazil and Argentina. While South and North America contributed a comparable number of sequences, Central America and the Caribbean were underrepresented, with only 748 HBV sequences in the database (Figure 2).
Regarding the global circulation of genotypes, genotype A was found in 19 of the 27 territories studied, and genotype D in 16. Genotype H, present in only six countries, exhibited a markedly higher prevalence in México (65%) and Nicaragua (33%) compared with the other American nations (<1%). Genotype G was also identified in only six countries; in five of them, its prevalence was below 4%, whereas in México, it reached 14%.

3.2. Molecular Epidemiology

The 8152 analyzed sequences exhibited a heterogeneous distribution, predominantly comprising genotypes A, C, and D. Genotypes E, G, H, and I were each detected at a frequency below 3% (Figure 3). Overall, the predominant subtypes in the database were A2 (21.1%), A1 (14.7%), C2 (7.8%), D3/D6 (6.9%), and C1 (6.3%).
Furthermore, genotypes were grouped based on their frequency of occurrence and visualized using a hierarchical dendrogram. The analysis revealed a tripartite division of countries according to their epidemiological genotype composition (Figure 4). The orange group was characterized by a high prevalence of genotype F (greater than 60%) and included 11 countries from Central and South America. The blue group was the least represented and comprised countries with a wide diversity of genotypes (e.g., the United States and Canada), regardless of the dominant type. The gray group displayed a predominance of either genotype A or genotype D.

3.3. Resistance-Associated Mutations

Sequences containing one or more resistance-associated mutations (RAMs) (n = 864) were isolated to construct a figure illustrating the relationship between genotype, specific mutations, RAMs, and resistance to treatments. Among all sequences, 5454 exhibited at least one mutation, and 863 were found to carry a documented resistance-associated mutation. The frequency distribution of genotypes, RAMs, and associated resistance to NAs was visualized using a coded Sankey plot (Figure 5). In contrast with genotype A, in which 93% of sequences harbored mutations, 59% of genotype F sequences lacked clinically relevant mutations. The proportion of sequences with RAMs within each genotype generally ranged from 2% to 7%, with notable exceptions: genotype G (31.0%), genotype A (17.6%), and genotype H (12.9%).
Notably, the L180M + M204V/I combination was present in 467 sequences (54.1%) of those with RAMs and was associated with resistance to ETV, LMV, and TBV. Given that the majority of resistance-associated sequences exhibited resistance to multiple drugs (94.5%), resistance to LMV was observed in 840 sequences (97.3%), TBV in 816 (94.6%), ETV in 217 (25.1%), ADV in 29 (3.4%), and tenofovir in 8 (0.1%). Resistance to ETV was consistently accompanied by resistance to both LMV and TBV, while resistance to TBV was always accompanied by resistance to LMV.
The relationship between subgenotypes and resistance-associated mutations (RAMs) was analyzed (Table 1), revealing that 30 subgenotypes were associated with increased or decreased odds of one or more RAMs. Half of these subgenotypes were associated with increased odds for the L269I mutation, including B2, B3, B5, B6, C1, D1, D2, D3/D6, D4, F2a, F2b, F3, F4, F5, and H. Subgenotype A2 was associated with increased odds for five mutations: H126Y (OR = 12.22; 95% CI: 10.78–13.85), V173L (OR = 5.97; 95% CI: 2.89–12.32), L269I + H126Y (OR = 4.35; 95% CI: 3.84–4.93), L180M + M204V/I (OR = 4.77; 95% CI: 3.94–5.77), and L180M + M204V/I + V173L (OR = 15.80; 95% CI: 11.20–22.28). Similarly, genotype G was associated with increased odds for five mutations: H126Y (OR = 1.81; 95% CI: 1.27–2.59), A181T/V (OR = 9.85; 95% CI: 3.36–28.85), M204V/I (OR = 9.56; 95% CI: 5.91–15.49), L269I + H126Y (OR = 11.94; 95% CI: 8.36–17.07), and L180M + M204V/I + V173L (OR = 2.33; 95% CI: 1.13–4.81). In contrast, subgenotype A1 was associated with significantly decreased odds against five mutations: S106C (OR = 0.25; 95% CI: 0.09–0.67), M204V/I (OR = 0.36; 95% CI: 0.19–0.68), L269I (OR = 0.18; 95% CI: 0.14–0.22), L180M + M204V/I (OR = 0.36; 95% CI: 0.19–0.68), and L180M + M204V/I + V173L (OR = 0.26; 95% CI: 0.13–0.51).
The relationship between HBV subgenotypes and increased odds of resistance to specific nucleos(t)ide analogs (NAs) was analyzed (Table 2). Most significant associations indicated decreased odds rather than increased odds. Subgenotype A2 was identified as being associated with significantly increased odds for resistance to entecavir (ETV) (OR = 12.04; 95% CI: 8.82–16.43), lamivudine (LMV) (OR = 5.45; 95% CI: 4.70–6.32), and telbivudine (TBV) (OR = 5.12; 95% CI: 4.41–5.95). In contrast, subgenotype A1 was consistently associated with decreased odds for resistance to these same drugs: ETV (OR = 0.30; 95% CI: 0.17–0.56), LMV (OR = 0.46; 95% CI: 0.36–0.60), and TBV (OR = 0.48; 95% CI: 0.37–0.62). Notably, genotype H (OR = 13.87; 95% CI: 1.69–114.03) and subgenotype B5 (OR = 87.24; 95% CI: 16.94–449.14) were associated with strongly increased odds for resistance to Tenofovir. Additionally, genotype G was associated with increased odds of resistance to four out of the five NAs evaluated: adefovir (ADV), ETV, LMV, and TBV.

3.4. Resistance in Prior Treatments and Co-Infection

The relationship between prior or ongoing treatment with lamivudine (LMV) and tenofovir (TDF) and subsequent resistance to specific nucleos(t)ide analogs (NAs) was analyzed (Figure 6). A total of 83 patient sequences were associated with previous or current use of LMV, and 17 with previous or current use of TDF. Prior treatment with LMV was associated with increased odds for the following mutations: L180M + M204V/I + V173L (OR = 7.32; 95% CI: 3.98–13.45), L180M + M204V/I (OR = 20.98; 95% CI: 13.34–33.01), and H126Y (OR = 6.70; 95% CI: 4.27–10.52). Similarly, prior treatment with TDF was associated with increased odds for L180M+M204V/I (OR = 14.17; 95% CI: 5.20–38.57) and H126Y (OR = 4.16; 95% CI: 1.60–10.81).
Additionally, the relationship between HIV co-infection (present in 428 sequences) and resistance to nucleos(t)ide analogs (NAs) was analyzed (Figure 7), but no statistically significant associations were found (p ≥ 0.05).

4. Discussion

The molecular epidemiological findings in this study reinforce previous reports on the geographical distribution of HBV genotypes while offering a more comprehensive view of the epidemiological landscape in the Americas. As expected, and consistent with global patterns [6], genotypes A and D were the most widely distributed, present in over half of the countries analyzed. Although subtype A1 has been associated with South America and A2 with North America, due to differing historical routes of spread (the transatlantic slave trade vs. European migration) and modes of transmission (perinatal vs. parenteral/sexual) [6,30]. Our data revealed an inverse pattern in countries such as Argentina, Venezuela, Cuba, and Canada. Similarly, while subtype C1 predominates in Southeast Asia and C2 in East Asia [31], elevated frequencies observed in countries with significant Asian immigration (e.g., the USA, Canada, and Panama) may reflect increasing globalization and shifts in the genetic landscape [32,33].
The regional genotypic profiles also highlighted the influence of endemic strains. As anticipated, genotype F—particularly subtype F1b—was more prevalent in South America, while genotype H showed markedly higher frequencies in México and Nicaragua, likely due to its evolutionary origin and adaptation to local populations [9,34]. These differences between the south of the continent and México can be dated to around 8140 years ago, where genotypes F and H suffered a speciation event. It also predicted that these genotypes started their diversification around 4551 and 2070 years ago, information that is supported by the fact that genotype H was found in native populations [34]. Although historically concentrated in México and considered endemic [8], genotype G is now believed to represent the re-emergence of an ancient Eurasian strain [7]. Its low prevalence in most countries may be underestimated due to its frequent occurrence as a co-infection with other genotypes, complicating diagnosis. Despite its modest representation, genotype G warrants attention due to its high mutation frequency and established link with HIV co-infection, both of which increase the likelihood of RAM development [14].
As expected, genotype J was absent from the dataset, as it has only been documented in a single Japanese individual [12]. Nevertheless, genotypes not typically endemic to the continent were observed. For example, subtype I2—usually restricted to Southeast Asia [35,36]—was detected in a Canadian sample, likely originating from a Vietnamese immigrant [37]. Genotype E, though not dominant in any country, was detected in moderate frequencies in areas with African ancestry (e.g., Haiti and Martinique) or migration (e.g., USA and Canada), consistent with its origin in West Africa [38].
Among the 11 codons analyzed in the reverse transcriptase (RT) domain, several mutation combinations demonstrated that while single mutations are more common, they are less effective in conferring resistance. Notably, clinically relevant mutations can arise spontaneously, independent of antiviral exposure [17], and phylogenetic analyses have traced M204V and L180M mutations back to the mid-20th century [39]. Their early emergence likely accounts for their high prevalence in this study (53.5% and 44.3% of RAM-positive sequences, respectively). The combined M204I/V and L180M mutations were found in 2.7% of sequences, aligning with global estimates [16,39,40]. One sequence harboring the A181T/V mutation—associated with ADV resistance [14,16,41]—was collected 15 years before the approval of the first HBV treatment [42], highlighting how high mutation rates can yield clinically relevant variants even in the absence of selective drug pressure.
Of all sequences with RAMs, 94.5% conferred resistance to more than one antiviral agent, typically LMV, TBV, and ETV. These findings confirm that the HBV mutation pool continues to evolve in response to the selective pressures exerted by successive antiviral therapies. The risk of resistance increases with prolonged NA exposure [20,23,24], and cross-resistance—particularly following LMV treatment—is well documented [21,22]. In our study, all sequences resistant to ETV also displayed resistance to LMV and TBV, while all TBV-resistant sequences were also LMV-resistant.
Consistent with global prevalence estimates, resistance to tenofovir was detected in only 0.1% of sequences. However, subtypes B5 and H were associated with the A194T mutation, which confers resistance to TDF. Although both had the highest rates of TDF- and ETV-resistant sequences, the endemic nature and genetic adaptation of genotype H to the Mexican population [34] complicate the interpretation of its association analysis. While LMV and TBV showed the highest number of significant associations across subtypes (13 and 14, respectively), this could be attributed to their high representation in the database.
Despite A194T’s known association with TDF resistance, this drug typically requires specific mutational patterns to lose efficacy [39]. Two previously reported quadruple-mutation combinations may contribute to resistance development: CYEI (S106C + H126Y + D134E + L269I) and MLVV (L180M + T184L + A200V + M204V) [43]. Although L269I alone does not confer resistance, it was associated with an increased odds ratio in 15 subtypes. Moreover, three subtypes (A1, A2, and G) were linked to its combination with S106C and H126Y (CYI). The L180M + M204V/I mutation, associated with resistance to LMV, TBV, and ETV, was present in 54.1% of resistant sequences. These combinations highlight the need for further research on mutation synergy and its impact on resistance.
Genotype G emerged as the genotype with the highest association with five mutations (H126Y, A181T/V, M204V/I, L269I + H126Y, and L180M + M204V/I + V173L) and resistance to four of the five NAs analyzed (ADV, ETV, LMV, and TBV). This aligns with the prior literature linking genotype G to an increased number of RAMs and NA resistance [14,44]. Given its strong association with HIV risk groups (e.g., intravenous drug users and men who have sex with men) [45,46,47,48,49], extensive NA use in these populations may exert pre-infection selective pressure, promoting the emergence of RAMs with cross-resistance to HIV and HBV therapies. Conversely, genotype E had the lowest RAM prevalence, consistent with its regional distribution in areas with minimal antiviral coverage (0.2%) and limited public health funding [1], which reduces the selective pressure for RAM development.
Intra-genotypic differences were also evident. Subtype A2 was associated with resistance to LMV, ETV, and TBV, while subtype A1 was protective against resistance to the same drugs. This contrast, previously reported in the literature [14], may reflect their differing geographic and historical contexts [6,30]. Subtype A1’s distribution in countries with limited treatment access likely contributes to reduced selective pressure, while A2’s higher replication capacity [30] may increase its likelihood of developing RAMs.
Our findings also support established evidence that prior exposure to antiviral therapy increases the risk of resistance [20]. Sequences from patients previously treated with LMV and TDF showed increased odds of carrying mutations (L180M, M204V/I, H126Y, and/or V173L) associated with resistance to LMV, TBV, and ETV.
Although the number of sequences obtained from each region (North, Central, and South America) was proportional to its population [50], some countries were disproportionately represented (e.g., Cuba in the Caribbean and Argentina in South America), while others were underrepresented (e.g., Guatemala in Central America and Ecuador in South America). This may result in a skewed perception of regional genotypic distribution. Countries like the USA, with abundant sequencing data, provide a clearer epidemiological picture, while nations like Guatemala or Ecuador require more representative sampling. These disparities highlight the need for enhanced molecular surveillance in underrepresented areas.
A key distinction between this study and broader global analyses [39] is the focus on genotypes more prevalent in the Americas (e.g., H, G, and F), which are often underrepresented in global datasets due to the dominance of European (A and D) and Asian (B and C) genotypes. This information supports the adaptation of treatment strategies based on geographic profiles, even in settings where routine genotyping is unavailable. In highly heterogeneous countries such as Canada, the United States, and Suriname, national-level surveillance may be insufficient, and regional assessments may be necessary. In resource-limited settings, the local molecular epidemiology may guide drug prioritization and resistance risk management. Although the 2024 WHO guidelines recommend TDF as the first-line agent due to its high resistance barrier, often in combination with LMV or FTC when monotherapy is not feasible [2], the availability of these NAs remains limited in many low-resource countries. Furthermore, an inherent limitation of our study is the lack of complete clinical meta-information associated with publicly available HBV sequences. Specifically, we were unable to determine whether the sequences were obtained before or after antiviral therapy. This limitation prevents an analysis that directly compares the frequency of pre- and post-treatment mutations, a critical point for discerning between mutations emerging due to drug selection pressure and pre-existing polymorphisms.
While progress has been made in managing HBV across the region, emerging therapies in early development stages [51], further efforts are needed to address drug resistance effectively. Understanding the molecular epidemiology of HBV across the continent is crucial for estimating clinical implications, particularly in light of established genotype–resistance associations. A comprehensive genotypic analysis of resistance mutations can inform public health policies and improve treatment strategies. Such profiling would enable healthcare professionals to identify patient groups at higher risk for resistance and tailor therapy accordingly.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/microorganisms13081913/s1, Figure S1: Report subgenotyping (confirming reported); Figure S2: Report subgenotyping (all samples); Figure S3: Report broad subgenotyping; Table S1: Reference table [6,14,29,41,49,52,53,54,55,56,57,58,59,60,61,62,63,64]; Figure S4: Dendogram; Table S2: Sequence Classification according to genotype.

Author Contributions

Conceptualization: I.A.R. and O.V.-S.; methodology: O.V.-S. and J.F.M.-V.; software: I.A.R.; validation: C.D.D.-P. and A.A.S.-R.; formal Analysis: I.A.R. and O.V.-S.; data curation: I.A.R. and J.F.M.-V.; writing—original draft: I.A.R. and O.V.-S.; writing—review and editing: C.D.D.-P. and J.F.M.-V.; supervision: O.V.-S. and J.F.M.-V.; project administration: O.V.-S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

This study was approved by Comités de Investigación, Ética en Investigación y de Bioseuridad, CUCS, Universidad de Guadalajara (approval code: CI-04723; approved on 28 June 2023).

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors thank all the researchers who made this study possible by sharing the information from their studies in the public NCBI repositories.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Global Hepatitis Report 2024: Action for Access in Low- and Middle-Income Countries. Available online: https://iris.who.int/handle/10665/376461 (accessed on 11 August 2025).
  2. Guidelines for the Prevention, Diagnosis, Care and Treatment for People with Chronic Hepatitis B Infection. Available online: https://iris.who.int/handle/10665/376353 (accessed on 11 August 2025).
  3. Campos-Valdez, M.; Monroy-Ramírez, H.C.; Armendáriz-Borunda, J.; Sánchez-Orozco, L.V. Molecular Mechanisms during Hepatitis B Infection and the Effects of the Virus Variability. Viruses 2021, 13, 1167. [Google Scholar] [CrossRef]
  4. Tsukuda, S.; Watashi, K. Hepatitis B virus biology and life cycle. Antivir. Res. 2020, 182, 104925. [Google Scholar] [CrossRef]
  5. Zhang, Z.H.; Wu, C.C.; Chen, X.W.; Li, X.; Li, J.; Lu, M.J. Genetic variation of hepatitis B virus and its significance for pathogenesis. World J. Gastroenterol. 2016, 22, 126–144. [Google Scholar] [CrossRef]
  6. Chen, J.; Li, L.; Yin, Q.; Shen, T. A review of epidemiology and clinical relevance of Hepatitis B virus genotypes and subgenotypes. Clin. Res. Hepatol. Gastroenterol. 2023, 47, 102180. [Google Scholar] [CrossRef] [PubMed]
  7. Kocher, A.; Papac, L.; Barquera, R.; Key, F.M.; Spyrou, M.A.; Hübler, R.; Rohrlach, A.B.; Aron, F.; Stahl, R.; Wissgott, A.; et al. Ten millennia of hepatitis B virus evolution. Science 2021, 374, 182–188. [Google Scholar] [CrossRef] [PubMed]
  8. Araujo, N.M.; Osiowy, C. Hepatitis B Virus Genotype G: The Odd Cousin of the Family. Front. Microbiol. 2022, 13, 872766. [Google Scholar] [CrossRef] [PubMed]
  9. Wolf, J.M.; Pereira, V.R.Z.B.; Simon, D.; Lunge, V.R. Evolutionary history of hepatitis B virus genotype H. J. Med. Virol. 2021, 93, 4004–4009. [Google Scholar] [CrossRef]
  10. Marchio, A.; Sitbounlang, P.; Deharo, E.; Paboriboune, P.; Pineau, P. Concealed for a Long Time on the Marches of Empires: Hepatitis B Virus Genotype I. Microorganisms 2023, 11, 2204. [Google Scholar] [CrossRef]
  11. Toyé, R.M.; Loureiro, C.L.; Jaspe, R.C.; Zoulim, F.; Pujol, F.H.; Chemin, I. The Hepatitis B Virus Genotypes E to J: The Overlooked Genotypes. Microorganisms 2023, 11, 1908. [Google Scholar] [CrossRef]
  12. Tatematsu, K.; Tanaka, Y.; Kurbanov, F.; Sugauchi, F.; Mano, S.; Maeshiro, T.; Nakayoshi, T.; Wakuta, M.; Miyakawa, Y.; Mizokami, M. A genetic variant of hepatitis B virus divergent from known human and ape genotypes isolated from a Japanese patient and provisionally assigned to new genotype J. J. Virol. 2009, 83, 10538–10547. [Google Scholar] [CrossRef]
  13. Sun, B.; Valtueña, A.A.; Kocher, A.; Gao, S.; Li, C.; Fu, S.; Zhang, F.; Ma, P.; Yang, X.; Qiu, Y.; et al. Origin and dispersal history of Hepatitis B virus in Eastern Eurasia. Nat. Commun. 2024, 15, 2951. [Google Scholar] [CrossRef]
  14. Araujo, N.M.; Teles, S.A.; Spitz, N. Comprehensive Analysis of Clinically Significant Hepatitis B Virus Mutations in Relation to Genotype, Subgenotype and Geographic Region. Front. Microbiol. 2020, 11, 616023. [Google Scholar] [CrossRef] [PubMed]
  15. Whalley, S.A.; Murray, J.M.; Brown, D.; Webster, G.J.; Emery, V.C.; Dusheiko, G.M.; Perelson, A.S. Kinetics of acute hepatitis B virus infection in humans. J. Exp. Med. 2001, 193, 847–854. [Google Scholar] [CrossRef]
  16. Zhang, X.; Chen, X.; Wei, M.; Zhang, C.; Xu, T.; Liu, L.; Xu, Z. Potential resistant mutations within HBV reverse transcriptase sequences in nucleos(t)ide analogues-experienced patients with hepatitis B virus infection. Sci. Rep. 2019, 9, 8078. [Google Scholar] [CrossRef]
  17. Irwin, K.K.; Renzette, N.; Kowalik, T.F.; Jensen, J.D. Antiviral drug resistance as an adaptive process. Virus Evol. 2016, 2, vew014. [Google Scholar] [CrossRef] [PubMed]
  18. Boulard, Y.; Bressanelli, S. Recapitulating Trafficking of Nucleosides Into the Active Site of Polymerases of RNA Viruses: The Challenge and the Prize. Front. Med. Technol. 2021, 3, 705875. [Google Scholar] [CrossRef] [PubMed]
  19. Thames, J.E.; Seley-Radtke, K.L. Comparison of the Old and New-Novel Mechanisms of Action for Anti-coronavirus Nucleoside Analogues. Chimia 2022, 76, 409–417. [Google Scholar] [CrossRef]
  20. Tacke, F.; Kroy, D.C. Treatment for hepatitis B in patients with drug resistance. Ann. Transl. Med. 2016, 4, 334. [Google Scholar] [CrossRef]
  21. Hepatitis B Virus: Overview of Management-UpToDate. Available online: https://www.uptodate.com/contents/hepatitis-b-virus-overview-of-management (accessed on 11 August 2025).
  22. Kimberlin, D.W. 295—Antiviral Agents. In Principles and Practice of Pediatric Infectious Diseases, 6th ed.; Long, S.S., Ed.; Elsevier: Philadelphia, PA, USA, 2023; pp. 1583–1598.e6. Available online: https://www.sciencedirect.com/science/article/pii/B9780323756082002950 (accessed on 11 August 2025).
  23. Laoi, B.N.; Herra, C.; Norris, S.; Crowley, B. First report of genotypic resistance to adefovir in chronic HBV in the Republic of Ireland. J. Antimicrob. Chemother. 2006, 57, 1009–1010. [Google Scholar] [CrossRef]
  24. Quirós-Santana, C. Telbivudina. Acta Médica Costarric. 2008, 50, 55. [Google Scholar]
  25. Xia, Y.; Guo, H. Hepatitis B virus cccDNA: Formation, regulation and therapeutic potential. Antivir. Res. 2020, 180, 104824. [Google Scholar] [CrossRef]
  26. FDALabel. Available online: https://nctr-crs.fda.gov/fdalabel/ui/spl-summaries/criteria/391290 (accessed on 11 August 2025).
  27. Pourkarim, M.R.; Amini-Bavil-Olyaee, S.; Lemey, P.; Maes, P.; Van Ranst, M. HBV subgenotype misclassification expands quasi-subgenotype A3. Clin. Microbiol. Infect. 2011, 17, 947–949. [Google Scholar] [CrossRef] [PubMed]
  28. Meldal, B.H.M.; Bon, A.H.; Prati, D.; Ayob, Y.; Allain, J.P. Diversity of hepatitis B virus infecting Malaysian candidate blood donors is driven by viral and host factors. J. Viral Hepat. 2011, 18, 91–101. [Google Scholar] [CrossRef] [PubMed]
  29. Sant’Anna, T.B.; Araujo, N.M. Hepatitis B Virus Genotype D: An Overview of Molecular Epidemiology, Evolutionary History, and Clinical Characteristics. Microorganisms 2023, 11, 1101. [Google Scholar] [CrossRef]
  30. Zhang, M.; Mouzannar, K.; Zhang, Z.; Teraoka, Y.; Piotrowski, J.; Ishida, Y.; Tateno-Mukaidani, C.; Saito, T.; Abe-Chayama, H.; Chayama, K.; et al. Hepatitis B virus genotypes A1 and A2 have distinct replication phenotypes due to polymorphisms in the HBx gene. PLoS Pathog. 2025, 21, e1012803. [Google Scholar] [CrossRef] [PubMed]
  31. Li, T.; Chen, H.; Sun, L.; Liu, J. Dating the origin and dispersal of global hepatitis B virus genotype C in humans. Drug Discov. Ther. 2022, 16, 85–92. [Google Scholar] [CrossRef]
  32. O’neil, K.; Hamilton, K.; Papademetriou, D. Migration in the Americas. 2005 Sep. Available online: https://www.iom.int/sites/g/files/tmzbdl486/files/jahia/webdav/site/myjahiasite/shared/shared/mainsite/policy_and_research/gcim/rs/RS1.pdf (accessed on 1 May 2025).
  33. OECD. International Migration in the Americas. 2015. Available online: https://www.oecd.org/en/publications/international-migration-in-the-americas_e5d3aa63-en.html (accessed on 11 August 2025).
  34. Jose-Abrego, A.; Roman, S.; Laguna-Meraz, S.; Rebello-Pinho, J.R.; Justo Arevalo, S.; Panduro, A. Tracing the evolutionary history of hepatitis B virus genotype H endemic to Mexico. Front. Microbiol. 2023, 14, 1180931. [Google Scholar] [CrossRef]
  35. Holzmayer, V.; Hance, R.; Defechereux, P.; Grant, R.; Kuhns, M.C.; Cloherty, G.; Rodgers, M.A. Identification of hepatitis B virus genotype I in Thailand. J. Med. Virol. 2019, 91, 717–721. [Google Scholar] [CrossRef]
  36. Shen, T.; Yan, X.M.; Liu, H.; Zhang, B.X.; Li, L.; Zhang, J.P.; Wang, J.L.; Xiao, C.J. Genotype I of hepatitis B virus was found in east Xishuangbanna, China and molecular dynamics of HBV/I. J. Viral Hepat. 2015, 22, 37–45. [Google Scholar] [CrossRef]
  37. Osiowy, C.; Kaita, K.; Solar, K.; Mendoza, K. Molecular characterization of hepatitis B virus and a 9-year clinical profile in a patient infected with genotype I. J. Med. Virol. 2010, 82, 942–948. [Google Scholar] [CrossRef]
  38. Andernach, I.E.; Hunewald, O.E.; Muller, C.P. Bayesian inference of the evolution of HBV/E. PLoS ONE 2013, 8, e81690. [Google Scholar] [CrossRef]
  39. Mokaya, J.; Vasylyeva, T.I.; Barnes, E.; Ansari, M.A.; Pybus, O.G.; Matthews, P.C. Global prevalence and phylogeny of hepatitis B virus (HBV) drug and vaccine resistance mutations. J. Viral Hepat. 2021, 28, 1110–1120. [Google Scholar] [CrossRef]
  40. Nguyen, T.K.; Van Le, D. Resistant mutations within the hepatitis B virus reverse transcriptase sequence in treatment failure patients with chronic HBV infection in Vietnam. J. Glob. Antimicrob. Resist. 2023, 33, 35–41. [Google Scholar] [CrossRef] [PubMed]
  41. Jose-Abrego, A.; Roman, S.; Rebello Pinho, J.R.; Gomes-Gouvêa, M.S.; Panduro, A. High Frequency of Antiviral Resistance Mutations in HBV Genotypes A2 and H: Multidrug Resistance Strains in Mexico. J. Clin. Transl. Hepatol. 2023, 11, 1023–1034. [Google Scholar] [CrossRef] [PubMed]
  42. Bassit, L.; Ono, S.K.; Schinazi, R.F. Moving Fast Toward Hepatitis B Virus Elimination. Adv. Exp. Med. Biol. 2021, 1322, 115–138. [Google Scholar] [PubMed]
  43. Liu, T.; Sun, Q.; Gu, J.; Cen, S.; Zhang, Q. Characterization of the tenofovir resistance-associated mutations in the hepatitis B virus isolates across genotypes A to D. Antivir. Res. 2022, 203, 105348. [Google Scholar] [CrossRef]
  44. Lada, O.; Gervais, A.; Branger, M.; Peytavin, G.; Roquebert, B.; Collin, G.; Fraqueiro, G.; Moucari, R.; Leclerc, L.; Martinot-Peignoux, M.; et al. Quasispecies analysis and in vitro susceptibility of HBV strains isolated from HIV-HBV-coinfected patients with delayed response to tenofovir. Antivir. Ther. 2012, 17, 61–70. [Google Scholar] [CrossRef]
  45. Sánchez, L.V.; Tanaka, Y.; Maldonado, M.; Mizokami, M.; Panduro, A. Difference of hepatitis B virus genotype distribution in two groups of mexican patients with different risk factors: High prevalence of genotype H and G. Intervirology 2007, 50, 9–15. [Google Scholar] [CrossRef]
  46. Osiowy, C.; Gordon, D.; Borlang, J.; Giles, E.; Villeneuve, J.P. Hepatitis B virus genotype G epidemiology and co-infection with genotype A in Canada. J. Gen. Virol. 2008, 89 Pt 12, 3009–3015. [Google Scholar] [CrossRef]
  47. Cornelissen, M.; Zorgdrager, F.; Bruisten, S.M.; Bakker, M.; Berkhout, B.; van der Kuyl, A.C. Widespread hepatitis B virus genotype G (HBV-G) infection during the early years of the HIV epidemic in the Netherlands among men who have sex with men. BMC Infect. Dis. 2016, 16, 268. [Google Scholar] [CrossRef]
  48. Araujo, N.M.; Araujo, O.C.; Silva, E.M.; Villela-Nogueira, C.A.; Nabuco, L.C.; Parana, R.; Bessone, F.; Gomes, S.A.; Trepo, C.; Kay, A. Identification of novel recombinants of hepatitis B virus genotypes F and G in human immunodeficiency virus-positive patients from Argentina and Brazil. J. Gen. Virol. 2013, 94 Pt 1, 150–158. [Google Scholar] [CrossRef]
  49. Jose-Abrego, A.; Roman, S.; Rebello Pinho, J.R.; de Castro, V.F.D.; Panduro, A. Hepatitis B Virus (HBV) Genotype Mixtures, Viral Load, and Liver Damage in HBV Patients Co-infected With Human Immunodeficiency Virus. Front. Microbiol. 2021, 12, 640889. [Google Scholar] [CrossRef] [PubMed]
  50. World Bank Open Data. Available online: https://data.worldbank.org (accessed on 11 August 2025).
  51. Pierra Rouviere, C.; Dousson, C.B.; Tavis, J.E. HBV replication inhibitors. Antivir. Res. 2020, 179, 104815. [Google Scholar] [CrossRef] [PubMed]
  52. McNaughton, A.L.; Revill, P.A.; Littlejohn, M.; Matthews, P.C.; Azim Ansari, M. Analysis of genomic-length HBV sequences to determine genotype and subgenotype reference sequences. J. Gen. Virol. 2020, 101, 271–283. [Google Scholar] [CrossRef]
  53. Mojsiejczuk, L.; Torres, C.; Flichman, D.; Campos, R.H. Long-term evolution of hepatitis B virus genotype F: Strong association between viral diversification and the prehistoric settlement of Central and South America. J. Viral. Hepat. 2020, 27, 620–630. [Google Scholar] [CrossRef]
  54. Yin, Y.; He, K.; Wu, B.; Xu, M.; Du, L.; Liu, W.; Liao, P.; Liu, Y.; He, M. A systematic genotype and subgenotype re-ranking of hepatitis B virus under a novel classification standard. Heliyon 2019, 5, e02556. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, X.Y.; Harrison, T.J.; He, X.; Chen, Q.Y.; Li, G.J.; Liu, M.H.; Li, H.; Yang, J.Y.; Fang, Z.L. The prevalence of mutations in the major hydrophilic region of the surface antigen of hepatitis B virus varies with subgenotype. Epidemiol. Infect. 2015, 143, 3572–3582. [Google Scholar] [CrossRef]
  56. Chen, L.; Shi, Y.; Yang, W.; Zhang, Y.; Xie, Q.; Li, Y.; Li, X.; Li, J.; Zhang, Z. Differences in Cpg Island Distribution Between Subgenotypes of the Hepatitis B Virus Genotype. Med. Sci. Monit. 2018, 24, 6781–6794. [Google Scholar] [CrossRef]
  57. Thijssen, M.; Trovão, N.S.; Mina, T.; Maes, P.; Pourkarim, M.R. Novel hepatitis B virus subgenotype A8 and quasi-subgenotype D12 in African–Belgian chronic carriers. Int. J. Infect. Dis. 2020, 93, 98–101. [Google Scholar] [CrossRef]
  58. Gionda, P.O.; Gomes-Gouvea, M.; Malta Fde, M.; Sebe, P.; Salles, A.P.M.; Francisco Rdos, S.; José-Abrego, A.; Roman, S.; Panduro, A.; Pinho, J.R.R. Analysis of the complete genome of HBV genotypes F and H found in Brazil and Mexico using the next generation sequencing method. Ann. Hepatol. 2022, 27 (Suppl. 1), 100569. [Google Scholar] [CrossRef]
  59. Zhu, H.L.; Wang, C.T.; Xia, J.B.; Li, X.; Zhang, Z.H. Establishment of reference sequences of hepatitis B virus genotype C subgenotypes. Genet. Mol. Res. 2015, 14, 16521–16534. [Google Scholar] [CrossRef] [PubMed]
  60. Jia, H.H.; Chen, Q.Y.; Jiang, Z.H.; Wang, X.Y.; Zhang, W.J.; He, X.; Harrison, T.J.; Jackson, J.B.; Fang, Z.-L. A novel subgenotype I3 of hepatitis B virus in Guangxi, China: A 15-year follow-up study. Virus Genes 2023, 59, 359–369. [Google Scholar] [CrossRef] [PubMed]
  61. Cai, Q.; Zhu, H.; Zhang, Y.; Li, X.; Zhang, Z. Hepatitis B virus genotype A: Design of reference sequences for sub-genotypes. Virus Genes 2016, 52, 325–333. [Google Scholar] [CrossRef]
  62. McNaughton, A.L.; D’Arienzo, V.; Ansari, M.A.; Lumley, S.F.; Littlejohn, M.; Revill, P.; McKeating, J.A.; Matthews, P.C. Insights from Deep Sequencing of the HBV Genome-Unique, Tiny, and Misunderstood. Gastroenterology 2019, 156, 384–399. [Google Scholar] [CrossRef]
  63. National Center for Biotechnology Information. NCBI. 2004 Genotyping. Available online: https://www.ncbi.nlm.nih.gov/projects/genotyping/formpage.cgi (accessed on 22 March 2024).
  64. Bell, T.G.; Yousif, M.; Kramvis, A. Bioinformatic curation and alignment of genotyped hepatitis B virus (HBV) sequence data from the GenBank public database. Springerplus 2016, 5, 1896. [Google Scholar] [CrossRef] [PubMed]
Figure 1. This study analyzed 8152 HBV sequences from the GenBank database to investigate associations among genotypes, mutations, and resistance to the main nucleos(t)ide analogs (NAs) used in the treatment of chronic HBV infection. The sequences were classified into three clusters based on genotype information availability, followed by phylogenetic analyses to determine genotypes and subtypes. Statistical analyses were performed to evaluate resistance-associated risk factors and genotype distribution across countries. ADV, adefovir; CDS, coding region; ETV, entecavir; LMV, lamivudine; ML, maximum likelihood; NAs, nucleo(s)tide analogs; RT, reverse transcriptase; TBV, telbivudine; TN93, Tamura-Nei 1993.
Figure 1. This study analyzed 8152 HBV sequences from the GenBank database to investigate associations among genotypes, mutations, and resistance to the main nucleos(t)ide analogs (NAs) used in the treatment of chronic HBV infection. The sequences were classified into three clusters based on genotype information availability, followed by phylogenetic analyses to determine genotypes and subtypes. Statistical analyses were performed to evaluate resistance-associated risk factors and genotype distribution across countries. ADV, adefovir; CDS, coding region; ETV, entecavir; LMV, lamivudine; ML, maximum likelihood; NAs, nucleo(s)tide analogs; RT, reverse transcriptase; TBV, telbivudine; TN93, Tamura-Nei 1993.
Microorganisms 13 01913 g001
Figure 2. The number of sequences included in the database, classified by region and by country, state, or territory. Alaska was presented separately from the rest of the United States due to the inherent differences in the genetic composition of its population.
Figure 2. The number of sequences included in the database, classified by region and by country, state, or territory. Alaska was presented separately from the rest of the United States due to the inherent differences in the genetic composition of its population.
Microorganisms 13 01913 g002
Figure 3. The number of sequences corresponding to each genotype and subtype included in the database. A total of 45 genotypes/subtypes were reported across the Americas. The most frequent subtypes were A2 (n = 1722), A1 (n = 1201), C2 (n = 634), D3/D6 (n = 561), C1 (n = 515), B2 (n = 459), and F1b (n = 425). Two sequences identified as natural recombinants (A/G and A/D) were excluded from the figure. Genotype J was not detected, as expected.
Figure 3. The number of sequences corresponding to each genotype and subtype included in the database. A total of 45 genotypes/subtypes were reported across the Americas. The most frequent subtypes were A2 (n = 1722), A1 (n = 1201), C2 (n = 634), D3/D6 (n = 561), C1 (n = 515), B2 (n = 459), and F1b (n = 425). Two sequences identified as natural recombinants (A/G and A/D) were excluded from the figure. Genotype J was not detected, as expected.
Microorganisms 13 01913 g003
Figure 4. Geographic distribution of HBV subtypes across the Americas, classified by country of origin and region: (a) North America, (b) Central America and the Caribbean, and (c) South America. The map displays the percentage frequencies of subtypes (represented as pie charts) in each of the 27 areas included in the database. Countries marked in blue exhibited a heterogeneous genotype distribution, with varying proportions of genotypes A. Countries and territories in gray showed a prominent presence of either genotype A or D. Countries highlighted in orange had a predominance of genotype F.
Figure 4. Geographic distribution of HBV subtypes across the Americas, classified by country of origin and region: (a) North America, (b) Central America and the Caribbean, and (c) South America. The map displays the percentage frequencies of subtypes (represented as pie charts) in each of the 27 areas included in the database. Countries marked in blue exhibited a heterogeneous genotype distribution, with varying proportions of genotypes A. Countries and territories in gray showed a prominent presence of either genotype A or D. Countries highlighted in orange had a predominance of genotype F.
Microorganisms 13 01913 g004aMicroorganisms 13 01913 g004bMicroorganisms 13 01913 g004c
Figure 5. A Sankey plot illustrating the relationship between genotype, mutations, and resistance to nucleos(t)ide analogs (NAs). The figure displays the frequency of genotypes reported in the total dataset (n = 8147), the subset with mutations (n = 5454), and, among those, the sequences harboring resistance-associated mutations (RAMs) (n = 863). For the 863 sequences with RAMs, the distribution of genotypes (A–H), specific RAMs, and resistance to nucleotide analogs (tenofovir and adefovir) and nucleoside analogs (telbivudine, lamivudine, and entecavir) were evaluated. Genotypes G and E, along with genotype A, exhibited the highest proportion of sequences with mutations (100% and 99%, respectively). However, only 16% of sequences with mutations contained RAMs. Genotype A accounted for 60% of all treatment-resistant sequences, representing 92% of all sequences with the H126Y mutation and 63% of those with the L180M + M204V/I mutation. The L180M + M204V/I and H126Y mutations together accounted for more than half of the observed RAMs, while resistance to TBV and LMV was the most frequently reported. Sequences with undetermined genotypes (NA) and recombinants were excluded from the figure.
Figure 5. A Sankey plot illustrating the relationship between genotype, mutations, and resistance to nucleos(t)ide analogs (NAs). The figure displays the frequency of genotypes reported in the total dataset (n = 8147), the subset with mutations (n = 5454), and, among those, the sequences harboring resistance-associated mutations (RAMs) (n = 863). For the 863 sequences with RAMs, the distribution of genotypes (A–H), specific RAMs, and resistance to nucleotide analogs (tenofovir and adefovir) and nucleoside analogs (telbivudine, lamivudine, and entecavir) were evaluated. Genotypes G and E, along with genotype A, exhibited the highest proportion of sequences with mutations (100% and 99%, respectively). However, only 16% of sequences with mutations contained RAMs. Genotype A accounted for 60% of all treatment-resistant sequences, representing 92% of all sequences with the H126Y mutation and 63% of those with the L180M + M204V/I mutation. The L180M + M204V/I and H126Y mutations together accounted for more than half of the observed RAMs, while resistance to TBV and LMV was the most frequently reported. Sequences with undetermined genotypes (NA) and recombinants were excluded from the figure.
Microorganisms 13 01913 g005
Figure 6. Odds ratios and 95% confidence intervals for the association between prior treatment with lamivudine (LMV) or tenofovir (TDF) and the subsequent presence of resistance-associated mutations (L180M, M204V/I, H126Y, and/or V173L).
Figure 6. Odds ratios and 95% confidence intervals for the association between prior treatment with lamivudine (LMV) or tenofovir (TDF) and the subsequent presence of resistance-associated mutations (L180M, M204V/I, H126Y, and/or V173L).
Microorganisms 13 01913 g006
Figure 7. Odds ratios and 95% confidence intervals for resistance to nucleos(t)ide analogs (NAs) in patients with HIV co-infection.
Figure 7. Odds ratios and 95% confidence intervals for resistance to nucleos(t)ide analogs (NAs) in patients with HIV co-infection.
Microorganisms 13 01913 g007
Table 1. Odds ratios and 95% confidence intervals for significant associations between HBV subtypes and specific resistance-associated mutations (RAMs). Subgenotypes highlighted in pink were identified with increased odds of a mutation, while those in green showed decreased odds. Subgenotypes A2 and genotype G were both associated with significantly increased odds for five mutations, while subgenotypes C1 and H were associated with four. Conversely, subgenotypes D3/D6 and C2 had decreased odds for four mutations each, and A1 had high odds for five. No significant associations were found between any subgenotype and the mutation L180M + M204V/I + I169T; this mutation was, therefore, excluded from the table. NS, not significant (p ≥ 0.05); -, no sequences with the specific combination of subgenotype and mutation.
Table 1. Odds ratios and 95% confidence intervals for significant associations between HBV subtypes and specific resistance-associated mutations (RAMs). Subgenotypes highlighted in pink were identified with increased odds of a mutation, while those in green showed decreased odds. Subgenotypes A2 and genotype G were both associated with significantly increased odds for five mutations, while subgenotypes C1 and H were associated with four. Conversely, subgenotypes D3/D6 and C2 had decreased odds for four mutations each, and A1 had high odds for five. No significant associations were found between any subgenotype and the mutation L180M + M204V/I + I169T; this mutation was, therefore, excluded from the table. NS, not significant (p ≥ 0.05); -, no sequences with the specific combination of subgenotype and mutation.
SubtypeS106CH126YD134EV173LA181T/VA194TS202C/G/IM204V/IN236TL269IL269I + H126YL180M + M204V/IL180M + M204V/I + V173L
A10.25 (0.09–0.67)3.55 (3.11–4.04)NSNSNSNSNS0.36 (0.19–0.68)-0.18 (0.14–0.22)4.64 (4.05–5.31)0.36 (0.19–0.68)0.26 (0.13–0.51)
A2-12.22 (10.78–13.85)NS5.97 (2.89–12.32)--NSNSNS0.02 (0.01–0.03)4.35 (3.84–4.93)4.77 (3.94–5.77)15.80 (11.20–22.28)
A5-NS--------117.24 (15.82–868.9)--
B2----NSNS-NSNS6.37 (5.20–7.79)-0.21 (0.09–0.47)-
B3---------8.77 (3.5–22.0)---
B4-0.01 (0.002–0.09)NS-18.12 (8.34–39.39)-NS7.67 (5.21–11.31)-NS-0.26 (0.11–0.64)-
B571.34 (34.6–147.07)----87.24 (16.94–449.13)-NS-6.38 (3.03–13.42)---
B6----NS----14. 36 (7.91–26.08)---
B10-----------16.49 (1.03–264.04)-
C12.99 (1.74–5.15)0.01 (0.004–0.06)5.18 (2.86–9.37)NSNSNS-2.03 (1.26–3.27)-3.37 (2.82–4.04)-0.28 (0.014–0.54)-
C2NS0.05 (0.02–0.09)2.15 (1.05–4.39)NS---NS-0.27 (0.21–0.36)0.01 (0.004–0.06)NS0.06 (0.008–0.41)
C5-NS15.74 (3.57–69.42)----NS-NS---
D1--NS-4.92 (1.15–20.99)--NS30.60 (2.76–339.6)1.58 (1.11–2.27)-NS-
D2---NS--NSNS-4.90 (3.89–6.16)-2.81 (2.02–3.91)NS
D3/D6NS0.02 (0.006–0.06)NSNS--3.89 (1.28–11.85)NS-4.34 (3.64–5.18)0.02 (0.004–0.07)0.34 (0.19–0.61)0.13 (0.03–0.53)
D4-0.02 (0.002–0.11)---NS---3.28 (2.56–4.2)---
D9---NSNS--NS-NS-NS4.41 (1.73–11.24)
D12------14.89 (1.92–115.28)--0.18 (0.04–0.74)-3.69 (1.52–8.99)-
E-2.86 (2.14–3.82)-----NS-0.01 (0.002–0.1)6.97 (5.22–9.31)0.25 (0.08–0.79)-
F1b7.03 (4.43–11.14)---NS-NS--NS0.01 (0.002–0.08)NSNS
F1c3.74 (1.79–7.82)--------0.19 (0.11–0.33)---
F2aNS0.03 (0.004–0.18)-------3.66 (2.66–5.05)-NS-
F2b---------8.26 (1.67–40.98)-5.51 (1.11–27.35)-
F37.23 (3.67–14.25)--NS---NS-5.38 (3.77–7.69)-NS-
F44.39 (2.37–8.14)-----NS--1.90 (1.48–2.44)-0.19 (0.06–0.58)NS
F5---------5.50 (1.007–30.07)---
F6------23.87 (3.03–187.99)--NS-NS-
G-1.81 (1.27–2.59)--9.85 (3.36–28.85)--9.56 (5.91–15.49)--11.94 (8.36–17.07)NS2.33 (1.13–4.81)
HNS--6.76 (1.59–28.8)NS13.87 (1.69–114.03)12.27 (2.78–54.24)NS-6.27 (3.94–9.99)0.06 (0.008–0.43)NSNS
I-NS--------50.48 (6.46–394.69)--
Table 2. Odds ratios and 95% confidence intervals for significant associations between HBV subgenotypes and resistance to specific nucleos(t)ide analogs (NAs). Subgenotypes highlighted in pink were identified as risk factors for resistance, while those in green were classified as protective factors against resistance to the corresponding NA. Lamivudine and telbivudine showed the highest number of significant associations, with 13 and 14 subgenotypes, respectively. In contrast, resistance to tenofovir was significantly associated only with subgenotypes B5 and H. NS, not significant (p ≥ 0.05); -, no sequences with the specific combination of subgenotype and mutation.
Table 2. Odds ratios and 95% confidence intervals for significant associations between HBV subgenotypes and resistance to specific nucleos(t)ide analogs (NAs). Subgenotypes highlighted in pink were identified as risk factors for resistance, while those in green were classified as protective factors against resistance to the corresponding NA. Lamivudine and telbivudine showed the highest number of significant associations, with 13 and 14 subgenotypes, respectively. In contrast, resistance to tenofovir was significantly associated only with subgenotypes B5 and H. NS, not significant (p ≥ 0.05); -, no sequences with the specific combination of subgenotype and mutation.
SubtypeAdefovirEntecavirLamivudineTelbivudineTenofovir
A1NS0.30 (0.17–0.56)0.46 (0.36–0.60)0.48 (0.37–0.62)NS
A20.13 (0.02–0.98)12.04 (8.82–16.43)5.45 (4.70–6.32)5.12 (4.41–5.95)-
B2NS-0.18 (0.10–0.35)0.19 (0.10–0.36)NS
B416.10 (7.54–34.40)0.12 (0.02–0.82)NS1.40 (1–1.96)-
B5--NSNS87.24 (16.94–449.14)
C1NS-0.50 (0.34–0.74)0.52 (0.36–0.76)NS
C2-0.05 (0.007–0.38)0.52 (0.37–0.73)0.54 (0.38–0.75)-
D14.55 (1.07–19.35)-NSNS-
D2-NS1.86 (1.38–2.51)1.93 (1.43–2.59)-
D3/D6-0.38 (0.17–0.86)0.35 (0.23–0.54)0.37 (0.24–0.56)-
D9NS4.14 (1.63–10.52)NSNS-
E--0.18 (0.07–0.48)0.18 (0.07–0.50)-
F1bNSNS0.55 (0.37–0.83)0.57 (0.38–0.86)-
F2a--0.34 (0.15–0.78)0.36 (0.16–0.81)-
F3--0.40 (0.18–0.90)0.41 (0.18–0.93)-
F4-NS0.13 (0.05–0.35)0.14 (0.05–0.36)-
G9.06 (3.11–26.37)2.18 (1.05–4.50)3.56 (2.46–5.14)3.68 (2.54–5.32)-
HNSNSNSNS13.87 (1.69–114.03)
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ruvalcaba, I.A.; Diaz-Palomera, C.D.; Silva-Ríos, A.A.; Muñoz-Valle, J.F.; Viera-Segura, O. The Molecular Epidemiology of Hepatitis B Virus and Its Resistance-Associated Mutations in the Polymerase Gene in the Americas. Microorganisms 2025, 13, 1913. https://doi.org/10.3390/microorganisms13081913

AMA Style

Ruvalcaba IA, Diaz-Palomera CD, Silva-Ríos AA, Muñoz-Valle JF, Viera-Segura O. The Molecular Epidemiology of Hepatitis B Virus and Its Resistance-Associated Mutations in the Polymerase Gene in the Americas. Microorganisms. 2025; 13(8):1913. https://doi.org/10.3390/microorganisms13081913

Chicago/Turabian Style

Ruvalcaba, Itzel A., Carlos Daniel Diaz-Palomera, Adrián Alejandro Silva-Ríos, José Francisco Muñoz-Valle, and Oliver Viera-Segura. 2025. "The Molecular Epidemiology of Hepatitis B Virus and Its Resistance-Associated Mutations in the Polymerase Gene in the Americas" Microorganisms 13, no. 8: 1913. https://doi.org/10.3390/microorganisms13081913

APA Style

Ruvalcaba, I. A., Diaz-Palomera, C. D., Silva-Ríos, A. A., Muñoz-Valle, J. F., & Viera-Segura, O. (2025). The Molecular Epidemiology of Hepatitis B Virus and Its Resistance-Associated Mutations in the Polymerase Gene in the Americas. Microorganisms, 13(8), 1913. https://doi.org/10.3390/microorganisms13081913

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop