Next Article in Journal
Effect of Coriander Plants on Human Emotions, Brain Electrophysiology, and Salivary Secretion
Next Article in Special Issue
Genomic Analysis of 18th-Century Kazakh Individuals and Their Oral Microbiome
Previous Article in Journal
Digital versus Traditional Workflow for Immediate Loading in Single-Implant Restoration: A Randomized Clinical Trial
Previous Article in Special Issue
An Alignment-Independent Approach for the Study of Viral Sequence Diversity at Any Given Rank of Taxonomy Lineage
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Phylogeography and Re-Evaluation of Evolutionary Rate of Powassan Virus Using Complete Genome Data

by
Artem N. Bondaryuk
1,2,*,
Tatiana E. Peretolchina
2,
Elena V. Romanova
2,
Anzhelika V. Yudinceva
2,
Evgeny I. Andaev
1 and
Yurij S. Bukin
2
1
Laboratory of Natural Focal Viral Infections, Irkutsk Antiplague Research Institute of Siberia and the Far East, Irkutsk 664047, Russia
2
Department of Genosystematics, Limnological Institute, Siberian Branch of the Russian Academy of Sciences, Irkutsk 664033, Russia
*
Author to whom correspondence should be addressed.
Biology 2021, 10(12), 1282; https://doi.org/10.3390/biology10121282
Submission received: 29 October 2021 / Revised: 22 November 2021 / Accepted: 3 December 2021 / Published: 6 December 2021

Abstract

:

Simple Summary

The evolution of human pathogenic viruses is one of the pressing problems of modern biology and directly relevant to public health. Many important aspects of virus evolution (e.g., evolutionary rate, population size, and migration history) are ‘hidden’ from the naked eye of a researcher. Modern bioinformatics methods make it possible to evaluate and visualize such evolutionary particularities of viruses. In this paper, we reconstructed the migration history and estimated the evolutionary rate of one of the most dangerous neuroinvasive and neurotropic tick-borne flaviviruses—Powassan virus (POWV)—distributed in North America and the Far East of Russia. Using the dates obtained, we hypothesized that the divergence of the most recent common ancestor of POWV into two independent genetic lineages most likely occurred because of the melting of glaciers that began at 11.72 Kya in the Holocene due to the climate warming-caused flooding of the isthmus between Eurasia and North America.

Abstract

In this paper, we revealed the genetic structure and migration history of the Powassan virus (POWV) reconstructed based on 25 complete genomes available in NCBI and ViPR databases (accessed in June 2021). The usage of this data set allowed us to perform a more precise assessment of the evolutionary rate of this virus. In addition, we proposed a simple Bayesian technique for the evaluation and visualization of ‘temporal signal dynamics’ along the phylogenetic tree. We showed that the evolutionary rate value of POWV is 3.3 × 10−5 nucleotide substitution per site per year (95% HPD, 2.0 × 10−5–4.7 × 10−5), which is lower than values reported in the previous studies. Divergence of the most recent common ancestor (MRCA) of POWV into two independent genetic lineages most likely occurred in the period between 2600 and 6030 years ago. We assume that the divergence of the virus lineages happened due to the melting of glaciers about 12,000 years ago, which led to the disappearance of the Bering Land Bridge between Eurasia and North America (the modern Alaskan territory) and spatial division of the viral areal into two parts. Genomic data provide evidence of the virus migrations between two continents. The mean migration rate detected from the Far East of Russia to North America was one event per 1750 years. The migration to the opposite direction occurred approximately once per 475 years.

1. Introduction

Powassan virus (POWV) was isolated for the first time in September 1958 in Canada (Powassan, Ontario) from a child with fatal encephalitis [1].
POWV is a tick-borne flavivirus transmitted by Ixodes spp. (e.g., I. marxi, I. scapularis, I. cookie, I. persulcatus, etc.) [2], which is spread across North America (NA) and the Russian Far East (RFE), and causes a fatal neuroinvasive disease in humans [3]. Approximately 10% of POWV encephalitis cases are fatal, and 50% cause severe and long-lasting neurological sequelae [4]. The mortality of the most other neuroinvasive tick-borne flaviviruses is lower. For instance, the European subtype of tick-borne encephalitis virus causes about 1–2% of fatal cases, and the louping-ill virus has only one fatal case registered [5]. Among the variants of POWV in NA, the subtype “Deer tick virus” (DTV) is distinguished based on genetic characteristics, the area distribution, and the spectrum of vectors [3].
The genome length of the POWV is approximately 11,000 nt. The largest part of the sequence (10,248 nt.) encodes a single polyprotein (3416 a.a.), which is cleaved co- and post-translationally into three structural proteins (C, prM, and E) and seven non-structural proteins (NS1, NS2A, NS2B, NS3, NS4A, NS4B, and NS5) [6].
The evolutionary rate of POWV and time of the most recent common ancestor (tMRCA) were assessed in previous studies using different approaches based on Bayesian inference [7,8]. However, in the phylogenetic studies performed in the early 2010s, researchers often did not evaluate the temporal signal (a sufficient number of genetic changes between sampling times) [9] that particularly reduces the validity of the analysis, especially when incomplete genomic sequences and a narrow sampling window relative to total tree length are used [10].
Another pitfall of that king of analysis is that Bayesian phylogenetic software (e.g., BEAST) can generate nodes with high posterior support even when the data does not have a significant temporal signal. Therefore, the results of the previous studies of the POWV evolutionary rate and tMRCA estimation need more accurate assessment.
Other interesting aspects of POWV molecular epidemiology are the phylogeographic relationships between virus isolates. In previous genetic studies, researchers hypothesized that POWV could be introduced into the RFE from NA [11,12]. However, such studies used relatively small data sets maintained either partial gene sequences or a small number of complete genome sequences. A significant sequence data accumulation and the development of algorithms of phylogenetic analysis in recent years allow researchers to obtain more robust results. In particular, a new approach of structured coalescent approximation implemented in the MASCOT package [13] of the BEAST 2 [14] allows the assessment of phylogeographic relationships between virus populations with the dating of migration events by the times of the strains’ isolation. This method can be helpful for estimation in detail of the POWV distribution in the areas of its natural foci. As of June 10, 2020, 26 complete genomes of POWV with known isolation dates (1958–2019) are available in NCBI and ViPR. This number of complete genomes should be sufficient for dating the main events of the viral evolution in the phylogeographic analysis.
In this study, we aim to estimate temporal signal in the set of the POWV complete genomes, re-evaluate the evolutionary rate of the virus and reveal the phylogeographic relationships between POWV isolates. We also suggest a new simple Bayesian technique for the evaluation and visualization of ‘temporal signal dynamics’ along the phylogenetic tree. This technique would be an especially useful tool for phylogenetic analysis in cases when a sampling window is small relative to tree length that leads to underestimating the temporal signal of a whole tree even when tMRCA of young clades have enough signal to be calculated reliably.

2. Materials and Methods

2.1. Genomic Data Sampling

Nucleotide sequences of a POWV polyprotein gene (10,248 nt) were downloaded from NCBI and ViPR [15] databases (accessed in June 2021. In total, 26 sequences were used in the analysis. The dates of their isolation were taken from the literature and covered the period from 1958 to 2019 (Table 1).
Eight sequences were isolated in the RFE (Russian Far East) and the others were from NA (North America) (one from Canada and 17 from the USA). The RFE and NA groups were used as ‘states’ in the phylogeographic analysis.

2.2. Codon Saturation Measuring and Evolutionary Model Selection

Index of substitution saturation (Iss) in a set of aligned nucleotide sequences was calculated in DAMBE [16] and compared with critical Iss value (referred to as Iss.c [17]).
The best-fit substitution model was chosen according to the BIC score estimated by ModelFinder [18] implemented in IQTREE v1.6.12 [19].
The selection between strict and relaxed molecular clock was made based on the value of the coefficient of substitution rate variation (CV) obtained in a preliminary BEAST v2.6.3 run [14]. A total of 200 million MCMC generations were run, where every 5000th generation was sampled, and the first 10% of generations were discarded as burn-in. The effective sample size (ESS) for all parameters exceeded 200. A relaxed clock model with an uncorrelated lognormal distribution (UCLD) of substitution rates and constant population size prior were used. If 95% highest posterior density (HPD) of CV maintains a zero value, the strict molecular clock is used for the data set [20].
The consensus tree and the branch rate heterogeneity were visualized in FigTree.

2.3. Assessment of Temporal Structure of Heterochronous Sequences

The temporal signal in the data set was evaluated using the Bayesian model test—BETS (Bayesian Evaluation of Temporal Signal) where times of virus isolation were used as one of the model priors [10].
To perform BETS the marginal likelihood values for heterochronous and isochronous models (Mhet—a model with dated tree tips in BEAUti and Miso—a model without tips dating) should be calculated by any appropriate method. We evaluated marginal likelihood values using Path Sampling (PS) in BEAST 2 and ranked them according to their log Bayes factors (log BFs). Log BFs were interpreted according to the guidelines of Duchene, et al., (2020) [10], where a difference in log BF between models more than 5 indicates ‘very strong’ support.
We performed PS analysis both for the model with the correct sampling times (heterochronous data) and for the model without sampling times (isochronous tree). For both models, we used a strict clock and constant size model as a coalescent tree prior. For each model we set three PS runs with 200 steps per run. A total of 10 million MCMC iterations were set as pre-burning step, and 500,000 iterations were set for each step. The sufficient lengths of MCMC chains inside PS step and number of steps were defined by comparing the log marginal likelihood values obtained in different runs. If the difference among log marginal likelihood values in the runs was not significant, the parameters acknowledged to be sufficient for convergence. For more detail about prior settings for BETS, see Table S1.
Additionally, we performed a permutation test with 20 different random permuted between sequences isolation dates. For each of the 20 permutations, a BEAST analysis was performed with the selected parameters of the evolution model and molecular clock. In these analyses, tree root height and evolutionary rate values were determined. The results obtained were then compared with the analysis with correct isolation dates. If the root height and evolutionary rate values do not differ from the analysis with correct isolation dates, it indicates the absence of the temporal signal in the genomic dataset studied.
We additionally performed a permutation test to detect the temporal signal in the data set. We carried out 20 different random permutations between sequences isolation dates. For each permutation, BEAST analysis was performed with the selected parameters of the evolution model and molecular clock. If the root height and evolutionary rate values obtained in the permutation test do not differ from the results of the analysis using correct viral isolation dates, it means the absence of the temporal signal in the genomic dataset studied.

2.4. Temporal Signal Dynamics

Considering a narrow sampling window and relatively ancient root age, theoretically, the signal can be absent near the root and appear closer to younger clades [10].
To illustrate possible temporal signal dynamics along the POWV phylogenetic tree, we sampled tMRCA values from the prior and posterior distributions for the root and the first five clades of the tree were designated as A, B, C, D, and E clades (Figure 1).
The step-by-step tutorial prepared by us with a complete description of the temporal signal dynamics analysis scheme is available from: https://doi.org/10.6084/m9.figshare.16782484 (accessed on 6 December 2021).

2.5. Phylogeographic Analysis

The phylogeographic structure of POWV was studied using a structure coalescent approach and MASCOT package v1.2.2 [13] implemented in BEAST 2. The MASCOT was chosen as a tool capable of performing a robust analysis under biased sampling [21].
The number of MCMC generations for the MASCOT analysis was 100 million, where each 5000th entry was sampled (the total number of samples was 20,000).

3. Results

3.1. Assessment of Codon Substitution Saturation and the Best-Fit Substitution Model

Analysis of POWV polyprotein gene set in DAMBE [16] did not reveal substitution saturation both in the first and second codon positions (Iss = 0,10 and Iss.c = 0.82 at p-value = 0.00), and in the third codon position (Iss = 0.25 and Iss.c = 0.80 at p-value = 0.00), (Table S2), indicating this set appropriate for phylogenetic study.
The GTR + I was chosen as the best substitution model according to the BIC score value (see Table S3) and used for further BEAST analysis.

3.2. Molecular Clock Choosing

The preliminary run (GTR + I + UCLD + Constant size model) using 26 POWV sequences set revealed an unusually high value of the coefficient of variation (95% HPD, 0.25–1.47), considering that slowly evolving tick-borne flaviviruses do not have a high-rate variation among tree branches, as it was shown, for instance, for tick-borne encephalitis and louping-ill virus by Uzcátegui, et al., (2012) [22].
A branch of ‘ctb30’ strain on the consensus tree has an extraordinarily high evolutionary rate (1.6 × 10−4 substitutions per site per year) in comparison with the mean rate across the whole tree (Figure S1).
Such a high evolutionary rate value may be explained either with the longer passage history of the ‘ctb30’ strain (three times in suckling mice and two times in tissue culture) or by the errors in sequencing. Thus ‘ctb30′ strain was excluded from the analysis.
A new run of 25 sequence set led to the narrowing of 95% HPD (0–0.39), where the majority of values distributed around zero and a zero value was the lowest boundary of 95% HPD (Figure S2). Thus, the strict molecular clock model was chosen for further analysis of temporal signal assessment in this data set.

3.3. Comparing Prior and Posterior Distributions. Temporal Structure and Temporal Signal Dynamics

The BETS approach yielded very strong support for heterochronous over isochronous models (the lowest log BFs for SChet in the case of SChet over SCiso in all three runs were 30, see Table 2). This result indicates that the data unequivocally contain temporal signal.
The temporal signal near the tree root was indicated by the non-overlapping 95% quantile distributions of the root age values obtained for prior and posterior data (Figure 2, root).
In clades A and B (Figure 2, clades A, B) the 95% quantile width of prior and posterior distribution were overlapped, but the posterior distribution array was shifted from prior, which indicates the minor impact of temporal signal on tMRCAs in these clades.
Comparison of prior and posterior for tMRCA values of clades C, D, and E revealed that the temporal signal becomes stronger closer to the tips: posterior densities were much narrower than prior (Figure 2, clades C, D, and E).

3.4. Permutation Test

Permutation test showed that in 18 out of 20 runs, 95% HPD intervals of estimated evolutionary rates and tree heights did not overlap with intervals of values obtained considering the real isolation dates of viruses (Figure 3a). A similar result was shown for the root height values, where 95% HPD intervals of only two permuted runs were overlapped with values estimated from the real isolation dates (Figure 3b). Generally, it is consistent with the BETS results.

3.5. Evolutionary Rate Estimates

The mean evolutionary rate of a POWV polyprotein gene estimated by SC was 3.3 × 10−5 nucleotide substitutions per site per year (95% HPD, 2.0 × 10−5–4.7 × 10−5).

3.6. Divergence Dating

We specify here the dates of the root and nodes divergence as years dated back from the year of the most recent sample isolation (2019).
The tree root diverged in 4170 (95% HPD, 2600–6030). The mean divergence times for the clades A and B were 1160 (95% HPD, 760–1750), and 1100 years (95% HPD, 721–1619), respectively (Figure 1).
According to the analysis, the time of the root divergence is 4170 years (95% HPD, 2600–6030). The mean divergence times for the clades A and B are 1160 years (95% HPD, 760–1750), and 1100 years (95% HPD, 721–1619), respectively (Figure 1).
TMRCAs of the clades C, D, and E, are 135 years (95% HPD, 87–200), 97 years (95% HPD, 62–143), and 71 years (95% HPD, 62–87), respectively.

3.7. Phylogeography

We obtained an unequivocal assessment of the root state using MASCOT. The structured coalescent inference revealed similar probability values for both territories of the POWV spread (56% and 44% for NA and RFE, respectively; Figure 1). The estimation of the clade B node state was more reliable (62% and 38% for RFE and NA, respectively; Figure 1); however, it remained ambiguous. Other tree nodes have state probability of one.
The relative genetic diversity (Ne × τ) estimated for NA POWV was about 622, which is significantly higher than the appropriate value for RFE POWV (≈25). The longer internode patristic distances between the NA POWV isolates of clade A in comparison with that of clade B supports the data about the increased genetic diversity of the first one.
The migration of POWV occurs in both directions. The mean rate for RFE to NA migration is one event per 1750 years, and migrations in the opposite direction occur once per 475 years, respectively.

4. Discussion

The evolutionary rate of POWV (95% HPD, 2.0 × 10−5–4.7 × 10−5 substitutions per site per year) estimated in this work was much lower than values reported in the previous studies, based on the sequences of single genes or their fragments. Pesko, et al., (2010) [7] estimated the rates of accumulation of nucleotide substitutions in the genome region encoding proteins E and NS-5 to be 2.2 × 10−4 and 3.9 × 10−5 substitutions per site per year, respectively. Subbotina and Loktev (2012) [8] showed that rate of nucleotide substitutions per site per year for the E protein was 1.4 × 10−4 and for NS-5 it was 5.4 × 10−5. Thus, the previous values of the evolutionary rates for region encoding protein E were almost 10 times higher than our estimates for the coding part of the POWV genome. For the fragment encoding NS-5 protein, the rate estimates were close or slightly overlapped the upper limit of the confidence interval we obtained. Such an increased evolutionary rate in previous studies could have been obtained due to the absence of a preliminary evaluation of temporal signal, and the effect of priors, specified in BEAUti (the program used to design the analysis and generate the BEAST 2 project file).
The evolutionary rate of POWV estimated in this study is compatible with that of TBEV and LIV (1.0 × 10−5–2.2 × 10−5 for TBEV and 5.7 × 10−6–3.9 × 10−5 for LIV [22,23,24]). Tick-borne flaviviruses, such as POWV, TBEV, and LIV, are known as slowly evolving RNA-viruses. Many other RNA-viruses have one or two orders of magnitude higher substitution rates. For instance, the substitution rate in measles virus is 3.6 × 10−4–9.0 × 10−4 [25]; in Hepacivirus C virus is 9.0 × 10−4 [26]; Zika virus is 6.2 × 10−4–8.2 × 10−4 [27]; HIV-1 is 1 × 10−3–1.7 × 10−3 [28]; and SARS-CoV-2 is 1.3 × 10−3–2.0 × 10−3 [29].
The possible reasons for the lower evolutionary rate of tick-borne flaviviruses (POWV, LIV, and TBEV) in comparison with mosquito-borne flaviviruses (e.g., dengue virus, 8.9 × 10−4 substitutions per site per year [30]), and other RNA-viruses without arthropod vectors (e.g., influenza virus and HIV-1) may be the features of the tick vector biology. Tick–host interaction is a two-host system which imposed constraining effect on the virus evolution. Moreover, ticks have an extended life-cycle (3–5 years), as compared with mosquitoes, and accumulate the virus in the relatively low titers which, thereby, reduces the viral evolutionary rate [31].
Clade E, comprising all isolates from FER and one isolate from Canada, was recognized as the youngest virus group (95% HPD, 62–87). Moreover, tMRCA of the clade E was assessed most accurately as it has the narrowest 95% HPD. The chronology of the occurrence between clades D vs. C and D vs. E remains vague, because of the overlap of 95% HPD intervals (Figure S3).
The previous study of Leonova, et al., (2009) [12] proposed several hypotheses of POWV transmission from Canada to the Russian Far East. The hypothesis of the human-caused transmission of TBEV in the 20th century (the parental node of clade E—95% HPD, 61–82 years ago) by the transition of military cargo, troops, animals, etc., was admitted as more likely, than one of natural transmission by the wild birds. However, according to our estimations, the POWV split into the NA and FER genetic lineages, which happened in the period between 2600 and 6030 years ago (the mean value is 4170 years ago, the root state reconstruction), has the natural reason, such as global climatic changes. The ambiguity in the evaluation of the root state could be explained by the POWV MRCA origination at the border area in Beringia.
The geological studies showed that the Bering Land Bridge connected Eurasia (its FER part, in particular) and North America in the Pleistocene during the last glaciation from 11.72 to 30 thousand years ago [32,33]. So, Beringia could be a migration route for people, animals, and plants from Eurasia to North America and vice versa. The evidence of those migrations was obtained in population genetic studies of different animal species [34,35,36]. MRCA of the modern POWV could exist on the territory of RFE, NA, and Beringia as a single viral population. The main vectors of POWV are Ixodes cookie and I. scapularis ticks, which parasitize on deer and small rodents [37], which in turn were abundant in RFE, NA, and Beringia during the last Ice Age. The melting of glaciers began at 11.72 Kya in the Holocene due to the climate warming caused flooding of the isthmus between Eurasia and North America [38], and the gene flow between the two POWV groups decreased radically. From that time, the formation of two genetic lines of the virus started and was completed about 4000 years ago. However, the genetic contacts between RFE and NA groups of POWV did not entirely quit. Rare virus transmission events have occurred from 4000 years ago to the present. According to data of Ogden, et al., (2008) [39], I. scapularis can parasitize on migratory birds. Thus, the migratory birds carrying infected ticks may provide gene flows between POWV from RFE and NA.
The results of the analysis of migratory events between the RFE and NA POWV groups reported in our study are not entirely reliable, and a more detailed study requires a significant increase in the sample size of the genomes both in NA and RFE. However, our work results lead to an unambiguous conclusion about the presence of such recent migration events between the two POWV groups.
Some studies of viruses as well as organisms of other groups showed that in many cases there is the possibility of underestimating the age of the root of the phylogenetic tree using molecular clocks due to the effect of slowing down the rate of the molecular clock in the tree root area [40,41] and/or fixing adaptive mutations during the speciation and the development of a new ecological niche [42]. Wertheim, et al., (2013) [42], using the origin of corona viruses as an example, showed that when viruses change their hosts, a distortion of the molecular clock occurs with an underestimation of the age of ancient evolutionary events. In our case, when POWV separated from the closest ancestors, it assimilated a new ecological niche with a change in the host spectrum. Therefore, the divergence time of the POWV lineages 4000 years ago may be underestimated. So, the evolutionary events caused the formation of two POWV clades on the territory of RFE, NA, and Beringia may have occurred at 11.72 Kya.
The phylogeographic structure of POWV isolates showed in our study corroborates the conclusions of Heinze, et al., (2012) [11], where the glaciation is hypothesized as the main factor that contributed to the divergence POWV. The authors [11], also considered the POWV movement from Old World to New World, and strongly suggested southern Asia be a source of POWV. Our results make it possible to assume that the POWV ancestor most likely entered in its ancient area (FER, Beringia, and NA) from the Old World—the region of the origin of flaviviruses [11]. Therein, this ancestor, adapting both to the local conditions and spectrum of hosts (ticks and warm-blooded animals), evolved into POWV. Thus, we confirm the findings of Heinze, et al., (2012) [11].
The larger values of relative genetic diversity (≈615) of the NA POWV population in comparison with RFE POWV (≈25; Figure 1) imply better conditions for virus replication and spread in the NA territory. This may be explained by the absence of competition from the other tick-borne flaviviruses (e.g., TBEV distributed in Asia and Europe). POWV occupied the free local vectors (I. cookie and I. scapularis) and spread across the northern part of NA. In the RFE region, POWV theoretically can compete with TBEV via a cross-immune response in warm-blooded animals. Experiments have shown [43,44,45] that a wide range of flaviviruses, including POWV and TBEV, have partial cross-neutralization of viral particles by antibodies. Thus, if a warm-blooded animal has ever encountered TBEV and received an immune response, it can break the cycle of transmission of the POWV in the tick–warm-blooded animal–tick transmission chain. Therefore, POWV apparently loses in the competition to the more abundant TBEV in the given territory that leads to decreasing of POWV genetic diversity. On the other hand, the possible effect of a small sample size for POWV in the RFE territory, which can randomly distort the index of genetic diversity, should not be excluded. A more detailed answer to this question requires the collection of additional experimental materials.
The MASCOT revealed asymmetric migration rates between states, with more intense migration from NA to RFE direction. It is important to note that we assessed the divergence of the ancestral POWV population since the Beringia flood, and did not cover more ancient periods of viral population history. Thus, there is no contradiction between the conclusions in our work and that obtained by Heinze, et al., (2012) [11] about southern Asia being the source of tick-borne flaviviruses.

5. Conclusions

The mean evolutionary rate inferred for POWV based on Bayesian analysis of complete genome data was 3.3 × 10−5 nucleotide substitutions per site per year (95% HPD, 2.0–4.7), which is comparable with the estimates of substitution rate of other tick-borne flaviviruses such as TBEV and LIV, and significantly lower than the rate estimates yielded previously. The analysis of temporal dynamics showed the presence of a temporal signal in all branches of the POWV phylogenetic tree with the strongest data impact on the tMRCA evaluation of three clades (C, D, and E) (95% HPD intervals 87–200, 62–143, 62–87, respectively).
The MRCA of modern POWV split into two independent genetic lineages between 2600 and 6030 years ago as a result of the Beringia flood about 11.72 thousand years ago.
The absence of competition from other tick-borne flaviviruses for the free local vectors (I. cookie and I. scapularis) in NA had a positive impact on the higher genetic level of the genetic diversity of NA POWV (≈615) comparing with the RFE virus population (≈25). However, the difference in relative genetic diversity between NA and RFE populations of POWV may be caused by the lack of sequence data on the territory of Russia. Thus, in case of further RFE POWV isolation and genome sequencing, the accuracy of phylogeographic analysis can be increased.

Supplementary Materials

The following are available online at https://www.mdpi.com/article/10.3390/biology10121282/s1, Figure S1: Evolutionary rate heterogeneity, Figure S2: Comparison the coefficient of rate variation values, Figure S3: Divergence time of tMRCAs of the three main clades, Table S1: BETS priors for the different BEAST projects, Table S2: Codon saturation analysis results, Table S3: ModelFinder’s output, “POWV_Mascot.xml”—the BEAST xml-project (MASCOT package v.1.2.2 required), “POWV.fasta”—the fasta alignment used in the phylogeography analysis.

Author Contributions

Conceptualization, A.N.B. and Y.S.B.; methodology, A.N.B.; formal analysis, A.N.B.; writing—original draft preparation, A.N.B.; writing—review and editing, A.N.B., T.E.P., E.V.R., A.V.Y., E.I.A., and Y.S.B.; visualization, A.N.B. and A.V.Y.; supervision, E.I.A.; project administration, Y.S.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Ministry of Science and Higher Education of Russian Federation under projects: No. 0279-2021-0010.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data supporting reported results (nucleotide data sets, BEAST projects) are available from: https://doi.org/10.6084/m9.figshare.16782484 (accessed on 6 December 2021).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mc, L.D.; Donohue, W.L. Powassan virus: Isolation of virus from a fatal case of encephalitis. Can. Med. Assoc. J. 1959, 80, 708–711. [Google Scholar]
  2. Shi, J.; Hu, Z.; Deng, F.; Shen, S. Tick-Borne Viruses. Virol. Sin. 2018, 33, 21–43. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Kuno, G.; Artsob, H.; Karabatsos, N.; Tsuchiya, K.R.; Chang, G.J. Genomic sequencing of deer tick virus and phylogeny of powassan-related viruses of North America. Am. J. Trop. Med. Hyg. 2001, 65, 671–676. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Ebel, G.D. Update on Powassan virus: Emergence of a North American tick-borne flavivirus. Annu. Rev. Entomol. 2010, 55, 95–110. [Google Scholar] [CrossRef]
  5. Williams, H.; Thorburn, H. Serum antibodies to louping-ill virus. Scott. Med. J. 1962, 7, 353–355. [Google Scholar] [CrossRef]
  6. Mandl, C.W.; Holzmann, H.; Kunz, C.; Heinz, F.X. Complete genomic sequence of Powassan virus: Evaluation of genetic elements in tick-borne versus mosquito-borne flaviviruses. Virology 1993, 194, 173–184. [Google Scholar] [CrossRef]
  7. Pesko, K.N.; Torres-Perez, F.; Hjelle, B.L.; Ebel, G.D. Molecular epidemiology of Powassan virus in North America. J. Gen. Virol. 2010, 91, 2698–2705. [Google Scholar] [CrossRef]
  8. Subbotina, E.L.; Loktev, V.B. Molecular evolution of the tick-borne encephalitis and Powassan viruses. Mol. Biol. 2012, 46, 82–92. [Google Scholar] [CrossRef]
  9. Rambaut, A.; Lam, T.T.; Max Carvalho, L.; Pybus, O.G. Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen). Virus Evol. 2016, 2, vew007. [Google Scholar] [CrossRef] [Green Version]
  10. Duchene, S.; Lemey, P.; Stadler, T.; Ho, S.Y.W.; Duchene, D.A.; Dhanasekaran, V.; Baele, G. Bayesian Evaluation of Temporal Signal in Measurably Evolving Populations. Mol. Biol. Evol. 2020, 37, 3363–3379. [Google Scholar] [CrossRef]
  11. Heinze, D.M.; Gould, E.A.; Forrester, N.L. Revisiting the clinal concept of evolution and dispersal for the tick-borne flaviviruses by using phylogenetic and biogeographic analyses. J. Virol. 2012, 86, 8663–8671. [Google Scholar] [CrossRef] [Green Version]
  12. Leonova, G.N.; Kondratov, I.G.; Ternovoi, V.A.; Romanova, E.V.; Protopopova, E.V.; Chausov, E.V.; Pavlenko, E.V.; Ryabchikova, E.I.; Belikov, S.I.; Loktev, V.B. Characterization of Powassan viruses from Far Eastern Russia. Arch. Virol. 2009, 154, 811–820. [Google Scholar] [CrossRef]
  13. Muller, N.F.; Rasmussen, D.; Stadler, T. MASCOT: Parameter and state inference under the marginal structured coalescent approximation. Bioinformatics 2018, 34, 3843–3848. [Google Scholar] [CrossRef] [Green Version]
  14. Bouckaert, R.; Vaughan, T.G.; Barido-Sottani, J.; Duchene, S.; Fourment, M.; Gavryushkina, A.; Heled, J.; Jones, G.; Kuhnert, D.; De Maio, N.; et al. BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput. Biol. 2019, 15, e1006650. [Google Scholar] [CrossRef] [Green Version]
  15. Pickett, B.E.; Sadat, E.L.; Zhang, Y.; Noronha, J.M.; Squires, R.B.; Hunt, V.; Liu, M.; Kumar, S.; Zaremba, S.; Gu, Z.; et al. ViPR: An open bioinformatics database and analysis resource for virology research. Nucleic Acids Res. 2012, 40, D593–D598. [Google Scholar] [CrossRef]
  16. Xia, X. DAMBE7: New and Improved Tools for Data Analysis in Molecular Biology and Evolution. Mol. Biol. Evol. 2018, 35, 1550–1552. [Google Scholar] [CrossRef] [Green Version]
  17. Xia, X.; Xie, Z.; Salemi, M.; Chen, L.; Wang, Y. An index of substitution saturation and its application. Mol. Phylogenet. Evol. 2003, 26, 1–7. [Google Scholar] [CrossRef]
  18. Kalyaanamoorthy, S.; Minh, B.Q.; Wong, T.K.F.; von Haeseler, A.; Jermiin, L.S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 2017, 14, 587–589. [Google Scholar] [CrossRef] [Green Version]
  19. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  20. Drummond, A.J.; Bouckaert, R.R. Bayesian Evolutionary Analysis with BEAST; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar]
  21. Muller, N.F.; Rasmussen, D.A.; Stadler, T. The Structured Coalescent and Its Approximations. Mol. Biol. Evol. 2017, 34, 2970–2981. [Google Scholar] [CrossRef] [Green Version]
  22. Uzcátegui, N.Y.; Sironen, T.; Golovljova, I.; Jääskeläinen, A.E.; Välimaa, H.; Lundkvist, Å.; Plyusnin, A.; Vaheri, A.; Vapalahti, O. Rate of evolution and molecular epidemiology of tick-borne encephalitis virus in Europe, including two isolations from the same focus 44 years apart. J. Gen. Virol. 2012, 93, 786–796. [Google Scholar] [CrossRef]
  23. Adelshin, R.V.; Sidorova, E.A.; Bondaryuk, A.N.; Trukhina, A.; Sherbakov, D.Y.; White III, R.A.; Andaev, E.I.; Balakhonov, S.V. “886-84-like” tick-borne encephalitis virus strains: Intraspecific status elucidated by comparative genomics. Ticks Tick Borne Dis. 2019, 10, 1168–1172. [Google Scholar] [CrossRef]
  24. Clark, J.J.; Gilray, J.; Orton, R.J.; Baird, M.; Wilkie, G.; Filipe, A.D.S.; Johnson, N.; McInnes, C.J.; Kohl, A.; Biek, R. Population genomics of louping ill virus provide new insights into the evolution of tick-borne flaviviruses. PLoS Negl. Trop. Dis. 2020, 14, e0008133. [Google Scholar] [CrossRef]
  25. Furuse, Y.; Suzuki, A.; Oshitani, H. Origin of measles virus: Divergence from rinderpest virus between the 11th and 12th centuries. Virol. J. 2010, 7, 52. [Google Scholar] [CrossRef] [Green Version]
  26. Yuan, M.; Lu, T.; Li, C.; Lu, L. The evolutionary rates of HCV estimated with subtype 1a and 1b sequences over the ORF length and in different genomic regions. PLoS ONE 2013, 8, e64698. [Google Scholar] [CrossRef] [Green Version]
  27. Pettersson, J.H.; Bohlin, J.; Dupont-Rouzeyrol, M.; Brynildsrud, O.B.; Alfsnes, K.; Cao-Lormeau, V.M.; Gaunt, M.W.; Falconar, A.K.; de Lamballerie, X.; Eldholm, V.; et al. Re-visiting the evolution, dispersal and epidemiology of Zika virus in Asia. Emerg. Microbes Infect. 2018, 7, 79. [Google Scholar] [CrossRef]
  28. Worobey, M.; Watts, T.D.; McKay, R.A.; Suchard, M.A.; Granade, T.; Teuwen, D.E.; Koblin, B.A.; Heneine, W.; Lemey, P.; Jaffe, H.W. 1970s and ‘Patient 0’ HIV-1 genomes illuminate early HIV/AIDS history in North America. Nature 2016, 539, 98–101. [Google Scholar] [CrossRef] [Green Version]
  29. Boni, M.F.; Lemey, P.; Jiang, X.; Lam, T.T.; Perry, B.W.; Castoe, T.A.; Rambaut, A.; Robertson, D.L. Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic. Nat. Microbiol. 2020, 5, 1408–1417. [Google Scholar] [CrossRef]
  30. Wei, K.; Li, Y. Global evolutionary history and spatio-temporal dynamics of dengue virus type 2. Sci. Rep. 2017, 7, 45505. [Google Scholar] [CrossRef] [Green Version]
  31. McGuire, K.; Holmes, E.C.; Gao, G.F.; Reid, H.W.; Gould, E.A. Tracing the origins of louping ill virus by molecular phylogenetic analysis. J. Gen. Virol. 1998, 79 (Pt 5), 981–988. [Google Scholar] [CrossRef]
  32. Clark, P.U.; Dyke, A.S.; Shakun, J.D.; Carlson, A.E.; Clark, J.; Wohlfarth, B.; Mitrovica, J.X.; Hostetler, S.W.; McCabe, A.M. The Last Glacial Maximum. Science 2009, 325, 710–714. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Hopkins, D.M. Aspects of the paleogeography of Beringia during the late Pleistocene. In Paleoecology of Beringia; Academic Press: Cambridge, MA, USA, 1982; pp. 3–28. [Google Scholar]
  34. Meiri, M.; Lister, A.; Kosintsev, P.; Zazula, G.; Barnes, I. Population dynamics and range shifts of moose (Alces alces) during the Late Quaternary. J. Biogeogr. 2020, 47, 2223–2234. [Google Scholar] [CrossRef]
  35. Tamm, E.; Kivisild, T.; Reidla, M.; Metspalu, M.; Smith, D.G.; Mulligan, C.J.; Bravi, C.M.; Rickards, O.; Martinez-Labarga, C.; Khusnutdinova, E.K.; et al. Beringian standstill and spread of Native American founders. PLoS ONE 2007, 2, e829. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  36. Tedford, R.H.; Wang, X.; Taylor, B.E. Phylogenetic Systematics of the North American Fossil Caninae (Carnivora: Canidae). Bull. Am. Mus. Nat. Hist. 2009, 325, 1–218. [Google Scholar] [CrossRef]
  37. Drummond, R. Ticks and what You Can Do about Them; Wilderness Press: Berkeley, CA, USA, 2013; p. 80. [Google Scholar]
  38. Hu, A.; Meehl, G.A.; Otto-Bliesner, B.L.; Waelbroeck, C.; Han, W.; Loutre, M.-F.; Lambeck, K.; Mitrovica, J.X.; Rosenbloom, N. Influence of Bering Strait flow and North Atlantic circulation on glacial sea-level changes. Nat. Geosci. 2010, 3, 118–121. [Google Scholar] [CrossRef]
  39. Ogden, N.H.; Lindsay, L.R.; Hanincova, K.; Barker, I.K.; Bigras-Poulin, M.; Charron, D.F.; Heagy, A.; Francis, C.M.; O’Callaghan, C.J.; Schwartz, I.; et al. Role of migratory birds in introduction and range expansion of Ixodes scapularis ticks and of Borrelia burgdorferi and Anaplasma phagocytophilum in Canada. Appl. Environ. Microbiol. 2008, 74, 1780–1790. [Google Scholar] [CrossRef] [Green Version]
  40. Burridge, C.P.; Craw, D.; Fletcher, D.; Waters, J.M. Geological dates and molecular rates: Fish DNA sheds light on time dependency. Mol. Biol Evol 2008, 25, 624–633. [Google Scholar] [CrossRef] [Green Version]
  41. Ho, S.Y.; Larson, G. Molecular clocks: When times are a-changin’. Trends Genet. 2006, 22, 79–83. [Google Scholar] [CrossRef]
  42. Wertheim, J.O.; Chu, D.K.; Peiris, J.S.; Kosakovsky Pond, S.L.; Poon, L.L. A case for the ancient origin of coronaviruses. J. Virol. 2013, 87, 7039–7045. [Google Scholar] [CrossRef] [Green Version]
  43. Burgomaster, K.E.; Foreman, B.M.; Aleshnick, M.A.; Larman, B.C.; Gordon, D.N.; Maciejewski, S.; Morabito, K.M.; Ledgerwood, J.E.; Gaudinski, M.R.; Chen, G.L.; et al. Limited Flavivirus Cross-Reactive Antibody Responses Elicited by a Zika Virus Deoxyribonucleic Acid Vaccine Candidate in Humans. J. Infect. Dis. 2021, 224, 1550–1555. [Google Scholar] [CrossRef]
  44. Kemenesi, G.; Banyai, K. Tick-Borne Flaviviruses, with a Focus on Powassan Virus. Clin. Microbiol Rev. 2019, 32, e00106-17. [Google Scholar] [CrossRef] [Green Version]
  45. Rathore, A.P.S.; St John, A.L. Cross-Reactive Immunity Among Flaviviruses. Front. Immunol. 2020, 11, 334. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The time scaled phylogenetic tree of POWV in NA and FER: The color of tree branches and nodes represent a state/territory (see legend in the top left corner). The change in colors is a virus migration event between the states/territories. Numbers to the left from main nodes are node state/territory posterior probability. Black asterisks show clades with posterior probability less than 0.8 (please, do not confuse with node state/territory posterior probability). In the other cases, posterior probability is more than 0.8. An insert is the relative genetic diversity (effective population size (Ne) × generation time (τ—time between two successful infections in the chain of transmission).
Figure 1. The time scaled phylogenetic tree of POWV in NA and FER: The color of tree branches and nodes represent a state/territory (see legend in the top left corner). The change in colors is a virus migration event between the states/territories. Numbers to the left from main nodes are node state/territory posterior probability. Black asterisks show clades with posterior probability less than 0.8 (please, do not confuse with node state/territory posterior probability). In the other cases, posterior probability is more than 0.8. An insert is the relative genetic diversity (effective population size (Ne) × generation time (τ—time between two successful infections in the chain of transmission).
Biology 10 01282 g001
Figure 2. The prior and posterior distributions of tMRCAs of root age and five main clades (clades (AE)). An insert within the root plot is boxplots of 95% quantile width of the tree root height.
Figure 2. The prior and posterior distributions of tMRCAs of root age and five main clades (clades (AE)). An insert within the root plot is boxplots of 95% quantile width of the tree root height.
Biology 10 01282 g002
Figure 3. Date permutation test results: The boxplots present 95% HPD of (a) evolutionary rate and (b) tree root height parameters. Analysis with correct isolation dates is shown as the orange bar. Horizontal dashed lines demonstrate overlapping areas of the analysis using correct dates and permuted replicates.
Figure 3. Date permutation test results: The boxplots present 95% HPD of (a) evolutionary rate and (b) tree root height parameters. Analysis with correct isolation dates is shown as the orange bar. Horizontal dashed lines demonstrate overlapping areas of the analysis using correct dates and permuted replicates.
Biology 10 01282 g003
Table 1. Sequences’ metadata.
Table 1. Sequences’ metadata.
GB_IDStrainCommentaryCountryYearHost
MF374486L_B Canada1958Human
KT224351LEIV_5530 Russia1970Ixodidae spp.
EU770575Spassk_9 Russia1975Dermacentor silvarum
MG652438Pow_24 Russia1975I. persulcatus
KT224350LEIV_3070Prm Russia1977Ixodidae spp.
EU670438Nadezdinsk_1991 Russia1991Human
HQ231415Ternay Russia2006Human
EU543649Partizansk/2006 Russia2006Human
HQ231414Ulysses Russia2006Human
HM44056364_7062 USA1964Tick
AF311056ctb30DTV subtype of POWVUSA1995-N/A-
HM440559NFS001 USA1996I. scapularis
HM440558wicf9901DTV subtype of POWVUSA1999I. scapularis
HM440561DTVWiB08DTV subtype of POWVUSA2008I. scapularis
HM440560DTVWiA08DTV subtype of POWVUSA2008I. scapularis
HM440562DTVWiC08DTV subtype of POWVUSA2008I. scapularis
KU886216P0375 USA2010I. scapularis
KJ746872LI_1 USA2013I. scapularis
MG647783RTS84 USA2016I. scapularis
MG647781RTS92 USA2016I. scapularis
MG647782RTS96 USA2016I. scapularis
MG647779RTS81 USA2016I. scapularis
MG647780RTS82 USA2016I. scapularis
MK309362MeW17_228 USA2017I. scapularis
MK104144MeC17_166 USA2017I. scapularis
MW001306D002 USA2019Human
Table 2. Log marginal likelihood values for heterochronous and isochronous models estimated by path sampling (PS) in BEAST 2.
Table 2. Log marginal likelihood values for heterochronous and isochronous models estimated by path sampling (PS) in BEAST 2.
ModelLog marginal Likelihood
PS Run 1PS Run 2PS Run 3
SChet 1−26,773.5−26,771.5−26,771.1
SCiso−26,808.8−26,808.5−26,807.6
1 The most appropriate model with the highest log marginal likelihood values.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Bondaryuk, A.N.; Peretolchina, T.E.; Romanova, E.V.; Yudinceva, A.V.; Andaev, E.I.; Bukin, Y.S. Phylogeography and Re-Evaluation of Evolutionary Rate of Powassan Virus Using Complete Genome Data. Biology 2021, 10, 1282. https://doi.org/10.3390/biology10121282

AMA Style

Bondaryuk AN, Peretolchina TE, Romanova EV, Yudinceva AV, Andaev EI, Bukin YS. Phylogeography and Re-Evaluation of Evolutionary Rate of Powassan Virus Using Complete Genome Data. Biology. 2021; 10(12):1282. https://doi.org/10.3390/biology10121282

Chicago/Turabian Style

Bondaryuk, Artem N., Tatiana E. Peretolchina, Elena V. Romanova, Anzhelika V. Yudinceva, Evgeny I. Andaev, and Yurij S. Bukin. 2021. "Phylogeography and Re-Evaluation of Evolutionary Rate of Powassan Virus Using Complete Genome Data" Biology 10, no. 12: 1282. https://doi.org/10.3390/biology10121282

APA Style

Bondaryuk, A. N., Peretolchina, T. E., Romanova, E. V., Yudinceva, A. V., Andaev, E. I., & Bukin, Y. S. (2021). Phylogeography and Re-Evaluation of Evolutionary Rate of Powassan Virus Using Complete Genome Data. Biology, 10(12), 1282. https://doi.org/10.3390/biology10121282

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop