Next Article in Journal
Seneca Valley Virus Degrades STING via PERK and ATF6-Mediated Reticulophagy
Next Article in Special Issue
A Managerial Approach towards Modeling the Different Strains of the COVID-19 Virus Based on the Spatial GeoCity Model
Previous Article in Journal
West Nile Virus, an Underdiagnosed Cause of Acute Fever of Unknown Origin and Neurological Disease among Hospitalized Patients in South Africa
Previous Article in Special Issue
Prevention and Control Are Not a Regional Matter: A Spatial Correlation and Molecular Linkage Analysis Based on Newly Reported HIV/AIDS Patients in 2021 in Jiangsu, China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

HantaNet: A New MicrobeTrace Application for Hantavirus Classification, Genomic Surveillance, Epidemiology and Outbreak Investigations

1
Laboratory Branch, Division of HIV Prevention, Centers for Disease Control and Prevention, Atlanta, GA 30329, USA
2
Viral Special Pathogens Branch, Division of High Consequence Pathogens and Pathology, Centers for Disease Control and Prevention, Atlanta, GA 30329, USA
3
General Dynamics Information Technology, Atlanta, GA 30329, USA
*
Authors to whom correspondence should be addressed.
Viruses 2023, 15(11), 2208; https://doi.org/10.3390/v15112208
Submission received: 30 September 2023 / Revised: 27 October 2023 / Accepted: 27 October 2023 / Published: 2 November 2023
(This article belongs to the Special Issue Infectious Disease Epidemiology and Transmission Dynamics 2.0)

Abstract

:
Hantaviruses zoonotically infect humans worldwide with pathogenic consequences and are mainly spread by rodents that shed aerosolized virus particles in urine and feces. Bioinformatics methods for hantavirus diagnostics, genomic surveillance and epidemiology are currently lacking a comprehensive approach for data sharing, integration, visualization, analytics and reporting. With the possibility of hantavirus cases going undetected and spreading over international borders, a significant reporting delay can miss linked transmission events and impedes timely, targeted public health interventions. To overcome these challenges, we built HantaNet, a standalone visualization engine for hantavirus genomes that facilitates viral surveillance and classification for early outbreak detection and response. HantaNet is powered by MicrobeTrace, a browser-based multitool originally developed at the Centers for Disease Control and Prevention (CDC) to visualize HIV clusters and transmission networks. HantaNet integrates coding gene sequences and standardized metadata from hantavirus reference genomes into three separate gene modules for dashboard visualization of phylogenetic trees, viral strain clusters for classification, epidemiological networks and spatiotemporal analysis. We used 85 hantavirus reference datasets from GenBank to validate HantaNet as a classification and enhanced visualization tool, and as a public repository to download standardized sequence data and metadata for building analytic datasets. HantaNet is a model on how to deploy MicrobeTrace-specific tools to advance pathogen surveillance, epidemiology and public health globally.

1. Introduction

Hantaviruses are rodent-borne, negative-sense, tri-segmented RNA viruses that cause pulmonary and renal complications in infected humans [1,2,3]. Human infection typically occurs by contact with host reservoir excreta [2], but limited human-to-human transmission has been documented for the Andes virus (ANDV) species [4,5,6,7,8]. Hantaviruses are roughly organized into Old and New World designations based on viral genomics, geographic distribution and symptom manifestation [9,10,11,12,13]. Old World hantaviruses, like Seoul virus (SEOV), are typically distributed in Asia and Europe, with limited locations in the United States (US) and Canada, and cause hemorrhagic fever with renal syndrome (HFRS) [3,14,15,16]. Additional Old World hantaviruses causing HFRS include Hantaan, Dobrava-Belgrade, Puumala and Tula orthohantaviruses, which are distributed in Central/Eastern Europe, Asia and Europe [13,16,17]. New World hantaviruses cause hantavirus cardiopulmonary syndrome (HPS) and are predominantly found in the Americas [2,18,19], associated with Sin Nombre virus (SNV) species infections in the US [20,21,22,23], and ANDV in South America [7,24,25]. Across the Americas, additional New World hantaviruses causing human disease include Bayou, Black Creek Canal, Monongahela [26,27] and New York orthohantaviruses [28] in North America [11,13,17] and Choclo, Araraquara, Laguna Negra, Lechiguanas, Oran and Tunari orthohantaviruses in South America [11,17,29]. Additionally, hantaviruses have been discovered in Africa, but human infections have not been well documented yet [30].
From 1993 to 2021, 850 hantavirus cases were reported in the US [31]. These cases were laboratory-confirmed by the presence of hantavirus-reactive IgM, rising hantavirus-specific IgG titers, presence of hantavirus RNA, or hantavirus-reactive immunohistochemistry and symptoms consistent with hantavirus infection [32]. A few imported human cases of ANDV associated with travel to South America have also been reported in the US [4,32]. True HPS disease burden may be underestimated due to mild patient symptoms and unrecognized human and/or rodent host disease prevalence [2,19].
The tri-segmented hantavirus genome consists of the small (S), medium (M) and large (L) segments that encode nucleoprotein, glycoprotein and RNA polymerase, respectively [1,2]. Sequencing evidence from isolated hantavirus outbreaks demonstrates the utility of using hantaviral sequence information for molecular epidemiology. For example, Martinez et al. [6] used ANDV sequence data to demonstrate human-to-human ANDV transmission during a 2018–2019 outbreak in Argentina. Goodfellow et al. [33] and Kjemtrup et al. [23] demonstrated the geographic relatedness of human and rodent hantaviral sequences, and that molecular epidemiology can allow for predictive exposure risks as new hantaviral sequences are acquired. Centers for Disease Control and Prevention (CDC) collaborations with state public health laboratories and academia during hantavirus outbreaks have facilitated a broader picture of hantavirus diversity and spread [32,34,35]. However, there is a need for better coordination of nationwide hantavirus sampling in human and rodent hosts with real-time data integration, analytics and reporting tools to support surveillance, epidemiology and outbreak response. For instance, Kim et al. [36,37] highlighted the need for collaboration between health experts and bioinformaticians to improve hantavirus genomic epidemiology and surveillance, and suggested the use of advanced and accessible bioinformatic tools that allow for real-time data integration, visualization and analysis.
To fill this gap, we developed and validated a new MicrobeTrace-powered visualization tool called HantaNet, using published hantavirus sequence and contact tracing data [6,32,34]. MicrobeTrace is a secure web browser tool developed at the CDC [38] that remains operational without an internet connection and facilitates data integration and analysis with a suite of customizable views, including networks, phylogenetic trees, world maps, tables and charts that could be tailored as a dashboard view for real-time data analytics and reporting. MicrobeTrace has been widely used to integrate, visualize and analyze data from several pathogens, including HIV [39,40,41,42,43], hepatitis [44,45], tuberculosis [46] and SARS-CoV-2 [47,48,49]. We describe the expansion of MicrobeTrace as a pathogen-specific tool for genomic surveillance and variant classification. HantaNet is a free, secure and user-friendly web tool (https://cdcgov.github.io/HantaNet/ (accessed on 23 October 2023)) to integrate, visualize and analyze molecular and epidemiological data for rapid public health action.

2. Materials and Methods

2.1. Database Search and Standardization of Hantavirus Data for Use in HantaNet

We searched the National Center for Biotechnology Information (NCBI) GenBank database (https://www.ncbi.nlm.nih.gov/genbank/ (accessed on 28 June 2022)) for all hantavirus S, M and L nucleotide sequences collected prior to 2021 using an in-house Python script (https://github.com/evk3/hantavirus_US_distribution/blob/main/DL_Hanta_genomes.ipynb (accessed on 28 June 2022)). The S (1287 nucleotides (nt)), M (3423 nt) and L (6474 nt) coding sequences were aligned separately with the Multiple Alignment using Fast Fourier Transform (MAFFT) online tool [50,51] to generate FASTA alignments, which were manually inspected in BioEdit [52]. These preliminary FASTA alignments were loaded into MicrobeTrace to explore the data and estimate clustering genetic distance thresholds of Old and New World hantaviruses (the unreleased version is available at https://mossy426.github.io/HantaNet/ (accessed on 4 March 2022)). To build the hantavirus reference datasets, we searched the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) database (https://www.bv-brc.org/ (accessed on 28 June 2022)) to retrieve curated S, M and L gene sequences associated with genomes representative of the US hantavirus strains. We loaded multiple sequence alignments with partial and full-length sequences to test their clustering in MicrobeTrace. This step was useful to perform a quality assessment of the reference alignments, and to exclude poor-quality and short-length (<400 nt) sequences from the reference datasets. We created a standardized and shareable CSV template file for hantaviral sequence and metadata submission to HantaNet (https://github.com/CDCgov/HantaNet/wiki/Sequence-and-Metadata-Submission-Form (accessed on 23 October 2023)) after data consolidation with Microsoft Excel and Tableau Desktop (version 2021.4.3) software tools.

2.2. Building Hantavirus Reference Gene Modules in MicrobeTrace for HantaNet

The open-source CDC MicrobeTrace GitHub repository (https://github.com/CDCgov/MicrobeTrace (accessed on 4 March 2022)) was cloned to create the HantaNet GitHub repository (https://github.com/CDCgov/HantaNet (accessed on 23 October 2023)). The three reference FASTA alignments and standardized metadata were initially loaded into three separate MicrobeTrace sessions to build the S, M and L visualization modules, respectively. These three modules were built using the Tamura-Nei 93 (TN93) nucleotide substitution model [53] implemented in MicrobeTrace to compute the pairwise genetic distances between the aligned sequences [54,55] and to empirically estimate the best genetic distance threshold for hantavirus strain clustering consistent with S, M and L gene segment phylogenies [10,32,56,57]. For example, if two different strains cluster together, the genetic distance cutoff can easily be decreased in HantaNet to remove the genetic links and separate the strains for determining the distance threshold. Once validated, the S, M and L modules were saved as three MicrobeTrace session files in the HantaNet GitHub repository where the application is hosted.

2.3. Validation of the HantaNet Tool

Any modern web browser can be used to open the HantaNet tool (https://cdcgov.github.io/HantaNet/ (accessed on 23 October 2023)) after accepting the end-user license agreement. Depending on the gene segment to be analyzed, the user first selects the pre-loaded reference module of interest to launch the default 2D network analysis view (see Video S1 published at https://www.youtube.com/watch?v=7j2KZ4tLStU (accessed on 23 October 2023) for a demonstration of the HantaNet tool). New hantavirus test sequences and metadata from US genomic surveillance provided by Whitmer et al. [32] were used to validate the HantaNet tool and confirm hantavirus clustering and classification with the reference dataset. Briefly, we downloaded the S, M and L reference datasets from HantaNet and aligned the new sequences to each reference alignment using the “Add fragmentary sequence(s) to existing alignment” option in MAFFT, which aligns the new sequences against the references, while keeping the reference alignment intact (refer to our protocol at https://github.com/CDCgov/HantaNet/wiki/Sequence-Alignment-Protocols (accessed on 4 October 2022)). This step ensured that the sequence alignments were standardized before loading into HantaNet for reproducible distance calculations and comparable results. We also added the new metadata to the standardized template, as described above. The FASTA alignments and metadata were loaded into HantaNet, and the specific genetic distance threshold previously determined for each gene segment was set up to cluster the sequences by hantavirus strain in the 2D network analysis view. The genetic relatedness and clustering of the hantavirus sequences in the 2D networks at the identified distance thresholds were confirmed in the phylogenetic tree view in HantaNet [58] by comparison with published hantavirus phylogenies [10,32,56,57]. We also built phylogenetic trees using the neighbor-joining method in MEGA X [59,60] and loaded the Newick tree output into HantaNet, which calculates the patristic distances from the Newick tree branch lengths to generate the network [61,62]. The new genetic links from the calculated patristic distances were visualized in HantaNet using the 2D network and the phylogenetic tree views.
We further validated HantaNet for building transmission networks by loading published sequence and contact tracing data from two different outbreak investigations [6,34]. In order to build and analyze the transmission networks in HantaNet, we applied the nearest neighbor (NN) algorithm to construct the minimum spanning tree (MST), which determines the shortest genetic distance path through the network [63,64]. The link width in the NN network was adjusted to be proportional to the genetic distance to easily visualize the genetic differences between sequences. The networks generated in HantaNet were compared to the published networks [6,34].

2.4. HantaNet Improvements and New Features

We built functionality in HantaNet to add new icons in the networks to represent the origin of sequences as a human, animal or viral isolate. These options are customizable, and more icons can be added per user preferences (please refer to our protocol at https://github.com/CDCgov/HantaNet#adding-custom-symbols-png-jpg-and-svg-to-be-used-in-the-github-web-application---collaborators-only (accessed on 4 October 2022)). We also added a new directional arrow feature in the network view to allow users to further characterize potential transmission directionality. This is an optional feature that can be turned on or off for unidirectional and bidirectional arrows to visualize the order of transmission events. We modified polygon labels to keep the text font case as entered in the metadata, which is useful for proper display of strain acronyms (e.g., SNV, SEOV) and geographic fields (e.g., country and state abbreviations). More view tabs can be added to the dashboards and are customizable. We also modified MST calculations for improved NN network display. We demonstrate how users can easily built interactive dashboards in HantaNet by integrating genetic, temporal and geographic data. The NN network can also be overlaid with map views and a timeline for spatiotemporal analysis.

3. Results

3.1. Validation of HantaNet for Classification of Hantaviruses

We built the HantaNet web tool by adapting MicrobeTrace to integrate curated S, M and L nucleotide sequence alignments and standardized metadata from US hantaviruses isolated from human and animal hosts (1984–2016). Table 1 contains the list of the 41 hantavirus genomes used to build the three HantaNet gene segment modules. The hantavirus genome sequences are publicly available in the GenBank database and the curated reference alignments can be downloaded from HantaNet (Figure 1).
The TN93 genetic distance threshold for clustering hantavirus species was empirically determined for each of the modules using known strain phylogenetic clustering and ranged from 12 to 20% genetic divergence (0.120–0.200 nucleotide substitutions per site) based on the loaded S, M and L reference alignments. The default TN93 distance cutoff selected in HantaNet was 0.127 (12.7%) for the S segment, 0.200 (20%) for the M segment and 0.144 (14.4%) for the L segment modules in HantaNet. Phylogenetic trees were also built in HantaNet and compared to published hantavirus phylogenies [10,32,56,57] to validate the clustering and genetic distances of 12 hantavirus species in each of the modules. The final S module (Figure S1) had representative sequences of the 12 hantavirus strains (ANDV, n = 2; BAYV, n = 3; BCCV, n = 2; BRV, n = 2; ELMCV, n = 2; ILV, n = 3; MGLV, n = 2; MULV, n = 2; NYV, n = 3; PHV, n = 4; SEOV, n = 19; SNV, n = 23). The M module (Figure 2) included 10 strains (ANDV, n = 2; BAYV, n = 2; BCCV, n = 1; BRV, n = 2; ELMCV, n = 2; MGLV, n = 1; NYV, n = 4; PHV, n = 2; SEOV, n = 22; SNV, n = 28), and the L module (Figure S2) had 8 strains (ANDV, n = 2; BAYV, n = 2; BCCV, n = 1; MGLV, n = 1; NYV, n = 1; PHV, n = 2; SEOV, n = 18; SNV, n = 16). The S, M and L networks were saved as separate MicrobeTrace sessions (Figure 1) in the cloned MicrobeTrace GitHub repository built for HantaNet (https://github.com/CDCgov/HantaNet (accessed on 23 October 2023)).

3.2. Genomic and Epidemiology Analyses of Hantaviruses Using HantaNet

We validated HantaNet using data available from three hantavirus studies [6,32,34].
Case Study 1. HantaNet was validated with 21 genome sequences of hantaviruses isolated from human hosts during US genomic surveillance (1999–2021). At the 0.2 genetic distance cutoff (Figure 2), the clustering of the 21 new M gene sequences (yellow nodes) in the 2D network revealed that they are SNV (n = 18), MGLV (n = 2) and NYV (n = 1) hantavirus strains. When the genetic distance cutoff was decreased to 0.127 in HantaNet (Figure 3A), two SNV subgroups were observed, SNV1 (n = 3) and SNV2 (n = 8). Our phylogenetic analysis supported the network clustering in HantaNet (Figure 2 and Figure S3), and our results were consistent with the hantavirus phylogenetic clustering reported by Whitmer et al. [32]. Genetic distance and clustering differences were also observed among the S, M and L gene segments, as discussed by Whitmer et al. [32].
We performed spatiotemporal analysis to show the geographic location of the new M sequences compared to the reference dataset. First, we applied the NN algorithm, which finds the shortest genetic distance route in a network, reducing the inferred genetic links from 426 to 52 (Figure 3A). We retrieved the calculated genetic distances of the closest SNV sequences from the table view in HantaNet (Table 2) to confirm their SNV classification, as reported by Whitmer et al. [32]. To incorporate geographic data, the MST was overlaid on the map, using the map view in HantaNet to visualize the geographic distribution of SNV1 and SNV2 variants (Figure 3B).
Case Study 2. HantaNet was used to plot the genetic and contact tracing data from the 2018-2019 ANDV outbreak among 34 persons in Argentina reported by Martinez et al. [6], and the transmission network was inferred (Figure 4). The first reported case of the ANDV outbreak occurred in early November 2018 in Chubut Province, Argentina. The HantaNet transmission network reveals three major transmission events, as previously described [6] and represented by large human icons (ARG/1, ARG/2, ARG/9). According to the report, the first transmission event in November 2018 occurred when Patient 1 (ARG/1) was symptomatic and attended a party, where five other persons became infected (ARG/2-ARG/6). Patient 2 (ARG/2) was the spouse of Patient 9 (ARG/9), who also became infected together with five other persons (ARG/7-ARG/8, ARG/11-ARG/13) during the second transmission event in December 2018. The third transmission event in January 2019 occurred after Patient 2 died (ARG/2). Ten additional persons (ARG/15-ARG/19, ARG/22-ARG/24, ARG27, ARG/30) became infected while attending the wake of Patient 2 (ARG/2) and being exposed to Patient 9 (ARG/9), who was symptomatic.
Case Study 3. The transmission network for the 2017 SEOV outbreak at US and Canada facilities (home-based ratteries, local pet stores and homes) by Knust et al. [34] was inferred using HantaNet (Figure 5). This outbreak involved 31 facilities in 11 states and 2 confirmed facilities in Canada. Three major clusters of confirmed SEOV cases were observed in Illinois (n = 12), Wisconsin (n = 5) and Utah (n = 3), as shown in the inferred transmission network (Figure 5). Pet rats were commonly exchanged between US facilities, and confirmed rat movement is shown with black directional arrows. Twenty-eight facilities confirmed infected rats, and three facilities were associated with human infections (facilities 26, 30, 31). The collected viral sequences were highly similar to SEOV isolated from the English Cherwell clade, suggesting a possible importation to the US and Canada from the UK or Europe [34]. The network shows likely rat movement between Canadian facilities (nodes A and B) and several US facilities (nodes 9, 10, 12, 17 and 28), indicating potential transmission events. Unconfirmed rat movement is shown with orange directional arrows.

4. Discussion

MicrobeTrace was originally developed as a browser-based web application for secure, local HIV surveillance data analysis for cluster detection and outbreak response [40]. Its use has expanded for the study of other pathogen outbreaks [38,46,47,48,49]. During the 2022–2023 global mpox outbreak, the MicrobeTrace team at the CDC built a local Nextstrain [65] and MicrobeTrace pipeline for continued US mpox genomic surveillance and to support secure cluster analysis of HIV–mpox coinfections [66]. The browser-based compatibility of MicrobeTrace offers the advantage of analyzing sensitive data locally without transmitting information across the internet. We developed HantaNet (https://cdcgov.github.io/HantaNet/ (accessed on 23 October 2023)) as a use case for the rapid deployment of MicrobeTrace as a free and secure, pathogen-specific platform for data integration, analysis, visualization and data sharing. We show that MicrobeTrace is easily adaptable to study specific pathogens and new outbreaks and offers the flexibility to add new features and tool modifications based on user preferences. We validated HantaNet as a hantavirus classification and genomic epidemiology tool using genomic surveillance sequences provided by Whitmer et al. [32] and published sequence and contact tracing data [6,34] to create different visualizations, including networks, phylogenetic trees, maps and dashboards. During this process, we developed enhancements to HantaNet that have since been deployed in MicrobeTrace. For outbreak investigations, users can empirically determine the genetic distance threshold used for detection of transmission chains and clusters for a specific hantavirus involved in an outbreak by using a slider bar in the settings menu. HantaNet improves transmission network visualizations by the addition of icons and directional arrows. Like MicrobeTrace, HantaNet displays individual genetic and epidemiologic links as colored solid lines. Moreover, dashed lines represent links supported by both genetic and epidemiologic data, and the new directional arrow feature in the 2D network view shows the order of transmission events, which simplifies and improves the network visualization and comprehension of transmission during an investigation.
We validated the genetic relatedness for classification of hantaviruses in HantaNet by building genetic networks and phylogenetic trees, which were consistent with published hantavirus gene phylogenies [10,32,56,57]. While phylogenetic trees and networks are two different visualizations of the same data, transmission networks and clustering are generally both easier to understand and to explore than trees because network diagrams are intuitive, and the distance cutoff in HantaNet can easily be changed within a session. For example, we showed that hantaviral strain clusters are resolved with different S, M and L genetic distance thresholds. Genetic networks are also useful to simplify visualization of gene reassortment [67,68,69,70], which is a mechanism for the evolution of viruses with segmented genomes [71,72,73,74]. While visualization of gene reassortment is mostly performed with phylogenetic and hybrid plot methods [75,76], these tools require programming skills and are more time-consuming and less efficient when compared to genetic network building tools like MicrobeTrace [38]. New MicrobeTrace-powered tools, like HantaNet, could be deployed for the analysis and visualization of segmented genomes for comparison with existing methods to detect gene reassortment in other pathogens.
We showed that building and visualizing networks in HantaNet is a fast and reliable method for the study of contact tracing and sequencing data from outbreak investigations. We also demonstrated that HantaNet could be used for hantavirus genomic surveillance with the rapid creation of dashboards useful to further explore the spatiotemporal relationships of strain variants and potential human exposure hotspots based on viral sequence similarity. HantaNet and other MicrobeTrace-related applications could be modified for phylogeographic studies by integrating Markov chain (discrete or continuous trait models) and coalescent models for ancestral reconstruction [77], commonly used in the Bayesian Evolutionary Analysis Sampling Trees (BEAST) method [78,79,80] and Nextstrain platform [65,81,82] for phylogenetic inference. Bayesian inference methods can be time-consuming, but more efficient approaches to build time-scaled phylogenies like TreeTime are available as open-source code [83] and could be integrated into HantaNet and MicrobeTrace. Phylogeography could be useful to study hantavirus outbreaks and report changes that occur in host–virus distribution during genomic surveillance, including county-level data to determine if hantavirus strains are confined by state and national boundaries.
We designed HantaNet to be a single, unified platform for hantavirus data standardization, integration, visualization, analytics and sharing. While there are limited hantavirus sequences currently available in public databases like GenBank, we envision that HantaNet will provide a framework for rapid hantavirus data submission to improve data sharing and the availability of sequences in public repositories. HantaNet could also serve as a repository for rapid sequence and metadata sharing among the CDC, state public health laboratories and academic partners. To encourage timely hantaviral data analysis and sharing, the HantaNet GitHub project page provides instructions on how to use the tool (https://github.com/CDCgov/HantaNet/wiki (accessed on 23 October 2023)) and to submit new sequence and metadata to HantaNet using the standardized submission form (https://github.com/CDCgov/HantaNet/wiki/Sequence-and-Metadata-Submission-Form (accessed on 23 October 2023)). Also, users interested in adding new reference alignments to include other hantaviral strains can fork the current HantaNet GitHub repository to replace the S, M and L MicrobeTrace session files with the new FASTA alignments and standardized metadata. HantaNet users will benefit from using these resources to easily download standardized hantavirus metadata and sequence alignments to build their datasets for rapid data visualization, analysis and reporting during outbreak investigations.
In summary, HantaNet is a model on how to develop secure and freely accessible MicrobeTrace-powered tools to improve public health investigations. These pathogen-specific tools could also be deployed in less developed countries to include the study of neglected diseases. Deployment and use of MicrobeTrace applications will help overcome socioeconomical and geographical barriers to the advancement of public health infrastructure worldwide.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v15112208/s1. Figure S1: HantaNet dashboard built with the 2D network and unrooted phylogenetic tree views in the small (S, nucleoprotein) reference gene module; Figure S2: HantaNet dashboard built with the 2D network and unrooted phylogenetic tree views in the large (L, RNA polymerase) reference gene module; Figure S3: HantaNet dashboard built with the 2D network and unrooted phylogenetic tree views in the medium (M, glycoprotein) reference gene segment module; Video S1: HantaNet tool demonstration video.

Author Contributions

Conceptualization, R.C., S.L.M.W. and E.M.C.; Data curation, R.C.; Formal analysis, R.C., S.L.M.W., E.M. and R.K.; Investigation, S.L.M.W., E.T. and M.M.; Methodology, R.C.; Project administration, R.C. and S.L.M.W.; Software, E.M., E.M.C., R.K. and K.W.C.; Supervision, J.D.K. and W.M.S.; Validation, R.C. and S.L.M.W.; Visualization, R.C., E.M. and R.K.; Writing—original draft, R.C., S.L.M.W., E.M. and E.S.; Writing—review and editing, R.C., S.L.M.W., E.M., E.M.C., R.K., E.T., M.M., K.W.C., E.S., A.S., J.M.M., J.D.K. and W.M.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the U.S. Department of Health and Human Services.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The sequence data used to build the HantaNet modules are publicly available in the GenBank database and can be downloaded at https://cdcgov.github.io/HantaNet/ (accessed on 23 October 2023). Additional data presented in this study are available upon request from the corresponding authors.

Acknowledgments

We are thankful to our colleagues in the Division of HIV Prevention, the Division of High Consequence Pathogens and Pathology and our user base in public health and academia for providing valuable feedback and features for HantaNet. Thanks to Anna Niewiadomska and the BV-BRC for helpful discussions on hantavirus data curation. The findings and conclusions in this article are those of the authors and do not necessarily represent the views of the CDC. The use of trade names and commercial sources is for identification only and does not imply endorsement by the CDC.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jonsson, C.B.; Figueiredo, L.T.; Vapalahti, O. A Global Perspective on Hantavirus Ecology, Epidemiology, and Disease. Clin. Microbiol. Rev. 2010, 23, 412–441. [Google Scholar] [CrossRef] [PubMed]
  2. Singh, P.; Talwar, P.; Palaniyandi, R. Hantavirus Pulmonary Syndrome (HPS): A Concise Review Based on Current Knowledge and Emerging Concept. J. Appl. Pharm. Sci. 2014, 4, 122–130. [Google Scholar]
  3. Avšič-Županc, T.; Saksida, A.; Korva, M. Hantavirus Infections. Clin. Microbiol. Infect. 2019, 21s, e6–e16. [Google Scholar] [CrossRef] [PubMed]
  4. Kofman, A.; Eggers, P.; Kjemtrup, A.; Hall, R.; Brown, S.M.; Morales-Betoulle, M.; Graziano, J.; Zufan, S.E.; Whitmer, S.L.M.; Cannon, D.L.; et al. Notes from the Field: Contact Tracing Investigation after First Case of Andes Virus in the United States—Delaware, February 2018. MMWR Morb. Mortal. Wkly. Rep. 2018, 67, 1162–1163. [Google Scholar] [CrossRef] [PubMed]
  5. Alonso, D.O.; Pérez-Sautu, U.; Bellomo, C.M.; Prieto, K.; Iglesias, A.; Coelho, R.; Periolo, N.; Domenech, I.; Talmon, G.; Hansen, R.; et al. Person-to-Person Transmission of Andes Virus in Hantavirus Pulmonary Syndrome, Argentina, 2014. Emerg. Infect. Dis. 2020, 26, 756–759. [Google Scholar] [CrossRef] [PubMed]
  6. Martínez, V.P.; Di Paola, N.; Alonso, D.O.; Pérez-Sautu, U.; Bellomo, C.M.; Iglesias, A.A.; Coelho, R.M.; López, B.; Periolo, N.; Larson, P.A.; et al. “Super-Spreaders” and Person-to-Person Transmission of Andes Virus in Argentina. N. Engl. J. Med. 2020, 383, 2230–2241. [Google Scholar] [CrossRef] [PubMed]
  7. Toledo, J.; Haby, M.M.; Reveiz, L.; Sosa Leon, L.; Angerami, R.; Aldighieri, S. Evidence for Human-to-Human Transmission of Hantavirus: A Systematic Review. J. Infect. Dis. 2022, 226, 1362–1371. [Google Scholar] [CrossRef] [PubMed]
  8. Bellomo, C.M.; Alonso, D.O.; Pérez-Sautu, U.; Prieto, K.; Kehl, S.; Coelho, R.M.; Periolo, N.; Di Paola, N.; Ferressini-Gerpe, N.; Kuhn, J.H.; et al. Andes Virus Genome Mutations That Are Likely Associated with Animal Model Attenuation and Human Person-to-Person Transmission. mSphere 2023, 8, e0001823. [Google Scholar] [CrossRef]
  9. Ermonval, M.; Baychelier, F.; Tordo, N. What Do We Know about How Hantaviruses Interact with Their Different Hosts? Viruses 2016, 8, 223. [Google Scholar] [CrossRef]
  10. Rissanen, I.; Stass, R.; Zeltiña, A.; Li, S.; Hepojoki, J.M.; Harlos, K.; Gilbert, R.J.C.; Huiskonen, J.T.; Bowden, T.A. Structural Transitions of the Conserved and Metastable Hantaviral Glycoprotein Envelope. J. Virol. 2017, 91, e00378-17. [Google Scholar] [CrossRef]
  11. Laenen, L.; Vergote, V.; Calisher, C.H.; Klempa, B.; Klingström, J.; Kuhn, J.H.; Maes, P. Hantaviridae: Current Classification and Future Perspectives. Viruses 2019, 11, 788. [Google Scholar] [CrossRef]
  12. Parvate, A.; Williams, E.P.; Taylor, M.K.; Chu, Y.K.; Lanman, J.; Saphire, E.O.; Jonsson, C.B. Diverse Morphology and Structural Features of Old and New World Hantaviruses. Viruses 2019, 11, 862. [Google Scholar] [CrossRef] [PubMed]
  13. Abudurexiti, A.; Adkins, S.; Alioto, D.; Alkhovsky, S.V.; Avšič-Županc, T.; Ballinger, M.J.; Bente, D.A.; Beer, M.; Bergeron, É.; Blair, C.D.; et al. Taxonomy of the Order Bunyavirales: Update 2019. Arch. Virol. 2019, 164, 1949–1965. [Google Scholar] [CrossRef] [PubMed]
  14. Vapalahti, O.; Mustonen, J.; Lundkvist, A.; Henttonen, H.; Plyusnin, A.; Vaheri, A. Hantavirus Infections in Europe. Lancet Infect. Dis. 2003, 3, 653–661. [Google Scholar] [CrossRef] [PubMed]
  15. Vaheri, A.; Henttonen, H.; Voutilainen, L.; Mustonen, J.; Sironen, T.; Vapalahti, O. Hantavirus Infections in Europe and Their Impact on Public Health. Rev. Med. Virol. 2013, 23, 35–49. [Google Scholar] [CrossRef] [PubMed]
  16. Tariq, M.; Kim, D.M. Hemorrhagic Fever with Renal Syndrome: Literature Review, Epidemiology, Clinical Picture and Pathogenesis. Infect. Chemother. 2022, 54, 1–19. [Google Scholar] [CrossRef] [PubMed]
  17. Klena, J.D.; Chiang, C.F.; Whitmer, S.M.; Wang, Y.F.; Shieh, W.J. Hantaviruses, 12th ed.; American Society of Microbiology: Washington, DC, USA, 2023. [Google Scholar]
  18. Hjelle, B.; Torres-Pérez, F. Hantaviruses in the Americas and Their Role as Emerging Pathogens. Viruses 2010, 2, 2559–2586. [Google Scholar] [CrossRef] [PubMed]
  19. Thorp, L.; Fullerton, L.; Whitesell, A.; Dehority, W. Hantavirus Pulmonary Syndrome: 1993–2018. Pediatrics 2023, 151, e2022059352. [Google Scholar] [CrossRef] [PubMed]
  20. Chizhikov, V.E.; Spiropoulou, C.F.; Morzunov, S.P.; Monroe, M.C.; Peters, C.J.; Nichol, S.T. Complete Genetic Characterization and Analysis of Isolation of Sin Nombre Virus. J. Virol. 1995, 69, 8132–8136. [Google Scholar] [CrossRef]
  21. Huang, C.; Campbell, W.P.; Means, R.; Ackman, D.M. Hantavirus S RNA Sequence from a Fatal Case of HPS in New York. J. Med. Virol. 1996, 50, 5–8. [Google Scholar] [CrossRef]
  22. Bagamian, K.H.; Towner, J.S.; Mills, J.N.; Kuenzi, A.J. Increased Detection of Sin Nombre Hantavirus RNA in Antibody-Positive Deer Mice from Montana, USA: Evidence of Male Bias in RNA Viremia. Viruses 2013, 5, 2320–2328. [Google Scholar] [CrossRef] [PubMed]
  23. Kjemtrup, A.M.; Messenger, S.; Meza, A.M.; Feiszli, T.; Yoshimizu, M.H.; Padgett, K.; Singh, S. New Exposure Location for Hantavirus Pulmonary Syndrome Case, California, USA, 2018. Emerg. Infect. Dis. 2019, 25, 1962–1964. [Google Scholar] [CrossRef] [PubMed]
  24. Martinez, V.P.; Bellomo, C.M.; Cacace, M.L.; Suarez, P.; Bogni, L.; Padula, P.J. Hantavirus Pulmonary Syndrome in Argentina, 1995–2008. Emerg. Infect. Dis. 2010, 16, 1853–1860. [Google Scholar] [CrossRef] [PubMed]
  25. Iglesias, A.A.; Bellomo, C.M.; Martínez, V.P. Hantavirus Pulmonary Syndrome in Buenos Aires, 2009–2014. Medicina 2016, 76, 1–9. [Google Scholar] [PubMed]
  26. Rhodes, L.V., 3rd; Huang, C.; Sanchez, A.J.; Nichol, S.T.; Zaki, S.R.; Ksiazek, T.G.; Humphreys, J.G.; Freeman, J.J.; Knecht, K.R. Hantavirus Pulmonary Syndrome Associated with Monongahela Virus, Pennsylvania. Emerg. Infect. Dis. 2000, 6, 616–621. [Google Scholar] [CrossRef] [PubMed]
  27. Albariño, C.G.; Wiggleton Guerrero, L.; Chakrabarti, A.K.; Rollin, P.E.; Nichol, S.T. Complete Genome Sequences of Monongahela Hantavirus from Pennsylvania, USA. Microbiol. Resour. Announc. 2018, 7, e00928-18. [Google Scholar] [CrossRef] [PubMed]
  28. Fernando, R.; Capone, D.; Elrich, S.; Mantovani, R.; Quarles, L., 3rd; D’Amato, A.; Lowe, N.; Malhotra, A.; Khoo, T.; Zufan, S.; et al. Infection with New York Orthohantavirus and Associated Respiratory Failure and Multiple Cerebral Complications. Emerg. Infect. Dis. 2019, 25, 1241–1243. [Google Scholar] [CrossRef] [PubMed]
  29. Doyle, T.J.; Bryan, R.T.; Peters, C.J. Viral Hemorrhagic Fevers and Hantavirus Infections in the Americas. Infect. Dis. Clin. North Am. 1998, 12, 95–110. [Google Scholar] [CrossRef]
  30. Witkowski, P.T.; Klempa, B.; Ithete, N.L.; Auste, B.; Mfune, J.K.; Hoveka, J.; Matthee, S.; Preiser, W.; Kruger, D.H. Hantaviruses in Africa. Virus Res. 2014, 187, 34–42. [Google Scholar] [CrossRef]
  31. Reported Cases of Hantavirus Disease. Available online: https://www.cdc.gov/hantavirus/surveillance/index.html (accessed on 25 April 2023).
  32. Whitmer, S.L.M.; Whitesell, A.; Mobley, M.; Talundzic, E.; Shedroff, E.; Cossaboom, C.M.; Messenger, S.; Deldari, M.; Bhatnagar, J.; Estetter, L.; et al. Human Orthohantavirus Disease Prevalence and Genotype Distribution in the U.S., 2008–2020. Lancet ID, 2023; in submission. [Google Scholar]
  33. Goodfellow, S.M.; Nofchissey, R.A.; Schwalm, K.C.; Cook, J.A.; Dunnum, J.L.; Guo, Y.; Ye, C.; Mertz, G.J.; Chandran, K.; Harkins, M.; et al. Tracing Transmission of Sin Nombre Virus and Discovery of Infection in Multiple Rodent Species. J. Virol. 2021, 95, e0153421. [Google Scholar] [CrossRef] [PubMed]
  34. Knust, B.; Brown, S.; de St. Maurice, A.; Whitmer, S.; Koske, S.E.; Ervin, E.; Patel, K.; Graziano, J.; Morales-Betoulle, M.E.; House, J.; et al. Seoul Virus Infection and Spread in United States Home-Based Ratteries: Rat and Human Testing Results from a Multistate Outbreak Investigation. J. Infect. Dis. 2020, 222, 1311–1319. [Google Scholar] [PubMed]
  35. Hecht, G.; Dale, A.P.; Ruberto, I.; Adame, G.; Close, R.; Snyder, S.J.; Pink, K.; Lemmon, N.; Rudolfo, J.; Madsen, M.; et al. Detection of Hantavirus During the COVID-19 Pandemic, Arizona, USA, 2020. Emerg. Infect. Dis. 2023, 29, 1663–1667. [Google Scholar] [CrossRef] [PubMed]
  36. Kim, W.K.; No, J.S.; Lee, D.; Jung, J.; Park, H.; Yi, Y.; Kim, J.A.; Lee, S.H.; Kim, Y.; Park, S.; et al. Active Targeted Surveillance to Identify Sites of Emergence of Hantavirus. Clin. Infect. Dis. 2020, 70, 464–473. [Google Scholar] [CrossRef] [PubMed]
  37. Kim, W.K.; Cho, S.; Lee, S.H.; No, J.S.; Lee, G.Y.; Park, K.; Lee, D.; Jeong, S.T.; Song, J.W. Genomic Epidemiology and Active Surveillance to Investigate Outbreaks of Hantaviruses. Front. Cell. Infect. Microbiol. 2020, 10, 532388. [Google Scholar] [CrossRef] [PubMed]
  38. Campbell, E.M.; Boyles, A.; Shankar, A.; Kim, J.; Knyazev, S.; Cintron, R.; Switzer, W.M. Microbetrace: Retooling Molecular Epidemiology for Rapid Public Health Response. PLoS Comput. Biol. 2021, 17, e1009300. [Google Scholar] [CrossRef] [PubMed]
  39. Bradley, H.; Hogan, V.; Agnew-Brune, C.; Armstrong, J.; Broussard, D.; Buchacz, K.; Burton, K.; Cope, S.; Dawson, E.; De La Garza, G.; et al. Increased Hiv Diagnoses in West Virginia Counties Highly Vulnerable to Rapid Hiv Dissemination through Injection Drug Use: A cautionary Tale. Ann. Epidemiol. 2019, 34, 12–17. [Google Scholar] [CrossRef] [PubMed]
  40. Campbell, E.M.; Patala, A.; Shankar, A.; Li, J.F.; Johnson, J.A.; Westheimer, E.; Gay, C.L.; Cohen, S.E.; Switzer, W.M.; Peters, P.J. Phylodynamic Analysis Complements Partner Services by Identifying Acute and Unreported Hiv Transmission. Viruses 2020, 12, 145. [Google Scholar] [CrossRef]
  41. Brenner, B.G.; Ibanescu, R.I.; Osman, N.; Cuadra-Foy, E.; Oliveira, M.; Chaillon, A.; Stephens, D.; Hardy, I.; Routy, J.P.; Thomas, R.; et al. The Role of Phylogenetics in Unravelling Patterns of Hiv Transmission Towards Epidemic Control: The Quebec Experience (2002–2020). Viruses 2021, 13, 1643. [Google Scholar] [CrossRef]
  42. Rich, S.N.; Prosperi, M.C.F.; Dellicour, S.; Vrancken, B.; Cook, R.L.; Spencer, E.C.; Salemi, M.; Mavian, C. Molecular Epidemiology of Hiv-1 Subtype B Infection across Florida Reveals Few Large Superclusters with Metropolitan Origin. Microbiol. Spectr. 2022, 10, e0188922. [Google Scholar] [CrossRef]
  43. Alexiev, I.; Shankar, A.; Pan, Y.; Grigorova, L.; Partsuneva, A.; Dimitrova, R.; Gancheva, A.; Kostadinova, A.; Elenkov, I.; Yancheva, N.; et al. Transmitted Hiv Drug Resistance in Bulgaria Occurs in Clusters of Individuals from Different Transmission Groups and Various Subtypes (2012–2020). Viruses 2023, 15, 941. [Google Scholar] [CrossRef] [PubMed]
  44. Longmire, A.G.; Sims, S.; Rytsareva, I.; Campo, D.S.; Skums, P.; Dimitrova, Z.; Ramachandran, S.; Medrzycki, M.; Thai, H.; Ganova-Raeva, L.; et al. Ghost: Global Hepatitis Outbreak and Surveillance Technology. BMC Genom. 2017, 18 (Suppl. 10), 916. [Google Scholar] [CrossRef] [PubMed]
  45. Rao, J.R.; Rafique, S.; Ali, A.; Rehman, G.; Ilyas, M.; Idrees, M. Next-Generation Sequencing Studies on the E1-Hvr1 Region of Hepatitis C Virus (Hcv) from Non-High-Risk Hcv Patients Living in Punjab and Khyber Pakhtunkhwa, Pakistan. Arch. Virol. 2021, 166, 3049–3059. [Google Scholar] [CrossRef] [PubMed]
  46. Labuda, S.M.; McDaniel, C.J.; Talwar, A.; Braumuller, A.; Parker, S.; McGaha, S.; Blissett, C.; Wortham, J.M.; Mukasa, L.; Stewart, R.J. Tuberculosis Outbreak Associated with Delayed Diagnosis and Long Infectious Periods in Rural Arkansas, 2010–2018. Public Health Rep. 2022, 137, 94–101. [Google Scholar] [CrossRef] [PubMed]
  47. Houwaart, T.; Belhaj, S.; Tawalbeh, E.; Nagels, D.; Fröhlich, Y.; Finzer, P.; Ciruela, P.; Sabrià, A.; Herrero, M.; Andrés, C.; et al. Integrated Genomic Surveillance Enables Tracing of Person-to-Person SARS-CoV-2 Transmission Chains During Community Transmission and Reveals Extensive Onward Transmission of Travel-Imported Infections, Germany, June to July 2021. Eurosurveillance 2022, 27, 2101089. [Google Scholar] [CrossRef] [PubMed]
  48. Yaglom, H.D.; Gebhardt, M.; Pfeiffer, A.; Ormsby, M.E.; Jasso-Selles, D.E.; Lemmer, D.; Folkerts, M.L.; French, C.; Maurer, M.; Bowers, J.R.; et al. Applying Genomic Epidemiology to Characterize a COVID-19 Outbreak in a Developmentally Disabled Adult Group Home Setting, Arizona. Front. Public Health 2021, 9, 668214. [Google Scholar] [CrossRef] [PubMed]
  49. Jayroe, M.; Aguilar, D.R.; Porter, A.; Cima, M.; Chai, S.; Hayman, K. Transmission Analysis of COVID-19 Outbreaks Associated with Places of Worship, Arkansas, May 2020–December 2020. J. Relig. Health 2023, 62, 650–661. [Google Scholar] [CrossRef] [PubMed]
  50. Katoh, K.; Misawa, K.; Kuma, K.; Miyata, T. Mafft: A Novel Method for Rapid Multiple Sequence Alignment Based on Fast Fourier Transform. Nucleic Acids Res. 2002, 30, 3059–3066. [Google Scholar] [CrossRef]
  51. Katoh, K.; Rozewicki, J.; Yamada, K.D. Mafft Online Service: Multiple Sequence Alignment, Interactive Sequence Choice and Visualization. Brief. Bioinform. 2017, 20, 1160–1166. [Google Scholar] [CrossRef]
  52. Hall, T.A. Bioedit: A User-Friendly Biological Sequence Alignment Editor and Analysis Program for Windows 95/98/Nt. Nucleic Acids Symp. Ser. 1999, 41, 95–98. [Google Scholar]
  53. Tamura, K.; Nei, M. Estimation of the Number of Nucleotide Substitutions in the Control Region of Mitochondrial DNA in Humans and Chimpanzees. Mol. Biol. Evol. 1993, 10, 512–526. [Google Scholar] [PubMed]
  54. Seqruler: Gui for Calculating TN93 between Set of Sequences. Available online: https://github.com/Sergey-Knyazev/SeqRuler (accessed on 3 February 2023).
  55. Seqruler: Gui for Calculating TN93 or SNP Distances for a Set of Sequences. Available online: https://github.com/CDCGov/SeqRuler (accessed on 25 October 2023).
  56. Monroe, M.C.; Morzunov, S.P.; Johnson, A.M.; Bowen, M.D.; Artsob, H.; Yates, T.; Peters, C.J.; Rollin, P.E.; Ksiazek, T.G.; Nichol, S.T. Genetic Diversity and Distribution of Peromyscus-Borne Hantaviruses in North America. Emerg. Infect. Dis. 1999, 5, 75–86. [Google Scholar] [CrossRef] [PubMed]
  57. Ramsden, C.; Holmes, E.C.; Charleston, M.A. Hantavirus Evolution in Relation to Its Rodent and Insectivore Hosts: No Evidence for Codivergence. Mol. Biol. Evol. 2009, 26, 143–153. [Google Scholar] [CrossRef] [PubMed]
  58. Tidytree: Uncompromisingly Flexible Phylogenetic Trees in Javascript. Available online: https://github.com/CDCgov/TidyTree (accessed on 25 October 2023).
  59. Kumar, S.; Tamura, K.; Nei, M. Mega: Molecular Evolutionary Genetics Analysis Software for Microcomputers. Comput. Appl. Biosci. 1994, 10, 189–191. [Google Scholar] [CrossRef] [PubMed]
  60. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. Mega X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef] [PubMed]
  61. Fourment, M.; Gibbs, M.J. Patristic: A Program for Calculating Patristic Distances and Graphically Comparing the Components of Genetic Change. BMC Evol. Biol. 2006, 6, 1. [Google Scholar] [CrossRef] [PubMed]
  62. Patristic: A Phylogenetics Toolkit for Javascript. Available online: https://github.com/CDCgov/patristic (accessed on 25 October 2023).
  63. Kruskal, J.B. On the Shortest Spanning Subtree of a Graph and the Traveling Salesman Problem. Proc. Am. Math. Soc. 1956, 7, 48–50. [Google Scholar] [CrossRef]
  64. Emst: Epsilon Minimal Spanning Trees. Available online: https://github.com/Sergey-Knyazev/eMST (accessed on 3 February 2023).
  65. Hadfield, J.; Megill, C.; Bell, S.M.; Huddleston, J.; Potter, B.; Callender, C.; Sagulenko, P.; Bedford, T.; Neher, R.A. Nextstrain: Real-Time Tracking of Pathogen Evolution. Bioinformatics 2018, 34, 4121–4123. [Google Scholar] [CrossRef]
  66. Cintron, R.; Gigante, C.M.; Li, Y.; Kelly, R.; Caravas, J.; Switzer, W.M. Implementation of Nextstrain and Microbetrace Pipeline for Mpox Genomic Surveillance and Rapid Public Health Action. In Proceedings of the 2023 APHL/CDC Advanced Molecular Detection (AMD) Days, Atlanta, GA, USA, 12–14 September 2023. [Google Scholar]
  67. Van Iersel, L.; Kelk, S.; Rupp, R.; Huson, D. Phylogenetic Networks Do Not Need to Be Complex: Using Fewer Reticulations to Represent Conflicting Clusters. Bioinformatics 2010, 26, i124–i131. [Google Scholar] [CrossRef]
  68. Mardulyn, P. Trees and/or Networks to Display Intraspecific DNA Sequence Variation? Mol. Ecol. 2012, 21, 3385–3390. [Google Scholar] [CrossRef]
  69. Schliep, K.P.; Potts, A.J.; Morrison, D.A.; Grimm, G.W. Intertwining Phylogenetic Trees and Networks. Methods Ecol. Evol. 2017, 8, 1212–1220. [Google Scholar] [CrossRef]
  70. Blair, C.; Ané, C. Phylogenetic Trees and Networks Can Serve as Powerful and Complementary Approaches for Analysis of Genomic Data. Syst. Biol. 2020, 69, 593–601. [Google Scholar] [CrossRef] [PubMed]
  71. Vijaykrishna, D.; Mukerji, R.; Smith, G.J. RNA Virus Reassortment: An Evolutionary Mechanism for Host Jumps and Immune Evasion. PLoS Pathog. 2015, 11, e1004902. [Google Scholar] [CrossRef] [PubMed]
  72. McDonald, S.M.; Nelson, M.I.; Turner, P.E.; Patton, J.T. Reassortment in Segmented RNA Viruses: Mechanisms and Outcomes. Nat. Rev. Microbiol. 2016, 14, 448–460. [Google Scholar] [CrossRef] [PubMed]
  73. Klempa, B. Reassortment Events in the Evolution of Hantaviruses. Virus Genes. 2018, 54, 638–646. [Google Scholar] [CrossRef] [PubMed]
  74. Varsani, A.; Lefeuvre, P.; Roumagnac, P.; Martin, D. Notes on Recombination and Reassortment in Multipartite/Segmented Viruses. Curr. Opin. Virol. 2018, 33, 156–166. [Google Scholar] [CrossRef] [PubMed]
  75. Barrat-Charlaix, P.; Vaughan, T.G.; Neher, R.A. Treeknit: Inferring Ancestral Reassortment Graphs of Influenza Viruses. PLoS Comput. Biol. 2022, 18, e1010394. [Google Scholar] [CrossRef]
  76. Seqcombo. Available online: https://github.com/YuLab-SMU/seqcombo (accessed on 21 March 2023).
  77. Rasmussen, D.A.; Grünwald, N.J. Phylogeographic Approaches to Characterize the Emergence of Plant Pathogens. Phytopathology 2021, 111, 68–77. [Google Scholar] [CrossRef]
  78. Drummond, A.J.; Rambaut, A. Beast: Bayesian Evolutionary Analysis by Sampling Trees. BMC Evol. Biol. 2007, 7, 214. [Google Scholar] [CrossRef]
  79. Bouckaert, R.A.-O.; Vaughan, T.G.; Barido-Sottani, J.A.-O.; Duchêne, S.A.-O.; Fourment, M.A.-O.; Gavryushkina, A.; Heled, J.A.-O.; Jones, G.A.-O.; Kühnert, D.A.-O.X.; De Maio, N.A.-O.; et al. Beast 2.5: An Advanced Software Platform for Bayesian Evolutionary Analysis. PLoS Comput. Biol. 2019, 15, e1006650. [Google Scholar] [CrossRef]
  80. Fisher, A.A.-O.; Hassler, G.W.; Ji, X.; Baele, G.; Suchard, M.A.; Lemey, P. Scalable Bayesian Phylogenetics. Philos. Trans. R. Soc. Lond. B Biol. Sci. 2022, 377, 20210242. [Google Scholar] [CrossRef]
  81. Nguyen, L.T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. Iq-Tree: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  82. Sagulenko, P.; Puller, V.; Neher, R.A. Treetime: Maximum-Likelihood Phylodynamic Analysis. Virus Evol. 2018, 4, vex042. [Google Scholar] [CrossRef]
  83. Treetime. Available online: https://github.com/neherlab/treetime (accessed on 21 March 2023).
Figure 1. Building HantaNet reference modules for the small (S), medium (M) and large (L) gene segments. Sequence alignments and standardized metadata were integrated from a collection of hantavirus genomes available from GenBank and curated at the Bacterial and Viral Bioinformatics Resource Center (BV-BRC). The hantavirus S, M and L nucleotide sequences were aligned as described in the materials section. The reference alignments and metadata files were loaded and integrated as separate gene segment modules into MicrobeTrace to build the HantaNet tool for hantavirus classification, epidemiology and outbreak detection. Users can download the hantavirus reference datasets and use them to standardize their own laboratory and epidemiological data for visualization and analysis in HantaNet.
Figure 1. Building HantaNet reference modules for the small (S), medium (M) and large (L) gene segments. Sequence alignments and standardized metadata were integrated from a collection of hantavirus genomes available from GenBank and curated at the Bacterial and Viral Bioinformatics Resource Center (BV-BRC). The hantavirus S, M and L nucleotide sequences were aligned as described in the materials section. The reference alignments and metadata files were loaded and integrated as separate gene segment modules into MicrobeTrace to build the HantaNet tool for hantavirus classification, epidemiology and outbreak detection. Users can download the hantavirus reference datasets and use them to standardize their own laboratory and epidemiological data for visualization and analysis in HantaNet.
Viruses 15 02208 g001
Figure 2. Example of a HantaNet dashboard built with the 2D network and unrooted phylogenetic tree views in the medium (M, glycoprotein) reference gene segment module. After loading the curated US hantavirus genomic surveillance data into HantaNet, the TN93 nucleotide substitution model was applied to generate a 2D network (left). The vertical grey line in the phylogenetic tree view (right) represents the 0.2 substitutions per site genetic distance cutoff used to obtain hantavirus strain-specific clusters (n = 10; nine clusters of two or more sequences and one singleton). Nodes represent individual sequences and were colored based on hantavirus classification, as shown in the key table next to the phylogenetic tree (right) (see Table 1 for abbreviations). Nodes were shaped by host: animal (square), human (star) and lab or viral isolate (addition sign).
Figure 2. Example of a HantaNet dashboard built with the 2D network and unrooted phylogenetic tree views in the medium (M, glycoprotein) reference gene segment module. After loading the curated US hantavirus genomic surveillance data into HantaNet, the TN93 nucleotide substitution model was applied to generate a 2D network (left). The vertical grey line in the phylogenetic tree view (right) represents the 0.2 substitutions per site genetic distance cutoff used to obtain hantavirus strain-specific clusters (n = 10; nine clusters of two or more sequences and one singleton). Nodes represent individual sequences and were colored based on hantavirus classification, as shown in the key table next to the phylogenetic tree (right) (see Table 1 for abbreviations). Nodes were shaped by host: animal (square), human (star) and lab or viral isolate (addition sign).
Viruses 15 02208 g002
Figure 3. HantaNet can connect genetic networks into minimum spanning trees (MSTs) and geographic locations. (A) Comparison of the 2D network view generated with TN93 at a threshold of 0.127 nucleotide substitutions per site (top,left) and after applying the nearest neighbor (NN) algorithm (right). We analyzed 21 M gene segments of SNV collected in the US from 1999 to 2021. Two main groups were observed, SNV1 (purple polygon) and SNV2 (green polygon), consistent with findings from Whitmer et al. [32]. Reference SNV1 and SNV2 sequence nodes are labeled with the prefix R, SNV or CCV in the network. The yellow nodes represent new sequences that did not cluster with SNV1 or SNV2 and may represent novel strains. The NN algorithm was applied to find the MST or shortest genetic distance path in the network, which prunes more distant links between nodes, while retaining the connection between the two closest nodes. The genetic link width was pre-set to be proportional to the TN93 distance. (B) A dashboard built with the NN network (left) and map (right) views. The NN network was overlayed on the map to visualize virus geographic location and spread. The selected yellow nodes with orange borders (sequences 1, 2 and 4) represent examples of new sequences of interest (e.g., outbreak, genomic surveillance, new variants, etc.) to show their distribution in the map. Two reference SNV2 sequences (6 and 19) were also selected and displayed in the map (green nodes with orange borders). Individual sequences with associated state- and county-level metadata are shown in the map as small nodes with black borders. Three clusters of sequences were observed in the states of California (n = 2), New Mexico (n = 4) and Montana (n = 18), and are represented by the green or orange circles, respectively.
Figure 3. HantaNet can connect genetic networks into minimum spanning trees (MSTs) and geographic locations. (A) Comparison of the 2D network view generated with TN93 at a threshold of 0.127 nucleotide substitutions per site (top,left) and after applying the nearest neighbor (NN) algorithm (right). We analyzed 21 M gene segments of SNV collected in the US from 1999 to 2021. Two main groups were observed, SNV1 (purple polygon) and SNV2 (green polygon), consistent with findings from Whitmer et al. [32]. Reference SNV1 and SNV2 sequence nodes are labeled with the prefix R, SNV or CCV in the network. The yellow nodes represent new sequences that did not cluster with SNV1 or SNV2 and may represent novel strains. The NN algorithm was applied to find the MST or shortest genetic distance path in the network, which prunes more distant links between nodes, while retaining the connection between the two closest nodes. The genetic link width was pre-set to be proportional to the TN93 distance. (B) A dashboard built with the NN network (left) and map (right) views. The NN network was overlayed on the map to visualize virus geographic location and spread. The selected yellow nodes with orange borders (sequences 1, 2 and 4) represent examples of new sequences of interest (e.g., outbreak, genomic surveillance, new variants, etc.) to show their distribution in the map. Two reference SNV2 sequences (6 and 19) were also selected and displayed in the map (green nodes with orange borders). Individual sequences with associated state- and county-level metadata are shown in the map as small nodes with black borders. Three clusters of sequences were observed in the states of California (n = 2), New Mexico (n = 4) and Montana (n = 18), and are represented by the green or orange circles, respectively.
Viruses 15 02208 g003
Figure 4. Transmission network of the 2018–2019 Andes virus (ANDV) outbreak in Argentina using available S gene sequences and contact tracing data [6] inferred with HantaNet, using a genetic distance cutoff of zero nucleotide substitutions per site (352 genetic links are hidden). The nodes were customized with human icons to represent patients and sized by the number of secondary transmissions. Blue lines show epidemiologic links, which display directional arrows where confirmed transmission events occurred. Dashed lines show both genetic and epidemiologic linkage providing stronger support for the transmission event. The index case was patient ARG/1, shown on the upper left of the network. Virus collection dates are shown with gray vertical lines in the network layout.
Figure 4. Transmission network of the 2018–2019 Andes virus (ANDV) outbreak in Argentina using available S gene sequences and contact tracing data [6] inferred with HantaNet, using a genetic distance cutoff of zero nucleotide substitutions per site (352 genetic links are hidden). The nodes were customized with human icons to represent patients and sized by the number of secondary transmissions. Blue lines show epidemiologic links, which display directional arrows where confirmed transmission events occurred. Dashed lines show both genetic and epidemiologic linkage providing stronger support for the transmission event. The index case was patient ARG/1, shown on the upper left of the network. Virus collection dates are shown with gray vertical lines in the network layout.
Viruses 15 02208 g004
Figure 5. Transmission network of Seoul virus (SEOV)-infected rats in home-based ratteries, local pet stores and homes during the 2017 multistate outbreak in the United States (US) and Canada [34]. Public SEOV sequences and confirmed facility data (laboratory-confirmed SEOV infections in people or rats) were loaded into HantaNet to build the transmission network visualization. Nodes were colored by location, represented by circles with facility numbers (US, n = 31) or letters (Canada, n = 2). Polygons represent clusters of facilities in US states with the greatest number of confirmed cases (Illinois, purple; Wisconsin, green; Utah, blue). Nodes were labeled with the sample collection dates (M/D/YY) and represent SEOV sequence availability. Links show unidirectional or bidirectional arrows, representing confirmed (black links) or unconfirmed (orange links) movement of rats between facilities. A total of 17 human cases were identified at the following US facilities: 26 (purple node), 30 (orange node), and 31 (blue node) in Illinois, Tennessee and Utah, respectively. Node X (purple circle) shows that a facility in Illinois had an infected pet rat detected after the outbreak in June 2018.
Figure 5. Transmission network of Seoul virus (SEOV)-infected rats in home-based ratteries, local pet stores and homes during the 2017 multistate outbreak in the United States (US) and Canada [34]. Public SEOV sequences and confirmed facility data (laboratory-confirmed SEOV infections in people or rats) were loaded into HantaNet to build the transmission network visualization. Nodes were colored by location, represented by circles with facility numbers (US, n = 31) or letters (Canada, n = 2). Polygons represent clusters of facilities in US states with the greatest number of confirmed cases (Illinois, purple; Wisconsin, green; Utah, blue). Nodes were labeled with the sample collection dates (M/D/YY) and represent SEOV sequence availability. Links show unidirectional or bidirectional arrows, representing confirmed (black links) or unconfirmed (orange links) movement of rats between facilities. A total of 17 human cases were identified at the following US facilities: 26 (purple node), 30 (orange node), and 31 (blue node) in Illinois, Tennessee and Utah, respectively. Node X (purple circle) shows that a facility in Illinois had an infected pet rat detected after the outbreak in June 2018.
Viruses 15 02208 g005
Table 1. Hantavirus reference sequences used to build the HantaNet gene segment modules.
Table 1. Hantavirus reference sequences used to build the HantaNet gene segment modules.
Classification aVirus Name bVirus ID cCollection Date dHost eIsolation Country fState gSegment hAccession ID i
NDVAndes virusANDV_00011997Rice ratChileUnknownS
M
L
AF291702 AF291703 AF291704
ANDV_00022016HumanSwitzerlandUnknownS
M
L
KY659432 KY604962 KY659431
BAYVBayou virusBAYV_00011993HumanUnited States of AmericaLouisianaS
M
L36929 L36930
BAYV_0002 Viral isolate S
M
L
GQ200820 GQ244521 GQ244526
BAYV_0003 Viral isolate S
L
NC_038298 NC_038299
BCCVBlack Creek Canal virusBCCV_00011994Cotton ratUnited States of AmericaFloridaS
M
L39949 L39950
BCCV_0002 Viral isolate SNC_043075
BRVBlue River virusBRV_00012005Deer miceUnited States of AmericaMissouriSDQ090888
BRV_00022005Deer miceUnited States of AmericaMissouriSDQ090890
BRV_00031997Deer miceUnited States of AmericaIndianaMAF030551
BRV_00041997Deer miceUnited States of AmericaOklahomaMAF030552
ELMCVEl Moro Canyon virusELMCV_00011994Harvest miceUnited States of AmericaCaliforniaS
M
U11427 U26828
ELMCV_0002 Viral isolate S
M
NC_038423 NC_038424
ILVIsla Vista virusILV_00011994VoleUnited States of AmericaCaliforniaSU19302
ILV_00021995VoleUnited States of AmericaCaliforniaSU31534
ILV_00031995Deer miceUnited States of AmericaCaliforniaSU31535
MGLVMonongahela virusMGLV_00011985Deer miceUnited States of AmericaWest VirginiaSU32591
MGLV_00021997HumanUnited States of AmericaPennsylvaniaS
M
L
MH539867 MH539866 MH539865
MULVMuleshoe virusMULV_00011995Cotton ratUnited States of AmericaTexasSU54575
MULV_00021999Cotton ratUnited States of AmericaTexasSKX066124
NYVNew York virusNYV_00011994Deer miceUnited States of AmericaNew YorkS
M
L
MG717391 MG717392 MG717393
NYV_00021994HumanUnited States of AmericaRhode IslandS
M
U09488 U36801
PHVProspect Hill virusPHV_0001 Viral isolate S
M
L
NC_038938 NC_038940 NC_038939
SEOVSeoul virusSEOV_00011993RatSouth KoreaUnknownS
M
L
NC_005236 S47716 X56492
SEOV_00022013RatUnited States of AmericaNew YorkS
M
KJ950866 KJ950862
SEOV_00032002RatUnited States of AmericaMarylandS
M
L
KT897726 KT897725 KT897724
SEOV_0004 Viral isolate S
M
L
KU204960 KU204959 KU204958
SEOV_00052013RatUnited States of AmericaNew YorkS
M
KJ950867 KJ950863
SEOV_00062013RatUnited States of AmericaNew YorkS
M
KJ950868 KJ950864
SEOV_00072013RatUnited States of AmericaNew YorkS
M
KJ950869 KJ950865
SNVSin Nombre virusSNV_00011993HumanUnited States of AmericaNew MexicoS
M
L
NC_005216 NC_005215 L37901
SNV_00021993Deer miceUnited States of AmericaNew MexicoS
M
L
L37904 L37903 L37902
SNV_00032009Deer miceUnited States of AmericaMontanaS
M
JQ690281 JQ690283
SNV_00041993HumanUnited States of AmericaNew MexicoS
M
L25784 L25783
SNV_0005 Viral isolate S
M
L
KF537003 KF537002 KF537001
SNV_0006 Viral isolate S
M
L
KF537006 KF537005 KF537004
SNV_00072008Deer miceUnited States of AmericaMontanaS
M
JQ690276 JQ690279
SNV_00082008Deer miceUnited States of AmericaMontanaS
M
JQ690277 JQ690280
SNV_00092009Deer miceUnited States of AmericaMontanaS
M
JQ690282 JQ690284
Convict Creek virusCCV_00011993Deer miceUnited States of AmericaCaliforniaS
M
L
L33683 L33474 AF425256
CCV_00021995Deer miceUnited States of AmericaCaliforniaS
M
L33816 L33684
a Hantavirus strain abbreviation. b Virus strain name entered as standard full name. c Unique virus or specimen identifier assigned manually. d Collection dates of original viruses are not included for viral isolates. e Common name of the natural host of the organism from which the sample was obtained. Viral isolates include viruses propagated in cell culture or laboratory animals, and NCBI curated non-redundant sequences (RefSeq). f Country where virus specimen was collected. Country is not included for viral isolates. g State or province where virus specimen was collected. State is not included for viral isolates. h Refers to Small (S), Medium (M) or Large (L) gene segments. i GenBank unique gene identifier.
Table 2. Sin Nombre virus (SNV) M segment pairwise genetic distances (substitutions/site) calculated using the TN93 nucleotide substitution model.
Table 2. Sin Nombre virus (SNV) M segment pairwise genetic distances (substitutions/site) calculated using the TN93 nucleotide substitution model.
SNV GroupPairwise Genetic Distance
SNV10–0.05714
SNV1-like0.07099–0.11967
SNV20.00029–0.06914
SNV2-like0.12026–0.12295
Other0.00693–0.03564
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cintron, R.; Whitmer, S.L.M.; Moscoso, E.; Campbell, E.M.; Kelly, R.; Talundzic, E.; Mobley, M.; Chiu, K.W.; Shedroff, E.; Shankar, A.; et al. HantaNet: A New MicrobeTrace Application for Hantavirus Classification, Genomic Surveillance, Epidemiology and Outbreak Investigations. Viruses 2023, 15, 2208. https://doi.org/10.3390/v15112208

AMA Style

Cintron R, Whitmer SLM, Moscoso E, Campbell EM, Kelly R, Talundzic E, Mobley M, Chiu KW, Shedroff E, Shankar A, et al. HantaNet: A New MicrobeTrace Application for Hantavirus Classification, Genomic Surveillance, Epidemiology and Outbreak Investigations. Viruses. 2023; 15(11):2208. https://doi.org/10.3390/v15112208

Chicago/Turabian Style

Cintron, Roxana, Shannon L. M. Whitmer, Evan Moscoso, Ellsworth M. Campbell, Reagan Kelly, Emir Talundzic, Melissa Mobley, Kuo Wei Chiu, Elizabeth Shedroff, Anupama Shankar, and et al. 2023. "HantaNet: A New MicrobeTrace Application for Hantavirus Classification, Genomic Surveillance, Epidemiology and Outbreak Investigations" Viruses 15, no. 11: 2208. https://doi.org/10.3390/v15112208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop