Next Article in Journal
Viral Infections in Type 2 Diabetes: A Dangerous Liaison
Previous Article in Journal
“Big Events” and HIV Transmission Dynamics: Estimating Time Since HIV Infection from Deep Sequencing Data Among Sex Workers and Their Clients in Dnipro, Ukraine
Previous Article in Special Issue
Tracking the Spatial and Functional Dispersion of Vaccine-Related Canine Distemper Virus Genotypes: Insights from a Global Scoping Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Global Phylogenetic Analysis of the CDV Hemagglutinin Gene Reveals Positive Selection on Key Receptor-Binding Sites

by
Tuba Çiğdem Oğuzoğlu
1,*,† and
B. Taylan Koç
2,*,†
1
Department of Virology, Faculty of Veterinary Medicine, Ankara University, 06110 Ankara, Turkey
2
Department of Virology, Faculty of Veterinary Medicine, Aydın Adnan Menderes University, 09110 Aydın, Turkey
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Viruses 2025, 17(9), 1149; https://doi.org/10.3390/v17091149
Submission received: 17 June 2025 / Revised: 14 August 2025 / Accepted: 18 August 2025 / Published: 22 August 2025
(This article belongs to the Special Issue Canine Distemper Virus)

Abstract

Canine distemper virus (CDV) is a multi-host morbillivirus whose evolution and host-switching capacity are largely determined by its hemagglutinin (H) gene. To reconsider the molecular evolution of this critical gene, we performed comprehensive phylogenetic, selection, and structural analyses on a curated dataset of 68 representative global H gene sequences. Our phylogenetic reconstruction confirmed the segregation of sequences into distinct, geographically associated lineages. To provide stronger evidence for viral adaptation, we performed a site-specific selection analysis, which identified 15 amino acid sites in the H protein undergoing significant episodic positive selection. Crucially, the majority of the known SLAM and Nectin-4 receptor-binding residues were found to be among these positively selected sites. We further contextualized these findings by mapping the sites onto a 3D homology model of the H protein, which confirmed their location on the exposed surfaces of the receptor-binding domain. This compilation provides quantitative evidence that the key functional regions of the H protein are direct targets for adaptive evolution, which has significant implications for understanding host tropism and the ongoing challenge of vaccine mismatch.

1. Introduction

Canine distemper virus (CDV) is a significant pathogen responsible for severe, multisystemic infections in domestic dogs of all ages. This disease typically presents with gastrointestinal and respiratory signs; however, neurological signs can also develop as a distinct clinical manifestation. In adult dogs, a specific neurological syndrome associated with persistent CDV infection is termed “old dog encephalitis” [1,2,3,4].
CDV infects an extensive range of hosts. It extends beyond the dog family (Canidae) to include numerous other carnivore families such as bears (Ursidae), cats (Felidae), weasels (Mustelidae), and raccoons (Procyonidae), as well as marine mammals like seals (Phocidae) and sea lions (Otariidae). Notably, the virus has also caused severe disease in non-carnivores, including marmots and specific primates (rhesus and cynomolgus macaques). Extensive research demonstrates that CDV undergoes dynamic genetic processes, which explains its ability to infect an extensive range of hosts [5,6]. In this context, scientific investigations over the last few decades have predominantly focused on characterizing the molecular structure of this virus and tracking its genetic substitutions to better understand its adaptability [7,8,9,10].
CDV is a pathogen classified within the genus Mobillivirus of the family Paramyxoviridae, and it exhibits a close antigenic relationship and genetic similarity with other prominent members of its genus, including measles virus (MV), rinderpest virus (RPV), and peste des petits ruminants virus (PPRV). The viral genome is composed of a non-segmented, negative-sense, single-stranded ribonucleic acid (RNA) molecule, approximately 15 kb in length [11,12].
The genetic material of CDV directs the synthesis of eight proteins, six of which are structural proteins that each fulfill a particular function in the lifecycle of CDV and its capacity to cause infection. Notably, the nucleocapsid (N), hemagglutinin (H), and fusion (F) proteins, with N being helically structured within the viral envelope, represent key targets for the host’s immune defenses; the H and F proteins are also instrumental in the initial stages of infection, specifically viral attachment to and fusion with host cells [13]. The H and F proteins form the surface spikes of the virion’s envelope, and the H protein mediates binding to host cell receptors like SLAM (CD150) or Nectin 4 (PVRL4) [14].
The H gene is recognized for exhibiting the most significant genetic diversity among the CDV structural genes. Consequently, its protein product, the H protein, shows a level of amino acid variation that surpasses that of any other protein within the Morbillivirus genus, potentially accounting for CDV’s broader spectrum of susceptible host species compared to its viral relatives [15]. Alterations within the H gene sequence have been correlated with the virus’s ability to be transmitted between different animal species and with its degree of pathogenicity (Table 1). For example, amino acid substitutions at specific sites within the H protein, such as position 530 and a change from Tyrosine to Histidine at position 549, are associated with enhanced viral virulence [16,17].
This pronounced genetic heterogeneity makes the H gene a preferred marker for the molecular typing of CDV isolates and a valuable tool for examining the evolutionary connections among various strains [21]. CDV is known to form distinct lineages with specific geographic distributions. These lineages are defined and classified based on the phylogenetic clustering of their H gene sequences. The exact number of lineages can vary between studies, partly due to the genetic region analyzed; for instance, studies using the complete H gene have identified eighteen lineages, while other investigations that have focused on partial H gene sequences have documented as many as twenty-two. Although these lineages often show geographical preferences, some degree of co-circulation occurs [12,22]. Recently recognized lineages are typically designated by their geographical point of discovery (continent or subcontinent), often appended with a numerical identifier or a distinctive name like “Rockborn-like” CDV [12]. The array of currently known lineages includes America-1, America-2, North America-3, South America/North America-4, America-5, Canada-1, Canada- 2, Asia-1, Asia-2, Asia-3, Asia-4, Asia-5, Asia-6, Europe Wildlife, Arctic-like, Africa-1, Africa-2, Southern Africa, Europe-1/South America-1, South America-2, South America-3, and Rockborn-like [5,23,24,25,26,27,28]. The characterization and distinction of new lineages rely on variations in the DNA and protein sequences of the H gene. Furthermore, the considerable evolutionary divergence observed in certain CDV lineages from Africa and Asia [29,30] points to genetic drift within the H gene as a driving force for viral evolution and the emergence of new lineages [31]. This continuous accumulation of genetic changes allows the virus to explore new antigenic space and adapt to the diverse receptor landscapes of different carnivore hosts. Understanding this fundamental evolutionary mechanism is therefore critical for tracking the global spread of CDV and anticipating its potential to cross species barriers.
We therefore intended to compile and reanalyze publicly available CDV sequences using recently released bioinformatic tools in this review. Our objective was to reveal the molecular dynamics of certain residues in the H gene by contextualizing the results of phylogenetic and structural analyses.

2. Materials and Methods

2.1. Metadata Collection and Initial Processing

Complete H gene sequences of CDV were retrieved from the National Center for Biotechnology Information (NCBI) GenBank database. The full list of the 692 accession numbers used as the initial dataset is provided in Appendix A Table A1. Full GenBank records for these accessions were retrieved using NCBI’s Entrez Direct (E-utilities) efetch command.
A custom Bash script was employed to process these records. The primary inclusion criterion was a sequence length between 1700 and 1900 base pairs (bp), and a filter was designed to select complete or near-complete H gene coding sequences while excluding partial fragments.
This process generated an initial FASTA file of filtered H gene sequences and a corresponding metadata file of tab-separated values (TSVs) (Appendix A Table A2).

2.2. Representative Sequence Selection

To reduce dataset redundancy and computational load for downstream analyses, a representative set of sequences was selected from the file. This was performed using MMseqs (version 2.0) [32]. Sequences were clustered using the easy-cluster workflow with a sequence identity threshold of 0.98 (98%) and a coverage threshold of 0.8, using coverage mode 0 (coverage of query and target).
The longest sequence from each cluster was typically chosen by MMseqs as the representative, resulting in a FASTA file of 68 representative H gene sequences.

2.3. Multiple Sequence Alignment (MSA)

The 68 representative H gene nucleotide sequences were aligned using Clustal Omega (version 1.2) with default parameters [33]. The resulting alignment was saved in FASTA format.

2.4. Selection Pressure Analysis

To identify amino acid sites under positive selection, a codon-based analysis was performed on the multiple sequence alignment of the 68 representative H gene nucleotide sequences. We used the Mixed Effects Model of Evolution (MEME) method, implemented on the Datamonkey web server, which is part of the HyPhy suite [34]. MEME is designed to detect sites evolving under episodic positive selection. The analysis was run using the universal genetic code, and sites with a p-value 0.1 were considered to be under significant diversifying selection.

2.5. Homology Modeling and Structural Visualization

To contextualize the locations of positively selected sites, 3D homology models of the H protein were generated for one representative sequence from each of the 14 major lineages identified in our dataset. Protein sequences were submitted to the SWISS-MODEL web server for automated homology modeling [35]. The resulting PDB files were visualized, analyzed, and rendered using UCSF ChimeraX [36]. The mmaker command in ChimeraX was used to superimpose the 14 models to assess structural conservation and variation.

2.6. B-Cell Epitope Conservancy Analysis

To computationally assess the potential for antigenic differences between viral lineages, a linear B-cell epitope conservancy analysis was performed. The amino acid sequences of the H protein from representative strains of the vaccine-like America-1 lineage, the Arctic-like lineage, and the European lineage were submitted to the BepiPred-3.0 web server [37]. This tool predicts the location of linear B-cell epitopes based on protein sequences. The resulting epitope score profiles were compared to identify conserved and variable antigenic regions.

2.7. Phylogenetic Tree Inference

A Maximum Likelihood (ML) phylogenetic tree was constructed from the aligned representative H gene nucleotide sequences using IQ-TREE (version 3.0.1) [38]. The best-fit nucleotide substitution model was automatically selected by IQ-TREE using ModelFinder Plus (MFP). Branch support was assessed using 1000 replicates of the Ultrafast Bootstrap Approximation (UFBoot). The final ML tree with support values was output in Newick format.

2.8. Phylogenetic Network Analysis

To explore potential conflicting phylogenetic signals and reticulate evolutionary patterns, a phylogenetic network was constructed using the Neighbor-Net algorithm, as implemented in SplitsTree6 (version 6.0) [39]. The aligned representative H gene sequences were used as input, and distances were typically calculated using the p-distance method within SplitsTree.

2.9. Phylogenetic Tree Visualization and Annotation

The Maximum Likelihood phylogenetic tree was visualized and annotated using iTOL (Interactive Tree Of Life, v6, https://itol.embl.de/ (accessed on 27 May 2025) [40]. The tree was annotated with obtained metadata.

3. Results

3.1. Phylogenetic Relationships and Lineage Diversity of CDV H Genes

Our initial dataset comprised 692 complete H gene sequences from the GenBank database. Following a redundancy reduction step, a final set of 68 representative sequences was selected for analysis. To elucidate the evolutionary relationships among these sequences, an ML phylogenetic tree was constructed (Figure 1). The analysis revealed a clear and well-supported segregation of the sequences into distinct clades, which correspond to identified CDV lineages. The major clades identified in the dataset included the Europe/South America-1, Asia-1, Southern African, and North/South America-4 lineages, among others. The topology of the tree is characterized by deep, well-supported branches (Ultrafast Bootstrap > 95 % ) that separate the major lineages, which is indicative of substantial genetic divergence between them. Furthermore, the tree highlights the clustering of sequences based on both their geographic origin and, in several instances, their host species, providing a clear map of the global distribution and diversity of these viral lineages.

3.2. Variation at Key Receptor-Binding Sites of the Hemagglutinin Protein

To investigate the molecular basis of H gene variability, the translated protein sequences were aligned, and key amino acid positions known to be involved in binding to the SLAM and Nectin-4 host cell receptors were examined (Figure 1). Significant variation was observed at these critical sites across the different lineages. Notably, at position 530, which is a crucial determinant for SLAM receptor interaction, at least five different amino acids (aspartic acid, D; glycine, G; aspargine, N; serine, S; and histidine, H) were observed among the representative strains. Similarly, position 549, another key site for host tropism, showed variation between histidine (H) and tyrosine (Y). Other receptor-binding sites, including positions 478, 519, 528, and 537, also displayed amino acid substitutions between lineages. This molecular diversity at functional receptor-binding sites underscores the strong selective pressures acting on the H protein and provides a potential mechanism for the observed differences in host range and antigenicity among lineages.

3.3. Phylogenetic Network Analysis Reveals Conflicting Signals

To explore evolutionary relationships that may not be adequately represented by a strictly bifurcating tree, a phylogenetic network was constructed using the Neighbor-Net algorithm (Figure 2). The resulting network confirmed the major groupings identified in the ML tree, with distinct clusters of related sequences corresponding to the major lineages; however, the network also revealed significant reticulation, visible as dense, web-like structures, particularly in the central part of the network connecting the major lineages. This reticulation indicates the presence of conflicting phylogenetic signals in the dataset. Such signals can arise from processes such as rapid diversification, homoplasy (convergent evolution of amino acid states), or potential recombination events during the evolutionary history of the H gene. The network structure suggests that while the major lineages are clearly distinct, their deep evolutionary relationships are complex and cannot be perfectly resolved into a simple branching tree, highlighting the dynamic and nonlinear nature of CDV H gene evolution.

3.4. Identification of Positively Selected Sites in the H Gene

To provide quantitative evidence for adaptive evolution, a selection pressure analysis was performed to identify specific sites in the H gene that are evolving under positive selection. The MEME analysis identified 15 codon sites undergoing significant episodic diversifying selection (p value 0.1 ) (Table 2). A considerable finding was that five of these positively selected sites (478, 528, 530, 537, and 549) are known critical residues located within the SLAM and Nectin-4 receptor-binding domains of the H protein. This overlap provides strong evidence that these functionally important regions are key targets of ongoing adaptive evolution.

3.5. Structural Mapping of Positively Selected Sites

To visualize the location of the sites under positive selection in a functional context, we generated and analyzed homology models of the H protein. The overall protein fold, which forms a homodimer/tetramer, was found to be highly conserved across all 14 modeled lineages. The 15 positively selected sites were mapped onto the surface of a representative H protein monomer (Figure 3). The visualization confirms that these sites, particularly the critical receptor-binding residues, are located on the exposed outer surfaces of the protein’s head domain. A superimposed view of the key receptor-binding sites across all 14 lineage models reveals subtle but potentially significant differences in the size, charge, and orientation of amino acid side chains at these functionally critical positions (Figure 3B), providing a structural basis for potential differences in receptor interactions.

3.6. Predicted Antigenic Differences Between Vaccine and Field Strains

To investigate the molecular basis for potential vaccine mismatch, we compared the predicted B-cell epitope profiles of the H protein from the vaccine-like America-1 lineage with two representative field strains (Arctic-like and European lineages) (Figure 4). While the overall antigenic profiles showed broad conservation across the protein, several regions exhibited notable differences in their predicted epitope scores. For instance, a region around site 400 showed a significantly higher predicted antigenicity in the field strains compared to the vaccine strain. Furthermore, subtle variations in epitope scores were observed in the functionally critical receptor-binding region (sites 530–550). The overlap of these variable antigenic regions with sites found to be under positive selection suggests that evolutionary pressure is actively shaping the antigenic surface of the H protein, providing a potential mechanism for immune evasion in vaccinated populations.

4. Discussion

This comprehensive bioinformatic analysis of 68 representative CDV H gene sequences provides a clear snapshot of the virus’s ongoing global evolution and highlights several key themes critical for veterinary medicine and wildlife conservation. Our findings, derived from both Maximum Likelihood phylogeny and phylogenetic network analysis, reveal the extensive genetic diversity of CDV and underscore the complex interplay between viral lineage, geographic distribution, and host range.

4.1. Global Dispersal and Co-Circulation of CDV Lineages

The phylogenetic tree (Figure 1) and the Neighbor-Net analysis (Figure 2) both successfully segregated the representative strains into distinct, well-supported clades corresponding to established CDV lineages. Our analysis visualizes the cosmopolitan nature of certain lineages, such as Europe/South America-1, which was identified in samples from South America (Brazil, Chile, Uruguay) and Europe (Italy, Germany). This broad distribution supports the hypothesis of frequent inter-continental viral traffic, likely facilitated by the global movement of domestic dogs [41,42,43]. In contrast, other lineages like Southern African appear more geographically constrained, though the presence of multiple sub-clades within this lineage in a single country (South Africa) points to complex local evolution and circulation among a diverse array of wildlife hosts, including lions (Panthera leo), spotted hyenas (Crocuta crocuta), and African wild dogs (Lycaon pictus) [44,45,46]. A significant finding is the co-circulation of multiple distinct lineages within the same geographic region. For instance, Italy was found to harbor strains from the Arctic-like, European Wildlife, and Europe/South America-1 lineages. This demonstrates that viral populations in a given area are not static but are shaped by repeated, independent introduction events, likely from different sources or hosts. The Neighbor-Net analysis (Figure 2) further supports this complexity, where the dense, web-like reticulations connecting the major lineage clusters suggest a history of rapid diversification and potentially conflicting phylogenetic signals that a simple tree cannot fully capture [45,47,48,49].

4.2. The Wildlife–Domestic Interface, Host Adaptation, and H Gene’s Role

A central theme emerging from our results is the critical role of the H gene in host adaptation, particularly at the wildlife–domestic animal interface. The data clearly show that major lineages are not restricted to domestic dogs. The Southern African lineage was found in a variety of wild carnivores [45], the European Wildlife lineage was identified in foxes (Vulpes vulpes) and martens (Martes martes) [50,51], and the Asia-1 lineage, while common in dogs, was also found in a raccoon dog (Nyctereutes procyonoides) and a rhesus macaque (Macaca mulatta) [20,23,44,52,53]. This supports the hypothesis that wild carnivores act as significant wildlife hosts and potential reservoirs, maintaining viral diversity and facilitating spillover events into domestic dog populations and across different wildlife species. The molecular basis for this host plasticity is likely rooted in the high variability of the H gene, particularly at receptor-binding sites. Our alignment analysis (Figure 1) highlights extensive amino acid substitutions at key positions like 530 and 549, which are known to mediate binding to the SLAM and Nectin-4 receptors and are critical for determining host tropism. The diversity at these sites (e.g., at least five different amino acids at position 530) across lineages likely reflects ongoing adaptation to the specific receptor variants of different host species [19,21,25,44,54]. This continuous evolution allows the virus to explore new ecological niches, posing a persistent threat to a wide range of carnivores, including endangered species like the Iberian lynx (Lynx pardinus) and Amur tiger (Panthera tigris) [55,56]. This dynamic highlights the potential for transmission in both directions, from domestic dogs to wildlife (spillover) and from wildlife reservoirs back to dogs (spillback), which complicates disease control efforts. Thus, high vaccination coverage in domestic dogs is crucial, and strategies such as wildlife vaccination or vaccinated dog buffer zones around conservation areas should be considered to mitigate spillover risks.
The domains on the H gene that bind to the SLAM and Nectin-4 receptors act as the key factors driving host and lineage shifts and are illustrated in Figure 1. Specifically, these regions involve amino acids at positions 519, 530, and 549 for SLAM binding and at 478, 479, 537, and 539 for Nectin-4 binding. While recent research has identified substitutions at other loci, the significance of these aforementioned regions remains undiminished. Notably, Figure 1 and Table 2 indicate a higher frequency of substitutions within the SLAM-binding amino acids. A plausible explanation for this observation is the role of SLAM as a cellular entry receptor, which would impose a more intense selective pressure on the virus to adapt [57]. In addition to host switching, it is also most likely to emerge as the most significant factor influencing lineage differentiation. Our findings of positive selection on the H gene align with known polymorphisms in carnivore SLAM receptors. Residue 530 (glycine in canids; tyrosine or histidine in felids or mustelids) and residue 549 strongly influence receptor-binding specificity. Positive selection at these sites provides evolutionary evidence of CDV adaptation to host receptor diversity, highlighting the molecular basis of its host-switching capacity [43,58,59].
Our structural modeling also contextualizes these findings. The 3D model (Figure 3) indicates that these positively selected sites are located on the exposed outer surfaces of the protein head domain, consistent with their role in host–receptor interaction and potential immune evasion. Furthermore, the comparative modeling of 14 distinct lineages (Figure 3B) revealed that while the overall protein fold is highly conserved, the specific amino acid substitutions at these key sites alter the local size, charge, and chemistry of the receptor-binding interface. For example, the variation at site 530 between a small glycine (common in vaccine-like strains) and a larger, charged aspartic acid (common in Asia-1) provides a structural hypothesis for how different lineages adapt to the specific receptor variants of different host species [19].

4.3. Implications for Vaccine Efficacy

A crucial point of discussion arising from our phylogenetic analysis is the significant genetic divergence between currently circulating field strains and the vaccine strains. The vast majority of commercial CDV vaccines are derived from the historical America-1 lineage (e.g., Onderstepoort strain) [24,45]. Our results show that the representative strains from major circulating lineages like Asia-1, Europe/South America-1, and Southern African form distinct clades that are phylogenetically distant from the America-1 lineage. This genetic gap, particularly in the highly antigenic H gene, is a leading hypothesis for observed vaccine failures and outbreaks in vaccinated dog populations [44,60].
B-cell epitope conservancy analysis (Figure 4) also provides computational evidence to support this “vaccine mismatch” hypothesis. The analysis revealed notable differences in the predicted epitope profiles between the vaccine strain and the representative field strains (Arctic-like and Europe), which have been substantially detected throughout Europe [61,62]. The location of these antigenic differences within regions under positive selection suggests that the virus is actively evolving to evade the host immune response [2,10,25].
Our analysis revealed several substitutions in critical receptor-binding residues of the H protein, particularly at positions 530 and 549, that diverge significantly from those in the America-1 vaccine strain. Due to their critical role in SLAM receptor engagement and their location within positively selected B-cell epitope regions [17,19,63,64], these changes could reduce vaccine-induced protection and enable cross-species transmission [24,44]. Structural modeling further indicated that variation in side-chain properties at these sites may alter receptor-binding interfaces [8,16]. This factor could contribute to the emergence of immune-evading variants in regions with co-circulating divergent lineages, such as Europe and Asia [24,31]. These findings underscore the urgent necessity for vaccines that incorporate antigens from contemporary field strains, as well as the need for enhanced genomic surveillance to detect variants with reduced neutralization potential.
Our study has a limitation in that the dataset of 68 representative sequences might not be sufficient to reliably estimate substitution rates or root-to-tip divergence. Larger datasets could yield more accurate results. Nevertheless, our findings encourage future studies to conduct these analyses.

5. Conclusions

Extensive research has established the importance of CDV and its geographical distribution patterns. The considerable diversity observed among CDV lineages is primarily attributed to their specific geographical origins.
The clear phylogenetic and geographic structuring of lineages, coupled with extensive host switching and significant molecular variation at key functional sites, highlights the complex challenges in controlling canine distemper. The deep genetic divergence between circulating field strains and current vaccine lineages is a point of major concern that warrants further investigation and may necessitate a re-evaluation of global vaccination strategies. A “One Health” approach, integrating surveillance of domestic dogs, wildlife, and the environment, is essential to effectively mitigate the threat posed by this formidable multi-host virus.
Moving forward, effective control of CDV will require proactive measures that extend beyond current vaccination practices. We recommend the development of regionalized or updated vaccine formulations that incorporate antigens from prevalent local lineages, particularly in regions where multiple divergent strains co-circulate. Expanded genomic surveillance is urgently needed in underrepresented areas such as Africa and Southeast Asia to identify emerging variants before they spread widely. Finally, experimental validation of our structural predictions, including receptor-binding and neutralization assays, will be essential to confirm the functional consequences of the observed amino acid substitutions and to guide evidence-based vaccine updates.

Author Contributions

Conceptualization, T.Ç.O. and B.T.K.; methodology, B.T.K.; software, B.T.K.; validation, T.Ç.O. and B.T.K.; formal analysis, T.Ç.O. and B.T.K.; data curation, T.Ç.O. and B.T.K.; writing—original draft preparation, T.Ç.O. and B.T.K.; writing—review and editing, T.Ç.O. and B.T.K.; visualization, T.Ç.O. and B.T.K.; supervision, T.Ç.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in the study were obtained from GenBank, and further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Supplementary Metadata

Table A1. List of 692 GenBank accession numbers for the complete H gene sequences of canine distemper virus (CDV) used as the initial dataset for this study.
Table A1. List of 692 GenBank accession numbers for the complete H gene sequences of canine distemper virus (CDV) used as the initial dataset for this study.
GenBank Accession Numbers
PQ625782.1PQ611763.1PQ241483.1PQ241484.1
PQ241485.1PQ241486.1PQ241487.1PQ241488.1
PQ241489.1PQ241490.1PQ241491.1PQ241492.1
PQ241493.1PQ241494.1PQ241495.1PQ241496.1
PQ241497.1PQ241498.1PQ241499.1PQ241500.1
PQ241501.1PQ241502.1PP977570.1PP894823.1
PP850218.1PP850219.1PP850220.1PP850221.1
PP493270.1PP493271.1PP493272.1PP493273.1
PP316238.1PP316239.1OR612054.1OR612056.1
OR243301.1OR123691.1OR123692.1OQ448914.1
OQ448915.1OP546520.1OP546521.1OP546522.1
OP546523.1OP546524.1OP546525.1OP546526.1
OP546527.1OP546528.1OP546529.1ON458033.1
OL795421.1OL795422.1OL795423.1OL795424.1
OL795425.1OL795426.1OL795427.1OL795428.1
OL795429.1OL795430.1ON929298.1ON929300.1
ON093253.1MW581018.1MW581019.1MW581020.1
MW581021.1MW581022.1MW581023.1MW581024.1
MW581025.1MW581026.1MW015089.1MW015090.1
MT350712.1MT350713.1MT350714.1MT350715.1
MT350716.1MT350717.1MW036777.1MW036781.1
MW036782.1MW036787.1MW036798.1MW036800.1
MW036808.1MW036817.1MW036820.1MT292106.1
MN044670.1MN044671.1MN044672.1MN044673.1
MN044674.1MN044675.1MN044676.1MN044679.1
MN044681.1MN044684.1MN044685.1MN044686.1
MN044687.1MN044688.1MN044693.1MN044694.1
MN044698.1MN044699.1MN044700.1MT066171.1
LC498611.1MK275197.1MK275198.1MK275199.1
MK275200.1MK275201.1MK275202.1MK275203.1
MK275204.1MK689164.1MK689165.1MK689166.1
MG952927.1MG952928.1MG952929.1LC426017.1
MF663673.1MF663674.1MG922458.1MG922459.1
MG922460.1MF467738.1MF467739.1MF467740.1
MF467741.1MF467742.1MF467743.1MF467744.1
MF467745.1MF467746.1MF467747.1KX708721.1
KX708722.1KX708723.1KX708724.1KX708725.1
KX708726.1KX708727.1KX708728.1KX708729.1
KX708730.1KX708731.1KX708732.1MF926598.1
MF926605.1MF926606.1MF926607.1MF926608.1
MF926609.1MF926610.1MF926611.1MG827088.1
MG827089.1MG827090.1KY774571.1KY774572.1
KY774573.1KY774574.1KY774575.1KY774576.1
KY774577.1KY774578.1KY350199.1KY350200.1
KY350201.1KY350202.1KY350203.1KY350204.1
KY350205.1KY350206.1KY350207.1KY350208.1
KY350209.1KY350210.1KY350211.1KY350212.1
KY350213.1KX897609.1KX897610.1KX897611.1
KX897612.1MF964178.1MF964179.1MF964180.1
MF964181.1MF964182.1MF964183.1MF964184.1
MF964185.1MF964186.1MF964187.1MF964190.1
KX943319.1KX943320.1KX943321.1KX943322.1
KX943323.1KX943324.1KX943325.1KX943326.1
MF100798.1MF100799.1MF100800.1MF100801.1
MF100802.1MF100803.1MF100804.1MF100805.1
MF100806.1MF100807.1MF100808.1MF100809.1
MF100810.1MF100811.1KU052888.1KU052889.1
KU052890.1KU052891.1KU052892.1KU052893.1
KU052894.1KU052895.1KU052896.1KU052897.1
KX371584.1KX371585.1KX371586.1KX371587.1
KX371588.1KU342688.1KU342692.1LC159583.1
LC159584.1LC159585.1LC159586.1KU847961.1
KU552082.1KU521343.1KU521344.1KU521345.1
KU521346.1KU983518.1KU983519.1KU983520.1
KU983521.1KU983522.1KU983523.1KU030831.1
KU030832.1KU030833.1KT266736.1KT119347.1
KR185351.1KT341044.1KT341045.1KT341046.1
KT341047.1KT341048.1KP872502.1KR002657.1
KR002658.1KR002659.1KR002660.1KR002661.1
KR074995.1LC007974.1LC007975.1LC007976.1
KP127955.1KP127956.1KP127957.1KP127958.1
KP127959.1KP127960.1KP127961.1KP127962.1
KP127963.1KP127964.1KP127965.1KP127966.1
KP127967.1KM923900.1KJ600773.1KJ600774.1
KJ600775.1KJ600776.1KM386683.1KM115532.1
KM115533.1KM115534.1KM115535.1KM115536.1
KC916714.1KC916715.1KC916716.1KC916717.1
KJ865300.1KJ728666.1KJ728667.1KJ685550.1
KF835411.1KF835412.1KF835413.1KF835414.1
KF835415.1KF835416.1KF835417.1KF835418.1
KF835419.1KF835420.1KF835421.1KF835422.1
KF835423.1KF835424.1KF835425.1KJ579444.1
KJ160517.1KJ437593.1KJ437594.1KJ437595.1
KJ437596.1KJ489381.1KJ489382.1KJ489383.1
KF430337.1KF430338.1KF430339.1KF430340.1
KF430341.1KF430342.1KF430343.1KF430344.1
KF430345.1KF430346.1KF430347.1KF430348.1
KF430349.1KF430350.1KF430351.1KF430352.1
KF430353.1KF430354.1KF430355.1KF430356.1
KF430357.1KF430358.1KF430359.1KF430360.1
KF430361.1KF430362.1KF430363.1KF430364.1
KF430365.1KF430366.1KF430367.1KF430368.1
KF430369.1KF430370.1KF430371.1KF430372.1
KF430373.1KF430374.1KF430375.1KF430376.1
KF430377.1KF880677.1KF880678.1KF880679.1
KF880680.1KF880681.1KF880682.1KF880683.1
KF880684.1KF767692.1JX844218.1JX844219.1
JX844220.1JX844221.1KF006819.1KF006820.1
KC966928.1KC966929.1JQ319389.1JQ319391.1
JQ319392.1JQ319393.1JQ319395.1JQ319396.1
JQ319397.1JQ319398.1JQ319399.1JQ327707.1
JQ327708.1JN008893.1JN008894.1JN008895.1
JN008896.1JN008897.1JN008898.1JN008899.1
JN008900.1JN008901.1JN008902.1JN008903.1
JN008904.1JN008905.1JN008906.1JN008907.1
KC667065.1KC667067.1KC590510.1KC494688.1
AB475098.1KC257463.1KC257464.1JQ966309.1
JX912958.2JX912959.2JX912960.2JX912961.2
JX912963.2JX912964.2JX912965.2JX912966.2
JX912968.2JX912971.2JX912975.2JX912976.2
JX912978.2JF810106.1JF810107.1JF810108.1
JF810109.1JF810110.1JF810111.1JN812975.1
JN812976.1JX276746.1JQ732170.1JQ732171.1
JQ732172.1JQ732173.1JQ732174.1JQ732175.1
JQ732176.1JQ732177.1JN836734.1JN836735.1
JN836736.1JN836737.1JN622007.1JN381191.1
JN215473.1JN215474.1JN215475.1JN215476.1
JN215477.1AB619774.1AB619775.1JF343962.1
JN153019.1JN153020.1JN153021.1JN153022.1
JN153023.1JN153024.1JN153025.1AB605890.1
AB605891.1HQ657209.1HQ850147.1HM443704.1
HM443706.1HM443707.1HM443708.1HM443709.1
HM443710.1HM443711.1HM443712.1HM443714.1
HM443715.1HM443716.1HM443717.1HM443718.1
HM443719.1HM443720.1HM443721.1HM443722.1
HM443723.1HM443724.1HM443725.1HM443726.1
HQ007902.1GU810819.1HQ128599.1HQ128600.1
HQ128601.1AB606410.1HM623891.1HM623893.1
HM623895.1HQ403645.1HM448829.1HM448830.1
HM448831.1HM448832.1HM448833.1HM448834.1
HM749644.1GU001863.1GU001864.1HM120874.1
HM165273.1FJ461693.1FJ461694.1FJ461695.1
FJ461696.1FJ461697.1FJ461698.1FJ461699.1
FJ461700.1FJ461701.1FJ461702.1FJ461703.1
FJ461704.1FJ461705.1FJ461706.1FJ461707.1
FJ461708.1FJ461709.1FJ461710.1FJ461711.1
FJ461712.1FJ461713.1FJ461714.1FJ461715.1
FJ461716.1FJ461717.1FJ461718.1FJ461719.1
FJ461720.1FJ461721.1FJ461722.1FJ461723.1
FJ461724.1GQ332530.1GQ332531.1GQ332532.1
GQ332533.1GQ332534.1GQ332535.1GQ214369.2
GQ214373.2GQ214374.2GQ214376.2GQ214378.2
GQ214380.2GQ214384.2FJ848530.1FJ848531.1
FJ848532.1FJ848533.1FJ848534.1FJ848535.1
FJ848536.1FJ851450.1FJ851451.1FJ851452.1
FJ851453.1FJ851454.1FJ851455.1FJ851456.1
FJ851457.1FJ851458.1FJ810213.1FJ810214.1
FJ810215.1FJ694852.1FJ694853.1FJ705230.1
FJ705231.1FJ705232.1FJ705233.1FJ705234.1
FJ705235.1FJ705236.1FJ705237.1FJ705238.1
FJ705239.1FJ605455.1FJ535063.1EU934233.1
EU296481.1EU296482.1EU296483.1EU296484.1
EU296485.1EU296486.1EU296487.1EU296488.1
EU296489.1EU296490.1EU296491.1EU296492.1
EU296493.1EU296494.1FJ423604.1FJ423605.1
FJ423606.1FJ423607.1FJ423608.1FJ423609.1
FJ392651.1FJ392652.1FJ409464.1FJ405223.1
FJ405224.1EU716072.1EU716073.1EU716074.1
EU716075.1FJ011004.1FJ011005.1FJ040791.1
EU684265.1EU564812.1EU564813.1EU379560.1
EU325720.1EU325721.1EU325722.1EU325723.1
EU325724.1EU325725.1EU325726.1EU325727.1
EU325728.1EU325729.1EU325730.1EU325731.1
EU252148.1EU252149.1AM903376.1EU143737.1
DQ889177.1DQ889178.1DQ889179.1DQ889180.1
DQ889181.1DQ889182.1DQ889183.1DQ889184.1
DQ889185.1DQ889186.1DQ889187.1DQ889188.1
DQ889189.1DQ903854.1EF445051.1EF445052.1
EF445053.1EF445054.1EF042818.1DQ922630.1
DQ226087.1DQ226088.1DQ228166.1DQ494317.1
DQ494318.1DQ494319.1DQ191175.1AY548109.1
AY548110.1AY548111.1AY498692.1AY465925.1
AY438597.1AB040766.1AB040767.1AB040768.1
Table A2. Metadata for the 68 representative canine distemper virus (CDV) H gene sequences used in the phylogenetic and network analyses. The table includes GenBank accession number, collection year, country of origin, host species, and assigned lineage.
Table A2. Metadata for the 68 representative canine distemper virus (CDV) H gene sequences used in the phylogenetic and network analyses. The table includes GenBank accession number, collection year, country of origin, host species, and assigned lineage.
AccessionYearCountryHostLineage
LC498611.12017JapanVulpes zerdaRockborn-like
MK275203.12018ChinaCanis sp.Asia-1
MG952928.12014ArgentinaCanis lupus familiarisEuropean Wildlife
LC426017.12018JapanCanis lupus familiarisAsia-2
MF663673.12016ItalyDogArctic-like
MF467739.12016South AfricaAfrican wild dogSouthern African
MF467741.12016South AfricaAfrican wild dogSouthern African
MF467742.12016South AfricaAfrican wild dogSouthern African
MF467743.12016South AfricaAfrican wild dogSouthern African
MF467745.12015South AfricaLionSouthern African
MF467746.12015South AfricaLionSouthern African
KX708721.12013RussiaMartes zibellinaArctic-like
KY774571.12016ChinaDogAsia-4
KY350201.12016ChinaDogAsia-1
KX897612.12015ChinaTigerAsia-1
MF964185.12016IndiaDogAsia-5
KU052888.12014ChileDogEurope/South America-1
KX371587.12016ChinaDogAsia-1
KU552082.11982ChinaDogArctic-like
KU983518.12014ChinaDogAsia-1
KT266736.12014MexicoCanineAmerica-3
LC007974.12010JapanVulpes vulpesAsia-1
KM923900.12008ChinaMonkeyAsia-1
JN008893.12011GreeceDogEurope/South America-1
JN008905.12011GreeceVulpes vulpesEurope/South America-1
KC667065.12012ChinaDogAsia-1
JX912959.22012BrazilDogEurope/South America-1
JX912960.22012BrazilDogEurope/South America-1
JX912975.22012BrazilDogEurope/South America-1
JQ732170.12012BrazilDogEurope/South America-1
JN836734.12009USAMartes pennantiAmerica-3
JN215473.12007UruguayDogEurope/South America-1
AB619775.12009JapanRacoon DogAsia-1
JN153019.12007GermanyRacoonEuropean Wildlife
EU296484.12006TaiwanCanineAsia-1
EU296491.12006TaiwanCanineAsia-1
EU296494.12007TaiwanCanineAsia-1
AM903376.12006IndiaDogAmerica-1
DQ889189.12006HungaryStone martenEuropean Wildlife
DQ922630.12006ChinaDogAsia-1
DQ228166.12006HungaryDogEuropean Wildlife
AY465925.12004USARaccoonAmerica-2
AY438597.12004USADogAmerica-2
PQ625782.12021IranDogArctic-like
PQ611763.12021IranDogArctic-like
PQ241483.12023VietnamDogAsia-4
PQ241498.12018VietnamDogAsia-1
PP850219.12023ChileDogNorth/South America-4
PP493270.12022ChileDogEurope/South America-1
OR612056.12022USADomestic FerretArctic-like
OR123691.12016NepalPanthera pardusAsia-5
OR123692.12021NepalPanthera pardusAsia-5
OQ448915.12010ItalyFerretEurope/South America-1
OP546520.12021ItalyFoxEuropean Wildlife
OL795430.12016GermanyRed foxEurope/South America-1
MW581018.12015USAGrey sealAsia-2
MT350712.12018PeruCanis lupus familiarisNorth/South America-4
MW036800.12019ItalyFoxEurope/South America-1
KC916717.11994TanzaniaSpotted hyenaAfrica-2
KJ728666.12013ChinaRaccoon dogAsia-1
KF835411.12011ColombiaDogNorth/South America-4
KF835422.12012ColombiaDogNorth/South America-4
KJ489381.12012ChinaDogAsia-4
KF430372.12007DenmarkNeovison visonEurope
GU001863.12005SpainLynx pardinusEurope/South America-1
FJ461693.12007South AfricaDogSouthern African
FJ461694.12007South AfricaDogSouthern African
GQ214374.22006AustriaBadgerEuropean Wildlife

References

  1. Koç, B.T.; Akkutay-Yoldar, Z.; Oğuzoğlu, T. New members to Arctic-like lineage of canine distemper virus from Turkey. Comp. Immunol. Microbiol. Infect. Dis. 2021, 78, 101678. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, H.; Guo, H.; Hein, V.G.; Xu, Y.; Yu, S.; Wang, X. The evolutionary dynamics history of canine distemper virus through analysis of the hemagglutinin gene during 1930–2020. Eur. J. Wildl. Res. 2023, 69, 56. [Google Scholar] [CrossRef]
  3. Glišić, D.; Kuručki, M.; Ćirović, D.; Šolaja, S.; Mirčeta, J.; Milićević, V. Molecular analysis of canine distemper virus H gene in the golden jackal (Canis aureus) population from Serbia. BMC Vet. Res. 2024, 20, 426. [Google Scholar] [CrossRef]
  4. Khosravi, M.; Gharib, F.Z.; bakhshi, A. Phylogenetic and molecular analysis of hemagglutinin gene and Fsp-coding region of canine distemper virus: Insight into novel vaccine development. Comp. Immunol. Microbiol. Infect. Dis. 2025, 117, 102292. [Google Scholar] [CrossRef]
  5. Giacinti, J.A.; Pearl, D.L.; Ojkic, D.; Campbell, G.D.; Jardine, C.M. Genetic characterization of canine distemper virus from wild and domestic animal submissions to diagnostic facilities in Canada. Prev. Vet. Med. 2022, 198, 105535. [Google Scholar] [CrossRef]
  6. Rivera-Martínez, A.; Rodríguez-Alarcón, C.A.; Adame-Gallegos, J.R.; Laredo-Tiscareño, S.V.; de Luna-Santillana, E.d.J.; Hernández-Triana, L.M.; Garza-Hernández, J.A. Canine Distemper Virus: Origins, Mutations, Diagnosis, and Epidemiology in Mexico. Life 2024, 14, 1002. [Google Scholar] [CrossRef]
  7. Martinez-Gutierrez, M.; Ruiz-Saenz, J. Diversity of susceptible hosts in canine distemper virus infection: A systematic review and data synthesis. BMC Vet. Res. 2016, 12, 78. [Google Scholar] [CrossRef] [PubMed]
  8. McCarthy, A.J.; Shaw, M.A.; Goodman, S.J. Pathogen evolution and disease emergence in carnivores. Proc. Biol. Sci. 2007, 274, 3165–3174. [Google Scholar] [CrossRef] [PubMed]
  9. Oğuzoğlu, T.Ç.; Yücel-Tenekeci, G.; Çalışkan, M.; Bozkurt, M.F.; De Massis, F. Detection and characterization of Distemper virus in a mink (Neovison vison) in Turkey. Vet. Ital. 2018, 54, 79–85. [Google Scholar] [CrossRef]
  10. Rendon-Marin, S.; da Fontoura Budaszewski, R.; Canal, C.W.; Ruiz-Saenz, J. Tropism and molecular pathogenesis of canine distemper virus. Virol. J. 2019, 16, 30. [Google Scholar] [CrossRef]
  11. Martella, V.; Elia, G.; Buonavoglia, C. Canine Distemper Virus. Vet. Clin. N. Am. Small Anim. Pract. 2008, 38, 787–797. [Google Scholar] [CrossRef] [PubMed]
  12. Wilkes, R.P. Canine Distemper Virus in Endangered Species: Species Jump, Clinical Variations, and Vaccination. Pathogens 2022, 12, 57. [Google Scholar] [CrossRef] [PubMed]
  13. von Messling, V.; Zimmer, G.; Herrler, G.; Haas, L.; Cattaneo, R. The Hemagglutinin of Canine Distemper Virus Determines Tropism and Cytopathogenicity. J. Virol. 2001, 75, 6418–6427. [Google Scholar] [CrossRef]
  14. Noyce, R.S.; Delpeut, S.; Richardson, C.D. Dog nectin-4 is an epithelial cell receptor for canine distemper virus that facilitates virus entry and syncytia formation. Virology 2013, 436, 210–220. [Google Scholar] [CrossRef]
  15. Pomeroy, L.W.; Bjørnstad, O.N.; Holmes, E.C. The Evolutionary and Epidemiological Dynamics of the Paramyxoviridae. J. Mol. Evol. 2008, 66, 98–106. [Google Scholar] [CrossRef]
  16. Weckworth, J.K.; Davis, B.W.; Roelke-Parker, M.E.; Wilkes, R.P.; Packer, C.; Eblate, E.; Schwartz, M.K.; Mills, L.S. Identifying Candidate Genetic Markers of CDV Cross-Species Pathogenicity in African Lions. Pathogens 2020, 9, 872. [Google Scholar] [CrossRef] [PubMed]
  17. Zipperle, L.; Langedijk, J.P.M.; Örvell, C.; Vandevelde, M.; Zurbriggen, A.; Plattet, P. Identification of Key Residues in Virulent Canine Distemper Virus Hemagglutinin That Control CD150/SLAM-Binding Activity. J. Virol. 2010, 84, 9618–9624. [Google Scholar] [CrossRef] [PubMed]
  18. Sawatsky, B.; Wong, X.X.; Hinkelmann, S.; Cattaneo, R.; von Messling, V. Canine Distemper Virus Epithelial Cell Infection Is Required for Clinical Disease but Not for Immunosuppression. J. Virol. 2012, 86, 3658–3666. [Google Scholar] [CrossRef]
  19. Nikolin, V.M.; Wibbelt, G.; Michler, F.U.F.; Wolf, P.; East, M.L. Susceptibility of carnivore hosts to strains of canine distemper virus from distinct genetic lineages. Vet. Microbiol. 2012, 156, 45–53. [Google Scholar] [CrossRef]
  20. Nikolin, V.M.; Olarte-Castillo, X.A.; Osterrieder, N.; Hofer, H.; Dubovi, E.; Mazzoni, C.J.; Brunner, E.; Goller, K.V.; Fyumagwa, R.D.; Moehlman, P.D.; et al. Canine distemper virus in the Serengeti ecosystem: Molecular adaptation to different carnivore species. Mol. Ecol. 2016, 26, 2111–2130. [Google Scholar] [CrossRef]
  21. Demeter, Z.; Lakatos, B.; Palade, E.A.; Kozma, T.; Forgách, P.; Rusvai, M. Genetic diversity of Hungarian canine distemper virus strains. Vet. Microbiol. 2007, 122, 258–269. [Google Scholar] [CrossRef]
  22. Karki, M.; Rajak, K.K.; Singh, R.P. Canine morbillivirus (CDV): A review on current status, emergence and the diagnostics. VirusDisease 2022, 33, 309–321. [Google Scholar] [CrossRef]
  23. Anis, E.; Needle, D.B.; Stevens, B.; Yan, L.; Wilkes, R.P. Genetic Characteristics of Canine Distemper Viruses Circulating in Wildlife in The United States. J. Zoo Wildl. Med. 2020, 50, 790. [Google Scholar] [CrossRef] [PubMed]
  24. Bhatt, M.; Rajak, K.K.; Chakravarti, S.; Yadav, A.K.; Kumar, A.; Gupta, V.; Chander, V.; Mathesh, K.; Chandramohan, S.; Sharma, A.K.; et al. Phylogenetic analysis of haemagglutinin gene deciphering a new genetically distinct lineage of canine distemper virus circulating among domestic dogs in India. Transbound. Emerg. Dis. 2019, 66, 1252–1267. [Google Scholar] [CrossRef] [PubMed]
  25. Duque-Valencia, J.; Sarute, N.; Olarte-Castillo, X.A.; Ruíz-Sáenz, J. Evolution and Interspecies Transmission of Canine Distemper Virus—An Outlook of the Diverse Evolutionary Landscapes of a Multi-Host Virus. Viruses 2019, 11, 582. [Google Scholar] [CrossRef]
  26. Espinal, M.A.; Díaz, F.J.; Ruiz-Saenz, J. Phylogenetic evidence of a new canine distemper virus lineage among domestic dogs in Colombia, South America. Vet. Microbiol. 2014, 172, 168–176. [Google Scholar] [CrossRef] [PubMed]
  27. Saltık, H.S.; Atlı, K. Approaches to identify canine distemper virus with neurological symptoms on the basis of molecular characterization of hemagglutinin and fusion genes. Virus Genes 2023, 59, 591–603. [Google Scholar] [CrossRef]
  28. Sarute, N.; Delgado, M.V.; Carrau, L.; Benech, A.; Francia, L.; Pérez, R.; Panzera, Y. First Genome Sequence of a Canine Distemper Virus Strain from South America. Genome Announc. 2014, 2, 10-1128. [Google Scholar] [CrossRef]
  29. Woma, T.Y.; van Vuuren, M.; Bosman, A.M.; Quan, M.; Oosthuizen, M. Phylogenetic analysis of the haemagglutinin gene of current wild-type canine distemper viruses from South Africa: Lineage Africa. Vet. Microbiol. 2010, 143, 126–132. [Google Scholar] [CrossRef]
  30. Zhao, J.J.; Yan, X.J.; Chai, X.L.; Martella, V.; Luo, G.L.; Zhang, H.L.; Gao, H.; Liu, Y.X.; Bai, X.; Zhang, L.; et al. Phylogenetic analysis of the haemagglutinin gene of canine distemper virus strains detected from breeding foxes, raccoon dogs and minks in China. Vet. Microbiol. 2010, 140, 34–42. [Google Scholar] [CrossRef]
  31. Martella, V.; Cirone, F.; Elia, G.; Lorusso, E.; Decaro, N.; Campolo, M.; Desario, C.; Lucente, M.; Bellacicco, A.; Blixenkrone-Møller, M.; et al. Heterogeneity within the hemagglutinin genes of canine distemper virus (CDV) strains detected in Italy. Vet. Microbiol. 2006, 116, 301–309. [Google Scholar] [CrossRef]
  32. Steinegger, M.; Söding, J. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nat. Biotechnol. 2017, 35, 1026–1028. [Google Scholar] [CrossRef]
  33. Sievers, F.; Higgins, D.G. The Clustal Omega Multiple Alignment Package. In Multiple Sequence Alignment; Springer: New York, NY, USA, 2020; pp. 3–16. [Google Scholar] [CrossRef]
  34. Murrell, B.; Wertheim, J.O.; Moola, S.; Weighill, T.; Scheffler, K.; Kosakovsky Pond, S.L. Detecting Individual Sites Subject to Episodic Diversifying Selection. PLoS Genet. 2012, 8, e1002764. [Google Scholar] [CrossRef] [PubMed]
  35. Meng, E.C.; Goddard, T.D.; Pettersen, E.F.; Couch, G.S.; Pearson, Z.J.; Morris, J.H.; Ferrin, T.E. UCSF ChimeraX: Tools for structure building and analysis. Protein Sci. 2023, 32, e4792. [Google Scholar] [CrossRef] [PubMed]
  36. Waterhouse, A.; Bertoni, M.; Bienert, S.; Studer, G.; Tauriello, G.; Gumienny, R.; Heer, F.T.; de Beer, T.A.; Rempfer, C.; Bordoli, L.; et al. SWISS-MODEL: Homology modelling of protein structures and complexes. Nucleic Acids Res. 2018, 46, W296–W303. [Google Scholar] [CrossRef] [PubMed]
  37. Clifford, J.N.; Høie, M.H.; Deleuran, S.; Peters, B.; Nielsen, M.; Marcatili, P. BepiPred-3.0: Improved B-cell epitope prediction using protein language models. Protein Sci. 2022, 31, e4497. [Google Scholar] [CrossRef]
  38. Minh, B.Q.; Schmidt, H.A.; Chernomor, O.; Schrempf, D.; Woodhams, M.D.; Von Haeseler, A.; Lanfear, R. IQ-TREE 2: New Models and Efficient Methods for Phylogenetic Inference in the Genomic Era. Mol. Biol. Evol. 2020, 37, 1530–1534. [Google Scholar] [CrossRef]
  39. Huson, D.H.; Bryant, D. The SplitsTree App: Interactive analysis and visualization using phylogenetic trees and networks. Nat. Methods 2024, 21, 1773–1774. [Google Scholar] [CrossRef] [PubMed]
  40. Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef]
  41. Rendon-Marin, S.; Martinez-Gutierrez, M.; Suarez, J.A.; Ruiz-Saenz, J. Canine Distemper Virus (CDV) Transit Through the Americas: Need to Assess the Impact of CDV Infection on Species Conservation. Front. Microbiol. 2020, 11, 810. [Google Scholar] [CrossRef]
  42. Rios-Usuga, C.; Ortiz-Pineda, M.C.; Aguirre-Catolico, S.D.; Quiroz, V.H.; Ruiz-Saenz, J. Concurrent Circulation of Canine Distemper Virus (South America-4 Lineage) at the Wild–Domestic Canid Interface in Aburrá Valley, Colombia. Viruses 2025, 17, 649. [Google Scholar] [CrossRef]
  43. Wipf, A.; Perez-Cutillas, P.; Ortega, N.; Huertas-López, A.; Martínez-Carrasco, C.; Candela, M.G. Geographical Distribution of Carnivore Hosts and Genotypes of Canine Distemper Virus (CDV) Worldwide: A Scoping Review and Spatial Meta-Analysis. Transbound. Emerg. Dis. 2025, 2025, 6632068. [Google Scholar] [CrossRef] [PubMed]
  44. Feng, N.; Liu, Y.; Wang, J.; Xu, W.; Li, T.; Wang, T.; Wang, L.; Yu, Y.; Wang, H.; Zhao, Y.; et al. Canine distemper virus isolated from a monkey efficiently replicates on Vero cells expressing non-human primate SLAM receptors but not human SLAM receptor. BMC Vet. Res. 2016, 12, 160. [Google Scholar] [CrossRef] [PubMed]
  45. Loots, A.K.; Mokgokong, P.S.; Mitchell, E.; Venter, E.H.; Kotze, A.; Dalton, D.L. Phylogenetic analysis of canine distemper virus in South African wildlife. PLoS ONE 2018, 13, e0199993. [Google Scholar] [CrossRef]
  46. Nguyen, D.V.; Suzuki, J.; Minami, S.; Yonemitsu, K.; Nagata, N.; Kuwata, R.; Shimoda, H.; Vu, C.K.; Truong, T.Q.; Maeda, K. Isolation and phylogenetic analysis of canine distemper virus among domestic dogs in Vietnam. J. Vet. Med. Sci. 2017, 79, 123–127. [Google Scholar] [CrossRef]
  47. Mira, F.; Purpari, G.; Di Bella, S.; Vicari, D.; Schirò, G.; Di Marco, P.; Macaluso, G.; Battilani, M.; Guercio, A. Update on canine distemper virus (CDV) strains of Arctic-like lineage detected in dogs in Italy. Vet. Ital. 2018, 54, 225–236. [Google Scholar] [CrossRef]
  48. Li, W.; Cai, C.; Xue, M.; Xu, G.; Wang, X.; Zhang, A.; Han, L. Phylogenetic analysis of canine distemper viruses isolated from vaccinated dogs in Wuhan. J. Vet. Med. Sci. 2018, 80, 1688–1690. [Google Scholar] [CrossRef]
  49. Trogu, T.; Canziani, S.; Salvato, S.; Bianchi, A.; Bertoletti, I.; Gibelli, L.R.; Alborali, G.L.; Barbieri, I.; Gaffuri, A.; Sala, G.; et al. Canine Distemper Outbreaks in Wild Carnivores in Northern Italy. Viruses 2021, 13, 99. [Google Scholar] [CrossRef] [PubMed]
  50. Kličková, E.; Černíková, L.; Dumondin, A.; Bártová, E.; Budíková, M.; Sedlák, K. Canine Distemper Virus in Wild Carnivore Populations from the Czech Republic (2012–2020): Occurrence, Geographical Distribution, and Phylogenetic Analysis. Life 2022, 12, 289. [Google Scholar] [CrossRef]
  51. Terio, K.A.; Craft, M.E. Canine Distemper Virus (CDV) in Another Big Cat: Should CDV Be Renamed Carnivore Distemper Virus? mBio 2013, 4, e00702-13. [Google Scholar] [CrossRef]
  52. Origgi, F.C.; Plattet, P.; Sattler, U.; Robert, N.; Casaubon, J.; Mavrot, F.; Pewsner, M.; Wu, N.; Giovannini, S.; Oevermann, A.; et al. Emergence of Canine Distemper Virus Strains With Modified Molecular Signature and Enhanced Neuronal Tropism Leading to High Mortality in Wild Carnivores. Vet. Pathol. 2012, 49, 913–929. [Google Scholar] [CrossRef]
  53. Pardo, I.D.R.; Johnson, G.C.; Kleiboeker, S.B. Phylogenetic Characterization of Canine Distemper Viruses Detected in Naturally Infected Dogs in North America. J. Clin. Microbiol. 2005, 43, 5009–5017. [Google Scholar] [CrossRef]
  54. Iwatsuki, K.; Shin, Y.S.; Miyashita, N.; Hirayama, N.; Gemma, T.; Mikami, T.; Mori, T.; Kai, C.; Yoshida, E. Molecular and phylogenetic analyses of the haemagglutinin (H) proteins of field isolates of canine distemper virus from naturally infected dogs. J. Gen. Virol. 1997, 78, 373–380. [Google Scholar] [CrossRef]
  55. Meli, M.L.; Simmler, P.; Cattori, V.; Martínez, F.; Vargas, A.; Palomares, F.; López-Bao, J.V.; Simón, M.A.; López, G.; León-Vizcaino, L.; et al. Importance of canine distemper virus (CDV) infection in free-ranging Iberian lynxes (Lynx pardinus). Vet. Microbiol. 2010, 146, 132–137. [Google Scholar] [CrossRef] [PubMed]
  56. Gilbert, M.; Sulikhan, N.; Uphyrkina, O.; Goncharuk, M.; Kerley, L.; Castro, E.H.; Reeve, R.; Seimon, T.; McAloose, D.; Seryodkin, I.V.; et al. Distemper, extinction, and vaccination of the Amur tiger. Proc. Natl. Acad. Sci. USA 2020, 117, 31954–31962. [Google Scholar] [CrossRef]
  57. Tatsuo, H.; Ono, N.; Yanagi, Y. Morbilliviruses Use Signaling Lymphocyte Activation Molecules (CD150) as Cellular Receptors. J. Virol. 2001, 75, 5842–5850. [Google Scholar] [CrossRef] [PubMed]
  58. Ohishi, K.; Suzuki, R.; Maeda, T.; Tsuda, M.; Abe, E.; Yoshida, T.; Endo, Y.; Okamura, M.; Nagamine, T.; Yamamoto, H.; et al. Recent Host Range Expansion of Canine Distemper Virus and Variation in its Receptor, The Signaling Lymphocyte Activation Molecule, in Carnivores. J. Wildl. Dis. 2014, 50, 596–606. [Google Scholar] [CrossRef]
  59. Yamamoto, Y.; Nakano, S.; Seki, F.; Shigeta, Y.; Ito, S.; Tokiwa, H.; Takeda, M. Computational Analysis Reveals a Critical Point Mutation in the N-Terminal Region of the Signaling Lymphocytic Activation Molecule Responsible for the Cross-Species Infection with Canine Distemper Virus. Molecules 2021, 26, 1262. [Google Scholar] [CrossRef]
  60. Abirami, M.; Srinivas, M.V.; Vasu, J.; Antony, P.X.; Thanislass, J.; Muthaiah, M.; Mukhopadhyay, H.K. Genotyping of Canine Distemper Virus Lineage in Clinically Infected Dogs in Puducherry, Southern India. Microbiol. Res. J. Int. 2020, 30, 17–30. [Google Scholar] [CrossRef]
  61. Alfano, F.; Lucibelli, M.G.; D’Alessio, N.; Auriemma, C.; Rea, S.; Sgroi, G.; Lucente, M.S.; Pellegrini, F.; Diakoudi, G.; De Carlo, E.; et al. Detection of canine distemper virus in wildlife in Italy (2022–2024). Front. Vet. Sci. 2025, 12, 1527550. [Google Scholar] [CrossRef]
  62. Lanszki, Z.; Toth, G.E.; Schütz, E.; Zeghbib, S.; Rusvai, M.; Jakab, F.; Kemenesi, G. Complete genomic sequencing of canine distemper virus with nanopore technology during an epizootic event. Sci. Rep. 2022, 12, 4116. [Google Scholar] [CrossRef] [PubMed]
  63. Wang, F.X.; Liao, P.; Wang, G.S.; Zhang, S.Q.; Liang, J.W.; Tian, F.L.; Zhao, J.J.; Cheng, S.P.; Wen, Y.J. Analysis of Amino Acid 530 and 549 of Hemagglutin Gene in Two Canine Distemper Virus Strains. Br. Microbiol. Res. J. 2015, 8, 367–378. [Google Scholar] [CrossRef]
  64. Sawatsky, B.; Cattaneo, R.; von Messling, V. Canine Distemper Virus Spread and Transmission to Naive Ferrets: Selective Pressure on Signaling Lymphocyte Activation Molecule-Dependent Entry. J. Virol. 2018, 92, 10-1128. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Phylogenetic tree of 68 representative canine distemper virus (CDV) H gene sequences, constructed using the Maximum Likelihood method. The panel on the right shows multiple sequence alignment of the translated H proteins, highlighting key amino acid positions involved in binding to SLAM and Nectin-4 host receptors. Major lineages and variations at these functional sites are visible.
Figure 1. Phylogenetic tree of 68 representative canine distemper virus (CDV) H gene sequences, constructed using the Maximum Likelihood method. The panel on the right shows multiple sequence alignment of the translated H proteins, highlighting key amino acid positions involved in binding to SLAM and Nectin-4 host receptors. Major lineages and variations at these functional sites are visible.
Viruses 17 01149 g001
Figure 2. Phylogenetic network of the 68 representative CDV H gene sequences, generated using the Neighbor-Net algorithm in SplitsTree. The network shows major clusters corresponding to the main lineages. The web-like reticulations in the center of the network indicate the presence of conflicting phylogenetic signals within the dataset, highlighting the complex evolutionary history of the H gene.
Figure 2. Phylogenetic network of the 68 representative CDV H gene sequences, generated using the Neighbor-Net algorithm in SplitsTree. The network shows major clusters corresponding to the main lineages. The web-like reticulations in the center of the network indicate the presence of conflicting phylogenetic signals within the dataset, highlighting the complex evolutionary history of the H gene.
Viruses 17 01149 g002
Figure 3. Structural mapping of positively selected sites on the canine distemper virus (CDV) hemagglutinin (H) protein. (A) A representative homology model (Arctic-like lineage, PQ611763.1) of the H protein monomer. All 15 sites under positive selection are shown as spheres. Key SLAM receptor-binding sites are highlighted in blue, and Nectin-4 sites are in orange. Positively selected sites with no currently known receptor-binding function are shown in red. (B) A superimposed, zoomed-in view of the key receptor-binding region from all 14 modeled lineages, demonstrating the high structural conservation of the protein backbone but variation in the side chains (shown in ball-and-stick representation) at these critical positions.
Figure 3. Structural mapping of positively selected sites on the canine distemper virus (CDV) hemagglutinin (H) protein. (A) A representative homology model (Arctic-like lineage, PQ611763.1) of the H protein monomer. All 15 sites under positive selection are shown as spheres. Key SLAM receptor-binding sites are highlighted in blue, and Nectin-4 sites are in orange. Positively selected sites with no currently known receptor-binding function are shown in red. (B) A superimposed, zoomed-in view of the key receptor-binding region from all 14 modeled lineages, demonstrating the high structural conservation of the protein backbone but variation in the side chains (shown in ball-and-stick representation) at these critical positions.
Viruses 17 01149 g003
Figure 4. Comparative B-cell epitope prediction for the CDV H protein. The plots show the predicted linear B-cell epitope scores (Y-axis) along the amino acid sequence position (X-axis) for representative sequences from (A) the vaccine-like America-1 lineage, (B) the Arctic-like lineage, and (C) the European lineage. Peaks above the dashed threshold line ( score > 0.15 ), which is the default cutoff recommended by the BepiPred-3.0 tool for identifying likely epitopes, indicate predicted antigenic regions. This epitope conservation analysis reveals both conserved and variable antigenic regions between the vaccine and field strains.
Figure 4. Comparative B-cell epitope prediction for the CDV H protein. The plots show the predicted linear B-cell epitope scores (Y-axis) along the amino acid sequence position (X-axis) for representative sequences from (A) the vaccine-like America-1 lineage, (B) the Arctic-like lineage, and (C) the European lineage. Peaks above the dashed threshold line ( score > 0.15 ), which is the default cutoff recommended by the BepiPred-3.0 tool for identifying likely epitopes, indicate predicted antigenic regions. This epitope conservation analysis reveals both conserved and variable antigenic regions between the vaccine and field strains.
Viruses 17 01149 g004
Table 1. Key amino acid residues in the canine distemper virus (CDV) hemagglutinin (H) protein involved in receptor binding.
Table 1. Key amino acid residues in the canine distemper virus (CDV) hemagglutinin (H) protein involved in receptor binding.
ResidueAssociatedReported Functional EffectKey
PositionReceptorReferences
478Nectin-4Critical for Nectin-4 binding and viral entry into epithelial cells.[18]
479Nectin-4Contributes to Nectin-4 interaction.[18]
519SLAMPart of the SLAM binding interface.[19]
530SLAMMajor determinant of host range and SLAM binding affinity.[19]
537Nectin-4Critical for Nectin-4 binding; substitutions can affect viral entry.[18,19]
539Nectin-4Part of the Nectin-4 binding interface.[18,19]
549SLAMKey determinant of host tropism and SLAM binding efficiency.[8,19,20]
Table 2. Amino acid sites under episodic positive selection in the CDV H gene, as identified by MEME analysis (p  0.1 ). Sites with known receptor-binding functions are highlighted in bold.
Table 2. Amino acid sites under episodic positive selection in the CDV H gene, as identified by MEME analysis (p  0.1 ). Sites with known receptor-binding functions are highlighted in bold.
Codon Sitep-Value (MEME)Known Function/Location
280.088-
700.068-
1690.054-
2800.024-
2980.088-
3000.015-
3950.054-
4510.088-
4600.054-
4780.054Nectin-4 Receptor Binding
5280.001SLAM Receptor Binding
530<0.001SLAM Receptor Binding
5370.015Nectin-4 Receptor Binding
5400.054SLAM Receptor Binding
549<0.001SLAM Receptor Binding
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Oğuzoğlu, T.Ç.; Koç, B.T. Global Phylogenetic Analysis of the CDV Hemagglutinin Gene Reveals Positive Selection on Key Receptor-Binding Sites. Viruses 2025, 17, 1149. https://doi.org/10.3390/v17091149

AMA Style

Oğuzoğlu TÇ, Koç BT. Global Phylogenetic Analysis of the CDV Hemagglutinin Gene Reveals Positive Selection on Key Receptor-Binding Sites. Viruses. 2025; 17(9):1149. https://doi.org/10.3390/v17091149

Chicago/Turabian Style

Oğuzoğlu, Tuba Çiğdem, and B. Taylan Koç. 2025. "Global Phylogenetic Analysis of the CDV Hemagglutinin Gene Reveals Positive Selection on Key Receptor-Binding Sites" Viruses 17, no. 9: 1149. https://doi.org/10.3390/v17091149

APA Style

Oğuzoğlu, T. Ç., & Koç, B. T. (2025). Global Phylogenetic Analysis of the CDV Hemagglutinin Gene Reveals Positive Selection on Key Receptor-Binding Sites. Viruses, 17(9), 1149. https://doi.org/10.3390/v17091149

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop