Next Article in Journal
Production of Novel Thermostable Esterases from Thermus thermophilus Strain ET-1 in Escherichia coli and Thermus thermophilus HB27 Using the Bifunctional Expression System pTGT-1 and Characterization of the Recombinant Enzymes
Previous Article in Journal
Challenges in Balancing Hemostasis and Thrombosis in Therapy Tailoring for Hemophilia: A Narrative Review
Previous Article in Special Issue
A Nasal Spray Combining Camostat with a Natural Polysaccharide for the Prevention of Viral Infection via Nasal Mucosal Barrier Formation and Entry Inhibition
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Insights into Genomic Dynamics and Plasticity in the Monkeypox Virus from the 2022 Outbreak

1
Department of Infectious-Tropical Diseases and Microbiology, IRCCS Sacro Cuore Don Calabria Hospital, Negrar di Valpolicella, 37024 Verona, Italy
2
Infections and Cystic Fibrosis Unit, Division of Immunology, Transplantation and Infectious Diseases, IRCCS San Raffaele Scientific Institute, 20123 Milano, Italy
3
Vita-Salute San Raffaele University, 20123 Milano, Italy
4
PhD National Programme in One Health Approaches to Infectious Diseases and Life Science Research, Department of Public Health, Experimental and Forensic Medicine, University of Pavia, 27100 Pavia, Italy
5
GM Lab, Department of Surgical Sciences, Dentistry, Gynaecology and Paediatrics, University of Verona, 37134 Verona, Italy
6
Division of Infectious Diseases, Department of Diagnostic and Public Health, University of Verona, 37129 Verona, Italy
7
Virology Unit, Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy
8
Department of Clinical and Experimental Sciences, University of Brescia, 25121 Brescia, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Mol. Sci. 2026, 27(3), 1371; https://doi.org/10.3390/ijms27031371
Submission received: 4 December 2025 / Revised: 20 January 2026 / Accepted: 21 January 2026 / Published: 29 January 2026
(This article belongs to the Special Issue Viral Biology: Infection and Pathology, Diagnosis and Treatment)

Abstract

The 2022 global mpox outbreak represented a turning point in the Monkeypox virus (MPXV) epidemiology, highlighting the incredible capability of this virus to adapt to different conditions, also in a non-endemic context. To investigate the genomic dynamics of MPXV 2022 strains, we performed whole-genome sequencing of 40 clinical samples from 16 Italian patients across multiple anatomical sites and timepoints between May and December 2022. Combining single-nucleotide analysis with detailed investigation of short tandem repeats (STRs), we explored inter- and intra-host viral dynamics. We identified 19 STR loci located near or within genes involved in immune modulation and virion morphogenesis. While most STRs remained stable across patients, a subset displayed locus- or matrix-specific variation. Among these, STR-VII—embedded within the coding sequence of OPG153, an envelope-associated protein implicated in viral attachment—showed a 12-nucleotide in-frame deletion, resulting in the loss of four aspartic acid residues (Δ4D variant). Structural modeling indicated that this deletion slightly alters a disordered acidic loop without affecting the global fold, potentially modulating surface charge and immune recognition. Integrating STR profiling into genomic surveillance may enhance resolution in outbreak reconstruction and reveal subtle adaptive processes underlying poxvirus–host interaction and immune escape.

1. Introduction

The 2022 global outbreak of mpox caused by the Monkeypox virus (MPXV) marked a turning point in the disease’s epidemiology. Previously considered a rare zoonosis, endemic to Central and West Africa, MPXV expanded rapidly into new geographies and transmission networks.
Over the last few years, MPXV has been described as the most significant species for humans among the Orthopoxvirus genus. Following variola virus (VARV) eradication, concern was raised that MPXV might fill the vacant epidemiological niche previously occupied by VARV [1,2,3]. Since 2017, the incidence of mpox in non-endemic areas had increased, and from May 2022 until August 2025, over 150,000 confirmed cases were reported across more than 137 countries, most of which had no previous history of MPXV transmission [4,5], driven predominantly by human-to-human spread within networks of close or sexual contact [3,6].
The MPXV genome is a double-stranded DNA molecule, featuring two identical oppositely oriented regions of  ~6400 bp in length, known as inverted terminal repeats (ITRs) [6,7]. These genomic regions contain genes assumed to be involved in immune evasion [8,9]; conversely, genes essential for viral replication are located in the highly conserved core region [6,7].
Phylogenetically, MPXV comprises two major clades [10]: clade I (with its subclades Ia and the new Ib, which emerged in 2023 in the South Kivu province), historically restricted to animal populations in Central Africa; and clade II (with its subclades IIa and IIb), restricted to West Africa [11,12]. On 14 August 2024, a public health emergency of international scale was declared due to the increase in mpox cases caused by clade I in the Democratic Republic of Congo and its spread to neighboring countries [5]. Clade Ia was primarily associated with zoonotic transmission from animal reservoirs, but emerging evidence showed that human-to-human transmission has been sustained. In contrast, epidemiological and sequencing information showed that clade Ib MPXV was predominantly associated with human-to-human transmission [5].
The 2022 outbreak was attributed to clade IIb, lineage B.1 [13], and genomic surveillance revealed a surprisingly high number of mutations, including over 40 single-nucleotide substitutions compared to the closest 2018–2019 clade IIb strains. Many of these were APOBEC3-mediated (GA > AA and TC > TT), suggesting ongoing adaptive evolution within human hosts [6].
In the terminal regions of poxviruses’ genomes, gene gain and loss are key drivers of their evolution and adaptation to the hosts [14,15]. Studies have shown that deletions were linked to increased human-to-human transmission [16], and gene copy number variation has been identified as a factor able to modulate viral fitness [17]. Moreover, characterized MPXV genomes from the 2022 outbreak have chimeric sequence architectures (mosaicisms), copy-number variation in tandem repeats, and significant linkage disequilibrium among SNPs, findings that strongly suggest that natural recombination acts alongside mutation and selection in shaping MPXV genomic diversity and its capacity for adaptation to new hosts [18].
Viral genomes often contain low-complexity regions (LCRs), defined as stretches of biased or repetitive sequence with limited nucleotide diversity. These regions frequently harbor short tandem repeats (STRs), i.e., short nucleotide motifs repeated in direct succession, whose copy number can vary among genomes. STRs have been described across a broad range of viruses [19,20,21,22,23], where they have been linked to genome plasticity, regulation of gene expression, host adaptation, and even differences in virulence or immune evasion. Despite growing evidence of tandem-repeat variability, STRs have not yet been fully characterized in MPXV. STR enrichment in low-complexity domains of MPXV suggests they could likewise contribute to viral diversity via mechanisms such as modulation of expression, structural genome plasticity, or genome length variation. A recent study showed that STRs are widespread and highly variable across strains, notably concentrated in the ITRs, with copy-number differences even among closely related isolates [6]. Such features suggest STRs may function as evolutionary “tuning elements,” fostering phenotypic variability in surface proteins and immune modulators with potential consequences for viral fitness, transmission, and immune escape [6].
In this study, we investigated the genomic dynamics of single-nucleotide polymorphisms (SNPs), STRs, and low-complexity regions in MPXV genomes collected during the 2022 outbreak in Italy. We applied a longitudinal design, analyzing clinical specimens collected at multiple timepoints, from diagnosis through follow-up, across different biological matrices.

2. Results

2.1. QC of Sequencing Data, Read Alignment, and Hybrid Assembly

Sequencing yielded an average of 2’029’939 Illumina and 273’023 ONT reads. Illumina sequencing generated between 101,826 and 9,409,984 total reads per sample, while ONT sequencing generated between 942 and 1’398’497 total reads per sample. Sequencing metrics, QC, and coverage against the RefSeq NC_063383.1 are reported in Supplementary Materials, Table S1. To better recapitulate the number of repeated sequences and to increase the accuracy of the STR analysis, hybrid genome assemblies were built for each sample by combining short and long reads. The hybrid assemblies generated on average 4 contigs belonging to MPXV, covering 97% of the NC_063383.1 reference sequence.

2.1.1. Single Nucleotide Polymorphisms

To explore patterns of viral mutation across samples, we conducted two complementary analyses: (i) exploration of novel nucleotide substitutions across patients, aimed at identifying potential convergences; and (ii) an intra-patient comparison of distinct biological matrices, to assess intra-host variability. Notably, P6, P10, P11, and P12 shared a pattern of five novel mutations (C101983T, G28334A, C162816T, C170573T, and C72215T), suggesting close phylogenetic relatedness since all samples belong to the B.1.7 lineage. Similarly, patients P3 and P4 shared three unique substitutions (G37152A, C184829T, and C152445T). We observed intra-host variability in the number of novel SNPs across different sample types, despite all matrices belonging to the same individual and timepoint. In P3, the skin lesion showed a higher number of novel mutations, followed by the pharyngeal swab and the saliva, which exhibited a reduced SNP profile. Specifically, mutations such as G37152A were consistently detected across all matrices, with the genital lesion carrying the G150394A mutation univocally in the pharyngeal sample. Similarly, in P4, while most matrices carried the full set of novel SNPs, the abdominal skin lesion additionally carried C1092T, G171341T, and G171341T. Figure 1 shows the localization of the identified SNPs along the MPXV genome. Detailed results are reported in Supplementary Materials, Table S2.

2.1.2. Characterization of STRs in MPXV Genomes

We systematically analyzed STRs using Tandem Repeat Finder. A total of 19 STRs (STR-I to STR-XIX) were identified (Table 1). The 19 STR regions were detected across all samples, and varied in length from 9 bp to 144 bp, with repeat units ranging from mononucleotide stretches (e.g., STR-I, STR-III, STR-IV, STR-V, STR-VI, STR-XI, and STR-XIX) to more composite motifs, such as STR-IX and STR-X. The number of repeats spanned from 3.7 (STR-XIII) to 25 (STR-VI), with different levels of conservation across samples. Interestingly, STR-I and STR-XIX, as well as STR-II and STR-XVIII, were located in the ITRs, where they were present as identical copies in reverse-complementary form. Nine STRs were located in proximity to annotated MPXV genes (OPGs), either upstream, downstream, or within ORFs. Most intragenic STRs (e.g., STR-III, -IV, -V, -VIII, -XIII, -XIV, and -XVI) were highly conserved across patients and timepoints.
By contrast, STR-X, although stable in copy number, displayed changes in repeat architecture (e.g., 2a + 3b to 2b + a + 3b configurations), highlighting a high structural plasticity at these loci. Non-intragenic loci, such as STR-II, STR-XV, STR-XVII, and STR-XIX, were the most variable, showing larger fluctuations in repeat copy number. The identified STRs and their corresponding repeat numbers are displayed in Figure 1. A comprehensive overview of the STR loci is provided in Figure 2; the loci displaying simple repeat motifs are reported in Figure 2a, while the loci in which we detected rearrangements in the architecture are presented in Figure 2b.
Intra-Host STRs Variation Across Matrices and Timepoints
We investigated the distribution of the 19 identified STRs across the 40 sequenced matrices. Overall, STR profiles were highly conserved across patients and sample types. A large number of STR loci, including STR-III, STR-IV, STR-V, STR-VIII, STR-XII, STR-XIII, STR-XIV, and STR-XVI, were unaltered across all samples, and STR-VI showed only minor fluctuations (23–25 repeats) in a subset of matrices. Some other STR loci presented a more variable pattern across samples. For instance, P3 exhibited copy-number variation without locus loss, with a marked reduction in STR-XV in skin lesions (14, 12, and 9 repeats vs. the prevalent 22) and a slight decrease in STR-II in the saliva (8 vs. 9 in other matrices). Notably, STR-XIX was absent in the two skin lesion samples (P3_SL_2_I and P3_SL_I), whereas it was present in saliva and pharyngeal swab.
P4 retained all STR loci across matrices, but consistently displayed lower repeat counts at specific sites compared with the predominant pattern in other patients: STR-I = 13 (vs. 14), STR-IX = 9 (vs. 10) in all P4 matrices, and STR-XIX = 13 (vs. 14). Additional matrix-specific differences included STR-VII = 15 in P4_SL_I (vs. 19 elsewhere) and STR-XV = 21 in P4_SL_2_I (vs. 22 in the other P4 samples).
With the exception of P3 and P4, only sporadic changes were observed: P7_SLG_I showed STR-X = 4 (vs. 5 in all other samples), P10_SL_I had STR-XV = 19, P4_SL_2_I and P13_SL_I had STR-XV = 21, P2_SLG_I showed STR-XI = 9 (vs. 7 elsewhere), and P5_R_I had STR-II = 7. Several loci also exhibited modest dispersion across patients/timepoints, notably STR-XVII (range 5.6–16) and STR-XVIII (range 7–9).
For the composite STRs (i.e., STR-IX and STR-X), beyond the number of repeated sequences changes, we notice that they segregated into two different block configurations. STR-IX is composed by [ATATTTT]n + [ATTTT]n; we defined the block [ATATTTT] as ‘block a, [a]’ and [ATTTT] as ‘block b, [b]’. The majority of analyzed samples accounted for a total of 10 repeats, composed as 7[a] + 3[b]. In all P4 matrices, we identified a total of 9 repeats, coherently with alternative architectures, but with the loss of one block a, resulting in 6[a] + 3[b] (Figure 2b).
Meanwhile, STR-X, defined as [GATATGATGGATATGAT]n (block a) + [GGATATGAT]n (block b), at a first glance, seemed stably conserved across samples, except in one case, which lost one repeat (P7_SLG_I). In 38 out of 40 analyzed samples, the architecture of the STR was 2[a] + 3[b], with a 2[a] + 2[b] variant in P7_SLG_I, resulting in a total of 4 repeats. Interestingly, in P5_SL_I we noticed a motif rearrangement, leading to 2[b]+ [a] +3[b] (Figure 2b). Beyond STR-IX and STR-X, sporadic variability was observed in STR-XV, STR-XVII, and STR-XIX, suggesting that most STR changes are patient- or matrix-specific rather than lineage-associated. Across longitudinal samples from the same patient, STR differences were observed in a limited number of loci, while the majority of STRs showed stable repeat copy numbers.
STRs and In Silico Protein Implications
In total, 9 out of the 19 STRs identified were located within annotated genes or partially overlapping the corresponding coding sequences; STR-X and -XIII were positioned in intergenic regions, outside the coding sequence. Analyzing the potential effects of STR variation on the coding sequence, no changes were observed in STR-III, -IV, -V, -XIV, and -XVI when compared with the annotated protein (Supplementary Materials, Figure S1). STR-X represents one of the most intriguing regions. Although located immediately upstream of the OPG176 coding sequence, its repeat block composition (e.g., 2a + 3b vs. 2b + a + 3b) showed rearrangements across samples (P5_SL_I and P7_SLG_I), despite overall conservation in repeat number. This structural variability may not directly alter the encoded protein sequence but could impact transcriptional or translational regulation. In contrast, STR-XIII, located within OPG190 (encoding the secreted interferon α/β decoy receptor B19R), remained fully conserved across all samples, consistent with strong functional constraints on this essential immunomodulatory gene [8].
Conversely, the skin lesion sample from P4 at timepoint I (P4_SL_I) showed a lower copy number of STR-VII, which is located within the coding sequence of OPG153. Specifically, we identified a 12-nucleotide in-frame deletion that resulted in the loss of four aspartic acid residues in a poly-Asp tract. The OPG153 gene (GeneID: 72551547) spans 1530 bp and encodes for the envelope protein OPG153 (Uniprot ID: A0A7H0DNC4), a surface-associated viral factor implicated in attachment and fusion suppression [9,25,26]. The STR-VII motif corresponds to a low-complexity, negatively charged region described in UniProt as a compositional bias domain. To evaluate potential structural consequences of the Δ4D deletion, both the wild-type and mutant protein sequences were modeled ab initio using AlphaFold2. Structural alignment of the two models revealed a high overlap, with the deletion confined to a disordered loop within the poly-Asp tract (Figure 3). The shortened acidic loop of the Δ4D variant displayed a slightly more compact conformation, while the global fold remained unchanged. These observations suggest that the deletion likely alters local electrostatic properties and loop flexibility.

2.1.3. Phylogenetic Analysis and STR-Based Clustering

Phylogenetic reconstruction based on whole-genome sequences clustered all Italian samples within clade IIb, in agreement with global surveillance data from the 2022–2023 outbreak. Of the total 40 sequences, 27 clustered into lineage B.1, 11 clustered into sublineage B.1.7, 1 into B.1.5, and 1 into B.1.15 (Figure 4. The tree revealed short branch lengths, indicating limited nucleotide divergence among circulating strains. Minor intra-lineage clustering patterns were detected, notably among P6, P10, P11, and P12, all belonging to sublineage B.1.7, consistent with their shared SNP profiles. P3 and P4 formed a divergent branch supported by private substitutions, in line with the SNP-based variant analysis. To complement the SNP-based phylogeny, we constructed a hierarchical clustering using the STR copy number matrix (Figure 5). The STR dendrogram showed an overall structure consistent with the SNP-based topology, supporting a high degree of genomic conservation across patients. However, subtle rearrangements in clustering order were observed. For instance, P3 and P4, which cluster together in the ML phylogeny, appeared more distant in the STR-based dendrogram due to variation in STR-VII (OPG153), STR-IX, and STR-X, while P6, P10, P11, and P12 remained closely grouped, mirroring their shared STR configuration. Notably, the dendrogram did not reveal segregation by matrix type or sampling timepoint. Overall, STR-based clustering partially recapitulates SNP-based phylogeny but provides additional resolution at the intra-lineage level. The discordance between the two approaches suggests that tandem-repeat variation evolves independently of point mutations, potentially reflecting rapid, reversible changes driven by polymerase slippage or recombination. These findings support the hypothesis that STRs can act as fast-evolving molecular markers, complementing SNP analyses for high-resolution tracking of MPXV genetic diversity.

3. Discussion

In this study, we provide a comprehensive view of the genomic plasticity of MPXV in a cohort of 16 Italian patients during the 2022 outbreak, by integrating SNP-based and STR analyses, and evaluating intra-host variability across timepoints and different anatomical matrices. Our work combines hybrid assemblies reconstruction, with repeat profiling and mutational analysis, revealing interesting features potentially involved in MPXV evolution and adaptation.
The SNP-based phylogenetic reconstruction confirmed the co-circulation of multiple B.1-derived sublineages, including B.1.7, B.1.5, and B.1.15, consistent with broader genomic surveillance [6,27]. However, our novel mutation analysis unveiled unexpected convergences between distinct patients. For instance, five novel mutations (C101983T, G28334A, C162816T, C170573T, and C72215T) were shared among four unrelated patients (P10, P11, P12, and P6), all belonging to lineage B.1.7. Similarly, the shared substitutions between P3 and P4 (G37152A, C184829T, and C152445T) could strengthen the hypothesis of co-circulating, under-sampled variants [28]. Functional annotation of these novel variants indicated that several mapped into or near genes potentially involved in viral fitness or host interaction. For example, OPG002 encodes CrmB, a TNF-binding protein that inhibits inflammatory responses [29]; OPG105 contains ankyrin repeats often associated with inhibition of NF-κB signaling [8]. Some of these SNPs are not yet reported in GISAID or GenBank lineage-defining mutation databases [30], suggesting ongoing intra-host diversification. These subtle differences highlight the importance of sampling multiple anatomical sites to capture the full scope of intra-host viral diversity [28] and capture long-term evolutionary processes, providing a stable phylogenetic reference. While SNPs provide a stable framework for lineage assignment, they do not fully capture the intra-host variability.
Importantly, STR profiling revealed a set of 19 conserved STRs (STR-I to STR-XIX), with high stability across most patients and matrices. STRs in genes involved in host immune interactions (e.g., OPG015, OPG204) showed minimal variability, supporting their potential function as key structural regulators [31].
When stratifying the STR profiles by timepoint (I, II, III), no systematic temporal variability was observed. The distribution of repeated sequences copy numbers remained stable across all timepoints, with the few observed differences attributable to inter-sample rather than longitudinal variation. These alterations did not correlate with coverage issues and were independent of lineage, pointing to intra-host viral dynamics. Our results indicate that intragenic STRs are overall under evolutionary constraint, remaining stable in repeat copy number. It should be acknowledged that, as not all anatomical matrices and timepoints were available for each patient, the heterogeneous longitudinal sampling may have potentially limited the detection of subtle temporal trends.
Nevertheless, STR-IX and STR-X represent notable exceptions, as they exhibit structural reorganization of repeat motifs despite numerical stability. In contrast, variability is concentrated in non-intragenic loci (e.g., STR-XV, STR-XVII, and STR-XIX), which may accumulate changes more freely and contribute to inter-patient differences, in accordance with previous findings on repeat instability at specific MPXV genome sites [32]. These results support the view that STRs may act as “evolutionary tuning knobs,” modulating viral gene expression or protein structure in response to host pressures [16,32]. The association of STRs with genes, such as OPG001 (a chemokine-binding protein) [32], OPG044 (a Bcl-2–like immune modulator) [33,34], OPG104 (an entry/fusion protein) [25,35], and OPG180 (a replication-associated protein subject to APOBEC3-like pressure) [36], suggests that STR instability may intersect with functionally important loci, potentially influencing immune evasion, viral entry efficiency, and replication fidelity. The conservation of most intragenic STRs likely reflects strong functional constraints, as repeat variation within coding regions may impact protein length, domain organization, or local folding. In this context, the architectural changes we observed in STR-IX and STR-X are of particular interest, since they may influence local protein conformation despite numerical stability in repeat copy number.
A particularly interesting case was represented by STR-VII, located within the coding region of OPG153. The skin lesion sample from P4 at timepoint I (P4_SL_I) showed a 12-nucleotide in-frame deletion corresponding to the loss of four aspartic acid residues (Δ4D) in a polyaspartic acid tract. The affected motif is located in a low-complexity, acidic region annotated in UniProt as a compositional bias domain. AlphaFold2-based structural modeling revealed that both wild-type and Δ4D proteins share a highly similar global fold, with the deletion confined to a flexible loop within the poly-Asp region (Figure 3), exhibiting a shorter and more compact loop. Though the functional implications cannot be assessed in this study, these results provide a framework for future investigations aimed at linking repeat architecture to protein structure and viral fitness. A recent work [37] identified broadly neutralizing antibodies targeting OPG153, highlighting its central role in poxvirus immune recognition. While our analyses did not reveal SNP-based variability in this gene, the structural variation observed in its STR region may influence antigenicity. These findings, although derived from in silico modeling and remain hypothesis-generating, further emphasize the importance of the integration of structural and immunological perspectives in future investigations.
The loci, OPG176 and OPG190, exemplify opposite selective regimes acting on STRs. For OPG176, STR-X was located immediately upstream of the coding sequence and maintained a stable overall repeat number, but displayed block-level rearrangements (e.g., 2a + 3b vs. 2b + a + 3b). This architecture plasticity within a non-coding context is consistent with a potential regulatory role, possibly modulating mRNA secondary structure, ribosome accessibility, or translation initiation efficiency (Figure 1). Given that OPG176 encodes a putative component of the entry/fusion complex involved in membrane fusion and virion morphogenesis, even subtle alterations in its expression timing or abundance could influence virion assembly or cell tropism. [25,26] The conservation of overall repeat length despite changes in motif organization suggests potential effects operating at the transcriptional level.
In contrast, STR-XVI, located within OPG190 (encoding the secreted interferon α/β decoy receptor B19R) [8], remained completely conserved across all samples. Maintaining the structural and quantitative integrity of B19R is likely critical for effective antagonism of host interferon responses, leaving little room for repeat drift or rearrangement. OPG176 and OPG190 represent two extremes of STR functional behavior: one permissive and regulatory, the other highly constrained and structurally essential. Taken together, our results suggest that intra-host MPXV genomic plasticity is present in a limited number of loci.
In the context of MPXV genome analysis, it is important to consider the intrinsic complexity of MPXV infections during the 2022 outbreak, during which the presence of different viral subpopulations has been increasingly reported [25,26]. These observations suggest that a single infection may not always be represented by a unique and uniform viral population, particularly when multiple samples are collected from the same individual over time or across different anatomical sites. Within this framework, the differences observed across longitudinal and multi-site samples in our study can be interpreted as part of a broader landscape of genomic variability, which may influence the stability and interpretability of the observed data.
Together, these findings suggest the interplay between SNP mutations and STR variability in shaping MPXV genomic structure and driving its evolution. While SNPs reflect cumulative mutation processes and lineage divergence, STRs appear more sensitive to intra-host forces, capturing short-term dynamics and/or anatomical compartmentalization. Most STR loci and SNP profiles were indistinguishable among patients sampled during the summer period (May–August), consistent with close epidemiological linkage. In contrast, patients sampled later in the year (P3, P4, September–December) carried additional private SNPs and showed slightly more heterogeneity at selected STR loci. This pattern may reflect temporal divergence or independent introductions, although further analyses would be required to confirm this hypothesis. The perspective of a combination of SNP and STR variability reveals that, even in a DNA virus with relatively low mutation rates, MPXV could exhibit notable genomic flexibility [38].
Monitoring both SNPs and STRs may improve molecular epidemiology efforts from a public health standpoint. STR configurations could be key complementary markers for phenotypic traits definition, such as immune escape. The current results show, for the first time, the presence of matrix- and patient-dependent SNP and STR patterns within single individuals, supporting a scenario of intra-host viral microevolution. Such a phenomenon, while typically associated with chronic RNA virus infections, may also occur in Orthopoxvirus infections and contribute to phenotypic or transmission potential variability [6,39,40].
In conclusion, our longitudinal, multi-matrix study offers an interesting glimpse into the intra-host evolution of MPXV. However, late autumn cases reveal that the viral genome can undergo localized changes, potentially influenced by tissue-specific pressures or immune compartmentalization. These include not only reductions in repeat copy number but also alternative motif architectures in composite STRs (e.g., STR-IX and STR-X). These dynamic features need further investigation in larger cohorts and may have implications for transmission fitness and pathogenesis. These findings pave the way for future studies aimed at integrating genomic observations, such as matrix-specific STR patterns and novel SNP profiles, with detailed clinical data, including disease severity, symptom localization, and immune response. This integrative approach may help clarify whether intra-host genomic variability correlates with phenotypic manifestations and tissue tropism, ultimately improving our understanding of MPXV pathogenesis and transmission dynamics.

4. Materials and Methods

Clinical specimens were collected from patients presenting with skin lesions and clinical manifestations suggestive of mpox infection. Samples were initially submitted for diagnostic purposes to the regional referral laboratory at IRCCS Sacro Cuore Don Calabria Hospital, Negrar di Valpolicella (Verona, Veneto Region, Italy).
A total of 16 patients were confirmed to have mpox by qPCR (Monkeypox Virus Real-Time PCR Kit, BioPerfectus, Taizhou City, China), with cycle threshold (Ct) values ≤ 28. Samples yielding these Ct values were selected for full-genome sequencing.
Following diagnosis, all 16 confirmed patients were enrolled in a longitudinal follow-up protocol. Weekly samples were collected from multiple anatomical sites, including skin lesions, saliva, urine, and pharyngeal swabs, starting from the time of diagnosis and continuing until viral clearance, defined by two consecutive negative qPCR results. A total of 4 patients completed two follow-up visits, and 1 patient completed three visits.
In total, 40 clinical samples from 16 patients were included in the sequencing analysis. All patients were male and received a diagnosis of mpox between May and November2022. Patients’ demographic characteristics and the types of collected specimens are summarized in Table 2.
All samples analyzed in this study were derived from routine clinical diagnostics and were retrospectively included. Although additional follow-up specimens were available for some patients, only samples meeting predefined qPCR Ct thresholds were suitable for whole-genome sequencing and were therefore included. As a consequence, the number of longitudinal and multi-site samples varied across patients.

4.1. DNA Extraction, Library Preparation, and Sequencing

Viral DNA was extracted from 200 µL of each matrix and eluted in 60 µL using an EZ1 Advanced XL instrument and EZ1 DSP Virus Kit (Qiagen, Hilden, Germany) according to the manufacturer’s instructions. DNA concentration was determined using the Qubit dsDNA HS assay kit (Thermofisher, Waltham, MA, USA). For short-read sequencing, a target enrichment based on hybrid capture was carried out using the Illumina RNA prep with an enrichment kit and the VSP panel v1 (Illumina, San Diego, CA, USA). The quality of libraries was assessed using the genomic ScreenTape kit and the 4200 TapeStation System (Agilent, Santa Clara, CA, USA). Illumina libraries were prepared according to the manufacturer’s instructions and run on a NextSeq1000 using the P1 300-cycle Illumina flow cell to perform 2 × 150 paired-end sequencing. For long-read Oxford Nanopore Technology (ONT) sequencing, enrichment was performed by generating 2,5 kbp-amplicons24. Shortly, two separate reactions were set up per sample as follows: 6.88 μL nuclease-free water, 0.625 μL primers pool 1 or 2 (100 μM), 12.5 μL Q5 High-Fidelity 2X master mix (New England Biolabs, Ipswich, MA, USA), and 5 μL of sample DNA. The amplification reaction was set up with the following parameters: 98 °C for 1 min, then 35 cycles of 98 °C for 10 s, 65 °C for 30 s, 69 °C for 100 s, and a final extension at 72 °C for 2 min. Separate reactions were then pooled for each sample, and 200 fmol of DNA was used to prepare sequencing libraries with the Native Barcoding Sequencing Kit (SQK-NBD114.24 Oxford Nanopore Technologies, Oxford, UK) as per the manufacturer’s instructions. ONT libraries were run on a MinION Mk1d instrument using FLO-MIN114 (Oxford Nanopore Technologies, Oxford, UK) flow cells to a target of 100× coverage per sample.

4.2. Bioinformatic Analysis

Read QC was performed with fastp v0.23.4 and fastqc v0.12.1. Human reads were removed using Kraken2 v2.1.3 [41]. Reads were aligned to the MPXV reference genome (NC_063383.1) using BWA-MEM v2.2.1 [42]; variants were called with bcftools v1.20 [43], filtering for Q > 30, mapQ > 50, DP > 20.

4.2.1. Hybrid Assembly

In order to explore the STR variability, hybrid assemblies were generated using Unicycler v0.5.1 [44], combining Illumina and ONT reads, oriented with MUMmer/Nucmer v4.0.1 [45], and finally, the oriented contigs were assembled into a single scaffold using a homemade script. This script concatenates the aligned contigs while accounting for gaps within and between them, allowing for their accurate identification and distinction. The correspondence between the experimental ID (reported in this manuscript) and the GISAID IDs (EPI_IDs) is provided in Supplementary Materials, Table S3.

4.2.2. STR Analysis

Short tandem repeats were identified using Tandem Repeat Finder v.4.09 [46] (parameters “2 7 7 80 10 50 2000 -ngs”), retaining loci with entropy > 1.8, match ≥ 80%, and non-overlapping regions. Partial repeats were also considered. Tandem Repeat Finder detects repeated patterns without requiring prior specification, but it does not detect single-nucleotide repeats (i.e., homopolymers). A custom R script using a cut-off of at least 9 nucleotides was systematically identified to detect the homopolymer repeats. Then, the identified STRs were visually inspected in the raw reads.
All loci were manually curated.

4.2.3. Global Dataset, Phylogeny, and STR-Based Clustering Analysis

Complete (>196 kb) MPXV genomes (n = 2108, as of 21 August 2025) were downloaded from GISAID EpiPox [30], excluding low-coverage sequences (>5% Ns).
To investigate the phylogenetic evolution, a Maximum Likelihood (ML) phylogenetic tree was reconstructed using a dataset including our samples and the complete sequences available from the GISAID repository, for a total of 2148 records. Multiple alignment was performed with MAFFT v7.523 [47], and ML phylogeny was inferred using IQ-TREE v2.2.2.7 [48] and visualized in iTOL v6.9.1 [49]. To investigate STR similarities between samples, we performed a hierarchical clustering using the number of STRs belonging to each sample matrix (Euclidean distance, R packages ape v.5.8.1, dendextend v.1.19.1, and ggtree v.3.16.3) [50,51,52].

4.2.4. Protein Translation and In Silico Structural Prediction

Genomic sequences containing STRs located within protein-coding regions were translated into amino acid sequences using the EMBOSS v.6.6.0.0 [53] executed from the command line. Translation was performed according to the standard genetic code, preserving the annotated reading frame and strand orientation. The resulting protein was compared to its corresponding wild-type reference, derived from UniProt [54] annotations of the same gene. When sequence alterations were detected relative to the reference, the affected open reading frames were subjected to three-dimensional structure prediction using AlphaFold2 [55] (ab initio modeling). In particular, the OPG153 gene (GeneID: 72551547) and its Δ4D variant were modeled independently. Structural visualization and overlay were carried out using Chimera tool v.1.19 [56].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms27031371/s1.

Author Contributions

M.D., C.C. and C.P. conceived the study. M.D. designed the study. M.D. and S.M. performed the experiments. A.M. performing the molecular assays. S.A. and M.G.C. were responsible for sample collection. A.A., N.R., F.G.G. and E.T. were responsible for patients’ enrollment and clinical evaluation. M.D., E.L., S.M., L.V., G.M. and D.L. performed formal data analyses. All authors interpreted and analyzed data. M.D. and C.P. drafted the manuscript. All authors revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by EU funding within the MUR PNRR Extended Partnership initiative on Emerging Infectious Diseases (project no. PE00000007, INF-ACT). Michela Deiana was a recipient of an Early-Career Award (ECA-INF-ACT 2024). The work was also supported by the Italian Ministry of Health “Fondi Ricerca Corrente—L1P3” to IRCCS Sacro Cuore Don Calabria Hospital.

Institutional Review Board Statement

All patients signed an informed consent form. The study received ethical clearance from the Ethical Committee of Verona and Rovigo provinces (Prot. n. 6232, 30 January 2023). Samples used in this study were stored at −80 °C in the “Tropica Biobank—(bbmri-eric:ID:IT_1605519998080235)” of the IRCCS Sacro Cuore Don Calabria Hospital until use.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The sequences have been deposited in the GISAID repository. The correspondence between the experimental ID (reported in this manuscript) and the GISAID IDs (EPI_IDs) is provided in Supplementary Materials, Table S3. The raw data supporting the conclusions of this article will be made available by the authors on request.

Acknowledgments

We gratefully acknowledge all data contributors, i.e., the authors and their originating laboratories responsible for obtaining the specimens, and their submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based.

Conflicts of Interest

The authors declare that the research was conducted in the absence of any commercial or financial conflicts of interest.

References

  1. Reynolds, M.G.; Carroll, D.S.; Karem, K.L. Factors Affecting the Likelihood of Monkeypox’s Emergence and Spread in the Post-Smallpox Era. Curr. Opin. Virol. 2012, 2, 335–343. [Google Scholar] [CrossRef] [PubMed]
  2. Lloyd-Smith, J.O. Vacated Niches, Competitive Release and the Community Ecology of Pathogen Eradication. Philos. Trans. R. Soc. B Biol. Sci. 2013, 24, 20120150. [Google Scholar] [CrossRef] [PubMed]
  3. World Health Organization Global Commission for the Certification of Smallpox Eradication. The Achievement of Global Eradication of Smallpox: Final Report of the Global Commission for the Certification of Smallpox Eradication, Geneva, December 1979; WHO: Geneva, Switzerland, 1979. [Google Scholar]
  4. Naga, N.G.; Nawar, E.A.; Mobarak, A.A.; Faramawy, A.G.; Al-Kordy, H.M.H. Monkeypox: A Re-Emergent Virus with Global Health Implications—A Comprehensive Review. Trop. Dis. Travel. Med. Vaccines 2025, 11, 2. [Google Scholar] [CrossRef] [PubMed]
  5. 2022-25 Mpox Outbreak: Global Trends. Available online: https://Worldhealthorg.Shinyapps.Io/Mpx_global/ (accessed on 1 September 2025).
  6. Isidro, J.; Borges, V.; Pinto, M.; Sobral, D.; Santos, J.D.; Nunes, A.; Mixão, V.; Ferreira, R.; Santos, D.; Duarte, S.; et al. Phylogenomic Characterization and Signs of Microevolution in the 2022 Multi-Country Outbreak of Monkeypox Virus. Nat. Med. 2022, 28, 1569–1572. [Google Scholar] [CrossRef]
  7. Shchelkunov, S.N.; Totmenin, A.V.; Safronov, P.F.; Mikheev, M.V.; Gutorov, V.V.; Ryazankina, O.I.; Petrov, N.A.; Babkin, I.V.; Uvarova, E.A.; Sandakhchiev, L.S.; et al. Moss B Analysis of the Monkeypox Virus Genome. Virology 2002, 297, 172–194. [Google Scholar] [CrossRef]
  8. Alcami, A.; Koszinowski, U.H. Viral Mechanisms of Immune Evasion. Trends Microbiol. 2000, 8, 410–418. [Google Scholar] [CrossRef]
  9. Moss, B.; Shisler, J.L. Immunology 101 at Poxvirus U: Immune Evasion Genes. Semin. Immunol. 2001, 13, 59–66. [Google Scholar] [CrossRef]
  10. Likos, A.M.; Sammons, S.A.; Olson, V.A.; Frace, A.M.; Li, Y.; Olsen-Rasmussen, M.; Davidson, W.; Galloway, R.; Khristova, M.L.; Reynolds, M.G.; et al. A Tale of Two Clades: Monkeypox Viruses. J. Gen. Virol. 2005, 86, 2661–2672. [Google Scholar] [CrossRef]
  11. Doty, J.B.; Malekani, J.M.; Kalemba, L.S.N.; Stanley, W.T.; Monroe, B.P.; Nakazawa, Y.U.; Mauldin, M.R.; Bakambana, T.L.; Tobit Liyandja, L.D.; Braden, Z.H.; et al. Assessing Monkeypox Virus Prevalence in Small Mammals at the Human–Animal Interface in the Democratic Republic of the Congo. Viruses 2017, 9, 283. [Google Scholar] [CrossRef]
  12. Nakazawa, Y.; Mauldin, M.R.; Emerson, G.L.; Reynolds, M.G.; Lash, R.R.; Gao, J.; Zhao, H.; Li, Y.; Muyembe, J.-J.; Kingebeni, P.M.; et al. A Phylogeographic Investigation of African Monkeypox. Viruses 2015, 7, 2168–2184. [Google Scholar] [CrossRef]
  13. WHO Monkeypox Fact Sheet. Available online: https://www.who.int/news-room/fact-sheets/detail/monkeypox (accessed on 1 September 2025).
  14. Esteban, D.J.; Hutchinson, A.P. Genes in the Terminal Regions of Orthopoxvirus Genomes Experience Adaptive Molecular Evolution. BMC Genom. 2011, 12, 261. [Google Scholar] [CrossRef]
  15. Hughes, A.L.; Friedman, R. Poxvirus Genome Evolution by Gene Gain and Loss. Mol. Phylogenetics Evol. 2005, 35, 186–195. [Google Scholar] [CrossRef]
  16. Kugelman, J.R.; Johnston, S.C.; Mulembakani, P.M.; Kisalu, N.; Lee, M.S.; Koroleva, G.; McCarthy, S.E.; Gestole, M.C.; Wolfe, N.D.; Fair, J.N.; et al. Genomic Variability and Tandem Repeats in Variola Virus and Monkeypox Virus. Emerg. Infect. Dis. 2014, 20, 232–239. [Google Scholar]
  17. Elde, N.C.; Child, S.J.; Eickbush, M.T.; Kitzman, J.O.; Rogers, K.S.; Shendure, J.; Geballe, A.P.; Malik, H.S. Poxviruses Deploy Genomic Accordions to Adapt Rapidly against Host Antiviral Defenses. Cell 2012, 150, 831–841. [Google Scholar] [CrossRef]
  18. Yeh, T.Y.; Hsieh, Z.Y.; Feehley, M.C.; Feehley, P.J.; Contreras, G.P.; Su, Y.C.; Hsieh, S.-L.; Lewis, D.A. Recombination Shapes the 2022 Monkeypox (Mpox) Outbreak. Med 2022, 3, 824–826. [Google Scholar] [CrossRef] [PubMed]
  19. Kumari, N.; Yadav, V.K.; Singh, H. Identification and Analysis of Microsatellites in Coronaviridae. Virus Res. 2024, 336, 199328. [Google Scholar] [CrossRef]
  20. Siddiqe, R.; Ghosh, A. Genome-wide in silico identification and characterization of Simple Sequence Repeats in diverse completed SARS-CoV-2 genomes. Gene Rep. 2021, 23, 101020. [Google Scholar] [CrossRef] [PubMed]
  21. Méndez-Tenorio, F.J.; Rojas-López, A.; Arias, C.F. Microsatellite Repeats in the Genome of Caliciviruses. J. Virol. 2002, 76, 10638–10643. [Google Scholar]
  22. Burrel, S.; Ait-Arkoub, Z.; Voujon, D.; Deback, C.; Abrao, E.P.; Agut, H.; Boutolleau, D. Molecular characterization of herpes simplex virus 2 strains by analysis of microsatellite polymorphism. J. Clin. Microbiol. 2013, 51, 3616–3623. [Google Scholar] [CrossRef]
  23. Sahu, B.P.; Majee, P.; Singh, R.R.; Nayak, D. Comparative analysis, distribution, and characterization of microsatellites in Orf virus genome. Sci. Rep. 2020, 10, 13852. [Google Scholar] [CrossRef]
  24. Monzón, S.; Varona, S.; Negredo, A.; Vidal-Freire, S.; Patiño-Galindo, J.A.; Ferressini-Gerpe, N.; Zaballos, A.; Orviz, E.; Ayerdi, O.; Muñoz-Gómez, A.; et al. Monkeypox Virus Genomic Accordion Strategies. Nat. Commun. 2024, 15, 3059. [Google Scholar] [CrossRef] [PubMed]
  25. Laliberte, J.-P.; Moss, B. Membrane Fusion by Poxvirus Entry/Fusion Complex. Proc. Natl. Acad. Sci. USA 2014, 111, E2834–E2843. [Google Scholar]
  26. Satheshkumar, P.S.; Chavre, J.; Moss, B. Role of the vaccinia virus O3 protein in cell entry can be fulfilled by its Sequence flexible transmembrane domain. Virology 2013, 444, 148–157. [Google Scholar] [CrossRef] [PubMed]
  27. Deiana, M.; Lavezzari, D.; Mori, A.; Accordini, S.; Pomari, E.; Piubelli, C.; Malagò, S.; Cordioli, M.; Ronzoni, N.; Angheben, A.; et al. Exploring Viral Genome Profile in Mpox Patients during the 2022 Outbreak, in a North-Eastern Centre of Italy. Viruses 2024, 16, 726. [Google Scholar] [CrossRef]
  28. Luna, N.; Ramírez, A.L.; Muñoz, M.; Ballesteros, N.; Patiño, L.H.; Castañeda, S.A.; Bonilla-Aldana, D.K.; Paniz-Mondolfi, A.; Ramírez, J.D. Phylogenomic analysis of the monkeypox virus (MPXV) 2022 outbreak: Emergence of a novel viral lineage? Travel Med. Infect Dis. 2022, 49, 102402. [Google Scholar] [CrossRef]
  29. Alakunle, E.; Kolawole, D.; Diaz-Cánova, D.; Alele, F.; Adegboye, O.; Moens, U.; Okeke, M.I. A Comprehensive Review of Monkeypox Virus and Mpox Characteristics. Front. Cell. Infect. Microbiol. 2024, 14, 1360586. [Google Scholar] [CrossRef]
  30. GISAID EpiPox GISAID EpiPox. Available online: https://gisaid.org// (accessed on 21 August 2025).
  31. Yadav, P.; Devasurmutt, Y.; Tatu, U. Phylogenomic and Structural Analysis of the Monkeypox Virus Shows Evolution towards Increased Stability. Viruses 2022, 31, 127. [Google Scholar] [CrossRef]
  32. Delamonica, B.; Davalos, L.; Larijani, M.; Anthony, S.J.; Liu, J.; MacCarthy, T. Evolutionary potential of the monkeypox genome arising from interactions with human APOBEC3 enzymes. Virus Evol. 2023, 2, vead047. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
  33. Lum, F.-M.; Torres-Ruesta, A.; Tay, M.Z.; Lin, R.T.P.; Lye, D.C.; Rénia, L.; Ng, L.F.P. Monkeypox: Disease Epidemiology, Host Immunity and Clinical Interventions. Nat. Rev. Immunol. 2022, 22, 597–613. [Google Scholar] [CrossRef]
  34. Molteni, C.; Forni, D.; Cagliani, R.; Arrigoni, F.; Pozzoli, U.; De Gioia, L.; Sironi, M. Selective Events at Individual Sites Underlie the Evolution of Monkeypox Virus Clades. Virus Evol. 2023, 9, vead031. [Google Scholar] [CrossRef]
  35. Moss, B. Poxvirus Entry and Membrane Fusion. Virology 2006, 344, 48–54. [Google Scholar] [CrossRef] [PubMed]
  36. Yu, J.; Zhang, X.; Liu, J.; Xiang, L.; Huang, S.; Xie, X.; Fang, L.; Lin, Y.; Zhang, M.; Wang, L.; et al. Phylogeny and molecular evolution of the first local monkeypox virus cluster in Guangdong Province, China. Nat. Commun. 2023, 14, 8241. [Google Scholar] [CrossRef] [PubMed]
  37. Paciello, I.; Rundlet, E.J.; Zhou, L.; Realini, G.; Perrone, F.; Pierleoni, G.; Troisi, M.; Postal, J.; Guivel-Benhassine, F.; Porrot, F.; et al. Antigen-agnostic identification of poxvirus broadly neutralizing antibodies targeting OPG153. Sci Transl Med. 2025, 10, eaeb3840. [Google Scholar] [CrossRef] [PubMed]
  38. Akther, S.; Su, M.; Wang, J.C.; Amin, H.; Taki, F.; De La Cruz, N.; Chowdhury, M.; Clabby, T.; Kopping, E.; Ruiz, V.E.; et al. Genomic Epidemiology of Mpox Virus during the 2022 Outbreak in New York City. Nat. Commun. 2025, 16, 8354. [Google Scholar] [CrossRef]
  39. O’Toole, Á.; Neher, R.A.; Ndodo, N.; Borges, V.; Gannon, B.; Gomes, J.P.; Groves, N.; King, D.J.; Maloney, D.; Lemey, P.; et al. Putative APOBEC3 deaminase editing in MPXV as evidence for sustained human transmission since at least 2016. bioRxiv 2023. [Google Scholar] [CrossRef]
  40. Brito, A.F.; Baele, G.; Nahata, K.D.; Grubaugh, N.D.; Pinney, J.W. Intrahost speciations and host switches played an important role in the evolution of herpesviruses. Virus Evol. 2021, 7, veab025. [Google Scholar] [CrossRef]
  41. Wood, D.E.; Salzberg, S.L. Kraken: Ultrafast Metagenomic Sequence Classification Using Exact Alignments. Genome Biol. 2014, 15, R46. [Google Scholar] [CrossRef]
  42. Jung, Y.; Han, D. BWA-MEME: BWA-MEM Emulated with a Machine Learning Approach. Bioinformatics 2022, 38, 2404–2413. [Google Scholar] [CrossRef]
  43. Genovese, G.; Rockweiler, N.B.; Gorman, B.R.; Bigdeli, T.B.; Pato, M.T.; Pato, C.N.; Ichihara, K.; McCarroll, S.A. BCFtools/Liftover: An Accurate and Comprehensive Tool to Convert Genetic Variants across Genome Assemblies. Bioinformatics 2024, 40, btae038. [Google Scholar] [CrossRef]
  44. Wick, R.R.; Judd, L.M.; Gorrie, C.L.; Holt, K.E. Unicycler: Resolving Bacterial Genome Assemblies from Short and Long Sequencing Reads. PLoS. Comput. Biol. 2017, 13, e1005595. [Google Scholar] [CrossRef]
  45. Marçais, G.; Delcher, A.L.; Phillippy, A.M.; Coston, R.; Salzberg, S.L.; Zimin, A. MUMmer4: A Fast and Versatile Genome Alignment System. PLoS. Comput. Biol. 2018, 14, e1005944. [Google Scholar] [CrossRef] [PubMed]
  46. Benson, G. Tandem Repeats Finder: A Program to Analyze DNA Sequences. Nucleic Acids. Res. 1999, 27, 573–580. [Google Scholar] [CrossRef] [PubMed]
  47. Katoh, K. MAFFT Version 5: Improvement in Accuracy of Multiple Sequence Alignment. Nucleic Acids. Res. 2005, 33, 511–518. [Google Scholar] [CrossRef] [PubMed]
  48. Nguyen, L.-T.; Schmidt, H.A.; von Haeseler, A.; Minh, B.Q. IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies. Mol. Biol. Evol. 2015, 32, 268–274. [Google Scholar] [CrossRef]
  49. Letunic, I.; Bork, P. Interactive Tree of Life (iTOL) v6: Recent Updates to the Phylogenetic Tree Display and Annotation Tool. Nucleic Acids. Res. 2024, 52, W78–W82. [Google Scholar] [CrossRef]
  50. Paradis, E.; Claude, J.; Strimmer, K. APE: Analyses of Phylogenetics and Evolution in R Language. Bioinformatics 2004, 20, 289–290. [Google Scholar] [CrossRef]
  51. Galili, T. Dendextend: An R Package for Visualizing, Adjusting and Comparing Trees of Hierarchical Clustering. Bioinformatics 2015, 31, 3718–3720. [Google Scholar] [CrossRef]
  52. Yu, G.; Smith, D.K.; Zhu, H.; Guan, Y.; Lam, T.T.-Y. Ggtree: An r Package for Visualization and Annotation of Phylogenetic Trees with Their Covariates and Other Associated Data. Methods Ecol. Evol. 2017, 8, 28–36. [Google Scholar] [CrossRef]
  53. The EMBOSS Project. The EMBOSS Website 2024. Available online: https://www.ebi.ac.uk/jdispatcher/emboss (accessed on 21 August 2025).
  54. The UniProt Consortium. UniProt: The Universal Protein Knowledgebase in 2025. Nucleic Acids. Res. 2025, 53, D609–D617. [Google Scholar] [CrossRef]
  55. Bertoni, D.; Tsenkov, M.; Magana, P.; Nair, S.; Pidruchna, I.; Querino, L.A.M.; Midlik, A.; Paramval, U.; Lawal, D.; Tanweer, A.; et al. AlphaFold Protein Structure Database 2025: A redesigned interface and updated structural coverage. Nucleic Acids Res. 2026, 54, D358–D362. [Google Scholar] [CrossRef]
  56. Pettersen, E.F.; Goddard, T.D.; Huang, C.C.; Couch, G.S.; Greenblatt, D.M.; Meng, E.C.; Ferrin, T.E. UCSF Chimera—A Visualization System for Exploratory Research and Analysis. J. Comput. Chem. 2004, 25, 1605–1612. [Google Scholar] [CrossRef]
Figure 1. Genomic organization of the MPXV genome, highlighting the distribution of SNPs and STRs across functional gene categories. A visual representation of the fully annotated MPXV assembly genomes (based on the NC_063383 genome annotation). Starting from the outer ring, MPXV genes are displayed and color-coded according to their predicted function: host modulation genes (dark green), surface protein genes (orange), assembly/budding genes (pink), replication/transcription genes (light green), and genes of unknown function (purple). Moving inward, the next track marks the positions of STRs, located either within genes (intragenic) or nearby (upstream or downstream). The following ring shows the number of repeat units for each STR (see scale bar). The innermost circle indicates the annotated SNPs across the genome. The double black lines delimit the ITRs of the MPXV genome.
Figure 1. Genomic organization of the MPXV genome, highlighting the distribution of SNPs and STRs across functional gene categories. A visual representation of the fully annotated MPXV assembly genomes (based on the NC_063383 genome annotation). Starting from the outer ring, MPXV genes are displayed and color-coded according to their predicted function: host modulation genes (dark green), surface protein genes (orange), assembly/budding genes (pink), replication/transcription genes (light green), and genes of unknown function (purple). Moving inward, the next track marks the positions of STRs, located either within genes (intragenic) or nearby (upstream or downstream). The following ring shows the number of repeat units for each STR (see scale bar). The innermost circle indicates the annotated SNPs across the genome. The double black lines delimit the ITRs of the MPXV genome.
Ijms 27 01371 g001
Figure 2. Comprehensive overview of STR copy number and structural variability across all samples. (a) Heatmap showing the distribution and the repeat number of the simple repeat motifs. The y-axis lists the samples, identified by sample id, while the x-axis shows the STRs. Color intensity reflects the STR copy number, as indicated by the scale bar. On the right, the lineage of the sequences in analysis is presented. (b) STR loci displaying rearrangements in repeat architecture (e.g., STR-IX and STR-X) are illustrated together with the distribution of the observed configurations.
Figure 2. Comprehensive overview of STR copy number and structural variability across all samples. (a) Heatmap showing the distribution and the repeat number of the simple repeat motifs. The y-axis lists the samples, identified by sample id, while the x-axis shows the STRs. Color intensity reflects the STR copy number, as indicated by the scale bar. On the right, the lineage of the sequences in analysis is presented. (b) STR loci displaying rearrangements in repeat architecture (e.g., STR-IX and STR-X) are illustrated together with the distribution of the observed configurations.
Ijms 27 01371 g002
Figure 3. Structural comparison of OPG153 wild-type and Δ4D variant. (A) Predicted three-dimensional model of wild-type OPG153 (blue) showing the poly-aspartic region (STR-VII, green). (B) Model of the Δ4D variant (yellow), in which the poly-Asp tract is shortened due to the deletion of four aspartic acid residues. (C) Overlay of wild-type and Δ4D models highlighting the contraction of the STR-VII acidic loop. (D) Close-up view of the poly-Asp region, with the deleted residues indicated in red (ΔAsp). The deletion does not disrupt the overall fold but shortens the acidic loop and slightly alters local charge distribution and flexibility.
Figure 3. Structural comparison of OPG153 wild-type and Δ4D variant. (A) Predicted three-dimensional model of wild-type OPG153 (blue) showing the poly-aspartic region (STR-VII, green). (B) Model of the Δ4D variant (yellow), in which the poly-Asp tract is shortened due to the deletion of four aspartic acid residues. (C) Overlay of wild-type and Δ4D models highlighting the contraction of the STR-VII acidic loop. (D) Close-up view of the poly-Asp region, with the deleted residues indicated in red (ΔAsp). The deletion does not disrupt the overall fold but shortens the acidic loop and slightly alters local charge distribution and flexibility.
Ijms 27 01371 g003
Figure 4. Phylogenetic tree. Circle tips represent samples, colored by lineage (B.1–C.1). Branch lengths reflect genetic distance, shown by the concentric rings; scale bar 100 substitutions. The circles represent the samples used for the tree construction; the triangles represent the collapsed branches. Labels highlight the samples analyzed in this study.
Figure 4. Phylogenetic tree. Circle tips represent samples, colored by lineage (B.1–C.1). Branch lengths reflect genetic distance, shown by the concentric rings; scale bar 100 substitutions. The circles represent the samples used for the tree construction; the triangles represent the collapsed branches. Labels highlight the samples analyzed in this study.
Ijms 27 01371 g004
Figure 5. Hierarchical clustering of STR profiles. The dendrogram is based on Euclidean distances, which reflect the dissimilarity between STR repeat copy numbers. Shorter distances indicate higher similarity in STR patterns between samples, while longer distances reflect greater divergence. The clustering is derived from repeat copy numbers and does not represent nucleotide sequence relationships.
Figure 5. Hierarchical clustering of STR profiles. The dendrogram is based on Euclidean distances, which reflect the dissimilarity between STR repeat copy numbers. Shorter distances indicate higher similarity in STR patterns between samples, while longer distances reflect greater divergence. The clustering is derived from repeat copy numbers and does not represent nucleotide sequence relationships.
Ijms 27 01371 g005
Table 1. Comparison of STRs identified in this study with those reported by Monzòn et al. (2024) [24]. The table summarizes the genomic location, repeat unit, repeat counts, and functional annotation for each STR identified in MPXV genomes from this study. The second column indicates the corresponding STRs described by Monzòn et al. (2024) [24]. Entries marked with an asterisk (*) denote STRs that overlap with a slightly different sequence composition than those reported in Palacios et al. NA indicates not available information.
Table 1. Comparison of STRs identified in this study with those reported by Monzòn et al. (2024) [24]. The table summarizes the genomic location, repeat unit, repeat counts, and functional annotation for each STR identified in MPXV genomes from this study. The second column indicates the corresponding STRs described by Monzòn et al. (2024) [24]. Entries marked with an asterisk (*) denote STRs that overlap with a slightly different sequence composition than those reported in Palacios et al. NA indicates not available information.
NameCorrespondence with Monzòn et al. [24]StartEndPattern# Max RepStr length (bp)# Samples Sharing the STRNearest GeneRelative Position to the GeneDistance in bpFunctional Insight
STR-ILCR10595611[T]141440/40OPG001Upstream−224Chemokine-binding protein
STR-IILCR1 *46774801[TAACTAACTTATGACT]914440/40OPG015Downstream37Ankyrin domains and immune evasion (TNF mimic)
STR-IIILCR1228,52728,537[A]9940/40OPG044IntrageneNABcl-2-like protein
STR-IVLCR1376,08276,090[T]9940/40OPG097IntrageneNAUnknown
STR-VLCR1480,84480,825[T]9940/40OPG104IntrageneNAUnknown
STR-VILCR5133,074133,098[T]252540/40OPG151Upstream−657Unknown
STR-VIILCR7136,498136,555[ATC]n + TATGAT + [ATC]n195740/40OPG153IntrageneNAKey attachment/egress protein
STR-VIIILCR15140,092140,131[ATAACAATT]4.43940/40OPG159IntrageneNAUnknown
STR-IXLCR8146,835146,920[ATATTTT]n + [ATTTT]n106440/40OPG170Upstream−57Unknown
STR-XLCR9 *150,544150,621[GATATGATGGATATGAT]n + [GGATATGAT]n56240/40OPG176IntrageneNABcl-2-like protein
STR-XILCR16152,661152,566[A]7940/40OPG180Upstream−31Unknown
STR-XIILCR171631,69163,207[TAAC]62440/40OPG188Upstream−63Unknown
STR-XIIILCR18166,070166,109[AATAATT]3.73940/40OPG190IntrageneNAIFNα/β binding protein homolog
STR-XIVLCR19 *169,709169,761[CAGATA]8.85240/40OPG197IntrageneNAUnknown
STR-XVLCR2 *173,252173,295[AT]224340/40OPG200Upstream−654Unknown
STR-XVILCR21174,506174,538[GATGAA]4.83240/40OPG204IntrageneNAInterferon decoy receptor; immune modulation
STR-XVIILCR3 *179,055179,245[ATATACATT]1614440/40OPG208Downstream34Serine protease inhibitor (anti-apoptotic, potential virulence)
STR-XVIIILCR4192,382192,534[AGTCATAAGTTAGTTA]914440/40OPG015Upstream−16Ankyrin domains and immune evasion (TNF mimic)
STR-XIXLCR11 *196,608196,624[A]141438/40OPG001Downstream233Chemokine-binding protein
Table 2. Patients’ general information. The table reports the details of each sample matrix, the collection date, the travel history, and the qPCR Ct. NA indicates not available information.
Table 2. Patients’ general information. The table reports the details of each sample matrix, the collection date, the travel history, and the qPCR Ct. NA indicates not available information.
P.MatrixTimepointSample_IdCollection_DateTravel_HistoryqPCR
P1Skin lesion swabIP1_SL_I25 May2022Mallorca21.00
P1Skin lesion swabIIP1_SL_1_II03 June 2022Mallorca21.51
P1Skin lesion swabIIIP1_SL_2_III07 June 2022Mallorca21.00
P2Skin lesion swab (genital)IP2_SLG_I18 July 2022Spain18.96
P2urineIP2_U_I18 July 2022Spain25.91
P2Rectal swabIP2_R_I18 July 2022Spain18.52
P2Skin lesion swab (genital)IP2_SLG_2_I18 July 2022Spain17.07
P2Skin lesion swab (genital)IIP2_SLG_II21 July 2022Spain20.96
P2Rectal swabIIP2_R_II21 July 2022Spain20.63
P3Skin lesion swabIP3_SL_I18 October 2022None14.70
P3Pharyngeal swabIP3_P_I18 October 2022None16.01
P3SalivaIP3_S_I18 October 2022None20.06
P3Skin lesion swab (head)IP3_SL_2_I18 October2022None18.99
P4Skin lesion swab (abdomen)IP4_SL_I02 November 2022Bosnia-Herzegovina16.70
P4Skin lesion swab (hand)IP4_SL_2_I02 November 2022Bosnia-Herzegovina14.40
P4Skin lesion swab (head)IP4_SL_3_I02 November 2022Bosnia-Herzegovina14.00
P4Pharyngeal swabIP4_P_I02 November 2022Bosnia-Herzegovina23.10
P4UrineIP4_U_I02 November 2022Bosnia-Herzegovina25.80
P4Skin lesion swab (leg)IIP4_SL_II02 November 2022Bosnia-Herzegovina25.30
P4UrineIIP4_U_II02 November 2022Bosnia-Herzegovina25.76
P5Skin lesion swabIP5_SL_I23 June 2022Spain22.19
P5Rectal swabIP5_R_I23 June 2022Spain18.78
P6Skin lesion swabIP6_SL_I21 July 2022None26.65
P7Skin lesion swab (genital)IP7_SLG_I01 July 2022None21.02
P8Skin lesion swab (genital)IP8_SLG_I15 July 2022None19.83
P8Skin lesion swab (genital)IP8_SLG_2_I15 July 2022None15.05
P8Skin lesion swab (genital)IIP8_SLG_II21 July 2022None19.07
P8UrineIIP8_SLG_2_II21 July 2022None25.10
P9Skin lesion swabIP9_SL_I01 July 2022USA26.67
P9Skin lesion swabIIP9_SL_2_II01 July 2022USA22.74
P10Skin lesion swabIP10_SL_I26 July2022None24.05
P10Skin lesion swabIIP10_SL_II29 July 2022None19.05
P11Skin lesion swab (abdomen)IP11_SL_I05 August 2022None20.17
P11Skin lesion swab (arm)IP11_SL_2_I05 August 2022None19.32
P12Skin lesion swab (head)IP12_SL_I18 August 2022None18.06
P12Rectal swabIP12_R_I18 August 2022None25.25
P13Skin lesion swabIP13_SL_I19September 2022None21.31
P15Pharyngeal swabIP15_P_I08 July 2022None27.03
P16Skin lesion swabIP16_SL_INANANA
P19Skin lesion swabIP19_SL_INANANA
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Deiana, M.; Locatelli, E.; Veschetti, L.; Malagò, S.; Mori, A.; Lavezzari, D.; Accordini, S.; Ronzoni, N.; Angheben, A.; Malerba, G.; et al. Insights into Genomic Dynamics and Plasticity in the Monkeypox Virus from the 2022 Outbreak. Int. J. Mol. Sci. 2026, 27, 1371. https://doi.org/10.3390/ijms27031371

AMA Style

Deiana M, Locatelli E, Veschetti L, Malagò S, Mori A, Lavezzari D, Accordini S, Ronzoni N, Angheben A, Malerba G, et al. Insights into Genomic Dynamics and Plasticity in the Monkeypox Virus from the 2022 Outbreak. International Journal of Molecular Sciences. 2026; 27(3):1371. https://doi.org/10.3390/ijms27031371

Chicago/Turabian Style

Deiana, Michela, Elena Locatelli, Laura Veschetti, Simone Malagò, Antonio Mori, Denise Lavezzari, Silvia Accordini, Niccolò Ronzoni, Andrea Angheben, Giovanni Malerba, and et al. 2026. "Insights into Genomic Dynamics and Plasticity in the Monkeypox Virus from the 2022 Outbreak" International Journal of Molecular Sciences 27, no. 3: 1371. https://doi.org/10.3390/ijms27031371

APA Style

Deiana, M., Locatelli, E., Veschetti, L., Malagò, S., Mori, A., Lavezzari, D., Accordini, S., Ronzoni, N., Angheben, A., Malerba, G., Tacconelli, E., Cusi, M. G., Gobbi, F. G., Piubelli, C., & Castilletti, C. (2026). Insights into Genomic Dynamics and Plasticity in the Monkeypox Virus from the 2022 Outbreak. International Journal of Molecular Sciences, 27(3), 1371. https://doi.org/10.3390/ijms27031371

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop