Abstract
In Saudi Arabia, cucurbit crops such as zucchini (Cucurbita pepo) and snake gourd (Trichosanthes cucumerina) are major vegetables and key dietary components, yet their associated viral threats remain poorly understood. We surveyed symptomatic cucurbit samples from greenhouses and open fields in the Al-Ahsa and Qatif regions. The detection methods employed included PCR, RCA, and Illumina NGS. Based on nucleotide sequence comparisons and maximum-likelihood phylogenetic analysis, we identified three viruses, i.e., TYLCV, WmCSV, and ToLCPalV, present as both single and mixed infections. Sequence analyses revealed a novel strain, TYLCV-Hasa, representing a distinct lineage of TYLCV. Analysis revealed that recombination occurred solely in the DNA-A components of the identified viruses, while DNA-B segments showed no evidence of recombination. Notably, no DNA satellites were detected, suggesting cucurbits may act as independent reservoirs of begomovirus diversity. These results provide a comprehensive genomic insight into cucurbit-infecting begomoviruses in Eastern Saudi Arabia. The discovery of TYLCV-Hasa and evidence of recombination raise concerns about the emergence of novel viral variants that could pose risks to cucurbit cultivation. The results establish a foundation for advanced molecular surveillance and breeding strategies, contributing to improved food security and supporting Saudi Arabia’s Vision 2030 goals for sustainable agriculture.
1. Introduction
Members of the family Geminiviridae form the most extensive and diverse assemblage of ssDNA plant viruses, characterized by their distinctive geminate (twin) icosahedral particles enclosing circular single-stranded DNA genomes. These viruses pose a significant threat to agriculture in tropical and subtropical regions due to their broad host range, which spans both monocotyledonous and dicotyledonous crops, leading to substantial yield losses [1,2]. The family currently comprises 14 recognized genera. Begomovirus is the most species-rich genus, containing over 500 classified species, which are phylogenetically separated into Old World (OW) and New World (NW) evolutionary lineages [3]. Begomoviruses typically have one of two genome types: a bipartite structure consisting of two components, DNA-A and DNA-B (each approximately 2.6 kb), or a monopartite structure containing a single DNA-A component (about 2.7–2.8 kb). Their genome organization is strongly influenced by geography, with bipartite begomoviruses prevalent in the NW, while monopartite forms dominate the OW [4]. The DNA-A component of Old World begomoviruses is known to encode a basic set of six ORFs [5]. Specifically, the virion-sense strand expresses CP and AV2/V2, while the complementary strand expresses Rep, TrAP, REn, and C4. Notably, some recent studies on monopartite species have reported the existence of extra ORFs like V3 and C5–C7 [6,7]. Meanwhile, DNA-B encodes two key proteins: BV1, the nuclear shuttle protein (NSP), and BC1, the movement protein (MP), both of which are essential for viral transport within and between host cells [8]. Both DNA components share a ~200-nucleotide common region (CR) containing the replication origin and a conserved nonanucleotide motif (TAATATTAC), both of which are crucial for both replication and bidirectional transcription. In the OW, it is common for monopartite begomoviruses to be found in association with ssDNA-satellites, i.e., betasatellites, alphasatellites, and deltasatellites, which are known to influence viral pathogenicity [9,10]. For example, betasatellite-encoded βC1 is widely recognized as the primary contributor to symptom development [11]. However, a recently described protein βV1 also enhances virulence [12]. Moreover, the interaction of the virus with alphasatellites and deltasatellites introduces additional layers of complexity to the disease mechanism and the plant’s host response [13].
The global dissemination of begomoviruses is driven by the polyphagous whitefly vector (Bemisia tabaci), the trade in infected plant material, the viruses’ high adaptability through recombination, and mixed infections [14,15]. Although local begomovirus populations often retain genetic distinctiveness, their evolution can occur through localized adaptation or parallel diversification [16]. The transboundary dissemination of these viruses is an increasingly documented phenomenon. For instance, Tomato yellow leaf curl virus (TYLCV) has progressively expanded westward into Africa [17], while Squash leaf curl virus has been detected throughout the Middle East [18]. Similarly, cotton leaf curl disease has shown an eastward spread reaching China [19]. Moreover, Watermelon chlorotic stunt virus (WmCSV), originally confined to the Old World, has now been reported in both the United States and Mexico [20,21]. Another closely related virus, tomato leaf curl Palampur virus (ToLCPalV), first described in India, has subsequently been identified across several Middle Eastern countries, including Iran, Iraq, Oman, and Saudi Arabia [22,23,24]. Several regional studies also highlighted distinct evolutionary patterns for TYLCV in the Middle East. Hosseinzadeh, et al. [25] proposed Iran as a center of TYLCV diversification, reinforced by a recent genetic diversity analysis by Akbar, et al. [26], suggesting that many TYLCV variants circulating in the Arabian Peninsula likely originated from, or are closely linked to, Iranian lineages.
Both monopartite and bipartite begomoviruses are documented in Saudi Arabia. Key bipartite species include WmCSV, ToLCPalV, and tomato leaf curl Sudan virus (ToLCSDV). The main monopartite begomoviruses impacting crops are TYLCV and cotton leaf curl Gezira virus (CLCuGeV) [27,28,29,30]. Notably, ToLCPalV has been recently reported to cause mixed infections in tomato and cucurbit crops within the Al Ahsa region [22,31].
Among susceptible hosts, zucchini (Cucurbita pepo L.) and snake gourd (Trichosanthes cucumerina L.) are widely grown cucurbits with high economic importance. Globally, zucchini contributes to approximately 23.4 million tonnes of annual squash production (https://www.futuremarketinsights.com/reports/squash-market; accessed on 15 May 2025). In Saudi Arabia, zucchini cultivation has expanded under greenhouse-based systems, with total vegetable output in regions like Qassim exceeding 115,000 tonnes annually [32]. Although less commercially prominent than zucchini, snake gourd is also cultivated across tropical Asia and the Middle East and remains a key dietary and regional cucurbit crop [33]. Both hosts are vulnerable to begomoviruses, which cause characteristic symptoms like chlorotic mosaic, interveinal yellowing, curling, vein thickening, and stunting, ultimately reducing yield and marketable quality, especially under high whitefly infestations [28,30]. While the number of begomovirus reports in the Arabian Peninsula is increasing, the full extent of the genetic diversity of the circulating viral genomes remains poorly understood because molecular surveillance has primarily relied on traditional, primer-dependent methods [27,34]. These approaches are often constrained by primer selectivity and the dominance of specific viral populations, making it difficult to accurately assess the viral diversity in infected hosts. Although cucurbits have substantial agricultural importance, their viral pathogens in Saudi Arabia remain poorly understood, and the identity, diversity, and genome organization of begomoviruses infecting zucchini and snake gourd remain largely uncharacterized. This highlights the need for a comprehensive investigation of begomovirus infections in cucurbits using next-generation sequencing (NGS).
In this study, we aimed to detect, identify, and molecularly characterize begomovirus strains infecting zucchini and snake gourd in the Eastern regions of Saudi Arabia using an NGS approach. Field surveys in Al-Ahsa revealed symptomatic plants from both open fields and greenhouse environments. While preliminary PCR diagnostics confirmed the presence of begomovirus, and NGS revealed mixed infections involving WmCSV, TYLCV, and ToLCPalV, occurring as either single or co-infections. These findings highlight the complex viral landscape affecting cucurbit production in the region. By integrating PCR-based diagnostics, rolling circle amplification (RCA), NGS, and phylogenetic analysis, this study provides new insights into begomovirus diversity in zucchini and snake gourd and establishes a basis for improved disease surveillance and management strategies in Saudi agriculture.
2. Materials and Methods
2.1. Collection of Plant Samples and Preliminary Begomovirus Screening
Leaf tissues exhibiting characteristic begomovirus-associated symptoms, including interveinal yellow mottle/mosaic, vein banding or clearing, upward leaf curling, and deformation of young foliage with reduced leaf size and mild stunting (Figure 1A, zucchini; B, snake gourd), were collected from 13 zucchini and six snake gourd plants. The samples were gathered from eight different greenhouse and field plots in the Al-Hofuf and Qatif regions of Saudi Arabia (Supplementary Table S1). Immediately after collection, samples were snap-frozen in liquid nitrogen and subsequently held at −80 °C until DNA extraction.
Figure 1.
Virus-like symptoms observed on infected cucurbit plants. (A) Symptoms on zucchini (Cucurbita pepo) include yellow mosaic, vein banding and clearing, leaf rugosity, blistering, and cupping, as well as reduced leaf size and stunting. (B) Symptoms on snake gourd (Trichosanthes cucumerina) include bright interveinal yellow mosaic, puckering, and marginal curling, with the most severe deformation observed on young leaves.
Genomic DNA was extracted using a commercial silica-based kit (Qiagen, Redwood City, CA, USA) following the manufacturer’s protocol. Initial virus screening was conducted through PCR using the universal degenerate primers AC1048/AV494 [35], which are designed to target a ~550 bp region of the begomoviral CP gene. Subsequently, the amplified DNA fragments were purified with the GeneJet PCR Purification Kit (ThermoFisher Scientific, Waltham, MA, USA) and were sequenced bidirectionally at the Macrogen facility in Korea.
2.2. Full-Length Genome Recovery by Rolling Circle Amplification and Next-Generation Sequencing
To obtain complete begomovirus genome sequences, RCA was performed on DNA extracted from all positive samples using Φ29 DNA polymerase (GE Healthcare, Chicago, IL, USA). RCA-enriched products were purified from four zucchini and two snake gourd samples and processed for NGS (Table 1). For six representative samples, libraries were prepared with Nextera XT and sequenced as paired-end reads (2 × 300 bp) on an Illumina MiSeq platform at Macrogen (Seoul, Republic of Korea). This approach is designed to capture circular DNA viruses but does not detect RNA viruses. The protocol used was consistent with previously published reports [22,31].
Table 1.
Molecular detection and Illumina MiSeq whole-genome sequencing of begomoviruses in zucchini and snake gourd samples.
2.3. Genome Assembly and Bioinformatic Analysis of NGS Data
Raw reads generated in FASTQ format were initially evaluated for quality using FastQC v0.11.8 [36]. Raw reads were quality-filtered with Trimmomatic v 0.39 to remove adapters and low-quality bases, and the post-trim metrics were inspected in FastQC to verify improvement [37]. We used BWA-MEM2 v2.2.1 [38] to map the cleaned sequence data to a set of begomovirus reference genomes, including TYLCV, and the respective DNA-A and DNA-B segments for both ToLCPalV and WmCSV. Resulting SAM files were converted, sorted, and indexed as BAM with SAMtools v1.9, and Picard was used to harmonize read-group annotations. Consensus genomes were derived with SAMtools mpileup and iVar v1.3 [39], applying a minimum depth cutoff of 20× for base acceptance.
2.4. PCR Validation of Viral Genome Components
To verify the viral sequences derived from NGS, RCA products from infected samples were diluted (1:10) and re-amplified thereafter, using component-specific primers as described previously [31]. The resulting amplicons were purified and subjected to a bidirectional Sanger sequencing approach at Macrogen Inc. (Seoul, Republic of Korea). Sequence identities were established by conducting BLASTn searches against the NCBI GenBank database (https://blast.ncbi.nlm.nih.gov/Blast.cgi; accessed on 1 April 2025).
2.5. Sequence Alignments and Identity Analysis
The pairwise nucleotide (nt) identities were computed for all query sequences, after which the most similar reference genomes were retrieved from GenBank for comparative evaluation. For each component (DNA-A and DNA-B), we built full-length multiple-sequence alignments with ClustalW in MEGA12 [40]. We subsequently calculated pairwise nt sequence identity percentages using the Sequence Demarcation Tool (SDT v1.2), adhering to the standard classification guidelines for geminiviruses [41]. Coding and intergenic regions were annotated with the NCBI ORFfinder tool (available at https://www.ncbi.nlm.nih.gov/orffinder/, accessed on 15 May 2025).
2.6. Phylogenetic Analysis
Phylogenetic relationships among begomovirus genomic components were reconstructed by maximum likelihood (ML) in MEGA v12. Model testing indicated that the General Time Reversible model with a gamma-distributed rate variation (GTR+I) best described the data, and this model was applied in subsequent analyses. Resulting trees were exported in Newick format and graphically refined using iTOL v6.5 and Adobe Illustrator CC 2025.
2.7. Detection of Recombination Events
To investigate recombination patterns, a dataset was compiled consisting of 100 complete begomovirus DNA-A and 100 DNA-B genomic component sequences, including those obtained in this study. Only sequences sharing >70% nt identity with the study isolates were included, following the criteria of Crespo-Bellido, et al. [42]. The datasets were exported as FASTA files after being aligned in MEGA12. Recombination events were identified using two programs, GARD and RDP5 v5.0 [43]. We utilized the seven algorithms contained in the RDP5 suite and enforced a strict validation criterion: only events supported by at least three separate methods were accepted. A Bonferroni-adjusted p-value of 0.05 was applied to assess significance, and default settings were used for all program runs.
2.8. Genetic Diversity, Neutrality, and Population Differentiation Analyses
To complement the genomic characterization of cucurbit-associated begomoviruses obtained through high-throughput sequencing, we performed a targeted regional population-genetic analysis. This analysis aimed to assess whether Iranian (IR) and Arabian-Red Sea (AR) begomovirus populations exhibit measurable differences in nt diversity and demographic signatures. This analysis focused specifically on WmCSV and ToLCPalV because (i) these viruses were prevalent in cucurbits in Eastern Saudi Arabia, and (ii) detailed population-genetic studies already exist for TYLCV in Iran and the Arabian Peninsula [25,26]. Therefore, TYLCV was not re-evaluated in this study.
2.8.1. Dataset Definition and Population Delineation
Full-length DNA-A and DNA-B sequences of ToLCPalV and WmCSV were retrieved from GenBank and supplemented with Saudi Arabian sequences generated in this study. To avoid sampling biases from regions with disproportionate sequencing efforts and to maintain a focus on cross-border viral movement relevant to Eastern Saudi Arabia, the dataset was restricted to two geographically defined populations, i.e., Pop-1 (IR): Iran; and Pop-2 (AR): Arabian-Red Sea corridor (Saudi Arabia, Oman, Iraq, Yemen, Kuwait, Sudan). We acknowledge that the Pop-1 dataset comprises fewer sequences and represents historical data compared to the contemporary and larger Pop-2 dataset. While this disparity is a common limitation in retrospective analyses, the comparison provides valuable insights into potential demographic shifts over time.
Partial genomes, sequences containing >3 ambiguous nts, redundant duplicates, or recombinant fragments supported by ≥3 RDP5 algorithms (Bonferroni-corrected p ≤ 0.05) were removed from the dataset. Separate alignments were constructed for each virus and genome component (DNA-A and DNA-B).
2.8.2. Multiple Sequence Alignment and Recombination Screening
The sequences were aligned in MEGA12 using the MUSCLE algorithm and manually inspected to maintain codon structure. The best-fit nt substitution model for each dataset was selected according to the Bayesian Information Criterion (BIC). Recombination screening was performed using RDP, GENECONV, BootScan, MaxChi, Chimaera, SiScan, and 3Seq under default parameters. Recombinant sequences or breakpoints identified by at least three methods were removed from downstream analysis to prevent inflation of diversity statistics.
2.8.3. Genetic Diversity and Neutrality Tests
Genetic diversity indices were estimated in DnaSP v6.12, including the number of polymorphic sites (S), total mutations (Eta), number of haplotypes (h), haplotype diversity (Hd), nt diversity (π), average nucleotide differences (k), and Watterson’s estimator (θw). Neutrality tests, including Tajima’s D (TD) and Fu & Li’s D (FLD), were calculated using 10,000 coalescent simulations to test for departures from mutation-drift equilibrium. Gaps and missing sites were treated by pairwise deletion. Analyses were performed separately for DNA-A and DNA-B components of both viral species.
2.8.4. Population Differentiation and Gene Flow Statistics
To assess the degree of genetic differentiation between Pop-1 and Pop-2, we calculated differentiation indices, including KST and the nearest-neighbor statistics (Snn), using DNASP v6.12. Additionally, the pairwise fixation index (FST) and Analysis of Molecular Variance (AMOVA) were performed using Arlequin v3.5.2.2. The significance of the FST values was determined using a non-parametric permutation test with 1000 replicates. AMOVA was used to partition the total molecular variance into components attributable to differences among populations and within populations.
3. Results
3.1. Next-Generation Sequencing and Data Analysis
During a survey of eight greenhouse and field plots in the Al-Ahsa region (including Al-Hofuf and Qatif municipalities), Saudi Arabia, we observed that approximately 40–60% of zucchini plants and 20–30% of snake gourd plants displayed characteristic begomovirus-like symptoms with associated whitefly infestations. Symptoms in zucchini were severe, including bright yellow mosaic/mottle with vein-clearing and banding, laminar rugosity/blistering, upward cupping and curling, apical leaf clustering, and a marked reduction in leaf size and plant stature (Figure 1A). In snake gourd, symptoms were milder, predominating on young leaves as interveinal yellow mosaic patches with marginal curling/puckering and limited stunting (Figure 1B). These profiles are consistent with typical begomovirus infections in cucurbit hosts.
We confirmed begomovirus infection in 12 zucchini and two snake gourd samples collected from Al-Hofuf and Qatif using primers targeting the core CP gene. Subsequent Sanger sequencing of the amplicons revealed strong sequence identity with the ToLCOMV and WmCSV DNA-A components. Based on these initial findings, four zucchini samples (SqH1, SqH42, SqSK7, and SqA2) and two snake gourd samples (SqH33 and SqA8my) were selected for NGS analysis using the Illumina high-throughput sequencing platform. From these six samples, we generated six DNA libraries, producing 2,380,606, 2,299,758, 2,567,202, 2,262,856, 2,728,541, and 2,422,677 raw paired reads per sample, which were assembled into full-length begomovirus genomes, including both bipartite and monopartite species. The sequences recovered during this study have been deposited into the NCBI GenBank database, each receiving a unique accession number (Table 1). Notably, no DNA-satellites were detected in any of the samples analyzed.
3.2. Deciphering Begomovirus Genomes Through Comparative Sequence Analysis
Pairwise sequence identity analysis using SDT demonstrated that most of the samples were infected with a single begomovirus species. This included the zucchini samples (SqH1, SqH41, and SqSK7) and the snake gourd samples (SqH33 and SqA8my). However, the zucchini sample SqA2 from the Qatif region was distinctive in containing a mixed infection of begomoviruses. An examination of the full-length sequences SHT1, SHT2, and SQT1 showed mutual identity ranging from 93.8 to 98.8% (Figure 2A). When compared to GenBank entries, SHT1 (from SqH1 zucchini) and SQT1 (from SqSK7 zucchini) showed nt sequence identities in the range of 98.0 to 99.7% with three existing TYLCV isolates from Saudi Arabia, which originated from cucumber (OR865128) and Malva (OL416210 and OL416216). In contrast, these two sequences showed less than 94% nt identity with other TYLCV isolates. Finally, the sequence SHT2 from the snake gourd sample (SqH33) showed it was most closely related (95.8% nt sequence identity) to the “Boushehr” strain isolate (GU076454) from Iran [44], as shown in Figure 2A. In the ML phylogenetic analysis of the full-length begomovirus sequences, the three isolates from this study showed distinct evolutionary relationships (Figure 3A). The isolate SHT2 clustered most closely with the Iranian “Boushehr” strain of TYLCV (GU076454), along with other TYLCV isolates from Sudan and Iran, suggesting an introduction from outside the region. In contrast, isolates SHT1 and SQT1 grouped with previously reported Saudi isolates, indicating local circulation and possible in-country diversification. Based on nt sequence comparisons and the ICTV criteria for begomovirus species demarcation (>92%) and strain demarcation (>94%) [45], we propose a strain-level group, TYLCV-Hasa, comprising SHT1, SQT1, and three previously reported Saudi TYLCV genomes (OR865128, OL416210, and OL416216). We denote the focal genome as TYLCV-Hasa [SA:Zuc:18].

Figure 2.
Sequence Demarcation Tool (SDT) heat map of pairwise nucleotide identity among complete begomovirus genomes. (A) DNA-A for WmCSV, ToLCPalV, and TYLCV. (B) DNA-B for WmCSV and ToLCPalV. Sequences were aligned with MUSCLE, and identities computed with SDT. The color scale spans 70–100%, with blue indicating lower identity and red indicating higher identity. The white text, boxed with a black background and red asterisks, marks the Saudi isolates identified in this study.
Figure 3.
Phylogenetic analysis of begomovirus genomes. (A) DNA-A maximum-likelihood (ML) tree including TYLCV, ToLCPalV, and WmCSV. Two ToLCPalV DNA-A clades are indicated, and the Saudi TYLCV isolates group together within the TYLCV lineage. Support values are shown as filled circles, and the scale bar represents nt substitutions per site (0.1). (B) DNA-B ML tree including ToLCPalV and WmCSV. Two ToLCPalV DNA-B clades are indicated, and Saudi WmCSV genomes cluster within the regional WmCSV group. Support values are shown as filled circles, and the scale bar represents substitutions per site (1). Accession numbers are provided next to each taxon. The names of the sequences identified in this study are boxed in black and highlighted with red asterisks.
The sequences SHWA1, SQWA1, and SQWA2 shared mutual nt sequence identities of 94.3–99.8%. The sequences SHWA1 from the zucchini sample (SqH41) and SQWA1 from the snake gourd sample (SqA8my) were 98.7% and 98.5% identical, respectively, to the WmCSV isolate (PP320240) from snake gourd in Saudi Arabia [30]. The SQWA2 sequence, obtained from the zucchini sample (SqA2), exhibited a maximum nt sequence identity of 96.0% with WmCSV isolates previously reported from both cucumber and snake gourd in Saudi Arabia [31]. In the resulting phylogenetic dendrogram, these three isolates were placed within the WmCSV clade, showing close relationships to other Saudi Arabian WmCSV isolates and further supporting their endemic status.
The sequences SHWB1, SQWB1, and SQWB2 demonstrated a high degree of mutual nt sequence identity, ranging from 95.7 to 100%. Specifically, SHWB1 (from zucchini sample SqH41) and SQWB1 (from snake gourd sample SqA8my) shared their highest nt identity (96.3–96.7%) with a WmCSV isolate from Iran (AJ245653) [46], as illustrated in Figure 2B. Conversely, sequence SQWB2 (from zucchini sample SqA2) showed a 98.8% nt sequence identity with a WmCSV isolate previously reported in Saudi Arabian cucumber (OR865138) [31]. Our phylogenetic analysis confirmed that these isolates belong to the WmCSV DNA-B clade, where they formed a robustly supported sub-lineage alongside other regional and Saudi Arabian isolates. The tight grouping of these isolates, supported by high bootstrap values, indicates their divergence from a recent common ancestor and highlights their localized circulation within the cucurbit-growing regions of Saudi Arabia.
The nt sequence identity of SQPA1 from the zucchini sample (SqA2) was computed as the highest (97.6%) with the ToLCPalV DNA-A isolate (OR865137) [31]. Isolate SQPA1 (PV645676) was positioned within clade-2 of the ToLCPalV clade in the phylogenetic dendrogram, grouping with sequences from both Saudi Arabia and other Middle Eastern countries, which implies its widespread regional circulation (Figure 3A). The sequence SQPB1 showed a very high (99.7%) nt sequence identity match with a previously reported DNA-B component (OR865145) of the ToLCPalV isolate identified from cucumber in Saudi Arabia [31]. In the DNA-B phylogenetic tree, isolate SQPB1 grouped within clade-2 of ToLCPalV DNA-B, showing close phylogenetic affinity with other Saudi Arabian isolates such as OR865146 and OR865145. This cluster was supported by strong bootstrap values, indicating a robust evolutionary relationship. The placement of PV645671 within clade-2, together with Middle Eastern isolates, suggests regional adaptation and potential cross-border virus movement, consistent with the dynamic epidemiology of begomoviruses in cucurbit hosts.
Detection of each begomovirus component in the corresponding zucchini and snake gourd samples was confirmed with previously reported, component-specific primer sets [31].
3.3. Exploring Potential Recombination Events
Recombination was first examined using the automated GARD tool (Datamonkey online source, https://www.datamonkey.org/, accessed on 20 July 2025). This analysis identified both strong and subtle recombination signals exclusively within the begomoviral DNA-A sequences (Figure 4A). Conversely, neither the GARD results nor the subsequent RDP analysis detected any significant recombination signals in the DNA-B component of the identified begomovirus genomes.
Figure 4.
Recombination analyses of begomovirus DNA-A genomes. (A) GARD analysis of the DNA-A multiple-sequence alignment. The best-fit model identifies phylogenetically supported breakpoint regions along the genome; dots mark inferred breakpoints, and the complete genome illustration is shown for orientation. RDP-based recombination analysis of (B) TYLCV isolates SHT1, SHT2, and SQT1; (C) ToLCPalV DNA-A isolate SQPA1; and (D) WmCSV DNA-A isolate SQWA2. Circular and linear genomic illustrations are shown for the recombinant isolates with recombinant fragments, putative major/minor parents are indicated next to each fragment, and breakpoint intervals are given in parentheses. RDP suite methods (RDP, GENECONV, Bootscan, MaxChi, Chimaera, SiScan, and 3Seq) were used for panels, B-D and events were accepted when supported by ≥3 methods under Bonferroni-corrected p < 0.05. Colors denote coding regions and recombinant fragments as depicted.
Comprehensive recombination screening with the RDP suite revealed multiple, well-supported events across the analyzed begomovirus isolates (Table 2, Figure 4). For SHT1, two independent recombination breakpoints were identified.
Table 2.
Recombination events inferred for the identified begomovirus genomes using the RDP suite.
The first spanned nt positions 97–1958, with TYLCV (GU076446) acting as the major parental sequence and TYLCV_SQT1 as the minor contributor (p = 5.84 × 10−28). The second region, covering positions 1959 and 2112, involved TYLCV (DQ631892) and TYLCV (HG941641) as the respective major and minor parents (p = 2.75 × 10−12). For SQT1, three distinct recombination signals were detected. The first occurred between positions 1172 and 1273, derived from TYLCV (EF423426) as the major parent and WmCSV_SQWA2 as the minor sequence. A second event, between 1941 and 2093, also involved TYLCV (DQ631892) and TYLCV (HG941641) as the putative parents. A third event extended across positions 2661 and 2717, where TYLCV (KJ830842) served as the major parent and an unidentified TYLCV isolate (ON756218) as the minor parent, supported by a highly significant probability (p = 3.30 × 10−12). In SHT2, two recombination sites were identified. The first, between positions 1172 and 1273, showed TYLCV (EF423426) as the major parent and WmCSV_SQWA2 as the minor parent. The second event, spanning 2105 and 2752, was formed between TYLCV_SQT1 and an uncharacterized TYLCV isolate (JQ928348), both exhibiting strong statistical support (p = 7.96 × 10−11 and p = 1.15 × 10−69, respectively).
For SQPA1, a single recombination breakpoint was recorded between positions 1158–1241, with TYLCV (GU076446) identified as the major parent and TYLCV_SQT1 as the minor parental sequence (p = 5.61 × 10−12). Finally, SQWA2 demonstrated three independent recombination regions: one extending from 160 to 276 (between WmCSV OK058529 and TYLCV_SQT1), a second between 344 and 506 (between WmCSV KJ958912 and TYLCV_SQT1), and a third between 1718 and 1854 (between WmCSV PP320241 and TYLCV OR865126). All three events were consistently confirmed by multiple detection algorithms, with p-values ranging from 4.97 × 10−16 to 3.35 × 10−19.
3.4. Genetic Diversity Analysis of Local Begomovirus Populations
The diversity analyses showed clear differences between Pop-1 and Pop-2 for both ToLCPalV and WmCSV (Table 3, Figure 5). For ToLCPalV DNA-A, Pop-1 displayed moderate diversity (π = 0.009; S = 204), while Pop-2 showed higher diversity (π = 0.014; S = 210). Haplotype diversity was high in both groups (Hd = 0.96–1.00). Watterson’s estimator (θw) exceeded π in both populations, indicating an excess of low-frequency variants (Figure 5); this gap was largest in Pop-1 and corresponded to the strongly negative neutrality values (Tajima’s D = −2.26; Fu & Li’s D = −3.47).
Table 3.
Estimation of genetic diversity and neutrality test indices for TYLCV, ToLCPalV and WmCSV DNA-A and DNA-B in Pop-1 and Pop-2 datasets.
Figure 5.
Comparative analysis of genetic diversity indices for ToLCPalV and WmCSV genomic components. (A) Genetic diversity profiles for the Iranian population (Pop-1), representing the historical diversification center. (B) Genetic diversity profiles for the Arabian-Red Sea population (Pop-2), representing the zone of viral expansion. The line graphs illustrate the values of nucleotide diversity (π) and Watterson’s estimator (θw) calculated for the DNA-A and DNA-B components of each virus. The divergence between θw and π indicates the presence of low-frequency variants and demographic expansion.
A similar trend was observed for ToLCPalV DNA-B, where Pop-2 exhibited greater polymorphism (S = 379) and nt diversity (π = 0.040) compared with Pop-1 (S = 287; π = 0.026). Again, θw was higher than π in both populations (Table 3; Figure 5), consistent with the mildly negative neutrality statistics in Pop-1 and near-neutral values in Pop-2.
For WmCSV DNA-A, Pop-1 showed relatively low diversity (π = 0.010; S = 76), whereas Pop-2 displayed substantially higher variation (π = 0.019; S = 262). This pattern was most pronounced in WmCSV DNA-B, where Pop-2 had the highest diversity across all datasets (π = 0.044; S = 510). In all WmCSV datasets, θw remained greater than π, producing consistently negative but moderate neutrality values. Overall, the Arabian–Red Sea population showed higher nt diversity across both viruses, while the Iranian population exhibited stronger departures from neutrality, reflecting distinct demographic histories. It should be noted that the lower diversity observed in Pop-1 may be partially influenced by the smaller sample size and older collection dates relative to Pop-2.
3.5. Population Differentiation Analysis
The genetic structure of ToLCPalV and WmCSV populations was further investigated through differentiation indices and AMOVA (Table 4). All viral components exhibited statistically significant genetic differentiation between Pop-1 and Pop-2 (p < 0.05). The highest level of differentiation was observed in ToLCPalV DNA-A (FST = 0.178, p < 0.001), supported by a high Snn value of 0.907 and a KST of 0.046. ToLCPalV DNA-B also showed highly significant differentiation (FST = 0.106, p < 0.001), with an even higher Snn value of 0.931, suggesting very distinct haplotype compositions between the two regions.
Table 4.
Genetic differentiation estimates and Analysis of Molecular Variance (AMOVA) between Pop-1 and Pop-2 of ToLCPalV and WmCSV.
For WmCSV, DNA-A showed moderate but significant differentiation (FST = 0.118, p = 0.006), with a high Snn (0.967) indicating strong population subdivision. WmCSV DNA-B displayed the lowest differentiation among the analyzed components (FST = 0.093, p = 0.031), though the result remained statistically significant.
AMOVA results were consistent with the fixation indices, revealing that while most of the genetic variation exists within populations (82.25–90.67%), a significant proportion of the total variation is attributed to differences among populations (Table 4). Specifically, ToLCPalV DNA-A showed the highest variation among populations (17.75%), followed by WmcSV DNA-A (11.97%), ToLCPalV DNA-B (10.61%), and WmCSV DNA-B (9.33%).
4. Discussion
The global expansion and increasing incidence of begomoviruses pose significant threats to agricultural productivity [16], particularly in economically important crops like zucchini and snake gourd [28,30]. In Saudi Arabia, where cucurbit cultivation is expanding, understanding the genetic diversity and distribution of circulating begomoviruses is essential for designing effective and sustainable control measures. This study utilized an NGS approach to comprehensively characterize begomovirus strains infecting symptomatic zucchini and snake gourd plants from the Al-Ahsa region of Saudi Arabia. The results demonstrated the co-presence of TYLCV, WmCSV, and ToLCPalV, occurring either individually or in mixed infections within the same hosts.
Our results corroborate those of previous regional reports, which confirm the prevalence of both monopartite and bipartite begomoviruses in the Arabian Peninsula [22,27,28,30,31]. The observed disease incidence (40–60% in zucchini and 20–30% in snake gourd) and characteristic symptoms, such as interveinal chlorosis, yellow mosaic patterns, leaf curling, and stunting, are consistent with typical begomovirus infections in cucurbit hosts. Beyond begomoviruses, cucurbits in Saudi Arabia face significant threats from RNA viruses. Zucchini yellow mosaic virus (ZYMV), watermelon mosaic virus (WMV), and papaya ringspot virus (PRSV) are the dominant pathogens causing severe disease across the region [47,48]. In Al-Ahsa, cucumber mosaic virus (CMV) often complicates diagnosis [49], while cucurbit yellow stunting disorder virus (CYSDV) remains a persistent issue in western Saudi Arabia [50]. This complex viral landscape emphasizes the need for broad-spectrum surveillance strategies.
While begomoviruses have been reported previously in Saudi Arabia and neighboring countries, the novelty of this work lies in three key findings: (i) the identification of a distinct “Hasa” strain of TYLCV, which clusters with Saudi Arabian isolates but is divergent from other known strains; (ii) the detection of multiple recombination events involving TYLCV, WmCSV, and ToLCPalV, underscoring the dynamic nature of their evolution; and (iii) the simultaneous recovery of full-length genomes from both zucchini and snake gourd, which expands the known host range and confirms cucurbits as important reservoirs of begomovirus diversity.
Phylogenetic analysis revealed contrasting evolutionary trajectories and origins of the identified begomovirus isolates. Two TYLCV isolates (SHT1 and SQT1) grouped tightly with previously reported Saudi Arabian isolates, supporting their local diversification. We propose designating these as members of a new “Hasa” strain of TYLCV. Conversely, SHT2 clustered with the Iranian “Boushehr” strain, suggesting a regional introduction through cross-border virus movement, consistent with the role of trade and whiteflies in mediating long-distance dispersal [14,17]. The WmCSV DNA-A and DNA-B isolates (SHWA1, SQWA1, SQWA2, SHWB1, SQWB1, and SQWB2) grouped into a Saudi Arabian sub-lineage, confirming their endemic establishment and localized circulation within cucurbit-growing regions, reinforcing our previous findings [28,31]. The observed patterns are consistent with WmCSV becoming well-adapted and diversified over time, which highlights its successful and long-term presence in the region. In parallel, the ToLCPalV DNA-A and DNA-B isolates (SQPA1 and SQPB1) clustered within clade-2 alongside isolates from Saudi Arabia and the broader Middle East. This phylogenetic placement supports previous documentation regarding its wide regional dissemination [22,23,31,51]. ToLCPalV, which originated in India, has spread eastward into Pakistan and westward into the Middle East [24]. The recurrent discovery of ToLCPalV from Saudi Arabia suggests that it has been widening its distribution in this region. The detailed analysis by Sattar, et al. [24] indicated that ToLCPalV isolates from the Arabian Peninsula and the Indo-Pak subcontinent are genetically quite distinct. While they share a common ancestor, they have evolved independently, as is evident from the clustering of these isolates into two distinct clades (Figure 3A,B). Together, these results strengthen the idea that Saudi Arabia serves as a regional hub for the circulation and diversification of these begomoviruses.
Our recombination analysis provided compelling evidence of intra- and inter-species genetic exchanges. TYLCV isolates were found to have recombined with both ToLCPalV and WmCSV. Likewise, WmCSV isolate SQWA2 displayed three distinct recombination events involving various WmCSV and TYLCV isolates. Our findings highlight extensive recombination activity among Saudi Arabian cucurbit begomoviruses, driven by mixed infections and the polyphagous nature of the whitefly vector, B. tabaci. Recombination is a well-established driver of begomovirus evolution, enabling host adaptation, virulence shifts, and resistance-breaking [42,52]. The detection of recombination breakpoints within functional genomic regions underscores the likelihood of adaptive consequences for viral fitness and epidemiology. The absence of significant recombination signals in the DNA-B component, as observed in our study, is consistent with some previous reports and could be attributed to different evolutionary pressures or less frequent genetic exchange in this component [21,53]. However, the exact reasons for this differential recombination pattern warrant further investigation.
The contrasting diversity patterns between Iran and the Arabian–Red Sea region indicate distinct evolutionary histories among regional begomoviruses. For both WmCSV and ToLCPalV, the Arabian–Red Sea population showed higher nt diversity, especially in DNA-B, consistent with a genetically heterogeneous and expanding viral pool shaped by repeated introductions and extensive whitefly movement. In contrast, the Iranian population displayed stronger negative neutrality indices and larger θw–π gaps, reflecting an excess of rare variants and a more structured, long-standing viral reservoir. This pattern aligns with earlier work on TYLCV, where full-genome analyses identified greater genetic variability and recombination hotspots in southern Iranian isolates, leading Hosseinzadeh, et al. [25] to propose Iran as a center of TYLCV diversification; Akbar, et al. [26] further showed that Iranian isolates form the basal core of a monophyletic cluster shared with Oman, Saudi Arabia, Pakistan, and Kuwait, indicating historical spread from Iran into the Arabian Peninsula. Our findings also parallel the regional structure reported for ToLCPalV [24], reinforcing that Iranian populations likely represent older, more constrained lineages, whereas Arabian-Red Sea populations constitute more dynamic and admixed viral assemblages shaped by cross-border movement and agricultural connectivity. This regional population structure is statistically confirmed by our differentiation analysis (Table 4), which reveals significant FST values for all analyzed components. The relatively high FST (0.178) and among-population variation (17.75%) for ToLCPalV DNA-A suggest limited gene flow between Iranian and Arabian-Red Sea populations for this component, maintaining their distinct genetic identities. This finding aligns with recent phylogeographic analyses of ToLCPalV, which described genetically distinct lineages circulating in the Arabian Peninsula versus the Indo-Pak subcontinent [24]. In contrast, the lower differentiation observed for WmCSV DNA-B (FST = 0.093) might reflect a higher tolerance for reassortment or more recent shared ancestry. Despite these differences, the consistently high Snn values (>0.80) across all datasets strongly support the conclusion that the Iranian and Arabian-Red Sea viral populations are genetically distinct subdivisions rather than a single panmictic population. These results mirror evolutionary patterns previously reported for TYLCV, where Iranian isolates formed a basal, structured population distinct from the expanding lineages found in the Arabian Peninsula [25,26].
Interestingly, no DNA-satellites were detected by our RCA-NGS pipeline and satellite-directed PCR screens. This suggests that symptoms can be attributed to begomovirus components alone or that satellites were below our detection threshold or absent from the sampled tissues/time points. While the absence of satellites in OW begomoviruses is uncommon, it has been reported in certain agroecosystems where helper viruses are sufficiently competitive [13,54]. Recently, ToLCPalV was reported to infect zucchini in association with tomato leaf curl betasatellite (ToLCB) from Iran [55], and such associations have been documented for different begomoviruses and DNA-satellites [10,56]. In another study, however, Al-Waeli, et al. [57] could not find any association of DNA-satellites with ToLCPalV in Iraq. Their absence in the sequenced samples from our study could be due to methodological factors, such as sequencing depth or RCA bias. Further studies focusing on the presence and role of these satellite molecules in the context of Saudi Arabian cucurbit begomoviruses would be valuable.
This study demonstrates the practical utility of NGS for resolving the complex begomovirus populations in agricultural ecosystems. Since our objective was to assemble high-confidence consensus genomes for taxonomic identification rather than to analyze low-frequency intra-host variants (quasispecies), technical replicates were not employed. The high depth of coverage (>20×) and strict quality filtering ensured the reliability of the dominant viral sequences reported here. By moving beyond traditional primer-dependent methods, NGS provides an unbiased and comprehensive view of viral diversity, enabling the identification of both known and potentially novel viral strains and recombination events [34,58]. It is important to note that while RCA is highly effective for enriching circular DNA for discovery, it may introduce amplification bias and alter the relative proportions of viral components [59]. Therefore, our findings represent a qualitative identification of the circulating viral diversity rather than an absolute quantification of viral load. Our findings underscore the limitations of traditional methods. For example, our previous studies using conventional techniques on snake gourd found only WmCSV infection [30], whereas NGS approaches revealed complex mixed infections involving WmCSV, TYLCV, and ToLCPalV. Our current findings are consistent with earlier work showing that NGS is a successful method for identifying mixed infections of multiple begomoviruses and DNA-satellites in crops such as tomato and muskmelon [22] and cucumber [31]. The detection of multiple begomoviruses and recombinants in zucchini and snake gourd highlights a serious risk to Saudi Arabian cucurbit production. As a high-value greenhouse crop, zucchini is particularly susceptible, with high whitefly infestations creating a conducive environment for virus spread and mixed infections. This robust molecular characterization provides a critical foundation for developing targeted and sustainable disease management strategies, including the deployment of resistant varieties, vector control measures, and improved diagnostic tools.
5. Conclusions
The study employed NGS to resolve a complex begomovirus landscape affecting cucurbit production in Eastern Saudi Arabia, revealing the co-circulation of TYLCV, WmCSV, and ToLCPalV in both single and mixed infections. The genomic data facilitated the identification of a novel viral lineage, TYLCV-Hasa, suggesting that the region is experiencing active local diversification alongside the introduction of distinct lineages from neighboring territories.
Our analysis provides critical insights into the evolutionary mechanisms driving this diversity. We observed that recombination is a pervasive force restricted to the DNA-A components, facilitating intra- and inter-species genetic exchanges, while DNA-B components remain evolutionarily conserved. Furthermore, population genetic analysis highlighted a distinct contrast between the stable, structured viral populations of Iran and the dynamic, expanding populations within the Arabian-Red Sea corridor, identifying the latter as a hotspot for viral admixture and variation. Notably, these infections were established in zucchini and snake gourd without the aid of DNA-Satellites, confirming these hosts as independent reservoirs for begomovirus evolution.
Collectively, these findings demonstrate that traditional primer-based diagnostics are insufficient for capturing the full extent of the localized viral diversity. The discovery of recombinant variants and the TYLCV-Hasa strain underscores the urgent need for continuous genomic surveillance. Implementing these advanced molecular tools will be essential for developing resistant crop varieties and IPM strategies, thereby supporting sustainable agriculture and food security initiatives in the region.
Supplementary Materials
The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v18010075/s1, Table S1: Field sampling, PCR screening, and NGS identification of cucurbit begomoviruses from Al-Ahsa and Qatif, Saudi Arabia.
Author Contributions
Conceptualization, M.N.S., M.I.A. and S.A.A.H.; methodology, M.N.S., S.M.E.-G. and M.I.A.; software, M.N.S.; validation, M.N.S., M.M. and S.M.E.-G.; formal analysis, M.N.S.; investigation, M.I.A.; resources, M.N.S. and A.A.A.-S.; data curation, M.N.S. and S.M.E.-G.; writing—original draft preparation, M.N.S. and M.M.; writing—review and editing, S.A.A.H. and A.A.A.-S.; supervision, M.N.S. and A.A.A.-S.; project administration, A.A.A.-S., M.N.S. and S.A.A.H.; funding acquisition, A.A.A.-S., M.N.S. and S.A.A.H. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Deanship of Scientific Research, King Faisal University, Saudi Arabia, grant number KFU260065 and the Research Development and Innovation Authority (RDIA) through grant number (12877-KFU-R-2-1-SE-).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
All the complete genomic sequences identified from this study are available on NCBI GenBank database under the accession numbers PV645670-77 and OL416207-09.
Acknowledgments
The authors are grateful to the anonymous reviewers and editors for their comments that helped improve the manuscript. The authors have reviewed and edited the output and take full responsibility for the content of this publication.
Conflicts of Interest
The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| AR | Arabian-Red Sea |
| BC1 | Movement Protein (MP) |
| BIC | Bayesian Information Criterion |
| BV1 | Nuclear Shuttle Protein (NSP) |
| CLCuGeV | Cotton leaf curl Gezira virus |
| CP | Coat Protein |
| CR | Common Region |
| DNA | Deoxyriboncleic Acid |
| FLD | Fu & Li’s D |
| GARD | Genetic Algorithm for Recombination Detection |
| GTR+I | General Time Reversible model with a gamma-distributed rate variation |
| Hd | Haplotype diversity |
| ICTV | International Committee on Taxonomy of Viruses |
| IR | Iranian |
| ML | Maximum Likelihood |
| MP | Movement Protein |
| ND | Not Detected |
| NGS | Next-Generation Sequencing |
| NSP | Nuclear Shuttle Protein |
| nt | Nucleotide |
| NW | New World |
| ORF | Open Reading Fram |
| OW | Old World |
| PCR | Polymerase Chain Reaction |
| RCA | Rolling Circle Amplification |
| RDP | Recombination Detection Program |
| REn | Replication Enhancer protein |
| Rep | Replication-associated protein |
| S | Number of polymorphic sites |
| SDT | Sequence Demarcation Tool |
| ssDNA | Single-stranded DNA |
| TD | Tajima’s D |
| ToLCB | Tomato leaf curl betasatellite |
| ToLCOMV | Tomato leaf curl Oman virus |
| ToLCPalV | Tomato leaf curl Palampur virus |
| ToLCSDV | Tomato leaf curl Sudan virus |
| TrAP | Transcriptional Activator Protein |
| TYLCV | Tomato yellow leaf curl virus |
| WmCSV | Watermelon chlorotic stunt virus |
| θw | Watterson’s estimator |
| π | Nucleotide diversity |
References
- Zerbini, F.M.; Briddon, R.W.; Idris, A.; Martin, D.P.; Moriones, E.; Navas-Castillo, J.; Rivera-Bustamante, R.; Roumagnac, P.; Varsani, A.; Consortium, I.R. ICTV virus taxonomy profile: Geminiviridae. J. Gen. Virol. 2017, 98, 131. [Google Scholar] [CrossRef]
- Fiallo-Olivé, E.; Lett, J.-M.; Martin, D.P.; Roumagnac, P.; Varsani, A.; Zerbini, F.M.; Navas-Castillo, J. ICTV virus taxonomy profile: Geminiviridae 2021. J. Gen. Virol. 2021, 102, 001696. [Google Scholar] [CrossRef]
- Rubino, L.; Abrahamian, P.; An, W.; Aranda, M.A.; Ascencio-Ibañez, J.T.; Bejerman, N.; Blouin, A.G.; Candresse, T.; Canto, T.; Cao, M. Summary of taxonomy changes ratified by the International Committee on Taxonomy of Viruses from the Plant Viruses Subcommittee, 2025. J. Gen. Virol. 2025, 106, 002114. [Google Scholar] [CrossRef]
- Souza, J.O.; Melgarejo, T.A.; Vu, S.; Nakasu, E.Y.; Chen, L.-F.; Rojas, M.R.; Zerbini, F.M.; Inoue-Nagata, A.K.; Gilbertson, R.L. How to be a successful monopartite begomovirus in a bipartite-dominated world: Emergence and spread of tomato mottle leaf curl virus in Brazil. J. Virol. 2022, 96, e00725-22. [Google Scholar] [CrossRef]
- Hanley-Bowdoin, L.; Bejarano, E.R.; Robertson, D.; Mansoor, S. Geminiviruses: Masters at redirecting and reprogramming plant processes. Nat. Rev. Microbiol. 2013, 11, 777–788. [Google Scholar] [CrossRef] [PubMed]
- Gong, P.; Tan, H.; Zhao, S.; Li, H.; Liu, H.; Ma, Y.; Zhang, X.; Rong, J.; Fu, X.; Lozano-Duran, R.; et al. Geminiviruses encode additional small proteins with specific subcellular localizations and virulence function. Nat. Commun. 2021, 12, 4278. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Chang, Z.; Zhao, S.; Gong, P.; Zhang, M.; Lozano-Duran, R.; Yan, H.; Zhou, X.; Li, F. Functional identification of a novel C7 protein of tomato yellow leaf curl virus. Virology 2023, 585, 117–126. [Google Scholar] [CrossRef] [PubMed]
- Fondong, V.N. Geminivirus protein structure and function. Mol. Plant Pathol. 2013, 14, 635–649. [Google Scholar] [CrossRef]
- Ferro, C.G.; Zerbini, F.M.; Navas-Castillo, J.; Fiallo-Olivé, E. Revealing the complexity of sweepovirus-deltasatellite–plant host interactions: Expanded natural and experimental helper virus range and effect dependence on virus-host combination. Microorganisms 2021, 9, 1018. [Google Scholar] [CrossRef]
- Sattar, M.N.; Ligthart, M.; Kvarnheden, A. Compatibility and interaction of begomoviruses and DNA-satellites causing leaf curl disease in Asia, Africa and Mediterranean Region. Eur. J. Plant Pathol. 2019, 155, 111–124. [Google Scholar] [CrossRef]
- Yang, X.; Guo, W.; Li, F.; Sunter, G.; Zhou, X. Geminivirus-associated betasatellites: Exploiting chinks in the antiviral arsenal of plants. Trends Plant Sci. 2019, 24, 519–529. [Google Scholar] [CrossRef]
- Gupta, N.; Reddy, K.; Gnanasekaran, P.; Zhai, Y.; Chakraborty, S.; Pappu, H.R. Functional characterization of a new ORF βV1 encoded by radish leaf curl betasatellite. Front. Plant Sci. 2022, 13, 972386. [Google Scholar] [CrossRef]
- Briddon, R.W.; Martin, D.P.; Roumagnac, P.; Navas-Castillo, J.; Fiallo-Olive, E.; Moriones, E.; Lett, J.M.; Zerbini, F.M.; Varsani, A. Alphasatellitidae: A new family with two subfamilies for the classification of geminivirus- and nanovirus-associated alphasatellites. Arch. Virol. 2018, 163, 2587–2600. [Google Scholar] [CrossRef] [PubMed]
- Gilbertson, R.L.; Batuman, O.; Webster, C.G.; Adkins, S. Role of the insect supervectors Bemisia tabaci and frankliniella occidentalis in the emergence and global spread of plant viruses. Ann. Rev. Virol. 2015, 2, 67–93. [Google Scholar] [CrossRef] [PubMed]
- Just, K.; Leke, W.N.; Sattar, M.N.; Luik, A.; Kvarnheden, A. Detection of Tomato yellow leaf curl virus in imported tomato fruit in northern Europe. Plant Pathol. 2014, 63, 1454–1460. [Google Scholar] [CrossRef]
- Rojas, M.R.; Macedo, M.A.; Maliano, M.R.; Soto-Aguilar, M.; Souza, J.O.; Briddon, R.W.; Kenyon, L.; Rivera Bustamante, R.F.; Zerbini, F.M.; Adkins, S. World management of geminiviruses. Annu. Rev. Phytopathol. 2018, 56, 637–677. [Google Scholar] [CrossRef]
- Mabvakure, B.; Martin, D.P.; Kraberger, S.; Cloete, L.; van Brunschot, S.; Geering, A.D.W.; Thomas, J.E.; Bananej, K.; Lett, J.M.; Lefeuvre, P.; et al. Ongoing geographical spread of Tomato yellow leaf curl virus. Virology 2016, 498, 257–264. [Google Scholar] [CrossRef]
- Sobh, H.; Samsatly, J.; Jawhari, M.; Najjar, C.; Haidar, A.; Abou-Jawdah, Y. First report of Squash leaf curl virus in cucurbits in Lebanon. Plant Dis. 2012, 96, 1231. [Google Scholar] [CrossRef]
- Sattar, M.N.; Kvarnheden, A.; Saeed, M.; Briddon, R.W. Cotton leaf curl disease—An emerging threat to cotton production worldwide. J. Gen. Virol. 2013, 94, 695–710. [Google Scholar] [CrossRef]
- Dominguez-Duran, G.; Rodriguez-Negrete, E.A.; Morales-Aguilar, J.J.; Camacho-Beltran, E.; Romero-Romero, J.L.; Rivera-Acosta, M.A.; Leyva-Lopez, N.E.; Arroyo-Becerra, A.; Mendez-Lozano, J. Molecular and biological characterization of Watermelon chlorotic stunt virus (WmCSV): An Eastern Hemisphere begomovirus introduced in the Western Hemisphere. Crop Protect. 2018, 103, 51–55. [Google Scholar] [CrossRef]
- Fontenele, R.S.; Bhaskara, A.; Cobb, I.N.; Majure, L.C.; Salywon, A.M.; Avalos-Calleros, J.A.; Argüello-Astorga, G.R.; Schmidlin, K.; Roumagnac, P.; Ribeiro, S.G. Identification of the begomoviruses squash leaf curl virus and watermelon chlorotic stunt virus in various plant samples in North America. Viruses 2021, 13, 810. [Google Scholar] [CrossRef]
- AlHudaib, K.A.; Almaghasla, M.I.; El-Ganainy, S.M.; Arshad, M.; Drou, N.; Sattar, M.N. High-throughput sequencing identified distinct bipartite and monopartite begomovirus variants associated with DNA-satellites from tomato and muskmelon plants in Saudi Arabia. Plants 2022, 12, 6. [Google Scholar] [CrossRef]
- Heydarnejad, J.; Hesari, M.; Massumi, H.; Varsani, A. Incidence and natural hosts of Tomato leaf curl Palampur virus in Iran. Australas. Plant Pathol. 2013, 42, 195–203. [Google Scholar] [CrossRef]
- Sattar, M.N.; Biju, B.V.; ElGanainy, S.M.; Almaghaslah, M.I.; Al Hashedi, S.A.; Al-Shoaibi, A.A. Genetic diversity, population structure, and cross-border dispersal patterns of tomato leaf curl Palampur virus in South and West Asia. Viruses 2025, 17, 678. [Google Scholar] [CrossRef]
- Hosseinzadeh, M.R.; Shams-Bakhsh, M.; Osaloo, S.K.; Brown, J.K. Phylogenetic relationships, recombination analysis, and genetic variability among diverse variants of tomato yellow leaf curl virus in Iran and the Arabian Peninsula: Further support for a TYLCV center of diversity. Arch. Virol. 2014, 159, 485–497. [Google Scholar] [CrossRef] [PubMed]
- Akbar, A.; Al Hashash, H.; Al-Ali, E. Tomato yellow leaf curl virus (TYLCV) in Kuwait and global analysis of the population structure and evolutionary pattern of TYLCV. Virol. J. 2024, 21, 308. [Google Scholar] [CrossRef]
- Idris, A.; Al-Saleh, M.; Piatek, M.J.; Al-Shahwan, I.; Ali, S.; Brown, J.K. Viral metagenomics: Analysis of begomoviruses by illumina high-throughput sequencing. Viruses 2014, 6, 1219–1236. [Google Scholar] [CrossRef] [PubMed]
- Rezk, A.A.; Sattar, M.N.; Alhudaib, K.A.; Soliman, A.M. Identification of watermelon chlorotic stunt virus from watermelon and zucchini in Saudi Arabia. Can. J. Plant Pathol. 2019, 41, 285–290. [Google Scholar] [CrossRef]
- Sohrab, S.S. Genetic diversity of begomoviruses infecting tomato plant in Saudi Arabia. Saudi J. Biol. Sci. 2020, 27, 222–228. [Google Scholar] [CrossRef] [PubMed]
- Sattar, M.N. Identification and molecular analysis of watermelon chlorotic stunt virus infecting snake gourd in Saudi Arabia. Not. Bot. Horti. Agrobot. 2024, 52, 13857. [Google Scholar] [CrossRef]
- Sattar, M.N.; Almaghasla, M.I.; Tahir, M.N.; El-Ganainy, S.M.; Chellappan, B.V.; Arshad, M.; Drou, N. High-throughput sequencing discovered diverse monopartite and bipartite begomoviruses infecting cucumbers in Saudi Arabia. Front. Plant Sci. 2024, 15, 1375405. [Google Scholar] [CrossRef]
- Hadid, M.; Ahmed, S.M. Role of smart agriculture on food security in Saudi Arabia. In Food and Nutrition Security in the Kingdom of Saudi Arabia, Vol. 1: National Analysis of Agricultural and Food Security; Ahmed, A.E., Al-Khayri, J.M., Elbushra, A.A., Eds.; Springer: Berlin/Heidelberg, Germany, 2024; pp. 229–248. [Google Scholar]
- Bharadwaj, D.; Karmakar, P.; Maurya, B.K.; Singh, H.; Singh, P.; Vinay, N. Snake Gourd Genetic Resources. In Vegetable Crops; Kalia, P., Ed.; Springer: Berlin/Heidelberg, Germany, 2025; pp. 405–421. [Google Scholar]
- Shahid, M.S.; Sattar, M.N.; Iqbal, Z.; Raza, A.; Al-Sadi, A.M. Next-generation sequencing and the CRISPR-Cas nexus: A molecular plant virology perspective. Front. Microbiol. 2021, 11, 609376. [Google Scholar] [CrossRef] [PubMed]
- Wyatt, S.; Brown, J.K. Detection of subgroup III geminivirus isolates in leaf extracts by degenerate primers and polymerase chain reaction. Phytopathology 1996, 86, 1288–1293. [Google Scholar] [CrossRef]
- Andrews, S. FastQC: A Quality Control Tool for High-Throughput Sequence Data. Available online: http://www.bioinformatics.babraham.ac.uk/projects/fastqc/ (accessed on 22 March 2024).
- Bolger, A.M.; Lohse, M.; Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 2014, 30, 2114–2120. [Google Scholar] [CrossRef] [PubMed]
- Vasimuddin, M.; Misra, S.; Li, H.; Aluru, S. Efficient architecture-aware acceleration of BWA-MEM for multicore systems. In Proceedings of the 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rio de Janeiro, Brazil, 20–24 May 2019; pp. 314–324. [Google Scholar]
- Umair, M.; Ikram, A.; Salman, M.; Khurshid, A.; Alam, M.; Badar, N.; Suleman, R.; Tahir, F.; Sharif, S.; Montgomery, J. Whole-genome sequencing of SARS-CoV-2 reveals the detection of G614 variant in Pakistan. PLoS ONE 2021, 16, e0248371. [Google Scholar] [CrossRef]
- Kumar, S.; Stecher, G.; Suleski, M.; Sanderford, M.; Sharma, S.; Tamura, K. MEGA12: Molecular evolutionary genetic analysis version 12 for adaptive and green computing. Mol. Biol. Evol. 2024, 41, msae263. [Google Scholar] [CrossRef]
- Muhire, B.; Martin, D.P.; Brown, J.K.; Navas-Castillo, J.; Moriones, E.; Zerbini, F.M.; Rivera-Bustamante, R.; Malathi, V.G.; Briddon, R.W.; Varsani, A. A genome-wide pairwise-identity-based proposal for the classification of viruses in the genus Mastrevirus (family Geminiviridae). Arch. Virol. 2013, 158, 1411–1424. [Google Scholar] [CrossRef]
- Crespo-Bellido, A.; Martin, D.P.; Duffy, S. Recombination analysis of geminiviruses using Recombination Detection Program (RDP). In Geminiviruses: Methods and Protocols; Springer: Berlin/Heidelberg, Germany, 2025; pp. 125–143. [Google Scholar]
- Martin, D.P.; Varsani, A.; Roumagnac, P.; Botha, G.; Maslamoney, S.; Schwab, T.; Kelz, Z.; Kumar, V.; Murrell, B. RDP5: A computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets. Virus Evol. 2021, 7, veaa087. [Google Scholar] [CrossRef]
- Lefeuvre, P.; Martin, D.P.; Harkins, G.; Lemey, P.; Gray, A.J.; Meredith, S.; Lakay, F.; Monjane, A.; Lett, J.M.; Varsani, A.; et al. The spread of tomato yellow leaf curl virus from the Middle East to the world. PLoS Path. 2010, 6, e1001164. [Google Scholar] [CrossRef]
- Brown, J.K.; Zerbini, F.M.; Navas-Castillo, J.; Moriones, E.; Ramos-Sobrinho, R.; Silva, J.C.; Fiallo-Olivé, E.; Briddon, R.W.; Hernández-Zepeda, C.; Idris, A. Revision of Begomovirus taxonomy based on pairwise sequence comparisons. Arch. Virol. 2015, 160, 1593–1619. [Google Scholar] [CrossRef]
- Kheyr-Pour, A.; Bananej, K.; Dafalla, G.A.; Caciagli, P.; Noris, E.; Ahoonmanesh, A.; Lecoq, H.; Gronenborn, B. Watermelon chlorotic stunt virus from the Sudan and Iran: Sequence comparisons and identification of a whitefly-transmission determinant. Phytopathology 2000, 90, 629–635. [Google Scholar] [CrossRef]
- Al-Saleh, M.A.; Al-Shahwan, I.M. Viruses infecting cucurbits in Riyadh, Gassim and Hail regions of Saudi Arabia. Arab. Gulf J. Sci. Res. 1997, 15, 223–254. [Google Scholar]
- Al-Saleh, M.; Amer, M.; Al-Shahwan, I.; Omer, A.; Mohammed, A. Characterization of different isolates of Zucchini yellow mosaic virus from cucurbits in Saudi Arabia. Afr. J. Microbiol. Res. 2014, 8, 1987–1994. [Google Scholar] [CrossRef]
- Soliman, A. Identification of cucumber mosaic virus infecting some vegetable crops in Saudi Arabia. J. Virol. Sci. 2019, 6, 46–54. [Google Scholar]
- Ahmad, M.; Al-Saleh, M.; Al-Shahwan, I.; Shakeel, T.; Ibrahim, Y.; Amer, M. Characterization of cucurbit yellow stunting disorder virus associated with yellowing disease of watermelon in Saudi Arabia. J. Anim. Plant Sci. 2020, 30, 1206–1214. [Google Scholar] [CrossRef]
- Astaraki, S.; Atighi, M.R.; Shams-bakhsh, M. High-throughput sequencing revealed the symptomatic common bean (Phaseolus vulgaris L.) virome in Iran. Sci. Rep. 2025, 15, 1621. [Google Scholar] [CrossRef] [PubMed]
- Fiallo-Olivé, E.; Navas-Castillo, J. The Role of Extensive Recombination in the Evolution of Geminiviruses. In Viral Fitness and Evolution: Population Dynamics and Adaptive Mechanisms; Domingo, E., Schuster, P., Elena, S.F., Perales, C., Eds.; Springer International Publishing: Cham, Switzerland, 2023; pp. 139–166. [Google Scholar]
- Fiallo-Olivé, E.; Navas-Castillo, J. Molecular and biological characterization of a New World monobipartite begomovirus/deltasatellite complex infecting Corchorus siliquosus. Front. Microbiol. 2020, 11, 1755. [Google Scholar] [CrossRef] [PubMed]
- Mirzazadeh, M.; Heydarnejad, J.; Salari, A.; Massumi, H.; Esmaeili, M. Association of tomato leaf curl betasatellite (Betasatellite solani) with the turncurtovirus sesame curly top virus (Turncurtovirus sesami) increases the severity of symptoms in watermelon plants. Trop. Plant Pathol. 2025, 50, 5. [Google Scholar] [CrossRef]
- Heydari-Gharaei, F.; Heydarnejad, J.; Salari, A.; Massumi, H. Interaction between tomato leaf curl Palampur virus and its associated betasatellite enhances disease severity in zucchini. J. Phytopathol. 2025, 173, e70098. [Google Scholar] [CrossRef]
- Shafiq, M.; Sattar, M.N.; Shahid, M.S.; Al-Sadi, A.M.; Briddon, R.W. Interaction of watermelon chlorotic stunt virus with satellites. Australas. Plant Pathol. 2021, 50, 117–128. [Google Scholar] [CrossRef]
- Al-Waeli, M.; Shahmohammadi, N.; Tavakoli, S.; Dizadji, A.; Kvarnheden, A. Infection of tomato in Iraq with tomato leaf curl Palampur virus and multiple variants of tomato yellow leaf curl virus. J. Plant Pathol. 2024, 106, 1283–1294. [Google Scholar] [CrossRef]
- James, A.; Kryovrysanaki, N.; Andronis, C.; Pappi, P.G.; Kalantidis, K.; Katsarou, K. Identification and characterisation of Zucchini yellow fleck virus and a novel Nepovirus from next-generation sequencing of mixed virus infections in cucumbers (Cucumis sativus) from Crete. Ann. Appl. Biol. 2025, 186, 248–261. [Google Scholar] [CrossRef]
- Kutnjak, D.; Tamisier, L.; Adams, I.; Boonham, N.; Candresse, T.; Chiumenti, M.; De Jonghe, K.; Kreuze, J.F.; Lefebvre, M.; Silva, G.; et al. A Primer on the analysis of high-throughput sequencing data for detection of plant viruses. Microorganisms 2021, 9, 841. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.