You are currently on the new version of our website. Access the old version .
VirusesViruses
  • Article
  • Open Access

8 January 2026

Genomic Organization of the Newly Discovered Cassava Congo Cheravirus Reveals a Unique Maf/HAM1 Motif in the C-Terminal Region of the RNA1 Polyprotein and Suggests the Presence of Two Protein Domains Upstream of the Putative Helicase Domain

,
,
,
,
,
and
1
Plant Pathology Laboratory, TERRA-Gembloux Agro-Bio Tech, University of Liège, 5030 Gembloux, Belgium
2
Department of Crop Production, Agriculture Sciences and Environment Faculty, Catholic University of Bukavu, Bukavu, Democratic Republic of the Congo
3
Plant Virus Department, Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures, 38124 Braunschweig, Germany
4
Plant Genetics and Rhizospheric Processes Laboratory, TERRA-Gembloux Agro-Bio Tech, University of Liège, 5030 Gembloux, Belgium
This article belongs to the Section Viruses of Plants, Fungi and Protozoa

Abstract

Cassava (Manihot esculenta) is a staple crop in sub-Saharan Africa threatened by several viral diseases. Here, we describe the genome sequence of a novel bipartite cheravirus (family Secoviridae) infecting cassava in the Democratic Republic of Congo and Tanzania. We designate the new virus “cassava Congo cheravirus”. Each RNA segment encodes a single polyprotein (P1 and P2 for RNA1 and RNA2, respectively), embedded with various putative cleavage sites (six and three in P1 and P2, respectively), consistent with members of the genus Cheravirus. We note two new features in the P1: (i) the presence of two domains, X1 and X2, upstream of the putative helicase region, which we also predict in other cheraviruses and (ii) the presence of a Maf/HAM1-like inosine triphosphatase (ITPase) domain, a rare motif among viruses only previously detected in three potyviruses and a torradovirus, all of which infect plants from the Euphorbia family. Phylogenetic analyses placed the virus firmly within the genus Cheravirus, with amino acid identities in the Pro-Pol and coat protein regions well below existing ICTV species thresholds, supporting its classification as a virus belonging to a new species in the Cheravirus genus. Spatially distinct isolates from Bas-Congo, South-Kivu, and Tanzania form three genetic clusters, with evidence of recombination in both RNA segments. These results expand the known diversity of cassava viruses and suggest possible adaptation to the cassava host via ITPase acquisition.

1. Introduction

In several countries worldwide, the daily caloric needs of millions of people are met through cassava-based foods. The importance of cassava for food security and income is high in sub-Saharan Africa due to its ability to yield storage roots and produce reasonable yields in poor soils with minimal inputs [1]. The threats posed by climate change are increasing the magnitude of the daily challenges facing African agriculture, food security, and nutrition [2]. In this particular context, cassava is recognized as a crop on which African farmers could rely in the future, compared to other leading crops in sub-Saharan Africa (SSA) [3]. The main factors on which this reputation of cassava as a “food bank for the poor” is based include its capacity to grow under environmentally challenging conditions (e.g., drought, low fertility, high temperatures) [4,5,6].
One of the most important limitations for cassava production in Africa is its susceptibility to various pathogens, arthropods and mammal pests [7]. The primary factors that facilitate disease establishment and spread are the vegetative propagation of the crop, the presence of virus vectors, and the long vegetation period that helps to maintain pathogens and pests. From the long list of known pathogens affecting cassava [4,8], Xanthomonas axonopodis pv. manihoti, the cassava brown streak viruses (CBSVs) and the cassava mosaic viruses (CMVs) [1,9,10,11,12] cause the most economic impact. In East and Central Africa, the two virus species associated with cassava brown streak disease (CBSD) and the nine virus species associated with cassava mosaic disease (CMD) are the most important pathogens causing severe yield losses [13]. The list of potentially threatening pathogens is continually expanding with the discovery of new viruses infecting cassava in Africa [8] and America [14,15].
Here, we present the biological and molecular characteristics of a new bipartite virus from the family Secoviridae, detected and isolated from cassava plants grown in the Democratic Republic of the Congo (D.R. Congo).
Secoviridae is the only family of plant-infecting viruses [16,17] in the order picornavirales. The viruses in the family share common properties with other members of the order: (a) a linear positive-strand RNA genome which can be monopartite or bipartite, with each RNA flanked at its 5′ end by a small viral protein, the “VPg”; (b) the presence of a large ORF that encodes a single large polyprotein cleaved by a 3C-like cysteine protease; (c) a conserved “replication bloc” within the RNA1-encoded polyprotein that includes a type III helicase (also referred to as NTP-binding protein or NTB), the VPg, a cysteine protease and a type I RNA-dependent RNA-polymerase; (d) a capsid, composed of coat protein (CP) subunits, that can be divided into a single large CP, or into two or three smaller CPs depending on the genus [18,19,20]. To date, 165 virus species belonging to one subfamily, ten genera and five subgenera have been assigned to the family Secoviridae (ICTV online taxonomy release, 22 August 2025). The genus Cheravirus comprises viruses with a bipartite genome that encodes three capsid proteins on RNA2. RNA1 encodes replication proteins and the protease. Cheraviruses are transmitted by nematodes and seeds [18]. Most cheraviruses can also be transmissible experimentally by mechanical inoculation [18]. Five criteria, not to be simultaneously met, are used to separate genera within the family Secoviridae, whereas seven are helpful in the demarcation of species: a CP amino acid sequence with less than 75% identity, a conserved amino acid sequence region located between the protease and the polymerase domains (Pro-Pol) with less than 80% identity, differences in antigenic reactions, distinct host range, distinct vector specificity, absence of cross-protection and absence of re-assortment between RNA1 and RNA2 for viruses with a bipartite genome [18].
The unique feature of this new secovirus isolate from cassava is the presence of a Ham1 gene in its genome. Maf1/ham1-like proteins (HAM1) are ubiquitous proteins among all living organisms, with a pyrophosphatase activity (ITPase, also termed the “house-cleaning” activity) to detoxify harmful non-canonical nucleotides: inosine triphosphate (ITP), xanthosine triphosphate (XTP) and their deoxy analogues (dITP/dXTP) [21,22,23]. Until recently, only three viruses, all from the Potyviridae family, were known to encode a HAM1 protein homolog to ITPases: CBSV and UCBSV (genus Ipomovirus) [24] and Euphorbia ringspot virus (EuRSV, genus Potyvirus) [25]. Recently, the cassava torrado-like virus (CsTLV, genus TORRADOVIRUS) of the Secoviridae family has also been shown to encode a HAM1-like protein [15,26,27]. Interestingly, all these viruses were isolated from euphorbiaceous host plants (cassava and the ornamental plant Euphorbia milii), for which previous studies have hypothesized higher concentrations of ITP/XTP in the cytoplasm due to the relocation of the plant HAM1 into the nucleus [21,24,28,29]. This high cytoplasmic pool of ITP may act as a selection pressure, particularly on cytoplasm-localized viruses, and promote the acquisition of the HAM1 gene in the viral genome to enable their survival in euphorbiaceous hosts [30].

2. Materials and Methods

2.1. Virus Isolation from Cassava Collected in Bas-Congo

In 2007, during field studies on the diversity of CMD conducted in Democratic Republic of the Congo (Bas-Congo), samples (stem cuttings) were collected from cassava plants. Cuttings were rooted, and plants were established and maintained under greenhouse conditions at the Plant Virus Collection, Leibniz-Institute German Collection of Microorganisms and Cell Cultures (DSMZ). Leaves from virus-infected cuttings exhibited typical mosaic symptoms; however, one cassava plant showed severe mosaic symptoms, leaf blistering and deformation, and conspicuous red necrotic spots, indicating the presence of additional viruses or other pathogens (Figure 1).
Figure 1. Cassava plant collected in 2007 from stem cuttings and maintained in the DSMZ greenhouse at DSMZ. Red necrotic spots (indicated by red arrows) and severe mosaic symptoms suggest the presence of a virus or other pathogen.
Leaf extracts were prepared from leaves of infected plants and mechanically inoculated to a set of herbaceous virus indicator hosts from the families Cucurbitaceae, Solanaceae and Chenopodiaceae, including Nicotiana occidentalis and N. benthamiana. For electron microscopy, sap extracts from symptomatic N. benthamiana leaves were adsorbed onto copper grids (400 mesh × 62 μm pitch), negatively stained with uranyl acetate, and examined by electron microscopy (EM).
For virus purification, leaves from virus-infected N. occidentalis plants were homogenized 10 days post-infection and subsequently subjected to differential centrifugation, including density gradient centrifugation, essentially following a protocol originally designed for tomato ringspot virus [31]. An antiserum against purified virus particles was then prepared (DSMZ AS-0896).
Soluble leaf proteins from virus-infected N. benthamiana plants and purified particle preparations were subjected to a discontinuous sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE, gradient 8–20%) to determine the size of the viral coat protein. Following electrophoresis and transfer to nitrocellulose membranes, viral proteins were detected by incubating the membranes with the antiserum DSMZ AS-0896.
Purified virus preparations were subjected to cDNA synthesis using random-primed cDNA synthesis followed by 2nd strand synthesis and cloning into a pBluescript SK- cloning vector. Positive clones from blue-white screening were sequenced with vector-specific primers, and contigs were assembled using Vector NTI 10.5. PCRs spanning missing genome sequences and 5′ and 3′ RACEs (as described below) were performed to reconstruct the complete viral RNA1 (7405 nt) and RNA2 (3460 nt) genomes. Genome sequences for this virus isolate PV-0896 were deposited into GenBank (RNA1, ON_924323; RNA2, ON_924324) and were included in the molecular analysis and diversity study of cassava virus samples collected in South Kivu Province, analyzed by HTS.

2.2. Virus Survey in South Kivu Province

2.2.1. Sampling, Pooling and Greenhouse Cultivation

From April to October 2019, during investigations on the diversity of viruses present in South Kivu/Eastern Democratic Republic of the Congo, samples were collected in the Uvira territory as previously indicated [32]. Surveyed fields in each village were randomly selected along transects oriented within the main cassava-growing villages, with a minimum distance of 2 km between fields. Two hundred forty-two fields were sampled in total. In each field, a portion of the third fully expanded young leaf on a shoot was collected on ten cassava plants selected following diagonals. Samples from individual plants, symptomatic or not, were conserved in tubes dried with silica gel. All leaf samples from the same field were pooled together to make a unique field-core sample. Stem cuttings from each sampled plant were also collected and grown in an experimental field in Bukavu, while others were shipped to Belgium and grown in a glasshouse at 27 °C.
Cassava plants regenerated through stem cuttings, kept in a glasshouse (27 °C) showed a chlorotic mottle pattern on middle leaves, consisting of irregular, pale interveinal yellow-green patches scattered across the lamina, spreading from secondary veins (Figure 2). These symptoms were absent on newly expanded leaves but developed on mature leaves. The plants tested negative for CBSVs [32] and Manihot esculenta-associated ampelovirus (MEaV-1 and MEaV-2) [8] by RT-PCR.
Figure 2. Cassava landraces ((a): Nabwilalanga, (b): Nabwigoma, (c,d): Rava) collected in 2019 as stem cuttings and grown in a greenhouse at Gembloux Agro Bio-tech, showing pale and irregular chlorotic mottle pattern of leaves.

2.2.2. RNA Extraction, Pooling of Field Samples, and High-Throughput Sequencing

Total RNA was extracted from each field sample (n = 242) using a modified CTAB protocol as previously described [8]. Extracted RNA was further tested for infection with cassava brown streak virus (CBSV) and Ugandan cassava brown streak virus (UCBSV) by RT-PCR, as previously described [32].
All samples that tested negative with this primer-based screening (71 field-core samples representing 710 individual plants collected into 71 farmer fields) were regrouped into 12 equimolar HTS pools (Pool number 1 to Pool number 12) that were subjected to high-throughput sequencing (HTS). DNase treatment was performed using Amplification-grade DNase I according to the manufacturer’s instructions (Life Technologies, Carlsbad, CA, USA). In addition, the RiboZero plant leaf kit for RNA-seq (Life Technologies Limited, Paisley, UK) was used for depleting ribosomal RNAs, and a TruSeq stranded total RNA kit (Illumina, New York, NY, USA) was used for library preparation following the manufacturer’s instructions. RNA libraries were then subjected to NextSeq 500 HTS at the University of Liège GIGA facilities (Liège, Belgium), with a read length of 2 × 75 bp in high output mode. Each library (pool) was sequenced at 5 million reads at 30× depth. In total, approximately 60 million reads were generated. All sequenced read data were deposited in the Sequence Read Archive (SRA) under the bioproject reference PRJNA1311516. Detailed SRA accessions for each sequenced pool are provided in Table S1.

2.2.3. Bioinformatic Analyses

Sequence reconstruction and analysis were performed using Geneious 2021.1.1 software (www.geneious.com) and embedded plugins. Following demultiplexing and quality trimming (BBDUCK, BBMerge, and Dedupe plugins), reads were de novo assembled into contigs using RNA-SPADES (embedded in Geneious). Reconstructed contigs were screened against the RefSeq non-redundant viral database (release 202, 8 September 2020, retrieved from NCBI) using BLASTn and tBLASTx searches (version 2.10.1) [33]. Early screening of reconstructed contigs from 4 HTS pools (Pool number 3, Pool number 4, Pool number 5, Pool number 12) against the RefSeq Viral database revealed nucleotide sequence identities with members of the family Secoviridae. Twenty-five rounds of iterative read mapping extended the size of corresponding contigs to yield the final sequences.
Multiple sequence alignments were reconstructed using ClustalW embedded in Mega X [34]. Maximum-likelihood phylogenetic trees were reconstructed in MEGA X after selecting the appropriate model for amino acid or nucleotide sequence alignments. The Le Gascuel (LG) matrix-based model of amino acid substitution using a discrete Gamma distribution with invariant sites (+G+I) was used on the alignment of the conserved amino acid “Pro-Pol” region, which is defined as the region between the conserved CG motif of the protease and GDD motif of the polymerase (in the RNA1 polyprotein). In contrast, the same substitution matrix (LG), but with frequencies and a discrete Gamma distribution (+G+F), was used to align the coat protein amino acid sequences (in the RNA2 polyprotein). These trees were constructed using translated amino acid sequences derived from nucleotide sequences of the RNA1- and RNA2-encoded polyproteins for reconstructed genomes and from all available genomes of Secoviridae members retrieved from GenBank. Table S2 lists the names of these viruses and the accession numbers for their corresponding RNA1 and RNA2 genomes. Bootstrap analysis (1000 replicates) was performed to evaluate the stability and significance of branches.
Putative reassortments and recombination events within the RNA segments were investigated using the methods implemented by the RDP4 software (version Beta 4.97). Recombination events supported by more than four of the nine integrated methods were considered as possible [35].

2.2.4. Confirmatory RT-PCR, Rapid Amplification of cDNA Ends (RACE), and Sanger Sequencing

Primers were designed to confirm the presence of viral sequences in pooled samples from multiple fields and in individual samples. Primers were also designed to reconstruct the 5′ and the 3′ ends of the assembled genomes. Additional primers were also designed to confirm the presence of the identified HAM1 motif in the RNA1 polyprotein by amplifying the region spanning the motif. All primers are shown in Table S3.
For confirmation of virus presence in pools of individual fields, RNA was extracted using the RNEasy Plus® Kit (QIAGEN®, Hilden, Germany) following the manufacturer’s recommendations. For the RT-PCR confirmation, the first-strand cDNA synthesis was performed using Tetro™ Reverse Transcriptase (Meridian Bioscience, Luckenwalde/Berlin, Germany) and random hexamers, following the manufacturer’s instructions. The PCR was conducted using MangoTaq™ DNA polymerase (Meridian Bioscience). Primer pairs were designed to amplify both extremities (5′ and 3′) of RNA1 and RNA2 (Table S3). A 10 µL PCR mixture containing 5.8 µL nuclease-free water, 2 µL PCR buffer (5X), 0.40 µL MgCl2 (50 mM), 0.20 µL dNTPs (10 mM), 0.20 µL of each primer (10 mM), 0.4 µL Mango Taq DNA polymerase (Meridian Bioscience®, Luckenwalde/Berlin, Germany), and 1.0 µL of cDNA. PCR products were analyzed by electrophoresis in TAE buffer (1X) on a 1% agarose gel stained with Gel red® (Biotium, Fremont, CA, USA), visualized under UV light, and photographed using a gel documentation system (E-Box CX5 Edge, Vilber/Fisher Biotech, Paris, France).
Reconstruction of the 5′ and 3′ ends of genomes was conducted using the SMARTer® RACE 5′/3′ Kit (Takara Bio, San Jose, CA, USA) following the manufacturer’s instructions. Obtained RACE products were Sanger-sequenced using virus-specific primers (Table S3).

2.3. Identification of the RNA1 and RNA2-Encoded Cleavage Sites

Nucleotide sequences corresponding to the RNA1 and RNA2 encoded polyproteins of genomes reconstructed in this study, together with those of representative members of the Secoviridae family (Table S4), were translated into amino acid sequences and aligned using Clustal Omega (Version 1.2.4) [36]. These alignments were used to deduce functional domains based on sequence homologies with related viruses. Putative cleavage sites were then inferred from alignments of cassava Congo cheravirus polyproteins (P1 and P2, respectively) with related Secoviridae members for which protease cleavage sites are experimentally characterized [37,38]. All isolates carried the conserved substrate-binding histidine in the protease domain (Figure S1), typically associated with cleavage specificity at Q or E in the −1 position, as shown for picornaviruses [39], and several members of the family Secoviridae, including cheraviruses [16]. However, in the case of nepoviruses, some proteases with the histidine in the substrate-binding pocket also recognize N, D or H at the −1 position [40].We therefore searched for putative cleavage sites that included Q, E, N, D or H in that position.

2.4. Data Mining

To screen for the newly identified Secoviridae members in publicly available RNASeq data, the nucleotide sequences corresponding to the polymerase domain encoded into the RNA1 segment of the secovirids detected in field-grown cassava plants were screened against the SRA [41] using an RdRp-based search. This search uses the core of the polymerase palm subdomain [42]. The Serratus cloud computing infrastructure was used to implement this search [43]. Raw reads data identified by the Serratus search as containing palm-id motifs closely related to the RdRp of the novel Secoviridae members were retrieved from the SRA using the “Faster Download and Extract Reads in FASTQ format from NCBI SRA” tool integrated into the GALAXY Europe platform [44]. The reads were processed following the bioinformatic analyses detailed in Section 2.2.3 above, using the pipeline and tools stated above. The contigs showing homology to Secoviridae were generated and included in subsequent analyses.
All generated contigs were deposited in Genbank and assigned accession numbers shown in Table 1.
Table 1. Genome organization of the 13 contigs reconstructed in this study.

3. Results

3.1. Isolation, Electron Microscopy, Purification, and Complete Genome Reconstruction of a Cheravirus Isolate from Bas-Congo

N. benthamiana plants infected by mechanical inoculation started showing chlorotic mottle and deformation of the youngest leaves approximately 15 days after inoculation, followed by vein clearing and oak leaf patterns developing on older leaves (Figure 3a). Electron microscopical examinations revealed virus-like particles (VLps) of icosahedral structure (Figure 3b, red arrows), some of which were penetrated by uranyl acetate stain, indicating that they were empty protein shells (Figure 3b, black arrows).
Figure 3. (a) Virus symptoms in N. benthamiana starting with chlorotic mottle and deformation of the youngest leaves. (b) Electron micrograph of leaf homogenates showing VLp of icosahedral structure (red arrows), sometimes empty (black arrows).
Electrophoresis in SDS-PAGE revealed three protein bands in purified virus preparations (Figure 4A, lane 6), indicating the presence of three putative coat protein units of sizes varying between 20 and 30 kDa (Figure 4A). Western immunoblot analysis with the virus-specific antiserum confirmed this finding (Figure 4B, lane 6). Also, it showed a faint reaction with protein preparations from virus-infected N. benthamiana (Figure 4B, lane 8).
Figure 4. (A). Electrophoretic separation (8–20% SDS-PAGE) of soluble leaf proteins from infected N. benthamiana (lanes 1–5, 7, and 8) and purified virus preparation (lane 6). (B), Nitrocellulose western immunoblot incubated with the antiserum DSMZ AS-0896, indicating virus coat protein units (lane 6).
Results from the complete genome assembly revealed that RNA1 and RNA2 are 7405 and 3460 nt long, respectively (Table 1). The presence of three coat proteins and the bipartite nature of the virus genome is typical of members of the genus Cheravirus. The genome sequence of the virus isolate designated CGO-BC, which serves as the reference sequence, has been submitted to GenBank (see below).

3.2. Genome Sequences of the New Cheravirus from Eastern Democratic Republic of the Congo and Data Mining of Publicly Available RNASeq Data

Annotation of contigs assembled from field-grown cassava plants in South Kivu identified eight contigs that showed high nucleotide and amino acid sequence homology with the genome sequence of the cheravirus isolate from Bas-Congo. These contigs originated from four of the twelve sequenced HTS pools (Pool 3, 4, 5 and 12—Table S1).
To uncover other secoviruses genome sequences from the publicly available RNASeq data, the polymerase sequence of the CGO-BC isolate was used for an RdRp search using the Serratus tool [43]. This data screening identified >80% identity with an RdRp palm_id reference u138608 from the RNASeq ERR996012 generated in 2015 to detect and assemble the genomes of CBSV and UCBSV [45]. The corresponding RNASeq dataset was downloaded, and, following demultiplexing, de novo assembly, and BLAST (version 2.10.1) searches as specified above, three contigs showing homology to secovirids were retrieved.
The sequence annotation of the 11 reconstructed contigs via BLASTn on nucleotide database revealed that in total, two groups could be defined according to their sizes: one group of five long contigs (four from field sampling and one from data mining) with sizes varying between 7405 nt and 7795 nt, showing distant similarities with the RNA1 of the stocky prune virus (YP_009665965.1), and another group of six shorter contigs (four from field sampling and two from data mining) with sizes varying between 3460 and 3496 nt (Table 1) showing distant homologies with RNA2 of the same virus (YP_009665964.1).
For the genome sequences identified in South Kivu province and through data mining in Tanzania, the CGO-SK and TZ initials were used, respectively. All 3′ ends were successfully retrieved by 3′ RACE (Note that the contigs retrieved through the SRA RNASeq data were not included in the RACE assay). Alignment of the TZ-55 contig with the CGO-BC reference suggested that the complete 5′ end was recovered. The 11 complete or nearly complete genome sequences were annotated and submitted to NCBI (reference numbers in Table S1). On the other hand, results from field-grown plants also indicated the presence of begomovirus-associated satellites II and III in one pool (Pool 4) and UCBSV in two pools (4 and 5) (accession numbers MW961202.1, MZ486425.1, MW961214.1, MW961221.1). Using publicly available RNASeq data, the results also allowed the identification of both CBSV (KR108830.1, 100% id) and UCBSV (KR108837.1, 100% id) in the same read run in which the secovirid contigs were identified (ERR996012).

3.3. Genome Annotation and Phylogenetic Analysis

3.3.1. Genome Sequences and Annotation

Each of the 13 genome segments showed the canonical organization of cheraviruses, with RNA1 and RNA2 each encoding a single polyprotein. The RNA1 polyproteins ranged from 2369 to 2483 aa, while RNA2 polyproteins ranged from 1067 to 1069 aa (Table 1). The size of the 5′ and 3′UTR regions of the reference isolate (CGO-BC) was 131 and 139 nucleotides for RNA1 and 95 and 139 nt for RNA2. The length of 3′UTR for RNA1 was 153 or 154 nucleotides for South Kivu isolates (fully sequenced) and only 120 nt for the Tanzania isolate (partial sequence). For RNA2, the length of 3′UTR ranged between 150 and 153 nt for South Kivu isolates (fully sequenced). A similar size (153 and 158 nt) was observed for the two RNA2 sequences from Tanzania, but the polyA tail was not reached. All 13 generated contigs were deposited in GenBank and assigned accession numbers shown in Table 1.

3.3.2. Phylogenetic Analysis

Pairwise amino acid comparisons and phylogenetic reconstructions were conducted using two regions: (i) the conserved protease-to-polymerase domain (the Pro-Pol region between the “CG” motif of the protease and the “GDD” motif of the polymerase) of RNA1, and (ii) the coat protein (CP) sequences. The 13 genomes of the putative new cheravirus were included, as well as 63 genomes of other representative Secoviridae species (see Table S2).
Both analyses yielded congruent topologies; the Pro-Pol-based tree is presented in Figure 5 and the CP-based tree in Figure S2.
Figure 5. Phylogenetic analysis of the aligned Pro-Pol amino acid region (between the “CG” motif of the protease and the “GDD” motif of the polymerase) of the cassava Congo cheravirus isolates (highlighted in green together with members from the cheravirus genus) and of the exemplar isolates from recognized species in the family Secoviridae (detailed information on these viruses is provided in Table S2). The amino acid sequence of the “Pro-Pol” region was deduced from the nucleotide sequence of the corresponding genomic RNA from the representative virus of each genus in the Secoviridae family. The alignments were generated using ClustalW integrated into MEGA X. The Maximum likelihood phylogenetic trees were also reconstructed in MEGA X using the Poisson model with uniform distribution for amino acid sequence alignments. The trees were annotated and formatted using iTOL v6 [46]. Bootstrap values are displayed at internal branch nodes and expressed as percentages based on 1000 replicates. The tree was rooted using the Pro-Pol sequence of poliovirus (EVC, species Enterovirus C, genus Enterovirus, family Picornaviridae).
The cassava isolates consistently clustered with recognized members of the genus Cheravirus, including stocky prune virus (StPV), arracacha virus B (AVB), currant latent virus (CuLV), cherry rasp leaf virus (CRLV), and apple latent spherical virus (ALSV) with strong bootstrap support.
Amino acid sequence identities between the cassava isolates and known Secoviridae species members were below 47% for the Pro-Pol region and below 13% for the CP region (Table 2). These values fall well below the ICTV species demarcation thresholds of 80% (Pro-Pol) and 75% (CP), supporting the classification of these isolates as a novel Cheravirus species. The Pro-Pol region of all isolates described for the new cheravirus exhibited a 36-aa insertion, as shown in Figure S3.
Table 2. Range of identity percentages for the conserved “Pro-Pol” region and the coat protein amino acid sequences between isolates described in this study and viruses included in representative species of various genera in the family Secoviridae.

3.4. Genomic Diversity of the Viruses Included in the New Cheravirus Species

Pairwise amino acid sequence identities amongst isolates of the new virus indicated that the lowest identity percentage for the Pro-Pol and CP sequences were 84% (BC and TZ isolates) and 70% (SK and TZ isolates), respectively (Table 3). The isolates from South Kivu showed high sequence identities, with a minimum of 96.5% and 97.6% for Pro-Pol and CP, respectively.
Table 3. Intergroup and intragroup average pairwise identity and standard deviation were calculated for the “Pro-Pol” region, the CP block, the RNA1 and RNA2-encoded polyproteins (P1 and P2, respectively), and the nucleotide sequences of (nearly) complete genomes of the RNA1 and RNA2 segments.
Phylogenetic reconstructions based on these alignments (and on the complete genome sequences) revealed that the 4 isolates from South Kivu formed a distinct cluster regardless of the genomic region considered. In addition, for RNA2, the two Tanzanian isolates always clustered together (Figure S4). The phylogenetic placement of the Bas-Congo reference isolate (CGO-BC) varied depending on the genomic region analyzed: clustering with South-Kivu (Pro-Pol and Polyprotein 1), with Tanzania (CP and RNA1 genome), or in a single cluster (Polyprotein 2 and RNA2 genome), suggesting possible reassortment or recombination of the RNAs.
Recombination analysis (RDP4) suggested recombination events in both RNA1 and RNA2, with several events supported by at least four algorithms. For example, CGO-SK3 and CGO-SK5 were identified as recombinants in RNA1 (the Pro-Pol region comprised between nucleotides 189 and 406 of the reference genome), while CGO-SK12 and TZ55 displayed recombination features in RNA2 (nucleotides 1 to 328) (Table S5).
Given that the ICTV demarcation criteria for amino acid identity (Pro-Pol < 80% and CP < 75%) were not consistently met across comparisons, we propose that all cassava isolates described here represent a single novel species within the genus Cheravirus. We suggest the name “cassava Congo cheravirus” for this novel virus. Further separation into additional species may require biological evidence, such as host range, vector specificity, or re-assortment capacity [18].

3.5. Prevalence of the Newly Characterized Cheravirus in South Kivu Province

Using HTS analyses of pooled samples, the new cheravirus was detected in 4 pools, including 33 fields, whereas the 8 other pools (representing 38 fields) tested negative (no viral reads). Among the four pools, RT-PCR testing of field-core samples indicated that 23 fields were infected with the virus. The field prevalence was therefore 32% (23 of 71 sequenced). The individual samples were also tested by RT-PCR, yielding 130 positive results among the 230 tested plants. The incidence within an infected field ranged from 20 to 60% with an average of 40%. The geographical repartition of infected and non-infected fields is illustrated in Figure 6.
Figure 6. Geographical distribution and prevalence of cassava Congo cheravirus in the 71 cassava fields of Uvira territory in the South Kivu province/Democratic Republic of the Congo. The central map shows the surveyed territory, with its three collectivities (Bafulero, Bavira, and Plaine de la Ruzizi). A dot represents each surveyed field. Yellow dots correspond to negative fields, while orange and dark red dots indicate positive fields with increasing prevalence. The panels on the left locate the study area at the national and continental scales.
The highest proportion of positive fields with higher prevalence (30–60%) clustered around the “Plaine de la Ruzizi” collectivity, located in the North-Western part close to Burundi and Rwanda, while southern and eastern areas showed mostly negative or low-prevalence detections (0–20%), suggesting either lower virus circulation or undersampling. These negative fields are interspersed with positive fields, indicating that infection is not uniform within hotspot-like zones, likely due to the presence of landraces that may not have been inoculated or that did not originate from virus-carrying material. Hotspots could be linked to intensive cassava cultivation, movement of planting material, or favorable agroecological conditions for virus persistence. This patchiness suggests localized spread rather than territory-wide prevalence.

3.6. Identification of the RNA1 and RNA2 Encoded Proteins and Cleavage Sites

The genomic organization of the RNA2 polyprotein was consistent with that of other cheraviruses, with three predicted cleavage sites (Figure 7, Table 4 and Table 5), delineating a putative movement protein (MP) and three capsid subunits (CP1-CP3) [26,47,48,49]. These proposed cleavage sites included a D (or E) at the −1 position, a small amino acid (G, A, M, or S) at the +1 position, and a highly conserved G at the −2 position.
Figure 7. Putative consensus cleavage sites delineating putative functional domains predicted within the RNA1 (P1) and the RNA2 (P2)-encoded polyproteins. Predicted cleavage sites are shown as short vertical lines above and behind the full-length polyproteins, along with the predicted cleaved peptide in italic. Deduced functional domains based on sequence homologies with related viruses are presented as follows: X1; X2; NTB, nucleoside triphosphate binding protein or putative helicase; VPg, viral genome-linked protein; Pro, protease; Pol, polymerase; HAM1, putative inosine triphosphate pyrophosphatase; MP, movement protein; CP1, CP2, CP3, capsid protein 1,2, and 3, respectively.
Table 4. Location of putative cleavage sites predicted in the RNA1 and RNA2-encoded polyproteins for isolates identified in cassava plants in this study.
Table 5. Consensus cleavage sites of isolates from this study and those of previously described viruses from the genus Cheravirus.
Previous analyses of the cheravirus P1 polyproteins had identified four cleavage sites, delineating five protein domains (Pro-Co, NTB, VPg, Pro, and Pol) [26,47,49,50]. We identified putative cleavage sites that aligned well with these four cleavage sites and were consistent with the consensus sequence derived from the analysis of the cassava Congo cheravirus P2 polyprotein described above. The sequence GD/A was predicted as a putative cleavage site upstream of the NTB domain. This cleavage site aligned well with cleavage sites previously reported for other cheraviruses (GQ/G for other cheraviruses). A tentative GD/T cleavage site is proposed at the VPg-Pro junction for the cassava Congo cheravirus isolates, corresponding to the putative GQ/G or GE/G cleavage sites identified for other cheraviruses. The proposed NTB-VPg cleavage site is GQ/G (Table 4). This cleavage site was identical to that of ALSV and CuLV and aligned well with cleavage sites predicted for other cheraviruses. Cleavage between the Pro and Pol domain could occur at one of the two possible cleavage sites suggested in Table 4 (AQ/A or LD/A). These two putative cleavage sites are in proximity to each other, separated by only two amino acids.
Further analysis identified an additional cleavage site in the N-terminal region of the polyprotein that would delineate two protein domains upstream of the NTB domain, similar to the X1 and X2 domains previously identified in the P1 polyprotein of viruses in the genera Nepovirus and Sadwavirus [19,37,38,40,51]. The proposed cassava Congo cheravirus X1-X2 cleavage site has a sequence of GD/G, which is consistent with the cleavage site consensus sequence emerging for this virus. We reexamined the P1 polyprotein sequence of other cheraviruses. We could also find putative X1-X2 cleavage sites at a corresponding position for all cheraviruses, suggesting that the presence of two protein domains upstream of the NTB domain is a conserved feature in the genus Cheravirus.
We also noted that the C-terminal region of the P1 polyprotein (downstream of the Pro-Pol cleavage site) is longer than that observed in other cheraviruses, with a possible additional protein domain downstream of the polymerase (the Ham1 domain, which encodes an ITPase; see below). We searched the polyprotein for an additional cleavage site. We identified a putative cleavage site downstream of the Pol domain that was generally consistent with the consensus for other cleavage sites in the P1 polyprotein (FMSAD/GL), with the G at the −2 position replaced by another small amino acid. Thus, our analysis identifies six cleavage sites in the RNA1-encoded polyprotein, delineating seven protein domains (Figure 7).
Most putative cleavage sites included an aspartic acid (D) at the −1 position, although two putative cleavage sites had a Q at the −1 position (NTB-VPg and possibly Pro-Pol cleavage sites). A glycine (G) was most often found in the +1 position of the cleavage sites, although some putative cleavage sites also had other small amino acids (T, A, S or M) at this position. Comparing the consensus cleavage site of the cassava Congo cheravirus isolates to those of other cheraviruses previously described, the strong preference for an aspartic acid (D) in the −1 position is unique to cassava Congo cheravirus isolates (Table 5). This preference has not yet been described for other viruses in the family Secoviridae. The protease of blackcurrant reversion virus, a nepovirus, prefers an N at the −1 position of the cleavage sites and occasionally recognizes cleavage sites with a D at the −1 position [40].
In conclusion, the putative cleavage site consensus for the cassava Congo cheravirus is (G, a) (D, Q)/(G, A, t, s, m). The comparison of this consensus to what was observed previously for other cheraviruses suggests a strong preference for a G at −2 position for all cheraviruses, including the ones reported in this study. The strong preference for a D at the −1 position in the cassava Congo cheravirus is somewhat unique. However, the predicted NTB-VPg cleavage site for CRLV also has an aspartic acid (D) at the −1 position (Table 4). Considering findings from [26], who previously compared cleavage sites of three characterized cheraviruses, the putative cleavage-site consensus for viruses in the genus Cheravirus can be summarized as (G, a) (Q, E, D)/(G, s) (Figure 8).
Figure 8. Sequence logos from the P6 to P1′ positions of cleavage sites corresponding to the cheravirus genus, including the CaSeCoV (the exemplar isolate CGO-BC), and a single isolate for each apple latent spherical virus (ALSV), cherry rasp leaf virus (CRLV), currant latent virus (CuLV), and arracacha virus B (AVB). The number of cleavage site sequences included in each sequence logo is N = 7. Sequence logos were generated as described in the methodology, using confidently predicted or experimentally validated cleavage sites listed in Table 4. Amino acid colours correspond to the chemistry: polar amino acids (G, S, T, Y, C, green), neutral amino acids (Q, N, purple), basic amino acids (K, R, H, blue), acidic amino acids (D, E, red) and hydrophobic amino acids (A, V, L, I, P, W, F, M, black).

3.7. Protein Domains in the N-Terminal Region of the RNA1 Polyprotein

The X1 protein is highly variable among nepoviruses and cheraviruses, and its function remains unknown [40]. Similarly, this domain was the most variable in size amongst the cassava Congo cheravirus isolates. The X1 domain of the isolate CGO-BC was the shortest (418 aa) while that of CGO-SK12 was the longest (531 aa).
The function of the X2 domain has previously been investigated in the nepovirus tomato ringspot virus (ToRSV). The X2 protein crosses the membrane twice since it possesses two transmembrane helices [52]. Using the algorithm of Kyte and Doolittle with a window size of 17 amino acids [53], two hydrophobic domains (putative transmembrane helices) were also predicted in the X2 domain of the cassava isolates, as well as that of other cheraviruses (Figure 9).
Figure 9. Computer-assisted prediction of transmembrane helices (TMH) in the X2-encoded domain of the RNA1 segment: (A) The hydrophobicity plot of X2, calculated using the algorithm of Kyte and Doolittle with a window size of 17 amino acids [53]. The RNA1-encoded polyprotein is shown at the top of the panel, with the indicated protein domains. Vertical lines represent the putative cleavage sites recognized by the RNA1-encoded 3C-like proteinase of the new cheravirus. (B) Multiple amino acid sequence alignment of the X2 domains for the new cheravirus isolates, along with other existing cheraviruses, showing amino acid residues associated with the putative double trans-membrane helices for a 2-pass transmembrane X2 domain of viruses reported in this study similar to that of the experimentally confirmed ToRSV (nepovirus) (dark-yellow highlighting in alignment). Amino acids are numbered from the first amino acid of the P1 polyprotein.
With sizes varying between 906 and 916 amino acids for the six isolates, the polymerase was the longest protein of the RNA1-encoded polyprotein. The “Pro-Pol” sequence, defined as the sequence between the conserved catalytic cysteine (or serine) of the 3C-like protease (CG) and the GDD motif of the RNA-dependent RNA polymerase, characteristic of viruses in the family Secoviridae (used for phylogenetic classification) [19], was quite “particular” for the viruses described in this study. They had an insertion of 36 additional amino acids, an uncommon feature compared to the sequences of all other included viruses (Figure S3).

3.8. A Putative Maf/HAM1 Motif Within the Polymerase Domain

The RNA1-encoded polymerase displayed an insertion of 36 amino acids and contained an extended C-terminal region with similarity to inosine triphosphate pyrophosphatases (ITPases; HAM1 family). This putative HAM1 domain was approximately 232 amino acids in length and likely cleaved from the polymerase at a consensus site FMSAD/GL) (Figure S5). Phylogenetic analysis grouped the viral HAM1-like proteins with those from cassava torrado-like virus (CsTLV) and euphorbia ringspot virus (EuRSV), forming a clade adjacent to bacterial ITPases (Figure 10).
Figure 10. Phylogenetic relationships of Maf/HAM1 motifs between viruses reported in this study (framed in blue) with those of representative members of different kingdoms of life (viruses are pointed in solid red circles). The alignment was generated using ClustalW integrated into MEGA X, and a maximum-likelihood phylogenetic tree was reconstructed in MEGA X using the LG + G model for amino acid sequence alignments. Detailed information on these sequences (complete names) is provided in Table S6.
The exact function of the viral ITPase has not been broadly characterized except for investigations by [30], who proposed that the presence of Ham1 proteins with highly conserved ITPase motifs for the U/CBSV served as a necrosis determinant in Nicotiana benthamiana and could act as a selection determinant during infections of cassava. Moreover, ref. [54] categorized key residues/motifs that influence ITPase substrate specificity in human ITPase. They identified three essential, four intermediates, and one dispensable amino acid residue.
To investigate whether crucial residues from the human ITPase active sites are conserved in ITPases from other organisms, including the viruses reported in this study, we performed an amino acid alignment (Figure S6). The results of this alignment are summarized in Table 6.
Table 6. Comparison of conserved amino acid residues in ITPases of representative members of various kingdoms of life (including viruses reported in this study) to essential amino acid residues that influence substrate specificity in human ITPase.
Alignment of active-site residues revealed conservation of two essential amino acids (arginine and tryptophan) and conservative substitutions at the third (glutamic acid to aspartic acid). This pattern parallels findings in CsTLV and contrasts with Potyviridae HAM1 proteins (e.g., CBSV, UCBSV), where all key residues are conserved [30,54].
These observations suggest that the viral HAM1 motif identified for cassava Congo cheraviruses could be functionally conserved, potentially influencing substrate specificity and pathogenicity during cassava infections.

4. Discussion

This study reports the genome characterization of cassava Congo cheravirus, the first cheravirus documented in cassava. Genome architecture, including a bipartite organization and the presence of three coat proteins, is consistent with viruses of the genus Cheravirus. However, two new features were identified for the P1 polyprotein: (i) the presence of two domains (X1 and X2) upstream the NTB region, which was previously only identified in members of the genera Nepovirus (Sanfacon, 2022) [37,38], and Sadwavirus [51], and appears to be conserved in all characterized members of the genus Cheravirus; and (ii) a unique Maf/HAM1-like inosine triphosphatase (ITPase) motif, not present in other cheraviruses and only previously identified in a few viruses infecting euphorbiaceous hosts [15,24,25,55,56]. The latter may confer a selective advantage in cassava by mitigating the incorporation of non-canonical nucleotides, a hypothesis that requires experimental validation [56].
A notable outcome of this study is the detailed prediction of P1 and P2 cleavage sites based on multiple sequence alignment with well-characterized secovirids. While experimental validation is pending, the congruence between alignment-based predictions and SDS-PAGE evidence for the three coat proteins provides high confidence in the proposed proteolytic map. The identification of two cleavage sites upstream of the NTB further strongly supports the presence of X1 and X2 domains, a newly documented feature for this genus [51]. Additionally, the putative consensus cleavage peptide motifs exhibit a distinct preference for aspartic acid at the −1 position, suggesting functional divergence in protease specificity relative to other cheravirus members. This refined proteolytic framework offers a valuable basis for future mutagenesis and functional studies [57].
Phylogenetic analysis placed the virus within the genus Cheravirus, with amino acid identities in both the conserved Pro-Pol and coat protein regions falling well below the ICTV species demarcation thresholds. Isolates from Bas-Congo, South-Kivu, and Tanzania formed three distinct clusters, and recombination signals in both RNA segments suggest ongoing genetic exchange. The recovery of viral sequences from historical RNAseq datasets indicates that the virus has circulated undetected for years, potentially aided by mild or latent symptom expression.
From a taxonomic perspective, the virus meets species-level criteria for Cheravirus but does not satisfy genus-level demarcation criteria to warrant reclassification [58]. The identification of the HAM1 motif within a Cheravirus genome expands the functional diversity known within this genus and may inform studies of viral adaptation in perennial and woody euphorbiaceous hosts [27].
The biological characterization of the newly identified virus has achieved several steps proposed in a recently published framework [59]. Viral particles of this proposed new species have been observed, the expression of the coat protein validated, and its mechanical transmission to N. benthamiana demonstrated. In addition, we report its presence in two countries, at locations at least 1700 km apart. A more in-depth study in South Kivu province underscored its widespread distribution (32% field prevalence and an average of 40% individual-plant incidence per field). This relatively high prevalence might suggest a common asymptomatic presence in propagated planting material, or rapid horizontal dissemination within or between fields, or from other ecosystems.
The geographical distribution of infected fields indicates heterogeneity, with the virus concentrated in specific foci rather than evenly spread across the Uvira territory. The coexistence of high-prevalence fields in proximity with lower-prevalence/negative fields could indicate recent introduction followed by local spread. Alternatively, this situation could reflect the persistence of vegetative propagation practices. Whatever the situation, this coexistence suggests that local transmission is incomplete, and virus spread may depend heavily on movement of cutting between farms rather than airborne or insect-only transmission.
Cheraviruses are classically transmitted by soil-borne nematodes, whose limited mobility typically results in aggregated or patchy distributions of infected plants within fields. The spatial distribution observed for cassava Congo cheravirus, characterized by localized hotspots interspaced with virus-free fields, is consistent with such transmission dynamics. However, cassava is predominantly propagated vegetatively, and the exchange of infected cuttings between neighboring farms [32] may also contribute to clustered infection patterns. In the absence of direct evidence for nematode vectors in this study, the observed distribution should therefore be interpreted as compatible with, but not diagnostic of, nematode-mediated transmission. Identification of potential vectors and the development of transmission assays will be required to clarify the epidemiological drivers of viral spread.
Additionally, the geographical concentration of highly prevalent fields in the Western part of the Uvira Territory highlighted that this virus is more prevalent in areas with more widespread cassava cultivation [32], located close to Burundi and Rwanda, which might also facilitate cross-border dissemination. This situation would raise concern about potential regional spread via the exchange of planting material, provided that it has a significant impact on yield.
Future research opportunities include characterizing potential transmission routes and vectors, and expanding symptomology and yield impact assessments. Performing functional assays for ITPase would also bring more insights into this function identified for virus infecting Euphorbiaceae hosts, including two recently discovered viruses carrying this motif [27].
In conclusion, all cassava isolates described in this study represent a single novel species within the genus Cheravirus. This virus, the “cassava Congo cheravirus”, has unique genomic traits that may influence its interactions with cassava and its epidemiology. Its detection in geographically distant regions underscores the importance of coordinated surveillance and functional studies to determine its roles in cassava growth and health.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/v18010084/s1, Table S1: Lengths, number of reads integrated, percent of total reads, and average coverage for viruses included in the family Secoviridae contigs from the various analyzed pools of cassava samples. Table S2: Complete names of selected Secoviridae members used for the phylogenetic study. Table S3: Primers used for Confirmatory RT-PCR, RACE, and Sanger sequencing for HAM1 motif confirmation. Table S4: Accession numbers and abbreviations for selected Secoviridae members used to deduce cleavage sites and functional domains of the RNA1 and RNA2 encoded polyproteins (respectively P1 and P2). Figure S1: Multiple amino acid sequence alignment of the protease domain localises the conserved Histidine of the substrate binding pocket region and the protease catalytic Cysteine. Figure S2: Phylogenetic analysis of the coat protein amino acid for the five isolates detected in cassava (strong green triangles and green frame) and of type isolates from recognized species in the family Secoviridae. Figure S3: Multiple amino acid sequence alignment of the conserved “CG-GDD” motif shows the insertion of 36 amino acids (highlighted in red) upstream of the “GDD” motif in the polymerase domain. Figure S4: Phylogenetic trees reconstructed using (A) the amino acid sequences of the “Pro-Pol” region, (B) the amino acid sequences of the entire P1 polyprotein, (C) the complete/nearly nucleotide sequence of the RNA1 segment (D) the amino acid sequences of the coat protein block, (E) the amino acid sequences of the entire P2 polyprotein and (F) the nearly/complete nucleotide sequence of the RNA2 segment for the new Secoviridae members described in this study. Figure S5: Multiple amino acid sequence alignment of the C-terminal region of the polymerase domain of viruses reported in this study, together with the arracacha virus A. The figure shows a putative cleavage site (FMSAD/G) that could delineate a putative domain downstream of the polymerase. Table S5: Recombination events detected among isolates from the new cheravirus. Figure S6: Multiple amino acid sequence alignment showing conserved sites of ITPases from representative members of various kingdoms of life. The alignment was built using Mview [60]. Table S6: Complete names and accession numbers for representative members of various kingdoms of life used for phylogenetic study of the HAM1 motifs.

Author Contributions

Conceptualization, S.M., H.V., S.W. and H.S.; methodology, S.M.; Y.B.K., S.W. and H.S.; software, Y.B.K., and S.M.; validation, S.M., S.W., H.S. and H.V.; formal analysis, Y.B.K., S.W., S.M. and H.S.; investigation, Y.B.K., S.W. and K.V.; resources, S.M., S.W. and H.V.; data curation, Y.B.K., S.W. and S.M.; writing—original draft preparation, Y.B.K.; writing—review and editing, S.M., H.S., S.W., H.V. and E.B.B.; visualization, Y.B.K. and S.W.; supervision, S.M.; project administration, H.V. and S.M.; funding acquisition, H.V. and S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This study was carried out as part of the ARES (Académie de Recherche et d’Enseignement Supérieur, Fédération Wallonie-Bruxelles, Belgium)-funded project iCARE (Improved Cassava Virus Resistance mitigation strategies and development of a disease-free seed system).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The sequencing reads generated through this study are available via the Sequence Reads Archive as a bioproject reference PRJNA1311516 (SRR35169868, SRR35169867, SRR35169866, SRR35169865). The genome sequence datasets generated and analyzed in this study are available in the GenBank repository under the following accession numbers: ON_924323, OP440440, OP440437, OP440438, OP440439, OP440441, ON_924324, OP440433, OP440432, OP440431, OP440434, OP440436, OP440435.

Acknowledgments

The authors gratefully acknowledge Serge Mukotanyi for his valuable contributions to GIS analysis and cartography.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Parmar, A.; Sturm, B.; Hensel, O. Crops that feed the world: Production and improvement of cassava for food, feed, and industrial uses. Food Secur. 2017, 9, 907–927. [Google Scholar] [CrossRef]
  2. Kabirou Dodo, M. Understanding Africa’s Food Security Challenges. In Food Security in Africa; IntechOpen: London, UK, 2021. [Google Scholar] [CrossRef]
  3. Jarvis, A.; Ramirez-Villegas, J.; Campo, B.V.H.; Navarro-Racines, C. Is Cassava the Answer to African Climate Change Adaptation? Trop. Plant Biol. 2012, 5, 9–29. [Google Scholar] [CrossRef]
  4. Lebot, V. Tropical Root and Tuber Crops: Cassava, Sweet Potato, Yams and Aroids; Crop Production Science in Horticulture, 17; CABI: Wallingford, Oxfordshire, UK, 2009. Available online: https://books.google.be/books?id=rFwyrKRSMUMC (accessed on 15 October 2022).
  5. Burns, A.; Gleadow, R.; Cliff, J.; Zacarias, A.; Cavagnaro, T. Cassava: The Drought, War and Famine Crop in a Changing World. Sustainability 2010, 2, 3572–3607. [Google Scholar] [CrossRef]
  6. Howeler, R.H. The Cassava Handbook: A Reference Manual based on the Asian Regional Cassava Training Course Held in Thailand; The International Center for Tropical Agriculture (CIAT): Bangkok, Thailand, 2012. [Google Scholar]
  7. Otun, S.; Escrich, A.; Achilonu, I.; Rauwane, M.; Lerma-Escalera, J.A.; Morones-Ramírez, J.R.; Rios-Solis, L. The future of cassava in the era of biotechnology in Southern Africa. Crit. Rev. Biotechnol. 2022, 43, 594–612. [Google Scholar] [CrossRef]
  8. Kwibuka, Y.; Bisimwa, E.; Blouin, A.G.; Bragard, C.; Candresse, T.; Faure, C.; Filloux, D.; Lett, J.-M.; Maclot, F.; Marais, A.; et al. Novel ampeloviruses infecting cassava in central Africa and the south-west Indian ocean islands. Viruses 2021, 13, 1030. [Google Scholar] [CrossRef] [PubMed]
  9. Maruthi, M.N. Limiting the impact of Cassava Brown Streak Disease on Smalholders, Womens and the Cassava Value Chain, Interim Project Narrative Report Submitted to the African Union Commission; African Union Commission: Addis Abeba, Ethiopia, 2015. [Google Scholar]
  10. Thresh, J.M.; Otim-Nape, G.W.; Legg, J.P.; Fargette, D. African cassava mosaic virus disease: The magnitude of the problem. Afr. J. Root Tuber Crop 1997, 2, 13–19. [Google Scholar]
  11. Legg, J.P.; Owor, B.; Sseruwagi, P.; Ndunguru, J. Cassava mosaic virus disease in East and Central Africa: Epidemiology and management of a regional pandemic. Adv. Virus Res. 2006, 67, 355–418. [Google Scholar] [PubMed]
  12. Rey, C.; Vanderschuren, H. Cassava Mosaic and Brown Streak Diseases: Current Perspectives and Beyond. Annu. Rev. Virol. 2017, 4, 429–452. [Google Scholar] [CrossRef]
  13. McCallum, E.J.; Anjanappa, R.B.; Gruissem, W. Tackling agriculturally relevant diseases in the staple crop cassava (Manihot esculenta). Curr. Opin. Plant Biol. 2017, 38, 50–58. [Google Scholar] [CrossRef]
  14. Jimenez, J.; Caicedo, S.; Pardo, J.M.; Gil-Ordóñez, A.; Alvarez-Quinto, R.; Mollov, D.; Cuellar, W.J. Single torradovirus infections explain the mysterious cassava frogskin disease in the Americas. Sci. Rep. 2024, 14, 29648. [Google Scholar] [CrossRef]
  15. Leiva, A.M.; Jimenez, J.; Sandoval, H.; Perez, S.; Cuellar, W.J. Complete genome sequence of a novel secovirid infecting cassava in the Americas. Arch. Virol. 2022, 167, 665–668. [Google Scholar] [CrossRef]
  16. Sanfaçon, H.; Wellink, J.; Le Gall, O.; Karasev, A.; Van Der Vlugt, R.; Wetzel, T. Secoviridae: A proposed family of plant viruses within the order Picornavirales that combines the families Sequiviridae and Comoviridae, the unassigned genera Cheravirus and Sadwavirus, and the proposed genus Torradovirus. Arch. Virol. 2009, 154, 899–907. [Google Scholar] [CrossRef]
  17. Walker, P.J.; Siddell, S.G.; Lefkowitz, E.J.; Mushegian, A.R.; Adriaenssens, E.M.; Dempsey, D.M.; Dutilh, B.E.; Harrach, B.; Harrison, R.L.; Hendrickson, R.C.; et al. Changes to virus taxonomy and the Statutes ratified by the International Committee on Taxonomy of Viruses (2020). Arch. Virol. 2020, 165, 2737–2748. [Google Scholar] [CrossRef]
  18. Thompson, J.R.; Dasgupta, I.; Fuchs, M.; Iwanami, T.; Karasev, A.V.; Petrzik, K.; Sanfaçon, H.; Tzanetakis, I.; van Der Vlugt, R.; Wetzel, T.; et al. ICTV virus taxonomy profile: Secoviridae. J. Gen. Virol. 2017, 98, 529–531. [Google Scholar] [CrossRef]
  19. Sanfaçon, H.; Dasgupta, I.; Fuchs, M.; Karasev, A.V.; Petrzik, K.; Thompson, J.R.; Tzanetakis, I.; van der Vlugt, R.; Wetzel, T.; Yoshikawa, N. Proposed revision of the family Secoviridae taxonomy to create three subgenera, ‘Satsumavirus’, ‘Stramovirus’ and ‘Cholivirus’, in the genus Sadwavirus. Arch. Virol. 2020, 165, 527–533. [Google Scholar] [CrossRef]
  20. Zell, R.; Delwart, E.; Gorbalenya, A.E.; Hovi, T.; King, A.M.; Knowles, N.J.; Lindberg, A.M.; Pallansch, M.A.; Palmenberg, A.C.; Reuter, G.; et al. ICTV virus taxonomy profile: Picornaviridae. J. Gen. Virol. 2017, 98, 2421–2422. [Google Scholar] [CrossRef]
  21. Xie, X.; He, Z.; Chen, N.; Tang, Z.; Wang, Q.; Cai, Y. The Roles of Environmental Factors in Regulation of Oxidative Stress in Plant. BioMed Res. Int. 2019, 2019, 21–27. [Google Scholar] [CrossRef]
  22. Parker, J.E.; Hessler, G.; Cui, H. A new biochemistry connecting pathogen detection to induced defense in plants. New Phytol. 2022, 234, 819–826. [Google Scholar] [CrossRef]
  23. Zamzami, M.A. Inosine Triphosphate Pyrophosphatase (ITPase): Functions, Mutations, Polymorphisms and Its Impact on Cancer Therapies. Cells 2022, 11, 384. [Google Scholar] [CrossRef]
  24. Mbanzibwa, D.R.; Tian, Y.; Mukasa, S.B.; Valkonen, J.P.T. Cassava brown streak virus (Potyviridae) encodes a putative Maf/HAM1 pyrophosphatase implicated in reduction of mutations and a P1 proteinase that suppresses RNA silencing but contains no HC-Pro. J. Virol. 2009, 83, 6934–6940. [Google Scholar] [CrossRef]
  25. Knierim, D.; Menzel, W.; Winter, S. Analysis of the complete genome sequence of euphorbia ringspot virus, an atypical member of the genus Potyvirus. Arch. Virol. 2017, 162, 291–293. [Google Scholar] [CrossRef]
  26. Petrzik, K.; Koloniuk, I.; Přibylová, J.; Špak, J. Complete genome sequence of currant latent virus (genus Cheravirus, family Secoviridae). Arch. Virol. 2016, 161, 491–493. [Google Scholar] [CrossRef]
  27. Mahillon, M.; Brodard, J.; Dubuis, N.; Gugerli, P.; Blouin, A.G.; Schumpp, O. Mixed infection of ITPase-encoding potyvirid and secovirid in Mercurialis perennis: Evidences for a convergent euphorbia-specific viral counterstrike. Virol. J. 2024, 21, 6. [Google Scholar] [CrossRef]
  28. James, A.M.; Seal, S.E.; Bailey, A.M.; Foster, G.D. Viral inosine triphosphatase: A mysterious enzyme with typical activity, but an atypical function. Mol. Plant Pathol. 2021, 22, 382–389. [Google Scholar] [CrossRef]
  29. Lopez, R.-G.; Gonzalo, I.; Garcia, J.-A.; Valli, A.A. Cassava and Ugandan cassava brown streak virus, an attractive case of plant-pathogen coevolution. In III Reunion de la Red Nacional de Virologia de Plantas; Lopez, P.-G., Ed.; RENAVIPLANT: Murcia, Spain, 2022; p. 78. [Google Scholar]
  30. Tomlinson, K.R.; Pablo-Rodriguez, J.L.; Bunawan, H.; Nanyiti, S.; Green, P.; Miller, J.; Alicai, T.; Seal, S.E.; Bailey, A.M.; Foster, G.D. Cassava brown streak virus Ham1 protein hydrolyses mutagenic nucleotides and is a necrosis determinant. Mol. Plant Pathol. 2019, 20, 1080–1092. [Google Scholar] [CrossRef]
  31. Stace-Smith, R. Purification and properties of tomato ringspot virus and an RNA-deficient component. Virology 1966, 29, 240–247. [Google Scholar] [CrossRef] [PubMed]
  32. Kwibuka, Y.; Nyirakanani, C.; Bizimana, J.P.; Bisimwa, E.; Brostaux, Y.; Lassois, L.; Vanderschuren, H.; Massart, S. Risk factors associated with cassava brown streak disease dissemination through seed pathways in Eastern D.R. Congo. Front. Plant Sci. 2022, 13, 803980. [Google Scholar] [CrossRef]
  33. Sayers, E.W.; Bolton, E.E.; Brister, J.R.; Canese, K.; Chan, J.; Comeau, C.; Connor, R.; Funk, K.; Kelly, C.; Kim, S.; et al. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2022, 50, D20–D26. [Google Scholar] [CrossRef]
  34. Kumar, S.; Stecher, G.; Li, M.; Knyaz, C.; Tamura, K. MEGA X: Molecular evolutionary genetics analysis across computing platforms. Mol. Biol. Evol. 2018, 35, 1547–1549. [Google Scholar] [CrossRef]
  35. Martin, D.P.; Murrell, B.; Golden, M.; Khoosal, A.; Muhire, B. RDP4: Detection and analysis of recombination patterns in virus genomes. Virus Evol. 2015, 1, vev003. [Google Scholar] [CrossRef]
  36. Madeira, F.; Park, Y.M.; Lee, J.; Buso, N.; Gur, T.; Madhusoodanan, N.; Basutkar, P.; Tivey, A.R.; Potter, S.C.; Finn, R.D.; et al. The EMBL-EBI search and sequence analysis tools APIs in 2019. Nucleic Acids Res. 2019, 47, W636–W641. [Google Scholar] [CrossRef]
  37. Wang, A.; Sanfacon, H. Proteolytic processing at a novel cleavage site in the N-terminal region of the tomato ringspot nepovirus RNA-1-encoded polyprotein in vitro. J. Gen. Virol. 2000, 81, 2771–2781. [Google Scholar] [CrossRef]
  38. Wetzel, T.; Chisholm, J.; Bassler, A.; Sanfaçon, H. Characterization of proteinase cleavage sites in the N-terminal region of the RNA1-encoded polyprotein from Arabis mosaic virus (subgroup A nepovirus). Virology 2008, 375, 159–169. [Google Scholar] [CrossRef]
  39. Bergmann, E.M.; Mosimann, S.C.; Chernaia, M.M.; Malcolm, B.A.; James, M.N. The refined crystal structure of the 3C gene product from hepatitis A virus: Specific proteinase activity and RNA recognition. J. Virol. 1997, 71, 2436–2448. [Google Scholar] [CrossRef]
  40. Sanfaçon, H. Re-examination of nepovirus polyprotein cleavage sites highlights the diverse specificities and evolutionary relationships of nepovirus 3C-like proteases. Arch. Virol. 2022, 167, 2529–2543. [Google Scholar] [CrossRef]
  41. Leinonen, R.; Sugawara, H.; Shumway, M. The sequence read archive. Nucleic Acids Res. 2011, 39, 2010–2012. [Google Scholar] [CrossRef]
  42. Babaian, A.; Edgar, R.C. Ribovirus classification by a polymerase barcode sequence. bioRxiv 2021. [Google Scholar] [CrossRef]
  43. Edgar, R.C.; Taylor, B.; Lin, V.; Altman, T.; Barbera, P.; Meleshko, D.; Lohr, D.; Novakovsky, G.; Buchfink, B.; Al-Shayeb, B.; et al. Petabase-scale sequence alignment catalyses viral discovery. Nature 2022, 602, 142–147. [Google Scholar] [CrossRef]
  44. Afgan, E.; Baker, D.; Batut, B.; Van Den Beek, M.; Bouvier, D.; Čech, M.; Chilton, J.; Clements, D.; Coraor, N.; Grüning, B.A.; et al. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018, 46, W537–W544. [Google Scholar] [CrossRef]
  45. Ndunguru, J.; Sseruwagi, P.; Tairo, F.; Stomeo, F.; Maina, S.; Djinkeng, A.; Kehoe, M.; Boykin, L.M. Analyses of twelve new whole genome sequences of cassava brown streak viruses and ugandan cassava brown streak viruses from East Africa: Diversity, supercomputing and evidence for further speciation. PLoS ONE 2015, 10, e0139321. [Google Scholar] [CrossRef]
  46. Letunic, I.; Bork, P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021, 49, W293–W296. [Google Scholar] [CrossRef]
  47. Li, C.; Yoshikawa, N.; Takahashi, T.; Ito, T.; Yoshida, K.; Koganezawa, H. Nucleotide sequence and genome organization of Apple latent spherical virus: A new virus classified into the family Comoviridae. Microbiology 2000, 81, 541–547. [Google Scholar] [CrossRef]
  48. James, D.; Upton, C. Nucleotide sequence analysis of RNA-2 of a flat apple isolate of Cherry rasp leaf virus with regions showing greater identity to animal picornaviruses than to related plant viruses. Arch. Virol. 2002, 147, 1631–1641. [Google Scholar] [CrossRef]
  49. Adams, I.P.; Glover, R.; Souza-Richards, R.; Bennett, S.; Hany, U.; Boonham, N. Complete genome sequence of arracacha virus B: A novel cheravirus. Arch. Virol. 2013, 158, 909–913. [Google Scholar] [CrossRef] [PubMed]
  50. James, D.; Upton, C. Genome segment RNA-1 of a flat apple isolate of Cherry rasp leaf virus: Nucleotide sequence analysis and RT-PCR detection. Arch. Virol. 2005, 150, 1469–1476. [Google Scholar] [CrossRef]
  51. Mann, K.S.; Walker, M.; Sanfaçon, H. Identification of cleavage sites recognized by the 3C-like cysteine protease within the two polyproteins of strawberry mottle virus. Front. Microbiol. 2017, 8, 745. [Google Scholar] [CrossRef]
  52. Zhang, G.; Sanfaçon, H. Characterization of Membrane Association Domains within the Tomato Ringspot Nepovirus X2 Protein, an Endoplasmic Reticulum-Targeted Polytopic Membrane Protein. J. Virol. 2006, 80, 10847–10857. [Google Scholar] [CrossRef]
  53. Kyte, J.; Doolittle, R.F. A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 1982, 157, 105–132. [Google Scholar] [CrossRef]
  54. Gall, A.D.; Gall, A.; Moore, A.C.; Aune, M.K.; Heid, S.; Mori, A.; Burgis, N.E. Analysis of human ITPase nucleobase specificity by site-directed mutagenesis. Biochimie 2013, 95, 1711–1721. [Google Scholar] [CrossRef]
  55. Valli, A.A.; Gonzalo, I.; Pasin, F.; Montanuy, I.; Antonio, J. Maf/ham1-like pyrophosphatases, host-specific partners of viral RNA-dependent RNA polymerases. bioRxiv 2021. [Google Scholar] [CrossRef]
  56. Valli, A.A.; García López, R.; Ribaya, M.; Martínez, F.J.; Gómez, D.G.; García, B.; Gonzalo, I.; Gonzalez de Pradena, A.; Pasin, F.; Montanuy, I.; et al. Maf/ham1-like pyrophosphatases of noncanonical nucleotides are host-specific partners of viral RNA-dependent RNA polymerases. PLoS Pathog. 2022, 18, e1010332. [Google Scholar] [CrossRef]
  57. Ferriol, I.; Junior, D.M.S.; Nigg, J.C.; Zamora-Macorra, E.J.; Falk, B.W. Identification of the cleavage sites of the RNA2-encoded polyproteins for two members of the genus Torradovirus by N-terminal sequencing of the virion capsid proteins. Virology 2016, 498, 109–115. [Google Scholar] [CrossRef]
  58. Fuchs, M.; Hily, J.M.; Petrzik, K.; Sanfaçon, H.; Thompson, J.R.; van der Vlugt, R.; Wetzel, T. ICTV Report Consortium. ICTV Virus Taxonomy Profile: Secoviridae 2022. J. Gen. Virol. 2022, 103, 881–899. [Google Scholar] [CrossRef]
  59. Fontdevila Pareta, N.; Khalili, M.; Maachi, A.; Rivarez, M.P.S.; Rollin, J.; Salavert, F.; Temple, C.; Aranda, M.A.; Boonham, N.; Botermans, M.; et al. Managing the deluge of newly discovered plant viruses and viroids: An optimized scientific and regulatory framework for their characterization and risk analysis. Front. Microbiol. 2023, 14, 2023. [Google Scholar] [CrossRef] [PubMed]
  60. Brown, N.P.; Leroy, C.; Sander, C. MView: A web-compatible database search or multiple alignment viewer. Bioinformatics 1998, 14, 380–381. [Google Scholar] [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.