Next Article in Journal
The Infectivity and Pathogenicity Characteristics of a Recombinant Porcine Epidemic Diarrhea Virus, CHFJFQ
Previous Article in Journal
Targeting Latent HIV Reservoirs: Effectiveness of Combination Therapy with HDAC and PARP Inhibitors
Previous Article in Special Issue
Exogenous dsRNA-Mediated RNAi: Mechanisms, Applications, Delivery Methods and Challenges in the Induction of Viral Disease Resistance in Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Integrative Computational Approach for Identifying Cotton Host Plant MicroRNAs with Potential to Abate CLCuKoV-Bur Infection

by
Muhammad Aleem Ashraf
1,2,*,†,
Imran Shahid
3,†,
Judith K. Brown
4,† and
Naitong Yu
1,*
1
Institute of Tropical Biosciences and Biotechnology, Chinese Academy of Tropical Agricultural Sciences, Haikou 571101, China
2
Department of Biosciences and Technology, Emerson University, Multan 60000, Pakistan
3
Department of Pharmacology and Toxicology, Faculty of Medicine, Umm Al-Qura University, Makkah 21955, Saudi Arabia
4
School of Plant Sciences, The University of Arizona, Tucson, AZ 85721, USA
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Viruses 2025, 17(3), 399; https://doi.org/10.3390/v17030399
Submission received: 13 January 2025 / Revised: 8 March 2025 / Accepted: 10 March 2025 / Published: 12 March 2025
(This article belongs to the Special Issue Roles of Small RNAs in Virus–Plant Interactions)

Abstract

:
Cotton leaf curl Kokhran virus-Burewala (CLCuKoV-Bur) has a circular single-stranded ssDNA genome of 2759 nucleotides in length and belongs to the genus Begomovirus (family, Geminiviridae). CLCuKoV-Bur causes cotton leaf curl disease (CLCuD) and is transmitted by the whitefly Bemisis tabaci cryptic species. Monopartite begomoviruses encode five open reading frames (ORFs). CLCuKoV-Bur replicates through a dsDNA intermediate. Five open reading frames (ORFs) are organized in the small circular, single-stranded (ss)-DNA genome of CLCuKoV-Bur (2759 bases). RNA interference (RNAi) is a naturally occurring process that has revolutionized the targeting of gene regulation in eukaryotic organisms to combat virus infection. The aim of this study was to elucidate the potential binding attractions of cotton-genome-encoded microRNAs (Gossypium hirsutum-microRNAs, ghr-miRNAs) on CLCuKoV-Bur ssDNA-encoded mRNAs using online bioinformatics target prediction tools, RNA22, psRNATarget, RNAhybrid, and TAPIR. Using this suite of robust algorithms, the predicted repertoire of the cotton microRNA-binding landscape was determined for a CLCuKoV-Bur consensus genome sequence. Previously experimentally validated cotton (Gossypium hirsutum L.) miRNAs (n = 80) were selected from a public repository miRNA registry miRBase (v22) and hybridized in silico into the CLCuKoV-Bur genome (AM421522) coding and non-coding sequences. Of the 80 ghr-miRNAs interrogated, 18 ghr-miRNAs were identified by two to four algorithms evaluated. Among them, the ghr-miR399d (accession no. MIMAT0014350), located at coordinate 1747 in the CLCuKoV-Bur genome, was predicted by a consensus or “union” of all four algorithms and represents an optimal target for designing an artificial microRNA (amiRNA) silencing construct for in planta expression. Based on all robust predictions, an in silico ghr-miRNA-regulatory network was developed for CLCuKoV-Bur ORFs using Circos software version 0.6. These results represent the first predictions of ghr-miRNAs with the therapeutic potential for developing CLCuD resistance in upland cotton plants.

1. Introduction

Upland cotton, Gossypium hirsutum L. (allotetraploid), is an economically important agronomic crop grown for fiber and seed in many locations worldwide [1,2]. The first whole genome sequence of allotetraploid upland cotton (52 chromosomes (2n = 4x = 52)) and physical mapping were completed in 2015 [3]. Infection of the cotton crop in Pakistan and India by leaf curl disease (CLCuD) causes a reduced yield and reduced quality of the fiber and seed [4,5,6,7]. CLCuD is caused by whitefly-transmitted multiple species of begomoviruses, including cotton leaf curl Kokhran virus-Burewala (CLCuKoV-Bur) [6,8,9,10,11,12,13]. Begomoviruses are transmitted by the whitefly Bemisia tabaci (Gennadius) cryptic species complex (Hemiptera; Aleyrodidae) in a persistent-circulative manner [14,15,16,17]. CLCuKoV-Bur emerged as a virus and resistant breaking “strain” of CLCuD [7,10,13]. Begomoviruses have an ssDNA genome and can be monopartite or bipartite. They typically encode six proteins. The ORFs of CLCuKoV-Bur include a plus (+) virion-sense (VS) (V1 and V2) and negative (−) complementary-sense (CS) strand (C1, C3, and C4), respectively [7,10,18]. The large intergenic region (LIR) contains sequences involved in virus replication and regulation of transcription of the C1 and V1 ORFs of begomoviruses [19,20].
Despite multiple efforts to control the spread of CLCuKoV-Bur variants through improved whitefly management and introgression of resistance genes into G. hirsutum, the virus has continued to diversify and repeatedly overcome genetic resistance in cotton. RNA interference (RNAi) is a naturally occurring process in eukaryotes that mediates gene silencing in eukaryotes and, recently, has been harnessed as an invaluable research tool [21,22].
MicroRNAs (miRNAs) are a class of non-coding, single-stranded RNA, 18–25 nucleotide long, that are involved in the regulation of fundamental processes, including gene expression by silencing protein synthesis [23]. They are modulators of cell growth, differentiation, development, and antiviral defense responses. They have been used experimentally to modulate genes in cells through the activation or suppression of protein expression from mRNA to better understand the function and also as a therapeutic agent for interfering with specific host–virus interactions [24,25,26,27].
Artificial microRNAs (amiRNAs) are considered the most effective, preferred antiviral defense that triggers the silencing of a target gene [28,29]. The use of amiRNA for modulating gene expression in eukaryotes [30] has facilitated the customizable, highly reversible gene inactivation of selected targets [31]. Thus, amiRNA can be used to produce high-efficiency gene silencing capabilities that can complement viral genome-editing approaches for plants [32,33]. Relatively small, highly efficient amiRNAs cloned in plasmid vectors have been developed to trigger post-transcriptional gene silencing to combat virus infections [34,35,36,37,38]. And, in a short time, the miRBase has become a substantial resource for annotating miRNA molecules. Recently, 80 conserved mature G. hirsutum genome-encoded ghr-miRNAs have been identified and annotated using the most updated miRBase v22 [39], which has made available a number of mature upland cotton miRNAs involved in regulating biotic (i.e., insect and fungal pathogens) and abiotic (salt and drought) stressors [40,41,42,43,44,45].
The objective of this study was to develop an innovative, integrated approach to identify high-affinity target-binding sites of mature cotton ghr-miRNAs with the potential to silence CLCuKoV-Bur target genes. The goal was accomplished through the in silico screening of mature miRNAs using an integrated miRNA prediction process that implemented the RNA22, psRNATarget, RNAhybrid, and TAPIR algorithms for the identification of homologous amiRNAs. The sequences are derived from the unbiased mapping of ghr-miRNA–mRNA target site interactions and represent the first step toward constructing amiRNA-expressing silencing plasmid vectors and tools for the transformation of upland cotton. The hypotheses reported here for upland cotton are expected to lead to the development of CLCuKoV-Bur resistance in cotton plants engineered to express the cotton-encoded miRNA candidates to exploit an amiRNA approach.

2. Materials and Methods

2.1. Upland Cotton (G. hirsutum) MicroRNAs and CLCuKoV-Bur Genome Retrieval

Eighty experimentally validated mature cotton microRNA molecules (Gossypium hirsutum-microRNAs) (ghr-miRNA156-ghr-miR7514) have been archived in the miRBase database, and the number of cotton ghr-miRNAs in the database is expected to continue to increase. These available mature cotton ghr-miRNAs have been identified and characterized from the pre-miRNA stem–loop structured hairpin precursor sequences, ghr-MIR156-ghr-MIR7514. Subsequently, the mature ghr-miRNAs (Accession IDs: MIMAT0005806-MIMAT0029164) (Table S1 (Supplementary Materials)) and precursor sequences (n = 78) (Accession IDs: MI0005638-MI0024206) (Table S2) were obtained from the miRBase v22 (http://mirbase.org/) for screening in this study [39]. The complete genome sequence of a representative CLCuKoV-Bur (Accession no. AM421522) was downloaded from the NCBI GenBank database (http://ncbi.nlm.nih.gov) [46].

2.2. RNA22 Algorithm

The RNA22 algorithm is available as a web-based tool to detect evidence of direct interactions between pairs of miRNAs–mRNAs [47]. The publicly available, user-friendly RNA22 web-based tool relies on a non-seed-based approach for identifying statistically significant sequence motifs or “patterns” that represent miRNA-binding sites with corresponding hetero-duplexes, available at http://cm.jefferson.edu/rna22v1.0/ (accessed on 27 April 2022). This tool is ideal for interactively exploring and visualizing miRNA target-site prediction, in context [48]. The maximum folding energy (MFE) was set as −15.00 Kcal/mol.

2.3. psRNATarget Algorithm

The psRNATarget is a small plant RNA target analysis software that uses complementary scoring (schema V2) to identify predicted miRNA targets [49]. The psRNATarget is a standard target prediction program with an inherently high sensitivity for the detection of inhibition patterns. The psRNATarget is designed for plant genomes and is available on the web server, http://www.zhaolab.org/psRNATarget/analysis?function=2 [50] (accessed on 18 March 2019). The mature miRNA and mRNA profiles were uploaded and employed for putative target predictions using the default criteria available in Schema v2 (2017 release), as follows: seed region: 2–13 (NT); HSP size = 19; default expectation threshold = 7.0.

2.4. RNAhybrid Algorithm

The RNAhybrid is available at a publicly available online web server (https://bibiserv.cebitec.uni-bielefeld.de, accessed on 26 July 2021) developed to computationally predict miRNA–target binding sites based on intermolecular hybridization [51]. The RNAhybrid algorithm identifies the energetically most favorable hybridization sites of a small RNA in a large RNA and uses a seed binding site as the key biological element for miRNA-binding site detection in the coding and non-coding regions. It has been updated with features that show promise for predicting non-canonical target sites in miRNA molecules. The algorithm was used to estimate the minimum free energy (MFE) of the miRNA and mRNA target pair using the MFE setting of −20.00 Kcal/mol.

2.5. TAPIR Algorithm

Tracking Any Point with Per-frame Initiation, or TAPIR (http://bioinformatics.psb.ugent.be/webtools/tapir, accessed on 24 July 2023), is a state-of-the-art plant algorithm designed to identify targets for plant miRNAs. TAPIR is used to estimate seed-based miRNA–mRNA interactions and has two available modes, “fast” and “precise”, to identify target sites [52]. Here, TAPIR was used in the “precise” mode to select the most highly specific miRNA–mRNA duplexes and calculate the free energy ratio of the duplex. The analysis used the standard (default) parameters, with a maximum score of 9.0, and the MFE ratio was set at ≥0.2.

2.6. RNAfold Algorithm

The consensus miRNA precursor (cotton) sequences were obtained from miRBase, and the optimal RNA secondary structures were predicted using RNAfold, a folding algorithm based on thermodynamic principles [53]. The analysis was conducted using the default settings.

2.7. RNAcofold Algorithm

RNAcofold (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAcofold.cgi, accessed on 27 December 2024) is a recently developed web-based algorithm designed to estimate the free energy (ΔG) of the miRNA–mRNA duplex [54]. It was included as a complementary program that disallows pseudoknot configurations and specifically evaluates the concentration dependency of dimerization. In this study, cotton sequences were scanned with RNAcofold using default settings.

2.8. Discovering Cotton Genome-Encoded miRNAs–Target Interaction

The Circos program is used to visualize multi-miRNA binding sites and target interactions. In this study, the Circos algorithm was used to map the ghr-miRNA–CLCuKoV-Bur-mRNA target site interactions using the default settings [55].

2.9. Statistical Analysis

The R statistical software (version 3.1.1) was used to analyze and visualize multidimensional cotton miRNA data (Tables S3 and S4) and depict the results as graphical representations [56].

2.10. CLCuKoV-Bur Genome Annotation

The complete CLCuKoV-Bur genome sequence validation and annotation of coding regions were carried out using AcaClone software (v 1.1.152) to produce the pDRAW32 plasmid map (http://www.acaclone.com/) (accessed on 20 August 2022).

3. Results

3.1. High-Probability miRNA Binding Sites in CLCuKoV-Bur Genome

The computational predictive approaches available for identifying prospective miRNA targets have faced major bottlenecks, and target predictions have often suffered from high false-positive rates. To address the potential shortcomings of individual algorithms that offer different strengths and weaknesses, the following four widely used algorithms were deployed in this study for identifying the putative target sites of ghr-miRNAs: RNA22, psRNATarget, RNAhybrid, and TAPIR (Figure 1). The ghr-miRNA sequences and annotations have been aggregated in the miRBase (v22). Eighty mature ghr-miRNA sequences were curated following their extraction from miRBase (v22). By implementing an in silico workflow, the biological data and miRNA prediction algorithms were integrated to identify hybridization sites of the ghr-miRNAs with the predicted potential to silence coding and non-coding sequences homologous to CLCuKoV-Bur genome/mRNA sequences (Figure 2). The CLCuKoV-Bur genome is a single-stranded circular DNA molecule of 2759 nucleotides in length with five coding regions (Figure 3). The cotton miRNA-binding landscape for CLCuKoV-Bur-encoded mRNAs was mapped, and miRNA-binding sites were identified in the context of the viral genome-encoded mRNA and non-coding sequences. The small size and ease of accurate annotation of a begomovirus genome make it highly amenable to the unambiguous mapping of multi-ghr-miRNA genome binding sites.
The number and locations of the predicted binding sites varied, depending on the predictive tools. For example, the RNA22 algorithm predicted more than ten target sites. A comparison of complementary cotton host and CLCuKoV-Bur DNA miRNA targeting sites predicted the putative binding of 14 ghr-miRNAs, with the potential to target 18 CLCuKoV-Bur genomic sites.
The psRNATarget identified 22 ghr-miRNAs, with the potential to target 23 CLCuKoV-Bur binding sites that were scored as highly “cleavable targets”. The RNAhybrid detected 78 ghr-miRNAs corresponding to 78 target sites, whereas the TAPIR algorithm identified 16 ghr-miRNA–target pairs. Several miRBase ghr-miRNAs predicted by the computational pipeline were found to bind the CLCuKoV-Bur genome with particularly high affinity and included ghr-miR396 (a, b), ghr-miR7488, and ghr-miR7493, which were predicted by three of the four algorithms, psRNATarget, RNAhybrid, and TAPIR. And two ghr-miRNAs, ghr-miR169b and ghr-miR399d, were predicted to bind virus genome sequences based on the consensus by all four tools (Figure 4, Figure 5 and Figure 6, Table 1 and Supplementary File S1).

3.2. Coat Protein (CP) of CLCuKoV-Bur Genome

The coat protein (CP) of the CLCuKoV-Bur genome is encoded by the V1 ORF, located between the coordinates 292 and 1062 (770 nt). The CP is required for encapsidation, cell-to-cell movement, and the whitefly vector transmission of begomoviruses [57,58,59]. The results of the RNA22 analysis identified ghr-miR169 (a, b) (start site 692) and ghr-miR7512 (918) as the most optimal target ghr-miRNAs (Figure 6A).
The eight miRNAs predicted to target the V1 ORF were ghr-miR3476-5p (start site 766), ghr-miR7485 (736), ghr-miR7492 (a, b, c) (902), ghr-miR7500 (675), and ghr-miR7510 (a, b) (703, 806), based on psRNATarget analysis (Figure 6B), while RNAhybrid predicted the miRNA binding sites (n = 8) ghr (miR394 (a, b) (852), miR482b (582), miR7486 (a, b) (850), miR7490 (563), miR7492 (a, b, c) (708), miR7510 (a, b) (695, 786), and miR7512 (918) (Figure 6C). By comparison, the TAPIR algorithm identified the V1 predicted targets as ghr-miR394 (a, b) (432) and ghr-miR2949 (b, c) (854) (Figure 6D and Tables S3 and S4).

3.3. Predicted Targets for the V2 ORF of CLCuKoV-Bur

The V2 ORF of CLCuKoV-Bur (132–488 nt) overlaps with the V1 ORF (CP) and encodes a pre-coat protein that is required for symptom development [60,61,62]. No host miRNAs were predicted to have the potential to target the VI and V2 overlapping regions based on the RNA22 and TAPIR algorithms (Figure 6A,D). However, the psRNATarget algorithm identified a target in the V1 and V2 ORF overlapping region, consisting of ghr-miR7497 located at nucleotide coordinate 460 (Figure 6B), whereas RNAhybrid predicted multiple targets in this region that consisted of ghr-miR2950 (320), ghr-miR3476-5p (422), ghr-miR7485 (473), ghr-miR7497 (363), and ghr-miR7498 (371) (Figure 6C and Tables S3 and S4)).

3.4. Predicted Targets for C1 ORF of CLCuKoV-Bur

The C1 ORF of the CLCuKoV-Bur genome (1505–2596) encodes a replication initiator protein (Rep) on the complementary strand. Rep is a multi-functional begomovirus protein that is essential for the replication of the circular ssDNA genome [63,64,65]. The RNA22 algorithm predicted five miRNAs in the rep coding region consisting of ghr-miR156c (1751), ghr-miR393 (1743), and ghr-miR399 (c, d, and e) (1747) (Figure 6A). By comparison, the psRNATarget identified four ghr-miRNAs with the potential to target C1 ORF, including ghr-miR399 (d, e) (1749), ghr-miR7493 (2095), and ghr-miR7505 (2059) (Figure 6B).
Predicted candidate ghr-miRNAs with the potential to target C1 ORF were identified as ghr-miR160 (2054), ghr-miR162a (1682), ghr-miR167 (a, b) (1954), ghr-miR399 (c, d, e) (1747), ghr-miR7484 (a, b) (1549), ghr-miR7487 (1575), ghr-miR7495 (a, b) (1596), ghr-miR7499 (1677), ghr-miR7502 (1656), ghr-miR7503 (1664), ghr-miR7505 (2046), ghr-miR7506 (1804), and ghr-miR7514 (1830) (Figure 6C). Also, the TAPIR algorithm predicted two “potentially efficient” host ghr-miRNAs: ghr-miR399d (1749) and ghr-miR2949a-3p (2079) (Figure 6D and Tables S3 and S4).

3.5. Predicted Targets for C3 ORF of CLCuKoV-Bur

The C3 ORF of the CLCuKoV-Bur genome (1059–1463 nt, coordinates) encodes the begomovirus replication enhancer protein (REn) [59,66,67]. The RNA22 algorithm predicted the following binding sites in this coding region, ghr (miR169b (1448), miR390 (a, b, c), (1410), and miR7491 (1422) (Figure 6A).
Analysis of the C3 ORF by the psRNATarget algorithm identified binding targets ghr-miR396 (a, b) (1249) and ghr-miR7493 (1163) (Figure 6B). The following nine hybridization sites within cotton miRNAs were predicted by RNAhybrid analysis, ghr-miR164 (1409), ghr-miR390 (a, b, c) (1410), ghr-miR 393 (1402), ghr-miR396 (a, b) (1252), ghr-miR 2949 (a-5p, b, c) (1241), ghr-miR7488 (1443), ghr-miR7489 (1451), ghr-miR7493 (1351), and ghr-miR7494 (1299) (Figure 6C). Finally, the TAPIR algorithm identified four miRNAs: ghr (miR172 (1236), miR396 (a, b) (1249), and miR7493 (1163) (Figure 6D and Tables S3 and S4).

3.6. Predicted Targets for the C4 ORF of CLCuKoV-Bur

The C4 ORF of the CLCuKoV-Bur genome (2137–2682, nt coordinates) encodes this smallest viral protein. It has been attributed to multiple functions, which are mostly poorly understood. The best-characterized function is as a ‘symptom determinant’ [68,69,70]. The ghr-miR390 (a, b, c) and ghr-miR7486 (a, b) located at nt coordinates 2443 and 2499, respectively, were identified in the C4-C1 overlapping region by RNA22 (Figure 6A).
The following four cotton-encoded miRNA binding sites were predicted with high confidence by psRNATarget, ghr-miR169b, ghr-miR7486 (a, b), and ghr-miR7513 located at the nt coordinates, 2190, 2499, and 2551, respectively (Figure 6B). By comparison, the RNAhybrid algorithm predicted the following cotton ghr-miRNAs, ghr-miR 156 (a, b, d), ghr-miR166b, ghr-miR169 (a, b), ghr-miR172, ghr-miR398, ghr-miR482a, ghr-miR 827 (a, b, c), ghr-miR3476-3p, ghr-miR7500, ghr-miR7501, ghr-miR7504b, ghr-miR7507, ghr-miR7509, and ghr-miR7513 located at the begomovirus genome coordinates 2203, 2584, 2190, 2187, 2480, 2580, 2484, 2481, 2478, 2440, 2205, 2152, 2172, and 2328, respectively (Figure 6C).
Finally, two target binding sites of cotton ghr-miRNAs predicted by the TAPIR algorithm were ghr-miR156 (a, b, d) (2202, nt coordinate) and ghr-miR169 (a, b) (2191, nt coordinate) (Figure 6D and Tables S3 and S4).

3.7. Large Intergenic Region of CLCuKoV-Bur

The large intergenic region, or LIR, of begomoviruses harbors non-coding sequences that are required for CLCuKoV-Bur replication and for the transcriptional regulation of the V1 and C1 ORFs [19,20]. The psRNATarget predicted the miRNAs, ghr-miR2948-5p (start site 72) and ghr-miR7491 (11) (Figure 6B).
By comparison, the RNAhybrid algorithm predicted the following six miRNAs, ghr-miR479 (150), ghr-miR2948-5p (66), ghr-miR7491 (11), ghr-miR7504a (131), ghr-miR7508 (84), and ghr-miR7511 (47). Also, RNAhybrid predicted the following three ghr-miRNAs in the LIR/C4 overlapping region: ghr-miR399 (a, b) (2649) and ghr-miR2949a-3p (2625) (Figure 6C). No predicted target binding sites were identified in the CLCuKoV-Bur genome by the RNA22 and TAPIR algorithms (Figure 6A,D and Tables S3 and S4).

3.8. Consensus miRNAs Predictions

Finally, a combination of algorithms, or a consensus approach, was evaluated for identifying conserved seed-binding sites among the cotton miRNAs. Among the 80 ghr-miRNAs in miRBase v22, 18 ghr-miRNAs were identified by two or more of the four algorithms evaluated (Table 2). Nine target binding sites were identified by consensus of at least two of the four algorithms: ghr-miR (156 (start site a, b, d), 169 (a, b), 390 (a, b, c), 396 (a, b), 399 (c, d, e), 7486 (a, b), 7488, 7493 and 7512) at nucleotide (nt) positions 2202, 2190, 1410, 1249, 1747, 2499, 2588, 1163, and 918, respectively (Figure 7 and Table 2).
Of the 18 consensus ghr-miRNAs, two upland cotton-derived miRNAs, ghr-miR169b and ghr-miR399e, were predicted to target the viral genome at coordinates 2190 and 1747, respectively, and were detected by the consensus of three of the four algorithms implemented. Of the eighteen consensus ghr-miRNAs, one cotton-derived miRNA, ghr-miR399d, was predicted to target the viral genome at coordinate 1747, located within the C1 coding region, and was detected by the consensus of all the four algorithms implemented: RNA22, psRNATarget, RNAhybrid, and TAPIR (Figure 8 and Table 2 and Table 3). The efficacies of the consensus ghr-miRNAs, ghr-miR169b, ghr-miR399d, and ghr-miR399e targeting CLCuKoV-Bur genes were validated by the suppression of RNAi-mediated cleavage or translational inhibition (Table 4).

3.9. Predicted miRNA–Target Interaction

A Circos plot was compiled to enable the exploration and follow-up validation of potentially complex miRNA–mRNA interactions involved in the upland cotton-CLCuKoV-Bur-pathosystem (Figure 9)

3.10. Estimation of the Free Energy (ΔG) of Consensus miRNA–mRNA Pairs

The binding free energies (ΔG) of the three consensus ghr-miRNA–miRBS (miRBS: miRNA binding site) duplexes, calculated by RNAcofold, ghr-miR169b, ghr-miR399d, and ghr-miR399e, were identified as the most effective seed matches for two interacting miRNA and mRNA molecules (Table 5).

3.10.1. Secondary Structure Predictions

The MFE hairpin secondary structure parameters for the three consensus pre-miRNA sequences, ghr-MIR169b, ghr-MIR399d, and ghr-MIR399e, were determined by RNAfold and revealed the optimal secondary structure of the consensus precursors (Table 6).

4. Discussion

CLCuKoV-Bur is a highly transmissible and virulent begomovirus of upland cotton that has emerged in cotton–vegetable-producing regions of Pakistan beginning in the mid-1990s. Since then, CLCuD has occurred in cotton annually, and the composition of the virus populations that have predominated at any one time has been dynamic, with new species and strains emerging, most likely in response to attempts to release resistant germplasm from breeding programs in Pakistan to control the disease. The CLCuKoV-Bur species was responsible for the second major CLCuD pandemic in cotton that resulted in reduced yield and quality of cotton and an overall decline in productivity [6,7,8,10,13,71,72].
Recently, in silico approaches have been developed and evaluated to facilitate the prediction of multi-miRNA-binding sites to elucidate and modulate plant host–virus interactions [73,74,75,76]. Here, 78 precursors and 80 miRNA mature molecules, available at miRBase (v22), were downloaded and analyzed accordingly to identify upland cotton miRNAs with predicted binding sites in the CLCuKoV-Bur genome. Four widely accepted, variably robust, predictive tools that consider different criteria for predicting viral binding site interactions involve plant host miRNAs. These four algorithms, RNA22, psRNATarget, RNAhybrid, and TAPIR (Figure 1), were selected to take advantage of their computational complementarity and ability to reduce false-positive predictions. The upland cotton genome sequence queried was found to encode 18 predicted consensus ghr-miRNAs with the potential for binding CLCuKoV-Bur targets and silence replication and other key steps in the infection cycle (Table 2).
Individually, the RNA22 algorithm produced the most robust evidence of a direct interaction between miRNA and mRNA pairs, which is based on analysis of Watson–Crick and G:U pairs in the seed region [48]. The high-confidence consensual cotton miRNAs with the potential to bind to CLCuKoV-Bur sequences are summarized in Table 2. Also, in a previous study, the CLCuKoV-Bur genome was analyzed using the miRanda algorithm alone. The analysis yielded a large number of predicted ghr-miRNA target sites on the genome, making it difficult to select the several most optimal sites for in planta validation [77]. Other studies have reported attempts to identify potential miRNA-binding sites to target plant–virus–host interactions by in silico predictions in anticipation of curbing infections. A number of studies have explored amiRNA approaches for silencing other begomoviruses toward combat disease development in the plant host [28,29,78,79], and to attempt to interfere with the circulative, non-propagative transmission by the whitefly vector [31,80].
Overall, among the most optimal binding sites or targets of consensus of ghr-miRNAs were ghr-miR169b and ghr-miR399 (d, e) (Figure 8) located in the CLCuKoV-Bur C1 gene and C4-C1 overlapping region that encodes a symptom determinant. Interestingly, although robust predicted binding sites were identified in the C1 ORF and C4-C1 overlapping region as targets of consensus ghr-miRNAs that may feasibly inhibit CLCuKoV-Bur replication, no such consensus ghr-miRNAs were identified in the CLCuKoV-Bur C3 or V1 ORFs, even though the latter genes are likewise indispensable for virus infections (Figure 8). Extensive studies have already been conducted to determine the high-confidence miRNA binding sites for the targeted pathogen genome to alter host–virus interactions based on in silico criteria [81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96].
Together, the four algorithms, RNA22, psRNATarget, RNAhybrid, and TAPIR, predicted that a consensus high-confidence binding site of ghr-miR399d (start site 1747) (Accession no. MIMAT0014350) is capable of silencing the viral Rep protein (Figure 8; Table 2, Table 3 and Table 4). Both the single and consensus algorithms identified ghr-miR399d as the most highly promising molecule (Table 2, Table 3 and Table 4).
The most robust interactions between the CLCuKoV-Bur genome and host miRNAs that are predicted to inhibit translations are summarized in Figure 9. Silencing of the C1 ORF that encodes a Rep-associated protein of CLCuKoV-Bur is anticipated to reduce viral replication. Among the predicted ghr-miRNAs, the ghr-miR399d target site exhibited the strongest MFE value. The MFE of a consensus target pair (ghr-miR399d) was variably estimated by the different algorithms as −17.80 Kcal/mol (RNA22), −22.50 Kcal/mol (RNAhybrid), and −18.20 Kcal/mol (RNAcofold) (Table 3), making the pair potential “true targets” [75,97,98].
The complement between miRNAs and targets was predicted by the psRNATarget and RNAhybrid algorithms (Table 4). The RNAhybrid is a powerful algorithm that identifies seed-based interactions. The MFE calculated by RNAcofold was in the acceptance range for the formation of amiRNA–target duplexes. The cotton consensus ghr-miR399d identified by analyses is predicted to target the CLCuKoV-Bur C1 ORF sequence for silencing (Figure 8). Interestingly, a previous study has shown that the ghr-miR399 regulates salt stress in cotton [99] by targeting the non-coding RNA GhlncRNA973, which is a predicted target of ghr-miR399 [100].
To date, no experimental data have been published specifically focusing on cotton miRNAs for targeting the CLCuKoV-Bur DNA genome. Consequently, here, an integrative computational approach was evaluated with the hope of increasing the robustness of the analysis through the cross-validation of individual and multiple algorithms. The goal was to reduce the number and increase the accuracy of predicted potentially lucrative ghr-miRNA targets by combining the results of individual, union, and intersection analyses. The algorithms were evaluated individually and in different combinations to arrive at a consensus binding site identification. This computationally integrative approach was designed to authenticate the predicted results at individual, union, and intersection levels. The union-level prediction is based on combined results of in silico tools to detect ‘true targets’. The sensitivity of the prediction was also enhanced due to decreased specificity. The intersection approach was based on the combination of two or more in silico prediction tools (Figure 4 and Figure 8) [101,102,103].
While promising intermolecular RNA–RNA interactions between ghr-miRNAs and the CLCuKoV-Bur ORF genome have been established, the next step is to design optimal amiRNA expression vectors that impart precise sequence specificity and minimize deleterious off-target effects.
The small size of amiRNA molecules makes both vector construction and amiRNA expression tractable for in planta validation to determine if CLCuKoV-Bur replication can be reduced substantially or inhibited altogether. Once validated, the goal is to deploy multiple or “stacked” amiRNA targets to achieve durable, high-level tolerance against CLCuKoV-Bur and, ultimately, other species and strains associated with the dynamic CLCuD complex that is endemic to cotton cropping systems in Pakistan and India. Finally, to circumvent the development of resistance-breaking species and strains likely to occur with the continuous cultivation of virus-resistant plants [104,105,106,107], the amiRNAs must be selected to target multiple sites and achieve maximum binding affinity and high specificity to the CLCuD begomoviral mRNAs or non-coding targets, together with the assurance of the stable inheritance of the miRNAs in resultant germplasm [30,33,34,108,109].
In a prior study, the complete genome was sequenced and annotated for an isolate of CLCuKoV-Bur from naturally infected cotton in Pakistan [10], and the bidirectional promoter of CLCuKoV-Bur was characterized to elucidate transcription mechanisms [19,20]. In this study, 80 ghr-miRNAs were identified that have homology (based on in silico predictions) with the CLCuKoV-Bur genome, and multiple ghr-miRNA-binding sites were identified. Among them, predicted binding sites of consensus ghr-miRNAs were found to reside in multiple consensus regions of the CLCuKoV-Bur genome (Table 4). Through the host miRNA and CLCuKoV-Bur interactions with viral sequences, both the suppression and induction of viral gene expression and/or altered regulation of non-coding regions may occur when a virus infection activates the host plant defense systems. The Circos software was used to link mature miRNAs to their target genes by producing a network for ease of visualizing the interactions (Figure 9). This is the first report of the predicted relationships between specific host miRNAs, and their sequence targets have been characterized to create a holistic regulatory network map using a suite of computational tools to predict miRNAs encoded in the upland cotton genome and their putative targets in the CLCuKoV-Bur genome. Given the exciting progress toward the design of miRNA constructs of the CLCuKoV-Bur genome to undermine replication and other key steps in the infection cycle within the cotton host plant virus genome, targeting is poised to become the next generation of therapy that exploits knowledge of miRNA interactions with viral targets.

5. Conclusions

CLCuD is caused by a complex of whitefly-transmitted begomoviruses that cause leaf curl disease in cotton that has reached epidemic levels throughout the last four decades, with the “Burewala” CLCuKoV-Bur evolving as one of the most prevalent species among multiple species and strains associated with the ongoing CLCuD pandemics that have resulted in major losses to the cotton crop in Pakistan. In this study, the first predicted relationships between specific host miRNAs and their sequence targets have been characterized to create a holistic regulatory network map using a suite of computational tools to predict miRNAs encoded in the upland cotton genome and their putative targets in the CLCuKoV-Bur genome. In this study, 80 ghr-miRNAs were identified that have homology (based on in silico hybridization) to the CLCuKoV-Bur genome, and multiple ghr-miRNA-binding sites were identified. Among them, ghr-miR399d (Accession no. MIMAT0014350), located at coordinate 1747 in the CLCuKoV-Bur genome, exhibited more binding to their targets and concluded as the top effective ghr-miRNA with the predicted potential to target the C1 ORF. The C1 is a potential site for the antiviral construct to bind and inhibit the virus replication. In addition to ghr-miR-399d, the consensus cotton miRNAs ghr-miR169b and ghr-miR399e were also positively validated with a stable thermodynamic binding energy. Definitive experiments are also needed to determine the binding strength of the predicted ghr-miRNAs in transgenic cotton plants. The low regeneration capability of upland cotton callus is another constraint to the development of CLCuD-resistant cotton plants. The intersection of the results obtained from these integrated miRNA prediction tools allowed for identifying novel targets for designing amiRNA-based constructs. Future work will focus on the validation of these promising cotton ghr-miRNAs to develop CLCuKoV-Bur resistance in cotton plants. We can suppose that an amiRNA-based targeted approach has been expanded for long-term resistance against CLCuKoV-Bur infections.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/v17030399/s1, Table S1: List of miRBase mature upland cotton miRNA sequences; Table S2: List of miRBase upland cotton hairpin precursor miRNA sequences; Table S3: Prediction of binding sites of upland cotton miRNAs; Table S4: Gene-wise prediction of binding sites of ghr-miRNAs; File S1: Prediction results analyzed by in silico algorithms.

Author Contributions

Conceptualization, M.A.A., J.K.B. and N.Y.; methodology, all authors; validation, M.A.A.; formal analysis, M.A.A.; investigation, M.A.A.; resources, N.Y.; data curation, M.A.A.; writing—original draft preparation, M.A.A. and I.S.; writing—review and editing, all authors; visualization, I.S.; supervision, N.Y. and J.K.B.; project administration, M.A.A., J.K.B. and N.Y.; funding acquisition, N.Y. and J.K.B. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by International Science & Technology Cooperation Program of Hainan Province (GHYF2023010), Hainan Provincial Natural Science Foundation of China (322RC769), and Central Public Interest Scientific Institution Basal Research Fund (NO. 1630052023003). The authors acknowledge the invaluable support provided by Emerson University, Multan.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Acknowledgments

The authors acknowledge the IPFP of HEC and TYSP of China.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Paterson, A.H.; Wendel, J.F.; Gundlach, H.; Guo, H.; Jenkins, J.; Jin, D.; Llewellyn, D.; Showmaker, K.C.; Shu, S.; Udall, J. Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres. Nature 2012, 492, 423–427. [Google Scholar] [CrossRef] [PubMed]
  2. Zhai, Z.; Zhang, K.; Fang, Y.; Yang, Y.; Cao, X.; Liu, L.; Tian, Y. Systematically and comprehensively understanding the regulation of cotton fiber initiation: A review. Plants 2023, 12, 3771. [Google Scholar] [CrossRef] [PubMed]
  3. Zhang, T.; Hu, Y.; Jiang, W.; Fang, L.; Guan, X.; Chen, J.; Zhang, J.; Saski, C.A.; Scheffler, B.E.; Stelly, D.M. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat. Biotechnol. 2015, 33, 531–537. [Google Scholar] [CrossRef]
  4. Sattar, M.N.; Kvarnheden, A.; Saeed, M.; Briddon, R.W. Cotton leaf curl disease–an emerging threat to cotton production worldwide. J. Gen. Virol. 2013, 94, 695–710. [Google Scholar] [CrossRef] [PubMed]
  5. Nadeem, S.; Riaz Ahmed, S.; Luqman, T.; Tan, D.K.; Maryum, Z.; Akhtar, K.P.; Muhy Ud Din Khan, S.; Tariq, M.S.; Muhammad, N.; Khan, M.K.R. A comprehensive review on Gossypium hirsutum resistance against cotton leaf curl virus. Front. Genet. 2024, 15, 1306469. [Google Scholar] [CrossRef]
  6. Zubair, M.; Zaidi, S.S.-e.-A.; Shakir, S.; Farooq, M.; Amin, I.; Scheffler, J.A.; Scheffler, B.E.; Mansoor, S. Multiple begomoviruses found associated with cotton leaf curl disease in Pakistan in early 1990 are back in cultivated cotton. Sci. Rep. 2017, 7, 680. [Google Scholar] [CrossRef]
  7. Amrao, L.; Amin, I.; Shahid, M.S.; Briddon, R.W.; Mansoor, S. Cotton leaf curl disease in resistant cotton is associated with a single begomovirus that lacks an intact transcriptional activator protein. Virus. Res. 2010, 152, 153–163. [Google Scholar] [CrossRef]
  8. Iqbal, M.J.; Zia-Ur-Rehman, M.; Ilyas, M.; Hameed, U.; Herrmann, H.W.; Chingandu, N.; Manzoor, M.T.; Haider, M.S.; Brown, J.K. Sentinel plot surveillance of cotton leaf curl disease in Pakistan—A case study at the cultivated cotton-wild host plant interface. Virus Res. 2023, 333, 199144. [Google Scholar] [CrossRef]
  9. Mahmood, M.A.; Ahmed, N.; Hussain, A.; Naqvi, R.Z.; Amin, I.; Mansoor, S. Dominance of cotton leaf curl Multan virus-Rajasthan strain associated with third epidemic of cotton leaf curl disease in Pakistan. Sci. Rep. 2024, 14, 13532. [Google Scholar] [CrossRef]
  10. Ashraf, M.A.; Shahid, A.A.; Mohamed, B.B.; Dahab, A.A.; Bajwa, K.S.; Rao, A.Q.; Khan, M.A.U.; Ilyas, M.; Haider, M.S.; Husnain, T. Molecular characterization and phylogenetic analysis of a variant of highly infectious cotton leaf curl Burewala virus associated with CLCuD from Pakistan. Aust. J. Crop Sci. 2013, 7, 1113–1122. [Google Scholar]
  11. Pawar, T.; Sharma, S.; Arora, R.K.; Biswas, K.K. Distribution of cotton leaf curl virus species/strain and characterization of associated satellite molecules in Gossypium hirsutum. Indian Phytopathol. 2024, 77, 815–823. [Google Scholar] [CrossRef]
  12. Afzal, M.; Saeed, S.; Riaz, H.; Ishtiaq, M.; Habib ur Rahman, M. Transmission efficiency of cotton leaf curl Khokhran virus/Cotton leaf curl Multan betasatellite complex by two whitefly cryptic species in Pakistan. Int. J. Trop. Insect Sci. 2023, 43, 819–830. [Google Scholar] [CrossRef]
  13. Shuja, M.N.; Briddon, R.W.; Tahir, M. Identification of a distinct strain of cotton leaf curl Burewala virus. Arch. Virol. 2014, 159, 2787–2790. [Google Scholar] [CrossRef]
  14. Idrees, M.A.; Abbas, A.; Saddam, B.; Bashir, M.H.; Naveed, H.; Khan, A.K.; Dara, M.Z.N. A comprehensive review: Persistence, circulative transmission of begomovirus by whitefly vectors. Int. J. Trop. Insect Sci. 2024, 44, 405–417. [Google Scholar] [CrossRef]
  15. de Moya, R.S.; Brown, J.K.; Sweet, A.D.; Walden, K.K.; Paredes-Montero, J.R.; Waterhouse, R.M.; Johnson, K.P. Nuclear orthologs derived from whole genome sequencing indicate cryptic diversity in the Bemisia tabaci (Insecta: Aleyrodidae) complex of whiteflies. Diversity 2019, 11, 151. [Google Scholar] [CrossRef]
  16. Shah, S.H.J.; Paredes-Montero, J.R.; Malik, A.H.; Brown, J.K.; Qazi, J. Distribution of Bemisia tabaci (Gennadius) (Hemiptera: Aleyrodidae) mitotypes in commercial cotton fields in the Punjab province of Pakistan. Fla. Entomol. 2020, 103, 41–47. [Google Scholar] [CrossRef]
  17. Chen, T.; Saeed, Q.; He, Z.; Lu, L. Transmission efficiency of Cotton leaf curl Multan virus by three cryptic species of Bemisia tabaci complex in cotton cultivars. PeerJ 2019, 7, e7788. [Google Scholar] [CrossRef] [PubMed]
  18. Brown, J.K.; Zerbini, F.M.; Navas-Castillo, J.; Moriones, E.; Ramos-Sobrinho, R.; Silva, J.C.; Fiallo-Olivé, E.; Briddon, R.W.; Hernández-Zepeda, C.; Idris, A. Revision of Begomovirus taxonomy based on pairwise sequence comparisons. Arch. Virol. 2015, 160, 1593–1619. [Google Scholar] [CrossRef]
  19. Ashraf, M.A.; Shahid, A.A.; Rao, A.Q.; Bajwa, K.S.; Husnain, T. Functional characterization of a bidirectional plant promoter from cotton leaf curl Burewala virus using an Agrobacterium-mediated transient assay. Viruses 2014, 6, 223–242. [Google Scholar] [CrossRef]
  20. Ashraf, M.A.; Shahid, A.A.; Rao, A.Q.; Brown, J.K.; Husnain, T. Development and evaluation of the cotton leaf curl Kokhran virus-Burewala bidirectional promoter for enhanced Cry1Ac endotoxin expression in Bt transgenic cotton. Appl. Sci. 2022, 12, 11275. [Google Scholar] [CrossRef]
  21. Koeppe, S.; Kawchuk, L.; Kalischuk, M. RNA interference past and future applications in plants. Int. J. Mol. Sci. 2023, 24, 9755. [Google Scholar] [CrossRef] [PubMed]
  22. Akbar, S.; Wei, Y.; Zhang, M.-Q. RNA interference: Promising approach to combat plant viruses. Int. J. Mol. Sci. 2022, 23, 5312. [Google Scholar] [CrossRef] [PubMed]
  23. Shang, R.; Lee, S.; Senavirathne, G.; Lai, E.C. microRNAs in action: Biogenesis, function and regulation. Nat. Rev. Genet. 2023, 24, 816–833. [Google Scholar] [CrossRef] [PubMed]
  24. Xu, T.; Zhu, Y.; Lin, Z.; Lei, J.; Li, L.; Zhu, W.; Wu, D. Evidence of cross-kingdom gene regulation by plant MicroRNAs and possible reasons for inconsistencies. J. Agric. Food. Chem. 2024, 72, 4564–4573. [Google Scholar] [CrossRef]
  25. Bajczyk, M.; Jarmolowski, A.; Jozwiak, M.; Pacak, A.; Pietrykowska, H.; Sierocka, I.; Swida-Barteczka, A.; Szewc, L.; Szweykowska-Kulinska, Z. Recent insights into plant miRNA biogenesis: Multiple layers of miRNA level regulation. Plants 2023, 12, 342. [Google Scholar] [CrossRef]
  26. Luo, C.; Bashir, N.H.; Li, Z.; Liu, C.; Shi, Y.; Chu, H. Plant microRNAs regulate the defense response against pathogens. Front. Microbiol. 2024, 15, 1434798. [Google Scholar] [CrossRef]
  27. Yang, X.; Zhang, L.; Yang, Y.; Schmid, M.; Wang, Y. miRNA mediated regulation and interaction between plants and pathogens. Int. J. Mol. Sci. 2021, 22, 2913. [Google Scholar] [CrossRef]
  28. Al-Roshdi, M.R.; Ammara, U.; Khan, J.; Al-Sadi, A.M.; Shahid, M.S. Artificial microRNA-mediated resistance against Oman strain of tomato yellow leaf curl virus. Front. Plant Sci. 2023, 14, 1150. [Google Scholar] [CrossRef]
  29. Khalid, A.; Zhang, X.; Ji, H.; Yasir, M.; Farooq, T.; Dai, X.; Li, F. Large artificial microRNA cluster genes confer effective resistance against multiple tomato yellow leaf curl viruses in transgenic tomato. Plants 2023, 12, 2179. [Google Scholar] [CrossRef]
  30. Teotia, S.; Wang, X.; Zhou, N.; Wang, M.; Liu, H.; Qin, J.; Han, D.; Li, C.; Li, C.E.; Pan, S. A high-efficiency gene silencing in plants using two-hit asymmetrical artificial MicroRNAs. Plant Biotechnol. J. 2023, 21, 1799–1811. [Google Scholar] [CrossRef]
  31. Ma, Z.; Wang, J.; Li, C. Research Progress on miRNAs and artificial miRNAs in insect and disease resistance and breeding in plants. Genes 2024, 15, 1200. [Google Scholar] [CrossRef] [PubMed]
  32. Cisneros, A.E.; Carbonell, A. Artificial small RNA-based silencing tools for antiviral resistance in plants. Plants 2020, 9, 669. [Google Scholar] [CrossRef]
  33. Cisneros, A.E.; Martín-García, T.; Primc, A.; Kuziuta, W.; Sánchez-Vicente, J.; Aragonés, V.; Daròs, J.-A.; Carbonell, A. Transgene-free, virus-based gene silencing in plants by artificial microRNAs derived from minimal precursors. Nucleic Acids Res. 2023, 51, 10719–10736. [Google Scholar] [CrossRef] [PubMed]
  34. Niu, Q.-W.; Lin, S.-S.; Reyes, J.L.; Chen, K.-C.; Wu, H.-W.; Yeh, S.-D.; Chua, N.-H. Expression of artificial microRNAs in transgenic Arabidopsis thaliana confers virus resistance. Nat. Biotechnol. 2006, 24, 1420–1428. [Google Scholar] [CrossRef] [PubMed]
  35. Liang, C.; Hao, J.; Li, J.; Baker, B.; Luo, L. Artificial microRNA-mediated resistance to cucumber green mottle mosaic virus in Nicotiana benthamiana. Planta 2019, 250, 1591–1601. [Google Scholar] [CrossRef]
  36. Miao, S.; Liang, C.; Li, J.; Baker, B.; Luo, L. Polycistronic artificial microRNA-mediated resistance to cucumber green mottle mosaic virus in cucumber. Int. J. Mol. Sci. 2021, 22, 12237. [Google Scholar] [CrossRef]
  37. Zhou, L.; Yuan, Q.; Ai, X.; Chen, J.; Lu, Y.; Yan, F. Transgenic Rice plants expressing artificial miRNA targeting the Rice stripe virus MP gene are highly resistant to the virus. Biology 2022, 11, 332. [Google Scholar] [CrossRef]
  38. Petchthai, U.; Yee, C.S.L.; Wong, S.-M. Resistance to CymMV and ORSV in artificial microRNA transgenic Nicotiana benthamiana plants. Sci. Rep. 2018, 8, 9958. [Google Scholar] [CrossRef]
  39. Kozomara, A.; Birgaoanu, M.; Griffiths-Jones, S. miRBase: From microRNA sequences to function. Nucleic Acids Res. 2019, 47, D155–D162. [Google Scholar] [CrossRef]
  40. Li, J.; Hull, J.J.; Liang, S.; Wang, Q.; Chen, L.; Zhang, Q.; Wang, M.; Mansoor, S.; Zhang, X.; Jin, S. Genome-wide analysis of cotton miRNAs during whitefly infestation offers new insights into plant-herbivore interaction. Int. J. Mol. Sci. 2019, 20, 5357. [Google Scholar] [CrossRef]
  41. Wang, W.; Liu, D.; Chen, D.; Cheng, Y.; Zhang, X.; Song, L.; Hu, M.; Dong, J.; Shen, F. MicroRNA414c affects salt tolerance of cotton by regulating reactive oxygen species metabolism under salinity stress. RNA Biol. 2019, 16, 362–375. [Google Scholar] [CrossRef] [PubMed]
  42. Dong, Z.; Zhang, J.; Zhu, Q.; Zhao, L.; Sui, S.; Li, Z.; Zhang, Y.; Wang, H.; Tian, D.; Zhao, Y. Identification of microRNAs involved in drought stress responses in early-maturing cotton by high-throughput sequencing. Genes Genom. 2018, 40, 305–314. [Google Scholar] [CrossRef]
  43. Mei, J.; Wu, Y.; Niu, Q.; Miao, M.; Zhang, D.; Zhao, Y.; Cai, F.; Yu, D.; Ke, L.; Feng, H. Integrative analysis of expression profiles of mRNA and microRNA provides insights of cotton response to Verticillium dahliae. Int. J. Mol. Sci. 2022, 23, 4702. [Google Scholar] [CrossRef] [PubMed]
  44. Li, B.; Feng, C.; Zhang, W.; Sun, S.; Yue, D.; Zhang, X.; Yang, X. Comprehensive non-coding RNA analysis reveals specific lncRNA/circRNA–miRNA–mRNA regulatory networks in the cotton response to drought stress. Int. J. Biol. Macromol. 2023, 253, 126558. [Google Scholar] [CrossRef] [PubMed]
  45. Zhan, J.; Diao, Y.; Yin, G.; Sajjad, M.; Wei, X.; Lu, Z.; Wang, Y. Integration of mRNA and miRNA analysis reveals the molecular mechanism of cotton response to salt stress. Front. Plant Sci. 2021, 12, 767984. [Google Scholar] [CrossRef]
  46. Sayers, E.W.; Beck, J.; Bolton, E.E.; Bourexis, D.; Brister, J.R.; Canese, K.; Comeau, D.C.; Funk, K.; Kim, S.; Klimke, W. Database resources of the national center for biotechnology information. Nucleic Acids Res. 2021, 49, D10. [Google Scholar] [CrossRef]
  47. Miranda, K.C.; Huynh, T.; Tay, Y.; Ang, Y.-S.; Tam, W.-L.; Thomson, A.M.; Lim, B.; Rigoutsos, I. A pattern-based method for the identification of microRNA binding sites and their corresponding heteroduplexes. Cell 2006, 126, 1203–1217. [Google Scholar] [CrossRef]
  48. Loher, P.; Rigoutsos, I. Interactive exploration of RNA22 microRNA target predictions. Bioinformatics 2012, 28, 3322–3323. [Google Scholar] [CrossRef]
  49. Dai, X.; Zhao, P.X. psRNATarget: A plant small RNA target analysis server. Nucleic Acids Res. 2011, 39, W155–W159. [Google Scholar] [CrossRef]
  50. Dai, X.; Zhuang, Z.; Zhao, P.X. psRNATarget: A plant small RNA target analysis server (2017 release). Nucleic Acids Res. 2018, 46, W49–W54. [Google Scholar] [CrossRef]
  51. Krüger, J.; Rehmsmeier, M. RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006, 34, W451–W454. [Google Scholar] [CrossRef] [PubMed]
  52. Bonnet, E.; He, Y.; Billiau, K.; Van de Peer, Y. TAPIR, a web server for the prediction of plant microRNA targets, including target mimics. Bioinformatics 2010, 26, 1566–1568. [Google Scholar] [CrossRef] [PubMed]
  53. Lorenz, R.; Bernhart, S.; Siederdissen, C.; Tafer, H.; Flamm, C.; Stadler, P.; Hofacker, I. ViennaRNA package 2.0. Algorithms Mol. Biol. 2011, 6, 26. [Google Scholar] [CrossRef]
  54. Bernhart, S.H.; Tafer, H.; Mückstein, U.; Flamm, C.; Stadler, P.F.; Hofacker, I.L. Partition function and base pairing probabilities of RNA heterodimers. Algorithms Mol. Biol. 2006, 1, 3. [Google Scholar] [CrossRef]
  55. Krzywinski, M.; Schein, J.; Birol, I.; Connors, J.; Gascoyne, R.; Horsman, D.; Jones, S.J.; Marra, M.A. Circos: An information aesthetic for comparative genomics. Genome Res. 2009, 19, 1639–1645. [Google Scholar] [CrossRef]
  56. Gandrud, C. Reproducible Research with R and RStudio; Chapman and Hall/CRC: Boca Raton, FL, USA, 2018. [Google Scholar]
  57. Poornima Priyadarshini, C.; Ambika, M.; Tippeswamy, R.; Savithri, H. Functional characterization of coat protein and V2 involved in cell to cell movement of Cotton leaf curl Kokhran virus-Dabawali. PLoS ONE 2011, 6, e26929. [Google Scholar] [CrossRef]
  58. Rasool, G.; Yousaf, S.; Ammara, U.e.; Iqbal, A.; Saeed, M.; Amin, I.; Mansoor, S. Transgenic expression of synthetic coat protein and synthetic replication associated protein produces mild symptoms and reduce begomovirus-betasatellite accumulation in Nicotiana benthamiana. Front. Agron. 2021, 3, 676820. [Google Scholar] [CrossRef]
  59. Breves, S.S.; Silva, F.A.; Euclydes, N.C.; Saia, T.F.; Jean-Baptiste, J.; Andrade Neto, E.R.; Fontes, E.P. Begomovirus–host interactions: Viral proteins orchestrating intra and intercellular transport of viral DNA while suppressing host defense mechanisms. Viruses 2023, 15, 1593. [Google Scholar] [CrossRef]
  60. Bahari, A.; Castillo, A.G.; Safaie, N.; Bejarano, E.R.; Luna, A.P.; Shams-Bakhsh, M. Functional analysis of V2 protein of Beet curly top Iran virus. Plants 2022, 11, 3351. [Google Scholar] [CrossRef]
  61. Li, M.; Li, C.; Jiang, K.; Li, K.; Zhang, J.; Sun, M.; Wu, G.; Qing, L. Characterization of pathogenicity-associated V2 protein of tobacco curly shoot virus. Int. J. Mol. Sci. 2021, 22, 923. [Google Scholar] [CrossRef]
  62. Wang, Y.; Wu, Y.; Gong, Q.; Ismayil, A.; Yuan, Y.; Lian, B.; Jia, Q.; Han, M.; Deng, H.; Hong, Y. Geminiviral V2 protein suppresses transcriptional gene silencing through interaction with AGO4. J. Virol. 2019, 93, e01675-01618. [Google Scholar] [CrossRef]
  63. Torres-Herrera, S.I.; Romero-Osorio, A.; Moreno-Valenzuela, O.; Pastor-Palacios, G.; Cardenas-Conejo, Y.; Ramírez-Prado, J.H.; Riego-Ruiz, L.; Minero-García, Y.; Ambriz-Granados, S.; Argüello-Astorga, G.R. A lineage of begomoviruses encode Rep and AC4 proteins of enigmatic ancestry: Hints on the evolution of geminiviruses in the New World. Viruses 2019, 11, 644. [Google Scholar] [CrossRef] [PubMed]
  64. Kamal, H.; Zafar, M.M.; Razzaq, A.; Parvaiz, A.; Ercisli, S.; Qiao, F.; Jiang, X. Functional role of geminivirus encoded proteins in the host: Past and Present. Biotechnol. J. 2024, 19, 2300736. [Google Scholar] [CrossRef] [PubMed]
  65. Ascencio-Ibáñez, J.T.; Bobay, B.G. Conserved structural motif identified in peptides that bind to geminivirus replication protein rep. Biochemistry 2021, 60, 2795–2809. [Google Scholar] [CrossRef]
  66. Sun, M.; Jiang, K.; Li, C.; Du, J.; Li, M.; Ghanem, H.; Wu, G.; Qing, L. Tobacco curly shoot virus C3 protein enhances viral replication and gene expression in Nicotiana benthamiana plants. Virus Res. 2020, 281, 197939. [Google Scholar] [CrossRef] [PubMed]
  67. Settlage, S.B.; See, R.G.; Hanley-Bowdoin, L. Geminivirus C3 protein: Replication enhancement and protein interactions. J. Virol. 2005, 79, 9885–9895. [Google Scholar] [CrossRef]
  68. Medina-Puche, L.; Orílio, A.F.; Zerbini, F.M.; Lozano-Durán, R. Small but mighty: Functional landscape of the versatile geminivirus-encoded C4 protein. PLoS Pathog. 2021, 17, e1009915. [Google Scholar] [CrossRef]
  69. Dai, K.-W.; Tsai, Y.-T.; Wu, C.-Y.; Lai, Y.-C.; Lin, N.-S.; Hu, C.-C. Identification of crucial amino acids in begomovirus C4 proteins involved in the modulation of the severity of leaf curling symptoms. Viruses 2022, 14, 499. [Google Scholar] [CrossRef]
  70. Zhao, W.; Ji, Y.; Zhou, Y.; Wang, X. Geminivirus C4/AC4 proteins hijack cellular coat protein complex I for chloroplast targeting and viral infections. Plant Physiol. 2024, 196, 1826–1839. [Google Scholar] [CrossRef]
  71. Sattar, M.N.; Iqbal, Z.; Tahir, M.N.; Ullah, S. The prediction of a new CLCuD epidemic in the Old World. Front. Microbiol. 2017, 8, 631. [Google Scholar] [CrossRef]
  72. Iqbal, Z.; Masood, M.; Shafiq, M.; Briddon, R.W. Temporal changes in the levels of virus and betasatellite DNA in B. tabaci feeding on CLCuD affected cotton during the growing season. Front. Microbiol. 2024, 15, 1410568. [Google Scholar] [CrossRef] [PubMed]
  73. Dweep, H.; Sticht, C.; Gretz, N. In-silico algorithms for the screening of possible microRNA binding sites and their interactions. Curr. Genom. 2013, 14, 127–136. [Google Scholar] [CrossRef] [PubMed]
  74. Ben Or, G.; Veksler-Lublinsky, I. Comprehensive machine-learning-based analysis of microRNA–target interactions reveals variable transferability of interaction rules across species. BMC Bioinform. 2021, 22, 264. [Google Scholar] [CrossRef] [PubMed]
  75. Riffo-Campos, Á.L.; Riquelme, I.; Brebi-Mieville, P. Tools for sequence-based miRNA target prediction: What to choose? Int. J. Mol. Sci. 2016, 17, 1987. [Google Scholar] [CrossRef]
  76. Arif, K.T.; Okolicsanyi, R.K.; Haupt, L.M.; Griffiths, L.R. A combinatorial in silico approach for microRNA-target identification: Order out of chaos. Biochimie 2021, 187, 121–130. [Google Scholar] [CrossRef]
  77. Akhter, Y.; Khan, J.A. Genome wide identification of cotton (Gossypium hirsutum)-encoded microRNA targets against Cotton leaf curl Burewala virus. Gene 2018, 638, 60–65. [Google Scholar]
  78. More, P.; Agarwal, P.; Anand, A.; Sanan-Mishra, N.; Agarwal, P.K. Artificial miRNA mediated resistance in tobacco against Jatropha leaf curl Gujarat virus by targeting RNA silencing suppressors. Sci. Rep. 2021, 11, 890. [Google Scholar] [CrossRef]
  79. Ali, I.; Amin, I.; Briddon, R.W.; Mansoor, S. Artificial microRNA-mediated resistance against the monopartite begomovirus cotton leaf curl Burewala virus. Virol. J. 2013, 10, 231. [Google Scholar] [CrossRef]
  80. Zubair, M.; Khan, M.Z.; Rauf, I.; Raza, A.; Shah, A.H.; Hassan, I.; Amin, I.; Mansoor, S. Artificial micro RNA (amiRNA)-mediated resistance against whitefly (Bemisia tabaci) targeting three genes. Crop Prot. 2020, 137, 105308. [Google Scholar] [CrossRef]
  81. Ashraf, F.; Ashraf, M.A.; Hu, X.; Zhang, S. A novel computational approach to the silencing of Sugarcane bacilliform Guadeloupe A virus determines potential host-derived microRNAs in sugarcane (Saccharum officinarum L.). PeerJ 2020, 8, e8359. [Google Scholar] [CrossRef]
  82. Ashraf, M.A.; Ashraf, F.; Feng, X.; Hu, X.; Shen, L.; Khan, J.; Zhang, S. Potential targets for evaluation of sugarcane yellow leaf virus resistance in sugarcane cultivars: In silico sugarcane miRNA and target network prediction. Biotechnol. Biotechnolo. Equip. 2021, 35, 1980–1991. [Google Scholar] [CrossRef]
  83. Ashraf, M.A.; Feng, X.; Hu, X.; Ashraf, F.; Shen, L.; Iqbal, M.S.; Zhang, S. In silico identification of sugarcane (Saccharum officinarum L.) genome encoded microRNAs targeting sugarcane bacilliform virus. PLoS ONE 2022, 17, e0261807. [Google Scholar] [CrossRef] [PubMed]
  84. Ashraf, M.A.; Tariq, H.K.; Hu, X.-W.; Khan, J.; Zou, Z. Computational biology and machine learning approaches identify rubber tree (Hevea brasiliensis Muell. Arg.) genome encoded microRNAs targeting rubber tree virus 1. Appl. Sci. 2022, 12, 12908. [Google Scholar] [CrossRef]
  85. Ashraf, M.A.; Ali, B.; Brown, J.K.; Shahid, I.; Yu, N. In Silico Identification of cassava genome-encoded microRNAs with predicted potential for targeting the ICMV-Kerala begomoviral pathogen of cassava. Viruses 2023, 15, 486. [Google Scholar] [CrossRef]
  86. Ashraf, M.A.; Murtaza, N.; Brown, J.K.; Yu, N. In silico apple genome-encoded microRNA target binding sites targeting apple chlorotic leaf spot virus. Horticulturae 2023, 9, 808. [Google Scholar] [CrossRef]
  87. Wenzhi, W.; Ashraf, M.A.; Ghaffar, H.; Ijaz, Z.; Zaman, W.u.; Mazhar, H.; Zulfqar, M.; Zhang, S. In Silico Identification of sugarcane genome-encoded microRNAs targeting sugarcane mosaic virus. Microbiol. Res. 2024, 15, 273–289. [Google Scholar] [CrossRef]
  88. Mohamed, N.A.; Ngah, N.M.F.N.C.; Abas, A.; Talip, N.; Sarian, M.N.; Hamezah, H.S.; Harun, S.; Bunawan, H. Candidate miRNAs from Oryza sativa for silencing the rice tungro viruses. Agriculture 2023, 13, 651. [Google Scholar] [CrossRef]
  89. Gaafar, Y.Z.A.; Ziebell, H. Novel targets for engineering physostegia chlorotic mottle and tomato brown rugose fruit virus-resistant tomatoes: In silico prediction of tomato microRNA targets. PeerJ 2020, 8, e10096. [Google Scholar] [CrossRef]
  90. Iqbal, M.S.; Jabbar, B.; Sharif, M.N.; Ali, Q.; Husnain, T.; Nasir, I.A. In silico MCMV silencing concludes potential host-derived miRNAs in maize. Front. Plant Sci. 2017, 8, 372. [Google Scholar] [CrossRef]
  91. Shahid, M.N.; Rasheed, S.; Iqbal, M.S.; Jamal, A.; Khalid, S.; Shamim, Z. In Silico prediction of potential miRNAs to target ZYMV in Cucumis melo. Pak. J. Bot 2022, 54, 1319–1325. [Google Scholar]
  92. Jabbar, B.; Iqbal, M.S.; Batcho, A.A.; Nasir, I.A.; Rashid, B.; Husnain, T.; Henry, R.J. Target prediction of candidate miRNAs from Oryza sativa for silencing the RYMV genome. Comput. Biol. Chem. 2019, 83, 107127. [Google Scholar] [CrossRef] [PubMed]
  93. Srivastava, A.; Pandey, V.; Singh, N.; Marwal, A.; Shahid, M.S.; Gaur, R.K. In silico identification of papaya genome-encoded microRNAs to target begomovirus genes in papaya leaf curl disease. Front. Microbiol. 2024, 15, 1340275. [Google Scholar] [CrossRef] [PubMed]
  94. Ashraf, M.A.; Brown, J.K.; Iqbal, M.S.; Yu, N. Genome-wide identification of cotton microRNAs predicted for targeting cotton leaf curl Kokhran virus-Lucknow. Microbiol. Res. 2023, 15, 1–19. [Google Scholar] [CrossRef]
  95. Pandey, V.; Srivastava, A.; Ali, A.; Gupta, R.; Shahid, M.S.; Gaur, R.K. Predicting candidate miRNAs for targeting begomovirus to induce sequence-specific gene silencing in chilli plants. Front. Plant Sci. 2024, 15, 1460540. [Google Scholar] [CrossRef]
  96. PANDEY, V.; Srivastava, A.; Gupta, R.; Shafiq Shahid, M.; Gaur, R.K. In Silico identification of chilli genome encoded microRNAs targeting the 16S rRNA and secA genes of ‘Candidatus Phytoplasma trifolii’. Front. Bioinform. 2025, 4, 1493712. [Google Scholar] [CrossRef]
  97. Homberg, N.; Galvão Ferrarini, M.; Gaspin, C.; Sagot, M.-F. MicroRNA target identification: Revisiting accessibility and seed anchoring. Genes 2023, 14, 664. [Google Scholar] [CrossRef]
  98. Peterson, S.M.; Thompson, J.A.; Ufkin, M.L.; Sathyanarayana, P.; Liaw, L.; Congdon, C.B. Common features of microRNA target prediction tools. Front. Genet. 2014, 5, 23. [Google Scholar] [CrossRef]
  99. Deng, F.; Zhang, X.; Wang, W.; Yuan, R.; Shen, F. Identification of Gossypium hirsutum long non-coding RNAs (lncRNAs) under salt stress. BMC Plant Biol. 2018, 18, 23. [Google Scholar] [CrossRef] [PubMed]
  100. Zhang, X.; Dong, J.; Deng, F.; Wang, W.; Cheng, Y.; Song, L.; Hu, M.; Shen, J.; Xu, Q.; Shen, F. The long non-coding RNA lncRNA973 is involved in cotton response to salt stress. BMC Plant Biol. 2019, 19, 459. [Google Scholar] [CrossRef]
  101. Quillet, A.; Anouar, Y.; Lecroq, T.; Dubessy, C. Prediction methods for microRNA targets in bilaterian animals: Toward a better understanding by biologists. Comput. Struct. Biotechnol. J. 2021, 19, 5811–5825. [Google Scholar] [CrossRef]
  102. Oliveira, A.C.; Bovolenta, L.A.; Nachtigall, P.G.; Herkenhoff, M.E.; Lemke, N.; Pinhal, D. Combining results from distinct microRNA target prediction tools enhances the performance of analyses. Front. Genet. 2017, 8, 59. [Google Scholar] [CrossRef] [PubMed]
  103. Min, H.; Yoon, S. Got target?: Computational methods for microRNA target prediction and their extension. Exp. Mol. Med. 2010, 42, 233–244. [Google Scholar] [CrossRef]
  104. Tsatsakis, A.M.; Nawaz, M.A.; Kouretas, D.; Balias, G.; Savolainen, K.; Tutelyan, V.A.; Golokhvast, K.S.; Lee, J.D.; Yang, S.H.; Chung, G. Environmental impacts of genetically modified plants: A review. Environ. Res. 2017, 156, 818–833. [Google Scholar] [CrossRef] [PubMed]
  105. Bauer-Panskus, A.; Miyazaki, J.; Kawall, K.; Then, C. Risk assessment of genetically engineered plants that can persist and propagate in the environment. Environ. Sci. Eur. 2020, 32, 32. [Google Scholar] [CrossRef]
  106. Hokanson, K.E.; Ellstrand, N.C.; Dixon, A.G.; Kulembeka, H.P.; Olsen, K.M.; Raybould, A. Risk assessment of gene flow from genetically engineered virus resistant cassava to wild relatives in Africa: An expert panel report. Transgenic Res. 2016, 25, 71–81. [Google Scholar] [CrossRef]
  107. Tepfer, M.; Jacquemond, M.; García-Arenal, F. A critical evaluation of whether recombination in virus-resistant transgenic plants will lead to the emergence of novel viral diseases. New Phytol. 2015, 207, 536–541. [Google Scholar] [CrossRef]
  108. Ossowski, S.; Schwab, R.; Weigel, D. Gene silencing in plants using artificial microRNAs and other small RNAs. Plant J. 2008, 53, 674–690. [Google Scholar] [CrossRef] [PubMed]
  109. Carbonell, A.; Fahlgren, N.; Mitchell, S.; Cox, K.L., Jr.; Reilly, K.C.; Mockler, T.C.; Carrington, J.C. Highly specific gene silencing in a monocot species by artificial microRNAs derived from chimeric miRNA precursors. Plant J. 2015, 82, 1061–1075. [Google Scholar] [CrossRef]
Figure 1. Features of common computational algorithms used for miRNA prediction. Four computational tools, RNA22, psRNATarget, RNAhybrid and TAPIR, were dedicated to showcase the predictive capacity of four widely used algorithms. The TAPIR and psRNATarget algorithms were used for plant miRNA predictions, whereas TAPIR and RNAhybrid are used to detect seed-based miRNA–target interaction. All four tools are multi-frame target detection algorithms and among the most widely used for analogous predictions.
Figure 1. Features of common computational algorithms used for miRNA prediction. Four computational tools, RNA22, psRNATarget, RNAhybrid and TAPIR, were dedicated to showcase the predictive capacity of four widely used algorithms. The TAPIR and psRNATarget algorithms were used for plant miRNA predictions, whereas TAPIR and RNAhybrid are used to detect seed-based miRNA–target interaction. All four tools are multi-frame target detection algorithms and among the most widely used for analogous predictions.
Viruses 17 00399 g001
Figure 2. Schematic representation of selected miRNA target sites was designed in CLCuKoV-Bur genome that enables the integration of biological data and computational tools. The computational framework composed of six different types of in silico tools.
Figure 2. Schematic representation of selected miRNA target sites was designed in CLCuKoV-Bur genome that enables the integration of biological data and computational tools. The computational framework composed of six different types of in silico tools.
Viruses 17 00399 g002
Figure 3. Diagram of the CLCuKoV-Bur genome sequence and coordinates for NCBI Accession no. AM421522. The predicted coding regions of the CLCuKoV-Bur genome indicated by the colored arrows.
Figure 3. Diagram of the CLCuKoV-Bur genome sequence and coordinates for NCBI Accession no. AM421522. The predicted coding regions of the CLCuKoV-Bur genome indicated by the colored arrows.
Viruses 17 00399 g003
Figure 4. Venn diagram showing the number of ghr-miRNAs predicted by four algorithms RNA22, psRNATarget, RNAhybrid and TAPIR. The degree of overlap between in silico tool and multi-miRNA-binding site predictions.
Figure 4. Venn diagram showing the number of ghr-miRNAs predicted by four algorithms RNA22, psRNATarget, RNAhybrid and TAPIR. The degree of overlap between in silico tool and multi-miRNA-binding site predictions.
Viruses 17 00399 g004
Figure 5. The union or consensus plot, showing the number of ghr-miRNA–target sites predicted by all four predictive tools. The y-axis shows the minimum free energy (MFE) based on −0–30 Kcal/mol threshold set for the RNA22 and RNAhybrid analyses. The MFE ratio for TAPIR was set over the range of 0–1, and the psRNATarget expectation score range was set at 0–10. The x-axis indicates the location in the virus genome sequence, spanning the 2759 nucleotides.
Figure 5. The union or consensus plot, showing the number of ghr-miRNA–target sites predicted by all four predictive tools. The y-axis shows the minimum free energy (MFE) based on −0–30 Kcal/mol threshold set for the RNA22 and RNAhybrid analyses. The MFE ratio for TAPIR was set over the range of 0–1, and the psRNATarget expectation score range was set at 0–10. The x-axis indicates the location in the virus genome sequence, spanning the 2759 nucleotides.
Viruses 17 00399 g005
Figure 6. Individual cotton-encoded ghr-miRNA predicted to target CLCuKoV-Bur sequences in the ssDNA CLCuKoV-Bur. Four widely used miRNA predictive tools with high-dimension settings. The putative miRNA binding sites identified in the virus genome were predicted by (A) RNA22, (B) psRNATarget, (C) RNAhybrid, (D) TAPIR algorithms, respectively. The viral open reading frames that were identified as predicted targets are indicated by the colored dots.
Figure 6. Individual cotton-encoded ghr-miRNA predicted to target CLCuKoV-Bur sequences in the ssDNA CLCuKoV-Bur. Four widely used miRNA predictive tools with high-dimension settings. The putative miRNA binding sites identified in the virus genome were predicted by (A) RNA22, (B) psRNATarget, (C) RNAhybrid, (D) TAPIR algorithms, respectively. The viral open reading frames that were identified as predicted targets are indicated by the colored dots.
Viruses 17 00399 g006
Figure 7. Intersection plot was created for ease of locating the consensus binding sites of cotton ghr-miRNAs. Nine high affinity predicted binding sites of 15 ghr-miRNAs in the CLCuKoV-Bur C1, C3, C4 and V1 coding region (s), identified by at least two of four algorithms implemented in this study.
Figure 7. Intersection plot was created for ease of locating the consensus binding sites of cotton ghr-miRNAs. Nine high affinity predicted binding sites of 15 ghr-miRNAs in the CLCuKoV-Bur C1, C3, C4 and V1 coding region (s), identified by at least two of four algorithms implemented in this study.
Viruses 17 00399 g007
Figure 8. Intersection plot was created for ease of locating consensus binding sites of ghr-miRNAs. Two high affinity binding sites were identified to target different coding regions of CLCuKoV-Bur C1, and C4 by at least three of the four algorithms implemented. The x-axis indicates the genome position within the viral genome of 1–2759 nucleotides in length. The Y-axis indicates the consensus ghr-miRNAs.
Figure 8. Intersection plot was created for ease of locating consensus binding sites of ghr-miRNAs. Two high affinity binding sites were identified to target different coding regions of CLCuKoV-Bur C1, and C4 by at least three of the four algorithms implemented. The x-axis indicates the genome position within the viral genome of 1–2759 nucleotides in length. The Y-axis indicates the consensus ghr-miRNAs.
Viruses 17 00399 g008
Figure 9. A Circos plot highlighting all predicted interactions between cotton ghr-miRNAs and CLCuKoV-Bur open reading frames (ORFs). This integrated interaction map shows miRNA–mRNA target interactions. The CLCuKoV-Bur ORFs are represented by colored lines.
Figure 9. A Circos plot highlighting all predicted interactions between cotton ghr-miRNAs and CLCuKoV-Bur open reading frames (ORFs). This integrated interaction map shows miRNA–mRNA target interactions. The CLCuKoV-Bur ORFs are represented by colored lines.
Viruses 17 00399 g009
Table 1. The number of cotton miRNAs predicted to target the CLCuKoV-Bur coding region.
Table 1. The number of cotton miRNAs predicted to target the CLCuKoV-Bur coding region.
GenesProteinRNA22
(Sites)
psRNATarget
(miRNAs)
RNAhybrid
(miRNAs)
TAPIR
(miRNAs)
V1Coat38124
V2Pre-Coat0150
C1Rep54182
C3REn53144
C4C455196
LIR-02100
Table 2. Summary of target binding sites of consensus ghr-miRNAs predicted by two or more of the four algorithms to target the CLCuKoV-Bur genome.
Table 2. Summary of target binding sites of consensus ghr-miRNAs predicted by two or more of the four algorithms to target the CLCuKoV-Bur genome.
Cotton
miRNA
Site/ORF
RNA22
Site/ORF psRNATargetSite/ORF
RNAhybrid
Site/ORF
TAPIR
MFE *
RNA22
Expectation
psRNATarget
MFE **
RNAhybrid
MFE Ratio
TAPIR
ghr-miR156 (a, b, d) 2203 (C1/C4)2202 (C1/C4) −23.500.51
ghr-miR169a692 (V1) 2190 (C1/C4)2191 (C1/C4)−16.70 −28.900.69
ghr-miR169b692 (V1)2190 (C1/C4)2190 (C1/C4)2191 (C1/C4)−15.406.00−31.900.64
ghr-miR390 (a, b, c)1410 (C3) 1410 (C3) −26.80 −33.40
ghr-miR396 (a, b) 1249 (C3)1225 (C3)1249 (C3) 6.00−22.500.58
ghr-miR399c1750 (C1) 1752 (C1) −19.00 −24.70
ghr-miR3999d1749 (C1)1749 (C1)1747 (C1)1749 (C1)−16.306.50−22.500.49
ghr-miR399e1747 (C1)1749 (C1)1747 (C1) −17.806.50−23.90
ghr-miR7486 (a, b)2499 (C1/C4)2499 (C1/C4)850 (V1) −21.305.00−30.70
ghr-miR7488 2558 (C4)1443 (C3)2558 (C4) 5.50−24.800.42
ghr-miR7493 1163 (C3)1351 (C3)1163 (C3) 6.00−22.200.52
ghr-miR7512918 (V1) 918 (V1) −16.70 −23.50
* MFE represents maximum free energy, calculated by RNA22. ** MFE denotes minimum free energy, estimated by RNAhybrid.
Table 3. The number of cotton miRNA-binding sites to target each gene of CLCuKoV-Bur.
Table 3. The number of cotton miRNA-binding sites to target each gene of CLCuKoV-Bur.
MicroRNAsRNA22psRNATargetRNAhybridTAPIR
Folding Energy (p-Value)ExpectationMinimum Free EnergyMFE Ratio
ghr-miR169b 6.00−31.900.64
ghr-miR399d−16.30 (0.319)6.50−22.500.49
ghr-miR399e−17.80 (0.319)6.50−23.90-
Table 4. The consensus cotton ghr-miRNAs were predicted for targeting (C1 and C4) genes.
Table 4. The consensus cotton ghr-miRNAs were predicted for targeting (C1 and C4) genes.
miRNA IDAccession IDMature Sequence (5′–3′)Target
ORF(s)
Genomic Target
(nt)
Mode of
Inhibition
ghr-miR169bMIMAT0029157CAGCCAAGGAUGAUUUGCCGGC1/C42190–2212Cleavage
ghr-miR399dMIMAT0014350UGCCAAAGGAGAUUUGCCCUGC11747–1769Cleavage
ghr-miR399eMIMAT0025840UGCCAAAGGAGAUUUGCCCCGC11747–1767Cleavage
Table 5. The binding interaction (ΔG) for the dimer cofold was estimated for each consensus pairs.
Table 5. The binding interaction (ΔG) for the dimer cofold was estimated for each consensus pairs.
miRNA IDmiRNA–mRNA Sequence (5′–3′)ΔG Duplex
(Kcal/mol)
ΔG Binding
(Kcal/mol)
ghr-miR169b5′ CAGCCAAGGAUGAUUUGCCGG 3′
5′ GCGGCGTAAGCGTCGTTGGCTGT 3′
−27.00−19.15
ghr-miR399d5′ UGCCAAAGGAGAUUUGCCCUG 3′
5′ TGGACTGCCAGTCTCTTTGGGCC 3′
−18.20−12.81
ghr-miR399e5′ UGCCAAAGGAGAUUUGCCCCG 3′
5′ TGGACTGCCAGTCTCTTTGGGCC 3′
−19.40−14.88
Table 6. The stem–loop hairpin precursor for the three consensus ghr-miRNAs was characterized.
Table 6. The stem–loop hairpin precursor for the three consensus ghr-miRNAs was characterized.
miRNA IDAccession IDLength PrecursorMFE */Kcal/molAMFE **MFEI ***(G + C)%
ghr-MIR169bMI0024199210 nt−61.80−29.42−0.81736.00
ghr-MIR399dMI001355798 nt−47.00−47.95−1.16941.00
ghr-MIR399eMI0022547157 nt−69.10−44.01−0.88050.00
* MFE is minimum free energy. ** AMFE is the abbreviation of adjusted free energy. *** MFEI is defined as free energy index.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ashraf, M.A.; Shahid, I.; Brown, J.K.; Yu, N. An Integrative Computational Approach for Identifying Cotton Host Plant MicroRNAs with Potential to Abate CLCuKoV-Bur Infection. Viruses 2025, 17, 399. https://doi.org/10.3390/v17030399

AMA Style

Ashraf MA, Shahid I, Brown JK, Yu N. An Integrative Computational Approach for Identifying Cotton Host Plant MicroRNAs with Potential to Abate CLCuKoV-Bur Infection. Viruses. 2025; 17(3):399. https://doi.org/10.3390/v17030399

Chicago/Turabian Style

Ashraf, Muhammad Aleem, Imran Shahid, Judith K. Brown, and Naitong Yu. 2025. "An Integrative Computational Approach for Identifying Cotton Host Plant MicroRNAs with Potential to Abate CLCuKoV-Bur Infection" Viruses 17, no. 3: 399. https://doi.org/10.3390/v17030399

APA Style

Ashraf, M. A., Shahid, I., Brown, J. K., & Yu, N. (2025). An Integrative Computational Approach for Identifying Cotton Host Plant MicroRNAs with Potential to Abate CLCuKoV-Bur Infection. Viruses, 17(3), 399. https://doi.org/10.3390/v17030399

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop