Next Article in Journal
Analysis of the Roles of the Arabidopsis nMAT2 and PMH2 Proteins Provided with New Insights into the Regulation of Group II Intron Splicing in Land-Plant Mitochondria
Next Article in Special Issue
Development of An Impedimetric Aptasensor for the Detection of Staphylococcus aureus
Previous Article in Journal
Sphingosine 1-Phosphate (S1P) Signaling in Glioblastoma Multiforme—A Systematic Review
Previous Article in Special Issue
Nucleic Acid Aptamers: Emerging Applications in Medical Imaging, Nanotechnology, Neurosciences, and Drug Delivery
 
 
Order Article Reprints
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Computational Methods for Modeling Aptamers and Designing Riboswitches

1
Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang Normal University, Huanggang 438000, China
2
Department of Physics, Wuhan University, Wuhan 430072, China
*
Author to whom correspondence should be addressed.
Int. J. Mol. Sci. 2017, 18(11), 2442; https://doi.org/10.3390/ijms18112442
Received: 19 October 2017 / Revised: 12 November 2017 / Accepted: 14 November 2017 / Published: 17 November 2017
(This article belongs to the Special Issue Aptamers)

Abstract

:
Riboswitches, which are located within certain noncoding RNA region perform functions as genetic “switches”, regulating when and where genes are expressed in response to certain ligands. Understanding the numerous functions of riboswitches requires computation models to predict structures and structural changes of the aptamer domains. Although aptamers often form a complex structure, computational approaches, such as RNAComposer and Rosetta, have already been applied to model the tertiary (three-dimensional (3D)) structure for several aptamers. As structural changes in aptamers must be achieved within the certain time window for effective regulation, kinetics is another key point for understanding aptamer function in riboswitch-mediated gene regulation. The coarse-grained self-organized polymer (SOP) model using Langevin dynamics simulation has been successfully developed to investigate folding kinetics of aptamers, while their co-transcriptional folding kinetics can be modeled by the helix-based computational method and BarMap approach. Based on the known aptamers, the web server Riboswitch Calculator and other theoretical methods provide a new tool to design synthetic riboswitches. This review will represent an overview of these computational methods for modeling structure and kinetics of riboswitch aptamers and for designing riboswitches.

Graphical Abstract

1. Introduction

Many noncoding RNAs (ncRNAs) have been found to bear important and diverse biological functions in all domains of life, including catalysis, protection of genomes, and regulation of cell activities [1,2,3]. This diversity in biological functions is attributed to the remarkable structure variety accommodated by RNAs. Riboswitches, which present a fundamental example of ncRNAs, are involved in cellular regulation through vast structural rearrangement in response to the intracellular physical signals, such as metabolites [1,4,5] and ions [6,7,8,9,10]. Among previously validated riboswitches, metabolite-specific riboswitches are the most widespread and the nature of their ligands is well defined in most cases, except for several “riboswitch-like”, presumably cis-acting, RNA structures for which no ligand has been found yet [11,12]. In order to function, most riboswitches usually consist of two domains: a conserved aptamer domain that is responsible for ligand binding, and an expression platform that converts changes in the aptamer domain into changes in gene expression. In contrast to these riboswitches, the SMK (SAM-III) riboswitch is the one that can use single domain for both ligand binding and gene regulation [13,14,15]. Aptamer domains, which are typically 35~200 nucleotides in length [16,17,18], often form a ligand-binding pocket (the aptamer structure) to bind the ligands with high specificity. To date, according to the aptamer structures (Figure 1), more than 30 riboswitch classes have been found in all three kingdoms of life [19,20,21]. Due to their specificity and function as genetic regulators, riboswitches represent a novel class of molecular target for developing antibiotics and chemical tools [22]. Thus, a comprehensive understanding of riboswitches is important to facilitate the design for riboswitch-targeted drug, molecular robotics, and new molecular sensors.
The signal-dependent conformational shifts of riboswitches, usually between two distinct functional states, i.e., ligand bound state and unbound state, regulate the downstream gene expression (Figure 2). One of the alternative states serves as the genetic off state (OFF state) by forming an intrinsic terminator hairpin or a repression stem to repress gene expression [26,27,28]. The other state acts as the genetic on state (ON state), which induces gene expression by preventing the formation of these regulatory elements. Riboswitches can also regulate RNA splicing by controlling the structural flexibility near the relevant splice site [25]. During the regulatory activities, one of the two structures is adopted by riboswitches, depending on whether the aptamer domain can form the pocket and bind its ligand on time or not. Therefore, to investigate the regulation mechanism of these functional ncRNAs, one of the major challenges is the information of the aptamer structure (Figure 2). In contrast to proteins, a much smaller number of RNA structures have been solved by using the traditional experimental methods [29], such as nuclear magnetic resonance (NMR) spectroscopy and X-ray crystallography [15,30]. Sensitive X-ray crystallography must take special care to avoid RNA aggregation and misfolding prior to crystal formation, and NMR experiments easily suffer from poor long-range correlations for RNA. These create a great demand to obtain information of RNA structures by using theoretical approaches. Up to now, many packages and methods have been developed to predict RNA secondary structure (two-dimensional (2D)) [31,32,33], as well as the three-dimensional (3D) structure for small RNAs [32,33,34,35,36,37,38]. These methods can quickly produce structure for a given RNA based on the input data. Some computational methods, such as RNAComposer and Rosetta [34,35,39], have been successfully applied to modeling 3D structure for several complex aptamers. As classical experimental methods may be limited in applicability to RNA [40], these theoretical approaches are likely to circumvent the bottleneck from experimental methods.
Another scientific challenge for the elucidation of the riboswitch function is to model aptamer kinetics, which is intrinsic to folding and conformational switching. Traditional molecular dynamics (MD) simulations [41,42], are able to provide a direct link between structure and dynamics. Nonetheless, due to the extreme complexity of force fields, a large number of atoms and a high number of degrees of freedom in RNA molecules, the detailed all of the atom simulations are difficult to produce trajectories in the time frame relevant for function of aptamers, which usually need to undergo large structural changes for purposes. Coarse-grained structural model with a less exhaustive representation is therefore particularly efficient to deal with these problems [43,44]. For example, the folding kinetics of pbuE [45], addA, and SMK riboswitch aptamers [43,46], have already been studied by using the coarse-grained SOP model. Another efficient alternative based on coarse-grained model is to investigate the kinetics of RNA 2D structure [47,48], which can also capture enough detail to understand the functions of aptamers.
Like protein, which can fold as soon as the N-terminal part emerges from the ribosome (co-translational folding) [49], nascent RNA also folds spontaneously as the nucleotides are synthesized by RNA polymerase in living cells (co-transcriptional folding) [50]. As riboswitches fold co-transcriptionally, folding patterns of the aptamer domain can direct the folding of the downstream expression platform. In fact, since many riboswitches regulate transcription, ligand binding can only occur during transcription for these to control gene expression [26,51,52,53]. Hence, the co-transcriptional folding kinetics is crucial for understanding the intracellular function of aptamers. Besides the optical-trapping assay and other experimental approaches [28,50,54], the kinetic Monte Carlo (MC) method was used to study the co-transcriptional folding of riboswitches [55], but it only considered the base pairing interactions closed in the native structures. By combining the master equation and the free energy landscape, BarMap and the helix-based computational methods have been applied to modeling the co-transcriptional folding behaviors for several riboswitches [56,57,58,59,60]. The results suggest that the aptamer domain folds into the pocket structure as soon as the nucleotides are transcribed [57,58], while the riboswitch without a separate aptamer domain is more likely to form an alternative structure instead of the pocket during the transcription [56]. These computation models, which can predict stable and metastable structures, kinetics, and transition states, bridge the gap in understanding the relationship between the structure and biological function of aptamers. Furthermore, computational RNA design has also made a great progress to construct synthetic riboswitches by using different strategies [61,62]. Here, we will provide a collection of these computational methods for modeling the structure and kinetics of aptamers, and for designing riboswitches.

2. Computational Method for Predicting Aptamer Structures

Since the structure of RNA determines its biological function, a complete understanding of the aptamer structure is the necessary prerequisite to understand the riboswitch-mediated regulation processes in the cell. RNAs fold into complex structures; the linear ribonucleotide sequence is the determinant of base-pairing interactions (2D structure), which in turn, determines the spatial shape (3D structure). Since most computational methods use the input of 2D structure to produce the RNA 3D structure [29,32,63], the precise prediction of 2D structure becomes more and more important. Early computational approaches for predicting RNA 2D structure (e.g., RNAfold [64]), only find the structure with the lowest free energy for a given RNA. As the functional structure may not be the one with the lowest energy, methods such as mfold and RNAsubopt [31,65] are developed to predict a set of low-energy structures. Other prediction methods, such as PPfold and RNAalifold [66,67], are based on evolutionary considerations. For a given aptamer, these methods can quickly produce 2D structure and the free energy. The methods for the modeling structure do not consist of a process that assumes co-transcriptional folding. Recently, RNA 2D structure prediction has been reinforced by incorporating the constraints from the experiments [65].

2.1. RNAComposer

RNA 2D structure is a crucial step in the functional characterization, but a thorough understanding of aptamer functions depends critically on the 3D structure, which is the key determinant of their interactions with ions and other molecules in cell. Based on the 2D structure tree graph representation and homology of structural elements, the RNAComposer method (Table 1) was developed to automatically predict the 3D structure for large RNAs [29]. As a knowledge-based method, it uses RNA sequence and 2D structure topology in dot-bracket notation as an input for 3D structure prediction (Figure 3). In this notation [64], unpaired nucleotides and the nucleotides that are involved in base pairs are represented by dots and brackets, respectively; square brackets and curly brackets refer to first-order pseudoknots and higher order structures, respectively. Although the input RNA 2D structure can be obtained by using the methods that are incorporated within the RNAComposer system: RNAfold, RNAstructure, and Contrafold, experimentally adjusted 2D structure is able to largely improve quality of the prediction [39].
The input RNA 2D structure first is divided into stems, loops, and single strands in the program. 3D structure elements corresponding to these fragmentations are searched within the structure elements dictionary, which is tailored from the RNA FRABASE database and consists of a 3D structural element with good structural properties [68]. After the searching process, the 3D elements whose heavy-atom root mean square deviation (RMSD) is lower than 1.0 Å relative to the parent PDB structure, are selected according to the 2D structure topology, sequence similarity, and so on. By merging the selected 3D structure elements, the initial RNA 3D structure is obtained, and then refined by the energy minimization in the torsion angle space and Cartesian atom coordinate space to get the final 3D structure. RNAComposer has been used to accurately build the 3D structure of several complex riboswitch families [39], such as FMN riboswitch aptamer, TPP, and purine riboswitch aptamers. It is offered at two sites: http://rnacomposer.ibch.poznan.pl and http://rnacomposer.cs.put.poznan.pl. By typing the 2D structure of THI riboswitch aptamer in Figure 3 on the website [69], the related 3D structure will be released in PDB formatted file to users within 10 s. RNAComposer is automated, efficient, especially suited for RNA 3D structure prediction of large RNAs, but it highly depends on the 3D structural elemental dictionary, and the applicability of the method is limited to RNAs with a few complex kink turn motifs.

2.2. Rosetta and Discrete Molecular Dynamics

RNAComposer is motif library-based, while Rosetta is a fragment-based method that is available online [35]. In the Rosetta approach, according to the RNA 2D structure and the experimental proximity mapping data, low-resolution models are generated by using Fragment Assembly of RNA (FARNA) [34], where models are assembled using RNA fragments from a crystallographic database via a MC algorithm. The Rosetta all-atom energy function is then used to minimize a small number of the low-resolution models with the lowest Rosetta energy scores. To select a representative set of 3D models, the largest and lowest-energy subsets of models that fall within a certain RMSD threshold of each other are collected to reflect the native fold of the RNA. Taking the aptamer from Fusobacterium nucleatum double glycine riboswitch as an example, the Rosetta 3D modeling can give a similar prediction as RMdetect and JAR3D [70,71,72,73].
Also, starting from 2D structures, a hierarchical computational approach (RAGTOP) was modified to predict 10 representative riboswitch aptamers with diverse structural features [74], by combining the coarse-grained graph sampling approach [76]. Through integration of computational and experimental methods, a three-bead coarse-grained model of RNA for discrete molecular dynamics simulation have gotten precise 3D structure predictions for the M-box riboswitch and TPP riboswitch aptamers [77], and for RNA in the range of a few hundred nucleotides [40]. In this coarse-grained model, each RNA nucleotide is represented by three beads corresponding to the base, sugar, and phosphate groups. The potential terms includes bonded, non-bonded interactions, and additional potential terms based on the experimental hydroxyl radical probing data. This method uses RNA sequences and base pairing as inputs to generate structures, and then applies replica exchange simulations with the potential to find a representative structure. Based on this method, the web server iFoldRNA was created for prediction of RNA 3D structure [75].
The 3D structures of aptamers modeled by these approaches suggest that the aptamer domains often form a compact structure involving many complicated tertiary interactions. Since the entire prediction of these approaches depends on the input data, the correct 2D structure is critical for the accurate 3D structure modeling. Experimental data provides powerful constraints to reinforce 2D structure prediction, but these methods currently can only achieve subhelix-resolution accuracy or near-atomic accuracy for RNAs.

3. Computation Model to Characterize Structural Changes in Aptamers

A key event in the biological function of riboswitches is the structural change within the aptamer domain. This change can lead to a change of the folding pattern within the expression platform, thereby directly modulating the gene expression. For effective flipping of riboswitches, the structural change of aptamers must be achieved within the certain time window. Therefore, characterizing the structural changes of aptamers is also important for fully understanding their function in the cell. The conventional all atom MD simulation has been widely used to describe the time-dependent motions of biological molecules [78]. However, even though modern parallelization of MD simulation is able to model trajectories on the order of milliseconds, it still fails to address the majority of biological processes, including folding or unfolding of aptamers that occur on much longer timescales (in seconds). In order to model time-dependent structural changes of aptamers, an effective solution is to use a coarse-grained structural model.

3.1. The Master Equation Approach

The master equation approach or kinetic MC method based on coarse-grained system [79,80,81], namely RNA 2D structure, is advantageous for accessing behaviors at long time scales, even minutes or hours [80]. In the master equation approach, structural changes are usually modeled on the RNA energy landscape, which specifies the conformation space and the transition rates between conformations. For a given RNA, the conformation space is sampled by all of the possible 2D structures that are constructed by using the formation or disruption of an entire helix as the elementary step to allow for large structural changes. Their free energies can be calculated using the Turner energy parameters [82,83]. The transition rates between states are obtained based on the free energy landscape analysis [48,80,81]: (1) formation; (2) disruption of a helix; and, (3) helix formation with concomitant partial melting of a competing helix. With these key concepts, the folding process can be described by the master equation:
d p i ( t ) / d t = i i [ k j i p j ( t ) k i j p i ( t ) ]
where p i ( t ) is the population of state i at time t and k i j is the transition rate from state i to state j. This computational method which integrates RNA 2D structure with the static energy landscape, provides a basic idea to predict the folding kinetics of the Hepatitis delta virus ribozyme and SMK riboswitch [56,80].

3.2. Coarse-Grained SOP Model

Besides these approaches, the coarse-grained SOP model using Langevin dynamics simulation can also be used to study the kinetics of RNA molecules by characterizing the folding landscape [43,45,46]. Actually, many complex biological processes, such as unfolding and refolding of various RNA and proteins [44,84,85], are described with great success by using the SOP model. In this coarse-grained model, each nucleotide, as well as the ligand, is represented as a single site. The total potential energy of the bound aptamer is
V T = V APT + V APT L
where V APT is the energy function of the aptamer; and, V APT L is the interaction between the ligand and the aptamer. The dynamics of the system can be simulated by using Brownian dynamics or the Langevin equation in the overdamped limit. During the force ramp simulation, the 5′-end of RNA is attached to a spring pulled with a constant speed, while the 3′-end is fixed. The free energy profiles are obtained using
G z = k B T ln P ( z )
where k B and T are the Boltzmann constant and temperature, respectively; and, the probability P ( z ) of the extension between z and z + dz, is calculated from the folding trajectories. Based on the theory of mean first-passage times [84], the transition rate between two folding states can be calculated from the time traces of the extension.
The kinetics of the SAM-III and addA riboswitch aptamers have been successfully studied with this approach [43,46]. As the crystal structure of pbuE riboswitch aptamer is not available, its atomic structure is produced via conformational sampling with MD by substituting the sequence of pbuE aptamer into the crystal structure of addA aptamer (PDB code: 1Y26), when considering the structural similarity between the two aptamers. But despite this structural similarity, their folding behaviors are different in the pulling simulation [43,45]. In addA riboswitch aptamer (Figure 1), the unfolding occurs in the order of F P 2 | P 3 P 3 U , while the unfolding order of pbuE aptamer is F P 2 | P 3 P 2 U , where F and U denote the fully folded state and unfolded state, respectively; Pi is the hairpin structure with helix Pi ; P 2 | P 3 denotes the state with helices P2 and P3. The different unfolding order of P2 and P3 in the two aptamers suggests that the riboswitches carry out different regulatory activities in bacteria, even though they belong to the same class. Helix P3 in pbuE riboswitch is unfolded ahead of helix P2, because of its unstability. This can explain why helix P3 is disrupted in OFF state of pbuE riboswitch but keeps folded in that of addA riboswitch [28,86]. Due to this greater conformational change in pbuE riboswitch, its OFF state can hardly transit to the aptamer structure, implying an irreversible kinetic riboswitch [57]. On the contrary, addA riboswitch is able to quickly reach equilibrium between the OFF state and the aptamer structure, which is consistent with a reversible thermodynamic switch [28]. The different unfolding kinetics of the aptamers under force thus can provide the information of their function.

4. Methods to Predict the Structure Transitions during Transcription

RNA folding occurs in two different ways [87]: (i) folding after synthesis of the entire RNA molecule (refolding); and, (ii) sequential folding occurs during transcription, namely co-transcriptional folding. In vivo, most RNAs fold co-transcriptionally, due to the sequential nature of RNA synthesis. The sequential folding during transcription is crucial for riboswitches, especially the kinetic riboswitches, to exert their regulation. Since kinetic switches are trapped in one state depending on whether the trigger is present at the time of folding, the mechanism of their function can only be understood in the context of co-transcriptional folding. For example, the full length pbuE riboswitch quickly folds to OFF state, which hinders adenine binding, while the aptamer structure that is responsible for ligand binding is not observed [53,57]. However, as nucleotides of the aptamer domain are transcribed first, the sequential folding may allow for the aptamer to form the pocket and bind to the ligand before formation of OFF state during transcription. Thus, the co-transcriptional folding kinetics of aptamers during transcription plays an important role for understanding their function in living cells.
In the case of co-transcriptional folding, as the RNA chain grows, the whole transcription process can be divided into a series of transcription steps [48,57,59], with each corresponding to adding one nucleotide. RNA folding kinetics within each step is modeled on the energy landscape as a certain RNA chain. But a link between two consecutive steps should be constructed due to the sequential folding in the transcription context. Based on this idea, the BarMap approach and helix-based computational approach have been developed to investigate the folding behaviors of several riboswitches under different transcription conditions [56,57,58,88].

4.1. BarMap

The BarMap approach integrates RNA 2D structure with the dynamic energy landscape to explore co-transcriptional folding kinetics [59]. The main idea of this approach is to model RNA kinetics on individual landscapes, where external triggers are considered as discrete changes. For successive kinetic simulations, a map between states of adjacent landscapes is computed to define the transfer of population densities.
In the case of transcription, the folding is treated as a process on a time-varying landscape in this approach. For each RNA elongation step, an energy landscape is first computed using the barriers program, which can simulate RNA folding kinetics [89,90]. Then, BarMap constructs a map between the landscapes of two successive steps, as shown in Figure 4a. According to this map, the final population at the previous step can be mapped to the initial population on the landscape at the current step. Finally, starting with the first landscape with more than one state, RNA folding kinetics can be simulated by the treekin program [59], which integrates the master equation for arbitrary times through calculating the matrix exponential. The amount of time t on a particular landscape corresponds to the elongation time of the polymerase. Co-transcriptional folding behaviors of RNAs can be obtained from the relationship between the population density and the transcription step that is given by the approach.
The BarMap approach has been used to study the RNA thermometer, RNA refolding during pore translocation, and co-transcriptional folding [59,88]. The effects of transcriptional pause sites, transcription rates and ligand concentrations, can easily be included in the approach by specifying the amount of time t and changing the binding energy added to all of the ligand-competent states, respectively. Using the example of a recently designed theophylline-dependent RS10 riboswitch, the BarMap approach predicts the folding behaviors in good agreement with experimental observations [88].

4.2. The Helix-Based Computational Method

In helix-based computation model [60], the folding time window for each RNA elongation step also depends on the time required for RNAP to transcribe the relevant nucleotide. At each step, the population kinetics is calculated in the same manner as the refolding kinetics that are calculated in the master equation approach [48,60]: first, the conformation space is generated and the transition rates are calculated; then, the population relaxation within the folding time window is described with the master equation, where the initial population at the current step is determined by the folding results of the previous step. Based on possible structural changes as the RNA chain is elongated by one nucleotide, a link between the initial population distribution at the current step and the final population distribution at the previous step is constructed in Figure 4b. Like the BarMap approach, the effects of the transcription speed, transcriptional pause, and ligand concentration can be mimicked by modifying the folding time window for each step and the binding energy in this method.
The helix-based computational method has been used to explore the regulation mechanisms of several well-studied riboswitches, such as pbuE [57], SMK [56], metF, and yitJ riboswitches [58]. The results show an excellent agreement of predicted trajectories with that from experiments and other methods [50,55,91]. The folding behavior of addA riboswitch aptamer, as shown in Figure 5, suggests that as the chain grows, the nascent RNA chain folds through a series of discrete intermediate states (from S0 to S5). When the first 25th nucleotide is transcribed, the open RNA chain S0 begins to form structure S1, which is replaced by S2 from step on 32. As the 49th nucleotide is free to form structures, S3 is formed and occupies most of the population till the 59th step. From step on 59, S3 begins to transit to S4. When helix P1 can be nucleated, the chain folds into S5.
This aptamer domain can fold into the pocket S5 as soon as the relevant nucleotides are transcribed (Figure 5a,d), which has also been found in pbuE [57], yitJ, and metF riboswitches [58]. During the refolding process (Figure 5b,e), the entire molecule folds into S5 mainly through structure I and S3. Although the refolding pathway is different from the co-transcriptional folding pathway, the aptamer domain also can form S5 within 0.1 s, implying that the aptamer domain is highly evolved. In contrast to these riboswitch aptamers, the SAM-III riboswitch, which utilizes a single domain to exert functions, quickly folds to ON state instead of the pocket (OFF) structure (as shown in Figure 1) under both the transcription context and the refolding condition (Figure 5) [56]. This thermodynamic switch is not sensitive to co-transcriptional folding kinetics, so it can be understood by the equilibrium properties. The co-transcriptional folding pathway of addA aptamer is similar to that of the pbuE aptamer [57], possibly because of the conservation within the aptamer domains. For many riboswitches, since the expression platforms are required to form a terminator or a repression stem [1,4], their sequences could be helpful to decipher the different folding intermediates as well.
Previous studies suggest that transcriptional pause plays a key role for riboswitches and other RNAs to exert function [92,93,94,95]. The major transcriptional pause sites found within several riboswitches are located immediately after the aptamer domains [51,86,96]. As the time window that is allowed for ligands binding is limited during transcription, the pauses in these regions can give the aptamers extra time to bind to the ligands but prevent the unbound functional states from being formed. Their effects have been assessed by the helix-based computational method and experimental approaches [51,57,58,93]. The results from the helix-based computational method suggest that removing the pause sites leads to a demand for an even higher ligand concentration to trigger the switch [57,58].
The good agreement of the results from these theoretical approaches with the experiments and other methods implies that they provide a reliable tool to understand the function of aptamers in the riboswitch-mediated gene regulation. However, all of these approaches based on 2D structure prediction ignore tertiary interactions and the effects of ions. Although the ligand binding can be mimicked by modifying the binding energy from experiments, the 2D structure model still cannot precisely predict the ligand–RNA interactions. Hence, there is a significant requirement in incorporating these factors to fully understand the function of riboswitches.

5. Computation Design for Synthetic Riboswitches

Artificial riboswitches established recently demonstrate that they can be used as a new tool for the drug-regulated expression of viral genes [97]. To design artificial riboswitches or other RNA devices for industrial and medical applications, different computation strategies have been developed over the years [62,98,99,100,101,102]. Based on a biophysical model of translation initiation, a web server called Riboswitch Calculator can be used to design synthetic riboswitches form various RNA aptamers [62]. In this model, a riboswitch is considered as a long mRNA molecule. The interactions of RNA–RNA, RNA–ribosome and ligand–RNA, control the translation initiation rate, which can be calculated as:
r = exp ( β Δ G total )
where β , Δ G total are the apparent Boltzmann coefficient and total energy change between mRNA and initiating 30S ribosomal subunit. The energies of RNA folding and the ribosome binding to mRNA are calculated using the ViennaRNA suite and RBS Calculator [64,103], respectively.
Riboswitch Calculator provides a useful tool to design translation regulating riboswitches. The four inputs are necessary for the design: the sequence and structural constraint of an aptamer, the ligand binding energy, and the protein coding section. These inputs are used by the optimization algorithm to generate an initial set of riboswitch candidates. Rounds of random mutations, evaluation, selection, and recombination are performed using these candidates. The biophysical model calculations are employed for the fitness evaluation to select the candidates that meet the objective function requirements, such as reaching targeted translation rates in OFF and ON states. The selected candidates will be sent back for the next round to find the riboswitch candidates with the highest fitness.
Transcription regulating riboswitches can also be designed by using the computational method [61]. The design algorithm used the aptamer with the known sequence and secondary structure. When considering the terminators always with a minimal size and the U stretch, sequences with lengths between 6 and 20 nucleotides are randomly generated to create a spacer database. The following step is to generate the sequences that are complementary to the subsequences of the 3′ part of the given aptamer. With these terms, a riboswitch candidate can be created by concatenating the aptamer, a spacer, a complementary sequence, and the U stretch. Using the RNAfold program [64], these candidates are evaluated by the folding simulations to select the elite members. A theophylline-specific riboswitch that is generated by this approach can activate transcription on binding of the target ligand [61].
These computational methods can be applied to constructing synthetic riboswitches, but the evaluation of synthetic riboswitch candidates is based on thermodynamic calculations in both of the approaches. Besides the above approaches, other methods that use different strategies are developed to design RNA devices recently [99,100,101,104]. It is known that co-transcriptional folding kinetics is fundamental to the action of functional RNAs in cell. However, it is not well incorporated into these methods, possibly because of the complexity of co-transcriptional folding. Although the transcription context may pose a serious challenge in the quest for designing RNA devices, these methods have made a great progress in the development of computational methods for designing RNA devices.

6. Conclusions

Since functions of biological molecules are determined by the formed structures, precise structure precondition is crucial to a complete understanding of many RNA-mediated processes, such as the regulation activities of riboswitches. Two domains of riboswitches, namely, the aptamer domain and the expression platform, often share nucleotides. The aptamer domain usually needs to form a unique ligand-binding pocket for specifically binding its ligand, which in turn, locks the conformation of the aptamer domain and directs the folding of the downstream expression platform. However, if the aptamer fails to bind its ligand, part of the nucleotides within the aptamer domain will pair with the nucleotides within the expression platform to form an alternative structure. For the riboswitches with one single domain, conformational change induced by the ligand binding would result in the formation of the structure that is different from the unbound functional state.
As structural changes within the aptamer domain produce a substantial change in the gene expression, it therefore demands to characterize structure and kinetics of aptamers for a thorough understanding of the riboswitch-mediated regulation. Given the fact that most aptamers have been extensively studied experimentally, yet they are not well characterized in some aspects. In the last several decades, many computational methods have been developed to investigate aptamers. One primary concern for aptamers is to precisely model the structure that is responsible for ligand recognizing and binding. Effective approaches, such as the motif library-based and fragment-based methods, have made significant progress in 3D structure prediction by integrating knowledge-based algorithm with experimental data or physics-based model. However, 3D modeling for aptamers and other large RNAs has not achieved a consistent atomic accuracy and the data may not enable statistical meta-analysis currently. Most of them are not suitable for predicting the 3D structure for highly complex RNAs. Furthermore, RNA chaperones may play a role in RNA folding [87,105,106], which are not considered in the model. High quality 3D structure prediction may require more experimental mapping data and significant human insight to build accurate models. Further advancements may hold great promise for making progress toward the goal.
Another important issue for aptamers comes from describing their great structural changes. Modern parallelization of MD simulation largely improves the computation efficiency, but it still cannot address the structural rearrangements of aptamers that occur on a seconds’ timescale. The coarse-grained SOP model and other computational methods have been applied to characterizing the kinetics of several riboswitch aptamers with great success. These demonstrate that the coarse-grained structural model, which enhances the conformation sampling, can be used to study the large scale conformational fluctuations of RNAs.
Also based on the coarse-grained structural model, a number of computational methods, such as RNAkinetics [107], Kinefold [108], and COFOLD [47] are developed to simulate co-transcriptional folding pathways of mRNAs from the early 1980s when key experiments show that structure formation happens co-transcriptionally [109]. In order to investigate the riboswitch-mediated regulation mechanisms, ligand binding should be taken into account. By incorporating the effect of ligand binding, the BarMap package and helix-based computational method have been successfully used to predict the co-transcriptional folding for several riboswitches. The application of these approaches in riboswitches implies that the 2D structure model can capture enough details of their behaviors in living cells.
Since conformational changes in aptamers can be induced by ligands, some other theoretical approaches focus on the prediction about interactions between ligands and aptamers or other features [110,111,112]. In recent years, computational methods for designing riboswitches have also made excellent progress in synthetic biology. As discussed above, several notable limitations of these computational methods still exist. Continuing developments in computational and hybrid methods are expected to overcome these limitations.

Acknowledgments

This work was partly supported by the National Natural Science Foundation of China under Grants No. 31600592 (to Sha Gong), 31270761 and 11574234 (to Wenbing Zhang). It was supported in part from Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains [2017BX08].

Author Contributions

Sha Gong, Yanli Wang, Zhen Wang and Wenbing Zhang designed the paper. Sha Gong and Wenbing Zhang wrote the paper. All authors have contributed in the improving of the manuscript, and read and approved its final version.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

ncRNAnoncoding RNA
SOPSelf-Organized Polymer
MCMonte Carlo
FMNFlavin Mononucleotide
SAMS-adenosylmethionine
TPPThiamine Pyrophosphate
NMRNuclear Magnetic Resonance
MDMolecular Dynamics
RNAPRNA polymerase
RMSDRoot Mean Square Deviation
FARNAFragment Assembly of RNA
RAGTOPRNA-As-Graph-Topologies

References

  1. Breaker, R.R. Riboswitches and the RNA world. Cold Spring Harb. Perspect. Biol. 2012, 4, 1–15. [Google Scholar] [CrossRef] [PubMed]
  2. Winkler, W.C. Riboswitches and the role of noncoding RNAs in bacterial metabolic control. Curr. Opin. Chem. Biol. 2005, 9, 594–602. [Google Scholar] [CrossRef] [PubMed]
  3. Cech, T.R.; Steitz, J.A. The noncoding RNA revolution—Trashing old rules to forge new ones. Cell 2014, 157, 77–94. [Google Scholar] [CrossRef] [PubMed]
  4. Chen, J.; Gottesman, S. Riboswitch regulates RNA. Science 2014, 345, 876–877. [Google Scholar] [CrossRef] [PubMed]
  5. Tucker, B.J.; Breaker, R.R. Riboswitches as versatile gene control elements. Curr. Opin. Struct. Biol. 2005, 15, 342–348. [Google Scholar] [CrossRef] [PubMed]
  6. Baker, J.L.; Sudarsan, N.; Weinberg, Z.; Roth, A.; Stockbridge, R.B.; Breaker, R.R. Widespread Genetic Switches and Toxicity Resistance Proteins for Fluoride. Science 2012, 335, 233–235. [Google Scholar] [CrossRef] [PubMed]
  7. Dambach, M.; Sandoval, M.; Updegrove, T.B.; Anantharaman, V.; Aravind, L.; Waters, L.S.; Storz, G. The Ubiquitous yybP-ykoY Riboswitch Is a Manganese-Responsive Regulatory Element. Mol. Cell 2015, 57, 1099–1109. [Google Scholar] [CrossRef] [PubMed]
  8. Dann, C.E.; Wakeman, C.A.; Sieling, C.L.; Baker, S.C.; Irnov, I.; Winkler, W.C. Structure and Mechanism of a Metal-Sensing Regulatory RNA. Cell 2007, 130, 878–892. [Google Scholar] [CrossRef] [PubMed]
  9. Price, I.R.; Gaballa, A.; Ding, F.; Helmann, J.D.; Ke, A. Mn2+-Sensing Mechanisms of yybP-ykoY Orphan Riboswitches. Mol. Cell 2015, 57, 1110–1123. [Google Scholar] [CrossRef] [PubMed]
  10. Cromie, M.J.; Shi, Y.; Latifi, T.; Groisman, E.A. An RNA Sensor for Intracellular Mg2+. Cell 2006, 125, 71–84. [Google Scholar] [CrossRef] [PubMed]
  11. Li, S.; Breaker, R.R. Identification of 15 candidate structured noncoding RNA motifs in fungi by comparative genomics. BMC Genom. 2017, 18, 785. [Google Scholar] [CrossRef] [PubMed]
  12. Weinberg, Z.; Lünse, C.E.; Corbino, K.A.; Ames, T.D.; Nelson, J.W.; Roth, A.; Perkins, K.R.; Sherlock, M.E.; Breaker, R.R. Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions. Nucleic Acids Res. 2017, 45, 10811–10823. [Google Scholar] [CrossRef] [PubMed]
  13. Fuchs, R.T.; Grundy, F.J.; Henkin, T.M. S-adenosylmethionine directly inhibits binding of 30S ribosomal subunits to the SMK box translational riboswitch RNA. Proc. Natl. Acad. Sci. USA 2007, 104, 4876–4880. [Google Scholar] [CrossRef] [PubMed]
  14. Lu, C.; Smith, A.M.; Ding, F.; Chowdhury, A.; Henkin, T.M.; Ke, A. Variable sequences outside the SAM-binding core critically influence the conformational dynamics of the SAM-III/SMK box riboswitch. J. Mol. Biol. 2011, 409, 786–799. [Google Scholar] [CrossRef] [PubMed]
  15. Wilson, R.C.; Smith, A.M.; Fuchs, R.T.; Kleckner, I.R.; Henkin, T.M.; Foster, M.P. Tuning riboswitch regulation through conformational selection. J. Mol. Biol. 2011, 405, 926–938. [Google Scholar] [CrossRef] [PubMed]
  16. DebRoy, S.; Gebbie, M.; Ramesh, A.; Goodson, J.R.; Cruz, M.R.; van Hoof, A.; Winkler, W.C.; Garsin, D.A. A riboswitch-containing sRNA controls gene expression by sequestration of a response regulator. Science 2014, 345, 937–940. [Google Scholar] [CrossRef] [PubMed]
  17. Rinaldi, A.J.; Lund, P.E.; Blanco, M.R.; Walter, N.G. The Shine-Dalgarno sequence of riboswitch-regulated single mRNAs shows ligand-dependent accessibility bursts. Nat. Commun. 2016, 7, 8976. [Google Scholar] [CrossRef] [PubMed]
  18. Barrick, J.E.; Breaker, R.R. The distributions, mechanisms, and structures of metabolite-binding riboswitches. Genome Biol. 2007, 8, R239. [Google Scholar] [CrossRef] [PubMed]
  19. Breaker, R.R. Prospects for Riboswitch Discovery and Analysis. Mol. Cell 2011, 43, 867–879. [Google Scholar] [CrossRef] [PubMed]
  20. Li, S.; Hwang, X.Y.; Stav, S.; Breaker, R.R. The yjdF riboswitch candidate regulates gene expression by binding diverse azaaromatic compounds. RNA 2016, 22, 530–541. [Google Scholar] [CrossRef] [PubMed]
  21. Serganov, A.; Nudler, E. A Decade of Riboswitches. Cell 2013, 152, 17–24. [Google Scholar] [CrossRef] [PubMed]
  22. Deigan, K.E.; FerrÉ-D’AmarÉ, A.R. Riboswitches: Discovery of Drugs That Target Bacterial Gene-Regulatory RNAs. Acc. Chem. Res. 2011, 44, 1329–1338. [Google Scholar] [CrossRef] [PubMed]
  23. Liberman, J.A.; Wedekind, J.E. Riboswitch structure in the ligand-free state. Wiley Interdiscip. Rev. RNA 2012, 3, 369–384. [Google Scholar] [CrossRef] [PubMed]
  24. Gilbert, S.D.; Rambo, R.P.; Van Tyne, D.; Batey, R.T. Structure of the SAM-II riboswitch bound to S-adenosylmethionine. Nat. Struct. Mol. Biol. 2008, 15, 177–182. [Google Scholar] [CrossRef] [PubMed]
  25. Cheah, M.T.; Wachter, A.; Sudarsan, N.; Breaker, R.R. Control of alternative RNA splicing and gene expression by eukaryotic riboswitches. Nature 2007, 447, 497–500. [Google Scholar] [CrossRef] [PubMed]
  26. Mandal, M. A Glycine-Dependent Riboswitch That Uses Cooperative Binding to Control Gene Expression. Science 2004, 306, 275–279. [Google Scholar] [CrossRef] [PubMed]
  27. Diegelman-Parente, A.; Bevilacqua, P.C. A mechanistic framework for Co-transcriptional folding of the HDV genomic ribozyme in the presence of downstream sequence. J. Mol. Biol. 2002, 324, 1–16. [Google Scholar] [CrossRef]
  28. Lemay, J.-F.; Desnoyers, G.; Blouin, S.; Heppell, B.; Bastet, L.; St-Pierre, P.; Massé, E.; Lafontaine, D.A. Comparative Study between Transcriptionally- and Translationally-Acting Adenine Riboswitches Reveals Key Differences in Riboswitch Regulatory Mechanisms. PLoS Genet. 2011, 7, e1001278. [Google Scholar] [CrossRef] [PubMed]
  29. Popenda, M.; Szachniuk, M.; Antczak, M.; Purzycka, K.J.; Lukasiak, P.; Bartol, N.; Blazewicz, J.; Adamiak, R.W. Automated 3D structure composition for large RNAs. Nucleic Acids Res. 2012, 40, 1–12. [Google Scholar] [CrossRef] [PubMed]
  30. Chen, B.; Zuo, X.; Wang, Y.-X.; Dayie, T.K. Multiple conformations of SAM-II riboswitch detected with SAXS and NMR spectroscopy. Nucleic Acids Res. 2012, 40, 3117–3130. [Google Scholar] [CrossRef] [PubMed]
  31. Zuker, M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 2003, 31, 3406–3415. [Google Scholar] [CrossRef] [PubMed]
  32. Xu, X.; Zhao, P.; Chen, S.J. Vfold: A web server for RNA structure and folding thermodynamics prediction. PLoS ONE 2014, 9. [Google Scholar] [CrossRef] [PubMed]
  33. Reuter, J.S.; Mathews, D.H. RNAstructure: Software for RNA secondary structure prediction and analysis. BMC Bioinform. 2010, 11, 129. [Google Scholar] [CrossRef] [PubMed]
  34. Das, R.; Baker, D. Automated de novo prediction of native-like RNA tertiary structures. Proc. Natl. Acad. Sci. USA 2007, 104, 14664–14669. [Google Scholar] [CrossRef] [PubMed]
  35. Das, R.; Karanicolas, J.; Baker, D. Atomic accuracy in predicting and designing noncanonical RNA structure. Nat. Methods 2010, 7, 291–294. [Google Scholar] [CrossRef] [PubMed]
  36. Jonikas, M.A.; Radmer, R.J.; Altman, R.B. Knowledge-based instantiation of full atomic detail into coarse-grain RNA 3D structural models. Bioinformatics 2009, 25, 3259–3266. [Google Scholar] [CrossRef] [PubMed]
  37. Parisien, M.; Major, F. The MC-Fold and MC-Sym pipeline infers RNA structure from sequence data. Nature 2008, 452, 51–55. [Google Scholar] [CrossRef] [PubMed]
  38. Shi, Y.-Z.; Wang, F.-H.; Wu, Y.-Y.; Tan, Z.-J. A coarse-grained model with implicit salt for RNAs: Predicting 3D structure, stability and salt effect. J. Chem. Phys. 2014, 141, 105102. [Google Scholar] [CrossRef] [PubMed]
  39. Purzycka, K.J.; Popenda, M.; Szachniuk, M.; Antczak, M.; Lukasiak, P.; Blazewicz, J.; Adamiak, R.W. Automated 3D RNA Structure Prediction Using the RNAComposer Method for Riboswitches. In Methods in Enzymology; Elsevier Inc.: Waltham, MA, USA, 2015; Volume 553, pp. 3–34. ISBN 0076-6879. [Google Scholar]
  40. Ding, F.; Lavender, C.A.; Weeks, K.M.; Dokholyan, N.V. Three-dimensional RNA structure refinement by hydroxyl radical probing. Nat. Methods 2012, 9, 603–608. [Google Scholar] [CrossRef] [PubMed]
  41. Higgs, P.G. RNA secondary structure: Physical and computational aspects. Q. Rev. Biophys. 2000, 33, 199–253. [Google Scholar] [CrossRef] [PubMed]
  42. Sharma, M.; Bulusu, G.; Mitra, A. MD simulations of ligand-bound and ligand-free aptamer: Molecular level insights into the binding and switching mechanism of the add A-riboswitch. RNA 2009, 15, 1673–1692. [Google Scholar] [CrossRef] [PubMed]
  43. Lin, J.-C.; Thirumalai, D. Relative stability of helices determines the folding landscape of adenine riboswitch aptamers. J. Am. Chem. Soc. 2008, 130, 14080–14081. [Google Scholar] [CrossRef] [PubMed]
  44. Hyeon, C.; Thirumalai, D. Mechanical Unfolding of RNA: From Hairpins to Structures with Internal Multiloops. Biophys. J. 2007, 92, 731–743. [Google Scholar] [CrossRef] [PubMed]
  45. Lin, J.-C.; Hyeon, C.; Thirumalai, D. Sequence-dependent folding landscapes of adenine riboswitch aptamers. Phys. Chem. Chem. Phys. 2014, 16, 6376–6382. [Google Scholar] [CrossRef] [PubMed]
  46. Lin, J.-C.C.; Thirumalai, D. Kinetics of allosteric transitions in S-adenosylmethionine riboswitch are accurately predicted from the folding landscape. J. Am. Chem. Soc. 2013, 135, 16641–16650. [Google Scholar] [CrossRef] [PubMed]
  47. Proctor, J.R.; Meyer, I.M. CoFold: An RNA secondary structure prediction method that takes co-transcriptional folding into account. Nucleic Acids Res. 2013, 41, e102. [Google Scholar] [CrossRef] [PubMed]
  48. Zhao, P.; Zhang, W.-B.; Chen, S.-J. Predicting Secondary Structural Folding Kinetics for Nucleic Acids. Biophys. J. 2010, 98, 1617–1625. [Google Scholar] [CrossRef] [PubMed]
  49. Holtkamp, W.; Kokic, G.; Jäger, M.; Mittelstaet, J.; Komar, A.A.; Rodnina, M.V. Cotranslational protein folding on the ribosome monitored in real time. Science 2015, 350, 1104–1107. [Google Scholar] [CrossRef] [PubMed]
  50. Frieda, K.L.; Block, S.M. Direct Observation of Cotranscriptional Folding in an Adenine Riboswitch. Science 2012, 338, 397–400. [Google Scholar] [CrossRef] [PubMed]
  51. Wickiser, J.K.; Winkler, W.C.; Breaker, R.R.; Crothers, D.M. The Speed of RNA Transcription and Metabolite Binding Kinetics Operate an FMN Riboswitch. Mol. Cell 2005, 18, 49–60. [Google Scholar] [CrossRef] [PubMed]
  52. Hennelly, S.P.; Novikova, I.V.; Sanbonmatsu, K.Y. The expression platform and the aptamer: Cooperativity between Mg2+ and ligand in the SAM-I riboswitch. Nucleic Acids Res. 2013, 41, 1922–1935. [Google Scholar] [CrossRef] [PubMed]
  53. Lemay, J.-F.; Penedo, J.C.; Tremblay, R.; Lilley, D.M.J.; Lafontaine, D.A. Folding of the Adenine Riboswitch. Chem. Biol. 2006, 13, 857–868. [Google Scholar] [CrossRef] [PubMed]
  54. Watters, K.E.; Strobel, E.J.; Yu, A.M.; Lis, J.T.; Lucks, J.B. Cotranscriptional folding of a riboswitch at nucleotide resolution. Nat. Struct. Mol. Biol. 2016, 23, 1124–1131. [Google Scholar] [CrossRef] [PubMed]
  55. Lutz, B.; Faber, M.; Verma, A.; Klumpp, S.; Schug, A. Differences between cotranscriptional and free riboswitch folding. Nucleic Acids Res. 2014, 42, 2687–2696. [Google Scholar] [CrossRef] [PubMed]
  56. Gong, S.; Wang, Y.; Wang, Z.; Wang, Y.; Zhang, W. Reversible-Switch Mechanism of the SAM-III Riboswitch. J. Phys. Chem. B 2016, 120, 12305–12311. [Google Scholar] [CrossRef] [PubMed]
  57. Gong, S.; Wang, Y.; Zhang, W. Kinetic regulation mechanism of pbuE riboswitch. J. Chem. Phys. 2015, 142, 15103. [Google Scholar] [CrossRef] [PubMed]
  58. Gong, S.; Wang, Y.; Zhang, W. The regulation mechanism of yitJ and metF riboswitches. J. Chem. Phys. 2015, 143, 45103. [Google Scholar] [CrossRef] [PubMed]
  59. Hofacker, I.L.; Flamm, C.; Heine, C.; Wolfinger, M.T.; Scheuermann, G.; Stadler, P.F. BarMap: RNA folding on dynamic energy landscapes. RNA 2010, 16, 1308–1316. [Google Scholar] [CrossRef] [PubMed]
  60. Zhao, P.; Zhang, W.; Chen, S.-J. Cotranscriptional folding kinetics of ribonucleic acid secondary structures. J. Chem. Phys. 2011, 135, 245101. [Google Scholar] [CrossRef] [PubMed]
  61. Wachsmuth, M.; Findeiß, S.; Weissheimer, N.; Stadler, P.F.; Mörl, M. De novo design of a synthetic riboswitch that regulates transcription termination. Nucleic Acids Res. 2013, 41, 2541–2551. [Google Scholar] [CrossRef] [PubMed]
  62. Espah Borujeni, A.; Mishler, D.M.; Wang, J.; Huso, W.; Salis, H.M. Automated physics-based design of synthetic riboswitches from diverse RNA aptamers. Nucleic Acids Res. 2016, 44, 1–13. [Google Scholar] [CrossRef] [PubMed]
  63. Zhao, Y.; Huang, Y.; Gong, Z.; Wang, Y.; Man, J.; Xiao, Y. Automated and fast building of three-dimensional RNA structures. Sci. Rep. 2012, 2, 734. [Google Scholar] [CrossRef] [PubMed]
  64. Hofacker, I.L. RNA Secondary Structure Analysis Using the Vienna RNA Package. In Current Protocols in Bioinformatics; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2009; pp. 1–16. ISBN 0471250953. [Google Scholar]
  65. Puton, T.; Kozlowski, L.P.; Rother, K.M.; Bujnicki, J.M. CompaRNA: A server for continuous benchmarking of automated methods for RNA secondary structure prediction. Nucleic Acids Res. 2013, 41, 4307–4323. [Google Scholar] [CrossRef] [PubMed]
  66. Bernhart, S.H.; Hofacker, I.L.; Will, S.; Gruber, A.R.; Stadler, P.F. RNAalifold: Improved consensus structure prediction for RNA alignments. BMC Bioinform. 2008, 9, 474. [Google Scholar] [CrossRef] [PubMed]
  67. Sükösd, Z.; Knudsen, B.; Vaerum, M.; Kjems, J.; Andersen, E.S. Multithreaded comparative RNA secondary structure prediction using stochastic context-free grammars. BMC Bioinform. 2011, 12, 103. [Google Scholar] [CrossRef] [PubMed]
  68. Popenda, M.; Szachniuk, M.; Blazewicz, M.; Wasik, S.; Burke, E.K.; Blazewicz, J.; Adamiak, R.W. RNA FRABASE 2.0: An advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures. BMC Bioinform. 2010, 11, 231. [Google Scholar] [CrossRef] [PubMed]
  69. Miranda-Rios, J.; Navarro, M.; Soberon, M. A conserved RNA structure (thi box) is involved in regulation of thiamin biosynthetic gene expression in bacteria. Proc. Natl. Acad. Sci. USA 2001, 98, 9736–9741. [Google Scholar] [CrossRef] [PubMed]
  70. Zirbel, C.L.; Roll, J.; Sweeney, B.A.; Petrov, A.I.; Pirrung, M.; Leontis, N.B. Identifying novel sequence variants of RNA 3D motifs. Nucleic Acids Res. 2015, 43, 7504–7520. [Google Scholar] [CrossRef] [PubMed]
  71. Rahrig, R.R.; Leontis, N.B.; Zirbel, C.L. R3D align: Global pairwise alignment of RNA 3D structures using local superpositions. Bioinformatics 2010, 26, 2689–2697. [Google Scholar] [CrossRef] [PubMed]
  72. Kladwang, W.; Chou, F.C.; Das, R. Automated RNA structure prediction uncovers a kink-turn linker in double glycine riboswitches. J. Am. Chem. Soc. 2012, 134, 1404–1407. [Google Scholar] [CrossRef] [PubMed]
  73. Cruz, J.A.; Westhof, E. Sequence-based identification of 3D structural modules in RNA with RMDetect. Nat. Methods 2011, 8, 513–519. [Google Scholar] [CrossRef] [PubMed]
  74. Kim, N.; Zahran, M.; Schlick, T. Computational Prediction of Riboswitch Tertiary Structures Including Pseudoknots by RAGTOP. In Methods in Enzymology; Elsevier Inc.: Waltham, MA, USA, 2015; Volume 553, pp. 115–135. ISBN 1557-7988 (Electronic) 0076-6879. [Google Scholar]
  75. Krokhotin, A.; Houlihan, K.; Dokholyan, N.V. iFoldRNA v2: Folding RNA with constraints. Bioinformatics 2015, 31, 2891–2893. [Google Scholar] [CrossRef] [PubMed]
  76. Kim, N.; Laing, C.; Elmetwaly, S.; Jung, S.; Curuksu, J.; Schlick, T. Graph-based sampling for approximating global helical topologies of RNA. Proc. Natl. Acad. Sci. USA 2014, 111, 4079–4084. [Google Scholar] [CrossRef] [PubMed]
  77. Krokhotin, A.; Dokholyan, N. V Computational methods toward accurate RNA structure prediction using coarse-grained and all-atom models. Methods Enzymol. 2015, 553, 65–89. [Google Scholar] [CrossRef] [PubMed]
  78. Wang, Y.; Gong, S.; Wang, Z.; Zhang, W. The thermodynamics and kinetics of a nucleotide base pair. J. Chem. Phys. 2016, 144, 115101. [Google Scholar] [CrossRef] [PubMed]
  79. Gusarov, I.; Nudler, E. Control of intrinsic transcription termination by N and NusA: The basic mechanisms. Cell 2001, 107, 437–449. [Google Scholar] [CrossRef]
  80. Chen, J.; Gong, S.; Wang, Y.; Zhang, W. Kinetic partitioning mechanism of HDV ribozyme folding. J. Chem. Phys. 2014, 140, 25102. [Google Scholar] [CrossRef] [PubMed]
  81. Chen, J.; Zhang, W. Kinetic analysis of the effects of target structure on siRNA efficiency. J. Chem. Phys. 2012, 137, 225102. [Google Scholar] [CrossRef] [PubMed]
  82. Xia, T.; SantaLucia, J.; Burkard, M.E.; Kierzek, R.; Schroeder, S.J.; Jiao, X.; Cox, C.; Turner, D.H. Thermodynamic Parameters for an Expanded Nearest-Neighbor Model for Formation of RNA Duplexes with Watson−Crick Base Pairs. Biochemistry 1998, 37, 14719–14735. [Google Scholar] [CrossRef] [PubMed]
  83. Mathews, D.H.; Sabina, J.; Zuker, M.; Turner, D.H. Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. J. Mol. Biol. 1999, 288, 911–940. [Google Scholar] [CrossRef] [PubMed]
  84. Hyeon, C.; Morrison, G.; Thirumalai, D. Force-dependent hopping rates of RNA hairpins can be estimated from accurate measurement of the folding landscapes. Proc. Natl. Acad. Sci. USA 2008, 105, 9604–9609. [Google Scholar] [CrossRef] [PubMed]
  85. Hyeon, C.; Dima, R.I.; Thirumalai, D. Pathways and Kinetic Barriers in Mechanical Unfolding and Refolding of RNA and Proteins. Structure 2006, 14, 1633–1645. [Google Scholar] [CrossRef] [PubMed]
  86. Rieder, R.; Lang, K.; Graber, D.; Micura, R. Ligand-Induced Folding of the Adenosine Deaminase A-Riboswitch and Implications on Riboswitch Translational Control. ChemBioChem 2007, 8, 896–902. [Google Scholar] [CrossRef] [PubMed]
  87. Schroeder, R.; Grossberger, R.; Pichler, A.; Waldsich, C. RNA folding in vivo. Curr. Opin. Struct. Biol. 2002, 12, 296–300. [Google Scholar] [CrossRef]
  88. Badelt, S.; Hammer, S.; Flamm, C.; Hofacker, I.L. Thermodynamic and kinetic folding of riboswitches. Methods Enzymol. 2015, 553, 193–213. [Google Scholar] [CrossRef] [PubMed]
  89. Wolfinger, M.T.; Svrcek-Seiler, W.A.; Flamm, C.; Hofacker, I.L.; Stadler, P.F. Efficient computation of RNA folding dynamics. J. Phys. A. Math. Gen. 2004, 37, 4731–4741. [Google Scholar] [CrossRef]
  90. Flamm, C.; Hofacker, I.L.; Stadler, P.F.; Wolfinger, M.T. Barrier Trees of Degenerate Landscapes. Z. Phys. Chem. 2002, 216, 155. [Google Scholar] [CrossRef]
  91. Geis, M.; Flamm, C.; Wolfinger, M.T.; Tanzer, A.; Hofacker, I.L.; Middendorf, M.; Mandl, C.; Stadler, P.F.; Thurner, C. Folding kinetics of large RNAs. J. Mol. Biol. 2008, 379, 160–173. [Google Scholar] [CrossRef] [PubMed]
  92. Huang, W.; Kim, J.; Jha, S.; Aboul-ela, F. A mechanism for S-adenosyl methionine assisted formation of a riboswitch conformation: A small molecule with a strong arm. Nucleic Acids Res. 2009, 37, 6528–6539. [Google Scholar] [CrossRef] [PubMed]
  93. Chauvier, A.; Picard-Jean, F.; Berger-Dancause, J.-C.; Bastet, L.; Naghdi, M.R.; Dubé, A.; Turcotte, P.; Perreault, J.; Lafontaine, D.A. Transcriptional pausing at the translation start site operates as a critical checkpoint for riboswitch regulation. Nat. Commun. 2017, 8, 13892. [Google Scholar] [CrossRef] [PubMed]
  94. Perdrizet II, G.A.; Artsimovitch, I.; Furman, R.; Sosnick, T.R.; Pan, T. Transcriptional pausing coordinates folding of the aptamer domain and the expression platform of a riboswitch. Proc. Natl. Acad. Sci. USA 2012, 109, 3323–3328. [Google Scholar] [CrossRef] [PubMed]
  95. Hollands, K.; Sevostiyanova, A.; Groisman, E.A. Unusually long-lived pause required for regulation of a Rho-dependent transcription terminator. Proc. Natl. Acad. Sci. USA 2014, 111, E1999–E2007. [Google Scholar] [CrossRef] [PubMed]
  96. Wong, T.N.; Pan, T. RNA Folding During Transcription: Protocols and Studies. In Methods in Enzymology; Elsevier Inc.: San Diego, CA, USA, 2009; Volume 468, pp. 167–193. ISBN 1557-7988 (Electronic)r0076-6879. [Google Scholar]
  97. Ketzer, P.; Kaufmann, J.K.; Engelhardt, S.; Bossow, S.; von Kalle, C.; Hartig, J.S.; Ungerechts, G.; Nettelbeck, D.M. Artificial riboswitches for gene expression and replication control of DNA and RNA viruses. Proc. Natl. Acad. Sci. USA 2014, 111, E554–E562. [Google Scholar] [CrossRef] [PubMed]
  98. Zhang, F.; Carothers, J.M.; Keasling, J.D. Design of a dynamic sensor-regulator system for production of chemicals and fuels derived from fatty acids. Nat. Biotechnol. 2012, 30, 354–359. [Google Scholar] [CrossRef] [PubMed]
  99. Carothers, J.M.; Goler, J.A.; Juminaga, D.; Keasling, J.D. Model-Driven Engineering of RNA Devices to Quantitatively Program Gene Expression. Science 2011, 334, 1716–1719. [Google Scholar] [CrossRef] [PubMed]
  100. Townshend, B.; Kennedy, A.B.; Xiang, J.S.; Smolke, C.D. High-throughput cellular RNA device engineering. Nat. Methods 2015, 12, 989–994. [Google Scholar] [CrossRef] [PubMed]
  101. Endoh, T.; Sugimoto, N. Rational Design and Tuning of Functional RNA Switch to Control an Allosteric Intermolecular Interaction. Anal. Chem. 2015, 87, 7628–7635. [Google Scholar] [CrossRef] [PubMed]
  102. Wei, K.Y.; Smolke, C.D. Engineering dynamic cell cycle control with synthetic small molecule-responsive RNA devices. J. Biol. Eng. 2015, 9, 21. [Google Scholar] [CrossRef] [PubMed]
  103. Espah Borujeni, A.; Channarasappa, A.S.; Salis, H.M. Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites. Nucleic Acids Res. 2014, 42, 2646–2659. [Google Scholar] [CrossRef] [PubMed]
  104. McKeague, M.; Wong, R.S.; Smolke, C.D. Opportunities in the design and application of RNA for gene expression control. Nucleic Acids Res. 2016, 44, 2987–2999. [Google Scholar] [CrossRef] [PubMed]
  105. Zemora, G.; Waldsich, C. RNA folding in living cells. RNA Biol. 2010, 7, 634–641. [Google Scholar] [CrossRef] [PubMed]
  106. Herschlag, D. RNA Chaperones and the RNA Folding Problem. J. Biol. Chem. 1995, 270, 20871–20874. [Google Scholar] [CrossRef] [PubMed]
  107. Danilova, L.V.; Pervouchine, D.D.; Favorov, A.V.; Mironov, A.A. RNAKinetics: A web server that models secondary structure kinetics of an elongating RNA. J. Bioinform. Comput. Biol. 2006, 4, 589–596. [Google Scholar] [CrossRef] [PubMed]
  108. Xayaphoummine, A.; Bucher, T.; Isambert, H. Kinefold web server for RNA/DNA folding path and structure prediction including pseudoknots and knots. Nucleic Acids Res. 2005, 33, 605–610. [Google Scholar] [CrossRef] [PubMed]
  109. Kramer, F.R.; Mills, D.R. Secondary structure formation during RNA synthesis. Nucleic Acids Res. 1981, 9, 5109–5124. [Google Scholar] [CrossRef] [PubMed]
  110. Gong, Z.; Zhao, Y.; Chen, C.; Xiao, Y. Role of Ligand Binding in Structural Organization of Add A-riboswitch Aptamer: A Molecular Dynamics Simulation. J. Biomol. Struct. Dyn. 2011, 29, 403–416. [Google Scholar] [CrossRef] [PubMed]
  111. Philips, A.; Milanowska, K.; Lach, G.; Boniecki, M.; Rother, K.; Bujnicki, J.M. MetalionRNA: Computational predictor of metal-binding sites in RNA structures. Bioinformatics 2012, 28, 198–205. [Google Scholar] [CrossRef] [PubMed]
  112. Philips, A.; Milanowska, K.; Ach, G.L.; Bujnicki, J.M. LigandRNA : Computational predictor of RNA—Ligand interactions. RNA 2013, 1605–1616. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Structural models of aptamers from several extensively studied riboswitch candidates. The structures can bind their nature ligands, such as S-adenosylmethionine (SAM), thiamine pyrophosphate (TPP), adenine and flavin mononucleotide (FMN), to form a ligand bound conformation. Except the SAM-II riboswitch from the Sargasso Sea metagenome [23,24] and TPP riboswitch aptamer from Neurospora crassa [25], other aptamers are from bacteria. Nucleotides within helices P1, P2, P3, P4, P5, and P6 found within the bound aptamers, are colored differently. The dash line denotes the long helix region in the structure of TPP riboswitch aptamer.
Figure 1. Structural models of aptamers from several extensively studied riboswitch candidates. The structures can bind their nature ligands, such as S-adenosylmethionine (SAM), thiamine pyrophosphate (TPP), adenine and flavin mononucleotide (FMN), to form a ligand bound conformation. Except the SAM-II riboswitch from the Sargasso Sea metagenome [23,24] and TPP riboswitch aptamer from Neurospora crassa [25], other aptamers are from bacteria. Nucleotides within helices P1, P2, P3, P4, P5, and P6 found within the bound aptamers, are colored differently. The dash line denotes the long helix region in the structure of TPP riboswitch aptamer.
Ijms 18 02442 g001
Figure 2. Schematic representation of riboswitch behaviors in cells. Nascent chain of the riboswitch and the ligand are colored red and blue respectively. The green line denotes the ribosome binding site (RBS) and RNA polymerase (RNAP) is denoted by yellow-green. The structure, structural changes, and co-transcriptional folding should be investigated for fully understanding the regulation mechanisms of riboswitches.
Figure 2. Schematic representation of riboswitch behaviors in cells. Nascent chain of the riboswitch and the ligand are colored red and blue respectively. The green line denotes the ribosome binding site (RBS) and RNA polymerase (RNAP) is denoted by yellow-green. The structure, structural changes, and co-transcriptional folding should be investigated for fully understanding the regulation mechanisms of riboswitches.
Ijms 18 02442 g002
Figure 3. Three-dimensional (3D) structure for THF riboswitch aptamer (PDB code: 3SD3) predicted by using RNAComposer program online.
Figure 3. Three-dimensional (3D) structure for THF riboswitch aptamer (PDB code: 3SD3) predicted by using RNAComposer program online.
Ijms 18 02442 g003
Figure 4. Relationship of landscapes in the BarMap approach (a) and structures in the helix-based computational method (b) between two adjacent steps. In (a), there are three types of events in landscapes: (i) A one-to-one correspondence between two minima (right); (ii) A two-to-one correspondence (left); and, (iii) a new local minima (marked by *) appears in the landscape at step N + 1. In (b), the triangles and red dots denote the newly transcribed nucleotide at step N and N + 1, respectively. For structures belonging to the first three types, their initial population at step N + 1 “directly inherit” from step N, while for type d, the initial population of the structure at step N + 1 is zero.
Figure 4. Relationship of landscapes in the BarMap approach (a) and structures in the helix-based computational method (b) between two adjacent steps. In (a), there are three types of events in landscapes: (i) A one-to-one correspondence between two minima (right); (ii) A two-to-one correspondence (left); and, (iii) a new local minima (marked by *) appears in the landscape at step N + 1. In (b), the triangles and red dots denote the newly transcribed nucleotide at step N and N + 1, respectively. For structures belonging to the first three types, their initial population at step N + 1 “directly inherit” from step N, while for type d, the initial population of the structure at step N + 1 is zero.
Ijms 18 02442 g004
Figure 5. The co-transcriptional folding kinetics (a,d) and refolding kinetics (b,e) of addA riboswitch aptamer (up panel) and SMK riboswitch (bottom panel). The transcription rates for addA aptamer and SMK riboswitch are 25 nt/s and 50 nt/s, respectively, and the main intermediate states formed during the transcription are shown in (c,f). The dash dot lines in (b) for addA riboswitch aptamer denote the unpaired nucleotides within the large inter-loop. OFF state of SMK riboswitch is the structure that is responsible for SAM binding shown in Figure 1.
Figure 5. The co-transcriptional folding kinetics (a,d) and refolding kinetics (b,e) of addA riboswitch aptamer (up panel) and SMK riboswitch (bottom panel). The transcription rates for addA aptamer and SMK riboswitch are 25 nt/s and 50 nt/s, respectively, and the main intermediate states formed during the transcription are shown in (c,f). The dash dot lines in (b) for addA riboswitch aptamer denote the unpaired nucleotides within the large inter-loop. OFF state of SMK riboswitch is the structure that is responsible for SAM binding shown in Figure 1.
Ijms 18 02442 g005
Table 1. Methods which have been used for modeling 3D structure for aptamers.
Table 1. Methods which have been used for modeling 3D structure for aptamers.
MethodsDescriptionAvailabilityReference
RNAComposerA motif library-based method that uses the dictionary tailored from RNA FRABASED database to build initial 3D structure.Web server[39]
RosettaA fragment-based method that uses FARFAR optimizes RNA conformations in the context of a physically realistic energy function.Local installation[72]
RMdetectA bioinformatics tool for identifying known 3D structural modules on genomic sequences.Local installation[73]
JAR3DScoring sequences to motif groups based on sequences’ ability to form the same pattern of interactions in motif.Web server[70]
RAGTOPPredicting RNA topologies by a coarse-grained sampling of 3D graphs guided by statistical knowledge-based potentials.Not available online[74]
iFoldRNAIncorporating SHAPE into discrete molecular dynamics to predict RNA structure.Web server[75]

Share and Cite

MDPI and ACS Style

Gong, S.; Wang, Y.; Wang, Z.; Zhang, W. Computational Methods for Modeling Aptamers and Designing Riboswitches. Int. J. Mol. Sci. 2017, 18, 2442. https://doi.org/10.3390/ijms18112442

AMA Style

Gong S, Wang Y, Wang Z, Zhang W. Computational Methods for Modeling Aptamers and Designing Riboswitches. International Journal of Molecular Sciences. 2017; 18(11):2442. https://doi.org/10.3390/ijms18112442

Chicago/Turabian Style

Gong, Sha, Yanli Wang, Zhen Wang, and Wenbing Zhang. 2017. "Computational Methods for Modeling Aptamers and Designing Riboswitches" International Journal of Molecular Sciences 18, no. 11: 2442. https://doi.org/10.3390/ijms18112442

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop