Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning

Kanada, Ryo; Tokuhisa, Atsushi; Tsuda, Koji; Okuno, Yasushi; Terayama, Kei

doi:10.3390/biom10030482

Open AccessArticle

Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning

by

Ryo Kanada

^1,*

,

Atsushi Tokuhisa

^1,2,3,

Koji Tsuda

^4,5,6,

Yasushi Okuno

^1,3,7 and

Kei Terayama

^3,5,7,*

¹

RIKEN Compass to Healthy Life Research Complex Program, Kobe 650-0047, Japan

²

RIKEN Center for Computational Science, Kobe 650-0047, Japan

³

RIKEN Medical Sciences Innovation Hub Program, Yokohama 230-0045, Japan

⁴

Graduate School of Frontier Sciences, he UTniversity of Tokyo, Kashiwa 277-8561, Japan

⁵

RIKEN Center for Advanced Intelligence Project, Tokyo 103-0027, Japan

⁶

Research and Services Division of Materials Data and Integrated System, National Institute for Materials Science, Tsukuba 305-0047, Japan

⁷

Graduate School of Medicine, Kyoto University, Kyoto 606-8507, Japan

^*

Authors to whom correspondence should be addressed.

Biomolecules 2020, 10(3), 482; https://doi.org/10.3390/biom10030482

Submission received: 15 February 2020 / Revised: 11 March 2020 / Accepted: 19 March 2020 / Published: 21 March 2020

(This article belongs to the Special Issue Application of Artificial Intelligence for Medical Research)

Download

Browse Figures

Versions Notes

Abstract

Accompanied with an increase of revealed biomolecular structures owing to advancements in structural biology, the molecular dynamics (MD) approach, especially coarse-grained (CG) MD suitable for macromolecules, is becoming increasingly important for elucidating their dynamics and behavior. In fact, CG-MD simulation has succeeded in qualitatively reproducing numerous biological processes for various biomolecules such as conformational changes and protein folding with reasonable calculation costs. However, CG-MD simulations strongly depend on various parameters, and selecting an appropriate parameter set is necessary to reproduce a particular biological process. Because exhaustive examination of all candidate parameters is inefficient, it is important to identify successful parameters. Furthermore, the successful region, in which the desired process is reproducible, is essential for describing the detailed mechanics of functional processes and environmental sensitivity and robustness. We propose an efficient search method for identifying the successful region by using two machine learning techniques, Bayesian optimization and active learning. We evaluated its performance using F1-ATPase, a biological rotary motor, with CG-MD simulations. We successfully identified the successful region with lower computational costs (12.3% in the best case) without sacrificing accuracy compared to exhaustive search. This method can accelerate not only parameter search but also biological discussion of the detailed mechanics of functional processes and environmental sensitivity based on MD simulation studies.

Keywords:

coarse-grained molecular dynamics simulation; biological rotary motor; machine learning; active learning; bayesian optimization

Graphical Abstract

1. Introduction

Owing to significant advancements in molecular structural analysis such as electron cryo-microscopy (Cryo-EM) and X-ray free-electron laser (XFEL), thousands of three-dimensional biomolecular structures including macromolecules and complex systems have been revealed. Furthermore, with the progress of molecular dynamics (MD) simulations using GPUs and supercomputers, it has become possible to understand the dynamics and behavior of biomolecular systems. Although all-atom molecular dynamics (AA-MD) simulation is a powerful tool for studying the dynamics of a biomolecule, it remains difficult to simulate entire functional processes of large molecules such as membrane proteins and motor proteins whose typical time scales are around milliseconds or longer. For example, even using a special-purpose supercomputer dedicated to AA-MD simulations, the protein folding process can be simulated only for proteins that are small and have a relatively high folding speed [1]. To overcome this problem, the application of coarse-grained (CG) MD simulations has been attracting attention because of its lower calculation cost compared to that of AA-MD. In fact, CG-MD simulation has succeeded in qualitatively reproducing numerous biological processes for various biomolecules [2,3,4,5,6,7,8,9,10,11,12]. Especially, C

_{α}

switching Go model [3,5] and multiple basin Go model [13] can realize large conformational change of protein easily by using available multiple native structures as references to construct model interaction.

However, CG-MD simulations strongly depend on the various model and environmental parameters and their tuning is needed to reproduce desired biological processes. In general, many CG-parameters are directly or indirectly influenced by the environment such as the temperature and ionic strength and mutation. Particularly, determining CG-parameters for a larger system with drastic conformational changes and interactions between multiple-chains is quite difficult and generally results in some uncertainty related to model parameters: for example, in the multiple-basin model [13] coupling parameter

Δ

and relative stability

Δ V

, in Langevin simulation of CG-model the friction constant

Γ

, inter-molecular interaction strength, parameter related to ion-strength dependence and so on. Various methods for determining these parameters such as force matching [14] and fluctuating matching [15] have been proposed. However, in many cases, these methods are very computationally expensive and cannot always determine valid parameters. As a result, such tuning is often performed manually.

Furthermore, to understand molecular mechanics through simulations, it is important to investigate the region of successful parameters (in other words, a phase diagram) that reproduce a targeted process. For example, a phase diagram of environmental parameters such as temperature, ionic strength and mutation is expected to provide information on the sensitivity or robustness to environmental changes and mutations of the target molecule. Particularly, the range of CG-parameters related to mutation may provide insights into the design new molecule with better functions. A few studies [4,16] have been conducted to systematically investigated and validated the dependence of the CG-model parameters by drawing a phase diagram and the limitation of CG-models have been discussed, including some parameter uncertainty.

However, exhaustively examining all candidate parameters is inefficient. Specifically, as simulations are performed stochastically, their results vary depending on a seed of the random variable and initial conditions of MD. To determine whether a process occurs stably (i.e., beyond a certain probability) under a certain parameter, it is necessary to repeatedly perform calculations while changing the conditions. As a result, the computational cost of exhaustively examining parameters with MD simulations is extremely high.

In recent years, various parameter optimization methods [17] such as Bayesian optimization (BO) and evolutionary algorithms have been proposed in the field of machine learning and applied to a wide range of actual problems such as parameter optimization of deep neural networks [18,19], combination of materials [20], and protein design [21]. Most parameter optimization techniques effectively find the optimal parameter. However, it is not necessarily appropriate to efficiently search for parameters beyond a certain criterion. In contrast, one of the authors recently proposed an effective sampling method [22,23] for constructing phase diagrams based on uncertainty sampling (US), a type of active learning technique. The method based on US can efficiently determine phases to examine the phase boundary preferentially when two or more phases are sampled. By regarding successful parameters and the failed parameters as two phases, it is possible to efficiently search for successful parameters. However, if the number of successful parameters is small, the efficiency of the method is considered to be poor, as boundary sampling becomes difficult because of difficulties in detecting successful parameters.

In this study, we propose a method named BOUS that efficiently samples in successful regions by combining BO and US described above to overcome the computational cost of parameter search in MD simulations. BOUS first searches for a successful parameter based on the success rate of the targeted process by using BO, and then switches to the US to efficiently search for successful parameter regions. To evaluate the performance of BOUS, we applied sampling methods including BOUS to parameter search problems of the rotational motion for F1-ATPase based on CG-MD simulations. We performed the CG-MD simulations based on two types of dynamics, Newtonian and Langevin. We also evaluated the sampling performances of other sampling methods: exhaustive search, random sampling (RS), US, and BO. The results showed that BOUS, BO, and US identified successful regions and construct a phase diagram with drastically reduced computation compared to exhaustive search and RS. In addition, BOUS showed better performances than BO and US.

Moreover, we confirmed that the rotational motion of the F1-motor was reproduced over a wide range containing parameters that were not reported in existing studies. We also discussed the stability information against parameter perturbation based on the constructed phase diagrams of successful parameters. These results suggest that deeper mechanical and biological discussions can be accelerated by efficiently drawing phase diagrams. Our implementation is available at https://github.com/tsudalab/SPEMD.

2. Materials and Methods

2.1. Machine Learning Based Region Search of Successful Parameters

This study was conducted to effectively identify the region of successful parameters that stably realize the desired dynamics, e.g., rotation of the F1-motor, even if random conditions such as initial velocity distribution are altered. We refer to a parameter that achieved a certain success rate or more as a successful parameter in this paper. The detailed definition of the success rate for the rotation F1-motor is described in a later section. We propose a method named as BOUS for efficiently sampling all successful parameters in the parameter space by combining BO and US. Figure 1a–c show flowcharts of the sampling methods on the basis of BO, US, and BOUS. After explaining the approaches of BO and US, the details of BOUS are described.

2.1.1. Parameter Sampling By Bo

Recently, as a type of data-driven approach, BO has been widely used as an efficient search method for parameters that satisfy a desired property [17,24]. As typical application examples, BO was used to efficiently search for the best network in deep learning [18] to optimize network parameters such as the number of networks and learning rates, and the best combination of a large number of materials [20].

Figure 1a shows the procedure used for a high score parameter search using BO. First, some parameters are selected randomly, and their scores are obtained by simulations as initialization. Next, the BO framework recommends a parameter to be examined next based on the obtained dataset of the checked parameters and their results, using a machine learning method called Gaussian process [24]. The expected mean and variance of the score for each parameter set are predicted by Gaussian process, as shown in the upper figure in Figure 1a. Then, the next parameter is selected by using an acquisition function, as shown in the lower figure in Figure 1a. Here, we used the expected improvement [25] as the acquisition function. Next, the simulation for the recommended parameter is performed and its score is added to the dataset. By repeating the process of a recommendation and simulation, we can efficiently search for the parameter with a desired property.

In BO, the score is basically assumed to be a quantitative value. In F1-motor simulation, we can apply the BO framework by regarding the success rate of its rotation as the score to efficiently find the parameter with the highest rate. However, BO may be inefficient for successful region search because BO is specialized for searching for parameters with higher scores.

In this study, we adopted the COMBO library [26] as an implementation of BO. COMBO is one of the accelerated implementation of BO using the approximation of the Gaussian process based on the random feature map [27]. This approach can be used without tuning of hyperparameters, as they are automatically determined by maximizing the type-II likelihood [24] in COMBO.

2.1.2. Parameter Sampling by Us

An active learning-based approach proposed by some authors for efficiently constructing a phase diagram [22] is expected to be useful for identifying regions of successful parameters. The approach adopted uncertainty sampling, which is a type of active learning, for preferentially sampling around predicted boundaries of multiple discrete classes by machine learning to quickly determine the boundaries. In this study, we applied the US-based sampling framework to F1-motor simulations by regarding successful and failed parameters as two phases. However, when only one phase is identified, any boundary of the phase cannot be predicted, resulting in inefficient sampling behavior close to the random sampling. In this research, if the criterion of the success rate was high, the region of successful parameters was narrow, making it difficult to detect the successful parameters.

Figure 1b shows the procedure of successful region search based on US. We implemented the method based on a combination of the label propagation (LP) [28] and the least confidence method [29], which showed the best performance among algorithms described in previous studies [22,23]. We assigned a successful or failed label to each parameter based on a success rate. In the sampling procedure of US, we first predicted the probability distributions for all unexamined parameters using LP, as shown in the upper figure in Figure 1b. Next, the least confidence method was used to select the parameter with the largest uncertainty, as shown in the lower figure in Figure 1b. The uncertainty score for each parameter x is calculated as follows:

\begin{matrix} \begin{matrix} u (x) = 1 - max_{p} P (p | x) \end{matrix}, \end{matrix}

(1)

where p is a label (success or failure) and

P (p | x)

is the predicted probability of p of the parameter

x

. That is, if the predicted probabilities of success and failure are almost the same, the score is relatively high. We used default hyperparameters in the previous paper [22]. It was shown that US works well under default parameters.

2.1.3. Combination of BO and US

Based on the advantages and disadvantages of BO and US described above, we propose the use of BOUS for searching for a successful parameter based on BO initially, and samples based on US to cover successful parameters after finding a successful parameter. Figure 1c shows the procedure of BOUS. Even if the region of the successful parameter is narrow, BOUS can efficiently find a successful parameter by BO. In the BOUS implementation, we use COMBO as a BO implementation with the success rate data to search until a successful parameter is found. After it is found, we assign successful or failed labels to the checked parameters according to the success rate and switch to the search based on US as described above.

2.2. F1-Motor and CG-MD Simulation

F1-ATPase is a biological rotary motor that catalyzes the hydrolysis of ATP into ADP and Pi. As shown in Figure 2a, the F1-motor consists of seven subunits

α_{3} β_{3} γ

of which x-ray crystal structure (Abrahams et al., 1994 [30]) was obtained from the PDB entry of 1BMF. The central stalk

γ

is a rotor, while the alternately arranged

α_{3}

and

β_{3}

subunits form a ring acting as the stator of the rotary motor. In x-ray structure nucleotide state of each subunit is classified three states: an ATP bound state “TP”, ADP-bound state “DP”, and nucleotide free state “E”. The

γ

-subunit rotates counterclockwise unidirectionally in discrete 120-degree steps as each ATP is hydrolyzed [31].

In this study, to conduct MD simulation for rotational motion of

γ

subunit in F1-motor by using CafeMol software [32] according to simulation scheme as shown in Figure S1. We applied structure-based C

_{α}

switching Go model [3] in which one particle represents each amino acid in proteins. The total energy for the F1-motor consists of four terms:

V = V_{γ} + V_{α_{3} β_{3}} + V_{α_{3} β_{3} - γ} + V_{a n c h o r}

. All parameters described by Koga’s work [3] were used, except for one related to excluded volume interaction (EVI) between

γ

and

α_{3} β_{3}

subunits:

V_{α_{3} β_{3} - γ}

is defined as

\begin{matrix} V_{α_{3} β_{3} - γ} = \sum_{i \in α_{3} β_{3}, j \in γ} E V I \times ϵ {(C / r_{i j})}^{12}, \end{matrix}

(2)

where EVI is one of the targets evaluated in the parameter survey. See the supplemental text-S1 for the detailed information of potentials:

V_{γ}

,

V_{α_{3} β_{3}}

,

V_{α_{3} β_{3} - γ}

,

V_{a n c h o r}

, and simulation protocol.

2.3. Simulation Dynamics and Sampled Parameter Space

To demonstrate that our optimization method is robust regardless of the type of dynamics of CG simulation and dimension of sampled parameter space, we simulated the time evolution of the CG F1-system with two representative dynamics: Newtonian dynamics and under-damped Langevin dynamics.

For time integration of Newton dynamics “velocity Verlet” algorithm [33] was used with a step size of

d t = 0.1

. Time steps for each phase

i (= 1, 2, 3, 4)

for switching Go (see Supplementary Figure S1) is

8 \times 10^{4}

. To achieve a constant temperature ensemble, velocity rescaling [34] was employed. The mass m for all amino acids was identical:

m = 10

. For the Newtonian dynamics system, we investigated a 2-dimensional parameter space defined by temperature

T \times 10^{3} = 2^{0}, 2^{1} \dots 2^{20} [K]

and the excluded volume interaction strength EVI

= 2^{0}, 2^{1} \dots 2^{11}

. For each combination of parameters (T and EVI) we conducted 10 trial simulations from the same initial structure (the 1994 complex structure) with different initial velocity conditions. See the supplemental text-S2 for the detailed of the investigation related to underdamped Langevin dynamics.

2.4. Definition of Success Rate for the Rotation of $γ$ Subunit in F1-Motor

In CG-simulation, we observed the rotation angle

θ (t)

of

γ

subunit from the initial structure as shown in Figure 2a (See the supplemental text-S3 for the detailed definition of the rotation angle

θ

). Figure 2b shows an example of the time-series of the rotation angle

θ

. Because we changed the reference structure

X_{i}

and potential

V_{α 3 β 3} (R | X_{i})

for each phase i(

= 1, 2, 3, 4

) according to switching Go-model scheme (Supplementary Figure S1), in an energy minimum state (

T = 0

K) the rotation angle

θ

of each phase i should ideally agree with

(i - 1) \times 120

(degree), named as the “Reference angle”. The reference angle increases in a step-wise manner as shown in Figure 2b. However, because of the thermal noise at the finite temperature T in each phase i the rotation angle

θ (t)

fluctuates and deviates from the reference angle stochastically. Furthermore, because of the influence of inertia (

m = 10.0

) and friction (

Γ

), even after switching the reference structure

X_{i}

, the rotation angle cannot respond to changes in the reference angle instantaneously. A finite relaxation time (delay) is required to achieve a rotation angle of 120 degrees after potential switching.

Therefore we set the tolerance range for the F1-motor rotation angle

θ

as shown in Figure 2b: this area allows ± 60-degree deviation of the rotation angle from the reference and allows for a delay in angle response within the half time step in each phase after the switching potential. The goodness score of F1-motor rotation is defined as the ratio of the angle

θ

s in the tolerance range during the whole trajectory. In this study, we defined a trial simulation with a goodness score of over 95% as a successful trial. The success rate is defined as the ratio of the number of successful trials out of 10 trials with different initial conditions for each parameter.

3. Results

3.1. Sampling Performances for F1-Motor Simulations Using Newtonian Dynamics

Figure 3a shows the success rate distribution of the F1-motor rotation based on Newtonian dynamics simulations for all parameters. Figure 3a-1,a-2 are examples of angle trajectories of F1-motor rotations in Figure 3a, and their success rates were 1 and 0.2, respectively. In this system, the rotation tended to be successful when the temperature was quite low, around 1 K, and when the EVI was as large as several tens to several hundreds of times of the default parameter. The region of successful parameter changes is shown in Figure 3b according to the criteria (Success threshold

τ

).

We applied the sampling algorithms BOUS, US, BO, and RS in Newtonian dynamics simulations. Figure 3c–f are typical examples of sampling results with the success threshold of 0.9 from the 10 same initial points using BOUS, US, BO, and RS, respectively. Each number in parentheses shows the number of sampled parameters. BOUS and BO began to search for successful parameters based on Bayesian optimization using the success rate information (Figure 3a), as the first 10 points did not contain successful parameters. BOUS switched to the US sampling immediately after finding the initial successful parameter at the 25th sampling and searched around the boundary between the successful and failed parameters. Finally, BOUS succeeded in searching for all (eight) successful parameters with only 41 samplings as shown in Figure 3c. In contrast, BO found the same successful parameter at the 25th sampling, but only four successful parameters were obtained at the 80th sampling, as shown in Figure 3e. The US succeeded in searching for all successful parameters with 80 samples (Figure 3d) because of intensive sampling around the boundary between the successful and failed parameters soon after finding the first successful point. In this trial, 64 points were required to find a successful parameter using RS. As a result, although 80 points were sampled in Figure 3f, only two successful parameters were obtained using RS.

To accurately evaluate the performances of the sampling methods, we calculated the performances of successful parameter search by randomly changing the initial points. Figure 4a–e show the numbers of detected successful parameters as functions of the number of sampling points for each success threshold. Here, the initial sampling number was fixed to 10. To evaluate the effect of different initial samplings, we repeated 100 trials using different initial points and averaged the results. We listed the number of iterations in the best and worst cases to find all successful parameters in Supplemental Table S1. As shown in Figure 4a–e, BOUS, US, and BO showed better performances compared to RS; particularly, BOUS surpassed the other machine learning-based approaches. For example, when the threshold

τ

= 1, BOUS identified all successful parameters with 15 iterations in the best case and with 40 iterations in the worst case as shown in Supplemental Table S1. The reason why the performance of BOUS was higher than that of US, particularly for larger thresholds, is considered to be that successful parameters can be searched quickly based on Bayesian optimization in the early stages of sampling. We showed the search performance to detect a successful parameter using BO and RS in Figure 4f. Each line is the average of 100 search trials with different initial parameters. The result showed that BO required only 30 samples to find a successful parameter even in the worst case, while more than 100 samples were needed by RS in some cases. For US, if there is only one phase, i.e., no successful cases have been found, the search by US will be closer to RS because no boundary is defined. The performance of US is considered to be inferior to BOUS, as time is required to find a successful parameter when the threshold is large. In contrast, when the threshold was less than 0.7, the performance of BOUS was nearly the same as that of US. This may be because when the threshold is small, the region of the successful parameter is large and can thus be immediately found by US. BO showed relatively high performances when the threshold was 1.0. In this case, the number of successful parameters was five, and BO could find these parameters during searching for the highest rate. On the other hand, it was confirmed that BO is not suitable for the exhaustive search of the successful parameters because the successful parameter is large when the threshold is small. We summarized the reduced performances of sampling iterations of BOUS, US, BO, and RS compared to exhaustive search in Table 1. The listed values of each sampling algorithm are the numbers of samplings required to detect all successful parameters with a probability of 95% or higher. The values in parentheses indicate the ratios of the number of calculations to the exhaustive search. From this result, the computational cost to identify the successful region can be reduced to 12.3% compared to exhaustive search by using BOUS when

τ

= 1.

3.2. Sampling Performance of F1-Motor Simulations Using Langevin Dynamics

We show the success rate distribution of all parameters based on Langevin dynamics simulations in Figure 5a, using two-dimensional maps with different

Γ

s. The simulation results seem noisy compared to those obtained by Newtonian dynamics and the maximum success rate was 0.9. Successful parameters were found to exist over a relatively wide temperature range from 50 to over 200 K and a wide range of the excluded volume term from 8 to 256. Figure 5b shows the distribution of the successful parameter when the success threshold

τ

is 0.8. Distributions of the successful parameter with other values of threshold

τ

are shown in Supplemental Figure S2.

Figure 5c–g show the numbers of detected successful parameters as functions of the number of sampling points using BOUS, US, BO, and RS. The success threshold in Figure 5c–g were 0.9, 0.8, 0.7, 0.6, and 0.5, respectively. Each dashed line shows the number of successful parameters. The initial number of sampling trials was also fixed to 10. To evaluate the effect of different initial samplings, we also repeated 100 trials using different initial points. These results show that BOUS, US, and BO achieved better performance compared to RS, and BOUS was the best sampling method among them as Newtonian dynamics cases. Furthermore, the results of successful parameter searches by BO and RS are shown in Figure 5h. Each line is the average of 100 search trials with different initial parameters. This result shows that a successful parameter can be found efficiently by using BO, even in a relatively noisy distribution of success rate (Figure 5a) and in the three-dimensional parameter space. There was a tendency for the efficiency to deteriorate as the success threshold

τ

was reduced. This is thought to be mainly because the smaller the threshold

τ

value is, the larger the number of successful parameters and thus the larger the number of samples required to enumerate all successful parameters. We also summarized the reduction performance of sampling iterations of BOUS, US, BO and RS compared to exhaustive search in Supplemental Table S2. When the threshold is 0.8, the number of samplings can be reduced to 18.9% by using BOUS compared to the exhaustive search of the parameter space. These results confirmed that machine learning-based approaches were effective for F1-motor simulation with Langevin dynamics and the combination of BO and US (BOUS) enabled efficient identification of the successful region.

4. Discussion

By conducting Newtonian dynamics and under-damped Langevin dynamics simulations of a CG-model with a few specific parameters, the past works [3,32] (Koga’s work and CafeMol manual) showed that the F1 motor can make rotational motion:

T = 4 \times 10^{- 3}

K,

2 \times 10^{- 2}

K, and EVI =

129.75

for Newtonian dynamics and

T = 100

K, EVI =

129.75

, and

Γ = 0.05

for Langevin dynamics were presented as representative successful parameters. In contrast, in this study, by drawing phase diagrams as shown in Figure 3a,b for Newtonian dynamics and Figure 5a,b for Langevin dynamics, we elucidated that the F1 motor can reproduce the rotational motion with a high success rate over wider parameter areas than the (localized) specific parameters used in previous studies [3,32] (and an example in CafeMol manual). The time trajectories displayed in Supplemental Figure S3a,b for Newtonian dynamics show that even at a higher temperature

T ≃ 2.0 \sim 8.2

(than

2 \times 10^{- 2}

) and lower EVI

= 16 \sim 64

(than

129.75

), the success rate for rotational motion of the F1 motor can be significantly high (

0.8 \sim 1

). Similarly, the time trajectories for Langevin dynamics in Supplemental Figure S4b,c showed that even at a higher temperature

T \geq 160

K and lower EVI

\leq 32 \sim 64

than those in past work [32] (Figure S4a), a high success rate could be realized. Obtaining this kind of knowledge is one of the advantages of drawing a phase diagram in a wide parameter space of CG-model.

Clarifying the area of successful parameters will enable a detailed analysis of the dynamics and mechanisms to realize important functions of target biomolecules. For example, our results provide insight into the effect of friction

Γ

on the success rate from the phase diagram Figure 5a: while in the higher temperature area

T \geq 280

K, the difference in success rates between low friction

Γ = 0.05

and high friction

Γ = 0.25

is not so significant, at the lower temperature

T \leq 70

K, the success rate seems to be decreased with higher friction

Γ

. These tendencies may be apprehended from the simulated trajectories of the rotational angle in each parameter: at the lower temperature (

T = 70

K), compared to the rapid response for the smaller friction (

Γ = 0.05

), the higher friction (

Γ = 0.25

) caused a slower response and tended to fail rotational motion, as shown in Supplemental Figure S5. However, at the higher temperature

T = 280 K

, the fluctuation amplitude of rotation with lower friction (

Γ = 0.05

) frequently exceeded the tolerance range of the angle, resulting in a lower success rate which is comparable to that obtained with high (

Γ = 0.25

), as shown in Supplement Figure S6.

In our study, it seems that the temperature with a significant success ratio is relatively smaller than room temperature (

\sim 300

K) in which the F1 motor can rotate in-vitro experiments. This probably comes from the switching go model: the immediate high activation energy of the whole system accompanied by switching potential may cause unstable rotation of gamma. We guess that the success ratio at higher temperatures can be improved by applying a multiple-basin model which can suppress activation energy.

Identifying the region of successful parameters for the CG-model will also provide information on the robustness of the biological function of the target molecule to environmental changes (temperature, ion strength and so on) and mutations. In the phase diagram, the range of successful EVI:

[16 : 256]

for Langevin dynamics (Figure 5b) and

[8 : 128]

for Newtonian dynamics with threshold

τ = 0.8

(Figure 3) is assumed to be related to the robustness in undergoing rotational motion of F1 against mutations in residues between

γ

and

α 3 β 3

subunits (as the EVI parameter should depend on the residues type in the corresponding area). Although in this study, the EVI parameter was set to a uniform value for all residues between

γ

and

α 3 β 3

, a phase diagram with residue-dependent EVI parameters will be developed in our future studies.

To select the next parameter based on machine learning algorithms, computational costs (search time) are required for learning and selection. Supplemental Figure S7 shows the averaged search time of each algorithm at each sampling step in the Langevin dynamics simulations. These results show that the search time of BO was very short (approximately 10 s) compared to the simulation times. Furthermore, in the US, the search time was less than 1 s, and the search time decreased as the number of samplings increased. It is considered that label estimation, which is the most time-consuming step in US, converged quickly when the number of sampled points is large. Because BOUS initially uses BO, a relatively long search time is needed; however, after finding a successful parameter, this method switches to the US and the search time becomes very short. These search times are considered to be sufficiently short compared to time-consuming MD simulations.

In this study, we examined phase diagrams of parameter spaces with two phases, successful and failed. The phase diagram construction method based on US is also applicable in cases where there are more than two types of phases, as described in [22]. For example, in the case of the F1-motor system, because the substep (90 + 30) for 120-degree rotation was frequently observed at lower ATP concentrations in in-vitro experiments [35] there are three possibilities: 120-degree rotation at once (without substep), substep rotation (90 + 30 degree), and others (failure for rotation). It is considered effective to apply US simply or search for all phases using BO and then switch to US like BOUS, when there are more than two types of phases. As future work, we will apply our new methodology to other biological systems to evaluate the potential of our machine learning-based approach.

5. Conclusions

In this study, we showed that parameter sampling methods based on machine learning algorithms are effective for identifying the region of successful parameters. In particular, BOUS showed better performances compared to existing methods Bayesian optimization (BO) and Uncertainty Sampling (US), which is proposed to make a phase diagram efficiently. We confirmed that our methods were effective for F1-ATPase using different calculation methods, Newtonian and Langevin dynamics, and with different dimensions of parameters. We confirmed that the F1 motor can rotate successfully in a wider range of parameter space than previously reported. Basically, the machine learning-based approaches (BOUS, US, and BO) works if the success rate (success or failure) of a given parameter can be calculated in any simulation and modeling. Therefore, these methods can be widely applicable to studies of CG models that contain not only Go-model but also non-Go-model such as an elastic network model, as long as there is a successful area. Our published implementation can be used as an alternative for parameter determination approaches for biomolecule such as force matching [14] and fluctuating matching [15].

Our methods can be applied to other target systems and calculation methods, including CG-MD, all-atom MD simulation, and others, as long as the success rate can be defined. Especially, making the phase diagram of a specific CG model and biomolecule system with the proposed method would help to properly evaluate advantages and disadvantages such as applicable limits and bias of the CG model. In addition, it is possible to use the proposed methods even if the dimension of the parameter is four or more. However, BO becomes inefficient if the parameter dimension is too high; in this case, dimension reduction methods such as random embedding [36,37] would be effective.

Supplementary Materials

The following are available online at https://www.mdpi.com/2218-273X/10/3/482/s1.

Author Contributions

Conceptualization, R.K., A.T., Y.O., and K.T. (Kei Terayama); methodology, R.K., K.T. (Koji Tsuda), and K.T. (Kei Terayama); investigation, R.K., A.T., and K.T. (Kei Terayama); writing—original draft preparation, R.K. and K.T. (Kei Terayama); writing—review and editing, R.K., A.T., K.T. (Koji Tsuda), Y.O., and K.T. (Kei Terayama); supervision, K.T. (Koji Tsuda) and Y.O.; funding acquisition, A.T., K.T. (Koji Tsuda), and Y.O. All authors have read and agreed to the published version of the manuscript.

Funding

This article is supported by project subsidized by the New Energy and Industrial Technology Development Organization (NEDO) and the Ministry of Education, Culture, Sports, Science, and Technology of Japan (MEXT) as a “Priority Issue on Post-K Computer” (Building Innovative Drug Discovery Infrastructure through Functional Control of Biomolecular Systems). This work was also supported by the Research Complex Promotion Program from the Japan Science and Technology Agency and by JSPS KAKENHI Grant Number 19K06535. This research used computational resources of the Camphor2 of CRAY XC40 provided by Kyoto University through the High Performance Computing Infrastructure System Research Project (Project ID: hp180123, hp190105) and the supercomputer centers of RAIDEN of AIP (RIKEN).

Conflicts of Interest

The authors declare no conflict of interest.

References

Shaw, D.E.; Deneroff, M.M.; Dror, R.O.; Kuskin, J.S.; Larson, R.H.; Salmon, J.K.; Young, C.; Batson, B.; Bowers, K.J.; Chao, J.C.; et al. Anton, a special-purpose machine for molecular dynamics simulation. Commun. ACM 2008, 51, 91–97. [Google Scholar] [CrossRef]
Takada, S.; Kanada, R.; Tan, C.; Terakawa, T.; Li, W.; Kenzaki, H. Modeling Structural Dynamics of Biomolecular Complexes by Coarse-Grained Molecular Simulations. Accounts Chem. Res. 2015, 48, 3026–3035. [Google Scholar] [CrossRef] [PubMed]
Koga, N.; Takada, S. Folding-based molecular simulations reveal mechanisms of the rotary motor F1–ATPase. Proc. Natl. Acad. Sci. USA 2006, 103, 5367–5372. [Google Scholar] [CrossRef] [PubMed]
Okazaki, K.i.; Takada, S. Dynamic energy landscape view of coupled binding and protein conformational change: Induced-fit versus population-shift mechanisms. Proc. Natl. Acad. Sci. USA 2008, 105, 11182–11187. [Google Scholar] [CrossRef] [PubMed]
Koga, N.; Kameda, T.; Okazaki, K.i.; Takada, S. Paddling mechanism for the substrate translocation by AAA+ motor revealed by multiscale molecular simulations. Proc. Natl. Acad. Sci. USA 2009, 106, 18237–18242. [Google Scholar] [CrossRef]
Kanada, R.; Terakawa, T.; Kenzaki, H.; Takada, S. Nucleosome Crowding in Chromatin Slows the Diffusion but Can Promote Target Search of Proteins. Biophys. J. 2019, 116, 2285–2295. [Google Scholar] [CrossRef]
Levy, Y.; Onuchic, J.N.; Wolynes, P.G. Fly-Casting in Protein-DNA Binding: Frustration between Protein Folding and Electrostatics Facilitates Target Recognition. J. Am. Chem. Soc. 2007, 129, 738–739. [Google Scholar] [CrossRef]
Marrink, S.J.; Risselada, H.J.; Yefimov, S.; Tieleman, D.P.; de Vries, A.H. The MARTINI Force Field: Coarse Grained Model for Biomolecular Simulations. J. Phys. Chem. B 2007, 111, 7812–7824. [Google Scholar] [CrossRef]
Hills, R.D.; Lu, L.; Voth, G.A. Multiscale coarse-graining of the protein energy landscape. PLoS Comput. Biol. 2010, 6, e1000827. [Google Scholar] [CrossRef]
Wang, J.; Olsson, S.; Wehmeyer, C.; Pérez, A.; Charron, N.E.; de Fabritiis, G.; Noé, F.; Clementi, C. Machine Learning of Coarse-Grained Molecular Dynamics Force Fields. ACS Cent. Sci. 2019, 5, 755–767. [Google Scholar] [CrossRef]
Bahar, I.; Rader, A. Coarse-grained normal mode analysis in structural biology. Curr. Opin. Struct. Biol. 2005, 15, 586–592. [Google Scholar] [CrossRef] [PubMed]
Sambriski, E.J.; Schwartz, D.C.; de Pablo, J.J. A mesoscale model of DNA and its renaturation. Biophys. J. 2009, 96, 1675–1690. [Google Scholar] [CrossRef] [PubMed][Green Version]
Okazaki, K.; Koga, N.; Takada, S.; Onuchic, J.N.; Wolynes, P.G. Multiple-basin energy landscapes for large-amplitude conformational motions of proteins: Structure-based molecular dynamics simulations. Proc. Natl. Acad. Sci. USA 2006, 103, 11844–11849. [Google Scholar] [CrossRef] [PubMed]
Izvekov, S.; Voth, G.A. A Multiscale Coarse-Graining Method for Biomolecular Systems. J. Phys. Chem. B 2005, 109, 2469–2473. [Google Scholar] [CrossRef] [PubMed]
Li, W.; Wolynes, P.G.; Takada, S. Frustration, specific sequence dependence, and nonlinearity in large-amplitude fluctuations of allosteric proteins. Proc. Natl. Acad. Sci. USA 2011, 108, 3504–3509. [Google Scholar] [CrossRef] [PubMed]
Yao, X.Q.; Kenzaki, H.; Murakami, S.; Takada, S. Drug export and allosteric coupling in a multidrug transporter revealed by molecular simulations. Nat. Commun. 2010, 1, 117. [Google Scholar] [CrossRef] [PubMed]
Shahriari, B.; Swersky, K.; Wang, Z.; Adams, R.P.; De Freitas, N. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 2015, 104, 148–175. [Google Scholar] [CrossRef]
Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization of Machine Learning Algorithms. In Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA, 3–8 December 2012; pp. 2951–2959. [Google Scholar]
Jaderberg, M.; Dalibard, V.; Osindero, S.; Czarnecki, W.M.; Donahue, J.; Razavi, A.; Vinyals, O.; Green, T.; Dunning, I.; Simonyan, K. Population based training of neural networks. arXiv Prepr. 2017, arXiv:1711.09846. [Google Scholar]
Seko, A.; Togo, A.; Hayashi, H.; Tsuda, K.; Chaput, L.; Tanaka, I. Prediction of Low-Thermal-Conductivity Compounds with First-Principles Anharmonic Lattice-Dynamics Calculations and Bayesian Optimization. Phys. Rev. Lett. 2015, 115, 205901. [Google Scholar] [CrossRef]
Saito, Y.; Oikawa, M.; Nakazawa, H.; Niide, T.; Kameda, T.; Tsuda, K.; Umetsu, M. Machine-learning-guided mutagenesis for directed evolution of fluorescent proteins. ACS Synth. Biol. 2018, 7, 2014–2022. [Google Scholar] [CrossRef]
Terayama, K.; Tamura, R.; Nose, Y.; Hiramatsu, H.; Hosono, H.; Okuno, Y.; Tsuda, K. Efficient construction method for phase diagrams using uncertainty sampling. Phys. Rev. Mater. 2019, 3, 033802. [Google Scholar] [CrossRef]
Terayama, K.; Tsuda, K.; Tamura, R. Efficient recommendation tool of materials by an executable file based on machine learning. Jpn. J. Appl. Phys. 2019, 58, 098001. [Google Scholar] [CrossRef]
Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; MIT Press: Cambridge, MA, USA, 2006. [Google Scholar]
Jones, D.R.; Schonlau, M.; Welch, W.J. Efficient global optimization of expensive black-box functions. J. Glob. Optim. 1998, 13, 455–492. [Google Scholar] [CrossRef]
Ueno, T.; Rhone, T.D.; Hou, Z.; Mizoguchi, T.; Tsuda, K. COMBO: An efficient Bayesian optimization library for materials science. Mater. Discov. 2016, 4, 18–21. [Google Scholar] [CrossRef]
Rahimi, A.; Recht, B. Random features for large-scale kernel machines. In Proceedings of the 20th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 3–6 December 2007; pp. 1177–1184. [Google Scholar]
Zhu, X.; Ghahramani, Z.; Lafferty, J.D. Semi-supervised learning using gaussian fields and harmonic functions. In Proceedings of the 20th International Conference on Machine Learning, Washington, DC, USA, 21–24 August 2003; pp. 912–919. [Google Scholar]
Lewis, D.D.; Gale, W.A. A sequential algorithm for training text classifiers. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 3–6 July 1994; Springer: New York, NY, USA, 1994; pp. 3–12. [Google Scholar]
Abrahams, J.P.; Leslie, A.G.W.; Lutter, R.; Walker, J.E. Structure at 2.8 Â resolution of F1-ATPase from bovine heart mitochondria. Nature 1994, 370, 621–628. [Google Scholar] [CrossRef] [PubMed]
Yasuda, R.; Noji, H.; Kinosita, K.; Yoshida, M. F1-ATPase Is a Highly Efficient Molecular Motor that Rotates with Discrete 120 Steps. Cell 1998, 93, 1117–1124. [Google Scholar] [CrossRef]
Kenzaki, H.; Koga, N.; Hori, N.; Kanada, R.; Li, W.; Okazaki, K.i.; Yao, X.Q.; Takada, S. CafeMol: A Coarse-Grained Biomolecular Simulator for Simulating Proteins at Work. J. Chem. Theory Comput. 2011, 7, 1979–1989. [Google Scholar] [CrossRef]
Frenkel, D.; Smit, B. Understanding Molecular Simulation: From Algorithms to Applications; Academic Press: Cambridge, UK, 1996; p. 443. [Google Scholar]
Berendsen, H.J.; Postma, J.P.; Van Gunsteren, W.F.; Dinola, A.; Haak, J.R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984. [Google Scholar] [CrossRef]
Yasuda, R.; Noji, H.; Yoshida, M.; Kinosita, K.; Itoh, H. Resolution of distinct rotational substeps by submillisecond kinetic analysis of F1-ATPase. Nature 2001, 410, 898–904. [Google Scholar] [CrossRef]
Wang, Z.; Zoghi, M.; Hutter, F.; Matheson, D.; De Freitas, N. Bayesian optimization in high dimensions via random embeddings. In Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, Beijing, China, 3–19 August 2013. [Google Scholar]
Wang, Z.; Li, C.; Jegelka, S.; Kohli, P. Batched high-dimensional bayesian optimization via structural kernel learning. In Proceedings of the 34th International Conference on Machine, Sydney, Australia, 6–11 August 2017; pp. 3656–3664. [Google Scholar]

Figure 1. Flowcharts of efficient region search of successful parameters by using Bayesian optimization (a) and uncertainty sampling (b). (c) The combination of Bayesian optimization (BO) and uncertainty sampling (US) (BOUS) for successful parameter region search.

Figure 2. (a) Structure of F

_{1}

-ATPase motor and counterclockwise rotation angle

θ

from initial structure. The red part corresponds to

γ

-subunit, and the blue and green parts correspond to the

β

and

α

subunits, respectively. (b) Example of time-series of the rotation angle

θ

. Black lines indicate the switched angle, referred to as the reference angle of the gamma-subunit in coarse-grained (CG)-molecular dynamics (MD) simulations. The light-green region shows the proper angle region, which is referred to as the tolerance angle. The goodness score of F1-motor rotation is defined as the ratio of the angle

θ

s in the tolerance range.

Figure 2. (a) Structure of F

_{1}

-ATPase motor and counterclockwise rotation angle

θ

from initial structure. The red part corresponds to

γ

-subunit, and the blue and green parts correspond to the

β

and

α

subunits, respectively. (b) Example of time-series of the rotation angle

θ

. Black lines indicate the switched angle, referred to as the reference angle of the gamma-subunit in coarse-grained (CG)-molecular dynamics (MD) simulations. The light-green region shows the proper angle region, which is referred to as the tolerance angle. The goodness score of F1-motor rotation is defined as the ratio of the angle

θ

s in the tolerance range.

Figure 3. (a) Success rate distribution of F1-motor rotation based on Newtonian dynamics simulations in the parameter space. (a-1) and (a-2) show examples trajectories of F1-motor rotations in (a). The success rates were 1 and 0.2, respectively. (b) Regions of successful parameters changing the criteria (success threshold

τ

) to ensure success. The yellow and blue grids are “success” and “failure” parameters in each threshold

τ

, respectively. (c–f) are typical examples of successful parameter search using BOUS, US, BO, and RS. The numbers of sampled parameters were 41 for BOUS and 80 for US, BO, and random sampling (RS).

Figure 3. (a) Success rate distribution of F1-motor rotation based on Newtonian dynamics simulations in the parameter space. (a-1) and (a-2) show examples trajectories of F1-motor rotations in (a). The success rates were 1 and 0.2, respectively. (b) Regions of successful parameters changing the criteria (success threshold

τ

) to ensure success. The yellow and blue grids are “success” and “failure” parameters in each threshold

τ

, respectively. (c–f) are typical examples of successful parameter search using BOUS, US, BO, and RS. The numbers of sampled parameters were 41 for BOUS and 80 for US, BO, and random sampling (RS).

Figure 4. Performance of successful parameter search for Newtonian dynamics simulations using BOUS, US, BO, and RS. The success thresholds were 1.0 (a), 0.9 (b), 0.8 (c), 0.7 (d), and 0.6 (e), respectively. The dashed blue line shows the total number of successful parameters for each threshold. (f) shows the efficiency of the search for best parameter detection using BO and RS with

τ = 1.0

.

Figure 4. Performance of successful parameter search for Newtonian dynamics simulations using BOUS, US, BO, and RS. The success thresholds were 1.0 (a), 0.9 (b), 0.8 (c), 0.7 (d), and 0.6 (e), respectively. The dashed blue line shows the total number of successful parameters for each threshold. (f) shows the efficiency of the search for best parameter detection using BO and RS with

τ = 1.0

.

Figure 5. Results of Langevin dynamics simulations. (a) Success rate distribution in the parameter space. (b) Successful parameter region (yellow) with

τ = 0.8

. (c–g) show performances of parameter region search using BOUS, US, BO, and RS. The success thresholds are 0.9 (c), 0.8 (d), 0.7 (e), 0.6 (f), and 0.5 (g), respectively. (h) shows the performance of BO and RS for the best parameter search.

Figure 5. Results of Langevin dynamics simulations. (a) Success rate distribution in the parameter space. (b) Successful parameter region (yellow) with

τ = 0.8

. (c–g) show performances of parameter region search using BOUS, US, BO, and RS. The success thresholds are 0.9 (c), 0.8 (d), 0.7 (e), 0.6 (f), and 0.5 (g), respectively. (h) shows the performance of BO and RS for the best parameter search.

Table 1. Numbers of samplings required to detect all successful parameters with a probability of 95% or higher using BOUS, US, BO, and RS. SP indicates the number of successful parameters. The values in parentheses indicate the ratios of the number of calculations in the exhaustive search. The total number of parameter candidates was 252.

$τ$	SP	BOUS	US	BO	RS
1.0	5 (1.98%)	31 (12.3%)	115 (45.6%)	67 (26.6%)	237 (94.0%)
0.9	8 (3.17%)	36 (14.3%)	90 (35.7%)	100 (39.7%)	241 (95.6%)
0.8	15 (5.95%)	45 (17.9%)	56 (22.2%)	114 (45.2%)	242 (96.0%)
0.7	22 (8.73%)	60 (23.8%)	65 (25.8%)	129 (51.2%)	238 (94.4%)
0.6	29 (11.5%)	80 (31.7%)	82 (32.5%)	146 (57.9%)	241 (95.6%)

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kanada, R.; Tokuhisa, A.; Tsuda, K.; Okuno, Y.; Terayama, K. Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning. Biomolecules 2020, 10, 482. https://doi.org/10.3390/biom10030482

AMA Style

Kanada R, Tokuhisa A, Tsuda K, Okuno Y, Terayama K. Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning. Biomolecules. 2020; 10(3):482. https://doi.org/10.3390/biom10030482

Chicago/Turabian Style

Kanada, Ryo, Atsushi Tokuhisa, Koji Tsuda, Yasushi Okuno, and Kei Terayama. 2020. "Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning" Biomolecules 10, no. 3: 482. https://doi.org/10.3390/biom10030482

APA Style

Kanada, R., Tokuhisa, A., Tsuda, K., Okuno, Y., & Terayama, K. (2020). Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning. Biomolecules, 10(3), 482. https://doi.org/10.3390/biom10030482

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning

Abstract

1. Introduction

2. Materials and Methods

2.1. Machine Learning Based Region Search of Successful Parameters

2.1.1. Parameter Sampling By Bo

2.1.2. Parameter Sampling by Us

2.1.3. Combination of BO and US

2.2. F1-Motor and CG-MD Simulation

2.3. Simulation Dynamics and Sampled Parameter Space

2.4. Definition of Success Rate for the Rotation of $γ$ Subunit in F1-Motor

3. Results

3.1. Sampling Performances for F1-Motor Simulations Using Newtonian Dynamics

3.2. Sampling Performance of F1-Motor Simulations Using Langevin Dynamics

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Exploring Successful Parameter Region for Coarse-Grained Simulation of Biomolecules by Bayesian Optimization and Active Learning

Abstract

1. Introduction

2. Materials and Methods

2.1. Machine Learning Based Region Search of Successful Parameters

2.1.1. Parameter Sampling By Bo

2.1.2. Parameter Sampling by Us

2.1.3. Combination of BO and US

2.2. F1-Motor and CG-MD Simulation

2.3. Simulation Dynamics and Sampled Parameter Space

2.4. Definition of Success Rate for the Rotation of γ Subunit in F1-Motor

3. Results

3.1. Sampling Performances for F1-Motor Simulations Using Newtonian Dynamics

3.2. Sampling Performance of F1-Motor Simulations Using Langevin Dynamics

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.4. Definition of Success Rate for the Rotation of $γ$ Subunit in F1-Motor