Next Article in Journal
Cable-Driven End-Effector Robots for Gait Rehabilitation: A Review and Future Research Directions
Previous Article in Journal
Neuromorphic Control of Robotic Systems with Numerical Simulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Sign Gradient Descent Algorithms for Accelerated Kinetostatic Protein Folding in Nanorobotics Design

by
Alireza Mohammadi
1,* and
Mohammad Al Janaideh
2
1
Department Electrical & Computer Engineering, University of Michigan, Dearborn, MI 48128, USA
2
Department Mechanical Engineering, University of Guelph, Guelph, ON N1G 2W1, Canada
*
Author to whom correspondence should be addressed.
Robotics 2025, 14(11), 167; https://doi.org/10.3390/robotics14110167
Submission received: 28 September 2025 / Revised: 7 November 2025 / Accepted: 13 November 2025 / Published: 17 November 2025
(This article belongs to the Section Intelligent Robots and Mechatronics)

Abstract

Numerical simulations of protein folding enable the design of protein-based nanomachines and nanorobots by predicting folded three-dimensional protein structures with high accuracy and revealing the protein conformation transitions during folding and unfolding. In the kinetostatic compliance method (KCM) for folding simulations, protein molecules are represented as ensembles of rigid nano-linkages connected by chemical bonds, and the folding process is driven by the kinetostatic influence of nonlinear interatomic force fields until the system converges to a free-energy minimum of the protein. Despite its strengths, the conventional KCM framework demands an excessive number of iterations to reach folded protein conformations, with each iteration requiring costly computations of interatomic force fields. To address these limitations, this work introduces a family of sign gradient descent (SGD) algorithms for predicting folded protein structures. Unlike the heuristic-based iterations of the conventional KCM framework, the proposed SGD algorithms rely on the sign of the free-energy gradient to guide the kinetostatic folding process. Owing to their faster and more robust convergence, the proposed SGD-based algorithms reduce the computational burden of interatomic force field evaluations required to reach folded conformations. Their effectiveness is demonstrated through numerical simulations of KCM-based folding of protein backbone chains.

1. Introduction

Protein-based nanorobotic devices are capable of exerting precise forces, manipulating microscopic objects, and transducing nanoscale information into measurable signals [1,2,3,4]. Nanorobots can be built by assembling distinct protein-based components into integrated devices that operate with multiple degrees of freedom. In these systems, proteins and DNA molecules function as joints, motors, transmission elements, and sensors, enabling precise, repeatable forces, motions, and signals. For example, viral protein linear motors produce consistent motion patterns in response to controlled changes in the pH of the environment [5,6]. As another example, modular DNA-origami nanocompartments can serve as programmable chambers that actively unfold proteins, with the folding state functioning as their key regulatory signal [7]. Another promising application of nanomotors lies in the design of probes for precision imaging (see, e.g., [8]).

1.1. Protein Folding Simulations in Nanorobotics

In designing protein-based nanorobots, protein folding numerical simulations play a central role in predicting final folded structures and capturing transient conformations during folding and unfolding. Beyond structural determination, such simulations can quantitatively assess the mobility of protein-based nano-mechanisms. As demonstrated in [9], numerical simulations can characterize the range and patterns of motion of protein-based bio-springs within a class of parallel nano-mechanisms with upward and backward movements regulated by linear myosin actuators. Protein folding simulations have also been utilized to simulate the motion of cyclic protein-based mechanisms under external electric field perturbations [10,11,12,13].
Beyond estimating the range of motion of protein-based nanodevices, interactive molecular simulations such as ligand–protein docking [14] and human-guided protein/RNA folding simulators [15] rely on accurate protein folding simulations. Enhanced by virtual reality (VR) technology, these platforms allow researchers to design and characterize both individual nanorobotic components and complex assemblies through immersive 3D visualization and haptic force feedback. Haptic devices, in particular, provide operators with a tactile sense of molecular forces during flexible ligand–protein docking and RNA/protein folding [16]. In addition to visual and tactile interaction, the incorporation of sonification and spatial perception of simulation data offers a further sensory dimension to the VR-based protein folding experience [17,18].

1.2. Knowledge-Based and Physics-Based Algorithms for Protein Folding

Knowledge-based algorithms are among the most common methods for protein folding simulations. Relying on pattern recognition and machine learning, these methods leverage amino acid sequences together with extensive databases of known structures to deduce 3D protein conformations. Artificial intelligence-based tools, with Google’s AlphaFold 2 as the leading example [19], have achieved remarkable success in identifying the most probable folded configurations of proteins. However, knowledge-based approaches face important limitations: they cannot adequately capture protein–nucleic acid interactions, account for molecular stability, or incorporate the kinetics of the folding process. Moreover, although highly effective at predicting final structures, these techniques provide little insight into the folding pathway itself [20].
Unlike knowledge-based approaches, physics-based methods constitute a distinct class of techniques that rely on fundamental physical principles to simulate the folding process and compute both transient and folded three-dimensional (3D) protein structures [21,22]. To improve the accuracy of these simulations, physics-based numerical methods can be coupled with advanced optimization techniques, including those derived from optimal control theory [23] and homotopy methods [24]. Although such approaches provide reliable insights into intermediate structural states during folding, they are computationally intensive and demand specialized hardware and considerable processing time [25].

1.3. Kinetostatic Compliance Method for Protein Folding

To mitigate the computational demands of physics-based algorithms, the kinetostatic compliance method (KCM) was introduced for protein folding simulations [26,27,28,29]. KCM treats proteins as nanomechanical systems composed of rigid peptide plane linkages (Figure 1). Instead of relying on computationally prohibitive all-atom molecular dynamics, it kinetostatically adjusts protein dihedral angles. These angles, which determine the relative orientations of peptide planes, are iteratively modulated under nonlinear interatomic force fields and guided toward a folded conformation with an energetically favorable state.
KCM for protein folding simulations has been employed for designing peptide-based nanorobots [10,11,12]. Notably, the KCM framework can be leveraged to simulate the movement range of closed-loop cyclic nanorobots under external electric fields [12].
Although the KCM is computationally advantageous for the design of protein-based nanorobots, it has so far relied on iterative kinetostatic fold compliance [26,27]. In this approach, structural updates of the protein molecule are obtained through heuristic-based iteration rules. However, the conventional method typically demands significantly more iterations, which involve intensive computations of interatomic force fields.
In this paper, to overcome the shortcomings of the conventional KCM iteration, we propose a family of sign gradient descent (SGD) iterative algorithms for kinetostatic protein folding as a substitute for the established method in KCM literature. In our proposed SGD-based framework, the sign of the free energy gradient in the protein configuration space becomes key in guiding the kinetostatic folding of protein molecules. Characterized by their reliance on gradient sign information, SGD algorithms represent a class of first-order optimization techniques originally conceived for training artificial neural networks [30,31]. These algorithms are known for their numerical stability as well as solid convergence attributes [32,33]. Consequently, the presented SGD algorithms in this paper enjoy formal guarantees of convergence and robustness. Monte Carlo methods provide a powerful approach for exploring the conformational energy landscape through probabilistic sampling. Their stochastic nature enables broad and unbiased exploration, which can capture diverse configurations of disordered proteins and escaping local minima in complex energy landscapes (see, e.g., [34]). In contrast, the proposed SGD-based kinetostatic framework complements these methods by performing a deterministic, gradient-driven descent toward a minimum-energy conformation using the sign of the free-energy gradient.

1.4. Contributions of the Paper

This work introduces a novel class of sign gradient descent (SGD) algorithms for kinetostatic protein folding simulations. Our proposed algorithms diverge from the established practice in kinetostatic folding simulations of using a heuristic fold compliance scheme with no formal guarantees of convergence [26]. Thanks to the strengths of SGD algorithms, our proposed kinetostatic protein folding iterations enjoy formal guarantees of robust convergence to minimum energy configurations (i.e., the folded protein molecule conformations). Because of their convergence and stability features, our SGD-based algorithms require a lesser number of iterations for convergence and hence ease the computational burden resulting from the interatomic force field calculations. Furthermore, owing to the structural similarity between protein kinematics in the KCM framework and robotic manipulators (Figure 1), our approach is naturally applicable to robotics problems that employ gradient-based methods such as task-oriented robot control [35], collaborative 3D localization [36], and gradient-informed path smoothing [37], among others.
A preliminary version of this work was presented in the conference publication [38]. The present paper substantially extends those results in three major directions. First, it introduces an adaptive, conformation-dependent SGD algorithm in which the step size is dynamically adapted to the current protein conformation, in contrast to the earlier version that adjusted step size only with iteration count. This improvement yields a hybrid gradient descent law that preserves both the sign and magnitude information of the folding torque vector. Second, the paper offers a more comprehensive analysis of convergence and robustness, which was only partially addressed in the earlier conference work. The results in this paper provide sufficient conditions under which the SGD-based iterations are guaranteed to reach folded states. Third, this paper provides more extensive numerical validation by presenting larger and more systematic simulation studies that demonstrate the computational efficiency and scalability of the proposed algorithms.
The proposed framework achieves faster convergence and enhanced stability, making it especially well-suited for capturing the complex motions that characterize peptide-based nano-robotic devices. In particular, it can directly contribute to the active line of research in nanorobotics led by Ilieş and Kazerounian (see, e.g., [12,13]). One noteworthy example is the development of 7-R closed-loop one-degree-of-freedom spatial mechanisms based on cyclic peptides. Unlike macro-scale robots that rely on traditional prismatic or revolute actuators, these nanoscale kinematic chains exploit molecular geometry to achieve controlled, repeatable motion through predictable dihedral-angle variations. The SGD-based folding framework introduced in this work provides an efficient computational tool for predicting and refining the equilibrium configurations of such nano-actuators by modeling the influence of external stimuli, such as perturbations in electric force fields.
The remainder of this paper is structured as follows. Section 2 introduces the nano-kinematic representation of protein molecules and outlines the foundations of the KCM framework for simulating their kinetostatic folding. Section 3 and Section 4 present the core algorithms and their convergence analysis, respectively. Section 5 presents the numerical simulations, and Section 6 concludes with a discussion of future research directions.

2. Background

In this section, we briefly review the KCM framework. To avoid ambiguity, we clarify the terminology: in biochemistry, the term “conformation” is conventionally used to describe the spatial arrangement of a protein’s kinematic structure, whereas in robot kinematics the term “configuration” is more common. Throughout this work, “conformation” and “configuration” are treated as interchangeable.

2.1. The Protein Backbone Nano-Kinematics

Proteins are macromolecules distinguished by their structural complexity and dynamic behavior. They consist of extended chains of peptide planes connected through chemical bonds. A protein’s functionality is inherently tied to its 3D conformation, which is dictated by the linear arrangement of polypeptide chain amino acids. For clarity and focus, the present discussion is restricted to the protein backbone.
Figure 1 depicts the protein polypeptide backbone. As shown, each peptide plane comprises six covalently bonded atoms that lie within a single plane. The assumption of coplanarity is strongly supported by X-ray crystallography [39]. This structural property provides the basis for modeling proteins as complex nano-mechanisms with many degrees of freedom [26,29]. In this view, peptide planes act as the rigid building blocks of the kinematic chain and serve as nanoscale linkages in the molecular nano-mechanism.
The rotations of nano-linkages within the protein kinematic chain are enabled by the central carbon atoms, denoted C α and commonly referred to as alpha-carbons (Figure 1). These C α atoms act as hinges, connecting adjacent peptide planes. At the termini of the chain, the C α atoms are linked to an N-terminus at one end and a C-terminus at the other.
The nano-kinematic architecture of a peptide backbone consisting of N 1 peptide planes/nano-linkages is fully characterized by the length of its chemical bonds together with the associated pairs of dihedral angles. For the ith peptide plane, there exist two distinct rotational degrees of freedom. The first corresponds to torsional motion about the covalent bond connecting the amide nitrogen atom (N) to the central alpha carbon atom ( C α ), and this degree of freedom is represented by the dihedral angle θ 2 i + 1 . The second arises from rotation about the covalent bond linking the central alpha carbon atom ( C α ) to the carbonyl carbon atom (C), which is similarly characterized by the dihedral angle θ 2 i + 2 . Together, these two rotational axes capture the essential kinematic variability of the peptide plane, as illustrated in Figure 1. Collecting all such variables defines the vector
θ : = θ 1 , , θ 2 N Q : = S 1 × × S 1 ,
which specifies the full configuration of the backbone chain. The configuration space Q is a torus formed by the Cartesian product of 2 N unit circles S 1 .
Each dihedral angle θ i in the configuration vector θ given by (1) corresponds to one degree of freedom (DOF). It is possible to define a unit vector u j , 1 j 2 N for each DOF. Each unit vector aligns with the axis of its corresponding rotational motion. Figure 1 illustrates the unit vectors u 2 i and u 2 i + 1 within the ith peptide plane (Figure 1). These vectors capture the directions of the bonds connecting the alpha carbon to the carbonyl carbon and to the nitrogen atom, respectively. These vectors define the primary axes of rotation for each segment of the backbone chain. Finally, the vectors u 1 and u 2 N are reserved to represent the special unit directions associated with the termini of the backbone chain. Specifically, u 1 corresponds to the orientation at N-terminus, which marks the starting end of the polypeptide chain. In contrast, u 2 N designates the orientation at C-terminus, which defines the terminating end of the chain.
While the dihedral angles specify the rotational DOFs along the kinematic chain, they are not by themselves sufficient to compute the rigid peptide linkage orientations. To capture this missing geometric information, the relative spatial arrangement of the coplanar atoms are represented using the body vectors within each peptide plane, denoted by b j for 1 j 2 N . Using these body vectors, the position of any atom in a given plane can be expressed relative to another atom of the same plane through a linear combination of the form 1 m b 2 i + 2 m b 2 i + 1 , where the coefficients 1 m and 2 m , for 1 m 4 , are fixed atom-specific constants that remain consistent throughout the peptide chain [26,27]. Taken together, the body vectors b j and the unit vectors u j yield a complete mathematical representation of the protein’s conformation, expressed entirely in terms of the underlying dihedral angle variables.
Rotation matrices can be used to map the conformation vector θ to the 3D configuration of the protein’s kinematic structure [26]. Specifically, a reference configuration, given by θ = 0 , together with the corresponding reference unit vectors u j 0 and reference body vectors b j 0 for 1 j 2 N , are used in
u j ( θ ) = r = 1 j R ( θ j , u j 0 ) u j 0 , b j ( θ ) = r = 1 j R ( θ j , u j 0 ) b j 0 ,
where R ( θ j , u j 0 ) , 1 j 2 N , is the rotation matrix about the reference axis u j 0 .
To compute the peptide atom Cartesian positions, the N-terminal atom is anchored at the origin and the body vectors b j ( θ ) from (2) are utilized in the following way. Letting i = 2 k 1 and i = 2 k correspond to the nitrogen and C α atoms of the kth peptide plane, the position vectors are given by
p i ( θ ) = b 1 ( θ ) + + b i ( θ ) , 1 i 2 N 1 .

2.2. Kinetostatic Compliance Folding

The KCM framework builds on the well-established experimental observation that protein folding dynamics can be effectively captured while neglecting inertial forces (see, e.g., [23]). The protein chain in KCM is represented as a collection of kinematic linkages that deform in response to the kinetostatic effects of nonlinear force fields (Figure 1).
Consider a peptide chain composed of N a atoms arranged into N 1 peptide planes. The conformational state of this chain is fully described by the dihedral angle vector θ , introduced in Equation (1). For any two atoms, a i and a j , their Cartesian coordinate vectors are denoted by p i ( θ ) and p j ( θ ) , respectively. The Euclidean distance between them is given by p i j ( θ ) : = | p i ( θ ) p j ( θ ) | 2 . The physical parameters necessary for evaluating the interatomic forces are atomic charges, van der Waals radii, equilibrium bond lengths, potential well depths, and interaction weight factors. The value of these parameters is detailed in [27] and related references.
In KCM, the total free energy governing protein folding, denoted E ( θ ) , is expressed as the sum of electrostatic and van der Waals contributions:
E ( θ ) : = E elec ( θ ) + E vdw ( θ ) ,
where E elec ( θ ) is the electrostatic potential energy, and E vdw ( θ ) is the van der Waals interaction energy (see, e.g., [26]). The forces on each atom a i , 1 i N a , which are induced by E elec ( θ ) and E vdw ( θ ) , are equal to the negative gradients of the corresponding energy terms: F i elec ( θ ) = p i E elec and F i vdw ( θ ) = p i E vdw .
Within the KCM framework, the first step involves evaluating the net forces and torques exerted on each of the N 1 peptide planes of the protein molecule [26]. These contributions are collected into a 6 N -dimensional vector G ( θ ) , referred to as the generalized force vector, which encapsulates the overall influence driving the folding process. In order for these forces to induce conformational changes, the generalized force vector G ( θ ) is projected onto an equivalent 2 N -dimensional torque vector τ ( θ ) . This projection effectively converts the generalized forces into torsional actions on the dihedral angles, which are the primary variables controlling kinetostatic protein folding.
The kinetostatic folding torque vector is given by
τ ( θ ) = T ( θ ) G ( θ ) ,
where T ( θ ) R 6 N × 2 N is the molecular chain Jacobian at conformation θ . As described in [26], the matrix T ( θ ) functions as the intermediary between the generalized forces and the torsional dynamics of the protein. Specifically, it maps the 6 N -dimensional generalized force vector onto the reduced 2 N -dimensional space of dihedral torques.
The conformational changes in dihedral angles occur under the influence of τ ( θ ) , which aligns with the steepest-descent direction of the total free energy E ( θ ) . This folding torque vector thus guides the backbone through the conformational landscape.

3. SGD-Based Kinetostatic Protein Folding Iteration Algorithms

In this section, we present the main SGD-based algorithms of the paper. To set the stage, we first review the conventional KCM iteration in Section 3.1. We then introduce two novel SGD-based kinetostatic fold compliance algorithms, each employing a distinct step-size update rule, in Section 3.2 and Section 3.3. Finally, in Section 4, we analyze the convergence and robustness properties of these SGD-based algorithms.

3.1. Conventional KCM Iteration Algorithm

Each local minimum θ * of the protein total free energy function E ( θ ) given by (4) is characterized by a torque vector that vanishes at the folded conformation, i.e., τ ( θ * ) = 0 . This condition represents a state of kinetostatic equilibrium: at a folded conformation, the absence of net torque signifies that the internal molecular forces are perfectly balanced. Moreover, the folding torque vector is directed along the steepest descent of the free energy gradient within the conformational landscape.
In the conventional KCM framework [26], the evolution of dihedral angles is governed kinetostatically by the folding torque vector induced by interatomic force fields. Starting from an initial unfolded conformation θ 0 , the kinetostatic compliance method prescribes a numerical update law that relates the applied torques to incremental changes in dihedral angles [26]. Given a conformation θ of a protein molecule, one can compute the folding torque vector τ ( θ ) in the direction of steepest descent on the free energy landscape. However, as established in the KCM literature [26], replacing this vector with its normalized counterpart leads to better performance. In particular, one can utilize the difference equation
θ k + 1 = θ k + κ 0 τ ( θ k ) | τ ( θ k ) | , k Z 0 + ,
where Z 0 + represents the set of non-negative integers, the -norm of the torque vector is defined as | τ ( θ k ) | : = max 1 i 2 N | τ i ( θ k ) | , and the torque itself is computed from Equation (5).
Algorithm 1 summarizes the successive kinetostatic folding iteration. At each protein conformation, the most computationally demanding step corresponds to the evaluation of electrostatic and van der Waals forces. When exact interatomic force calculations are required, the complexity of this step for a molecule with N a atoms scales quadratically, i.e., O ( N a 2 ) [26]. Faster convergence rates to the protein folded conformation result in a smaller number of required interatomic force field calculations. Accordingly, devising efficient numerical algorithms for computing the variation of dihedral angles can reduce the required number of such intensive calculations.
Algorithm 1 Conventional Kinetostatic Folding Iteration Algorithm
  1:
Input: Initial dihedral angle vector θ 0 , step size κ 0 > 0 , convergence tolerance ε tol , max iterations K max
  2:
Output: Foldedconformation dihedral angle vector θ *
  3:
k 0
  4:
while   | τ ( θ k ) | 2 < ε tol and k < K max  do
  5:
   Compute the generalized force vector G ( θ k ) .
  6:
   Compute the protein chain Jacobian T ( θ k ) .
  7:
   Compute the folding torque vector τ ( θ k ) T ( θ k ) G ( θ k ) .
  8:
   Update the protein conformation using the difference equation in (6): θ k + 1 θ k + κ 0 τ ( θ k ) | τ ( θ k ) | .
  9:
    k k + 1
10:
end while
11:
return   θ * θ k
In Algorithm 1, the normalized torque vector τ ( θ k ) | τ ( θ k ) | governs the incremental updates at each conformation θ k according to the difference equation in (6). The heuristically optimized positive constant κ 0 is chosen sufficiently small to prevent extensive angle variations. As can be seen from Algorithm 1, the iterative scheme described in (6) is carried out until the total free energy reaches the vicinity of a local minimum on the underlying free-energy landscape. In practical implementations, this condition of convergence is assessed not by monitoring the energy directly, but rather by examining the magnitude of the kinetostatic folding torque vector. Specifically, the iteration converges when the Euclidean norm of τ ( θ k ) falls below a desired tolerance threshold ε tol > 0 , that is, when | τ ( θ k ) | 2 < ε tol (see Section 4 for further details).

3.2. SGD-Based Algorithm with Conformation-Independent Step Size Update Rule

As can be seen from Algorithm 1, the normalized kinetostatic folding torque vector is obtained by dividing each element of τ ( θ ) by the vector’s infinity norm | τ ( θ k ) | : = max 1 i 2 N | τ i ( θ k ) | , i.e.,
τ ( θ ) | τ ( θ ) | : = τ 1 ( θ ) | τ ( θ ) | , , τ 2 N ( θ ) | τ ( θ ) | .
Using this normalized vector for iteratively updating the protein molecule’s conformation improves both stability and convergence rate compared to the original torque vector. Indeed, normalizing the torque vector confines each normalized torque component within the interval [ 1 , 1 ] , namely, τ i ( θ k ) | τ ( θ k ) | [ 1 , 1 ] for 1 i 2 N .
A key insight from  (7) is that the kinetostatic fold compliance difference equation in (6), which is used in Algorithm 1, does not depend on the magnitude of the torque vector. Motivated by this observation, we propose an alternative to the heuristic strategy in Algorithm 1. Specifically, drawing on the established literature on sign gradient descent optimization [32], we introduce a novel SGD-based successive kinetostatic fold compliance algorithm, which exploits only the directional information encoded in the sign of the torque vector components.
In particular, we propose the following SGD-based difference equation
θ k + 1 = θ k + κ k sgn τ ( θ k ) , k Z 0 + ,
where κ k is a step size that gets updated in every iteration and
sgn τ ( θ k ) : = sgn ( τ 1 ( x ) ) , , sgn ( τ 2 N ( x ) ) .
Note that the sign function for a scalar x, denoted as sgn ( x ) , is defined according to sgn ( x ) = 1 if x < 0 , sgn ( x ) = 0 if x = 0 , and sgn ( x ) = 1 if x > 0 . This ensures that the sign operator compactly encodes the directional information of each torque component, while ignoring its exact magnitude, thereby guiding the iterative update process in a directionally consistent manner.
Remark 1.
Analyzing the two kinetostatic fold compliance difference equations given by (6) and (8) reveals a key distinction: while the conventional approach in (6) leverages the normalized torque vector, i.e., τ ( θ k ) | τ ( θ k ) | , the proposed SGD-based difference equation in (8) utilizes the function sgn ( · ) . This function operates solely on the torque vector’s sign, i.e., sgn τ ( θ k ) . Furthermore, the step size κ 0 in (6) remains fixed, whereas the SGD-based difference equation employs a dynamically changing step size κ k as discussed in what follows.
There are a variety of conventional methods for updating the step size κ k at each iteration of SGD-based kinetostatic folding. Specifically, according to the SGD optimization literature [32], the step size κ k at each step of the iteration can be adjusted according to
κ k + 1 = h ( κ k ) , k Z 0 + ,
where the inequality h ( κ k ) < κ k should hold for every positive κ k . The choice of the step size mapping h ( · ) in (9) offers significant flexibility. Several established methods can be employed for this purpose, including the step size adaptation rule proposed by Moulay et al. [32]. Specifically, the following step size update rule
h ( κ k ) = γ 0 κ k , k Z 0 + ,
where γ 0 ( 0 , 1 ) is a positive constant, leads to the step size sequence
κ k k Z 0 + = κ 0 · ( γ 0 ) k k Z 0 + .
Notably, when γ 0 = 0.5 , this rule coincides with the well-known DICHO algorithm [32].
Using the difference equation given by (8) and the step size update rule in (10), we arrive at Algorithm 2, which relies on the directional information of each component in the folding torque vector. According to the SGD optimization literature (see, e.g., [32]), the parameter κ 0 in the step size update rule given by (10) governs the size of the explored protein conformation landscape for finding folded structures. A larger κ 0 value expands the investigated conformation landscape without significantly affecting the convergence rate of the SGD folding iteration. Additionally, the parameter γ 0 determines the precision by which the protein conformation landscape will be explored. Moreover, the closer γ 0 to 1, the faster the speed of convergence to folded protein conformations.
The SGD step size update rule in Algorithm 2 does not utilize the information contained within the protein molecule conformation for varying the step sizes in each iteration. In the next section, we present an adaptive SGD-based algorithm, which utilizes a conformation-dependent step size update rule.
Algorithm 2 SGD-Based Kinetostatic Folding Iteration Algorithm with Conformation-Independent Step Size Update Rule
  1:
Input: Initial dihedral angle vector θ 0 , initial step size κ 0 > 0 , γ 0 ( 0 , 1 ) , convergence tolerance ε tol , max iterations K max
  2:
Output: Folded conformation dihedral angle vector θ *
  3:
k 0
  4:
while   | τ ( θ k ) | 2 < ε tol and k < K max  do
  5:
   Compute the generalized force vector G ( θ k ) .
  6:
   Compute the protein chain Jacobian T ( θ k ) .
  7:
   Compute the folding torque vector τ ( θ k ) T ( θ k ) G ( θ k ) .
  8:
   Update the protein conformation using the difference equation in (8): θ k + 1 θ k + κ k sgn ( τ ( θ k ) ) .
  9:
   Update the step size κ k + 1 κ 0 · ( γ 0 ) k .
10:
    k k + 1
11:
end while
12:
return   θ * θ k

3.3. SGD-Based Kinetostatic Folding Iteration Algorithm with Conformation-Dependent Step Size Update Rule

To encode the protein molecule conformation within the step sizes, we consider the following kinetostatic folding difference equation
θ k + 1 = θ k + κ k θ k sgn τ ( θ k ) , k Z 0 + ,
where κ k θ k is the conformation-dependent step size in the kth iteration.
Following the SGD optimization literature (see, e.g., [32]), we propose the conformation-dependent step size update rule
κ k θ k = κ k , 1 | τ ( θ k ) | 2 + κ k , 2 , k Z 0 + ,
where κ k , 1 0 , κ k , 2 > 0 are design parameters (to be discussed later in this section), and | τ ( θ k ) | 2 is the Euclidean norm of the kinetostatic folding torque vector at conformation θ k .
To update the parameter κ k , 1 , which modulates | τ ( θ k ) | 2 at each iteration of kinetostatic folding, it is possible to choose a positive constant value, namely, κ k , 1 = ζ 0 . Furthermore, similar to (10), the parameter κ k , 2 can be updated according to the rule κ k + 1 , 2 = h ( κ k , 2 ) , in which the discrete-time mapping h ( · ) is given by
h ( κ k , 2 ) = γ 0 κ k , 2 , k Z 0 + ,
where γ 0 ( 0 , 1 ) is a positive constant. Using these choices for the design parameters κ k , 1 and κ k , 2 , we arrive at the conformation-dependent step size update rule
κ k θ k = ζ 0 | τ ( θ k ) | 2 + κ 0 , 2 γ 0 k , k Z 0 + .
The SGD-based scheme in Algorithm 3 under the adaptive step-size update rule (15) takes the form of a hybrid gradient descent law, where both the kinetostatic folding torque vector and its sign simultaneously govern the evolution of the dihedral angle vector at each iteration. In particular, we have the identity τ θ k = | τ ( θ k ) | 2 sgn τ ( θ k ) . Using this identity along with (15) in (12), we arrive at the following difference equation
θ k + 1 = θ k + ζ 0 τ θ k + κ 0 , 2 γ 0 k sgn τ ( θ k ) , k Z 0 + .
Remark 2.
An intriguing feature of adopting the conformation-dependent step size update rule (15) is the resulting iteration in (16), where the kinetostatic folding torque vector plays a direct role in governing the dihedral angle evolution. Thanks to the conformation-dependent step size update rule (15), the information contained in | τ ( θ k ) | 2 will be preserved. This is in direct contrast with the conventional KCM difference equation in (6) and the conformation-independent SGD difference equation in (8), where the information about the magnitude of the folding torque vector is lost.
Using the difference equation given by (12) and the adaptive step-size update rule in (15), we arrive at Algorithm 3, which relies on the directional information of each component in the folding torque vector. In contrast to (9), the step size mapping given by (15) not only depends on the iteration number (i.e., k) but also on the conformation of the protein molecule at each iteration (i.e., θ k ). When κ k , 1 = 0 in (13), Algorithm 2 will be retrieved. In other words, the step size mapping given by (9) can be considered as a special case of (13), where the protein conformation does not affect the step size in each iteration. In the next section, we discuss the convergence properties of Algorithms 2 and 3.
Algorithm 3 SGD-Based Kinetostatic Folding Iteration Algorithm with Conformation-Dependent Step Size Update Rule
  1:
Input: Initial dihedral angle vector θ 0 , initial step size adaptation parameters γ 0 ( 0 , 1 ) , ζ 0 > 0 , and κ 0 , 2 > 0 , convergence tolerance ε tol , max iterations K max
  2:
Output: Folded conformation dihedral angle vector θ *
  3:
k 0
  4:
while   | τ ( θ k ) | 2 < ε tol and k < K max  do
  5:
   Compute the generalized force vector G ( θ k ) .
  6:
   Compute the protein chain Jacobian T ( θ k ) .
  7:
   Compute the folding torque vector τ ( θ k ) T ( θ k ) G ( θ k ) .
  8:
   Adapt the step size κ k θ k ζ 0 | τ ( θ k ) | 2 + κ 0 , 2 γ 0 k .
  9:
   Update the protein conformation using the difference equation in (12): θ k + 1 θ k + κ k θ k sgn τ ( θ k ) .
10:
    k k + 1
11:
end while
12:
return   θ * θ k

4. Convergence Properties of the SGD-Based Kinetostatic Folding Iteration Algorithms

In this section, we analyze the convergence and robustness properties of Algorithms 2 and 3 presented in the previous section. We remark that although the conventional method given in Algorithm 1 has long dominated the KCM literature [26], it lacks formal convergence guarantees.

4.1. Convergence Formalization

We begin by formalizing the notion of convergence to a folded conformation. As specified in Equation (5), the kinetostatic torque vector τ ( θ ) is aligned with the steepest-descent direction of the free-energy gradient within the protein’s conformational landscape. Given an initial unfolded configuration θ 0 located in a neighborhood of the native folded state θ * , we say that the difference equations in (8) and (12) converge if
lim k θ k = θ * .
The convergence condition in (17) can be expressed in terms of folding torque vectors as follows. At any folded conformation θ * , the kinetostatic folding torque vector necessarily vanishes, i.e., τ ( θ * ) = 0 . Accordingly, convergence of the two difference equations given by (8) and (12) can be checked numerically when the Euclidean norm of the kinetostatic folding torque vector falls below a predefined tolerance ε tol > 0 , i.e., when the criterion
| τ ( θ k * ) | 2 < ε tol ,
is met for some positive integer k * . This criterion underlies the stopping conditions in SGD-based Algorithms 2 and 3. Smaller values of k * correspond to faster convergence toward the folded protein conformation. The integer k * also specifies the number of interatomic force-field evaluations required to achieve convergence. Since the most computationally demanding aspect of protein folding simulations lies in computing electrostatic and van der Waals interactions, kinetostatic iterations with faster convergence (i.e., smaller k * ) impose a significantly lighter computational burden.

4.2. Conditions for Convergence of SGD-Based Algorithms

The convergence of the iterative SGD-based schemes in Algorithms 2 and 3 (i.e., the guarantee that lim k θ k = θ * ) can be understood by noting that, in the vicinity of folded conformations, the free energy landscape of protein molecules is well approximated by quadratic energy functions of the type used in simple elastic network models (see, e.g., [40,41]). In this regime, the aggregate free energy E ( θ ) given in (4) effectively reduces to a quadratic function of the dihedral angle configuration vector around a minimum-energy folded state.
By approximating the free energy of a protein molecule with a quadratic function in a neighborhood of a folded conformation θ * , one can directly invoke a convergence result from the SGD literature. In particular, ref. [32] (Theorem 1) guarantees asymptotic convergence to folded conformations, provided that θ * is an isolated local minimum of E ( θ ) and that the positivity condition ( θ * θ k ) sgn ( τ ( θ k ) ) > 0 is satisfied within an open neighborhood of θ * . Under the stated positivity condition, asymptotic convergence of Algorithms 2 and 3 is guaranteed if the following conditions are satisfied:
C1
0 < κ k < 2 ( θ * θ k ) sgn ( τ ( θ k ) ) for all k Z 0 + ;
C2
κ k ( θ * θ k ) sgn ( τ ( θ k ) ) c | θ * θ k | 2 α for some positive α and c for all k Z 0 + ; and,
C3
lim k κ k = 0 .
The first condition (i.e., C1) ensures that the step size κ k is positive but not excessively large, so that each update moves the protein configuration toward the folded conformation without overshooting the stable region. The second condition (i.e., C2) imposes a lower bound on the effective progress made in each iteration, guaranteeing that updates are strong enough to reduce the distance of the protein configuration to the folded conformation in each iteration. Finally, the last condition (i.e., C3) requires the step size to vanish asymptotically, which prevents oscillations around the equilibrium and ensures convergence to the precise minimum-energy conformation.

4.3. Robustness of Convergence of SGD-Based Kinetostatic Iteration Algorithms

In practical implementations, the protein conformation vector θ k may be subject to various sources of uncertainty arising from numerical or modeling approximations. These uncertainties include finite-precision numerical errors accumulated during iterative updates, approximation errors in force-field evaluation (e.g., truncated van der Waals or electrostatic interaction cutoffs), and stochastic perturbations or deviations in dihedral angles introduced by coarse-grained molecular representations or experimentally reconstructed initial structures (such as those derived from cryo-EM or NMR data).
The iterative SGD-based schemes in Algorithms 2 and 3 also demonstrate robustness to uncertainties beyond their formal convergence guarantees. In particular, even when the conformation vector θ k at iteration k is perturbed by an uncertainty term ϵ k both algorithms still converge reliably to the folded conformations.
Consider the case where the protein conformation at the kth iteration is affected by an additive uncertainty, that is, when it is updated as θ k θ k + ϵ k . Accordingly, the difference equation in Algorithm 2 under this uncertainty takes the form
θ k + 1 = θ k + ϵ k + κ k sgn τ ( θ k + ϵ k ) , k Z 0 + .
As demonstrated in ([32], Theorem 2), the perturbed SGD difference equation in (19) still converges to the folded conformation θ * under very mild conditions on the uncertainty vector ϵ k . A similar robustness result can be stated for Algorithm 3 with conformation-dependent step size update rule. The robustness property of the proposed SGD-based algorithms ensures convergence to the folded conformation even in the presence of such perturbations, provided that the uncertainty sequence ϵ k k Z 0 + remains bounded. Specifically, bounded uncertainties do not compromise the asymptotic stability of the kinetostatic folding process since the iterates in (19) continue to approach the minimum-energy state.

5. Numerical Simulations

In this section we present numerical simulations conducted to evaluate the effectiveness of the proposed SGD-based kinetostatic folding algorithms (Algorithms 2 and 3). Their performance is further compared against the conventional kinetostatic folding algorithm (Algorithm 1). The simulations are carried out according to the principles of Protofold I [27].
In our simulations, we focus on two fundamental secondary structure motifs— α , namely, helices and β -pleated sheets, and evaluate the performance of the proposed SGD-based algorithms in driving convergence from unfolded conformations to these structures. The α -helix (Figure 2) is one of the most common motifs in proteins, formed by a tightly coiled amino acid chain that adopts a right-handed helical conformation [39]. Likewise, the β -pleated sheet (Figure 2) represents another essential structural element, playing a key role in protein folding and stability through its ability to induce significant polypeptide chain reorientation [42].

5.1. Simulations for Convergence to α -Helix Structures

Our simulations for convergence to α -helix structures (Figure 2) were conducted on two protein backbone chains with 20-dimensional and 40-dimensional dihedral angle spaces, respectively. The numerical simulations were performed using the conventional kinetostatic folding algorithm (Algorithm 1) alongside the two SGD-based alternative schemes (Algorithms 2 and 3). The corresponding parameter values are summarized in Table 1. To ensure a fair comparison, the fixed step size κ 0 in the conventional algorithm was chosen to match the initial step size κ 0 in Algorithm 2 and the parameter κ 0 , 2 in the adaptive, conformation-dependent SGD algorithm (Algorithm 3). Similarly, the step size update weight γ 0 in Algorithm 2 was set equal to the parameter γ 0 used in Algorithm 3.
To ensure consistent initial conditions across the three algorithms, all simulations began with the same pre-coiled backbone chain, positioned near an α -helix configuration. Figure 3 and Figure 4 depict the free energy evolution of the peptide backbone under the conventional kinetostatic folding algorithm (Algorithm 1), the SGD-based algorithm with a conformation-independent step size update rule (Algorithm 2), and the SGD-based algorithm with a conformation-dependent step size update rule (Algorithm 3), each starting from this identical conformation (also shown in the figures). The figures further illustrate the protein configurations obtained at the 30th, 60th, 180th, and 540th iterations.
As can be seen from Figure 3 and Figure 4, the conventional kinetostatic folding algorithm achieve a slower convergence rate to the folded α -helix configuration. Slower convergence rates as discussed earlier necessitate increased iterations for convergence to minima of the free energy of protein molecules implying an increased computational burden on folding simulations. Remarkably, the slower convergence becomes more nuanced as the number of the peptide planes start increasing. Finally, we remark that unlike the conference version of this paper [38], which used a fixed decay factor for the step size optimized only for α -helix convergence, the present work employs an adaptive, conformation-dependent step-size rule (Algorithm 3) with a stopping criterion tied to the torque norm threshold ( | τ ( θ ) | 2 < ε tol ). This modification results in smoother convergence with fewer oscillations and naturally leads to different iteration counts across structures.
Table 2 summarizes the runtime comparison between the conventional KCM framework and the proposed SGD-based algorithms for α -helix folding simulations. Because each iteration entails a complete interatomic force evaluation, the reduction in iteration count leads proportionally to a lower computational cost and shorter overall runtime.

5.2. Simulations for Convergence to β -Pleated Sheet Structures

To examine convergence toward β -pleated sheet structures (Figure 2), we performed simulations on a protein backbone chain characterized by a 20-dimensional dihedral angle space (see Table 3). Interestingly, compact β -sheet motifs of this type have also been identified in small proteins such as chignolin, which is a synthetic mini-protein consisting of only 10 amino acids. This protein nonetheless exhibits stable β -hairpin folding (see, e.g., [43]).
To ensure consistent initial conditions across the three algorithms, all simulations began with the same configuration in the proximity of a β -pleated sheet motif. Figure 3 and Figure 4 depict the free energy evolution of the peptide backbone under the conventional kinetostatic folding algorithm (Algorithm 1), the SGD-based algorithm with a conformation-independent step size update (Algorithm 2), and the SGD-based algorithm with a conformation-dependent step size update (Algorithm 3), each starting from this identical conformation (also shown in the figures). The figures further illustrate the protein configurations obtained at the 30th, 60th, 180th, and 540th iterations for all three algorithms.
Closer scrutiny of Figure 5 reveals a slower convergence rate toward the β -pleated sheet configuration for the conventional kinetostatic folding algorithm (Algorithm 1) compared to its SGD-based counterparts (Algorithms 2 and 3). As previously discussed, this translates to a requirement for a greater number of iterations to reach the free energy minima of protein molecules, leading to a heightened computational demand for kinetostatic folding simulations. Additionally, as can be seen from Figure 5a, the free energy oscillates noticeably around the folded configuration, further highlighting the inefficiency of the conventional kinetostatic folding algorithm. Conversely, the free energy profiles under the SGD-based algorithms exhibit monotonic convergence towards a lower minimum within a shorter timeframe. As can be seen from the free energy profiles in Figure 5b,c, the SGD-based algorithm with adaptive conformation-dependent step size update rule provides a relatively faster convergence rate. Table 4 summarizes the runtime comparison between the conventional KCM framework and the proposed SGD-based algorithms for β -hairpin folding simulations. Because each iteration entails a complete interatomic force evaluation, the reduction in iteration count leads proportionally to a lower computational cost and shorter overall runtime.

5.3. Discussion

Building on the simulation results presented in Section 5.1 and Section 5.2, this section discusses how the computational performance of the proposed SGD-based algorithms scales with increasing protein size and discusses the observed trends in iteration count and runtime efficiency.
The computational cost of the proposed SGD-based kinetostatic folding algorithms is dominated by the evaluation of interatomic force fields, including electrostatic and van der Waals interactions. Similar to the conventional KCM formulation, the per-iteration complexity scales as O ( N a 2 ) with respect to the number of atoms N a , since each iteration requires computing all pairwise atom–atom interactions. The key advantage of the SGD-based framework lies in its ability to significantly reduce the number of iterations required to reach convergence. As a result, the total computational complexity improves from O ( K conv N a 2 ) for the conventional algorithm to O ( K SGD N a 2 ) for the proposed approach, where K SGD < K conv . The runtime and convergence comparisons presented in Table 2 and Table 4 demonstrate a reduction of approximately 34– 63 % in the total number of iterations (and hence in force-field evaluations), depending on the protein size. Consequently, the proposed algorithms offer a scalable and computationally efficient framework for simulating larger protein systems without modifying the underlying kinetostatic model.
Finally, we conclude this section by noting that the transient performance gap between Algorithms 2 and 3 arises from the conformation-dependent term ζ 0 | τ ( θ k ) | 2 in Algorithm 3, which preserves magnitude information during descent. Consequently, Algorithm 3 exhibits a more monotonic convergence behavior compared to the purely sign-based updates of Algorithm 2.

6. Conclusions

With the rapid development of peptide-based nanorobots, the need for fast and accurate numerical folding simulations has become increasingly important. This work departs significantly from conventional successive kinetostatic fold-compliance methods traditionally used in protein folding simulations. In particular, we introduced a novel family of SGD-based kinetostatic folding algorithms as an alternative to the conventional approach. Since each iteration requires a full evaluation of interatomic forces, these improvements directly translate to reduced computational cost, as the observed decrease in iteration count proportionally lowers the overall runtime. Beyond the canonical α -helix and β -sheet motifs studied here, the proposed SGD-based kinetostatic folding framework remains applicable to proteins with mixed or multi-domain topologies. Because the algorithms operate directly in the dihedral-angle configuration space, they can accommodate arbitrarily complex backbone geometries without modification to the underlying update law. The convergence guarantees derived in this paper extend to broader classes of folding problems including topologically constrained or multi-stable protein systems.
We expect the SGD-based successive kinetostatic fold compliance framework to be a powerful tool for investigating KCM-driven protein folding under solvation and entropy constraints. The proposed SGD-based kinetostatic folding framework can also be applied to refine experimentally derived structures such as cryo-EM maps (e.g., Apoferritin) and structures obtained from X-ray crystallography or NMR. Because the method operates directly in dihedral-angle space and performs a deterministic descent on the free-energy landscape, it can be initialized from partially resolved experimental conformations and used to iteratively refine backbone configurations toward a nearby minimum-energy state. This makes the approach suitable for cryo-EM models with uncertain backbone placement or missing loop density, as it requires only an initial conformation and topology, does not rely on full landscape sampling, and can be constrained to refine local dihedral angles while preserving well-resolved structural regions. In addition, its inherent compatibility with stochastic SGD extensions (e.g., [44]) opens the door to designing new stochastic algorithms specifically tailored for kinetostatic folding simulations.

Author Contributions

Conceptualization, A.M.; methodology, A.M.; software, A.M.; formal analysis, A.M. and M.A.J.; investigation, A.M.; resources, A.M.; data curation, A.M.; writing—original draft preparation, A.M. and M.A.J.; writing—review and editing, A.M. and M.A.J.; visualization, A.M.; supervision, A.M.; project administration, A.M.; funding acquisition, A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Science Foundation (NSF) through the award number CMMI-2153744.

Data Availability Statement

The datasets generated during and/or analysed during the current study are available from the corresponding author.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

  1. Neettiyath, A.; Pumera, M. Micro/Nanorobots for Advanced Light-Based Biosensing and Imaging. Adv. Funct. Mater. 2025, 35, 2415875. [Google Scholar] [CrossRef]
  2. Peng, Z.; Iwabuchi, S.; Izumi, K.; Takiguchi, S.; Yamaji, M.; Fujita, S.; Suzuki, H.; Kambara, F.; Fukasawa, G.; Cooney, A.; et al. Lipid vesicle-based molecular robots. Lab Chip 2024, 24, 996–1029. [Google Scholar] [CrossRef]
  3. Woo, S.; Saka, S.K.; Xuan, F.; Yin, P. Molecular robotic agents that survey molecular landscapes for information retrieval. Nat. Commun. 2024, 15, 3293. [Google Scholar] [CrossRef]
  4. Huang, C.M.; Kucinic, A.; Johnson, J.A.; Su, H.J.; Castro, C.E. Integrated computer-aided engineering and design for DNA assemblies. Nat. Mater. 2021, 20, 1264–1271. [Google Scholar] [CrossRef]
  5. Dubey, A.; Sharma, G.; Mavroidis, C.; Tomassone, S.M.; Nikitczuk, K.; Yarmush, M.L. Dynamics and kinematics of viral protein linear nano-actuators for bio-nano robotic systems. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), New Orleans, LA, USA, 26 April–1 May 2004; Volume 2, pp. 1628–1633. [Google Scholar] [CrossRef]
  6. Ruiz, P.A.S.; Ziebert, F.; Kulić, I.M. Physics of self-rolling viruses. Phys. Rev. E 2022, 105, 054411. [Google Scholar] [CrossRef]
  7. Huang, J.; Jaekel, A.; van den Boom, J.; Podlesainski, D.; Elnaggar, M.; Heuer-Jungemann, A.; Kaiser, M.; Meyer, H.; Saccà, B. A modular DNA origami nanocompartment for engineering a cell-free, protein unfolding and degradation pathway. Nat. Nanotechnol. 2024, 19, 1521–1531. [Google Scholar] [CrossRef]
  8. Zhang, D.; Lin, L.; Deng, C.; Osman, M.S.; Rodriguez, P.E.S.; Han, F.; Li, M.; Wang, L. Advanced Imaging Strategies Based on Intelligent Micro/Nanomotors. Cyborg Bionic Syst. 2025, 6, 384. [Google Scholar] [CrossRef] [PubMed]
  9. Hamdi, M.; Ferreira, A. Multiscale design and modeling of protein-based nanomechanisms for nanorobotics. Int. J. Robot. Res. 2009, 28, 436–449. [Google Scholar] [CrossRef]
  10. Chorsi, M.T.; Tavousi, P.; Mundrane, C.; Gorbatyuk, V.; Ilieş, H.; Kazerounian, K. One Degree of Freedom 7-R Closed Loop Linkage as a Building Block of Nanorobots. In Proceedings of the International Symposium on Advances in Robot Kinematics, Bilbao, Spain, 26–30 June 2022; pp. 41–48. [Google Scholar] [CrossRef]
  11. Chorsi, M.T.; Tavousi, P.; Mundrane, C.; Gorbatyuk, V.; Kazerounian, K.; Ilies, H. Kinematic design of functional nanoscale mechanisms from molecular primitives. J. Micro Nano-Manuf. 2021, 9, 021005. [Google Scholar] [CrossRef]
  12. Mundrane, C.; Chorsi, M.; Vinogradova, O.; Ilieş, H.; Kazerounian, K. Exploring Electric Field Perturbations as the Actuator for Nanorobots and Nanomachines. In Proceedings of the International Symposium on Advances in Robot Kinematics, Bologna, Italy, 26–30 June 2022; Springer: Berlin/Heidelberg, Germany, 2022; pp. 257–265. [Google Scholar] [CrossRef]
  13. Kazerounian, K.; Ilieş, H. The Evolving Role of Robot Kinematics in Bio-Nanotechnology. In Proceedings of the International Symposium on Advances in Robot Kinematics, Ljubljana, Slovenia, 1–5 July 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 77–87. [Google Scholar]
  14. Bolopion, A.; Cagneau, B.; Redon, S.; Régnier, S. Comparing position and force control for interactive molecular simulators with haptic feedback. J. Mol. Graph. Model. 2010, 29, 280–289. [Google Scholar] [CrossRef]
  15. Lanrezac, A.; Férey, N.; Baaden, M. Interactive Molecular Dynamics. In Comprehensive Computational Chemistry, 1st ed.; Yáñez, M., Boyd, R.J., Eds.; Elsevier: Oxford, UK, 2024; pp. 454–474. [Google Scholar] [CrossRef]
  16. Daunay, B.; Régnier, S. Stable six degrees of freedom haptic feedback for flexible ligand–protein docking. Comput.-Aided Des. 2009, 41, 886–895. [Google Scholar] [CrossRef]
  17. Kacem, A.; Zbiss, K.; Watta, P.; Mohammadi, A. Wave space sonification of the folding pathways of protein molecules modeled as hyper-redundant robotic mechanisms. Multimed. Tools Appl. 2024, 83, 4929–4949. [Google Scholar] [CrossRef]
  18. Mishra, S.; Corro-Flores, M.; Krum, D.; Forouzesh, N. Molecular docking improved with human spatial perception using virtual reality. IEEE Trans. Vis. Comput. Graph. 2024, 30, 2269–2275. [Google Scholar] [CrossRef] [PubMed]
  19. AlQuraishi, M. A watershed moment for protein structure prediction. Nature 2020, 577, 627–628. [Google Scholar] [CrossRef] [PubMed]
  20. Fersht, A.R. AlphaFold—A Personal Perspective on the Impact of Machine Learning. J. Mol. Biol. 2021, 433, 167088. [Google Scholar] [CrossRef] [PubMed]
  21. Heo, L.; Arbour, C.F.; Feig, M. Driven to near-experimental accuracy by refinement via molecular dynamics simulations. Proteins Struct. Funct. Bioinform. 2019, 87, 1263–1275. [Google Scholar] [CrossRef]
  22. Ołdziej, S.; Czaplewski, C.; Liwo, A.; Chinchio, M.; Nanias, M.; Vila, J.; Khalili, M.; Arnautova, Y.; Jagielska, A.; Makowski, M.O.; et al. Physics-based protein-structure prediction using a hierarchical protocol based on the UNRES force field: Assessment in two blind tests. Proc. Natl. Acad. Sci. USA 2005, 102, 7547–7552. [Google Scholar] [CrossRef]
  23. Arkun, Y.; Gür, M. Combining optimal control theory and molecular dynamics for protein folding. PLoS ONE 2012, 7, e29628. [Google Scholar] [CrossRef]
  24. Dunlavy, D.M.; O’leary, D.P.; Klimov, D.; Thirumalai, D. HOPE: A homotopy optimization method for protein structure prediction. J. Comput. Biol. 2005, 12, 1275–1288. [Google Scholar] [CrossRef]
  25. Lane, T.J.; Shukla, D.; Beauchamp, K.A.; Pande, V.S. To milliseconds and beyond: Challenges in the simulation of protein folding. Curr. Opin. Struct. Biol. 2013, 23, 58–65. [Google Scholar] [CrossRef]
  26. Tavousi, P.; Behandish, M.; Ilieş, H.T.; Kazerounian, K. Protofold II: Enhanced model and implementation for kinetostatic protein folding. ASME J. Nanotechnol. Eng. Med. 2015, 6, 034601. [Google Scholar] [CrossRef]
  27. Tavousi, P. On the Systematic Design and Analysis of Artificial Molecular Machines. Ph.D Thesis, University of Connecticut, Mansfield, CT, USA, 2016. [Google Scholar]
  28. Kacem, A.; Zbiss, K.; Mohammadi, A. A numerical integrator for kinetostatic folding of protein molecules modeled as robots with hyper degrees of freedom. Robotics 2024, 13, 150. [Google Scholar] [CrossRef]
  29. Mohammadi, A.; Spong, M.W. Predicting Protein Folding Pathways With Quadratic Constraints on Rates of Entropy Change: A Nonlinear Optimization-Based Control Approach. J. Dyn. Syst. Meas. Control 2025, 147, 051011. [Google Scholar] [CrossRef]
  30. Pascanu, R.; Mikolov, T.; Bengio, Y. On the difficulty of training recurrent neural networks. In Proceedings of the International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013; pp. 1310–1318. [Google Scholar]
  31. Riedmiller, M.; Braun, H. A direct adaptive method for faster backpropagation learning: The RPROP algorithm. In Proceedings of the IEEE International Conference on Neural Networks, San Francisco, CA, USA, 28 March–1 April 1993; pp. 586–591. [Google Scholar] [CrossRef]
  32. Moulay, E.; Léchappé, V.; Plestan, F. Properties of the sign gradient descent algorithms. Inf. Sci. 2019, 492, 29–39. [Google Scholar] [CrossRef]
  33. Li, X.; Lin, K.Y.; Li, L.; Hong, Y.; Chen, J. On Faster Convergence of Scaled Sign Gradient Descent. IEEE Trans. Ind. Inform. 2023, 20, 1732–1741. [Google Scholar] [CrossRef]
  34. Ciemny, M.P.; Badaczewska-Dawid, A.E.; Pikuzinska, M.; Kolinski, A.; Kmiecik, S. Modeling of disordered protein structures using monte carlo simulations and knowledge-based statistical force fields. Int. J. Mol. Sci. 2019, 20, 606. [Google Scholar] [CrossRef] [PubMed]
  35. Roveda, L.; Pavone, M. Gradient Descent-Based Task-Orientation Robot Control Enhanced with Gaussian Process Predictions. IEEE Robot. Autom. Lett. 2024, 9, 8035–8042. [Google Scholar] [CrossRef]
  36. Knuth, J.; Barooah, P. Collaborative 3D localization of robots from relative pose measurements using gradient descent on manifolds. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation (ICRA), Saint Paul, MN, USA, 14–18 May 2012; pp. 1101–1106. [Google Scholar] [CrossRef]
  37. Heiden, E.; Palmieri, L.; Koenig, S.; Arras, K.O.; Sukhatme, G.S. Gradient-Informed Path Smoothing for Wheeled Mobile Robots. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 21–25 May 2018; pp. 1710–1717. [Google Scholar] [CrossRef]
  38. Mohammadi, A.; Al Janaideh, M. Sign Gradient Descent Algorithms for Kinetostatic Protein Folding. In Proceedings of the 2023 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS), Abu Dhabi, United Arab Emirates, 2–6 October 2023; pp. 1–6. [Google Scholar] [CrossRef]
  39. Finkelstein, A.V.; Ptitsyn, O. Protein Physics: A Course of Lectures, 2nd ed.; Academic Press: Amsterdam, The Netherlands, 2016. [Google Scholar]
  40. Perez, A.; Yang, Z.; Bahar, I.; Dill, K.A.; MacCallum, J.L. FlexE: Using elastic network models to compare models of protein structure. J. Chem. Theory Comput. 2012, 8, 3985–3991. [Google Scholar] [CrossRef] [PubMed][Green Version]
  41. López-Blanco, J.R.; Chacón, P. New generation of elastic network models. Curr. Opin. Struct. Biol. 2016, 37, 46–53. [Google Scholar] [CrossRef]
  42. Kountouris, P.; Hirst, J.D. Predicting β-turns and their types using predicted backbone dihedral angles and secondary structures. BMC Bioinform. 2010, 11, 407. [Google Scholar] [CrossRef]
  43. Maruyama, Y.; Mitsutake, A. Analysis of structural stability of chignolin. J. Phys. Chem. B 2018, 122, 3801–3814. [Google Scholar] [CrossRef] [PubMed]
  44. Safaryan, M.; Richtárik, P. Stochastic sign descent methods: New algorithms and better theory. In Proceedings of the 38th International Conference on Machine Learning, Virtual, 18–24 July 2021; pp. 9224–9234. [Google Scholar]
Figure 1. In KCM, the protein backbone is represented as a nanoscale kinematic system in which peptide planes act as rigid links joined by revolute joints located at the alpha-carbon atoms, Adapted from ref. [28]. Each alpha-carbon serves as a hinge between successive peptide planes and kinematically functions like a universal joint with two degrees of freedom.
Figure 1. In KCM, the protein backbone is represented as a nanoscale kinematic system in which peptide planes act as rigid links joined by revolute joints located at the alpha-carbon atoms, Adapted from ref. [28]. Each alpha-carbon serves as a hinge between successive peptide planes and kinematically functions like a universal joint with two degrees of freedom.
Robotics 14 00167 g001
Figure 2. Schematic depiction of two secondary structures: (left) an α -helix with protein data bank (PDB) code 1DJF; (right) a β -hairpin with PDB code 1UAO.
Figure 2. Schematic depiction of two secondary structures: (left) an α -helix with protein data bank (PDB) code 1DJF; (right) a β -hairpin with PDB code 1UAO.
Robotics 14 00167 g002
Figure 3. α -helix simulations using a protein backbone chain with a 20-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Figure 3. α -helix simulations using a protein backbone chain with a 20-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Robotics 14 00167 g003
Figure 4. α -helix simulations using a protein backbone chain with a 40-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Figure 4. α -helix simulations using a protein backbone chain with a 40-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Robotics 14 00167 g004
Figure 5. β -pleated sheet simulations using a protein backbone chain with a 20-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Figure 5. β -pleated sheet simulations using a protein backbone chain with a 20-dimensional dihedral angle vector space. (a) Conventional kinetostatic folding (Algorithm 1). (b) SGD-based kinetostatic folding with conformation-independent step size update rule (Algorithm 2). (c) SGD-based kinetostatic folding with adaptive conformation-dependent step size update rule (Algorithm 3).
Robotics 14 00167 g005
Table 1. Parameters for α -helix kinetostatic folding simulations.
Table 1. Parameters for α -helix kinetostatic folding simulations.
Folding AlgorithmN (Num. Peptide Planes) κ 0 ( κ 0 , 2 ) γ 0 ζ 0
Algorithm 110 2 × 10 3
Algorithm 210 2 × 10 3 0.995
Algorithm 310 2 × 10 3 0.995 1 × 10 4
Algorithm 120 4 × 10 3
Algorithm 220 4 × 10 3 0.995
Algorithm 320 4 × 10 3 0.995 0.5 × 10 4
Table 2. Runtime and convergence comparison for α -helix folding simulations.
Table 2. Runtime and convergence comparison for α -helix folding simulations.
NAlgorithmIterations to ConvergenceReduction vs. KCM
10Conventional KCM420
10SGD28033% ↓
10Adaptive SGD20052% ↓
20Conventional KCM760
20SGD40047% ↓
20Adaptive SGD29062% ↓
Table 3. Parameters for β -pleated sheet kinetostatic folding simulations.
Table 3. Parameters for β -pleated sheet kinetostatic folding simulations.
Folding AlgorithmN (Num. Peptide Planes) κ 0 ( κ 0 , 2 ) γ 0 ζ 0
Algorithm 110 7.5 × 10 3
Algorithm 210 7.5 × 10 3 0.995
Algorithm 310 7.5 × 10 3 0.995 1 × 10 4
Table 4. Runtime and convergence comparison for β -pleated sheet folding simulations.
Table 4. Runtime and convergence comparison for β -pleated sheet folding simulations.
NAlgorithmIterations to ConvergenceReduction vs. KCM
10Conventional KCM610
10SGD39036% ↓
10Adaptive SGD25059% ↓
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mohammadi, A.; Al Janaideh, M. Sign Gradient Descent Algorithms for Accelerated Kinetostatic Protein Folding in Nanorobotics Design. Robotics 2025, 14, 167. https://doi.org/10.3390/robotics14110167

AMA Style

Mohammadi A, Al Janaideh M. Sign Gradient Descent Algorithms for Accelerated Kinetostatic Protein Folding in Nanorobotics Design. Robotics. 2025; 14(11):167. https://doi.org/10.3390/robotics14110167

Chicago/Turabian Style

Mohammadi, Alireza, and Mohammad Al Janaideh. 2025. "Sign Gradient Descent Algorithms for Accelerated Kinetostatic Protein Folding in Nanorobotics Design" Robotics 14, no. 11: 167. https://doi.org/10.3390/robotics14110167

APA Style

Mohammadi, A., & Al Janaideh, M. (2025). Sign Gradient Descent Algorithms for Accelerated Kinetostatic Protein Folding in Nanorobotics Design. Robotics, 14(11), 167. https://doi.org/10.3390/robotics14110167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop