In this section, we briefly review the KCM framework. To avoid ambiguity, we clarify the terminology: in biochemistry, the term “conformation” is conventionally used to describe the spatial arrangement of a protein’s kinematic structure, whereas in robot kinematics the term “configuration” is more common. Throughout this work, “conformation” and “configuration” are treated as interchangeable.
2.1. The Protein Backbone Nano-Kinematics
Proteins are macromolecules distinguished by their structural complexity and dynamic behavior. They consist of extended chains of peptide planes connected through chemical bonds. A protein’s functionality is inherently tied to its 3D conformation, which is dictated by the linear arrangement of polypeptide chain amino acids. For clarity and focus, the present discussion is restricted to the protein backbone.
Figure 1 depicts the protein polypeptide backbone. As shown, each peptide plane comprises six covalently bonded atoms that lie within a single plane. The assumption of coplanarity is strongly supported by X-ray crystallography [
39]. This structural property provides the basis for modeling proteins as complex nano-mechanisms with many degrees of freedom [
26,
29]. In this view, peptide planes act as the rigid building blocks of the kinematic chain and serve as nanoscale linkages in the molecular nano-mechanism.
The rotations of nano-linkages within the protein kinematic chain are enabled by the central carbon atoms, denoted
and commonly referred to as alpha-carbons (
Figure 1). These
atoms act as hinges, connecting adjacent peptide planes. At the termini of the chain, the
atoms are linked to an N-terminus at one end and a C-terminus at the other.
The nano-kinematic architecture of a peptide backbone consisting of
peptide planes/nano-linkages is fully characterized by the length of its chemical bonds together with the associated pairs of dihedral angles. For the
ith peptide plane, there exist two distinct rotational degrees of freedom. The first corresponds to torsional motion about the covalent bond connecting the amide nitrogen atom (N) to the central alpha carbon atom (
), and this degree of freedom is represented by the dihedral angle
. The second arises from rotation about the covalent bond linking the central alpha carbon atom (
) to the carbonyl carbon atom (C), which is similarly characterized by the dihedral angle
. Together, these two rotational axes capture the essential kinematic variability of the peptide plane, as illustrated in
Figure 1. Collecting all such variables defines the vector
which specifies the full configuration of the backbone chain. The configuration space
is a torus formed by the Cartesian product of
unit circles
.
Each dihedral angle
in the configuration vector
given by (1) corresponds to one degree of freedom (DOF). It is possible to define a unit vector
,
for each DOF. Each unit vector aligns with the axis of its corresponding rotational motion.
Figure 1 illustrates the unit vectors
and
within the
ith peptide plane (
Figure 1). These vectors capture the directions of the bonds connecting the alpha carbon to the carbonyl carbon and to the nitrogen atom, respectively. These vectors define the primary axes of rotation for each segment of the backbone chain. Finally, the vectors
and
are reserved to represent the special unit directions associated with the termini of the backbone chain. Specifically,
corresponds to the orientation at N-terminus, which marks the starting end of the polypeptide chain. In contrast,
designates the orientation at C-terminus, which defines the terminating end of the chain.
While the dihedral angles specify the rotational DOFs along the kinematic chain, they are not by themselves sufficient to compute the rigid peptide linkage orientations. To capture this missing geometric information, the relative spatial arrangement of the coplanar atoms are represented using the body vectors within each peptide plane, denoted by
for
. Using these body vectors, the position of any atom in a given plane can be expressed relative to another atom of the same plane through a linear combination of the form
, where the coefficients
and
, for
, are fixed atom-specific constants that remain consistent throughout the peptide chain [
26,
27]. Taken together, the body vectors
and the unit vectors
yield a complete mathematical representation of the protein’s conformation, expressed entirely in terms of the underlying dihedral angle variables.
Rotation matrices can be used to map the conformation vector
to the 3D configuration of the protein’s kinematic structure [
26]. Specifically, a reference configuration, given by
, together with the corresponding reference unit vectors
and reference body vectors
for
, are used in
where
,
, is the rotation matrix about the reference axis
.
To compute the peptide atom Cartesian positions, the N-terminal atom is anchored at the origin and the body vectors
from (2) are utilized in the following way. Letting
and
correspond to the nitrogen and
atoms of the
kth peptide plane, the position vectors are given by
2.2. Kinetostatic Compliance Folding
The KCM framework builds on the well-established experimental observation that protein folding dynamics can be effectively captured while neglecting inertial forces (see, e.g., [
23]). The protein chain in KCM is represented as a collection of kinematic linkages that deform in response to the kinetostatic effects of nonlinear force fields (
Figure 1).
Consider a peptide chain composed of
atoms arranged into
peptide planes. The conformational state of this chain is fully described by the dihedral angle vector
, introduced in Equation (1). For any two atoms,
and
, their Cartesian coordinate vectors are denoted by
and
, respectively. The Euclidean distance between them is given by
. The physical parameters necessary for evaluating the interatomic forces are atomic charges, van der Waals radii, equilibrium bond lengths, potential well depths, and interaction weight factors. The value of these parameters is detailed in [
27] and related references.
In KCM, the total free energy governing protein folding, denoted
, is expressed as the sum of electrostatic and van der Waals contributions:
where
is the electrostatic potential energy, and
is the van der Waals interaction energy (see, e.g., [
26]). The forces on each atom
,
, which are induced by
and
, are equal to the negative gradients of the corresponding energy terms:
and
.
Within the KCM framework, the first step involves evaluating the net forces and torques exerted on each of the
peptide planes of the protein molecule [
26]. These contributions are collected into a
-dimensional vector
, referred to as the generalized force vector, which encapsulates the overall influence driving the folding process. In order for these forces to induce conformational changes, the generalized force vector
is projected onto an equivalent
-dimensional torque vector
. This projection effectively converts the generalized forces into torsional actions on the dihedral angles, which are the primary variables controlling kinetostatic protein folding.
The kinetostatic folding torque vector is given by
where
is the molecular chain Jacobian at conformation
. As described in [
26], the matrix
functions as the intermediary between the generalized forces and the torsional dynamics of the protein. Specifically, it maps the
-dimensional generalized force vector onto the reduced
-dimensional space of dihedral torques.
The conformational changes in dihedral angles occur under the influence of , which aligns with the steepest-descent direction of the total free energy . This folding torque vector thus guides the backbone through the conformational landscape.