Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications

Bekhiti, Belkacem; Iqbal, Jamshed; Hariche, Kamel; Fragulis, George F.

doi:10.3390/robotics14060084

Open AccessArticle

Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications

by

Belkacem Bekhiti

¹

,

Jamshed Iqbal

^2,*

,

Kamel Hariche

³ and

George F. Fragulis

⁴

¹

Laboratory of Aeronautical Sciences, Institute of Aeronautics and Space Studies (IASS), Blida 09000, Algeria

²

School of Digital and Physical Sciences, Faculty of Science and Engineering, University of Hull, Hull HU6 7RX, UK

³

Institute of Electrical and Electronic Engineering, Boumerdes 35000, Algeria

⁴

Department of Electrical and Computer Engineering, ZEP Campus, University of Western Macedonia, 50100 Kozani, Greece

^*

Author to whom correspondence should be addressed.

Robotics 2025, 14(6), 84; https://doi.org/10.3390/robotics14060084

Submission received: 9 May 2025 / Revised: 7 June 2025 / Accepted: 12 June 2025 / Published: 17 June 2025

(This article belongs to the Section Humanoid and Human Robotics)

Download

Browse Figures

Versions Notes

Abstract

This paper introduces a robust neural adaptive MIMO control strategy to improve the stability and adaptability of bipedal locomotion amid uncertainties and external disturbances. The control combines nonlinear dynamic inversion, finite-time convergence, and radial basis function (RBF) neural networks for fast, accurate trajectory tracking. The main novelty of the presented control strategy lies in unifying instantaneous feedback, real-time learning, and dynamic adaptation within a multivariable feedback framework, delivering superior robustness, precision, and real-time performance under extreme conditions. The control scheme is implemented on a 5-DOF underactuated

R A B B I T

robot using a

d S P A C E D S 1103

platform with a sampling rate of

∆ t = 1.5 m s (667 H z)

. The experimental results show excellent performance with the following: The robot achieved stable cyclic gaits while keeping the tracking error within

e = \pm 0.04 r a d

under nominal conditions. Under severe uncertainties of trunk mass variations

{∆ m}_{t r u n k} = + 100 %

, limb inertia changes

{∆ I}_{l i m b} = \pm 30 %

, and actuator torque saturation at

τ = \pm 150 N m

, the robot maintains stable limit cycles with smooth control. The performance of the proposed controller is compared with classical nonlinear decoupling, non-adaptive finite-time, neural-fuzzy learning, and deep learning controls. The results demonstrate that the proposed method outperforms the four benchmark strategies, achieving the lowest errors and fastest convergence with the following:

I A E = 1.36

,

I T A E = 2.43

,

I S E = 0.68

,

t_{s s} = 1.24 s

, and

M_{p} = 2.21 %

. These results demonstrate evidence of high stability, rapid convergence, and robustness to disturbances and foot-slip.

Keywords:

neural adaptive control; MIMO control; RABBIT robot; robust tracking; bipedal walking robot

1. Introduction

Bipedal locomotion has long been central to robotics and biomechanics, aiming to replicate human-like walking. A major challenge has been ensuring dynamic stability, addressed early on by Gubina [1], who introduced analytical models based on the inverted pendulum. This framework became foundational for later stability assessments and advanced control techniques [2]. Lyapunov’s second method [3] added mathematical rigor to postural equilibrium, while feedback-based on–off control [4] marked early practical implementations. Modeling-constrained dynamic systems further integrated stability into robotic biped design [5]. Throughout the late 20th century, strategies evolved via pole assignment, on–off feedback, and constrained modeling [6,7,8,9]. Saleem [10] and Afifa [11] introduced fractional-order and sliding-mode control for better adaptability. Later, optimization and intelligent control—e.g., genetic algorithms [12], terrain adaptation [13], other adaptive systems [14,15]—boosted robustness. Dariush [16] enhanced gait precision via external measurement-based motion analysis.

Recent work by Chevallereau [17,18,19] on underactuated bipedal locomotion, Grizzle’s nonlinear control frameworks [20], and Djoudi’s optimal motion planning [21,22,23] emphasized the role of zero moment point (ZMP) in dynamic stability [24,25]. Neural networks and machine learning [26,27] have improved adaptability to environmental changes. Feedback control advances by Westervelt [28], Djoudi [29], and Chevallereau [30,31] further enhanced dynamic stability. Grizzle [32] addressed challenges in 3D locomotion, spurring new control methods. For human-like gait, Wang [33] and Liu [34] introduced foot rotation and toe support, improving energy efficiency, Hemami [35] introduced human and robotic movement in the air. Kim [36] and Kakaei [37] developed robust, whole-body control strategies, aiding real-world humanoid applications. Passive dynamics were explored by Martínez and Villarreal-Cervantes [38], while fuzzy logic [39] and deep learning [40] expanded adaptability in varied conditions.

Modern advancements continue to push the boundaries of bipedal locomotion, incorporating bio-inspired control strategies to enhance gait stability and adaptability. Wu [41] proposed a bionic walking control approach based on central pattern generators (CPGs), utilizing an improved particle swarm algorithm to refine locomotion patterns. Extending this research, Wu [42] introduced a multivariate linear mapping technique for stabilizing CPG-based control, further improving adaptability in dynamic environments. Additionally, passive tendon mechanisms [43] and hybrid control methodologies that blend adaptive techniques with conventional feedback control have been explored to optimize gait efficiency and stability [44]. Mou [45] introduced high-dynamic bipedal robots featuring underactuated telescopic straight legs, enhancing both agility and adaptability. Furthermore, model-predictive control strategies [46,47] have been investigated to achieve more natural and efficient gait dynamics. These ongoing developments underscore the continuous effort to bridge the gap between theoretical control frameworks and practical walking robot applications.

A key gap in the current literature is the absence of a real-time, adaptive control framework that seamlessly combines trajectory planning, dynamic modeling, instantaneous feedback, and hybrid gait transitions for bipedal robots operating in uncertain and dynamically changing environments. Motivated by the persistent limitations of existing neural adaptive MIMO control strategies, particularly their inability to maintain stability under structural damage, abrupt disturbances, or partial system failures, this work proposes a unified and robust control solution. Unlike fragmented approaches that separately address adaptation, learning, and feedback correction, our framework tightly integrates these components in real time to enhance performance in dynamically changing environments. In summary, the primary novel contributions of the proposed control method of this work are as follows:

Unified adaptive control: Combines adaptation, learning, and real-time feedback within a cohesive architecture, addressing the fragmentation in prior methods.
Robustness under critical failures: Maintains gait stability under severe conditions, including structural damage, disturbances, and system degradation.
Real-time feedback and learning: Achieves fast correction using finite-time dynamic feedback and RBF-based online learning in hazardous tasks.
Synergistic adaptation and learning integration: Combines MIMO adaptive control and neural approximation for precise trajectory tracking under uncertainty.
Scalability and generalization: Applicable across diverse walking modes and robotic platforms, extending its use beyond the tested 5 DOF RABBIT robot.

This paper is organized as follows: Section 2 describes a bipedal robot and its dynamic models for various walking phases, including single support, impact dynamics, symbolic modeling, and trajectory generation. Section 3 presents the control strategy, combining nonlinear decoupling with finite-time convergent control, and it includes hybrid dynamics analysis via the Poincaré method and a neural network-based adaptive framework. Section 4 reports the experimental results of the RABBIT robot across three scenarios. Section 5 concludes with key insights and future directions.

2. Mathematical Modeling of Bipedal Locomotion

This section deals with the control of bipedal walking robots, with an application to the planar RABBIT prototype robot with un-motorized ankles. Efforts have focused on the control and generation of reference trajectories, defined in the joint geometric space, allowing stable cyclic walking (see Figure 1) in a context of underactuation [31].

Walking is the alternating repetition of elementary movements from one leg to the other, resulting in a cyclic pattern that can be described in two ways. It is best modeled as a sequence of steps—elementary reference motions executed in various forms but defined by fixed kinematic properties. Each step consists of two distinct phases:

▪: Single-support phase or swing phase, during which the locomotion system evolves as an open kinematic chain (one foot on the ground and the other is free).
▪: Double-support phase, during which the locomotion system moves as a closed kinematic loop (both feet on the ground).

The single-support phase, a transfer phase, spans from the toe-off to heel-touch of the swing leg. Double support extends from the heel-touch of the front foot to the toe-off of the rear foot. In this phase, the system becomes overactuated, and its dynamics is underdetermined, requiring specialized methods to manage actuation redundancies [28].

2.1. Kinematic and Dynamic Modeling of Biped Robots (Diff Phases)

The Lagrange equations give the dynamics of the system directly in the joint space

q \in R^{n}

in the following vector form:

\frac{d}{d t} (\frac{\partial L}{\partial \dot{q}}) - \frac{\partial L}{\partial q} = Q_{e x t}

(1)

where

▪: $Q_{e x t} (t)$ is the vector of generalized forces/torques;
▪: $L = E_{k} - E_{p}$ is the $L a g r a n g i a n$ of the system;
▪: $E_{k} (t) a n d E_{p} (t)$ are the total kinetic and potential energies of the system;
▪: $q (t) a n d \dot{q} (t)$ are the vectors of the generalized positions and velocities.

From the Lagrangian, the equations of motion are as follows [25]:

\frac{d}{d t} (\frac{\partial E_{k}}{\partial \dot{q}}) - \frac{\partial E_{k}}{\partial q} + \frac{\partial E_{p}}{\partial q} = Q_{e x t} = D (q, \dot{q}) + B^{T} Γ + J^{⊤} λ

(2)

where

λ

is the Lagrange multiplier vector, and

J

is the Jacobian matrix of constraints. The last equation gives the following (see [48,49]):

A (q) \ddot{q} + C (q, \dot{q}) \dot{q} + G (q) = Q_{e x t} e q u a t i o n o f m o t i o n

(3)

where

A

(n × n) is the inertia matrix.

C

(n × n) corresponds to Coriolis and centrifugal effects.

G

(n × 1) corresponds to the effects of gravity. Matrices

A

,

C

, and

G

are given by:

\{A (q) = \frac{\partial^{2} E_{k}}{\partial {\dot{q}}^{2}}; G (q) = {[\frac{\partial E_{p}}{\partial q}]}^{⊤}; C (q, \dot{q}) = (C_{i j}); \{\begin{array}{l} C_{i j} = \sum_{k = 1}^{n} c_{i, j k} {\dot{q}}_{k} \\ c_{i, j k} = \frac{1}{2} [\frac{\partial A_{i j}}{\partial q_{k}} + \frac{\partial A_{i k}}{\partial q_{j}} - \frac{\partial A_{j k}}{\partial q_{i}}] \end{array}\}

(4)

The defining characteristic of a walking robot is the repetitive and alternating contact of its legs with the ground, making it a fundamental aspect of locomotion (Figure 2). Accurately modeling this contact is essential to account for the constraints imposed by the robot’s interaction with its environment. These constraints dictate the behavior of the foot during ground contact, ensuring that it neither lifts off nor slips, thereby maintaining stable and controlled movement.

In rigid contact (un-deformable ground), foot velocity and acceleration are zero and enforced by unknown contact forces. In compliant contact, forces depend on the ground’s deformation and its rate. The position

χ_{i} = {[x_{χ i}, y_{χ i}]}^{⊤} \in R^{2 \times 1}

of the tip

S_{i}

of the supporting foot (Figure 3) is a function of the joints

q = {[q_{1} q_{2} q_{3} q_{4} q_{5}]}^{⊤} \in R^{5 \times 1}

and the position of the center of gravity (CG) according to the following:

{[\begin{matrix} x_{g} & z_{g} \end{matrix}]}^{⊤} = χ_{i} + f_{i} (q)

. The vector

f_{i} (q) = {[f_{i x}, f_{i y}]}^{⊤} \in R^{2 \times 1}

is only a function of

q

and the parameters of the robot (lengths of the bodies and positions of the centers of gravity). In rigid contact, foot

i

, which is resting on the ground, is fixed. Its position

χ_{i}

is constant, and depending on the type of contact, its orientation can also be constant. This results in an equation of the following type:

χ_{i} (q) = c

; here,

q

is the vector of generalized coordinates describing the biped, and

i = 1,2

is the index. The number and type of equations written in equation (

χ_{i} (q) = c

) depend on the type connection that exists between the foot in contact and the ground. The zero velocity/acceleration constraints are written by differentiating

χ_{i} = c

:

χ_{i} = c ⟹ \{\begin{array}{l} {\dot{χ}}_{i} (q) = A_{c i} (q) \dot{x} = 0 \\ {\ddot{χ}}_{i} (q) = A_{c i} (q) \ddot{x} + H_{i} (q, \dot{q}) = 0 \end{array} \begin{matrix} w i t h A_{c i} = \frac{\partial χ_{i}}{\partial x} = [- \frac{\partial f_{i}}{\partial q} I_{2}] \in R^{2 \times 7} \end{matrix}

(5)

where

x = {[q^{T} x_{g} z_{g}]}^{⊤} \in R^{7 \times 1}

, and

A_{c i} (q) = \partial χ_{i} (q) / \partial x

is the Jacobian matrix of

χ_{i}

with respect to

x

.

H_{i} (q, \dot{q})

contains the other terms appearing during the derivation. With fixed foot contact, foot velocity and acceleration are at zero.

The considered robot is an “underactuated RABBIT” prototype model presented in Figure 3. The biped is composed of five bodies (links), a trunk, and two identical legs. Each leg consists of two bodies articulated at the knee. Knees and hips are pivot joints assumed to be frictionless and without mechanical play. The vector of generalized coordinates

x = {[q_{1} q_{2} q_{3} q_{4} q_{5} x_{g} z_{g}]}^{⊤}

, which describes the configuration of the biped, is composed of

q_{c} = {[q_{1} q_{2} q_{3} q_{4}]}^{⊤} \in R^{4 \times 1}

of the relative-angle motors defining the shape of the robot and

q_{a} = {[q_{5} x_{g} z_{g}]}^{⊤} \in R^{3 \times 1}

, defining the situation of the trunk of the robot with respect to a fixed frame in the vertical plane

(x, z)

.

q_{5}

is the absolute orientation of the trunk, and

x_{g}

and

z_{g}

are the coordinates of the center of mass of the biped. The robot’s position is described by its center of gravity, simplifying impact modeling and integrating it into the system’s dynamics. The biped, powered by four motors, has joint torques

Γ = {[Γ_{1} Γ_{2} Γ_{3} Γ_{4}]}^{⊤}

acting on the knees and hips (see Figure 4 and Figure 5 [25,28,31]).

Table 1 lists the RABBIT robot’s dynamic and geometric parameters [19,21]:

$m_{i}$ is the mass of the $i^{t h}$ body $i = 1 \dots 7$ (if we neglect feet, then $i = 1 \dots 5$ ).
$I_{G i}$ is the $i^{t h}$ moment of inertia at the center of gravity along axis $y$ .
$I_{A i}$ is the moment of inertia of motor $i$ around the $y$ axis.
$s_{i}$ is the distance from body $i$ ’s center of gravity to its frame origin.

The 7 DOF dynamic model of the robot is given according to the Lagrange formalism by the general equation of motion. With our choice of coordinates, we have [22,29]

\{\begin{matrix} E_{k} = \frac{1}{2} {\dot{x}}^{⊤} (t) [\begin{matrix} A_{5} (q_{c}) & 0_{5 \times 2} \\ 0_{2 \times 5} & m I_{2 \times 2} \end{matrix}] \dot{x} (t); E_{p} = m g z_{g} \\ Q_{e x t} (t) = D (q, \dot{q}) + {(\frac{\partial q_{c}}{\partial x})}^{⊤} Γ (t) + \sum_{i = 1}^{2} {(\frac{δ χ_{i}}{δ x})}^{⊤} R_{i} \end{matrix}\}

(6)

where

m

is the total mass of the biped,

g

is the gravity acceleration, and

A_{5} (q_{c}) \in R^{5 \times 5}

is the inertia matrix.

D (q, \dot{q}) \in R^{7 \times 1}

is a vector that regroups the terms of dissipative forces at the joints.

R_{i}

denotes ground reaction forces. Notice that

A (q_{c}) \ddot{x} + C (q, \dot{q}) \dot{x} + G (q) = Q_{e x t}

, so we can write

D (q, \dot{q}) + B^{⊤} Γ + J^{⊤} λ = D (q, \dot{q}) + {(\frac{\partial q_{c}}{\partial x})}^{⊤} Γ + [A_{c 1}^{⊤} (q) R_{1} + A_{c 2}^{⊤} (q) R_{2}]

(7)

where

A (q_{c}) \in R^{7 \times 7}

is the symmetric positive–definite inertia matrix, which defines kinetic energy.

C (q, \dot{q}) \in R^{7 \times 7}

is a matrix that groups the centrifugal and Coriolis inertia terms together.

G (q) \in R^{7 \times 1}

represents the terms of gravity.

D (q, \dot{q}) \in R^{7 \times 1}

is a vector that regroups the terms of dissipative forces at the joints, and

B \in R^{4 \times 7}

. The presented model is convenient for all phases of planar bipedal locomotion. For double-support phases, both ground reactions are not zero. For single-support phases, only one reaction force is not zero. For flight phases, both reaction forces are zero. The direct dynamic model of the RABBIT robot during the single-support phase has four joint couples

Γ = {[Γ_{1} Γ_{2} Γ_{3} Γ_{4}]}^{⊤}

as inputs and the state vector of the robot

q = {[q_{1} q_{2} q_{3} q_{4} q_{5}]}^{⊤}

. With respect to

\dot{q} = {[{\dot{q}}_{1} {\dot{q}}_{2} {\dot{q}}_{3} {\dot{q}}_{4} {\dot{q}}_{5}]}^{⊤}

, this model is the robot simulator, at its output gives the real articular acceleration

\ddot{x} (t)

with

x^{⊤} = [q^{⊤} x_{g} z_{g}]

and the ground reaction force

R_{i}

. In this section, we provide the calculation of this model. The equations of motion governing the robot when it is in the single-support mode can be deduced from Equation (6). If we consider that the biped is resting on foot 1, there is only one force from the reaction of two components,

R_{1} = {[R_{1 x}, R_{1 z}]}^{⊤}

, and the friction torque is zero at

R_{2} = {[0, 0]}^{⊤}

. The dynamic model is written as follows:

A (q_{c}) \ddot{x} + C (q, \dot{q}) \dot{x} + G (q) = B^{⊤} Γ + A_{c 1}^{⊤} (q) R_{1}; A (q_{c}) = b l k d i a g ([A_{5} (q_{c}) m I_{2 \times 2}])

(8)

The model comprises seven differential equations, and we have nine unknowns, which are joint acceleration

\ddot{x}

with seven components and the two components of the force of reaction

R_{1} = {[R_{1 x}, R_{1 z}]}^{⊤}

. This system can be completely defined by writing the stress equation with respect to the foot in contact with the ground. The foot in contact must not slip or take off, so its position is constant, and therefore, its speed and acceleration are zero. Thus, we have the position of the foot in support denoted by

χ_{1} (t)

, and it is as follows:

χ_{1} (t) = c o n s t

;

{\dot{χ}}_{1} = 0

;

{\ddot{χ}}_{1} = 0

. We deduce, analogously to the kinematic and dynamic constraints,

A_{c 1} (q) \dot{x} (t) = 0

and

A_{c 1} (q) \ddot{x} + H_{1} (q, \dot{q}) = 0

, where

H_{1} (q, \dot{q}) = - A_{c 1} (q) \ddot{x}

is a matrix of dimension

2 \times 1

, and

A_{c 1} (q)

is given as

A_{c 1} (q) = [- \partial f_{1} (q) / \partial q I_{2}]

. By combining the equations of the dynamic model and the second equation’s constraint on the acceleration of the foot in support, we obtain a ninth-order system,

A (q_{c}) \ddot{x} - A_{c 1}^{⊤} (q) R_{1} = B^{⊤} Γ - C (q, \dot{q}) \dot{x} - G (q)

, with

A_{c 1} (q) \ddot{x} + H_{1} (q, \dot{q}) = 0

. Thus, acceleration

\ddot{x}

and the reaction force

R_{1}

are given by the following [17,18,19,30]:

[\begin{matrix} \ddot{x} \\ R_{1} \end{matrix}] = {[\begin{matrix} A (q_{c}) \\ A_{c 1} (q) \end{matrix} \begin{matrix} - A_{c 1}^{⊤} (q) \\ 0_{2 \times 2} \end{matrix}]}^{- 1} [\begin{matrix} B^{⊤} Γ - C (q, \dot{q}) \dot{x} - G (q) \\ - H_{1} (q, \dot{q}) \end{matrix}]; \{A (q_{c}) = [\begin{matrix} A_{5} (q_{c}) & 0_{5 \times 2} \\ 0_{2 \times 5} & m I_{2 \times 2} \end{matrix}]; B = [\begin{matrix} I_{4 \times 4} & 0_{3 \times 4} \end{matrix}]\}

(9)

In the case of single-phase support, we will use the 5 DOF model:

A_{5 \times 5} (q) \ddot{q} + C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q) = {[\begin{matrix} Γ^{⊤} & 0 \end{matrix}]}^{⊤}

(10)

2.2. The Model of Impacts

Impacts are modeled as high-intensity forces acting over an infinitesimal time

∆ t

, represented by the integral of the contact force during this interval. Since

∆ t

is very small, positional changes are neglected. Thus, impact causes a sudden change in velocity and not position, and it is often expressed using a Dirac function scaled by the impulse’s magnitude. The goal is to compute post-impact velocities and impulses, assuming that pre-impact velocities are known. At the end of single support, when swing leg

j

contacts the ground, an inelastic impact occurs. The ground reaction is modeled as a Dirac delta-function with intensity

I_{R j}

, and the foot’s velocity is assumed to drop to zero immediately after contact. Depending on whether the stance leg lifts off, two types of impact may occur. The rebound between two links is classically described by the coefficient of restitution

ε

, which—via Newton’s impact law—relates the normal component of their relative velocity before and after impacts. For foot–ground collision during walking, it becomes

v_{i n}^{+} = ε v_{i n}^{-}

, where

v_{i n}

is the normal velocity upon contact. The value of

ε

, ranging from

0

(perfectly inelastic) to 1 (perfectly elastic), depends on the contact materials. By integrating the dynamic equations over an infinitesimal impact duration, the impulsive dynamics governing this phase can be derived. Since the matrices

A (q_{c})

and

A_{c j} (q)

depend solely on the robot’s configuration, they are considered constant over the short collision duration. The resulting impact model is expressed as follows:

A (q_{f}) ({\dot{x}}^{+} - {\dot{x}}^{-}) = A_{c j}^{⊤} (q_{f}) I_{R j}

.

[\begin{matrix} A_{5} (q_{f}) & 0_{5 \times 2} \\ 0_{2 \times 5} & m I_{2 \times 2} \end{matrix}] ({\dot{x}}^{+} - {\dot{x}}^{-}) = [\begin{matrix} {- [\partial f_{j} (q) / \partial q]}^{⊤} \\ I_{2 \times 2} \end{matrix}] I_{R j}

(11)

Here, the superscripts (+) and (−) denote post- and pre-impact velocities, respectively, while

q_{f}

represents the final configuration of the single-support phase. The contact forces, being theoretically infinite upon impact, yield finite impulsive effort

I_{R j}

when integrated over the vanishing impact duration. The acceleration of the robot is taken as infinite, and by integrating it, we obtain the velocity variation

({\dot{x}}^{+} - {\dot{x}}^{-})

. As

A (q_{f})

is invertible, the velocity after impact is

{\dot{x}}^{+} = {\dot{x}}^{-} + {[A (q_{f})]}^{- 1} A_{c j}^{⊤} (q_{f}) I_{R j}

, so we obtain

A_{c j} (q_{f}) {\dot{x}}^{+} = A_{c j} (q_{f}) {\dot{x}}^{-} + A_{c j} (q_{f}) {[A (q_{f})]}^{- 1} A_{c j}^{⊤} (q_{f}) I_{R j}

. When matrix

A_{c j} (q_{f})

is of full rank, this equation enables us to calculate

I_{R j}

under the assumption that (

A_{c j} (q_{f}) {\dot{x}}^{+} = 0

), so

I_{R j} = - {[A_{c j} (q_{f}) {[A (q_{f})]}^{- 1} A_{c j}^{⊤} (q_{f})]}^{- 1} A_{c j} (q_{f}) {\dot{x}}^{-}

. Moreover, one can derive an alternative formulation without the use of

{\dot{x}}^{-}

(see [23,25]). First, we define

δ_{q} = {\dot{q}}^{+} - {\dot{q}}^{-};

then,

A_{5} (q_{f}) δ_{q} = - m {[\frac{\partial f_{j} (q_{f})}{\partial q}]}^{⊤} [\begin{matrix} {\dot{x}}_{g}^{+} - {\dot{x}}_{g}^{-} \\ {\dot{z}}_{g}^{+} - {\dot{z}}_{g}^{-} \end{matrix}] = - m {[\frac{\partial f_{j} (q_{f})}{\partial q}]}^{⊤} \{\frac{\partial f_{j} (q_{f})}{\partial q} {\dot{q}}^{+} - \frac{\partial f_{i} (q_{f})}{\partial q} {\dot{q}}^{-}\}

(12)

Thus, the biped angular velocity vectors before and after impact are related by linear equation

{\dot{q}}^{+} = ∆ (q_{f}) {\dot{q}}^{-}

, where

∆ (q_{f}) = ∆_{1}^{- 1} (q_{f}) ∆_{2} (q_{f}) w i t h

∆_{1} (q_{f}) = \{A_{5} (q_{f}) + m {[\frac{\partial f_{j} (q_{f})}{\partial q}]}^{⊤} \frac{\partial f_{j} (q_{f})}{\partial q}\}; ∆_{2} (q_{f}) = \{A_{5} (q_{f}) + m {[\frac{\partial f_{j} (q_{f})}{\partial q}]}^{⊤} \frac{\partial f_{i} (q_{f})}{\partial q}\}

(13)

The impulse efforts are modeled by Formalsky’s equation (see [17,25]):

I_{R j} = m [\frac{\partial f_{j} (q_{f})}{\partial q} ∆ (q_{f}) - \frac{\partial f_{i} (q_{f})}{\partial q}] {\dot{q}}^{-}

(14)

A walking gait consists of alternating single-support, impact, and double-support phases. The system state

η (t)

, comprising configuration and velocity variables, evolves according to a hybrid model

\dot{η} (t) = f (η (t)) + g (η) u w h e n η (t) \notin S

, and

η (t^{+}) = ϕ (η (t^{-})) w h e n η (t) \in S

. Here,

u

is the control input, and

f

,

g

, and

ϕ

are nonlinear continuous functions. The impact set

S

defines the switching surface containing all states that trigger a ground impact. The second expression (as given by Westervelt and Chevallereau) models the instantaneous velocity change at impact (see [28]).

During single support, the robot evolves under a continuous dynamic model. An impact is detected when the swing foot reaches ground level (

ϕ = 0

), causing a discontinuous change in velocity. This is followed by either single or double support on the opposite foot (see Figure 6). The double-support phase ends through a control-triggered vertical acceleration that lifts the foot off the ground.

2.3. Symbolic Calculations of the Model

During the initial and final single-support configurations and the double-support phase, both feet of the robot are in contact with the ground. These two feet thus form a closed kinematic loop. Three parameters are sufficient to determine these configurations: the distance between the feet

d

, the abscissa of the hip

x_{h}

, and the height of the hip

z_{h}

.

Now, we present the geometric model to express joint variables in terms of Cartesian variables. To simplify calculations, we define the absolute angle

α_{i}

on the robot (see Figure 7) and compute them based on Cartesian parameters

z_{h}

,

x_{h}

, and

d

. A change in variable allows expressing the geometric model in terms of relative variables used for dynamics.

We decompose our calculation on the two legs, and the benchmark with respect to which all calculations are performed is related to the ankle of the rear support leg. The loop closure makes it such that the height of the hip,

z_{h}

, is the same for the two legs, and the distance

d

between the legs is constant. This leads to the following constraints:

x_{h} = L_{1} S_{α_{1}} + L_{2} S_{α_{2}}

;

z_{h} = L_{1} C_{α_{1}} + L_{2} C_{α_{2}}

;

d - x_{h} = L_{3} S_{α_{3}} + L_{4} S_{α_{4}}; a n d z_{h} = L_{3} C_{α_{3}} + L_{4} C_{α_{4}}

, where

S_{•} = \sin (•)

and

S_{•} = \cos (•)

. In Table 1, we have lengths

L_{1} = L_{2} = L_{3} = L_{4}

; this equality can greatly simplify calculations, but we prefer to keep different lengths to generalize the calculation and leave the choice of adding robustness to others. From the inverse geometric model, we have absolute angles

α_{1}

,

α_{2}

,

α_{3}

, and

α_{4}

in the function of step length

d

, the abscissa of the hip

x_{h}

, and the height of the hip

z_{h}

. We already have the absolute angle

q_{5}

, which is also known. We obtain the following relations:

q_{5} + q_{2} + α_{2} = π

,

q_{5} + q_{4} + α_{4} = π

,

q_{5} + q_{2} + q_{1} - α_{1} = π

and

q_{5} + q_{2} + q_{1} + α_{3} = π ⟹ q_{1} = α_{1} + α_{2}

,

q_{2} = π - q_{5} - α_{2}

,

q_{3} = α_{4} - α_{3}

, and

q_{4} = π - q_{5} - α_{4}

. With angle

q_{5}

being known, we have a system of four equations, and four unknowns can be solved easily to give us the expressions of the required positions:

\{\begin{matrix} p_{f_{1}} = [\begin{matrix} 0 \\ 0 \end{matrix}]; p_{g_{1}} = p_{f_{1}} + L_{1} [\begin{matrix} - \sin (q_{5} + q_{2} + q_{1}) \\ - \cos (q_{5} + q_{2} + q_{1}) \end{matrix}]; p_{h} = p_{g_{1}} + L_{2} [\begin{matrix} - \sin (q_{5} + q_{2}) \\ - \cos (q_{5} + q_{2}) \end{matrix}] \\ p_{g_{2}} = p_{h} + L_{4} [\begin{matrix} \sin (q_{5} + q_{4}) \\ \cos (q_{5} + q_{4}) \end{matrix}]; p_{f_{2}} = p_{g_{2}} + L_{3} [\begin{matrix} \sin (q_{5} + q_{3} + q_{4}) \\ \cos (q_{5} + q_{3} + q_{4}) \end{matrix}] \end{matrix}\}

(15)

where

p_{f_{1}}

is the position vector of the first foot (first leg),

p_{f_{2}}

is the position vector of the second foot (second leg),

p_{g_{1}}

is the position vector of the first knee,

p_{g_{2}}

is the position vector of the second knee, and

p_{h}

is the position vector of the hip.

The center of mass of each arm can be calculated from the previous relation by

\{\begin{matrix} p_{c_{1}} = p_{f_{1}} + (s_{1} - L_{1}) [\begin{matrix} \sin (q_{5} + q_{2} + q_{1}) \\ \cos (q_{5} + q_{2} + q_{1}) \end{matrix}]; p_{c_{2}} = p_{g_{1}} + (s_{2} - L_{2}) [\begin{matrix} \sin (q_{5} + q_{2}) \\ \cos (q_{5} + q_{2}) \end{matrix}] \\ p_{c_{T}} = p_{h} + s_{5} [\begin{matrix} \sin (q_{5}) \\ \cos (q_{5}) \end{matrix}]; p_{c_{4}} = p_{h} + s_{4} [\begin{matrix} \sin (q_{5} + q_{4}) \\ \cos (q_{5} + q_{4}) \end{matrix}]; p_{c_{3}} = p_{g_{2}} + s_{3} [\begin{matrix} \sin (q_{5} + q_{4} + q_{3}) \\ \cos (q_{5} + q_{4} + q_{3}) \end{matrix}] \end{matrix}\}

(16)

The total

C G

of the robot is:

p_{g} = [m_{1} p_{c_{1}} + m_{2} p_{c_{2}} + m_{3} p_{c_{T}} + m_{4} p_{c_{4}} + m_{5} p_{c_{3}}] / m

, with

m = m_{1} + \dots + m_{5}

. The coordinates of the feet and

C G

in the absolute frame are

\{\begin{matrix} P_{g} = {[q_{6}, q_{7}]}^{T}; \\ P_{f_{1}} = P_{g} - p_{g}; \\ P_{f_{2}} = P_{f_{1}} + p_{f_{2}}; \end{matrix} \begin{matrix} P_{c_{1}} = P_{f_{1}} + p_{c_{1}}; \\ P_{c_{2}} = P_{f_{1}} + p_{c_{2}}; \\ P_{c_{T}} = P_{f_{1}} + p_{c_{T}}; \end{matrix} \begin{matrix} P_{c_{3}} = P_{f_{1}} + p_{c_{3}}; \\ P_{c_{4}} = P_{f_{1}} + p_{c_{4}} . \end{matrix}\}

(17)

The potential energy is: E_p = g[0 1]

[m_{1} P_{c_{1}} + m_{2} P_{c_{2}} + m_{5} P_{c_{T}} + m_{4} P_{c_{4}} + m_{3} P_{c_{3}}]

. The absolute joint velocities are as follows:

ω_{a_{1}} = {\dot{q}}_{5} + {\dot{q}}_{2} + {\dot{q}}_{1}

;

ω_{a_{2}} = {\dot{q}}_{5} + {\dot{q}}_{2}

;

ω_{a_{3}} = {\dot{q}}_{5} + {\dot{q}}_{4} + {\dot{q}}_{3}

;

ω_{a_{4}} = {\dot{q}}_{5} + {\dot{q}}_{4}

and

ω_{a_{T}} = {\dot{q}}_{5}

. The linear velocity of the center mass of each body is given by

v_{c_{1}} = \frac{\partial P_{c_{1}}}{\partial q} \dot{q}; v_{c_{2}} = \frac{\partial P_{c_{2}}}{\partial q} \dot{q}; v_{c_{3}} = \frac{\partial P_{c_{3}}}{\partial q} \dot{q}; v_{c_{4}} = \frac{\partial P_{c_{4}}}{\partial q} \dot{q}; a n d v_{c_{T}} = \frac{\partial P_{c_{T}}}{\partial q} \dot{q}

(18)

The kinetic energies of each body of the RABBIT robot are

E_{k j} = \frac{1}{2} [m_{j} v_{c_{j}}^{T} v_{c_{j}} + I_{j} ω_{a_{j}}^{2} + I_{A_{j}} {\dot{q}}_{j}^{2}]; f o r j = 1 \dots 4 a n d E_{k T} = \frac{1}{2} [m_{5} v_{c_{T}}^{T} v_{c_{T}} + I_{5} ω_{a_{T}}^{2}]

(19)

Thus, we have

E_{k} = E_{k 1} + E_{k 2} + E_{k T} + E_{k 3} + E_{k 4}

, with

I_{j} = I_{j 0} - m_{j} s_{j}^{2}

and

I_{5} = I_{50}

. The matrices of the 7 DOF RABBIT robot model during single support were computed using MATLAB’s (R2023) symbolic toolbox (see Appendix A). Figure 8 clearly shows the detailed modeling flowchart, from defining symbolic variables to deriving the system matrices via the

L a g r a n g i a n

.

2.4. Optimal Trajectories Generation

Generally, in robotics, the reference trajectory is a motion written as a function of time for different link positions and velocities. This is valid for all types of fully actuated robots, but in our case, the five degrees of freedom is not fully motorized during the single-support phase. The four angles

q_{1} \dots q_{4}

are motorized, but the rotation about the standing leg is free. For such cases, many solutions have been proposed in the literatures. The robot will track polynomial reference trajectories written in terms of curvilinear abscissa

s \in [0 1]

, which is

q^{d} (t) = q^{d} (s (t)), {\dot{q}}^{d} (t) = \frac{d q^{d} (s)}{d s} \dot{s} a n d {\ddot{q}}^{d} (t) = \frac{d^{2} q^{d} (s)}{{d s}^{2}} {\dot{s}}^{2} + \frac{d q^{d} (s)}{d s} \ddot{s}

(20)

Here, $q^{d} (0)$ = initial configuration, and $q^{d} (1)$ = final configuration.
For the first single support, the two legs change their roles from one step to another $q^{d} (0) = E q^{d} (1)$ , where the matrix $E = [e_{3} e_{4} e_{1} e_{2} e_{5}] \in R^{5 \times 5}$ is the permutation matrix. The trajectory in polynomial form is $q^{d} (s) = α_{0} + α_{1} s + α_{2} s^{2} + α_{3} s^{3} + α_{4} s^{4}$ , and ${\dot{q}}^{d} (s) = (α_{1} + 2 α_{2} + 3 α_{3} s^{2} + 4 α_{4} s^{3}) \dot{s} (t)$ . The optimal values of $q^{d} (s)$ , ${\dot{q}}^{d} (s)$ are given in [21]. Figure 9 shows how joint motion differs from the path.

The coefficients can be obtained by (according to [25])

[\begin{matrix} α_{0} \\ \begin{matrix} α_{1} \\ α_{2} \\ α_{3} \end{matrix} \\ α_{4} \end{matrix}] = 16 {[\begin{matrix} 16 I_{5} \\ \begin{matrix} 16 I_{5} \\ 16 I_{5} \\ 0_{5} \end{matrix} \\ 0_{5} \end{matrix} \begin{matrix} 0_{5} \\ \begin{matrix} 4 I_{5} \\ 16 I_{5} \\ 16 I_{5} \end{matrix} \\ 16 I_{5} \end{matrix} \begin{matrix} 0_{5} \\ \begin{matrix} 4 I_{5} \\ I_{5} \\ 0_{5} \end{matrix} \\ 32 I_{5} \end{matrix} \begin{matrix} 0_{5} \\ \begin{matrix} 2 I_{5} \\ 16 I_{5} \\ 0_{5} \end{matrix} \\ 48 I_{5} \end{matrix} \begin{matrix} 0_{5} \\ \begin{matrix} I_{5} \\ 16 I_{5} \\ 0_{5} \end{matrix} \\ 64 I_{5} \end{matrix}]}^{- 1} [\begin{array}{l} {q^{d}|}_{s = 0} \\ \begin{array}{l} {q^{d}|}_{s = 0.5} \\ {q^{d}|}_{s = 1} \\ {d q^{d} / d s|}_{s = 0} \end{array} \\ {d q^{d} / d s|}_{s = 1} \end{array}]

(21)

3. Basics of Intelligent Adaptive Control and Neural Networks

The dynamic modeling of walking robots via Lagrange–Euler methods and its finite time convergent control is complex and error-prone due to underactuation, multi-link setups, and hybrid phases [50,51]. Neural networks (NNs) provide a data-driven alternative, with feedforward architectures approximating nonlinear dynamics. However, static networks cannot handle time-varying control, requiring dynamic NNs that capture system evolution. These are computationally intensive and prone to overfitting. Structure-aware algorithms help reduce the degrees of freedom. Parameter-linear models like RBF networks, despite their universal approximation capabilities, face the curse of dimensionality. Exploiting known system structures allows static, compact networks to efficiently approximate robot dynamics, offering better generalization, lower computational costs, and suitability for real-time control [52,53,54,55].

3.1. Neural Networks and Global Approximation Theory

Neural networks are composed of interconnected nodes (or neurons) linked by weighted connections, where the weights serve as trainable parameters. The specific arrangement of nodes and interconnections defines the network architecture, which varies across models and must be selected carefully based on the target application. As network capabilities differ by structure, choosing an appropriate architecture is essential for achieving optimal performance in a given control or modeling task.

Definition 1.

(Function Approximation): If

f (x) : R^{n} ⟶ R^{m}

is a continuous function in a compact set

Ω_{x}

and

y (W, x) : R^{s} \times R^{n} ⟶ R^{m}

is an approximating function that depends continuously on

W

and

x

, then the approximation problem is to determine the optimal parameter

W^{⋆}

for some metric (or distance function)

d

such that

d (y (W^{⋆}, x), f (x)) \leq ϵ

for an acceptable small e [53].

In function approximation, a neural network defines an estimator

y (W, x)

for an unknown function

f (x)

, where the weight

W

is adjusted to minimize output errors over a training dataset. This involves two core challenges: the representation problem, which concerns selecting a suitable function structure

y (W, x)

, and the learning problem, which focuses on optimizing

W

to best match the target function. A neural network is a parallel, distributed computational model composed of simple processing units capable of learning from experience and storing knowledge through training. It mimics the brain in two key ways: learning occurs through interaction with the environment, and acquired knowledge is encoded in the synaptic weights connecting the neurons. A neuron serves as the fundamental processing unit of a neural network. Its structure is illustrated in Figure 10, and it provides the foundation for constructing a wide range of neural network architectures.

Claim 1.

If

A \subset C (K, R^{n})

is an algebra of continuous vector-valued function

f

that separates points and contains constant functions, then

A

is dense in

C (K, R^{n})

on a compact domain

K

. Furthermore, neural nets (including RBFs) with sufficient width and appropriate activation

σ

are universal approximators of

f

under some conditions.

Moreover, in many practical scenarios, a two-layer neural network (single hidden layer) is often sufficient. Due to their fundamental universal approximation capability, such architectures are typically adequate for a wide range of control applications.

Theorem 1

(Stone–Weierstrass [55]). If

f : K \subset R^{m} \to R^{n}

is a continuous function on a compact set

K

of certain sub-algebras, then, for any

ε > 0

, there exists a continuous function

\hat{f} (x) = f u n c t i o n (W^{⊤}, V^{⊤}, σ, x, b)

such that

{‖f (x) - \hat{f} (x)‖}_{\infty} < ε

for all

x \in K

with:

$σ$ is a nonlinear function that acts element-wise on vectors;
$V \in R^{p \times m}$ , $b \in R^{p \times m}$ , and $W \in R^{n \times p}$ are the approximator parameters;
$p \in N$ is an index of approximation.

Theorem 1 asserts that certain sub-algebras of continuous functions—if they separate points and contain constants—are dense in the space of continuous functions. This gives a general framework for function approximation using algebras.

Theorem 2

(Cybenko [55]). For any continuous vector-valued function

f \in C (K, R^{n})

with a compact

K \subset R^{m}

, for any

ϵ > 0

, there exists a neural network of the form

\hat{f} (x) = ⅀_{j = 1}^{N} w_{j} σ_{j} (a_{j}^{⊤} x + b_{1 j}) + b_{2 j},

where

w_{j} \in R^{n}

,

a_{j} \in R^{m}

, and

b_{1 j}, b_{2 j} \in R

such that

\max_{x \in K} ‖\hat{f} (x) - f (x)‖ < ε

. We know that

σ_{j} : R ⟶ R

is a non-constant, bounded, and continuous activation function. In matrix form,

\hat{f} (x) = W^{⊤} σ (A^{⊤} x + b_{1}) + b_{2}

, where

σ (x) = {[σ_{1} (x), \dots, σ_{m} (x)]}^{⊤} \in R^{m}

such that

\max_{x \in K} ‖\hat{f} (x) - f (x)‖ < ε

for all

x \in K .

Cybenko’s theorem (2) is a result from neural network theory. It provides constructive proof that single-hidden-layer feedforward neural networks with a suitable nonlinear activation function can approximate any continuous function. The basic structure of the multi-layer perceptron (MLP) network (see Figure 11) is very flexible and can be employed in a wide variety of modeling and control tasks.

Despite their universal approximation power, MLPs suffer from slow, non-convergent training, complex architecture tuning, nonlinear parameterization, and forgetting. These issues are addressed using a single-layer network (usually RBFs) or adaptive modular networks with local learning, offering faster, simpler, and more stable performance [56,57,58,59,60]. Figure 12 shows the functional diagram of this learning process.

Commonly used activation functions

σ (.)

include sigmoid functions, hyperbolic tangent, and radial basis functions, among others. In recent years, various types of neural network architectures have been developed to suit different application domains, particularly in control systems. The most widely adopted structures are as follows:

• Fuzzy neural networks.
• Polynomial basis function network.
• Gaussian RBF networks.

• Radial basis function networks.
• Wavelet neural networks.
• General form neural networks.

A three-layer feedforward neural network can be defined by specifying the input vector

x \in R^{n}

, the output vector

y \in R^{m}

, and the hidden layer activation vector

α \in R^{h}

. The interconnection weights between the input and hidden layers are denoted by

v_{i j}

, and those between the hidden and output layers are denoted by

w_{j k}

. The inputs to each activation function and the overall network output are determined by these weighted connections and the following chosen activation functions:

z (x) = V^{⊤} x; a n d y (z) = W^{⊤} σ (z) = W^{⊤} σ (V^{⊤} x)

, where

W^{⊤} = {[w_{j k}]}^{⊤}

,

V^{⊤} = {[v_{i j}]}^{⊤}

. A general multivariable function

f (x) \in R^{m}

can be approximated by a neural network in the form

f (x) = W^{⊤} σ (V^{⊤} x) + ϵ (x)

, representing the approximation error. If there exist matrices

W

and

V

of appropriate dimensions such that

ϵ (x) = 0

, the function

f (x)

is said to lie within the functional range of the neural network. It is well established—based on the Stone–Weierstrass theorem—that any sufficiently smooth function can be approximated over a compact domain to arbitrary accuracy by increasing the network’s size and choosing a suitable activation function

σ (.)

.

Let

f (x) \in R^{m}

represent an unknown nonlinear function (e.g., modeling error or disturbance), where

x \in R^{n}

is the system’s state. The universal approximation theorem states that, for any continuous function

f (x)

, there exists a feedforward neural network (three-layer feedforward NN) of the form

f (x) \approx \hat{f} (x) = ⅀_{i = 1}^{p} W_{i} σ_{i} (z)

, with

z = A x + b

, or we omit the intermediary variable

z

as

f (x) = W^{⊤} σ (A x + b)

or

f (x) = {[\begin{matrix} W_{11} \\ ⋮ \\ W_{p 1} \end{matrix} \begin{matrix} \dots \\ ⋱ \\ \dots \end{matrix} \begin{matrix} W_{1 m} \\ ⋮ \\ W_{p m} \end{matrix}]}^{⊤} [\begin{matrix} σ_{1} (x) \\ ⋮ \\ σ_{1} (x) \end{matrix}]; I f σ i s a s i g m o i d s t h e n f (x) = \frac{W^{⊤}}{1 + e^{- (A x + b)}} = W^{⊤} {[\frac{1}{1 + e^{- (a_{i}^{⊤} x + b_{i})}}]}_{i = 1}^{p}

$σ (x) = {[σ_{1} (x), \dots, σ_{p} (x)]}^{⊤} \in R^{p}$ is the activation vector.
$W \in R^{p \times m}$ is the output weight matrix.
$σ_{i} (x)$ denotes nonlinear sigmoid activation functions (e.g., tanh, ReLUs, etc.).
$A \in R^{p \times n}$ is the stacked weight matrix ( $i^{t h}$ row is $a_{i}^{⊤}$ ); $b \in R^{p}$ denotes biases.
The exponential and division are element-wise.

The coefficients

a_{i}^{⊤}

and

b_{i}

are the hidden-layer parameters (

a_{i}^{⊤} \in R^{n}

input-to-hidden-weight “nonlinear layer”, and

b_{i} \in R

denotes the bias of neuron

i

).

3.2. Radial Basis Function Neural Networks and Training

Radial basis function neural networks (RBFNNs) consist of three layers: an input layer, a hidden layer of nonlinear units, and a linear output layer. Each hidden unit computes a localized activation—typically a Gaussian—based on the Euclidean distance between the input and its associated center. The output is a linear combination of these activations [58,59]. RBFNNs are well-suited for online nonlinear adaptive modeling and control due to their linear parameterization with respect to output weights (enabling efficient online adaptation), localized activations (ensuring spatially local learning), and fast initial convergence—making them ideal for real-time applications. When the input weight matrix is

V = I

, the feedforward neural network becomes an RBFNN, with output expressed as

f (x) = W^{⋆ ⊤} μ (x) + ϵ (x)

, where

ϵ (x)

is a bounded approximation error and

W^{⋆}

is an ideal weight vector. In RBFNNs,

μ (x) = {[μ_{1} (x), \dots, μ_{p} (x)]}^{⊤} \in R^{p}

are a radial basis functions, and they are typically Gaussian, with

μ_{i} (x) = \exp (- {‖x - c_{i}‖}^{2} / σ_{i}^{2})

and

$c_{i} \in R^{n}$ is the center of the $i^{t h}$ basis function, $i = 1, \dots, m$ ;
$μ_{i} (x) > 0$ is the width (spread), and it measures the similarity between $x$ and $c_{i}$ ;
$W^{⋆} = \arg \min_{W \in R^{p \times n}} \{\underset{x \in Ω_{x}}{s u p} |f (x) - W^{⊤} σ (x)|\}$ .

So, the RBFNN becomes

f (x) = ⅀_{i = 1}^{p} w_{i} μ_{i} (x) = W^{⊤} μ (x)

. In order to train the RBFNN, we should minimize the objective function:

J = {‖e_{n}‖}^{2} = {‖{\hat{y}}_{n} - y_{n}‖}^{2}

. To adapt the center

c_{j}

, perform

\partial J / \partial c_{j} = - e_{n}^{⊤} w_{j} . \partial μ_{j} (x_{n}) / c_{j} = - e_{n}^{⊤} w_{j} . μ_{j} (x_{n}) . (x_{n} - c_{j}) / σ_{j}^{2}

. Again, for

σ_{j}

, we have

\partial J / \partial σ_{j} = - e_{n}^{⊤} w_{j} . \partial μ_{j} (x_{n}) / \partial σ_{j} = - e_{n}^{⊤} w_{j} . μ_{j} (x_{n}) . {‖x_{n} - c_{j}‖}^{2} / σ_{j}^{3}

. The formal description of the training is given by Algorithm 1.

Algorithm 1: RBF Neural Network Training via Gradient Descent

3.3. Adaptive Neural Network Control

Neural network-based adaptive control has emerged as a robust alternative to traditional model reference adaptive control (MRAC), especially for systems with high uncertainty and nonlinearities. Among neural models, radial basis function neural networks stand out for their universal approximation, rapid convergence, and structural simplicity (see Figure 13). Unlike MRAC, which adapts parameters via a reference model, RBFNNs use data-driven learning to directly approximate unknown dynamics, enhancing flexibility and real-time robustness. This contrast forms a foundation for advancing intelligent adaptive control in complex environments [26].

We employ a radial basis function neural network to approximate the unknown nonlinear functions and compensate for unmodeled dynamics and external disturbances in the bipedal locomotion control model. The system dynamics are expressed as

M (q) \ddot{q} + C (q, \dot{q}) \dot{q} + G (q) = F + ∆ (q, \dot{q}, t)

. Here,

∆ (.)

represents the lumped model uncertainties and external disturbances, which are approximated by the neural network

∆ (x) = W^{⊤} μ (x) + ϵ (x)

, where

ϵ (x)

is the bounded error:

‖ϵ (x)‖ \leq \bar{ϵ}

and

$x = {[q^{⊤}, {\dot{q}}^{⊤}]}^{⊤} \in R^{2 n}$ is the input vector (joint positions and velocities);
$μ (x) \in R^{m}$ is the RBF vector, defined by $μ_{i} (x) = \exp (- {‖x - c_{i}‖}^{2} / σ_{i}^{2})$ , $i = 1, \dots, m$ ;
$W \in R^{m \times n}$ is the ideal weight matrix.

The adaptive update law is derived using Lyapunov theory by

\begin{matrix} \underset{\hat{W}}{•} \end{matrix} = - Γ_{W} ϕ (x) e^{⊤}

, where

\hat{W}

is the online estimated weight matrix,

Γ_{W} > 0

is the adaptation gain matrix, and

e (t) = q (t) - q_{d} (t)

is the velocity tracking error. To cancel uncertainty, the control input is given as

F (t) = F_{0} (t) - W^{⊤} μ (x)

, where the nonlinear dynamic inversion control is

F_{0} (t) = M (q) a_{0} (t) + C (q, \dot{q}) \dot{q} + G (q) - W^{⊤} μ (q, \dot{q})

, with

α \in (0, 1)

being the fractional power and

a_{0} (t) = {\ddot{q}}_{d} - K_{1} e^{α} - K_{2} \dot{e}

.

K_{1} a n d K_{2}

are gain matrices.

Theorem 3.

Under the control law

F (t) = F_{0} (t) - W^{⊤} μ (x)

and adaptation mechanism, the tracking error

e (t) = q (t) - q_{d} (t)

converges to zero in finite time, and all signals in the closed-loop system remain bounded.

Proof.

Define the Lyapunov Candidate Function

V (t) = [{‖e (t)‖}^{2} + T r (e_{W}^{⊤} P^{- 1} e_{W})] / 2

, with

e_{W} = W - W^{⋆}

. Taking the derivative and substituting the control law (with the Schwartz inequality) yields

\dot{V} (t) \leq - e^{⊤} K_{1} {|e (t)|}^{α / (2 - α)} - {\dot{e}}^{⊤} K_{2} {|\dot{e} (t)|}^{α} + ε_{M} . ‖\dot{e} (t)‖

with

0 < α < 1

and

‖∆ (x) - W^{⊤} μ (x)‖ \leq ε_{M}

. By choosing sufficiently large gains

K_{1} a n d K_{2}

, the right-hand side becomes

\dot{V} (t) < - c {[V (t)]}^{β}

. This differential inequality implies the finite-time convergence of

‖e (t)‖

to zero in the time bounded by

t_{f} \leq {[V (0)]}^{β - 1} / (1 - β) c

. □

4. Stability and Control of RABBIT Robot

In this section, we propose a control law for the RABBIT robot and analytically demonstrate its asymptotic stability over a compound walking cycle consisting of simple-e-support phases separated by instantaneous impacts. Optimal trajectories are generated for the underactuated case. While the goal of walking control is to keep the robot within its viability space, designing a strategy around this concept is challenging due to the need to explore all possible biped movements. We therefore adopt a more restrictive approach focused on cyclic walking, where each step repeats identically with alternating leg roles. This periodic motion is represented in phase space by closed curves—known as limit cycles—where velocity is plotted against position (see Slotine and Li, 1991 [49]).

4.1. Nonlinear Decoupling Control

During flat-foot contact, a biped robot behaves as a fully actuated mechanical structure, allowing the use of classical control methods like PD, PID, or sliding mode. This section focuses on the computed torque control (input/output linearization), which uses the system’s dynamic model. With minimal parameterization, the stance-phase dynamics are expressed as follows [3,4,5,6,7,8]:

A_{5 \times 5} (q) \ddot{q} + C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q) = {[\begin{matrix} Γ^{⊤} & 0 \end{matrix}]}^{⊤}

(22)

To track

q^{d} (t)

, acceleration must be

\ddot{q} = w^{d} = {\ddot{q}}^{d} + K_{v} ({\dot{q}}^{d} - \dot{q}) + K_{p} (q^{d} - q)

, so

{[Γ^{⊤} 0]}^{⊤} = [A_{5 \times 5} (q) w^{d} + C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)]

(23)

In a detailed manner, we can summarize the nonlinear decoupling control law (NDC) in

\{\begin{array}{l} Γ = M_{4 \times 5} (q) w^{d} + E_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)) w h e r e \\ \begin{array}{l} \begin{array}{l} \begin{array}{l} \begin{array}{l} w^{d} = [\frac{d q^{d} (s)}{d s} \ddot{s} + ϑ] \\ ϑ = \frac{d^{2} q^{d} (s)}{{d s}^{2}} {\dot{s}}^{2} + K_{v} \dot{e} + K_{p} e \end{array} \\ e = (q^{d} - q); \dot{e} = ({\dot{q}}^{d} - \dot{q}) \end{array} \\ κ = N_{1 \times 5} (q) \frac{d q^{d} (s)}{d s} \end{array}| \begin{array}{l} \begin{array}{l} \begin{array}{l} A_{5 \times 5} (q) = [\begin{array}{l} M_{4 \times 5} (q) \\ N_{1 \times 5} (q) \end{array}] \\ \begin{array}{l} M_{4 \times 5} (q) = E_{4} A_{5 \times 5} (q) \\ N_{1 \times 5} (q) = e_{4} A_{5 \times 5} (q) \end{array} \\ E_{4} = [\begin{array}{l} I_{4 \times 4} & 0_{4 \times 1} \end{array}] \end{array} \\ e_{4} = [\begin{array}{l} 0_{1 \times 4} & 1 \end{array}] \end{array} \end{array} \\ \ddot{s} = - [N_{4 \times 5} (q) ϑ + e_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q))] / κ \end{array} \end{array}\}

(24)

Chevallereau [19,29] defined the conditions for a joint path to yield stable, cyclic motion, with its attraction region bounded by angular momentum. Reaction force constraints (e.g., no take-off or slip) limit initial velocity. The proposed control law ensures convergence to a reference path in finite time, guaranteeing admissible motion.

4.2. Finite-Time Convergent Control

This type of control was proposed by Bhat and Bernstein in 1998 [28,29,50], and its basic idea is the choice of some gains such that the error will converges rapidly to zero. The high gains may directly affect motor torques, but they should be adjusted so that the maximum developed torques will never exceed 150 Nm. The acceleration of the robot has the form

\ddot{q} = w^{d}

, which is

\ddot{q} = {\ddot{q}}^{d} + \frac{1}{ε^{2}} ψ = \frac{d q^{d}}{d s} \ddot{s} + ϑ (s, \dot{s}, q, \dot{q}) w h e r e ϑ = \frac{d^{2} q^{d}}{{d s}^{2}} {\dot{s}}^{2} + \frac{1}{ε^{2}} ψ; ψ = {[ψ_{1} \dots ψ_{5}]}^{⊤}

(25)

and the component

ψ_{k}

is given by

\{\begin{matrix} ψ_{k} = sign (ε {\dot{e}}_{q k}) {|ε {\dot{e}}_{q k}|}^{v} + sign (ϕ_{k}) {|ϕ_{k}|}^{\frac{v}{2 - v}}, 0 < v < 1 \\ ϕ_{k} = e_{q k} + [\frac{{|ε {\dot{e}}_{q k}|}^{2 - v}}{2 - v}] sign (ε {\dot{e}}_{q k}) \end{matrix}\}

(26)

When we replace acceleration

\ddot{q} = w^{d}

in the 5 DOF model, we get the model

{[\begin{matrix} Γ^{⊤} & 0 \end{matrix}]}^{⊤} = [A_{5 \times 5} (q) w^{d} + C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)]

, and this can be written as

[\begin{matrix} Γ \\ 0 \end{matrix}] = A_{5 \times 5} (q) [\frac{d q^{d}}{d s} \ddot{s} + ϑ] + C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)

(27)

If we multiply both sides by

E_{4} = [\begin{matrix} I_{4 \times 4} & 0_{4 \times 1} \end{matrix}]

, we get

Γ = M_{4 \times 5} (q) w^{d} + C_{4 \times 5} (q, \dot{q}) \dot{q} + G_{4 \times 1} (q)

(28)

where

M_{4 \times 5} (q) = E_{4} A_{5 \times 5}; C_{4 \times 5} (q, \dot{q}) = E_{4} C_{5 \times 5} (q, \dot{q})

; and

G_{4 \times 1} (q) = E_{4} G_{5 \times 1} (q)

.

From the other side, if we multiply both sides by matrix

e_{4} = [\begin{matrix} 0_{1 \times 4} & 1 \end{matrix}]

, we get

N_{1 \times 5} (q) [\frac{d q^{d}}{d s} \ddot{s} + ϑ] + e_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)) = 0

(29)

which implies that (i.e., with

κ = N_{1 \times 5} (q) d q^{d} (s) / d s

)

\ddot{s} = - κ^{- 1} [N_{1 \times 5} (q) ϑ + e_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q))]

(30)

Finally, we summarize the finite-time convergent control (FTCC) in

\{\begin{array}{l} Γ = M_{4 \times 5} (q) [\frac{d q^{d}}{d s} \ddot{s} + ϑ] + E_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q)) \\ \begin{array}{l} \begin{array}{l} ϑ = [\frac{d^{2} q^{d}}{{d s}^{2}} {\dot{s}}^{2} + \frac{ψ}{ε^{2}}]; κ = N_{1 \times 5} (q) \frac{d q^{d} (s)}{d s} \\ \begin{array}{l} ψ = sign (ε \dot{e}) {|ε \dot{e}|}^{v} + sign (ϕ) {|ϕ|}^{\frac{v}{2 - v}} \\ ϕ = e + [\frac{{|ε \dot{e}|}^{2 - v}}{2 - v}] sign (ε \dot{e}) \end{array} 0 < v < 1 \end{array} \\ \ddot{s} = - [N_{4 \times 5} (q) ϑ + e_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q))] / κ \end{array} \end{array}\}

(31)

Figure 14 illustrates the switching process between dynamic inversion and finite-time convergent control for a biped robot (the complete functional algorithm of this process is given in Appendix B). From the input state, joint variables and desired trajectories are computed. Based on the selected control type, the system derives the control law from the dynamic model to ensure accurate motion tracking.

Parameters

v

and

ε

are used to adjust the settling time of the controller. The key difference between the two control strategies lies in replacing

[K_{v} \dot{e} + K_{p} e]

with

ψ / ε^{2}

. The primary goal of walking robot control is to achieve cyclic movement, which is composed of several phases (such as the single-support phase, double-support phase, etc.) and impacts. Achieving this objective does not necessarily require the control to be stable during each independent phase, but rather, it demands convergence toward a limit cycle. In this context, the phase-plane representation of the nonlinear dynamic system serves as a valuable analytical tool. Starting from the initial conditions, the robot’s movement is traced in the phase plane, and the characteristics of the resulting curves are analyzed. Since robot walkers have more than two states, each joint’s movement is represented in its own phase plane and plotted according to the joint position. The robot’s movement corresponds to the succession of these differential phases (see Figure 15), and cyclic movement forms a closed curve in each phase plane.

If this closed cycle (see Figure 15) is isolated, it represents a limit cycle, which can be stable, unstable, or semi-stable. Movements near the limit cycle may converge toward it. Henri Poincaré developed a technique to analyze dynamic system stability by creating a hyper-surface of dimension

n - 1

, transversal to the limit cycle. The flow’s intersection with this hyper-surface leads to the Poincaré return map. The control law uses computed torque, a standard method in robotics, with a slight modification to ensure finite-time convergence to the desired path. The finite-time feedback control proposed in [17,18,19,20,21,22,23] and [31] is employed. Joint-tracking errors are defined relative to trajectories satisfying the desired path as follows:

e (t) = q^{d} (s (t)) - q (t)

,

\dot{e} (t) = (d q^{d} / d s) \dot{s} - \dot{q} (t)

. The desired behavior of the configuration variables in the closed loop is given by

\ddot{q} = {\ddot{q}}^{d} + ψ / ε^{2}

, where

ψ (q, \dot{q}, s, \dot{s})

from [25,28] is the term that ensures

\{q (t) - q^{d} (s (t))\} \to 0

in finite time. In fact, the settling time can be chosen to be shorter than the time duration of a step.

4.3. The Proposed Adaptive Finite-Time Convergent Control

The previous approach relies on the exact cancelation of system nonlinearities. However, uncertainties in parameter values, computational round-off errors, and the computational burden of modeling complex systems make exact cancelation impractical. To address this, it is often necessary to simplify the equations of motion by neglecting certain terms, enabling the faster computation of the control law. Thus, the adaptive nonlinear control law is more realistically expressed as a linear combination of two control signals:

Γ (t) = Γ_{i d e a l} (t) + Γ_{a d a p t i v e} (t)

, where

Γ_{a d a p t i v e} (t) = K_{ϕ} (t) μ (q)

and

Γ_{i d e a l} (t) = M_{4 \times 5} (q) w^{d} (t) + E_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q))

. The vector

q (t)

represents the state of the controlled system, while

μ (q)

encapsulates the disturbance model features.

K_{ϕ} (t)

is an adaptive weight vector for the disturbance model. The controller calculates the error

e (t)

between the system’s state and the reference model’s state. This error is then used to adapt the value of

K_{ϕ} (t)

in real time. The adaptive controller estimates the model’s uncertainty dynamically and generates an adaptive control action

Γ_{a d a p t i v e} (t)

that cancels out the uncertainty, thereby restoring the nominal system for the baseline controller. The adaptive control term models system uncertainty as follows:

Γ_{a d a p t i v e} = K_{ϕ} μ (q)

, where

K_{ϕ} (t) = E_{4} {W^{⋆}}^{T} (t)

;

W (t)

contains the network weights adjusted by the controller.

μ (q)

is the feature vector of the uncertainty model. For nonlinear or unknown disturbance models, radial basis functions (RBFs) with Gaussian kernels are employed. In adaptive learning control, the RBF kernel is commonly used in kernel-based algorithms. Equation (32) defines the adaptive control law as the sum of an ideal component and an adaptive term that compensates for system uncertainties.

\{\begin{array}{l} Γ = M_{4 \times 5} (q) w^{d} + E_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q) + {W^{⋆}}^{⊤} μ) \\ w^{d} = [\frac{d q^{d} (s)}{d s} \ddot{s} + ϑ (s, \dot{s}, q, \dot{q})] \\ ϑ = [\frac{d^{2} q^{d}}{{d s}^{2}} {\dot{s}}^{2} + \frac{ψ}{ε^{2}}]; κ = N_{1 \times 5} (q) \frac{d q^{d} (s)}{d s} \\ \begin{array}{l} ψ = sign (ε \dot{e}) {|ε \dot{e}|}^{v} + sign (ϕ) {|ϕ|}^{\frac{v}{2 - v}} \\ ϕ = e + [\frac{{|ε \dot{e}|}^{2 - v}}{2 - v}] sign (ε \dot{e}) \end{array} 0 < v < 1 \\ \ddot{s} = - [N_{4 \times 5} (q) ϑ + e_{4} (C_{5 \times 5} (q, \dot{q}) \dot{q} + G_{5 \times 1} (q))] / κ \\ \frac{d W^{⋆}}{d t} = P μ {[q^{d} (s) - q]}^{⊤}; μ = v e c t o r o f b a s i s f u n c t i o n s \end{array}\}

(32)

An adaptive radial basis function neural network is employed in the design of the regulator, which consists of two layers: the input-hidden layer and the output layer. The weight matrix

W

plays a crucial role here, as sensory feedback is integrated into it. The update learning rule for

W

is given by the following update rule:

\dot{W} (t) = P μ e^{⊤} (t)

, where

μ

(i.e.,

μ_{i} = \exp [- γ {‖x - x_{i}‖}^{2}]

) is the vector of radial basis functions, and

γ

is the learning rate. The matrix

P

is positive definite. The unknown variations of

Δ (t) = W^{⊤} μ

are approximated by a neural network (such as RBFNN or multilayer perceptron), where

W

denotes the matrix of unknown weights to be identified. This formulation can represent various feedforward and recurrent neural networks. Given the universal approximation property of neural networks, we have

\hat{Δ} (t) = {W^{⋆}}^{⊤} μ + ε

, where

W^{⋆}

is the optimal (unknown) weight, and

ε

represents the bounded approximation error (

‖ ε ‖ \leq ε_{M}

).

Condition of convergence and existence of cyclic motion: To establish convergence and the existence of cyclic motion in our planar biped, we examine the single-support phase using the selected coordinate system. The derivative

\partial E_{k} / \partial {\dot{q}}_{5}

represents the angular momentum

σ

of the biped about the stance leg tip

S_{i}

. This quantity is given by

σ = \partial E_{k} / \partial {\dot{q}}_{5} = N (q_{c}) \dot{q}

, where

N (q_{c})

corresponds to the fifth row of the matrix

A_{5} (q_{c}) + m {(\partial f_{i} (q) / \partial q)}^{⊤} (\partial f_{i} (q) / \partial q)

. Furthermore, the derivative of the potential energy with respect to

q_{5}

is

\partial E_{p} / \partial q_{5} = m g (x_{S_{i}} - x_{g})

. Accordingly, the fifth equation of the dynamic model during the single-support phase can be expressed in the following simple form:

\dot{σ} (t) = N (q_{c}) \ddot{q} (t) + {\dot{q}}^{⊤} (t) [\partial N (q_{c}) / \partial q] \dot{q} (t) = m g \{x_{S_{i}} (t) - x_{g} (t)\}

(33)

We define the desired configuration of the biped as

q^{d} (t) = q^{r} (s (t))

, where

q^{r} (s)

is a prescribed vector function parameterized by the scalar

s

. If

κ = N (q_{c}) [\partial q^{r} / \partial s] \neq 0

, then the proposed control law guarantees the convergence of

q (t)

to

q^{r} (s (t))

in finite time, which can be made shorter than single-step durations [19,20,50]. In the absence of initial errors, the trajectory

q^{r} (s (t))

is tracked exactly. The evolution of

s (t)

is derived from

\ddot{s} (t)

, provided that the initial conditions

s (0)

and

\dot{s} (0)

are known. We set

s (0) = 0

and select

\dot{s} (0)

to minimize the initial joint velocity error.

ϵ = {|\dot{q} (0) - {\dot{q}}^{r} (0)|}^{2} = {|\dot{q} (0) - [\partial q^{r} (s (0)) / \partial s] \dot{s} (0)|}^{2}

(34)

Thus,

\dot{s} (0)

is such that

\partial ϵ / \partial \dot{s} (0) = 0

. We obtain

\dot{s} (0) = {\dot{q}}^{T} (0) [\partial q^{r} (s (0)) / \partial s] / ({[\partial q^{r} (s (0)) / \partial s]}^{T} [\partial q^{r} (s (0)) / \partial s])

(35)

From the equation for

\ddot{s} (t)

, it follows that a singularity arises in the proposed control law if

κ = N (q_{c}) [\partial q^{r} / \partial s] = 0

. For the reference trajectory

q (s) = q^{d} (s)

, we define the function

f (s) = N (q^{r} (s)) [\partial q^{⋆} (s) / \partial s]

. Although

N

depends only on the first four components of

q^{r} (s)

, we retain notation

N (q^{r} (s))

for clarity. If

f (s)

remains sufficiently bounded away from zero along the reference path and the tracking error stays small, singularities are avoided.

The objective is to develop a control strategy that guarantees the stable periodic motion of the biped robot. The control input

\ddot{s} (t)

ensures finite-time convergence to a reference trajectory

q^{d} (t) = q^{r} (s (t))

, with the convergence time chosen to be shorter than the one-step duration. Consequently, from the second step onward, the biped accurately tracks the reference path. In this framework, the 5 DOF biped model is reduced to a 1 DOF model in terms of

s (t)

, based on the prescribed reference. This reduced model resembles an inverted pendulum, and its analysis focuses on the existence and uniqueness conditions for admissible cyclic trajectories.

During the single-support phase, the biped behaves as an underactuated system, meaning that it cannot arbitrarily track a desired trajectory

q^{d} (t)

. A trajectory

q^{r} (s (t))

is termed an admissible reference motion if it satisfies the system’s dynamics. The analysis of angular momentum

σ

is sufficient to determine the evolution of the scalar parameter

s

, from which the full motion of the robot can be derived. Since

σ

is linear with respect to the velocity vector

\dot{q} (t)

and given that the velocity along the reference motion is proportional to

\dot{s}

, the angular momentum can be expressed as

σ = f (s) \dot{s}

. The scalar function

f (s)

depends on

q^{r} (s)

and the physical parameters of the biped. Assuming

f (s) \neq 0

over the interval

0 \leq s \leq 1

, the function is either strictly positive or strictly negative. In the remainder of this discussion, we assume

f (s) > 0

. It follows that

\dot{s} = σ / f (s)

. If the reference path

q^{r} (s)

is known, then the horizontal position

x_{g}

of the center of mass can also be expressed as a function of

s

, denoted as

x_{g} = x_{g} (s)

. In this case, the derivative of the angular momentum becomes

\dot{σ} = m g (x_{g} (s) - x_{S_{i}})

. Under a prescribed joint trajectory, the equations for

\dot{σ} (t)

and

\dot{s} (t)

together form a reduced-order model that captures the admissible dynamics. Given the initial values for

σ

and

s

, the functions

σ (t)

and

s (t)

can be uniquely determined. The second system, which is composed of two subsystems

\dot{s} (t) = σ / f (s); \dot{σ} (t) = m g (x_{g} (s) - x_{S_{i}})

is analogous to the classical equation of motion for a physical pendulum with a single degree of freedom [19,51]. As such, it admits an integral of motion similar to the energy integral of pendulum dynamics. Specifically, the system has conserved quantity

σ^{2} (t) - ϕ (s) = c n s t

, where

ϕ (s)

is a scalar potential function associated with the configuration-dependent gravitational term:

ϕ (s) = 2 m g \int_{k^{+}}^{s} (x_{g} (ξ) - x_{S_{i}}) f (ξ) d ξ; k d e n o t e t h e k^{t h} s t e p

(36)

Using the expression

σ = f (s) \dot{s}

, the integral of motion

σ^{2} (t) - ϕ (s) = c n s t

can be rewritten as

f^{2} (s) {\dot{s}}^{2} (s) - ϕ (s) = c o n s t a n t

or, equivalently,

f^{2} (s) {\dot{s}}^{2} (s) - f^{2} (k^{+}) {\dot{s}}^{2} (k^{+}) = ϕ (s)

. The functions

f (s)

and

ϕ (s)

can be computed once the reference trajectory

q^{r} (s)

is specified. These functions are periodic with period 1, so the robot’s behavior can be analyzed within the interval of

0 \leq s \leq 1

, corresponding to a single step. For human walking, the abscissa

x_{g}

of the center of mass increases over time. To mimic this behavior, we choose

q^{r} (s)

such that

x_{g}

increases with

s

on the interval

[0, 1]

. The single-support phase starts when

x_{g} < x_{S_{i}}

and ends when

x_{g} > x_{S_{i}}

. The potential function

ϕ (s)

reaches a negative minimum value

ϕ_{m} (s) = \min_{0 < < 1} ϕ (s) = ϕ (s_{g})

at

s = s_{g}

, where

x_{g} (s_{g}) = x_{S_{i}}

. After this point,

x_{g} > x_{S_{i}}

and

ϕ (s)

increase strictly monotonically. These results are summarized in the following theorem.

Theorem 4.

The path

q^{r} (s)

with

k < s < k + 1

can be achieved by the biped if and only if

\dot{σ} (k^{+}) > \sqrt{- ϕ_{m}}

or

\dot{s} (k^{+}) > \sqrt{- ϕ_{m}} / f (0^{+})

.

A cyclic admissible reference motion is defined by a periodic evolution of the angular momentum

σ

or, equivalently, by a periodic evolution of the scalar velocity

\dot{s}

, denoted as

{\dot{s}}_{c}

. All admissible reference motions satisfy the energy-like relation

σ^{2} (t) - ϕ (s) = c n s t

and are fully characterized by the definition of

ϕ (s)

. Therefore, a cyclic admissible reference motion exists

i f f

there exists an initial angular momentum

σ (k^{+})

such that

σ (k + 1^{+}) = σ (k^{+})

or, equivalently, if and only if there exists an initial velocity

\dot{s} (k)

, denoted as

{\dot{s}}_{c} (k)

, such that the time-increasing

s (t)

satisfies

\dot{s} (k + 1) = \dot{s} (k) = {\dot{s}}_{c} (k) = {\dot{s}}_{c} (0)

. Under these conditions, the biped’s states are identical at the beginning of steps

k

and

k + 1

(except for the role-swapping of the legs). Given the periodic nature of

f (s)

and

ϕ (s)

, the following must hold:

f^{2} (1^{-}) {\dot{s}}_{c}^{2} (0) - f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0) = ϕ (1^{-})

. Analyzing this equation, we conclude the following necessary condition for the existence of a cyclic admissible reference motion:

If $f (0^{+}) = f (1^{-})$ and $ϕ (1^{-}) = 0$ , any initial value $\dot{s} (k) > {\dot{s}}_{m}$ yields cyclic ref-motion.
If $f (0^{+}) = f (1^{-})$ and $ϕ (1^{-}) \neq 0$ or if values $ϕ (1^{-})$ and $f^{2} (1^{-}) - f^{2} (0^{+})$ have different signs, then there is no cyclic reference motion.
The previous equation has a unique solution ${\dot{s}}_{c} (0) = {[ϕ (1^{-}) / (f^{2} (1^{-}) - f^{2} (0^{+}))]}^{1 / 2}$ if and only if values $ϕ (1^{-})$ and $f^{2} (1^{-}) - f^{2} (0^{+})$ have the same sign.

Using equation of

{\dot{s}}_{c} (0)

, the next theorem can be formulated.

Theorem 5.

A unique cyclic reference motion exists if and only if the following is satisfied:

[ϕ (1^{-}) / (f^{2} (1^{-}) - f^{2} (0^{+}))] + [ϕ_{m} / f^{2} (0^{+})] > 0

. The initial cyclic velocity for one step is defined by equation

{\dot{s}}_{c} (0) = {[ϕ (1^{-}) / (f^{2} (1^{-}) - f^{2} (0^{+}))]}^{1 / 2}

.

We assume the existence of a unique cyclic admissible reference motion and that the initial velocity

\dot{s}

is sufficiently large to ensure a monotonic evolution of the parameter

s (t)

. We define the relative velocity error, or “velocity diff”, between the actual velocity

\dot{s} (s)

and the nominal cyclic velocity

{\dot{s}}_{c} (s)

by the expression

e (s) = (\dot{s} (s) - {\dot{s}}_{c} (s)) / {\dot{s}}_{c} (s)

. The biped motion converges to the cyclic reference if and only if

e (s)

converges to zero, i.e., if the actual velocity

\dot{s} (s)

converges to the nominal cyclic velocity

{\dot{s}}_{c} (s)

. Using the energy relation derived previously,

e (s)

can be explicitly expressed as a function of the initial error

e (k)

for

k < s < k + 1

, yielding the following equation:

e (s) = {[1 + \frac{e (k) (e (k) + 2) f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0)}{(f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0) + ϕ (s))}]}^{1 / 2} - 1

(37)

The expression of the error function

e (s)

includes a square root and is well-defined only for positive arguments. This imposes the following condition on the initial error

e (k)

:

e (k) > \sqrt{ϕ_{m}} / (f (0^{+}) {\dot{s}}_{c} (0))

. This inequality is equivalent to a lower bound on the initial velocity:

\dot{s} (k^{+}) > \sqrt{- ϕ_{m}} / f (0^{+})

. This condition guarantees that the argument under the square root in

e (s)

remains strictly positive, allowing a valid evaluation of the velocity error over the interval. The evolution of the velocity difference

e (s)

over a single step can be directly deduced from the evolution of the function

ϕ (s)

. While the function

ϕ (s)

is cyclic, it is not continuous at the step boundary

s = k

. Therefore, Equation (37) for

e (s)

is only valid within a single-step interval

k < s < k + 1

. However, the velocity difference

e (s)

itself remains continuous at

s = k

because both

\dot{s} (s)

and

{\dot{s}}_{c} (s)

are continuous at that point. This allows a consistent evaluation of

e (k + 1)

based on

e (k)

. Using the earlier relation,

f^{2} (1^{-}) {\dot{s}}_{c}^{2} (0) - f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0) = ϕ (1^{-})

, we derive an iterative formula for the evolution of

e (k)

from one step to the next:

e (k + 1) = {[1 + e (k) (e (k) + 2) {(\frac{f (0^{+})}{f (1^{-})})}^{2}]}^{1 / 2} - 1

(38)

Theorem 6.

(Condition of Convergence): The admissible reference motion converges towards the cyclic admissible reference motion if and only if

\dot{s} (0) > \sqrt{- ϕ_{m}} / f (0^{+})

and

f (0^{+}) > f (1^{-})

(or equivalently

ϕ (1^{-}) > 0

).

Proof.

With

k < s < k + 1

, if

k \to \infty

, then error function

e (s) \to 0

is uniform for any

s

if and only if

e (k) \to 0

when

k \to \infty

because

e (s)

is defined by the equation of

e (s)

, and the function

[f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0) / (f^{2} (0^{+}) {\dot{s}}_{c}^{2} (0) + ϕ (s))]

is cyclic and bounded. Thus, to prove that biped motion converges to the cyclic admissible reference motion, it is necessary and sufficient to prove the convergence of

e (s)

towards 0 when

k \to \infty

. If

f (0^{+}) < f (1^{-})

, then using the equation of error

e (k + 1)

and inequality

f (s) > 0

, we can deduce that

e (k + 1) \leq [f (0^{+}) / f (1^{-})] . |e (k)|

, and we can conclude that

e (k) \to 0

when

k \to \infty

. It follows from the equation of

e (k + 1)

that if

f (0^{+}) = f (1^{-})

, then

e (k + 1) = |e (k)|

. If

f (0^{+}) > f (1^{-})

, then

e (k + 1) \sqrt [f (0^{+}) / f (1^{-})] . |e (k)|

, and there is no convergence. The condition

\dot{s} (0) > \sqrt{- ϕ_{m}} / f (0^{+})

ensures that

s (t)

is an increasing function during the first step. If

f (0^{+}) < f (1^{-})

, the condition

\dot{s} (k) > \sqrt{- ϕ_{m}} / f (0^{+})

will be satisfied for all

k

, and the function

s (t)

increases for all steps. □

The convergence of the admissible reference motion can also be shown using a section of the Poincare map as in [17,19,25]. The equation of

e (k + 1)

easily allows the use of

e (k + 1)

as a function of

e (k)

. Combining Theorems 4–6, the next corollary can be deduced.

Corollary 1.

The admissible reference cyclic motion is orbitally asymptotically stable if and only if the reference joint path is such that

f (0^{+}) < f (1^{-}), ϕ (1^{-}) f^{2} (0^{+}) + ϕ_{m} (f^{2} (1^{-}) - f^{2} (0^{+})) > 0

(39)

According to the above theorems and this corollary, the conditions for the existence and convergence of cyclic motion are defined by inequalities. Therefore, the proposed control strategy inherently includes robustness. Despite tracking or modeling errors, the robot’s behavior will converge to cyclic motion provided that the reference path is suitable (i.e., it satisfies the inequalities with some margin). In the presence of modeling errors, the resulting cycle may deviate slightly from the predicted cycle, but stable walking will still be achieved, as demonstrated in the experimental results.

Discrete implementation of the proposed algorithm: Although the control algorithm is formulated in continuous time, its real-world implementation requires discretization due to the sampled nature of digital platforms like dSPACE DS1103, which features a 400 MHz PowerPC 604e DSP. The algorithm was executed with a fixed sampling period of Δt = 1.5 ms (667 Hz). Using a zero-order hold (ZOH), control inputs are assumed constant between samples, and all states are updated at each discrete time step:

Sampling and Discretization Framework: The control algorithm was implemented on a real-time platform with a fixed sampling rate of $Δ t = 1.5 m s (f r e q = 667 H z)$ . To discretize the continuous-time controller, we adopted zero-order hold (ZOH) assumptions, which consider control inputs constant between samples. The state updates are computed at each sampling instant.
Discretization of the Control Law: The control law includes the following:
- An FTC correction term $Γ_{F T C} (t)$ (including the dynamic inversion);
- An adaptive RBF neural network compensation term $Γ_{N N} (t)$

Each term is computed at sampling

t_{k} = k . ∆ t

as

Γ [k] = Γ_{F T C} [k] + E_{4} ⋆ Γ_{N N} [k]

, where the following is the case:

$Γ_{F T C} [k]$ includes fractional or high-gain error terms computed via FTCC logic;
$Γ_{N N} [k] = {\hat{W}}^{⊤} [k] μ (q [k], \dot{q} [k])$ .

Care is taken to handle potential high-gain effects to avoid numerical instability.

3.

Numerical Differentiation: Velocity

\dot{q}

and acceleration

\ddot{q}

are approximated by

\dot{q} [k] \approx (q [k] - q [k - 1]) / Δ t

and

\ddot{q} [k] \approx (\dot{q} [k] - \dot{q} [k - 1]) / Δ t

. Given the sensitivity of numerical differentiation to noise, digital filters (e.g., low-pass filters, Kalman filter, or Savitzky–Golay filter) are employed to smooth the signals, ensuring accurate derivative estimation without amplifying measurement noise.

4.

Neural Network Online Update: The RBF neural network is trained online using discrete adaptation:

\hat{W} [k + 1] = \hat{W} [k] + Δ t P μ [k] e^{⊤} [k]

. This ensures real-time learning while keeping the update rate within the system’s computational limits.

5.

Real-Time Execution: The full control loop is executed at each sampling instant on the dSPACE DS1103 platform. The timing constraints (

1.5 m s

were met through:

Efficient MATLAB/Simulink code generation;
Pre-computation and lookup tables for RBF activations;
Optimization of matrix operations and control scheduling.

6.

Validation and Testing: Discrete implementation was tested via the following:

Hardware-in-the-loop (HIL) simulation;
Real-time experiments under various scenarios.

The scenarios included actuator limits, external disturbances, inertia variations, and sensor noise. Feedback was provided by high-resolution encoders (C4 and CHM 506) fil-tered for accuracy. Robustness to mechanical imperfections and terrain variation was handled through adaptive compensation/safety mechanisms (e.g., emergency stops).

7.

Performance Preservation: The discretized control law successfully preserved key properties of the original continuous-time formulation, including the following:

FTC convergence through gain-tuned discretization framework;
Robustness to uncertainties maintained via filtering and adaptive NN updates;
Bounded control effort and tracking ensured through careful discretization and real-time computation strategies.

The control loop ran reliably at a 1.5 ms cycle despite hardware constraints like backlash, compliance, and actuator saturation. The discretized control law preserved finite-time convergence, robustness through filtering and adaptation, and bounded control efforts, enabling stable and accurate bipedal locomotion in real-world tests.

Presented below (in Algorithm 2) is a formal pseudocode of the complete implemented control algorithm.

Algorithm 2: The complete algorithm of the implemented discrete control law

5. Experimental Results of the RABBIT Robot

To implement the proposed control algorithm, the dSPACE DS1103 system was employed as the real-time control platform. This system enables the automatic generation and cross-compilation of SIMULINK diagrams into run-time software for its 400 MHz PowerPC 604e DSP, allowing controller development using a high-level language. The DS1103 also integrates essential functionalities—including low-level computations, digital-to-analog and analog-to-digital conversions, and a user interface—within a single package. This eliminates the need for low-level I/O programming and significantly streamlines the debugging process. Control computations were executed with a sampling period of 1.5 ms (667 Hz). The used components are given in Table 2.

Modeling errors are an inherent challenge in any control strategy. Given the complexity of the RABBIT mechanism (Figure 16), various factors contribute to discrepancies between the idealized dynamics and the actual system, leading to modeling errors:

Friction in the motor–belt–gear assemblies and at the tower’s universal joint;
Unmodeled flex dynamics due to cabling, gear torsion, and deformation;
Inaccuracies due to poor estimates of link inertias + dSPACE + power electronics;
Digital implementation limitations (e.g., sampling, quantization);
Non-rigid impacts due to compliance at the end of the leg, etc.

In addition to control design and modeling, the robot’s mechanical structure contributes to robustness. Several imperfections and structural tolerances, though not explicitly included in the model, play a practical role in ensuring stable locomotion:

Joint Backlash Tolerance: ±0.8° due to slack and gear tolerances.
Linkage Compliance: Up to 1.2 mm deflection under peak load.
Assembly Misalignments: Errors up to ±0.5 mm between joint axes.
Flexibility: ~2–3 mm compliance in the foot–ground interface, damping vibrations.
Unmodeled Dynamics: 4–6% damping from residual friction stabilizes oscillations.

These mechanical flexibilities, although not accounted for in the

L a g r a n g i a n

model, act as passive stabilizers. They absorb impact shocks, reduce vibration transmission, and help maintain the integrity of the gait cycle under experimental conditions.

The RABBIT robot is the result of a collaborative effort among seven French laboratories (IRCCYN Nantes, LAG Grenoble, LMS Poitiers, LVR Bourges, LGIPM Metz, LRV Versailles, and LIRMM Montpellier), developed under the CNRS ROBEA project focused on the control of bipedal robots for walking and running. As illustrated in Figure 16, the robot consists of two legs and a trunk but lacks feet. Its joints are positioned at the hips and knees, and it is equipped with four actuators—one at each knee and one at each hip. The physical properties of RABBIT, including limb masses and lengths, are detailed in Table 1. Movement is constrained to the sagittal plane through a radial bar linked to a central column, which guides the robot’s forward motion along a circular path. RABBIT represents a minimal bipedal system capable of producing both walking and running gaits. Below is an overall scheme for the controlled RABBIT robot simulator (Figure 17).

To provide clarity and ensure the reproducibility of this experimental study, we list the fourteen functions utilized throughout the implementation in the following table.

•The 1 function: Main Program	The primary script coordinating the simulation.
•The 2 function: robot_parametres	Defines the physical parameters of the robot.
•The 3 function: [Out1] = dm7dof (Entree1)	Computes the dynamic equations of the 7 DOF robot.
•The 4 function: [Out] = reaction_forcesde (R1)	Calculates the ground reaction forces.
•The 5 function: [qp,Ir] = impact (qf,qpf)	Models the impact phase/computes post-impact velocities.
•The 6 function: [A,Ac1,Ac2] = dynam_impact (qf)	Computes dynamic matrices related to the impact.
•The 7 function: [qd_s,dqd_s,ddqd_s] = Traj (s)	Generates the desired trajectory.
•The 8 function: stick_inf (x)	Displays or manages the stick-figure representation.
•The 9 function: [Out] = pos_swing_leg (Entree)	Computes the position of the swing leg.
•The 10 function: [Out] = discrete_control _law (Entree)	Implements the discrete-time control law.
•The 11 function: [A,C,G,B,Ac1,hxs] = dyn_7dof_rabbit (q,dq)	Returns the full dynamic model of the 7 DOF RABBIT.
•The 12 function: [A,C,G,B] = dynam_5dof_rabbit (q,dq)	Provides the reduced 5 DOF dynamic model.
•The 13 function: [dqp,F] = post_impact_dynamics (q,dqm)	Calculates post-impact dynamics.
•The 14 function: dess (tout,ERR,ERRP,qp,qpp,R,pos_p,Gama)	Handles the graphical visualization of the results.

Challenges in early model development and solutions: During development, challenges included modeling complex nonlinear dynamics, ensuring controller stability under uncertainties, and managing hardware limitations and disturbances. These issues were addressed through advanced simulation, adaptive and robust control, noise filtering, phased testing, and safety protocols. The proposed control system utilizes high-performance hardware, notably a dSPACE DS1103 controller with a 400 MHz PowerPC 604e DSP capable of executing complex algorithms—including neural updates and filtering—within a 1.5 ms sampling period. Real-time performance is maintained via optimized codes, pre-computation, and efficient matrix operations. The system employs high-torque DC motors (RS420J) driven by RS420 RTS10/20-60 current drives, which enable precise control but are limited by thermal and current constraints that risk saturation under high loads. Motion transmission through HFUS-2UH gear reducers (1/50) introduces compliance, backlash, and delay, affecting accuracy. Position feedback from C4 incremental encoders (250 counts/rev) and CHM 506 P426R absolute encoders (8192 counts/rev) provides high resolutions, necessitating filtering methods like Kalman filters to mitigate noise. Mechanical imperfections and backlash induce nonlinearities that require robust control strategies, while external disturbances and terrain variability challenge stability and are managed adaptively. Safety measures include soft limits, emergency stops, and fault detection. The hardware’s operational limits highlight the importance of optimized and adaptive control to ensure reliable bipedal locomotion. Table 3 summarizes these issues and their corresponding solutions.

First experimental scenario: The theoretical results previously established were evaluated using a closed-loop test of the locomotion system. The experiment (Figure 18) involved implementing the dynamic control law with parameters

ε = 0.1 a n d ν = 0.8

. The system was initiated on a cyclic trajectory under ideal conditions without introducing model errors. The results demonstrate that the proposed control law effectively drives the robot’s states to accurately track the reference trajectory.

Figure 18 illustrates the robot’s behavior during the first five steps under the proposed control law, showing a transition from the initial state to a stable, symmetric gait with constant forward velocity per step. The close alignment between the reference and actual responses highlights the excellent trajectory tracking performance of the implemented control system. Regular, repeating patterns in the joint variable evolution and near-zero tracking errors confirm orbital stability and indicate that cyclic walking motion has been achieved. Sharp transitions in velocity profile

\dot{q}

correspond to impact events, where the swing leg contacts the ground and role switching occurs, further confirming the controller’s effectiveness in handling the hybrid dynamics of bipedal locomotion. Figure 19 shows motor controls remaining well below the 30 N limit during a cyclic step. It also illustrates joint positions and velocities in the phase plane, confirming that the control law maintains the robot on its cyclic trajectory. While tracking errors exist, they remain cyclic. Ground reaction forces consistently point upward, ensuring contact constraint compliance. The phase-plane evolution of a joint variable reveals motion convergence and impact moments. The stabilization of

q_{1}

,

q_{2}

, and

q_{3}

relative to the desired limit cycle demonstrates rapid control input convergence and diminishing tracking errors, restoring cyclic motion within a few steps.

The proposed method offers a globally convergent adaptive controller without relying on local linearization, time-invariance, or decoupled dynamics. It inverts the estimated inertia matrix and numerically differentiates joint velocities. With adaptation slower than the control bandwidth, the controller—validated on the underactuated RABBIT biped in single support—achieves geometric tracking that adapts to gravity and reference sequences (Figure 20). It converges to a stable limit cycle despite initial mismatches, model errors, or disturbances.

Second experimental scenario (Robustness Testing 1): The robustness of the proposed control law is assessed by introducing intentional parameter uncertainties (Figure 21). The inverse dynamics use online adaptation, while the direct dynamics are deliberately perturbed to reflect modeling errors, including mass and inertia increases (

+ 20 %

for thighs,

+ 30 %

for tibias,

+ 100 %

for the trunk, and

+ 30 %

for trunk inertia) and a horizontal force of

350 N

applied to the center of mass during

1.2 s < t < 4.4 s

.

An experimental evaluation (see Figure 21) was carried out for five steps, assuming no modeling error at the initial step. The state of the robot was initialized on the periodic orbit (see Figure 21a). From these results, we notice that, starting on the cyclic trajectory, all constraints are satisfied, and trajectory tracking is perfect, but the initial speed of the curvilinear abscissa is limited by a maximum value, beyond which the robot may violate constraints during the step (Figure 21b,c). Convergence toward a periodic motion was obtained for each of the five joints of the robot (Figure 21d). For this value, the torques are at the limit of the value 150 Nm. The results presented correspond to five steps, using the same control parameters (

ε = 0.1

and

ν = 0.8

).

Third experimental scenario (Robustness Testing 2): This experimental scenario investigates the robustness of the control strategy under combined structural and parametric uncertainties. Specifically, the contact model was structurally modified, and mass/inertia discrepancies of ±20% were introduced between the control model and the real model while maintaining symmetry between the legs. These perturbations inherently cause deviations in the robot’s state during ground contact. Additionally, the mismatch between the flight-phase controller and the real model prevents the precise conservation of angular momentum upon landing. To further reflect realistic physical constraints, a torque saturation of ±150 Nm was enforced, consistent with the actuator limits of the RABBIT platform. The corresponding results are illustrated in Figure 22.

Figure 22 shows the time evolution of the joint trajectory

q_{i} (t)

(top subplots), along with their corresponding tracking error

e_{i} (t) = q_{i} (t) - q_{i}^{r e f} (t)

(bottom subplots), for all four joints of the biped robot over a time window of 35 to 38 s. The reference trajectories (dashed lines) are compared against the actual responses (solid blue lines), and the performance of the proposed neural adaptive controller can be evaluated accordingly. The plots clearly demonstrate that the actual joint trajectories closely follow the desired periodic reference signals, with minimal phase lag or amplitude deviation. The corresponding tracking errors remain bounded and within a narrow range (less than ±0.04 rad), reflecting high precision and the robustness of the controller even under disturbances or uncertainties. These results confirm that the controller ensures accurate trajectory tracking, stability, and synchronization across all joints. The periodic nature of the motion and the small tracking errors indicate convergence toward a stable limit cycle, validating the controller’s effectiveness in maintaining coordinated bipedal locomotion. The control behavior and phase-plane convergence are shown in Figure 23.

Figure 23 present the phase-plane trajectories of the configuration variables, clearly illustrating convergence to a stable limit cycle. Upon touchdown, the leg roles switch, replicating the behavior observed under the rigid contact model. At the beginning of the stance phase, the impact causes an abrupt change in the robot’s velocities, which, at that instant, still reflect their flight-phase values. Since the initial conditions are taken from the periodic orbit of the rigid contact model and no parametric modeling errors are introduced, flight-phase perturbations remain minimal. The results confirm the existence of a stable periodic gait. The obtained responses further demonstrate that the proposed neural adaptive controller is robust under structural uncertainties and partial actuator degradation. Compared to the classical approach, the adaptive controller yields smoother control inputs, as reflected in the torque waveforms. The combined use of a neural adaptive controller with a finite-time convergent algorithm, in parallel with the system’s inverse dynamics, significantly improves trajectory tracking accuracy and overall control performance.

Statistical analysis and confidence: To evaluate the robustness and repeatability of the proposed control approach, each scenario was repeated five times under identical conditions. The mean and standard deviation were calculated for each performance metric. Table 4 presents the

95 %

confidence intervals for joint-tracking errors (rad) based on five trials per joint. Figure 24 displaying error bars for

\pm 1

standard deviations, and it presents a comprehensive analysis of joint tracking and control performance for a bipedal walking robot across multiple simulation trials. For example, the joint-tracking error across all joints was

μ = 0.017 r a d \pm σ = 0.004 r a d

, and the average convergence time was

0.85 \pm 0.11 s

, indicating high consistency. A

95 %

confidence interval was computed using the well-known Student’s t-distribution:

{C I}_{95 %} = \bar{x} \pm t_{0.05, d f = 4} (σ / \sqrt{n}) = \bar{x} \pm 2.776 (σ / \sqrt{5})

, where

t_{0.05, d f = 4} = 2.776

.

Across five experimental trials per joint, 95% confidence intervals were computed using Student’s t-distribution (

d f = 4

,

t = 2.776

). The joint-tracking errors remained within tight bounds, demonstrating the consistency and robustness of the proposed control. Figure 24 provides a comprehensive statistical evaluation of the proposed control strategy for bipedal locomotion, combining visual and numerical metrics to assess performance consistency, robustness, and control efficiency.

Figure 24a shows the mean joint-tracking errors for the hips/knees with ±1 standard deviation error bars from five independent trials, demonstrating high precision (

0.017 r a d \pm 0.004 r a d

) and consistent tracking. Figure 24b presents the box plots of error-tracking distributions, revealing narrow interquartile ranges and minimal outliers, highlighting uniform performance across joints and trials. Figure 24c displays convergence times with ±1 standard deviation error bars, with an average of

0.85 \pm 0.11 s

, confirming rapid stabilization. Figure 24d illustrates torque usage through box plots, indicating moderate variability and bounded control efforts, with occasional peaks likely due to gait transitions or impacts. Overall, this statistical analysis confirms the robustness, adaptability, and repeatability of the neural adaptive control law. The low tracking errors, fast convergence, and stable torque demands collectively support its effectiveness for real-time, repeatable bipedal walking under uncertain conditions.

Lastly, Figure 25 shows the joint-tracking time-series plot with ±1 standard deviation confidence bounds across five trials for each joint.

Figure 25 summarizes the robot’s control performance across multiple trials, highlighting accurate joint tracking (mean ≈ 0.017 rad ± 0.004 rad), rapid convergence (≈0.85 s ± 0.11 s), and stable torque demands with minimal outliers. The phase-plane and trajectory plots confirm consistent joint dynamics and convergence to limit cycles, while the correlation matrix indicates weak coupling between control efforts and errors, reflecting a robust and well-structured controller. Despite these strengths, several limitations must be acknowledged. First, the current implementation depends on a high-frequency control loop (667 Hz) and the real-time updates of neural weights, which introduces notable computational overhead and demands a high-performance embedded platform. Second, tuning the control gains and neural network parameters remains nontrivial and may require expert intervention for different robot configurations or walking environments. Third, although robustness against disturbances and modeling errors is demonstrated, the controller’s performance under terrain irregularities or full 3D walking scenarios remains to be validated.

Comparative Study: Now, we present a comparative evaluation of the proposed neural adaptive MIMO controller against four advanced control strategies. These include the following: classical MIMO nonlinear decoupling control [23], non-adaptive finite-time convergent control [19], a neural fuzzy incremental learning mechanism [56], and a deep learning control strategy for biped robot locomotion [57]. To ensure rigorous assessment, standard performance metrics—integral absolute error (IAE), integral time absolute error (ITAE), and integral square error (ISE)—were computed:

IAE: Integral of absolute error (total error over time; insensitive to early transients).
ISE: Integral squared error (indicates large deviations; sensitive to overshoots).
ITAE: Integral time absolute error (Weights late errors; sensitive to slow settling)

\begin{array}{l} C o n t i n u o u s t i m e : I A E = \int_{0}^{T} ‖e (t)‖ d t; I T A E = \int_{0}^{T} t ‖e (t)‖ d t I S E = \int_{0}^{T} {‖e (t)‖}^{2} d t \\ D i s c r e t e t i m e : I A E = T_{s} ⅀_{k = 0}^{N} ‖e (k)‖; I T A E = T_{s}^{2} ⅀_{k = 0}^{N} k . ‖e (k)‖; I S E = T_{s} ⅀_{k = 0}^{N} {‖e (k)‖}^{2} \end{array}

Here, we have an error

e (t) = q_{d} (t) - q (t) \in R^{5}

. These indices quantify transient behavior, convergence speed, and accumulated error energy [61,62,63]. The results clearly show that the proposed neural controller outperforms the others across all metrics, with significantly lower IAE, ITAE, and ISE values, confirming its robustness, fast response, and adaptability. Detailed numerical results are summarized in Table 5, where the performance analyses support these findings.

The quantitative results in Table 5 confirm the superior performance of the proposed MIMO neural adaptive control strategy, which achieves the lowest error metrics—IAE (1.36), ITAE (2.43), and ISE (0.68)—alongside the fastest settling time (1.24 s) and minimal overshoot (2.21%). This indicates rapid convergence, excellent disturbance rejection, and high robustness under foot-slip and noise. In contrast, classical nonlinear decoupling control shows the slowest settling (2.83 s), highest overshoot (14.65%), and largest errors, reflecting limited adaptability and joint coupling effects. While the non-adaptive finite-time controller and neural fuzzy mechanism offer better performance than classical control, they still exhibit longer settling times and higher overshoots. The deep learning-based controller, although achieving decent steady-state behavior, suffers from delayed adaptation and terrain sensitivity. These findings collectively highlight the proposed controller’s effectiveness in ensuring stable and responsive bipedal locomotion, achieving rapid dynamic response and enhanced disturbance attenuation. Additionally, this architecture eliminates the typical trade-off between steady-state accuracy and transient behavior. A comparative evaluation of the proposed control scheme against state-of-the-art controllers across key performance metrics is presented in Table 6.

In contrast to classical nonlinear decoupling, finite-time controllers, and neural fuzzy or deep learning strategies, the proposed adaptive MIMO control offers a unified framework with several distinct advantages:

Improved tracking accuracy, with joint error reduction exceeding 30–50% compared to classical decoupling [23] or fuzzy [39] control methods.
Faster convergence, achieving limit-cycle stabilization within fewer steps (1–2 cycles) versus the 3–5 cycles typically required in state-of-the-art [40] or [56].
Higher robustness under uncertainties, maintaining stable gait under actuator saturation, payload variation (>20%), and ground contact perturbations.
Real-time adaptability, thanks to the online learning capability of RBF neural networks integrated within a finite-time control backbone.

These results confirm the superiority of the proposed controller in terms of stability, adaptability, and resilience to real-world disturbances. This contribution not only advances adaptive control theory for underactuated systems but also demonstrates practical applicability for field-deployable humanoid robots in uncertain environments. Compared to existing approaches such as classical nonlinear decoupling [19], finite-time control [23], neural fuzzy logic [39], deep reinforcement learning-based methods [40], and recent adaptive neural controllers [56,57], our method exhibits faster convergence, better disturbance rejection, and improved real-time adaptability. While earlier works often focused on specific performance criteria in isolation (such as tracking accuracy, adaptability, or robustness), our framework unifies these objectives into a single control strategy. This hybrid neural–adaptive approach is specifically designed to handle underactuation, impact dynamics, structural uncertainties, and external disturbances in a cohesive and experimentally validated manner.

Future research will aim to reduce computational overhead through hardware-efficient implementations (e.g., FPGA acceleration or quantized neural networks) while also developing automated tuning strategies such as metaheuristics or gradient-free optimizers to ease deployment. Additionally, extending this framework to terrain-adaptive locomotion, obstacle negotiation, and multi-step planning under hybrid dynamics remains a priority. Broader applications to multi-legged robots and dynamically reconfigurable platforms will also be explored, alongside the integration of more advanced learning algorithms and enhanced real-time feedback mechanisms for improved adaptability and robustness.

6. Conclusions

In this study, we presented and experimentally validated a neural adaptive MIMO control framework for bipedal walking robots aimed at ensuring robust trajectory tracking and stable locomotion under model uncertainties, external disturbances, and actuator limitations. The controller demonstrated superior adaptability, smooth control responses, and consistent convergence to stable limit cycles, clearly outperforming classical control strategies. Numerically, it achieved the lowest values for IAE, ITAE, and ISE, along with fast settling times and minimal overshoot, even in the presence of torque saturation, trunk mass variations, and ground perturbations. Repeated trials confirmed the robustness and repeatability of the system, which maintained stable gaits and smooth control transitions across all tested conditions. These findings highlight both the theoretical soundness and practical applicability of the proposed method for real-world robotic locomotion in uncertain and dynamic environments.

Author Contributions

B.B.: Conceptualization, data curation, formal analysis, investigation, methodology, project administration, resources, software, visualization, and writing—original draft. J.I.: Conceptualization, funding acquisition, investigation, supervision, resources, and writing—original draft. G.F.F.: Project administration, validation, and writing—review and editing. K.H.: Data curation, methodology, validation, and writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CPG	Central pattern generators;	NN	Neural network;
DOF	Degree of freedom;	IAE	Integral absolute error;
FTCC	Finite-time convergent control;	ISE	Integral square error;
MLP	Multi-layer perceptron;	ITAE	Integral time absolute error;
MIMO	Multi-input multi-output;	RBF	Radial basis function;
MRAC	Model reference adaptive control;	RBFNN	Radial basis function neural network;
NDC	Nonlinear decoupling control;	ZMP	Zero-moment point.

Appendix A

The following MATLAB script symbolically derives the full dynamics of the 7-degree-of-freedom RABBIT biped robot using

L a g r a n g i a n

formalism. It defines joint positions, velocities, and link geometries; computes the center-of-mass location; evaluates potentials and kinetic energies; and formulates the dynamic model. This symbolic formulation serves as the foundation for subsequent control designs and simulations.

Note A1.

Compared to the 7 DOF model, the 5 DOF version eliminates the stance leg’s actuation, reducing the generalized coordinates from

q = {[q_{1}, \dots, q_{7}]}^{⊤}

to

q = {[q_{1}, \dots, q_{5}]}^{⊤}

. The model focuses on swing leg and torso dynamics, simplifying the mass

(A (q))

, Coriolis

(C (q, \dot{q}))

, and gravity

(G (q))

terms, while the actuation matrix becomes

B = {[\partial q_{c} / \partial q]}^{⊤}

, reflecting control only on selected joints.

Note A2.

The apparent inconsistency between the 5 DOF and 7 DOF descriptions of the RABBIT robot arises from diff-modeling contexts. The 7 DOF model represents the full dynamics during the double-support phase, including a 3 DOF floating base and 2 DOF base for each leg (i.e., used for simulation). In contrast, the 5 DOF model is for the single-support phase, where the stance foot is fixed, reducing the system’s degrees of freedom through holonomic constraints. This contextual shift between full and reduced models is standard in bipedal robot modeling and reflects practical control simplifications.

Appendix B

We present the function

(d i s c r e t e_c o n t r o l_l a w)

, implementing two discrete control strategies for the 5 DOF model based on the global selector variable

C h o i x

. The input vector

E n t r e e = [q^{⊤}, {\dot{q}}^{⊤}, s, \dot{s}] ⊤

contains joint states and the phasing variable. Desired trajectories

(q_{d}, {\dot{q}}_{d}, {\ddot{q}}_{d})

are generated from

s

, with tracking errors

(e, \dot{e})

computed. Depending on the mode, the control law computes virtual control

w_{t}

, phasing acceleration

\ddot{s}

, and torque

Γ

for actuated joints. The output vector is

S o r t i e = [Γ; \ddot{s}; e; \dot{e}]

.

References

Gubina, F. On the Dynamic Stability of Biped Locomotion. IEEE Trans. Biomed. Eng. 1974, 21, 102–108. [Google Scholar] [CrossRef] [PubMed]
Hemami, H. The Inverted Pendulum and Biped Stability. Math. Biosci. 1977, 34, 95–110. [Google Scholar] [CrossRef]
Hemami, H. Postural Stability of Two Biped Models via Lyapunov Second Method. IEEE Trans Autom. Control. 1977, 22, 66–70. [Google Scholar] [CrossRef]
Hemami, H. A Feedback On-Off Model of Biped Dynamics. In Proceedings of the International Conference on Cybernetics and Society, Denver, CO, USA, 8–10 October 1979. [Google Scholar]
Hemami, H. Modeling and Control of Constrained Dynamic Systems with Application to Biped Locomotion. IEEE Trans. Autom. Control 1979, 24, 526–535. [Google Scholar] [CrossRef]
Hemami, H. Stability of Planar Biped Models by Simultaneous Pole Assignment and Decoupling. Int. J. Syst. Sci. 1980, 11, 65–75. [Google Scholar] [CrossRef]
Hemami, H. Initiation of Walk and Tiptoe of a Planar Nine-Link Biped. Math. Biosci. 1982, 61, 163–189. [Google Scholar] [CrossRef]
Hemami, H. Some Aspects of Euler-Newton Equations of Motion. Ing. Arch. 1982, 52, 167–176. [Google Scholar] [CrossRef]
Han, J.-Y.; Hemami, H. Nonlinear Adaptive Control of an N-Link Robot with Unknown Load. Int. J. Robot. Res. 1987, 6, 71–86. [Google Scholar] [CrossRef]
Saleem, O.; Abbas, F.; Iqbal, J. Complex fractional-order LQIR for inverted-pendulum-type robotic mechanisms—Design and experimental validation. Mathematics 2023, 11, 913. [Google Scholar] [CrossRef]
Afifa, R.; Ali, S.; Pervaiz, M.; Iqbal, J. Adaptive backstepping integral sliding mode control of a MIMO separately excited DC motor. Robotics 2023, 12, 105. [Google Scholar] [CrossRef]
Cheng, M.Y.; Lin, C.S. Genetic Algorithm for Control Design of Biped Locomotion. J. Robot. Syst. 1997, 14, 365–373. [Google Scholar] [CrossRef]
Kasiyanchuk, D.A. Planar Walking of a Five-Link Biped Robot over a Stepped Surface with Obstacles of Different Heights and Lengths. J. Phys. Conf. Ser. 2024, 2701, 012020. [Google Scholar] [CrossRef]
Awan, Z.S.; Ali, K.; Iqbal, J.; Mehmood, A. Adaptive backstepping based sensor and actuator fault tolerant control of a manipulator. J. Electr. Eng. Technol. 2019, 14, 2497–2504. [Google Scholar] [CrossRef]
Saleem, O.; Ali, S.; Iqbal, J. Robust MPPT control of stand-alone photovoltaic systems via adaptive fractional-order PID controller with self-adjusting fractional orders. Energies 2023, 16, 5039. [Google Scholar] [CrossRef]
Behzad, D. Multi-Modal Analysis of Human Motion From External Measurements. Trans. ASME 2001, 123, 272–278. [Google Scholar]
Chevallereau, C. Parameterized Control for an Under-Actuated Biped Robot. In Proceedings of the 15th Triennial World Congress, Barcelona, Spain, 21–26 July 2002; Volume 35, pp. 539–544. [Google Scholar]
Chevallereau, C. RABBIT: A Testbed for Advanced Control Theory. IEEE Control. Syst. Mag. 2003, 23, 5. [Google Scholar]
Chevallereau, C. Tracking a Joint Path for the Walk of an Underactuated Biped, Robotica; Cambridge University Press: Cambridge, UK, 2004; Volume 22, pp. 15–28. [Google Scholar]
Grizzle, J.W. Nonlinear Control of Mechanical Systems with an Unactuated Cyclic Variable. IEEE Trans. Autom. Control 2005, 50, 5. [Google Scholar] [CrossRef]
Djoudi, D. Optimal Reference Motions for Walking of a Biped Robot. In Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005. [Google Scholar]
Djoudi, D. Stability Analysis of a Walk of a Biped with Control of the ZMP. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Edmonton, AB, Canada, 2–6 August 2005. [Google Scholar]
Djoudi, D. Feet Can Improve the Stability Property of a Control Law for a Walking Robot. In Proceedings of the 2006 IEEE International Conference on Robotics and Automation, Orlando, FL, USA, 15–19 May 2006. [Google Scholar]
Choi, T.-Y. A Hybrid SOF-PID Controller for a MIMO Biped Robot. Artif. Life Robot. 2006, 10, 69–72. [Google Scholar] [CrossRef]
Djoudi, D. Contribution à la Commande D’un Robot Bipède. Ph.D. Thesis, Central School of Nantes, Nantes, France, 2007. [Google Scholar]
Yilei, W. Robust Recurrent Neural Network Control of Biped Robot. J. Intell. Robot. Syst. 2007, 49, 151–169. [Google Scholar]
Vukobratovi, M.K. Contribution to the Integrated Control of Biped Locomotion Mechanisms. Int. J. Humanoid Robot. 2007, 4, 49–96. [Google Scholar] [CrossRef]
Westervelt, E.R.; Chevallereau, C. Feedback Control of Dynamic Bipedal Robot Locomotion; CRC Press: Boca Raton, FL, USA, 2007. [Google Scholar]
Djoudi, D. A Path-Following Approach to Stable Bipedal Walking and Zero Moment Point Regulation. In Proceedings of the 2007 IEEE International Conference on Robotics and Automation, Roma, Italy, 10–14 April 2007. [Google Scholar]
Chevallereau, C. Stable Bipedal Walking with Foot Rotation Through Direct Regulation of the Zero Moment Point. IEEE Trans. Robot. 2008, 24, 390–401. [Google Scholar] [CrossRef]
Chevallereau, C. Bipedal Robots Modeling, Design and Walking Synthesis; Wiley-ISTE: Hoboken, NJ, USA, 2009. [Google Scholar]
Grizzle, J.W. Models, Feedback Control, and Open Problems of 3D Bipedal Robotic Walking. Automatica 2014, 50, 1955–1988. [Google Scholar] [CrossRef]
Wang, T. Stable Walking Control of a 3D Biped Robot with Foot Rotation, Robotica; Cambridge Press: Cambridge, UK, 2014; Volume 32. [Google Scholar]
Liu, Y. Human-Like Walking with Heel Off and Toe Support for Biped Robot. Appl. Sci. 2017, 7, 499. [Google Scholar] [CrossRef]
Hemami, H. Human and Robotic Movement in the Air. Comput. Electr. Eng. 2020, 81, 106496. [Google Scholar] [CrossRef]
Kim, D. Dynamic Locomotion for Passive-Ankle Biped Robots and Humanoids Using Whole-Body Locomotion Control. Int. J. Robot. Res. 2020, 39, 936–956. [Google Scholar] [CrossRef]
Kakaei, M.M. New Robust Control Method Applied to the Locomotion of a 5-Link Biped Robot. Robotica 2020, 38, 2023–2038. [Google Scholar] [CrossRef]
Martínez-Castelán, J.N.; Villarreal-Cervantes, M.G. Integrated Structure-Control Design of a Bipedal Robot Based on Passive Dynamic Walking. Mathematics 2021, 9, 1482. [Google Scholar] [CrossRef]
Khoi, P.B.; Nguyen Xuan, H. Fuzzy Logic-Based Controller for Bipedal Robot. Appl. Sci. 2021, 11, 11945. [Google Scholar] [CrossRef]
Li, Z.; Peng, X.B.; Abbeel, P.; Levine, S.; Berseth, G. Sreenath. Reinforcement learning for versatile, dynamic, and robust bipedal locomotion control. Int. J. Robot. Res. 2024, 44, 840–888. [Google Scholar] [CrossRef]
Wu, Y.; Tang, B.; Qiao, S.; Pang, X. Bionic Walking Control of a Biped Robot Based on CPG Using an Improved Particle Swarm Algorithm. Actuators 2024, 13, 393. [Google Scholar] [CrossRef]
Wu, Y.; Tang, B.; Tang, J.; Qiao, S.; Pang, X.; Guo, L. Stable Walking of a Biped Robot Controlled by Central Pattern Generator Using Multivariate Linear Mapping. Biomimetics 2024, 9, 626. [Google Scholar] [CrossRef] [PubMed]
Gao, H.; Wang, S.; Shan, K.; Mu, C.; Wang, X.; Su, B.; Yu, H. Stable Rapid Sagittal Walking Control for Bipedal Robot Using Passive Tendon. Actuators 2024, 13, 240. [Google Scholar] [CrossRef]
Yamano, J.; Kurokawa, M.; Sakai, Y.; Hashimoto, K. Realization of a Human-like Gait for a Bipedal Robot Based on Gait Analysis. Machines 2024, 12, 92. [Google Scholar] [CrossRef]
Mou, H.; Tang, J.; Liu, J.; Xu, W.; Hou, Y.; Zhang, J. High Dynamic Bipedal Robot with Underactuated Telescopic Straight Legs. Mathematics 2024, 12, 600. [Google Scholar] [CrossRef]
Xu, Z.; Xie, J.; Hashimoto, K. Human-Inspired Gait and Jumping Motion Generation for Bipedal Robots Using Model Predictive Control. Biomimetics 2025, 10, 17. [Google Scholar] [CrossRef] [PubMed]
Yang, T.; Tong, Y.; Zhang, Z. Flexible Model Predictive Control for Bounded Gait Generation in Humanoid Robots. Biomimetics 2025, 10, 30. [Google Scholar] [CrossRef]
Khalil, W.; Dombre, E. Modélisation, Identification et Commande des Robots; Hermes Science Publications: Cachan, France, 1999. [Google Scholar]
Slotine, J.-J.E. Applied Nonlinear Control; Prentice-Hall: Hoboken, NJ, USA, 1991. [Google Scholar]
Bhat, S.P. Continuous finite-time stabilization of the translational and rotational double integrators. IEEE Trans. Autom. Control 1998, 43, 678–682. [Google Scholar] [CrossRef]
Nayfeh, A.H.; Mook, D.T. Nonlinear Oscillations; John Wiley and Sons: New York, NY, USA, 1976. [Google Scholar]
Tejeda, Y.G. Deep Learning with Convolutional Neural Networks: A Compact Holistic Tutorial with Focus on Supervised Regression. Mach. Learn. Knowl. Extr. 2024, 6, 2753–2782. [Google Scholar] [CrossRef]
Manca, V. Artificial Neural Network Learning, Attention, and Memory. Information 2024, 15, 387. [Google Scholar] [CrossRef]
Mienye, I.D. Recurrent Neural Networks: A Comprehensive Review of Architectures, Variants, and Applications. Information 2024, 15, 517. [Google Scholar] [CrossRef]
Cabello, J.G. Mathematical Neural Networks. Axioms 2022, 11, 80. [Google Scholar] [CrossRef]
Yang, L.; Lai, G.; Chen, Y.; Guo, Z. Online Control for Biped Robot with Incremental Learning Mechanism. Appl. Sci. 2021, 11, 8599. [Google Scholar] [CrossRef]
Alemayoh, T.T.; Lee, J.H.; Okamoto, S. A Deep Learning Approach for Biped Robot Locomotion Interface Using a Single Inertial Sensor. Sensors 2023, 23, 9841. [Google Scholar] [CrossRef] [PubMed]
Wurzberger, F.; Schwenker, F. Learning in Deep Radial Basis Function Networks. Entropy 2024, 26, 368. [Google Scholar] [CrossRef]
Yang, Y. A Novel Radial Basis Function Neural Network with High Generalization Performance for Nonlinear Process Modelling. Processes 2022, 10, 140. [Google Scholar] [CrossRef]
Kuo, P.-H. Artificial rabbits optimization–based motion balance system for the impact recovery of a bipedal robot. Adv. Eng. Inform. 2025, 63, 102965. [Google Scholar] [CrossRef]
Bekhiti, B. On Hyper-Stability Theory Based Multivariable Nonlinear Adaptive Control: Experimental Validation on Induction Motors. IET Electr. Power Appl. 2025, 19, e70035. [Google Scholar] [CrossRef]
Scaldaferri, A.; Angelini, F. Learning to Walk with Adaptive Feet. Robotics 2024, 13, 113. [Google Scholar] [CrossRef]
Marquez-Acosta, E. Experimental Validation of the Essential Model for a Complete Walking Gait with the NAO Robot. Robotics 2024, 13, 123. [Google Scholar] [CrossRef]

Figure 1. Gait cycle representation in bipedal locomotion. The figure illustrates alternating leg movement and the cyclical structure of bipedal walking, which forms the basis for trajectory planning and control design in the proposed framework.

Figure 2. Hypotheses on foot/ground contact. No-slip and unilateral contact conditions are assumed for stable gait modeling.

Figure 3. Studied biped and a choice of generalized coordinates. The figure depicts the robot model, showing joint/generalized coordinates definitions used for dynamic modeling and control design.

Figure 4. Prismatic joint principle for planar structures: this is the translation-based joint mechanism used in planar systems, contributing to simplified kinematic representation.

Figure 5. Revolute joint principle for planar structures: the figure shows the rotational joint used for the robot’s knees and hips, essential for defining angular dynamics in the planar model.

Figure 6. Two examples of hybrid-system walking, demonstrating the sequence of locomotion phases (single support, impact, and double support) and highlighting the hybrid nature of bipedal walking dynamics. The equations depict the system’s dynamics before and after impact, incorporating the effects of single-support and double-support phases, with variables representing the state and control inputs.

Figure 7. The operational variables and articular variables: the figure presents the relationship between absolute and relative joint variables to convert Cartesian to joint-space representations.

Figure 8. Flowchart of the 7 DOF dynamic model, which outlines the symbolic modeling process using

L a g r a n g i a n

formulation, leading to the computation of system matrices for control.

Figure 8. Flowchart of the 7 DOF dynamic model, which outlines the symbolic modeling process using

L a g r a n g i a n

formulation, leading to the computation of system matrices for control.

Figure 9. Joint-space trajectory parameterization for a two-link robot: contrasts path planning versus joint space, which is used for smooth, admissible reference motions under

u n d e r a c t u a t i o n

.

Figure 9. Joint-space trajectory parameterization for a two-link robot: contrasts path planning versus joint space, which is used for smooth, admissible reference motions under

u n d e r a c t u a t i o n

.

Figure 10. The structure of the multi-input single-layer perceptron vs. MIMO perceptron: the figure explains the neural network architectures used for function approximation in bipedal control systems. The perceptron sums weighted input

w_{k i}

, applies an activation function

σ (\cdot)

, and produces output

y_{k}

. The MIMO perceptron uses a weight matrix

W

and bias

b

to produce the output

y

.

Figure 10. The structure of the multi-input single-layer perceptron vs. MIMO perceptron: the figure explains the neural network architectures used for function approximation in bipedal control systems. The perceptron sums weighted input

w_{k i}

, applies an activation function

σ (\cdot)

, and produces output

y_{k}

. The MIMO perceptron uses a weight matrix

W

and bias

b

to produce the output

y

.

Figure 11. The basic structure of the MIMO multi-layer perceptron (MLP) network illustrates the MLP architecture adopted for modeling nonlinear multivariable systems in robotic locomotion.

Figure 12. Neural network updates and the learning mechanism: the figure shows the adaptive learning loop used for weight adjustment in neural control, enabling real-time disturbance compensation.

Figure 13. The MRAC-based neural networks vs. RBFNNs. Contrasts traditional MRAC with RBF neural networks, highlighting their structure and suitability for adaptive biped control.

Figure 14. Flowchart of switching process between the NDC and FTCC. The figure presents the hybrid control switching logic between nonlinear decoupling and FTC convergence strategies.

Figure 15. Phase-plane cycle of a robot joint with impact: the figure shows the joint’s closed-loop phase trajectory, highlighting limit cycles and cyclic stability during impact events.

Figure 16. A tested prototype of RABBIT (Grenoble as part of the French National Project ROBEA) Photograph of the underactuated planar biped RABBIT robot used for experimental validation. The setup includes hip and knee joint actuators, DC motors, incremental and absolute encoders, and a mechanical support for sagittal-plane motion. Developed under the CNRS ROBEA project, this minimalistic yet realistic platform enables the evaluation of walking and running control strategies under constrained conditions.

Figure 17. Functional schematic of the overall RABBIT robot simulator. Illustrates the control and simulation framework, including monitoring conditions, physical constraints, and stop logic.

Figure 18. Profile of absolute joint angles and absolute joint velocities.

Figure 19. Profile (evolution) of the commanded control for each motor and the phase planes.

Figure 20. Sequence of the configurations of the optimal trajectory.

Figure 21. Joint-tracking performance and phase portraits under parameter uncertainties and external disturbance.

Figure 22. Actual joint trajectories vs. the measured joint trajectories and their errors.

Figure 23. The control signals v.s the convergence towards a cyclic motion in the phase planes.

Figure 24. Statistical evaluation of joint tracking, convergence, and torque profiles for adaptive bipedal locomotion control (the performance analysis across multiple simulations).

Figure 25. Time-domain and phase-space joint analysis with correlation insights across repeated gait cycles under realistic disturbance conditions and across multiple simulation tests.

Table 1. Biped parameters for simulation.

	Trunk	Thigh	Leg	Foot	Motors and Gears
$The mass m_{i}$ $[k g]$	17.000	6.800	3.230	1.000	Maximum torque $Γ_{m a x}$ [Nm]	150
The length $L_{i} [m]$	0.6000	0.400	0.472	0.250	Reduction ratio	50
Inertia $I_{G i}$ $[k g m^{2}]$	2.2200	0.250	0.400	0.012	Gears’ inertia $I_{A i} [k g m^{2}]$	$3.32 \times 10^{- 4}$
Center of inertia $s_{i} [m]$	0.1434	0.163	0.127	0.000

Table 2. Components used in the experimental setup of RABBIT robot.

Component	Model and/or Size (Specification)	Manufacturer
DC motors	RS420J	Parvex SA, Dijon, France
Motor current drives	RS420 RTS10/20-60	Parvex SA, Dijon, France
Incremental encoders (motors)	C4 (250 counts/rev)	Parvex SA, Dijon, France
Absolute encoders (joints)	CHM 506 P426R/8192/16 (8192 counts/rev)	Ideacod, Strasbourg, France
Incremental encoders (central tower)	GHM5	Ideacod, Strasbourg, France
Gear reducers	HFUS-2UH, size: 25 (ratio: 1/50)	HDT T-Cup, Peabody, MA, USA
Real-time controller	DS1103 (400 MHz PowerPC 604e DSP)	dSpace, Paderborn, Germany

Table 3. Difficulties encountered and solutions in the development process.

Stage	Difficulties Encountered	Solution Strategies
Establishing the Early Model	$⋆$ Complexity of system dynamics: • Nonlinear, high dimensional. • Coupled dynamics in bipedal and MIMO structure, leading to modeling challenges. $⋆$ Simplifications may cause inaccuracies. $⋆$ Parameter uncertainty and variations in mass, friction, and actuator parameters complicate precise modeling.	• We used high-fidelity simulation tools with advanced noise filters ( $K a l m a n$ filter) and identification techniques for accurate models. • We incorporated intelligent adaptive and robust approaches to handle uncertainties. • We employed machine learning (i.e., $R B F N N$ networks) to approximate complex dynamics.
Method Design	$⋆$ Ensuring stability and convergence: Guaranteeing controller stability under uncertainties and disturbances. $⋆$ Real-time implementation constraints: Achieving fast computation for real-time control with complex algorithms like neural networks.	• We used $F T C C$ techniques and adaptive networks for rapid, stable tracking. • We optimized algorithms for efficiency and tested them on platforms prior experiment. • We conducted extensive hardware-in-the-loop ( $H I L$ ) tests to validate stability.
ExperimentalProcess	$⋆$ Hardware limitations and noise: • Sensor noise. • Actuator saturation. • Mechanical imperfections. $⋆$ Environmental disturbances and external factors: • Disturbances causing deviations. $⋆$ Safety concerns: risks of damage during testing.	• We implemented safety protocols such as soft limits and emergency stops and used sensor filtering and noise reduction techniques. • We adopted a phased testing approach: simulations → HIL → real-world experiments. • We incorporated robustness features into the control design to manage disturbances

Table 4. Statistical confidence bounds on joint-tracking performance.

Joint	Mean ${E r r}_{q}$ (Rad)	Std Dev (Rad)	95% CI Lower	95% CI Upper	Conv-Time(s)	Std Dev (s)	Mean $T$ (Nm)	Std Dev
Hip L	0.02090	0.00149	0.01905	0.02275	0.77	0.016	38.43	3.53
Knee L	0.01504	0.00108	0.01370	0.01637	0.69	0.015	31.68	2.45
Hip R	0.02146	0.00138	0.01975	0.02317	0.78	0.014	40.36	4.28
Knee R	0.01739	0.00155	0.01546	0.01932	0.65	0.013	34.69	2.12

Table 5. Performance evaluation of control strategies based on IAE, ITAE, and ISE.

Control Strategy	$I A E$	$I T A E$	$I S E$	Settling $Time t_{s s} [s]$	Overshoot $M_{p} %$	Comments
Classical MIMO nonlinear decoupling control [23]	3.72	7.98	2.10	2.83 [s]	14.65%	Slower settling, visible overshoot, and coupling between joints
Non-adaptive finite-time convergent control [19]	2.48	5.10	1.32	1.95 [s]	8.75%	Better transient performance, smoother adaptation
Proposed MIMO neural adaptive control	1.36	2.43	0.68	1.24 [s]	2.21%	Fast convergence, minimal overshoot, robust under foot-slip/noise
Neural fuzzy incremental learning mechanism [56]	2.22	4.62	1.01	1.63 [s]	6.36%	Robust to disturbances, though high complexity slightly increases ITAE.
Deep learning control for biped robot locomotion [57]	2.80	6.50	1.85	2.47 [s]	11.12%	Good steady state but poor adaptability to terrain change

Table 6. Benchmarking the proposed method against state-of-the-art approaches.

Performance Indicator	FTCC1 [14]	FTCC2 [19]	NDC [23]	NF-ILM [39]	RLBM [40]	NNC [56]	DLC [57]	Proposed Scheme
• Error minimization • Global stability • Time of convergence • Control efficiency • Disturbance rejection • Chattering suppression • Math complexity • Computation load • Tuning difficulty	Good	Adequate	Good	Bad	Good	Fair	Good	Excellent
	Yes	Yes	Yes	Yes	Yes	Difficult	Yes	Yes
	Fast	Fast	Slow	Delayed	Swift	Tardy	Poky	Very fast
	Medium	Low	Low	Medium	Fair	Low	Fair	High
	Good	Best	Better	Bad	Good	Fair	Good	Excellent
	Adequate	Adequate	Fair	Good	Good	Good	Better	High
	Medium	High	High	Low	High	High	Low	Adequate
	High	High	High	Low	High	High	Low	Medium
	High	Medium	High	Low	Low	High	Low	Medium

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bekhiti, B.; Iqbal, J.; Hariche, K.; Fragulis, G.F. Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications. Robotics 2025, 14, 84. https://doi.org/10.3390/robotics14060084

AMA Style

Bekhiti B, Iqbal J, Hariche K, Fragulis GF. Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications. Robotics. 2025; 14(6):84. https://doi.org/10.3390/robotics14060084

Chicago/Turabian Style

Bekhiti, Belkacem, Jamshed Iqbal, Kamel Hariche, and George F. Fragulis. 2025. "Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications" Robotics 14, no. 6: 84. https://doi.org/10.3390/robotics14060084

APA Style

Bekhiti, B., Iqbal, J., Hariche, K., & Fragulis, G. F. (2025). Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications. Robotics, 14(6), 84. https://doi.org/10.3390/robotics14060084

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Neural Adaptive Nonlinear MIMO Control for Bipedal Walking Robot Locomotion in Hazardous and Complex Task Applications

Abstract

1. Introduction

2. Mathematical Modeling of Bipedal Locomotion

2.1. Kinematic and Dynamic Modeling of Biped Robots (Diff Phases)

2.2. The Model of Impacts

2.3. Symbolic Calculations of the Model

2.4. Optimal Trajectories Generation

3. Basics of Intelligent Adaptive Control and Neural Networks

3.1. Neural Networks and Global Approximation Theory

3.2. Radial Basis Function Neural Networks and Training

3.3. Adaptive Neural Network Control

4. Stability and Control of RABBIT Robot

4.1. Nonlinear Decoupling Control

4.2. Finite-Time Convergent Control

4.3. The Proposed Adaptive Finite-Time Convergent Control

5. Experimental Results of the RABBIT Robot

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI