Next Article in Journal
Vehicle-to-Grid (V2G) Research: A Decade of Progress, Achievements, and Future Directions
Previous Article in Journal
Charting the Landscape of Data Envelopment Analysis in Renewable Energy and Carbon Emission Efficiency
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On Increasing the Energy Efficiency and Performance of a Bicycle Robot Stabilization—Towards Event-Triggered Control Considerations

1
Faculty of Automatic Control, Robotics and Electrical Engineering, Poznan University of Technology, Piotrowo 3a Str., 60-965 Poznan, Poland
2
Institute of Robotics and Machine Intelligence, Poznan University of Technology, Piotrowo 3a Str., 60-965 Poznan, Poland
3
Department of Electrical Engineering and Information Technology, University of Naples, Corso Umberto I 40, 80125 Napoli, Italy
*
Author to whom correspondence should be addressed.
Energies 2025, 18(23), 6146; https://doi.org/10.3390/en18236146
Submission received: 29 October 2025 / Revised: 15 November 2025 / Accepted: 21 November 2025 / Published: 24 November 2025

Abstract

This paper addresses the critical problem of stabilizing an inherently unstable bicycle robot at low or zero speeds using advanced control strategies. The key objective is to develop a feedback controller that actively operates a handlebar, as well as a reaction wheel, maintaining balance while keeping performance indices low, including energy consumption. It is the energy consumption that makes the platform attractive for the users by extending the time of operation, not only due to the actions connected to motor design and battery design, but to the control system itself. To design and verify the control law that enables stabilization, and reduces energy consumption, bicycle dynamics and controllers based on linear quadratic regulator (LQR) techniques are designed, along with their robustified variants accounting for actuator failures and model uncertainties, and an innovative event-triggered control (ETC) scheme to reduce control updates and energy use. Simulations demonstrate effective stabilization from various initial lean angles under realistic conditions, with robustness notably improved at zero speed, where balancing is most challenging. Extending control to include both reaction wheel and handlebar actuation enhances dynamic maneuverability and energy efficiency. The event-triggered approach significantly lowers computational and actuator energy demands by updating controls only when needed, reducing mechanical wear. This research contributes to electromobility by improving the control and energy efficiency of personal transport devices like electric bicycles and autonomous bicycles, which rely on stable balancing mechanisms for safety and user comfort. The findings support future experimental validation and IoT implementation, pushing forward controlled, energy-conscious stabilization for emerging personal electromobility solutions.

1. Introduction

Personal electromobility faces different challenges, spanning from battery faults, limited range, or charging infrastructure issues, especially in developing countries, which strongly affect user comfort. User comfort is further influenced by factors such as temperature impact on battery life [1] and how fast-charging stations are available. Modern and advanced control laws, also with large degrees of robustness, as well as energy management systems designed to optimize motor operation, by means of energy consumption, can strongly extend battery life, energy range, as well as energy efficiency. This might be achieved either by the construction of motors, or by the design of energy-efficient control strategies [2] to reduce the total energy consumption. Every energy-efficient control solution balances between user comfort, performance, as well as energy savings. However, every such solution contributes to user satisfaction eventually from extended battery use and contributes to pollution reduction by reducing emissions.
Modern energy storage units and energy management solutions for the means of personal electromobility present a visible decrease in energy consumption, typically achieving improvements in vehicle energy efficiency by about 10–15% in comparison to standard solutions. These numbers are even more impressive when advanced control systems are included in the loop. They are of course not only attributed to maximizing regenerative braking, reducing vehicle weight, general maintenance issues, but also to smart battery management systems, and aptly designed control laws.
Taking a look at the past years, the technology offering the greatest energy savings and increase in terms of performance for personal electromobility means is connected mainly to solid-state batteries and advanced lithium-ion batteries with silicon and graphene anodes [3]. These provide a density of energy at the level of 300–500 Wh/kg, much increased with ca. 100–270 Wh/kg for old lithium-ion batteries. Higher energy density reduces the weight of the vehicle, thus enabling the companies to support the user comfort when riding motorcycles or bicycles with self-balancing systems. These typically use gyroscopic measurements, IMUs, and are followed by advanced control algorithms to maintain stability. The balancing torques produced by such solutions are large to enable stabilization in stationary condition, of much value in the so-called inclusive mobility [4], where enhancement of user comfort, safety, and reduction in probability of accidents, is of great value.
Research on self-balancing properties in electric vehicles such as in the case of Honda or Lit Motors reveals that advanced control algorithms are needed to maintain stability, but these are often connected to extra energy consumption, where continuous operation results in its increase [5]. The current paper shows it is still possible to achieve similar aims, but with the energy of the signals greatly reduced, bridging the gap between the market demand and the availability of technology, applied to the basic electromobility unit, in the form of a bicycle.
Bicycles as an attractive means of transport offer possibilities to reduce the mobility gap for the elderly and the young. For those underprivileged, riding a bike is sometimes a great challenge and comes with a lot of effort. Also for this reason, electric bicycles have been developed, but it is the problem of stabilization that still prevails. In order to stabilize a bicycle, one needs to learn how to keep balance. However, the same can be done using effective actuation and using various techniques and control algorithms, which take advantage of a reaction wheel, handlebar control, or other approaches. It is to be kept in mind that the stabilization efficiency depends also on the linear velocity of a bicycle, and is crucial when the bicycle is at near-zero or even zero velocity. This calls for special actuation methods.
The most common technique is to use a reaction wheel to generate reaction torque to keep the bicycle in balance. The construction is relatively simple and requires a motor to drive the reaction wheel, along with proper sensors to enable stabilization. Apart from the reaction wheel, one can also use the handlebar to use the steering/heading angle to keep the balance of the bicycle, to move the center of gravity to such a position as to stabilize the structure, extending the possibility of stabilization by the reaction torque to more dynamic scenarios, such as, e.g., passing a curve [6]. Handlebar control can also be used to couple its action with the reaction wheel, to restore maneuverability in dynamic situations, as well as to recover the property in case the bicycle carries a load [7].
Such systems are usually controlled by proportional–integral–derivative controllers (PIDs) or state-vector-based controllers, on the basis of a linearized model, obtained for the upright (unstable) equilibrium point [8]. A linear quadratic control (LQR) [9,10], unfortunately requires the knowledge of the linearized dynamics. Obviously, one can also expect the linear quadratic controller with integral action to be applied as well, to reduce energy consumption. On the other side, sliding-mode controllers can be used as well, to offer agile dynamics in a closed loop along with satisfactory disturbance rejection [11].
Other approaches also use a bank of controllers to enable gain-scheduled-like control, or implement expert knowledge in fuzzy sliding-mode controllers. All the approaches try to design the steering strategy to lean the bicycle so as to enter curves, or enable combined operation coupled with the reaction wheel. Obviously, the classical PID-like control is the easiest to understand; however, it lacks robustness, and the state feedback control requires a linearized model with proper accuracy to obtain stabilization near the linearization set-point, thus is sensitive to modeling errors.
More advanced approaches, like the fuzzy controller with sliding-mode feature, require fuzzy logic expertise, replacing the knowledge about the model, whereas the other approaches consume energy to compensate for the uncertainties present in the system, by making agile moves. There are also the approaches that use model predictive control or nonlinear model predictive control to achieve stabilization and balance [10].
Every approach usually undergoes a V-cycle procedure, starting from the software-in-the-loop stage, and ending on the deployment. For the actions presented in the paper, Arduino Engineering Rev 2 Kit [12] was used, where the gyroscopic effect of the reaction wheel is used to stabilize the structure, along with the handlebar action, enabling the researchers to pursue the implementation of various control strategies eventually on the hardware. In this paper, the authors use the simulation environment of Arduino hardware under Matlab and Simulink to couple LQR control with a robustifying approach, along with the reduction of control effort by using event-triggered control to reduce the energy consumption.
The adopted strategy forms a basic step prior to conducting experiments on the real hardware, enabling one to get a deep insight into the control strategies to be adopted, and to identify weak points of the techniques selected. It potentially allows other researchers to follow this line of inquiry and formulate their own approach, further extending the LQR capabilities. The model of the bicycle can be visualized using the Simscape library, to enable a better view of the structure during simulation, as in Figure 1. Such an approach enables one to test if all the code written fits to Arduino Nano IoT 33 board memory to enable future hardware tests.
The paper innovatively integrates classical control techniques (LQR) with a robustified version to handle actuator faults and model uncertainties, demonstrating improved performance and reliability. Another key strength is the introduction of event-triggered control (ETC), which effectively reduces the frequency of control updates, thereby lowering energy consumption and mechanical wear, an important consideration for battery-powered personal electromobility devices.
The inclusion of both reaction wheel and handlebar actuation, especially with the adoption of event-triggering mechanisms, along with introduced degrees of robustness, extend the control strategy to dynamic scenarios, enhancing maneuverability and further energy savings, which are so welcomed by the users.
A similar approach to the reduction of control updates for saturated systems can be found in [13], based on data-driven conditions. As per the connection to linear matrix inequalities, the rules obtained in the paper can be difficult to apply in real time, since the co-design of event-triggering and state feedback formulation is tightly coupled. In addition, no uncertainty can be handled here. On the contrary, the authors of [14] enable a robustification approach for additive uncertainties, unfortunately for continuous-time systems only, extending the well-established triggering conditions. The most promising alternative approach is presented in [15] for the predictive control, to adjust the sampling intervals to keep a balance between computational cost and cost of gathering the data, unfortunately, again for continuous-time systems. One of the latest approaches to the event-triggered control systems has been reported in [16], However, this technique is incomparably more demanding as far as computation burden is concerned. The natural aim is to maximize the cross-update intervals, here by the use of game theory, in a two-step fashion. In the first step, the decision whether to update or not is made; in the second step, an adaptive scheme based on dynamic programming is used. The advantage is the suitability for nonlinear multiple-input, multiple-output systems; the disadvantage is its continuous-time form, heavy computation burden, and no views toward robustification in the case of uncertain system description.
The major contribution can be listed as follows:
  • Extension of a standard LQR technique to handle model uncertainty;
  • Introduction of event-triggered mechanism and analysis of various performance indices to grasp the interplay between control performance and the frequency of control updates;
  • Complete evaluation of the performance of the solution in the case of a single actuator (reaction wheel), and two actuators (handlebar action added);
  • Critical comparison with the alternative solutions, from the viewpoint of the event-triggering strategy.
The structure of the paper is as follows: Section 2 outlines the basic model of the bicycle, Section 3 presents the considered control strategies, including LQR, linear quadratic plus integral, robustified, and event-triggered ones, along with model predictive and sliding-mode controllers. In Section 4, the results of a vast simulation campaign are presented, and finally, Section 5 outlines future research directions and presents the conclusions, paving the way to deployment on the real hardware.

2. Basic Model of the Bicycle

As per the inverted pendulum-like structure of the bicycle with a stabilizing reaction wheel, the basic motion law follows the second principle of Newton dynamics for the rotational motion, i.e.,
T ( t ) = J θ ¨ ( t ) ,
where J is the moment of inertia, θ is the angle of rotation with respect to the unstable upright equilibrium point, and T is the torque exerted by the reaction wheel.
In rotational motion, torques and moments of inertia are additive, thus the torque T is a combination of the torque generated by gravity forces T g , acting on the center of the mass m of the inverted pendulum rod, the torque generated by the gravity forces T w , acting on the reaction wheel of mass m w , and last, the torque generated by the motor of the reaction wheel T m , counteracting the effects of gravity,
T ( t ) = T g + T w T m .
Let us adopt the following notation:
  • θ —lean angle from the upright position;
  • θ ˙ —rotational velocity (lean velocity);
  • ϕ —rotation angle of the reaction wheel;
  • ω —rotational velocity of the reaction wheel;
  • T m —torque generated by the reaction wheel.
Now, the torque can be expressed as
m g l r sin ( θ ( t ) ) + m w g l w sin ( θ ( t ) ) J w ( ω ˙ ( t ) + θ ¨ ( t ) ) = J θ ¨ ( t ) ,
with
  • m—mass of the bicycle without the mass of the reaction wheel;
  • g—gravity force;
  • l r —distance between the ground level and the center of mass of the bicycle;
  • m w —mass of the reaction wheel;
  • l w —distance between the ground and the axis of rotation of the reaction wheel;
  • R—reaction wheel radius;
  • r—inverted pendulum radius (the bicycle is treated as an inverted pendulum);
  • J—auxiliary moment of inertia;
  • J w —moment of inertia of the reaction wheel with respect to its axis of rotation
and
J = m w ( 0.5 R 2 + l w 2 ) + 1 12 m ( 3 r 2 + l w 2 ) + m l w 2
Taking reaction wheel actions into account, the following state-space vector can be used to describe the state of the bicycle:
x ̲ ( t ) = x 1 ( t ) x 2 ( t ) x 3 ( t ) x 4 ( t ) = θ ( t ) θ ˙ ( t ) ϕ ( t ) ω ( t ) .
In the Engineering Rev Kit 2 documentation, one can find the nominal values of the parameters listed above, resulting in the following state-space description:
x ˙ ̲ ( t ) = x 2 ( t ) 92.1727 sin ( x 1 ( t ) ) 333.333 u ( t ) x 4 ( t ) 92.1727 sin ( x 1 ( t ) ) + 11844.120 u ( t ) = f ̲ ( t , x ̲ , u ) ,
with the input signal u ( t ) = T m ( t ) .
Let us linearize the nonlinear description of the model in the following unstable equilibrium point:
f ̲ ( t , x ̲ , u ) = 0 ̲ x ̲ 0 = 0 0 0 0 , u 0 = 0 ,
resulting in
A = f ̲ x ̲ x ̲ = x ̲ 0 , u = u 0 = 0 1 0 0 92.1727 0 0 0 0 0 0 1 92.1727 0 0 0 ,
b ̲ = f ̲ u x ̲ = x ̲ 0 , u = u 0 = 0 333.333 0 11844.120
in the model
x ˙ ̲ ( t ) = A x ̲ ( t ) + b ̲ u ( t ) .
Since this model reflects the real dimensions of the hardware to be used in the future research stage, the authors have decided to perform the linearization on the basis of the Simulink nonlinear model of the bicycle using the Model Linearizer Tool, assuming zero initial conditions and no constraints imposed on the control signal. This procedure resulted in the following state-space model:
x ˙ ̲ ( t ) = 0 1 0 0 93.1217 0 1.0619 × 10 7 0 0 0 0 1 93.1250 0 0.0192 0 x ̲ ( t ) + 0 246.216 0 11004 u ( t ) ,
and
y ̲ ( t ) = 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 x ̲ ( t )
as the output vector. This state-space model will eventually be taken as the source of the values of the parameters taken for further consideration.
Matrix A of this model has the following eigenvalues:
λ ̲ = 9.6500 9.6500 0.1386 0.1386 ,
connected to the unstable behavior of the bicycle without actuation.

3. Considered Control Strategies

3.1. Software-in-the-Loop Model Development for Reaction Wheel Actuation

In order to perform simulations of a complete setup, to reflect the behavior of the real bicycle robot, one needs to read a technical specification of the DC motor with the encoder from the setup, providing the information about the operating voltage (12 V DC) along with the stall current value of 1.85 A.
Now, the parameters of the DC motor installed in the system can be calculated from the following dependency:
k t = M I = k e = U ω ,
where U = 12   V , I = 1.85   A , ω = 7000   rpm = 733.038   rad sec . The maximum torque of the motor becomes
T max = U I ω = 0.0302   Nm ,
which has been taken as the constraint level in the simulations carried out in the paper, related to the reaction wheel torque. In order to balance the system, it must hold that
T max T g + T w .
Based on this value, one can obtain the maximum deflection of the inverted pendulum from the unstable equilibrium point, which the motor is capable of bringing back to the vertical position, by compensation of the gravity force:
T g = 0.497   kg · 9.807 m sec 2 · 0.065   m · sin ( θ m a x ) , T g = 0.03   kg · 9.807 m sec 2 · 0.13   m · sin ( θ max ) , T m = 0.0302   Nm , θ max = asin 0.0302 0.355 = 0.08517   rad = 4 . 88 .
The considered model has been discretized with T s = 0.05   s using step-invariant transformation into
A d = 1.1187 0.0520 0 0 4.8389 1.1187 0 0 0.1187 0.002 1 0.05 4.8391 0.1187 0.001 1 ,
b ̲ d = 0.3138 12.7941 13.7609 550.6824 ,
C d = 1 0 0 0 0 1 0 0 0 0 1 0 0 0 0 1 ,
resulting in the following eigenvalues of A d :
λ ̲ A = 1.6201 0.6172 0.9931 1.007 .

3.2. LQR Controller for the Operation of the Reaction Wheel

A full-state feedback control has been used at the first stage of the research, minimizing the standard LQR performance index [17]
J = t = 0 x ̲ t T Q x ̲ t + u ̲ t T R u ̲ t ,
where, Q = diag ( q i ) and R = diag ( r i ) have been chosen to conform to Bryson’s rule [18]:
q i = 1 max deflection in x i 2 ,
r j = 1 max deflection in u j 2 .
In the considered case,
q 1 = 1 θ max 2 = 137.8563 ,
q 2 = 1 θ ˙ max 2 = 11.11 ; θ ˙ max = 0.3 rad sec ,
where θ ˙ max 2 is a halved value of the maximum rotational velocity allowing the system to stabilize subject to constraints imposed on the control signal. The other weights become
q 3 = 1 ( 60 π ) 2 = 0.00168 ,
as per 30 rotations of the reaction wheel, and
q 4 = 1 733 . 038 2 = 0.00000186 ,
since 733.038   rad sec mirrors the maximal rotational velocity of the motor. The weight of the control signal is
R = R = 1 T m 2 = 1 0.0302 2 = 1096.44 .
Following the LQR calculation of the optimal control law, one obtains
Q = 137.8563 0 0 0 0 11.11 0 0 0 0 0.00168 0 0 0 0 0.00000186 ,
R = 1096.44 ,
k ̲ = [ 0.9147 , 0.1026 , 0.0005 , 0.0004 ] T ,
u t = k ̲ T x ̲ t .
The closed-loop system matrix
A c = 0.8316 0.0198 0.0002 0.0001 6.8644 0.1945 0.0065 0.0047 12.469 1.4105 1.0071 0.0551 498.8918 54.4047 0.2826 1.2023
has the following eigenvalues:
λ ̲ c = 0.2527 0.8975 + 0.1018 i 0.8975 0.1018 i 0.7887
with all the magnitudes below 1.
The control signal in all the simulations has been constrained so as to mimic the voltage constraint of the real system at the level of 0.03 Nm.

3.3. LQI Controller for the Operation of the Reaction Wheel

The state vector has alternatively been extended with four additional entries, corresponding to integrals of the errors [19]:
x ̲ ext , t = [ x ̲ t T , x ̲ I , t T ] T ,
where x ̲ I , t is the appropriate integral of the full-state vector with the vice-last component non-zero to compensate for excessive energy use connected to the rotating reaction wheel, and
u t = [ k ̲ T , 0 , 0 , 10 9 , 0 ] T x ̲ ext , t .
The reason to affect the motion of the reaction wheel is given rise by the research reported in the paper cited above, when the wheel exhibited constant rotations when the bicycle was in an upright state, to reduce the energy consumption.

3.4. Robustified Version of the LQR Controller for the Operation of the Reaction Wheel

The robustified version of the LQR controller takes into consideration the so-called actuator failure model, which basically takes the form [19]
u t F = ρ u t
where 0 ρ ρ ρ + , ρ 1 , ρ + 1 mimics the uncertainty introduced either by improper actuation, or modeling errors, in this case—introduced by the adopted linearization scheme, or mismodeling of certain phenomena. Additionally, limits imposed on the control signal also admit to such a failure model, where the calculated control signal (vector) is not all all time instants the same as the applied control signal (vector).
Having adopted this strategy, in the current paper, 0 δ < 1 defines the certainty level as ρ = 1 δ , ρ + = 1 + δ , along with the algorithm presented below, enabling one to recalculate the optimal feedback vector k ̲ to introduce robustness.
This algorithm is left here without any other description, referring the reader to the prior publication of one of the authors of this paper on the topic, providing the detailed formulae [19].

3.5. Extension of the Considered Software-in-the-Loop Model to Handlebar Actions

When the handlebar is in use, the bicycle has an additional torque exerted on the front frame, and the angle α refers to the angle between the planes of the two wheels. Thus, the state vector is extended to x ̲ ( t ) R 6 , with α ( t ) and α ˙ ( t ) as the last two entries (discretized when a discrete-time model is in use).
Any non-zero angle α results in both centrifugal force and a lean angle of the bicycle when the robot is in motion. An additional component which appears is T g , which introduces an additional degree of freedom, and the so-called steer equation, whose derivation is out of the scope of this paper (strong coupling effects between lean angle, centrifugal force, net force due to gravity when the bicycle leans over, etc.) [20]. Having launched the Model Linearizer Tool in the case of the input vector
u ̲ ( t ) = T m ( t ) T h ( t ) ,
where the last component corresponds to the torque exerted on the handlebar, and discretizing the model with T s = 0.05   s using step-invariant method results in
A d h = 1.1169 0.0519 0 0 0.0003 0 4.7663 1.1169 0 0 0.0144 0.0003 0.1169 0.0019 1.0000 0.05 0.0003 0 4.7664 0.1169 0.0010 1 0.0144 0.0003 0.0845 0.0014 0 0 1.1471 0.0524 3.5250 0.0845 0 0 6.0244 1.1471 ,
B d h = 0.3260 0.5881 13.2873 23.9097 13.7687 0.5882 550.9967 23.9149 0.5882 28.3359 23.9146 1160.1 ,
C d = I
with the eigenvalues of A d h as
λ ̲ h = 1.7119 1.6114 0.5841 0.6206 0.9931 1.0070 .

3.6. LQR Controller for the Operation of the Reaction Wheel and the Handlebar

A full-state six-element feedback control has been used at the second stage of the research, with the entries of Q and R defined as in the original case, and now extended with
q 5 = 1 ( π 2 ) 2 = 0.4053 ,
as the maximum value of α is 90 , and
q 6 = 1 5.6 2 = 0.03188 ,
since the system gets destabilized when | α ˙ |   >   5.6   rad sec , along with
r 2 = 1 0.2255 2 = 19.6655 ,
to mimic the maximum torque generated by the handlebar motor of 0.2255 Nm.
Following the LQR calculation of the optimal control law, one obtains
Q = 137.8563 0 0 0 0 0 0 11.111 0 0 0 0 0 0 0.00168 0 0 0 0 0 0 0.00000186 0 0 0 0 0 0 0.4053 0 0 0 0 0 0 0.03188 ,
R = 1096.443138 0 0 19.6655 ,
K = 0.9037 0.1014 0.0005 0.0004 0.0241 0.0022 0.0543 0.0039 0.0001 0 0.0095 0.0011 ,
u ̲ t = K x ̲ t .
The closed-loop system matrix
A h c = 0.8543 0.0211 0.0001 0.0001 0.0020 0 5.9426 0.1382 0.0052 0.0037 0.0804 0.0017 12.294 1.3923 1.0071 0.0549 0.3266 0.0298 491.8737 55.6754 0.2824 1.1973 13.0639 1.1926 0.9231 0.0486 0.0017 0.0011 0.8934 0.0213 38.8828 1.9798 0.0681 0.0455 4.3679 0.127
has the following eigenvalues:
λ ̲ h c = 0.0004 0.278 0.8943 + 0.1041 i 0.8943 0.1041 i 0.7798 0.8399
with all the magnitudes below 1, again.
The control signals in all the simulations have been constrained so as to mimic the voltage constraints of the real system.

3.7. Event-Triggered Control Strategy Under LQR Framework with Robustness

Let x ̲ t denote the last received measurement of the state of a bicycle, with the corresponding control vector/signal u ̲ t = K T x ̲ t . The next update of the control action takes place on the basis of a triggering condition; otherwise, the control action is kept unchanged.
On the basis of [21,22], the following event-triggering condition is considered:
u ̲ t u ̲ t + l 2   >   γ u ̲ t + l 2
where γ > 0 actually defines the threshold value, and the control action is calculated as previously discussed. As in [23], the index l increases as long as the control action is updated, saving the computational burden of the processor, and at the same time, reducing the occupation of the communication channel.
This strategy can be easily applied to the considered robustified LQR as well. Its major task is to reduce the energy consumption of the control law, as well as to reduce wear and tear effects in mechanical parts, as the input torque is altered less frequently, preserving the overall performance of the system at a similar level.

3.8. Can the ETC Mechanism Be Applied to the Other State-of-the-Art Approaches?

It is vital to show the advantage of the proposed approach in terms of computational burden, related to the necessary calculations, in comparison to the state-of-the-art solutions. The first one to compare with would be the model predictive control, where the control performance index mirrors the one from (21), though calculated for a limited number of samples in the prediction horizon N
J = j = 0 N 1 x ̲ t + j T Q x ̲ t + j + u ̲ t + j T R u ̲ t + j .
On the basis of minimization of (52), for X ̲ = [ x ̲ t + 1 T , , x ̲ t + N T ] T a set of future control inputs is calculated U ̲ = [ u ̲ t T , , u ̲ t + N 1 T ] T .
Using the transition matrix Φ , and by introduction of Γ matrix in the forms
Φ = A A 2 A N , Γ = B 0 0 AB A 0 A N 1 B A N 2 B B ,
the cost function can be rewritten as
J = X ̲ T Q ˜ X ̲ + U ̲ T R ˜ X ̲ ,
with Q ˜ = diag { Q , , Q } , R ˜ = diag { R , , R } .
Now, substituting X ̲ = Φ x ̲ t + Γ ̲ U ̲ the performance index can be rewritten as
J = ( Φ x ̲ t + Γ ̲ U ̲ ) T Q ˜ ( Φ x ̲ t + Γ ̲ U ̲ ) + U ̲ T R ˜ U ̲
1 2 U ̲ T H U ̲ + U ̲ T F T x ̲ t ( dropping the constant term ) ,
with
H = 2 ( Γ T Q ˜ Γ + R ˜ ) , F = 2 Φ T Q ˜ Γ .
Finally, the control vector in the prediction horizon of N samples becomes
U ̲ = H 1 F x ̲ t .
Naturally, this mechanism can allow an easy implementation of the ETC methodology, and also offers comparable performance in a finite prediction horizon, as per the same Q and R matrices used in comparison with the LQR approach.
In the case of a sliding-mode control, the control is aimed at keeping the states on the so-called sliding surface σ ( x ̲ ) , which is selected individually for the specific system [24]. In addition, the selection criteria for this surface are also related to the desired dynamics of the closed loop, which in the considered case are of the fourth order. This control strategy is not suited for the introduction of the ETC mechanism, as it could potentially prevent the system from reaching (and staying on) the sliding surface, by infrequent changes in the controls.
In this paper, the SMC algorithm was implemented by the use of a sliding-mode controller Simulink block, developed in Matlab 2025a, where the sliding surface is automatically selected.

4. Results

4.1. Verification of the Adopted Linearization Scheme

In order to clearly verify the applicability of the linearized model of the system, with the linearization point taken as
x ̲ 0 = 0 0 0 0 , u 0 = 0 ,
and based on the LQR control law, from the range of the initial conditions allowing stabilization, three different initial conditions have been selected with the results depicted in Figure 2.
As expected, the linear model gives a worse result of approximating the nonlinear dynamics of the accurate model of the real bicycle, as per the linearization made in proximity of the unstable equilibrium point. However, on the basis of the indirect Lyapunov method [25], it is fully supported to use this linearized model as the starting point to calculate the feedback gains in the LQR control.

4.2. Simulation-Based Basin of Attraction Estimation

For the selected nonlinear control system, and the LQR gains calculated with no robustness ( δ = 0 ), a set of simulations has been performed to estimate the area of initial conditions to result in attracting the ( x 1 , x 2 ) phase trajectory to the origin. The results are presented in Figure 3.
The green color depicts the configuration of the initial conditions, resulting in a stable closed-loop system, whereas the red color mirrors the configurations for which it was impossible to stabilize the system, as per active constraints disabling proper stabilization.

4.3. LQI Considerations

The extended control law, by adding the LQI term, surprisingly gave no improved results in terms of x 3 or x 4 —it was expected that the added gain penalizing excessive rotations of the reaction wheel (extending the battery use time and keeping voltage levels with real hardware) would allow this variable to stay closer to zero. On the contrary, in the range of the feedback gain smaller than 10 7 in the absolute value change, only negligible changes have been observed, and for the values exceeding 10 7 in the absolute value change, deterioration of control performance has been observed, see Figure 4.
This could only be explained in this way that the design parameters of the LQR control law have already pushed the dynamics of the closed-loop system to its limits, with no further improvement possible. The origin of this step was the will to compare the results to [19], where a clear improvement both in energy consumption connected to constant operation of the reaction wheel, as well as improvement in overall performance, has been observed. The other reason could potentially be connected to the fact that the simulations performed used a software-in-the-loop model that does not mirror all the nonlinear phenomena connected to the real hardware.

4.4. LQR-Based Stabilization of the Bicycle Using a Reaction Wheel Only with Introduced Robustness

In the control system with the LQR feedback vector calculated on the basis of the algorithm with the introduced robustness degree defined by δ , a series of simulations has been performed over a grid of δ values, to depict the core differences in dynamics between the obtained control systems.
Figure 5 presents the simulation results both for the system with classical LQR ( δ = 0 ), as well as for the controller taking robustness into account ( δ > 0 ). It is clearly visible that the improvement in the lean angle regulation and its first derivative performance is strongly coupled with the effort of the control signal, and the energy consumed by the motor driving the reaction wheel. Relating to [19], the increase in δ allows one to take the unmodeled uncertainties inherently connected to the introduction of a linearized model into account, to improve the overall performance of the control system, as they mimic actuator fault-like behavior, where ‘inaccurate’ control signal is fed to the system (or, otherwise, ‘unexpected’ reaction to a proper control signal is observed, due to mismodeling errors).
In order to evaluate the performance of the system in terms of the energy-connected performance indices, the following quantities have been calculated for every δ ,
J x j = i = 0 N x j , i 2 ,
J u k = i = 0 N u k , i 2 ,
where j = 1 , , 4 refers to the state vector entry, and N is the number of samples per observation horizon, k is the control input number, and is presented in Figure 6 and Figure 7. As can be seen, introduction of robustness ( δ > 0 ) enables one to obtain improved performance indices in comparison to a classical LQR control ( δ = 0 ), in terms of reduction of lean angles, their derivatives, and what is of paramount importance—in terms of input energy consumption.
The latter is possible due to the modification of the feedback vector k ̲ by the robustifying approach to the values presented in Figure 8.
As can be seen in Figure 6 and Figure 7, it is possible to reduce the absolute value of x 1 and x 2 by a slight deterioration in the operation of the reaction wheel, while reducing the energy consumption in the control channel over a wide range of δ parameters, what is inherently connected to stability margins of a classical LQR control, supported here by the robustifying algorithm. In a real system, this would improve the performance, while increasing the battery lifetime, and prevent voltage drops.
The corresponding plots, taken for the LQI approach referred to previously, are presented in Figure 9. The results clearly indicate that there is no reason to include the LQI term in the system, as per the increase in all the performance indices’ components, when the gain attached under the LQI framework exceeds 10 7 . On the contrary, as per the values of x 3 vs. the other state vector components, a decrease in this coefficient results in no impact on the overall control performance.

4.5. LQR-Based Stabilization of the Bicycle Using Reaction Wheel and Handlebar Actions with Introduced Robustness

Similar tests were performed in the case of two control inputs, namely input torque to the reaction wheel ( u 1 ), and to the handlebar ( u 2 ). The performance indices related to components of the state vector have been presented in Figure 10, and to the control efforts in Figure 11, followed by the program of gain scheduling for various δ s (Figure 12).
Again, it can be observed that the performance indices connected to the first two, and the last two components of the state vector get reduced in comparison with the classical LQR ( δ = 0 ), but also comparing the first two with the case without a handlebar—the initial values of the plots with δ = 0 are lower in comparison with the case for a reaction wheel actuation.
The same holds for the energy use, where the two components presented in Figure 11 have been reduced as well. Naturally, the overall consumption by summing up the two values for δ = 0 is greater in comparison to the situation from Figure 7; nevertheless, two control inputs are available here.
All the plots presented in Figure 13 provide a time-based comparison of the performance presenting the improvement in dynamics, of the state components (comparable between no handlebar and handlebar configurations, connected to the ability to make more agile movements, by taking advantage of the two control inputs).
It is to be mentioned again that all control inputs are saturated on the limit corresponding to the maximum torques, which can be generated by the real motors, mirrored here in the simulation environment.

4.6. Towards the Event-Triggered Control Actions in the Stabilization Task

The last stage of the research referred to the use of event-triggered control actions, to be updated whenever the considered event-triggering condition defined by γ is met. For γ = 0 one obtains the classical LQR (with δ = 0 ) or the robustified one with δ > 0 .
At first, in the case of the situation with the reaction wheel actuation only, it can be seen in Figure 14a that the performance indices connected to the lean angle are over a wide range of δ below the case with no ETC condition. Thus, the precision of stabilization gets improved, with comparable velocities of the angular deflection, while usually providing also better results in the case of the reaction wheel operation (compare performance for x 3 and x 4 ).
The latter is also visible in Figure 15a where one can see improvement energy consumption indices’ values, reducing with an increase in δ . The less frequent the triggering is (i.e., greater values of γ ), the lower energy consumption becomes. This is connected to changes in controls over time.
When two control inputs are available, it turns out that on the basis of the performance indices presented in Figure 14b that the results comparable to the single actuator only case can be seen only for greater values of δ for small values of γ , i.e., for frequent actuation. In Figure 15b, one can see no deterioration in energy consumption of the first control channel, but it is visible in the second one, as it is the handlebar which takes over the major action of improving the performance.
This can be avoided by adding linear velocity to the bicycle, to exert stabilization in motion, where far less control effort should be put into the handlebar, which will be the core of future hardware-in-the-loop experiments.
On the basis of the time plots, one can identify the reason for a potential increase in performance indices, summing up over a longer period of time. In Figure 16a when robustness is introduced, one can see a long tail in time response, and a lower speed of operation of the reaction wheel x 4 , resulting in greater angles x 3 . In turn, these results in terms of x 1 and x 2 and the comfort of the potential user, present no oscillatory behavior.
The same holds for the case presented in Figure 17a, where the changes in control vector have smaller absolute values and are non-zero over a wider time horizon. At the same time, they compensate model imperfections, not resulting in agile movements, aligning with the principles of robust control.
Figure 16b and Figure 17b present the plots for two control inputs, where the operation of the ETC mechanism is clearly seen. Some of the solutions in terms of time plots are similar to one another, and should be observed in terms of qualitative comparison, from the viewpoint of the performance. However, one should bear in mind that despite certain costs connected to the robustifying algorithm, in terms of a slight deterioration in channel u 2 , the robust control law is obtained, tolerating control imperfections.
In Figure 18, the number of triggering actions has been depicted, showing the reduction from 600 in the case without ETC, down to even 100 in the case of the event-triggering mechanism considered. This admits to avoiding occupation of communication channels, in passing control actions to the system, as well as reducing frequent changes in control inputs, reducing negative effects on the mechanics of the system. One could also visualize the reduction in the computational burden of the edge devices, where less frequent control updates take place, without major deterioration of the performance, and by the use of the robustifying algorithm—improvement over a wide range of δ s .
Also in terms of average value of differences across control updates is concerned (Figure 19), and their variances (Figure 20), the mean value of the control torques remains the same over the first 5 s of each simulation (there is no reason to calculate means or variance distorted by near-zero signal values in 30 s duration of the experiment), and the variance of changes is clearly greater in the case of less frequent control updates, what compensates effects of reduction of the occupation of communication channels.
Table 1 summarizes the energy reduction connected to the control input signal with reference to Figure 15a and Figure 18a, identified as the best configuration visible in the plots. These results have been obtained in the case of robustified ETC with a reduction of triggers down by −83% in the best configuration. Mean reduction of the number of triggers was 58 % ( δ = 0.1 ), 67 % ( δ = 0.2 ), 75 % ( δ = 0.5 ), and 80 % ( δ = 0.5 ).
Table 2 summarizes energy reduction (taken for channel 1, as channel 2 energy is one order of magnitude lower, thus the changes are negligible) connected to the control input signals with reference to Figure 15b and Figure 18b, identified as the best configuration visible in the plots.
These results have been obtained in the case of robustified ETC with reduction of triggers (taking two separate triggering conditions into account for two control inputs) down by −61% in the best configuration. Mean reduction of the number of triggers was 21 % ( δ = 0.1 ), 24 % ( δ = 0.2 ), 43 % ( δ = 0.5 ), and 51 % ( δ = 0.5 ).
In Figure 21, a sample experimental run for x 1 ( 0 ) = 2 , and the remaining entries of the state vector are set to zero, to compare the performance between LQR and ETC on a real platform has been added, to depict that this control scheme is deployable and allows stabilization of the bicycle robot. However, due to the issues connected to the implementation, noise mitigation, and mismodeling issues, the complete vast statistical campaign to present the properties of the considered control actions, is left as a future research direction.
The deployment on the real hardware requires necessary changes to the strategy of taking tests, initial condition changes, and at this point is premature to be presented here.
In Table 3 and Table 4 the comparison between different ETC versions has been presented, showing the impact of γ on the overall performance. Starting from the bottom row, it can be easily identified that the greater the threshold value, the more frequent the updates become; thus, it is vital to keep this value low. The first two rows present the already-introduced performance indices related to the lean angle and its dynamics—the degradation does not introduce drastic changes, and is completely acceptable. The overall control energy-related performance index is also not hardly affected, which leads the way to a reduction of mechanical wear and tear effects by the introduction of ETC.
The same holds in the case when robustness is introduced to ETC ( δ = 0.3 ), since not only lean angle-related performance indices are reduced by roughly 10%, but also the control effort is reduced by approximately 5–8%, whilst the control updates are reduced as well. The increase in γ results in the same effects as in the case of no robustness introduced, since the lean angle-related performance indices get reduced, at the cost of an increase in the frequency of updates (control energy consumption decreases in such a case).
Table 5 and Table 6 mirror the same situation, but for another initial lean angle. The control energy performance index reduction for the robustified case reaches down to 1–4%, which is less impressive, however, it is still coupled with a decrease in frequency of control updates. It is to be kept in mind though that for another δ (consult the initial tests of the impact of δ ), the situation might be more advantageous, as a broad range of reduction of performance indices has been identified for greater values of data, as per mismodeling errors.
Definitely, as a direction of future research, one could list the selection rule for γ to obtain improved control performance, which would lead to the presentation of a combined, possibly weighted, performance measure capturing various features of the event-triggered control system.

4.7. Alternative Approaches—Can the ETC Mechanism Be Applied to the State-of-the-Art Methods?

It is natural to discuss whether the other state-of-the-art methods are comparable or offer similar performance in comparison to the LQR+ETC mechanism, or at least the pure LQR mechanism. For this purpose, the authors have decided to put into direct comparison the MPC method for the prediction horizon of N = 10 samples, with Q and R matrices as in the case of the considered LQR algorithm. The results in terms of the performance indices are presented in Table 7 and Table 8. The calculated control vector has the first entry applied to the system, post-saturation. It can be seen that the MPC has strong averaging properties, thus the performance indices in terms of x 1 are almost doubled values of the LQR ones, and the improvement is observed for intermediate values of γ , since the prediction mechanism allows good control quality when the distortion of the sequence of control inputs is not present to a high extent, i.e., for low values of γ . Nevertheless, over a wide range of γ s the performance index for x 1 stays roughly the same, as in the case of LQR control. The averaging properties are also seen as far as x 2 is considered, as lower values of the performance index are visible for comparable initial lean angles, or comparable γ s . That is interesting, the MPC framework has much greater control updates in comparison to the LQR case, this is why the reduction in the number of control updates is less visible for small values of γ , also in the case of the higher initial lean angle. Otherwise, the MPC controller could not get full control authority over the unstable system.
Since the performance in terms of control indices connected to the input action, along with x 1 as the major effect of the control effort is considered, the intermediate values of γ are favored, to allow the ETC mechanism to mimic the trajectories feasible with γ = 0 .
The situation in the case of the second considered state-of-the-art solution is more complicated, as one cannot actually impose ETC mechanism without major performance deterioration, if not even instability. To keep the system on the sliding surface, the control should switch constantly, and the sequence of switch-overs cannot be modified, without divergence from the sliding motion, actually showing no features of SMC in this case.
The former is the reason why tests for various values of γ were not performed, and the performance indices forming the first column of the to-be-conducted simulations are 7.231, 157.88, 4261.7, 2255.3, and 0.3604, respectively. This shows much greater energy of the control signal necessary to keep the system on the sliding surface, actually deterring the purpose of the paper, to introduce the ETC mechanism in order to lower the energy consumption, while maintaining the overall performance at a comparable level.
The final comparison in terms of responses to the same initial lean angle has been presented in Figure 22, where the introduction of the ETC mechanism to LQR and MPC results in comparable performance, with the LQR method more applicable in the environment where the computational burden should be lower.
A short summary of the considered approaches is presented in Table 9, to evaluate possible suitability for the adopted task. It is to be stressed that the strategies have been compared in their classical (most commonly met) forms, with special view of the possibility to calculate the state feedback vector off-line, to introduce robustification ( δ ), to introduce a triggering mechanism, as well as introduction of low performance degradation in comparison to the case without ETC. As a result, the LQR method outperforms the other classical ones due to the reasons listed above in this Section.

5. Conclusions and Future Research Directions

The paper addressed the complex challenge of stabilizing an inherently unstable bicycle robot at zero speed where balancing is most difficult. The key problem here is maintaining upright balance while keeping energy consumption low—so crucial for future personal electromobility vehicles such as electric bicycles, allowing the handicapped to potentially indirectly use the proposed control solution advantages. To tackle this, the authors model bicycle dynamics and develop feedback control strategies including linear quadratic regulation, its robustified version, and an event-triggered control scheme. The approaches use reaction wheel actuation and handlebar action to stabilize the bicycle with an emphasis on (energy) efficiency. Event-triggered control significantly reduces control signal updates, thereby saving energy and reducing mechanical wear without compromising stability. Simulation results demonstrate robust stabilization across a range of initial lean angles and operational uncertainties, highlighting improved energy profiles and stability margins by means of time-domain analysis. This is particularly important for extending battery life and improving user comfort and safety in autonomous or rider-assisted vehicles. By integration of robust and event-based techniques, the study offers a promising direction for efficient balancing control in emerging personal electromobility platforms, contributing to sustainable and accessible urban mobility solutions.
Balance support is widely available beyond motorcycles and bicycles as well, also in various personal mobility units aimed at customer use, such as self-balancing scooters or wheelchairs for example. The global market demand is about to exceed USD 2.5 billion in 2025 with growth propelled not only by the adoption of energy-efficient motors, but also, as in this paper, intelligent control systems to maintain stability. All these approaches must be energy-consumption aware.
As the paper actually proposed two alternative control inputs, namely the reaction wheel torque, and the handlebar-related torque, the final discussion should also refer to different operating conditions, and the effective use of the actuators. Reaction wheels consume more electrical energy because they operate by spinning a mass to generate the input torque. This process, related to x 3 and x 4 variables, is clearly seen throughout the paper, and is connected to a sustained energy consumption, even at zero speed. In the case of the handlebar action, this input is more energy-efficient in a broader context, as per the gyroscopic effects of the front wheel and trail geometry, to get a self-stabilization property, not investigated in this paper. Nevertheless, at low or zero speeds, the reaction wheel action is preferred. At low speeds, the combination of reaction wheel and handlebar actions is most effective. The criterion that actuation to use should eventually depend on efficient control allocation, by the use of optimization, as in the case of the discussed LQR approach, where one could connect different weights to the less preferred input, and change these adaptively, subject to the change in the linear speed of the bicycle.
In the future, it is planned to use experimental validation on real hardware to confirm simulation outcomes, also in the context of more advanced control strategies as model predictive control (presented here in simulation context only) and adaptive approaches. These potentially would allow the spread control effort over a wider range of steps, in the case of predictive control, to capture the interplay between prediction horizons and control efforts better. In the case of adaptive control, it would allow alteration of control system parameters to specific conditions, as driving at zero, low, moderate, and high speeds, to build a bank of controllers to switch between. The key challenge here would be to design a switching law to change feedback gains on the basis of a specified condition. Implementing the control schemes on IoT-enabled embedded systems can potentially pave the way for intelligent, connected electromobility platforms. The authors also plan to include trajectory tracking in moving bicycles with variable velocities and external disturbances into account, including taking banked turns. This could eventually support the advantage of using a single actuation method, a combined actuation method for various conditions, as well as taking advantage of a gyroscopic effect, and self-stabilizing property support to better increase the energy efficiency. The major challenge here is to obtain results ready for generalization to other platforms with a similar mechanism of motion. The topic of balancing energy use, stability, and user comfort remains the critical goal for practical, widespread adoption.

Author Contributions

Conceptualization, D.H. and A.F.; methodology, D.H.; software, A.F.; validation, A.F., D.H. and R.I.; formal analysis, D.H. and R.I.; investigation, A.F. and R.I.; resources, D.H.; data curation, A.F.; writing—original draft preparation, D.H. and A.F.; writing—review and editing, D.H. and R.I.; visualization, A.F.; supervision, D.H.; project administration, D.H.; funding acquisition, D.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Poznan University of Technology under grant no. 214/SBAD/0247.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Solntsev, A.; Asoyan, A.; Nikitin, D.; Bagrin, V.; Fediushkina, O.; Evtykov, S.; Marusin, A. Influence of temperature on the performance and life cycle of storage batteries. Transp. Res. Procedia 2021, 57, 652–659. [Google Scholar] [CrossRef]
  2. Widmer, F.; Ritter, A.; Duhr, P.; Onder, C.H. Battery lifetime extension through optimal design and control of traction and heating systems in hybrid drivetrains. eTransportation 2022, 14, 100196. [Google Scholar] [CrossRef]
  3. Nzereogu, P.; Oyesanya, A.; Ogba, S.; Ayanwunmi, S.; Sobajo, M.; Chimsunum, V.; Ayanwunmi, V.; Amoo, M.; Adefemi, O.; Chukwudi, C. Solid-State lithium-ion battery electrolytes: Revolutionizing energy density and safety. Hybrid Adv. 2025, 8, 100339. [Google Scholar] [CrossRef]
  4. Lee, K.; Sener, I.N. E-bikes Toward Inclusive Mobility: A Literature Review of Perceptions, Concerns, and Barriers. Transp. Res. Interdiscip. Perspect. 2023, 22, 100940. [Google Scholar] [CrossRef]
  5. Magalingam, S.K.; Muthuswamy, P.; Sekar, P.; Muniyan, R.; Raj, A.S. Design and fabrication self balancing single-wheel vehicle. AIP Conf. Proc. 2024, 3042, 020024. [Google Scholar] [CrossRef]
  6. Saguchi, T.; Yoshida, K.; Takahashi, M. Stable running control of autonomous bicycle robot. Nihon Kikai Gakkai Ronbunshu C Hen/Trans. Jpn. Soc. Mech. Eng. Part C 2007, 73, 2036–2041. [Google Scholar] [CrossRef]
  7. Huang, W.H.; Nguyen, P.T.T.; Nguyen, D.D.; Doan, H.P.; Chuang, M.Y.; Kuo, C.H. Stability Control and Path Tracking of a Self- Balancing Bicycle With a Reaction Wheel. IEEE/ASME Trans. Mechatronics 2025, 30, 2928–2936. [Google Scholar] [CrossRef]
  8. Brizuela-Mendoza, J.A.; Astorga-Zaragoza, C.M.; Zavala-Río, A.; Canales-Abarca, F.; Martínez-García, M. Gain-scheduled linear quadratic regulator applied to the stabilization of a riderless bicycle. Proc. Inst. Mech. Eng. Part I J. Syst. Control Eng. 2017, 231, 669–682. [Google Scholar] [CrossRef]
  9. Zhu, X.; Deng, Y.; Zheng, X.; Zheng, Q.; Chen, Z.; Liang, B.; Liu, Y. Online Series-Parallel Reinforcement-Learning- Based Balancing Control for Reaction Wheel Bicycle Robots on a Curved Pavement. IEEE Access 2023, 11, 66756–66766. [Google Scholar] [CrossRef]
  10. Li, C.; Xie, Y.F.; Wang, G.; Zeng, X.F.; Jing, H. Lateral stability regulation of intelligent electric vehicle based on model predictive control. J. Intell. Connect. Veh. 2021, 4, 104–114. [Google Scholar] [CrossRef]
  11. Chen, L.; Liu, J.; Wang, H.; Hu, Y.; Zheng, X.; Ye, M.; Zhang, J. Robust control of reaction wheel bicycle robot via adaptive integral terminal sliding mode. Nonlinear Dyn. 2021, 104, 2291–2302. [Google Scholar] [CrossRef]
  12. Arduino Engineering Kit Rev 2 (Documentation). 2025. Available online: https://edu-content-preview.arduino.cc/content-preview/university/project/CONTENTPREVIEW+AEKR2 (accessed on 20 October 2025).
  13. Seuret, A.; Tarbouriech, S. Data-driven event-triggering mechanism for linear systems subject to input saturation. Eur. J. Control 2024, 80, 101045. [Google Scholar] [CrossRef]
  14. Alyahia, S.; Barbalata, C.; Malisoff, M.; Mazenc, F. Dynamic event-triggered control of linear continuous-time systems using a positive systems approach. Nonlinear Anal. Hybrid Syst. 2024, 54, 101508. [Google Scholar] [CrossRef]
  15. Luo, Z.; Zhu, B.; Meng, X.; Zuo, Z. Event-based model predictive control with two-phase predictive detection. J. Frankl. Inst. 2024, 361, 107172. [Google Scholar] [CrossRef]
  16. Xu, W.; Wang, T.; Qiu, J.; Liu, X. A Novel Framework for Game-Based Optimal Event-Triggered Control of Multi-Input Nonlinear Systems. IEEE Trans. Autom. Control 2024, 69, 4867–4874. [Google Scholar] [CrossRef]
  17. Kwakernaak, H.; Sivan, R. Linear Optimal Control Systems; Wiley-Interscience: New York, NY, USA, 1972. [Google Scholar]
  18. Bryson, A.E.; Ho, Y.C. Applied Optimal Control: Optimization, Estimation, and Control; Blaisdell Publishing: Waltham, MA, USA, 1969. [Google Scholar]
  19. Owczarkowski, A.; Horla, D. Robust LQR and LQI control with actuator failure of a 2DOF unmanned bicycle robot stabilized by an inertial wheel. Int. J. Appl. Math. Comput. Sci. 2016, 26, 325–334. [Google Scholar] [CrossRef]
  20. Limebeer, D.J.N.; Massaro, M. Dynamics and Optimal Control of Road Vehicles; Oxford University Press: Oxford, UK, 2018. [Google Scholar] [CrossRef]
  21. Heemels, W.; Donkers, M.; Teel, A. Periodic Event-Triggered Control for Linear Systems. IEEE Trans. Autom. Control 2013, 58, 847–861. [Google Scholar] [CrossRef]
  22. Liu, T.; Jiang, Z.P. Event-based control of nonlinear systems with partial state and output feedback. Automatica 2015, 53, 10–22. [Google Scholar] [CrossRef]
  23. Zietkiewicz, J.; Horla, D.; Owczarkowski, A. Sparse in the Time Stabilization of a Bicycle Robot Model: Strategies for Event- and Self-Triggered Control Approaches. Robotics 2018, 7, 77. [Google Scholar] [CrossRef]
  24. Veselić, B.; Milosavljević, C.; Peruničić-Draženović, B.; Huseinbegović, S.; Petronijević, M. Discrete–Time Sliding Mode Control of Linear Systems with Input Saturation. Int. J. Appl. Math. Comput. Sci. 2020, 30, 517–528. [Google Scholar] [CrossRef]
  25. Liu, J.; Liu, J.; Yan, R.; Ding, T. Deep Lyapunov Learning: Embedding the Lyapunov Stability Theory in Interpretable Neural Networks for Transient Stability Assessment. IEEE Trans. Power Syst. 2024, 39, 7437–7440. [Google Scholar] [CrossRef]
Figure 1. Simscape model of the considered bicycle, where each colour denotes a separate rigid body, interconnected into one structure.
Figure 1. Simscape model of the considered bicycle, where each colour denotes a separate rigid body, interconnected into one structure.
Energies 18 06146 g001
Figure 2. Stabilization of a bicycle robot for various initial lean angles and x 2 ( 0 ) = 0   rad sec , x 3 ( 0 ) = 0   rad , and x 4 ( 0 ) = 0   rad sec .
Figure 2. Stabilization of a bicycle robot for various initial lean angles and x 2 ( 0 ) = 0   rad sec , x 3 ( 0 ) = 0   rad , and x 4 ( 0 ) = 0   rad sec .
Energies 18 06146 g002aEnergies 18 06146 g002b
Figure 3. Initial conditions vs. closed-loop BIBO stability.
Figure 3. Initial conditions vs. closed-loop BIBO stability.
Energies 18 06146 g003
Figure 4. Comparison of LQR- and LQI-driven closed-loop system performance (the two curves overlap).
Figure 4. Comparison of LQR- and LQI-driven closed-loop system performance (the two curves overlap).
Energies 18 06146 g004
Figure 5. Effect of added robustness degree δ on time plots in stabilization task.
Figure 5. Effect of added robustness degree δ on time plots in stabilization task.
Energies 18 06146 g005
Figure 6. Performance indices for the state.
Figure 6. Performance indices for the state.
Energies 18 06146 g006
Figure 7. Performance index for u.
Figure 7. Performance index for u.
Energies 18 06146 g007
Figure 8. Feedback gains.
Figure 8. Feedback gains.
Energies 18 06146 g008
Figure 9. Performance indices for the LQI control.
Figure 9. Performance indices for the LQI control.
Energies 18 06146 g009
Figure 10. Performance indices for the state when handlebar action is included.
Figure 10. Performance indices for the state when handlebar action is included.
Energies 18 06146 g010
Figure 11. Performance index for u ̲ (handlebar action included).
Figure 11. Performance index for u ̲ (handlebar action included).
Energies 18 06146 g011
Figure 12. Feedback gains when handlebar action is included.
Figure 12. Feedback gains when handlebar action is included.
Energies 18 06146 g012
Figure 13. Stabilization of a bicycle robot for different configuration parameters (with and without handlebar action), x 1 ( 0 ) = 2 , x 2 ( 0 ) = 0   rad sec , x 3 ( 0 ) = 0   rad , x 4 ( 0 ) = 0   rad sec .
Figure 13. Stabilization of a bicycle robot for different configuration parameters (with and without handlebar action), x 1 ( 0 ) = 2 , x 2 ( 0 ) = 0   rad sec , x 3 ( 0 ) = 0   rad , x 4 ( 0 ) = 0   rad sec .
Energies 18 06146 g013
Figure 14. Performance indices connected to states’ values.
Figure 14. Performance indices connected to states’ values.
Energies 18 06146 g014
Figure 15. Performance indices for the control input(s)’ values.
Figure 15. Performance indices for the control input(s)’ values.
Energies 18 06146 g015
Figure 16. Time plots (states)—comparison.
Figure 16. Time plots (states)—comparison.
Energies 18 06146 g016
Figure 17. Time plots (controls)—comparison.
Figure 17. Time plots (controls)—comparison.
Energies 18 06146 g017
Figure 18. Number of control triggers.
Figure 18. Number of control triggers.
Energies 18 06146 g018
Figure 19. Mean values of the control signal(s).
Figure 19. Mean values of the control signal(s).
Energies 18 06146 g019aEnergies 18 06146 g019b
Figure 20. Variance of the control signal(s).
Figure 20. Variance of the control signal(s).
Energies 18 06146 g020
Figure 21. Sample experimental run with and without ETC, both for δ = 0 (no handlebar action).
Figure 21. Sample experimental run with and without ETC, both for δ = 0 (no handlebar action).
Energies 18 06146 g021
Figure 22. Comparison of sample LQR, LQR-ETC, MPC-ETC and plain SMC controller performance.
Figure 22. Comparison of sample LQR, LQR-ETC, MPC-ETC and plain SMC controller performance.
Energies 18 06146 g022
Table 1. Comparison of the decrease in the best configuration of energy of the input signal and the alternative solution—reaction wheel actuation (∗ denoted skipped comparison).
Table 1. Comparison of the decrease in the best configuration of energy of the input signal and the alternative solution—reaction wheel actuation (∗ denoted skipped comparison).
vs.LQRLQR + δ ETC
LQR + δ −28%
ETC−5%−4%
ETC + δ −21%−4%−18%
Table 2. Comparison of the decrease in the best configuration of energy of the input signal and the alternative solution—reaction wheel and handlebar actuation (∗ denotes skipped comparison).
Table 2. Comparison of the decrease in the best configuration of energy of the input signal and the alternative solution—reaction wheel and handlebar actuation (∗ denotes skipped comparison).
vs.LQRLQR + δ ETC
LQR + δ −15%
ETC0%−3%
ETC + δ −7%−3%−9%
Table 3. Effect of γ in ETC on the overall performance, δ = 0 , x 1 ( 0 ) = 1 (bold numbers denote small values).
Table 3. Effect of γ in ETC on the overall performance, δ = 0 , x 1 ( 0 ) = 1 (bold numbers denote small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 0.5380.5380.5360.5370.5300.5310.5400.5330.5200.5440.5430.5190.5190.5190.519
J x 2 2.7532.7722.7662.7652.8472.8482.7322.7753.0713.1513.1493.0793.0793.0783.078
J x 3 273.0271.1274.9274.1267.3266.9265.2263.4246.0236.7236.9253.9254.9255.1255.0
J x 4 656.0654.1655.2655.1647.4648.3651.8639.5619.6624.9624.6617.0617.9618.0618.0
J u 0.00470.00470.00470.00470.00470.00470.00480.00470.00460.00480.00480.00460.00460.00460.0046
TRG [%]0−41.7−40.2−37.3−34.7−30.5−27.7−27.7−24.7−25.8−25.2−22.3−22.2−22.0−21.8
Table 4. Effect of γ in ETC on the overall performance, δ = 0.3 , x 1 ( 0 ) = 1 (bold denotes small values).
Table 4. Effect of γ in ETC on the overall performance, δ = 0.3 , x 1 ( 0 ) = 1 (bold denotes small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 0.5120.5120.5110.5110.5100.5130.5030.5020.5030.5020.5020.5000.5000.4830.483
J x 2 2.5492.5482.5602.5982.6052.5452.7362.7402.7422.7402.7392.7472.7482.9632.963
J x 3 309.9310.2310.2305.9304.5299.3290.0288.2287.8287.5289.6286.9286.9260.3260.3
J x 4 666.0665.7663.1660.1657.7662.0637.0635.2634.9631.7633.8626.3626.1587.7587.7
J u 0.00450.00450.00450.00450.00450.00450.00440.00440.00440.00440.00440.00440.00440.00430.0043
TRG [%]0−45.2−38.8−32.7−29.3−28.5−26.5−27.0−24.5−25.2−25.0−26.3−24.5−25.0−25.0
Table 5. Effect of γ in ETC on the overall performance, δ = 0 , x 1 ( 0 ) = 2 (bold denotes small values).
Table 5. Effect of γ in ETC on the overall performance, δ = 0 , x 1 ( 0 ) = 2 (bold denotes small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 1.5181.5191.5141.5181.4971.4961.4941.5281.4791.5371.5361.5331.4501.4501.450
J x 2 6.9496.9906.9786.9997.2127.2397.2367.0017.8578.1238.1298.1197.7297.7297.730
J x 3 773.3770.5779.6776.0757.75750.0752.7738.8701.4667.6666.8667.6717.0717.0717.2
J x 4 185018491848184918271815181518301779176917671766172517251726
J u 0.01310.01310.01300.01310.01290.01280.01280.01320.01280.01330.01330.01330.01270.01270.0127
TRG [%]0−42.3−40.7−39.3−34.2−31.3−29.0−28.2−26.3−25.2−25.0−24.3−22.8−23.0−22.2
Table 6. Effect of γ in ETC on the overall performance, δ = 0.3 , x 1 ( 0 ) = 2 (bold denotes small values).
Table 6. Effect of γ in ETC on the overall performance, δ = 0.3 , x 1 ( 0 ) = 2 (bold denotes small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 1.4761.4761.4681.4721.4711.4841.4811.4471.4461.4451.4471.4371.4311.4181.426
J x 2 6.4626.4616.4896.6346.6356.5156.5537.0817.0927.0927.1067.1447.0076.9807.045
J x 3 895.3896.2899.7879.5879.5856.1853.4825.1822.0821.4829.4815.7820.0852.0849.1
J x 4 191719161908189418941908189918241813181218271788178617741780
J u 0.01260.01260.01260.01260.01260.01270.01270.01240.01240.01240.01240.01230.01250.01250.0126
TRG [%]0−49.0−35.8−31.7−29.2−28.7−27.2−26.3−25.7−25.8−25.8−25.7−25.3−25.8−25.3
Table 7. Effect of γ in MPC-ETC on the overall performance, x 1 ( 0 ) = 1 (bold denotes small values).
Table 7. Effect of γ in MPC-ETC on the overall performance, x 1 ( 0 ) = 1 (bold denotes small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 1.2191.2181.2191.2211.2321.2381.2351.2251.2111.2151.2241.2321.2391.2731.354
J x 2 2.2782.2832.2942.3252.3242.3882.4252.4412.4802.6092.7032.7952.8822.9983.321
J x 3 521251105103510851595144511151345031493849094876486050735334
J x 4 462746214620462846554662464746294537450545244527453546744957
J u 0.00890.00890.00890.00890.00900.00900.00900.00890.00880.00890.00890.00900.00900.00930.0099
TRG [%]0−28.8−35.5−41.2−40.5−38.7−38.0−36.5−38.5−40.3−42.3−41.0−40.3−40.8−42.0
Table 8. Effect of γ in MPC−ETC on the overall performance, x 1 ( 0 ) = 2 (bold denotes small values).
Table 8. Effect of γ in MPC−ETC on the overall performance, x 1 ( 0 ) = 2 (bold denotes small values).
δ /perf.ind.0.000.050.100.150.200.250.300.350.400.450.500.550.600.650.70
J x 1 3.4633.4633.4683.5173.5333.5273.5033.4383.4813.4113.4383.4283.5983.5663.625
J x 2 5.6725.6835.7355.7835.8365.9776.0506.1906.3436.5286.8227.0227.5427.7798.313
J x 3 145301451914494147391483814681145631420614395139161379513614141121391013899
J x 4 131371313113137133251339513273131921291213079126431268712600131741291313072
J u 0.02460.02460.02460.02490.02510.02500.02490.02440.02470.02420.02440.02440.02560.02550.0261
TRG [%]0−29.0−37.7−40.7−39.3−38.0−39.8−37.3−38.8−38.5−40.0−40.8−41.2−40.7−40.5
Table 9. Comparison of the selected control approaches in the framework presented in the paper.
Table 9. Comparison of the selected control approaches in the framework presented in the paper.
StrategyOff-Line Calc.Robustif.ETC Intr’dPost-ETC Low Perf. Degrad.
LQR++++
MPC+++
SMC+
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Frański, A.; Horla, D.; Iervolino, R. On Increasing the Energy Efficiency and Performance of a Bicycle Robot Stabilization—Towards Event-Triggered Control Considerations. Energies 2025, 18, 6146. https://doi.org/10.3390/en18236146

AMA Style

Frański A, Horla D, Iervolino R. On Increasing the Energy Efficiency and Performance of a Bicycle Robot Stabilization—Towards Event-Triggered Control Considerations. Energies. 2025; 18(23):6146. https://doi.org/10.3390/en18236146

Chicago/Turabian Style

Frański, Adam, Dariusz Horla, and Raffaele Iervolino. 2025. "On Increasing the Energy Efficiency and Performance of a Bicycle Robot Stabilization—Towards Event-Triggered Control Considerations" Energies 18, no. 23: 6146. https://doi.org/10.3390/en18236146

APA Style

Frański, A., Horla, D., & Iervolino, R. (2025). On Increasing the Energy Efficiency and Performance of a Bicycle Robot Stabilization—Towards Event-Triggered Control Considerations. Energies, 18(23), 6146. https://doi.org/10.3390/en18236146

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop