An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory

Zhou, Zhongjin; Zhao, Jingbo; Zheng, Jianfeng; Liu, Haimei

doi:10.3390/wevj16070386

Open AccessArticle

An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory

by

Zhongjin Zhou

^1,2,

Jingbo Zhao

^1,2,3,*,

Jianfeng Zheng

¹

and

Haimei Liu

³

¹

School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, China

²

School of Automotive Engineering, Changzhou Institute of Technology, Changzhou 213032, China

³

School of Intelligent Manufacturing and Control Engineering, Shanghai Polytechnic University, Shanghai 201209, China

^*

Author to whom correspondence should be addressed.

World Electr. Veh. J. 2025, 16(7), 386; https://doi.org/10.3390/wevj16070386

Submission received: 12 June 2025 / Revised: 5 July 2025 / Accepted: 7 July 2025 / Published: 9 July 2025

Download

Browse Figures

Versions Notes

Abstract

In response to the problem of cooperative steering control between drivers and intelligent driving systems, a master–slave Game-Based human–machine cooperative steering control framework with adaptive weight fuzzy decision-making is constructed. Firstly, within this framework, a dynamic weight approach is established. This approach takes into account the driver’s state, traffic environment risks, and the vehicle’s global control deviation to adjust the driving weights between humans and machines. Secondly, the human–machine cooperative relationship with unconscious competition is characterized as a master–slave game interaction. The cooperative steering control under the master–slave game scenario is then transformed into an optimization problem of model predictive control. Through theoretical derivation, the optimal control strategies for both parties at equilibrium in the human–machine master–slave game are obtained. Coordination of the manipulation actions of the driver and the intelligent driving system is achieved by balancing the master–slave game. Finally, different types of drivers are simulated by varying the parameters of the driver models. The effectiveness of the proposed driving weight allocation scheme was validated on the constructed simulation test platform. The results indicate that the human–machine collaborative control strategy can effectively mitigate conflicts between humans and machines. By giving due consideration to the driver’s operational intentions, this strategy reduces the driver’s workload. Under high-risk scenarios, while ensuring driving safety and providing the driver with a satisfactory experience, this strategy significantly enhances the stability of vehicle motion.

Keywords:

human–machine collaborative driving; model predictive control; game theory; driving weight

1. Introduction

The emergence of autonomous driving has relieved the burden on drivers and even mitigated the risk of accidents [1,2]. Nevertheless, achieving fully autonomous driving (Level 5) remains a long-term goal. Given the intricate traffic environment and legal complexities, vehicle control still cannot be completely decoupled from drivers [3]. Thus, despite rapid progress in autonomous driving technology, the human–machine co-driving stage will persist for some time into the future. The core technology in human–machine co-driving is human–machine collaborative driving technology. Here, driving authority is not monopolized by one party but is jointly determined through collaboration between the vehicle and the driver, integrating their respective decisions [4].

In the context of human–machine collaborative driving, the real-time driving authority allocation strategy dictates the vehicle’s driving trajectory. For example, the researchers Fang et al. [5] utilized the tracking error in vehicle state data to quantify the driver’s path-following ability under diverse driving conditions and accordingly determined the driving authority allocation. As the driver’s ability improved, the driver’s driving weight increased. The researchers Xu et al. [6] constructed a multi-factor-driven weight allocation module considering human–machine characteristics and collaborative performance. Taking into account the potential field of environmental risks and the human–machine conflict coefficient, the researchers Xie et al. [7] designed an adaptive weight allocation method to allocate weights between the driver and the automated system. The researchers Zhao et al. [8] adopted Gaussian process regression techniques to evaluate driving risks. The driving risks were evaluated from multiple dimensions, including vehicle speed, the front wheel’s steering angle, vehicle stability, and human–machine conflicts. Based on the evaluation results, the driving authority between humans and machines was dynamically allocated. Additionally, investigations have approached the allocation of driving authority from the perspectives of the vehicle or the traffic environment. For instance, Liu Jun et al. [9] established a risk potential field and adjusted the driving weights in accordance with the prevailing driving risk. When the driving risk was elevated, the vehicle’s driving weight was augmented. The researchers Dai et al. [10] employed bargaining games and proposed a utility function for a collaborative driving system. The optimal driving weight allocation scheme was derived by solving the system’s maximum utility function.

In human–machine collaborative driving, both the driver and the autonomous driving system are capable of independently fulfilling driving tasks and are thus regarded as two autonomous agents. Game theory has proven to be an effective approach to addressing the collaborative control issues between two agents. The researchers Zhou et al. [11] established a methodology that took into account the uncertainties in the driving behaviors of surrounding autonomous vehicles. This methodology is founded on game theory and distributed model predictive control. Eventually, it was integrated into a multi-objective constrained problem. In the course of human–machine shared steering control, the researchers Feng et al. [12] incorporated cooperative game theory to determine the level of interactive control. Moreover, they put forward a robust H∞ compensation control approach to enhancing the anti-interference capabilities of the system, thus guaranteeing the stability of the vehicle. The researchers Lu et al. [13] integrated a designed game entry mechanism with a re-planned game sequence under different intersection conditions and constructed a variable game model to address the driving conflicts of autonomous vehicles at unsignalized intersections. The researchers Liu et al. [14,15] put forward a three-level Game-Based decision-making framework. Ultimately, a neural network was utilized to learn the decision-making outcomes derived from the game approach. This method enables autonomous vehicles to respond in a timely manner when interacting with drivers featuring diverse driving styles in the surroundings. The researchers Li et al. [16] employed game theory to address the conflict between tracking accuracy and driving stability in the context of lane-changing for driverless vehicles. The researchers Wang et al. [17] introduced cooperative game theory to model the steering interaction of dual controllers. An adaptive weight adjustment strategy based on the predicted output deviation was designed to achieve collaborative optimization of dual steering control. Even when partial steering fails, fault-tolerant steering control is also achieved through cooperative game theory.

This study focuses on a description of the relationship between the driver and the co-driving controller, as well as the issue of driving authority allocation. The aim is to alleviate the driver’s operational burden while ensuring vehicle driving safety. A master–slave game model for human–machine interaction behavior is constructed. Based on this, an adaptive weight decision-making model grounded in fuzzy control rules is proposed. This model modifies the conditions of the human–machine game equilibrium to accomplish the path-tracking task. The experimental results indicate that the method put forward in this paper can effectively reduce the driver’s burden, enhance the stability of vehicle motion, and assist drivers in various states to fulfill driving tasks.

2. Modeling of the Vehicle Dynamics and Human–Machine Steering Controllers

2.1. The Control Architecture

The co-driving control system diverges from traditional driving assistance systems. As an independent virtual co-driver, the co-driving controller possesses a certain level of autonomy. It can make autonomous decisions based on environmental perception information to provide assistance to the driver. The human–machine control actions are accomplished through a hybrid approach in accordance with the driving authority allocation, thereby realizing human–machine collaborative steering control. The autonomous driving system constructs a human–machine interaction model by obtaining its own expectation (

T_{a}

) and the driver’s expectation (

T_{d}

). It predicts the driver’s control action (

u_{d}

) and formulates its own optimal response (

u_{a}

). The global steering controller computes the global optimal control variable (

u_{g}

). The fuzzy logic controller takes the driver’s state (

D s

); the degree of the deviation between the actual vehicle’s control input and the global optimal control; and the traffic environment risk (

R i s k

) as inputs and outputs the driving weights (

α

,

β

), both human and machine, in real time. The human–machine integration module determines the linear combination of the control variables of the human and the machine according to their weights and uses this as the final control variable (

u

) to command the vehicle to execute steering maneuvers. Non-cooperative game theory is employed to establish the human–machine interaction model, which is characterized as a Stackelberg game. The human–machine conflict issue is addressed by deriving the optimal control strategies for both the human and the machine. Subsequently, an adaptive weight decision model based on fuzzy rules is utilized to dynamically adjust the driving authorities of the human and the machine. This adjustment modifies the equilibrium conditions of the human–machine game, ensuring the safe operation of the vehicle. The overall framework is presented in Figure 1.

2.2. Modeling of the Vehicle Dynamics

In order to design the human–machine collaborative control model and reduce the computational complexity of the control algorithm, a three-degree-of-freedom single-track vehicle dynamics model [18], as depicted in Figure 2, is established. This is achieved by disregarding factors such as changes in the longitudinal velocity, the differences between the front and rear wheels, longitudinal and lateral aerodynamics, and the impact of the suspension system. XOY is the global coordinate system, while xoy is the vehicle-fixed coordinate system.

F_{l f}

and

F_{l r}

represent the longitudinal resultant forces of the front and rear tires, respectively;

F_{c f}

and

F_{c r}

represent the net lateral forces of the front and rear tires, respectively;

v_{c f}

is the lateral velocity of the front wheel;

v_{l f}

is the longitudinal velocity of the front wheel;

δ_{f}

and

δ_{r}

are the steering angles of the front and rear wheels of the vehicle, respectively;

a

and

b

are the distances from the vehicle’s center of mass to the front and rear axles, respectively;

v_{f}

is the velocity of the front axle’s center;

\dot{φ}

is the yaw rate;

a_{f}

is the slip angle of the front wheel;

m

is the mass of the vehicle;

I_{z}

is the moment of inertia;

s

is the slip ratio;

C_{l}

is the longitudinal stiffness of the tire; and

C_{c}

is the lateral stiffness of the tire.

The transformation relationship between the global coordinate system and the vehicle-fixed coordinate system is

\{\begin{matrix} \dot{Y} = \dot{x} \sin φ + \dot{y} \cos φ \\ \dot{X} = \dot{x} \cos φ - \dot{y} \sin φ \end{matrix}

(1)

In accordance with Newton’s second law, force-equilibrium equations are established along the x-axis and y-axis and about the z-axis, respectively:

\{\begin{array}{l} m \ddot{x} = m \dot{y} \dot{φ} + 2 F_{x f} + 2 F_{x r} \\ m \ddot{y} = - m \dot{x} \dot{φ} + 2 F_{y f} + 2 F_{y r} \\ I_{z} \ddot{φ} = 2 a F_{y f} - 2 b F_{y r} \end{array}

(2)

The relationships between the lateral force, the longitudinal force, and the resultant force of the tires in the x-direction and the y-direction are as follows:

\{\begin{matrix} F_{x f} = F_{l f} \cos δ_{f} - F_{c f} \sin δ_{f} \\ F_{x r} = F_{l r} \cos δ_{r} - F_{c r} \sin δ_{r} \\ F_{y f} = F_{l f} \sin δ_{f} + F_{c f} \cos δ_{f} \\ F_{y r} = F_{l r} \sin δ_{r} + F_{c r} \cos δ_{r} \end{matrix}

(3)

In order to simplify the model, the assumption of a low angular velocity is employed, and an approximate relationship can be obtained:

\{\begin{matrix} \sin δ \approx δ \\ \cos δ \approx 1 \\ \tan δ \approx δ \end{matrix}

(4)

Then, Equation (4) can be simplified into

\{\begin{array}{l} F_{x f} = F_{l f} - F_{c f} δ_{f} \\ F_{x r} = F_{l r} \\ F_{y f} = F_{l f} δ_{f} + F_{c f} \\ F_{y r} = F_{c r} \end{array}

(5)

To calculate the longitudinal force

F_{l}

and the lateral force

F_{c}

of the tire, the model is simplified further, and the Pacejka tire model [18] is adopted. When the lateral acceleration

a_{y}

≤ 0.4 g and the tire slip angle

α

≤ 5°, according to the magic formula, it is known that

\{\begin{matrix} F_{l} = C_{l} s \\ F_{c} = C_{c} α \end{matrix}

(6)

In order to calculate the tire camber angle, by analyzing the relationship shown in the figure, we obtain

α = \arctan \frac{v_{c}}{v_{l}}

(7)

\{\begin{matrix} v_{l} = v_{y} \sin δ + v_{x} \cos δ \\ v_{c} = v_{y} \cos δ - v_{x} \sin δ_{c} \end{matrix}

(8)

\{\begin{array}{l} v_{y f} = \dot{y} + a \dot{φ} \\ v_{y r} = \dot{y} - b \dot{φ} \\ v_{x f} = \dot{x} \\ v_{x r} = \dot{x} \end{array}

(9)

Under the premise of an assumed low angular velocity, by jointly considering Equation (7) through to Equation (9), the following result is obtained:

α_{f} = \frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}

(10)

Regarding the rear wheel’s camber angle, when considering a front wheel drive scenario with

δ_{r}

= 0, it follows that

α_{r} = \frac{\dot{y} - b \dot{φ}}{\dot{x}}

(11)

By substituting Equations (10) and (11) into Equation (6), the lateral forces and the longitudinal forces of the front and rear wheels are derived as follows:

\{\begin{cases} F_{c f} = C_{c f} (\frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}) \\ F_{c r} = C_{c r} (\frac{\dot{y} - b \dot{φ}}{\dot{x}}) \end{cases}

(12)

\{\begin{matrix} F_{l f} = C_{l f} s_{f} \\ F_{l r} = C_{l r} s_{r} \end{matrix}

(13)

When Equations (12) and (13) are substituted into Equation (5), we obtain

\{\begin{cases} F_{x f} = C_{l f} s_{f} - C_{c f} (\frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}) δ_{f} \\ F_{x r} = C_{l r} s_{r} \\ F_{y f} = C_{l f} s_{f} δ_{f} + C_{c f} (\frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}) \\ F_{y r} = C_{c r} (\frac{\dot{y} - b \dot{φ}}{\dot{x}}) \end{cases}

(14)

Upon substituting the above equation into the force equilibrium Equation (2), simplified vehicle dynamics equations can be derived:

\{\begin{cases} m \ddot{x} = m \dot{y} \dot{φ} + 2 [C_{l f} s_{f} + C_{c f} (δ_{f} - \frac{\dot{y} + a \dot{φ}}{\dot{x}} δ_{f}) δ_{f} + C_{l r} s_{r}] \\ m \ddot{y} = - m \dot{x} \dot{φ} + 2 [C_{l f} s_{f} δ_{f} + C_{c f} (\frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}) + C_{c r} (\frac{\dot{y} - b \dot{φ}}{\dot{x}})] \\ I_{z} \ddot{φ} = 2 [a C_{l f} s_{f} δ_{f} + C_{c f} (\frac{\dot{y} + a \dot{φ}}{\dot{x}} - δ_{f}) - b C_{c r} (\frac{\dot{y} - b \dot{φ}}{\dot{x}})] \\ \dot{Y} = \dot{x} \sin φ + \dot{y} \cos φ \\ \dot{X} = \dot{x} \cos φ - \dot{y} \sin φ \end{cases}

(15)

The position, velocity, yaw angle, and other information for the vehicle are chosen as the vehicle state variables. These are denoted as

ζ = {[\begin{matrix} \dot{y}, \dot{x}, φ, \dot{φ}, Y, X \end{matrix}]}^{T}

; when the control variable is chosen as

u = [δ_{f}]

and the output variable is likewise chosen as

η = {[φ, Y]}^{T}

, the equation for the nonlinear dynamics of the vehicle can be formulated as

\dot{ζ} = f (ζ, u)

(16)

First, the forward Euler method is employed to discretize the nonlinear model. Given that the sampling time is

T

, we have

\dot{ζ} (k) = \frac{ζ (k + 1) - ζ (k)}{T}

(17)

By simultaneously considering Equations (16) and (17), we arrive at

ζ (k + 1) = ζ (k) + T f [ζ (k), u (k)]

(18)

Subsequently, by applying the state-trajectory method to linearizing the system state Equation (18), we obtain

ζ (k + 1) - {\hat{ζ}}_{0} (k + 1) = A_{k, 0} [ζ (k) - {\hat{ζ}}_{0} (k)] + B_{k, 0} [u (k) - u_{0} (k)]

(19)

The equation presented above represents the linear model derived at the operating point

(ζ_{0}, u_{0})

. This model can be extended further to an arbitrary point

(ζ_{t}, u_{t})

. Consequently, a discrete linearized vehicle system model can be achieved.

\{\begin{cases} ζ (k + 1) = A ζ (k) + B u (k) + d (k) \\ η = C ζ \end{cases}

(20)

In the formula,

d (k) = {\hat{ζ}}_{t} (k + 1) - A_{k, t} {\hat{ζ}}_{t} (k) - B_{k, t} u_{t} (k)

,

A = A_{k, t}

,

B = B_{k, t}

, and

C = [\begin{matrix} 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 \end{matrix}]

.

2.3. The Driver Model

Taking into account the intricate circumstances of repetitive experiments involving real-life drivers, in order to analyze the experimental results with greater precision, this study employs the optimal preview lateral acceleration driver model [19]. The steering wheel’s angle in the driver output of this model is presented in Equation (21). This model simultaneously considers the driver’s reaction latency and the correction mechanism. By simulating the driver’s actions of making judgments and implementing rolling adjustments based on the current vehicle’s driving state and comparing the deviation between the future trajectory for a previewed timepoint and the target trajectory, it can effectively mirror the performance of human drivers during driving tasks.

\{\begin{cases} δ_{s w}^{*} = \ddot{y} {(t)}^{*} \cdot C_{1} \cdot C_{2} \\ \ddot{y} (t) = \frac{2}{T_{p}^{2}} [y (t + T_{p}) - y (t) - T_{p} \dot{y} (t)] \\ C_{1} = \frac{1 + T_{c}}{G_{a y}} \\ C_{2} = \frac{\exp (- T_{d} s)}{1 + T_{h} s} \end{cases}

(21)

Specifically,

\ddot{y} (t)

represents the ideal lateral acceleration derived from the optimization objective of minimizing the tracking error;

C_{1}

denotes the correction element;

G_{a y}

stands for the steady-state gain function of the lateral acceleration;

C_{2}

refers to the reaction lag component;

T_{p}

is the preview time;

T_{h}

is the delay time for muscle action; and

T_{d}

is the delay time for a neural reaction.

Real drivers’ states can be influenced by factors such as emotions, the environment, their physical condition, and even alcohol consumption. This has a direct impact on driving safety. The driving state of drivers can be classified or quantified by means of a driver state detection system, operational signals, and vehicle status information. Regarding the driver model, different parameters can be utilized to characterize the driver’s state. In the relevant literature, the variable

D s \in (0.001, 1.00)

is employed to quantify the driving state of the driver model. A lower value of

D s

indicates a worse state of the driver. In this section, drivers A and B are introduced to simulate the manipulation behaviors of drivers in different states. This is to analyze the effectiveness of the proposed human–machine collaborative strategy under the condition of uncertain driver behaviors. The values of the driver parameters are presented in Table 1.

2.4. A Transverse Controller Based on Linear Time-Varying Model Predictive Control

In this study, a lateral controller that takes the control stability into account is developed based on model predictive control theory. This controller will serve as the globally optimal controller during the human–machine collaborative driving process. To guarantee the stability and tracking ability of the control system, a linear time-varying model is employed as the prediction model. Linear time-varying model predictive control offers the advantages of simple computation and a high real-time performance. Compared to nonlinear time-varying model predictive control, it is more suitable for vehicle motion control. A fixed prediction horizon and control horizon are employed, and a quadratic programming (QP) solver is utilized for online optimization under linear constraints. This approach enhances the computational speed of the algorithm. An objective function for autonomous driving path tracking is established with the aim of minimizing both the path-tracking error and the magnitude of the variation in the control input. Consequently, a new state space is constructed.

\begin{array}{l} ξ (k + 1) = [\begin{matrix} ζ (k + 1) \\ u (k) \end{matrix}] \\ = [\begin{matrix} A & B \\ 0_{m \times n} & I_{m} \end{matrix}] [\begin{matrix} ζ (k) \\ u (k - 1) \end{matrix}] + [\begin{matrix} B \\ I_{m} \end{matrix}] Δ u (k) + [\begin{matrix} d (k) \\ 0 \end{matrix}] \end{array}

(22)

After rearrangement, we obtain

\{\begin{cases} ξ (k + 1) = A_{Δ} ξ (k) + B_{Δ} Δ u (k) + \bar{d} (k) \\ η_{Δ} (k) = C_{Δ} ξ (k) \end{cases}

(23)

In the formula,

A_{Δ} = [\begin{matrix} A & B \\ 0_{m \times n} & I_{m} \end{matrix}], B_{Δ} = [\begin{matrix} B \\ I_{m} \end{matrix}], \bar{d} (k) = [\begin{matrix} d (k) \\ 0 \end{matrix}]

The values of

ξ

and

η

at the next

N_{p}

time instants can be obtained through iteration. Then, the matrix expression of the system’s output at the future

N_{p}

time instants is

Y (k) = Ψ_{Δ} ξ (k) + Θ_{Δ} Δ U (k) + Υ_{Δ} Ξ (k)

(24)

Among these,

Y (k) = {[\begin{matrix} η_{Δ} (k + 1) \\ η_{Δ} (k + 2) \\ ⋮ \\ η_{Δ} (k + N_{c}) \\ ⋮ \\ η_{Δ} (k + N_{p}) \end{matrix}]}_{N_{p} \times 1}, Ψ_{Δ} = {[\begin{matrix} C A_{Δ} \\ C A_{Δ}^{2} \\ ⋮ \\ C A_{Δ}^{N_{c}} \\ ⋮ \\ C A_{Δ}^{N_{p}} \end{matrix}]}_{N_{p} \times 1}, Δ U (k) = {[\begin{matrix} Δ u (k) \\ Δ u (k + 1) \\ Δ u (k + 2) \\ ⋮ \\ Δ u (k + N_{c} - 1) \end{matrix}]}_{N_{c} \times 1},

Θ_{Δ} = {[\begin{matrix} C B_{Δ} & 0 & 0 & 0 \\ C A_{Δ} B_{Δ} & C B_{Δ} & 0 & 0 \\ \dots & \dots & ⋱ & \dots \\ C A_{Δ}^{N_{c} - 1} B_{Δ} & C A_{Δ}^{N_{c} - 2} B_{Δ} & \dots & C B_{Δ} \\ C A_{Δ}^{N_{c}} B_{Δ} & C A_{Δ}^{N_{c} - 1} B_{Δ} & \dots & C A_{Δ} B_{Δ} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ C A_{Δ}^{N_{p} - 1} B_{Δ} & C A_{Δ}^{N_{p} - 2} B_{Δ} & \dots & C A_{Δ}^{N_{p} - N_{c}} B_{Δ} \end{matrix}]}_{N_{p} \times N_{c}}

\begin{matrix} Υ_{Δ} = {[\begin{matrix} C & 0 & 0 & \dots & 0 \\ C A_{A} & C & 0 & \dots & 0 \\ C A_{A}^{2} & C A_{A} & C & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ C A_{A}^{N_{P - 1}} & C A_{A}^{N_{P - 2}} & \dots & \dots & C \end{matrix}]}_{N_{p} \times N_{p}}, Ξ (k) = {[\begin{matrix} d (k) \\ d (k + 1) \\ ⋮ \\ d (k + N_{p} - 2) \\ d (k + N_{p} - 1) \end{matrix}]}_{N_{p} \times 1} \end{matrix}

Let

Y_{r e f} (k) = {[η_{r e f} (k + 1), \dots, η_{r e f} (k + N_{P})]}^{T}

(25)

Consequently, the optimization objective of the controller is

\begin{array}{l} \min J_{Δ} = \sum_{i = 1}^{N_{p}} {(η_{Δ} (k + i) - η_{r e f} (k + i))}^{T} Q_{Δ} (η_{Δ} (k + i) \\ - η_{r e f} (k + i)) + \sum_{i = 0}^{N_{c} - 1} {(Δ u (k + i))}^{2} R_{Δ} + ε^{T} ρ ε \\ s . t . \{\begin{matrix} Δ u_{\min} \leq Δ u \leq Δ u_{\max} \\ u_{\min} \leq u \leq u_{\max} \\ 0 \leq ε \leq M \end{matrix} \end{array}

(26)

Herein, the first term represents the global controller’s requirement for a relatively small path-tracking error, while the second term reflects the global controller’s demand for smooth variation in the control variable.

Q_{Δ}

and

R_{Δ}

are the penalty factors for these two terms.

We define the control increment sequence over the control horizon

N_{c}

as

Δ U (k) = {(Δ u (k), Δ u (k + 1), \dots, Δ u (k + N_{c} - 1))}^{T}

(27)

Consequently, the optimization objective function can be formulated as

\begin{array}{l} J_{Δ} = {[Y (k) - Y_{r e f} (k)]}^{T} Q_{Δ} [Y (k) - Y_{r e f} (k)] \\ + Δ U^{T} R_{Δ} Δ U + ε^{T} ρ ε \end{array}

(28)

Substitute Equation (24) into Equation (28) and set

E (k) = Y_{r e f} (k) - Ψ_{Δ} ξ (k) - Υ_{Δ} Ξ (k)

(29)

Given that the dimension of

E {(k)}^{T}

is

1 \times N_{p}

and the dimension of

Δ U (k)

is

N_{c} \times 1

, the dimension of

E {(k)}^{T} Q_{Δ} Θ_{Δ} Δ U (k)

is

1 \times 1

. Thus, it holds that

\begin{array}{l} J_{Δ} = Δ U {(k)}^{T} [Θ_{Δ}^{T} Q_{Δ} Θ_{Δ} + R_{Δ}] Δ U (k) \\ - 2 E {(k)}^{T} Q_{Δ} Θ_{Δ} Δ U (k) + E {(k)}^{T} Q_{Δ} E (k) + ε^{T} ρ ε \end{array}

(30)

Moreover, since

E^{T} Q E

is a constant, it can be disregarded during the optimization process. Thus, Equation (30) can be rewritten as

J_{Δ} = \frac{1}{2} [\begin{matrix} Δ U {(k)}^{T} & ε \end{matrix}] H [\begin{matrix} Δ U (k) \\ ε \end{matrix}] + f^{T} [\begin{matrix} Δ U (k) \\ ε \end{matrix}]

(31)

Among these,

\{\begin{matrix} H = [\begin{matrix} 2 (Θ_{Δ}^{T} Q_{Δ} Θ_{Δ} + R_{Δ}) & 0 \\ 0 & 2 ρ \end{matrix}] \\ f^{T} = [\begin{matrix} 2 E {(k)}^{T} Q_{Δ} Θ_{Δ} & 0 \end{matrix}] \end{matrix}

Then, the optimal control increment sequence

Δ {U_{g}}^{*} (k)

can be obtained by solving Equation (31) through quadratic programming. The first element

Δ {u_{g}}^{*} (k)

of this sequence serves as the actual input control increment, and thus the actual input control quantity of the lateral controller is

u (k) = u_{0} (k - 1) + Δ u^{*} (k)

(32)

The robustness and stability of the designed lateral controller were experimentally verified in CarSim 2020 (Shown in Figure 3) and MATLAB 2023a (Shown in Figure 4). As a potent visual modeling and simulation utility within MATLAB, Simulink is expressly dedicated to the construction of dynamic system models. In Simulink, users can directly access MATLAB scripts and functions, thereby enabling the execution of more sophisticated algorithms and in-depth analytical tasks. CarSim is capable of accommodating a wide array of simulation scenarios. Notably, the most distinctive advantage of CarSim lies in its seamless integration with MATLAB/Simulink. This integration empowers users to import the simulation outcomes from CarSim into MATLAB without any operational impediments, thus facilitating further in-depth analysis and processing.

3. The Leader–Follower Game-Based Optimal Control Strategy

3.1. A Solution to the Leader–Follower Game

The vehicle control variable

u

is derived from the weighted sum of the driver’s steering angle

u_{d}

and the steering angle

u_{a}

of the automated driving system. We denote

α

and

β

as the weights of the latter two, respectively. Then,

u = α u_{a} + β u_{a}

(33)

In the formula,

α \in [0, 1], β \in [0, 1], α + β = 1

Consequently, the discrete linear system model of the vehicle (Equation (20)) can also be represented as

\{\begin{cases} ζ (k + 1) = A ζ (k) + B_{a} u_{a} (k) + B_{d} u_{d} (k) + d (k) \\ η (k) = C ζ (k) \end{cases}

(34)

In the formula,

B_{d} = α B, B_{a} = β B

We denote

N_{p}

as the prediction horizon and

N_{c}

as the control horizon. Then, by iterating the above discrete-state equation, the prediction equations for the next

N_{p}

steps under human–machine cooperative control can be derived as

Z (k) = Ψ ζ (k) + Θ_{a} U_{a} (k) + Θ_{d} U_{d} (k) + Υ Φ (k)

(35)

In the formula,

Z (k) = [\begin{matrix} η (k + 1) \\ η (k + 2) \\ ⋮ \\ η (k + N_{c}) \\ ⋮ \\ η (k + N_{p}) \end{matrix}], Ψ = [\begin{matrix} C A \\ C A^{2} \\ ⋮ \\ C A^{N_{c}} \\ ⋮ \\ C A^{N_{p}} \end{matrix}], U_{i} (k) = [\begin{matrix} u_{i} (k) \\ u_{i} (k + 1) \\ ⋮ \\ u_{i} (k + N_{c} - 1) \end{matrix}], Φ (k) = [\begin{matrix} d (k) \\ d (k + 1) \\ ⋮ \\ d (k + N_{p} - 1) \end{matrix}]

Θ_{i} = [\begin{matrix} C B_{i} & 0 & \dots & 0 \\ C A B_{i} & C B_{i} & \dots & 0 \\ ⋮ & ⋮ & ⋮ \\ C A^{N_{c} - 1} B_{i} & C A^{N_{c} - 2} B_{i} & \dots & C B_{i} \\ ⋮ & ⋮ & ⋮ \\ C A^{N_{P} - 1} B_{i} & C A^{N_{P} - 2} B_{i} & \dots & C A^{N_{p} - N_{c}} B_{i} \end{matrix}], Υ = [\begin{matrix} C & 0 & 0 & \dots & 0 \\ C A & C & 0 & \dots & 0 \\ C A^{2} & C A & C & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ C A^{N_{P - 1}} & C A^{N_{P - 2}} & \dots & \dots & C \end{matrix}]

i = a, d

In the process of human–machine collaborative driving, the driver and the automated driving system are two independent agents, each pursuing its own control objectives. Unintentionally, a binary game relationship is formed. First of all, the path-tracking processes of both the driver and the automated driving system are simplified as model predictive controllers. Moreover, both of them aim to minimize their respective cost functions.

\{\begin{array}{l} \min_{U_{d}} J_{d} (k) \\ \min_{U_{a}} J_{a} (k) \end{array}

(36)

Herein,

J_{d} (k)

denotes the cost function of the driver, and

J_{a} (k)

denotes the cost function of the automated driving system.

With the requirements that each participant’s vehicle lateral displacement and yaw angle are consistent with their own expectations and that their own efforts are minimized, the cost functions of the driver and the automated driving system are defined as shown in Equation (37). Herein,

T_{a} (k)

represents the preview points for the next

N_{p}

steps obtained by the automated driving system according to the obstacle avoidance trajectory output from the path-planning module, and

T_{d} (k)

represents the preview points for the driver in the next

N_{p}

steps.

\{\begin{matrix} J_{d} (k) = {∥ Z (k) - T_{d} (k) ∥}_{Q_{d}}^{2} + {∥ U_{d} (k) ∥}_{R_{d}}^{2} \\ J_{a} (k) = {∥ Z (k) - T_{a} (k) ∥}_{Q_{a}}^{2} + {∥ U_{a} (k) ∥}_{R_{a}}^{2} \end{matrix}

(37)

In the formula,

Q_{i} = [\begin{matrix} q_{i} & 0 & 0 & 0 \\ 0 & q_{i} & 0 & 0 \\ 0 & 0 & ⋱ & ⋮ \\ 0 & 0 & \dots & q_{i} \end{matrix}], q_{i} = [\begin{matrix} q_{i}^{φ} & 0 \\ 0 & q_{i}^{y} \end{matrix}], R_{i} = [\begin{matrix} r_{i} & 0 & 0 & 0 \\ 0 & r_{i} & 0 & 0 \\ ⋮ & 0 & ⋱ & ⋮ \\ 0 & 0 & \dots & r_{i} \end{matrix}], T_{i} (k) = [\begin{matrix} t_{i} (k + 1) \\ t_{i} (k + 2) \\ ⋮ \\ t_{i} (k + N_{p}) \end{matrix}]

In the leader–follower game model, this study employs the backward induction method to solve the Stackelberg equilibrium. That is, an analytical expression for the optimal response of the automated driving system to the driver’s actions is first obtained. Then, based on this analytical expression, the driver’s optimal control action

{U_{d}}^{*}

is determined. Subsequently, the optimal control action

{U_{a}}^{*}

of the automated driving system is deduced in reverse.

Firstly, the error variable is defined.

\{\begin{matrix} ε_{a} (k) = T_{a} (k) - Ψ ξ (k) - Θ_{d} U_{d} (k) - Υ Φ (k) \\ ε_{d} (k) = T_{d} (k) - Ψ ξ (k) - Θ_{a} U_{a} (k) - Υ Φ (k) \end{matrix}

(38)

Then, the partial derivative of the cost function

J_{i} (k)

with respect to

U_{i} (k)

is

\begin{matrix} \frac{\partial J_{i}}{\partial U_{i} (k)} = \frac{\partial ({∥ Θ_{i} U_{i} (k) - ε_{i} (k) ∥}_{Q_{i}}^{2} + {∥ U_{i} (k) ∥}_{R_{i}}^{2})}{\partial U_{i} (k)} \\ = 2 Θ_{i}^{T} Q_{i} (Θ_{i} U_{i} (k) - ε_{i} (k)) + 2 R_{i} U_{i} (k) \end{matrix}

(39)

Let

\frac{\partial J_{a}}{\partial U_{a} (k)} = 0

; the optimal control sequence for the automated driving system is obtained as

U_{a} (k) = L_{a} (2 Θ_{a}^{T} Q_{a} ε_{a} (k))

(40)

In the formula,

L_{a} = {(2 Θ_{a}^{T} Q_{a} Θ_{a} + 2 R_{a})}^{- 1}

Equation (40) indicates that

U_{a} (k)

is not only related to its own expectations but also depends on the leader’s strategy

U_{d} (k)

. Since the driver serves as the leader and can estimate the operations of the automated driving system and strategize in advance, we substitute Equation (35) into the driver’s payoff function

J_{d} (k)

in Equation (37) and take the derivative to obtain

\frac{\partial J_{d}}{\partial U_{d} (k)} = [\begin{matrix} 2 {Θ_{d 2}}^{T} Q_{d} Θ_{d 2} + 2 R_{d} \end{matrix}] U_{d} (k) - 2 {Θ_{d 2}}^{T} Q_{d} ε_{d}^{*} (k)

(41)

In the formula,

\begin{array}{l} ε_{d}^{*} (k) = T_{d} (k) - (Ψ + Θ_{a} L_{a} 2 Θ_{a}^{T} Q_{a} Ψ) ξ (k) + Θ_{a} L_{a} 2 Θ_{a}^{T} Q_{a} T_{a} (k) \\ - (2 Θ_{a} L_{a} Θ_{a}^{T} Q_{a} Υ + Υ) Φ (k), Θ_{d 2} = Θ_{d} + Θ_{a} L_{a} 2 Θ_{a}^{T} Q_{a} Θ_{d} \end{array}

Let

\frac{\partial J_{d}}{\partial U_{d} (k)} = 0

; the optimal control strategy of the driver can be derived as

U_{d}^{*} (k) = L_{d} (2 Θ_{d 2}^{T} Q_{d} ε_{d}^{*} (k))

(42)

In the formula,

L_{d} = {(2 {Θ_{d 2}}^{T} Q_{d} Θ_{d 2} + 2 R_{d})}^{- 1}

Substituting Equations (38) and (42) into the expression of the solution of the automated driving system in Equation (40), the optimal response of the automated driving system to the driver’s operation can be derived as follows:

U_{a}^{*} (k) = L_{a} (2 Θ_{a}^{T} Q_{a} ε_{a}^{*} (k))

(43)

In the formula,

ε_{a}^{*} (k) = T_{a} (k) - Ψ ξ (k) - Θ_{d} L_{d} (2 Θ_{d 2}^{T} Q_{d} ε_{d}^{*} (k)) - Υ Φ (k)

3.2. The Adaptive Weight Fuzzy Decision-Making Model

The allocation of driving authority is a crucial issue in human–machine collaborative control. Inadequate intervention fails to guarantee driving safety, whereas excessive intervention will exacerbate human–machine conflicts and impose additional burdens on drivers. Consequently, a novel dynamic weight adjustment approach is proposed. This approach takes into account the driver’s state, the risks of the traffic environment, and the deviation in the global control of the vehicle, aiming to supply reliable collaborative control factors for the human–machine integration module within the control architecture.

3.2.1. Assessment of Environmental Risks

Environmental risk

R i s k

is primarily composed of two components. One is the obstacle risk

R i s k_{0}

, and the other is the constraint

R i s k_{r}

stemming from traffic regulations, denoted as the formula

R i s k = s u m (R i s k_{0}, R i s k_{r})

First and foremost, with respect to the obstacle risk, the assessment model established in Ref. [20] is

R i s k_{o}^{'} = \frac{K_{o b s t}}{k^{'}}

(44)

Herein,

k^{'} = \begin{matrix} \frac{d X}{X_{n}} & \frac{d Y}{Y_{n}} \end{matrix}

, where

K_{o b s t}

is a constant determining the intensity of the risk domain, and

b

is a constant determining the shape of the risk domain.

d X

and

d Y

represent the relative longitudinal and lateral displacements between the current position and the obstacle, respectively.

X_{n}

and

Y_{n}

are the longitudinal and lateral normalization constants, respectively.

Taking into account the disparities in risks across different directions, initially, the obstacle vehicle is dilated by a factor of the vehicle’s length and width. This is to establish an elliptical driving risk domain. Subsequently, the longitudinal normalization constant is calibrated using the product of the relative longitudinal velocity

Δ v_{x}

and the minimum safe lane-changing time

T_{m}

[21] to accommodate diverse traffic scenarios. Given that the collision risk surges rapidly as the distance from the obstacle vehicle decreases, an exponential form is employed to formulate the risk assessment model, as presented in Equation (45).

R i s k_{o} = \{\begin{matrix} R i s k_{\max} & (x, y) \in S \\ λ_{1} (e^{\frac{λ_{2}}{| k |}} - 1) & (x, y) \notin S \end{matrix}

(45)

Herein,

∣ k ∣ = \sqrt{{(\frac{x - x_{o}}{X})}^{2} + {(\frac{y - y_{o}}{Y})}^{2}}, X_{n} = l + Δ ν_{x} T_{m}, Y_{n} = w;

l

and

w

denote the length and width of the obstacle, respectively.

(x_{0}, y_{0})

represents the position of the centroid of the obstacle.

λ_{1}

and

λ_{2}

are the coefficients for regulating the intensity of the risk domain.

S : k^{2} = 1

is the inflated ellipse of the obstacle expansion.

Where there are

n

obstacles in the driving environment, the obstacle risk

R i s k_{0}

of the ego vehicle is represented by the sum of the risks posed by all obstacles

R_{o i} (i = 1, 2, \dots, n)

to the ego vehicle, as presented in Equation (46).

R i s k_{o} = s u m (R i s k_{o 1}, R i s k_{o 2}, \dots, R i s k_{o n})

(46)

Furthermore, road boundaries impose constraints on drivers’ behaviors, preventing them from breaching traffic regulations, such as lane deviation. In this paper, a road boundary risk assessment model is developed. When a vehicle is traveling near the centerline of the lane, the risk is relatively low, and a trigonometric function is employed for modeling purposes. As the vehicle approaches the road boundary line, the risk escalates rapidly, and an exponential function is utilized for the modeling. In the research scenario of this paper, a two-lane road is considered. Let

Y = 0

denote the road centerline,

L

represent the width of a single lane,

Y_{l} = - L / 2

signify the centerline of the left-hand lane, and

Y_{r} = - L / 2

denote the centerline of the right-hand lane. The road risk assessment model is established as presented in Equation (47).

R i s k_{r} = \{\begin{matrix} γ_{1} (e^{| Y - Y_{l} |} - 1) Y \leq - L / 2 o r Y \geq L / 2 \\ γ_{2} [\sin (\frac{2 π (Y - Y_{l})}{2 L})] - L / 2 < Y < L / 2 \end{matrix}

(47)

Herein,

γ_{1}

and

γ_{2}

are the coefficients for modulating the intensity of the risk domain.

3.2.2. The Deviation in Vehicle Control

During the game process, human and machine operations interact with each other. Given the uncertainty of the driver’s operations, a global controller is required to conduct comprehensive monitoring of the vehicle’s driving state, thereby preventing the vehicle from entering an unstable state. The global steering control scheme is deduced by means of the quadratic dynamic optimization approach. The optimization objective of the global controller is formulated as presented in Equation (48).

The optimization problem is solved to obtain the control sequence

{U_{g}}^{*} (k)

. Moreover, the first element

{u_{g}}^{*} (k)

of this sequence is regarded as the actual control increment. Thus, the control deviation is defined as

e (k) = \frac{1}{H} \sum_{j = k - H + 1}^{k} (|u_{g} (j)| - |u (j)|)

(48)

In this context,

H

denotes the length of the time window.

We define the normalized deviation in global control as follows:

O f f s e t = \frac{|e (k)|}{\max (|e (k)|)}

(49)

3.2.3. Dynamic Weight Adjustment of Driving Authority Based on Fuzzy Rules

The deviation between the driving risk and the global controller directly impacts the driving weight distribution between the human driver and the automated system. Nevertheless, it remains challenging to formulate an analytical mathematical relationship between this deviation and the variation in driving weight. Specifically, when the driving risk is elevated and the driver’s state is suboptimal, it is desirable to increase the driving weight of the co-driving controller. This augmentation of its intervention level serves to safeguard driving safety. Conversely, when the driving risk is relatively low, the driving weight of the vehicle should be transferred to the driver, ensuring that the vehicle adheres to the driver’s control commands. Therefore, on the premise of ensuring safe driving operations, it is essential to offer more assistance to drivers in a poor state and minimize intervention for those in a good state. This approach aims to alleviate the driver’s workload and enhance overall driving performance and safety.

A fuzzy logic controller is employed to design the weights. The structure of the fuzzy logic controller is depicted in Figure 5. Herein, the driving environment risk

R i s k

, the global control deviation

O f f s e t

, and the driver’s state

D s

serve as the inputs, while the weight

β

of the autonomous driving system acts as the output.

The driving environment risks and deviations in global control can be obtained via the methods described above. The state of the human driver can be obtained by monitoring the driver’s physiological signals or operational behaviors related to the vehicle. Moreover, the driver model can be simulated using state parameters.

The triangular membership function features a simple structure, low computational complexity, and high sensitivity. Its vertices correspond to typical values, facilitating adjustment. In this study, the triangular membership function is employed to characterize the fuzzy set. Regarding situations where the values of

D s

and

O f f s e t

are either relatively large or relatively small, a semi-trapezoidal membership function is utilized for the description. The semi-trapezoidal function is capable of effectively dealing with boundary conditions. It helps to circumvent blind spots in the rules, thereby ensuring that even the maximum value of

D_{s}

or the minimum value of

O f f s e t

can still activate the corresponding rules. Finally, the membership functions for each input and output variable are presented in Figure 6.

Each fuzzy rule is associated with a fuzzy relation. The fuzzy relation

R i

corresponding to the

i

-th rule is expressed as

R_{i} = {(R i s k_{l} \times D s_{m} \times O f f s e t_{n})}^{T_{1}} \times β_{k}

(50)

Among them,

i \in {1, 2, 3, \dots, 7}, l, m, n \in {1, 2, 3}, k \in {1, 2, 3, 4, 5}, T_{l}

is the transfer matrix of column vectors.

The synthetic inference rule is adopted to calculate the fuzzy output. Then, the defuzzification of the output fuzzy quantity is carried out using the centroid method, from which a corresponding crisp output value can be obtained. As depicted in Figure 7.

4. Simulation Results and Their Analysis

As depicted in Figure 8, the scenario under consideration is a stretch of a two-lane highway, where each lane has a width of 3.5 m, and the speed of the vehicle

V_{s}

is 60 km/s, while that of the vehicle

V_{f}

is 70 km/s. All vehicles are located on the centerline of the lane. To assess the effectiveness of dynamic weight allocation within risk scenarios, two obstacle vehicles are introduced into the experimental scenario. The ego vehicle

V_{s}

is traveling at a constant speed while following the leading vehicle

V_{f}

in the right-hand lane. There is a stationary obstacle (accident vehicle

V_{o 1}

) 100 m ahead in the ego vehicle’s lane and another stationary obstacle (accident vehicle

V_{o 2}

) 150 m ahead in the left-hand lane. Nevertheless, owing to occlusion by the leading vehicle, the driver of the ego vehicle fails to recognize the danger. At a particular instant, the leading vehicle abruptly changes lanes. Subsequently, the driver of the ego vehicle spots the obstacles on the lanes and initiates an emergency obstacle avoidance operation. To validate the steering interaction, it is specified that all drivers uniformly implement the lane-changing obstacle avoidance strategy. During the co-simulation of CarSim and Simulink in the context of human–machine collaborative driving, two operating conditions are applied separately: solely the driver driving and human–machine collaborative driving.

The results of the experimental simulation are presented in Figure 9 and Figure 10. Specifically, Figure 9 depicts the outcomes of driver A’s participation in driving, and Figure 10 shows those of driver B’s involvement. Here,

D A_{a}

and

D A_{b}

denote the driving conditions of driver A and driver B operating independently, respectively. Meanwhile,

S C_{a}

and

S C_{b}

represent collaborative driving conditions where driver A and driver B cooperate with the autonomous driving system, respectively.

By contrasting the curves for various indicators when the drivers are driving solo in Figure 9 and Figure 10, it can be observed that driver A demonstrates a superior driving state to that of driver B. For driver A, the driving trajectory and the control input variables exhibit smooth variations. Specifically, the peak value of the yaw rate is maintained below 0.25 rad·s⁻¹, which indicates that driver A is capable of accurately fulfilling the driving task. In contrast, driver B fails to successfully execute the steering and obstacle avoidance tasks. Notably, at the longitudinal position of X = 100 m, driver B comes perilously close to colliding with the road boundary.

Regarding driver A, the trajectory during collaborative driving with the autonomous driving system approximately coincides with that during solo driving. As can be observed from Figure 9b, the weight of the autonomous driving system generally remains at around 0.1. Only when entering high-risk areas does the weight increase slightly, yet it still remains below 0.2. This suggests that driver A maintains a relatively high driving weight and a high level of driving freedom throughout the collaborative process.

Concerning driver B, owing to the suboptimal driving state, the initial driving weight assigned to the autonomous driving system is relatively higher. Consequently, when solving the non-cooperative interaction model, a greater advantage can be achieved. This enables more substantial intervention in the vehicle’s operation to help the driver complete their steering maneuvers.

As depicted in Figure 10b, the weight adjustment strategy proposed in this section effectively mitigates drastic fluctuations in the driving weights between the human driver and the autonomous driving system. During the first lane-changing process, taking into account the driver’s state, the driving weight of the autonomous driving system only experiences a minor increase when the vehicle reaches the longitudinal position of X = 50 m, which is attributed to the relatively high environmental risk at that moment. During the second lane-changing process, when the vehicle reaches the longitudinal position of X = 96 m, despite a rapid surge in risk, the weight of the autonomous driving system only shows a marginal change. This is because the global control deviation remains within a narrow range. When the driver is in a less-than-ideal state, minimizing significant weight variations can effectively alleviate the driver’s psychological stress and operational burden. As is evident from Figure 10a, the vehicle’s driving path during collaborative driving is smoother. Additionally, as shown in Figure 10c, the collaborative driving mode significantly reduces the driver’s large-scale control inputs, thereby decreasing the driver’s level of exertion.

In view of the overall outcomes of the aforementioned simulation experiments, during the collaborative driving process, the degree of driver effort is lower, and the vehicle exhibits a lower yaw angular velocity. This indicates that the collaborative driving strategy can assist drivers in alleviating their driving burden and enhancing driving safety. When comparing the results of collaborative driving among different drivers, the proposed strategy can provide varying degrees of assistance to drivers in different states, thereby ensuring driving freedom. In contrast to a strategy that solely relies on environmental risks to adjust the driving weights, the proposed adaptive weight decision-making model can yield weights with a smaller range of variation. This is more conducive to enabling drivers to understand their driving characteristics and promoting the achievement of equilibrium in the human–machine interaction, thus reducing driver discomfort.

5. General Conclusions

This paper presents a human–machine collaborative steering control method grounded in the leader–follower game theory. Taking into account the characteristics of unconscious human–machine competition, the human–machine interaction is characterized as a leader–follower game relationship. Through the application of backward induction, the optimal control strategy under equilibrium conditions is theoretically derived. This enables the conditions for achieving game equilibrium to be determined and the optimal human–machine control strategy that incorporates the ability for dynamic adjustment to the driving authority under such equilibrium to be formulated. Subsequently, an adaptive weight decision-making model based on fuzzy control rules is designed. This model modifies the conditions for human–machine game equilibrium to fulfill the path-tracking task. In this adaptive weight decision-making model, a globally optimal steering controller is innovatively introduced. This controller serves to monitor the stability of human–machine control. Additionally, a fuzzy controller is constructed by integrating the traffic environment risks and the driver’s state, thereby realizing dynamic adjustments of human–machine driving authority.

The experimental results demonstrate that the proposed method can effectively alleviate the driver’s workload, enhance the stability of vehicle motion, and assist drivers in diverse states to accomplish driving tasks. In comparison with strategies that merely adjust the driving weights according to risk factors, the adaptive weight decision-making model proposed in this study exhibits smaller fluctuations in the weight values. When applying game theory to addressing conflict issues, smaller weight variations are more conducive to reaching equilibrium, causing less disruption to the human–machine balance and reducing driver discomfort. Existing studies have primarily focused on collaborative steering control of drivers and controllers, yet they have overlooked the latent cooperative potential in longitudinal control. In future research, intelligent driving vehicles will generally need to execute both lateral and longitudinal control simultaneously to cope with more complex traffic scenarios.

From the analytical expression of the leader–follower game equilibrium control strategy derived, it can be seen that the optimal human–machine steering control strategy within the leader–follower collaborative framework is directly associated with the driving intentions of both the human and the machine; the vehicle’s state; and the driving weights. Accurate extraction of the driver’s driving intentions and the rational selection of the driving weights have a direct bearing on the performance of the human–machine hybrid system. Therefore, future research should be centered on improving the ability to predict driver intentions, along with the application of reinforcement learning to the field of intelligent driving [22,23,24]. For instance, reinforcement learning can be employed as the driving system for intelligent vehicles in specific scenarios. This approach aims to enhance the overall performance of the human–machine co-driving system further. The development of high-fidelity traffic models for simulating the social behavior of vehicles in traffic scenarios is conducive to the preliminary calibration of the parameters of the control algorithms for intelligent driving vehicles. This approach can effectively save development time and effort. In this study, only joint simulation verification was conducted, which has certain deficiencies in practical engineering applications. Specifically, the control performance of the designed control strategy has not been verified through hardware-in-the-loop bench tests with drivers or in real vehicles. In future hardware-in-the-loop tests, the following aspects need to be addressed: first, the encapsulation of the controller model and code generation; second, the deployment on real-time platforms to achieve real-time operation of the controller on physical hardware; third, the establishment of software and hardware interfaces for signal interaction with the virtual vehicle dynamics model to form a closed-loop operation; and fourth, data collection and performance evaluation to ensure that the system meets the requirements of embedded deployment and real-time performance. This will promote its application in actual intelligent driving systems.

Author Contributions

Conceptualization: Z.Z., J.Z. (Jianfeng Zheng), and J.Z. (Jingbo Zhao); methodology: Z.Z.; software: Z.Z. and J.Z. (Jingbo Zhao); validation: J.Z. (Jingbo Zhao).; investigation: J.Z. (Jianfeng Zheng), H.L., and J.Z. (Jingbo Zhao); data curation: Z.Z.; writing—original draft preparation: Z.Z.; writing—review and editing: Z.Z., J.Z. (Jianfeng Zheng), H.L., and J.Z. (Jingbo Zhao); supervision: J.Z. (Jingbo Zhao); project administration: Z.Z., J.Z. (Jianfeng Zheng), and J.Z. (Jingbo Zhao); funding acquisition: J.Z. (Jingbo Zhao) and H.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the International Joint Laboratory for Operation Safety and Integrated Control of New Energy Vehicles, grant number CZ20230026; the Chang-zhou Intelligent Networked Vehicle Collaborative Control International Joint Laboratory, grant number CZ20220030; the Basic Science (Natural Science) Research Project of Higher Education in Jiangsu Province, grant number 22KJA580001; and the National Natural Science Foundation of China, grant number 62273061.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Tang, F.; Gao, F.; Wang, Z. Driving Capability-Based Transition Strategy for Cooperative Driving: From Manual to Automatic. IEEE Access 2020, 8, 139013–139022. [Google Scholar] [CrossRef]
Gao, F.; He, B.; He, Y. Detection of Driving Capability Degradation for Human-Machine Cooperative Driving. Sensors 2020, 20, 1968. [Google Scholar] [CrossRef]
Wu, J.; Kong, Q.; Yang, K.; Liu, Y.; Cao, D.; Li, Z. Research on the Steering Torque Control for Intelligent Vehicles Co-Driving With the Penalty Factor of Human–Machine Intervention. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 59–70. [Google Scholar] [CrossRef]
Xing, Y.; Lv, C.; Cao, D.; Hang, P. Toward human-vehicle collaboration: Review and perspectives on human-centered collaborative automated driving. Transp. Res. Part C Emerg. Technol. 2021, 128, 103199. [Google Scholar] [CrossRef]
Fang, Z.; Wang, J.; Wang, Z.; Liang, J.; Liu, Y.; Yin, G. A Human-Machine Shared Control Framework Considering Time-Varying Driver Characteristics. IEEE Trans. Intell. Veh. 2023, 8, 3826–3838. [Google Scholar] [CrossRef]
Xu, Q.; Zhang, X.; Lei, Z.; Wen, G.; Li, X.; Su, Y.; Gu, S. Robust path-tracking control for intelligent vehicles considering human–vehicle multi-factors. J. Vib. Control. 2025. [Google Scholar] [CrossRef]
Xie, W.; Lu, S.; Dai, L.; Jia, R. Interactive strategy of shared control vehicle and automated vehicles based on adaptive non-cooperative game. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2024. [Google Scholar] [CrossRef]
Zhao, X.; Yin, Z.; He, Z.; Nie, L.; Li, K.; Kuang, Y.; Lei, C. Indirect Shared Control Strategy for Human-Machine Cooperative Driving on Hazardous Curvy Roads. IEEE Trans. Intell. Veh. 2023, 8, 2257–2270. [Google Scholar] [CrossRef]
Liu, J.; Guo, H.; Meng, Q.; Shi, W.; Gao, Z.; Chen, H. Game-Theoretic Driver-Automation Cooperative Steering Control on Low-Adhesion Roads With Driver Neuromuscular Delay. IEEE Trans. Intell. Transp. Syst. 2024, 25, 10115–10130. [Google Scholar] [CrossRef]
Dai, C.; Zong, C.; Zhang, D.; Hua, M.; Zheng, H.; Chuyo, K. A Bargaining Game-Based Human–Machine Shared Driving Control Authority Allocation Strategy. IEEE Trans. Intell. Transp. Syst. 2023, 24, 10572–10586. [Google Scholar] [CrossRef]
Zhou, Y.; Huang, C.; Hang, P. Game Theory-Based Interactive Control for Human–Machine Cooperative Driving. Appl. Sci. 2024, 14, 2441. [Google Scholar] [CrossRef]
Feng, J.; Yin, G.; Liang, J.; Lu, Y.; Xu, L.; Zhou, C.; Peng, P.; Cai, G. A Robust Cooperative Game Theory-Based Human–Machine Shared Steering Control Framework. IEEE Trans. Transp. Electrif. 2024, 10, 6825–6840. [Google Scholar] [CrossRef]
Lu, X.; Zhao, H.; Li, C.; Gao, B.; Chen, H. A Game-Theoretic Approach on Conflict Resolution of Autonomous Vehicles at Unsignalized Intersections. IEEE Trans. Intell. Transp. Syst. 2023, 24, 12535–12548. [Google Scholar] [CrossRef]
Liu, M.; Wan, Y.; Lewis, F.L.; Nageshrao, S.; Filev, D. A Three-Level Game-Theoretic Decision-Making Framework for Autonomous Vehicles. IEEE Trans. Intell. Transp. Syst. 2022, 23, 20298–20308. [Google Scholar] [CrossRef]
Liu, M.; Kolmanovsky, I.; Tseng, H.E.; Huang, S.; Filev, D.; Girard, A. Potential Game-Based Decision-Making for Autonomous Driving. IEEE Trans. Intell. Transp. Syst. 2023, 24, 8014–8027. [Google Scholar] [CrossRef]
Li, G.; Tian, T.; Song, J.; Li, N.; Bai, H. Research on trajectory tracking control of driverless cars based on game theory. Proc. Inst. Mech. Eng. Part D J. Automob. Eng. 2024, 239, 1536–1553. [Google Scholar] [CrossRef]
Wang, H.; Zheng, W.; Zhou, J.; Feng, L.; Du, H. Intelligent vehicle path tracking coordinated optimization based on dual-steering cooperative game with fault-tolerant function. Appl. Math. Model. 2024, 139, 115808. [Google Scholar] [CrossRef]
Gong, J. Model Predictive Control of Unmanned Vehicles; Beijing Institute of Technology Press Co., Ltd.: Beijing, China, 2020. [Google Scholar]
Guo, K.; Guan, H. Modelling of driver/vehicle directional control system. Veh. Syst. Dyn. 1993, 22, 141–184. [Google Scholar] [CrossRef]
Rasekhipour, Y.; Khajepour, A.; Chen, S.K.; Litkouhi, B. A potential field-based model predictive path-planning controller for autonomous road vehicles. IEEE Trans. Intell. Transp. Syst. 2016, 18, 1255–1267. [Google Scholar] [CrossRef]
Wakasugi, T. A study on warning timing for lane change decision aid systems based on driver’s lane change maneuver. In Proceedings of the 19th International Technical Conference on the Enhanced Safety of Vehicles, Washington, DC, USA, 6–9 June 2005. Paper No. 05-0290. [Google Scholar]
Wang, S.; Jia, D.; Weng, X. Deep reinforcement learning for autonomous driving. arXiv 2018, arXiv:1811.11329. [Google Scholar]
Tseng, K.-K.; Yang, H.; Wang, H.; Yung, K.L.; Lin, R.F.Y. Autonomous driving for natural paths using an improved deep reinforcement learning algorithm. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 5118–5128. [Google Scholar] [CrossRef]
Islam, F.; Ball, J.E.; Goodin, C.T. Enhancing longitudinal velocity control with attention mechanism-based deep deterministic policy gradient (DDPG) for safety and comfort. IEEE Access 2024, 12, 30765–30780. [Google Scholar] [CrossRef]

Figure 1. The human–machine master–slave Game-Based cooperative control structure with adaptive weights.

Figure 2. Three-degree-of-freedom vehicle model.

Figure 3. An image of the CarSim interface.

Figure 4. A control diagram of MPC.

Figure 5. The structure of the fuzzy logic controller.

Figure 6. The membership functions of the variables in the fuzzy logic controller.

Figure 7. Driving weight surface.

Figure 8. A schematic illustration of the experimental scenario.

Figure 9. The simulation results of the model for driver A. (a) Travel trajectory, (b) the curve of the variation in the FLC variable, (c) the degree of the driver’s effort, and (d) yaw rate.

Figure 10. The simulation results of the model for driver B. (a) Travel trajectory, (b) the curve of the variation in the FLC variable, (c) the degree of the driver’s effort, and (d) yaw rate.

Table 1. The values of the driver model parameters.

Parameter	Driver A	Driver B
$T_{p}$	1	0.9
$T_{d}$	0.1	0.2
$T_{h}$	0.2	0.3
$D s$	1	0.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Published by MDPI on behalf of the World Electric Vehicle Association. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhou, Z.; Zhao, J.; Zheng, J.; Liu, H. An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory. World Electr. Veh. J. 2025, 16, 386. https://doi.org/10.3390/wevj16070386

AMA Style

Zhou Z, Zhao J, Zheng J, Liu H. An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory. World Electric Vehicle Journal. 2025; 16(7):386. https://doi.org/10.3390/wevj16070386

Chicago/Turabian Style

Zhou, Zhongjin, Jingbo Zhao, Jianfeng Zheng, and Haimei Liu. 2025. "An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory" World Electric Vehicle Journal 16, no. 7: 386. https://doi.org/10.3390/wevj16070386

APA Style

Zhou, Z., Zhao, J., Zheng, J., & Liu, H. (2025). An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory. World Electric Vehicle Journal, 16(7), 386. https://doi.org/10.3390/wevj16070386

Article Menu

An Adaptive Weight Collaborative Driving Strategy Based on Stackelberg Game Theory

Abstract

1. Introduction

2. Modeling of the Vehicle Dynamics and Human–Machine Steering Controllers

2.1. The Control Architecture

2.2. Modeling of the Vehicle Dynamics

2.3. The Driver Model

2.4. A Transverse Controller Based on Linear Time-Varying Model Predictive Control

3. The Leader–Follower Game-Based Optimal Control Strategy

3.1. A Solution to the Leader–Follower Game

3.2. The Adaptive Weight Fuzzy Decision-Making Model

3.2.1. Assessment of Environmental Risks

3.2.2. The Deviation in Vehicle Control

3.2.3. Dynamic Weight Adjustment of Driving Authority Based on Fuzzy Rules

4. Simulation Results and Their Analysis

5. General Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI