Next Article in Journal
Recent Advances in Raman Spectral Classification with Machine Learning
Previous Article in Journal
Real-Time Fluorescence-Based COVID-19 Diagnosis Using a Lightweight Deep Learning System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Learning-Augmented MPC for Autonomous Vehicle Path Tracking via Ensemble Residual Dynamics Learning

School of Automotive Studies, Tongji University, 4800 Cao’an Highway, Jiading District, Shanghai 201804, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Sensors 2026, 26(1), 340; https://doi.org/10.3390/s26010340
Submission received: 10 November 2025 / Revised: 24 December 2025 / Accepted: 29 December 2025 / Published: 5 January 2026
(This article belongs to the Section Vehicular Sensing)

Abstract

Accurate vehicle dynamics modeling is essential for path tracking control, especially under sharp-curvature or rapidly changing conditions where nonlinear and time-varying behaviors introduce significant discrepancies between the nominal model and real vehicle responses, ultimately degrading the performance of traditional Model Predictive Control (MPC). To address this challenge, this paper proposes a learning-augmented MPC framework that incorporates an ensemble learning-based Data-Driven Dynamics Refinement (DDR) Model to enhance predictive accuracy and control robustness. The DDR Model complements nominal vehicle dynamics by capturing complex behaviors that are difficult to represent analytically. An ensemble of independently trained neural predictors is employed to improve generalization performance and provide stable refinement across diverse driving conditions. Furthermore, a feature-driven activation mechanism is designed to selectively apply refinement only when pronounced nonlinear behaviors arise, thereby reducing unnecessary computational burden. High-fidelity simulation studies validate the effectiveness of the proposed method. In single- and double-lane-change scenarios, the refined dynamics reduce maximum lateral deviation by approximately 6 cm and 4 cm, and decrease the maximum vehicle heading error by 0.02 rad and 0.015 rad, respectively, demonstrating significant improvements in tracking accuracy and robustness.

1. Introduction

1.1. Motivation

The evolution of autonomous driving technologies is driving significant progress in vehicle motion control techniques [1,2]. MPC, owing to its well-recognized capability to handle multiple constraints and generate optimal control sequences, has been widely adopted for autonomous vehicle motion control. Within the theoretical framework of MPC, the controller performs multi-step predictions of the dynamic system states based on a predictive model, which constitutes the core of the control architecture. The accuracy of these predictions directly determines the control performance: inaccurate models introduce prediction errors that can significantly degrade trajectory tracking and stability, whereas precise models enable robust control and reliable vehicle motion. Consequently, developing an accurate and reliable predictive model is essential for achieving high-performance vehicle motion control, particularly under complex and time-varying driving conditions.
Nevertheless, achieving accurate vehicle dynamics modeling remains highly challenging, as the system consists of multiple strongly coupled subsystems with intricate interdependencies [3]. A key challenge lies in tire–road interaction, which exhibits pronounced nonlinearities and uncertainties that are difficult to capture using conventional mathematical models. Consequently, existing vehicle models face a fundamental trade-off: simplified kinematic or low-order dynamic models ensure computational efficiency but fail to reproduce critical nonlinear behaviors such as tire saturation, load transfer, and varying road conditions; in contrast, high-fidelity models provide improved accuracy but are computationally prohibitive for real-time applications. Therefore, reconciling real-time feasibility with high modeling accuracy remains a central challenge in the design of MPC-based motion control systems for autonomous vehicles. These limitations suggest that physics-based models alone are insufficient for representing all dynamic effects. However, instead of replacing the nominal model, a more practical approach is to retain the analytical model as the structural foundation and use data-driven learning to specifically capture the modeling error, i.e., the unmodeled and time-varying dynamics.

1.2. Literature Review

Many scholars have proposed various physical models to characterize vehicle dynamics. Some studies [4,5,6] have employed simplified kinematic models in the design of MPC. However, kinematic models primarily focus on the geometric relationships of vehicle motion and fail to accurately capture the vehicle’s dynamic characteristics. Building upon this topic, subsequent research [7,8,9] has proposed a single-track dynamic model that incorporates tire cornering characteristics, providing a more accurate representation of vehicle dynamics. Kabzan et al. [10] proposed a scaling factor-based approach that integrates the single-track dynamic model with the kinematic model, thereby extending the applicability of the fused model and significantly enhancing its accuracy across a broader operational range. However, due to inherent characteristics such as axle load transfer and adherence ellipse constraints during vehicle motion, tire models that assume fixed cornering stiffness exhibit significantly reduced accuracy under varying operating conditions.
In vehicle dynamics research, many scholars have focused on accurately characterizing the interaction forces between tires and the road surface, among which cornering stiffness plays a critical role in lateral and yaw dynamics. However, cornering stiffness is not a fixed value; it is influenced by various factors such as tire construction, inflation pressure, normal ground force, longitudinal tire force, wheel camber angle, and tire wear. Consequently, vehicle models that assume a fixed cornering stiffness struggle to accurately capture lateral and yaw behaviors.
Some studies [11,12] have employed the Recursive Least Squares (RLS) algorithm to enable real-time estimation of tire cornering stiffness for trajectory tracking control. Chu et al. [13] employed Unscented Kalman Filtering (UKF) to estimate cornering stiffness, dynamically correcting the vehicle dynamics model with the estimated results, thereby significantly enhancing both model accuracy and adaptability to varying driving conditions. Additionally, Zhang et al. [14] introduced a Square-Root Cubature Kalman Filtering (SCKF)-based approach for tire lateral force observation. By computing the deviation between observed values and predictions from a linear tire model, a cornering stiffness correction factor is incorporated to enable adaptive adjustment of tire cornering stiffness. While cornering stiffness correction enhances model accuracy, it compensates only for a subset of the modeling errors in vehicle dynamics.
In addition, in traditional control frameworks, Active Disturbance Rejection Control (ADRC) and other model-free or disturbance-estimation-based techniques are often employed to compensate for unknown disturbances and enhance system robustness [15,16]. For example, Wang et al. [17] proposed an ultra-local model predictive control (ULMPC) approach, which replaces an explicit system model with an ultra-local representation and estimates aggregated dynamics online, enabling model-free predictive control for automated vehicle trajectory tracking. Xie et al. [18] developed a disturbance rejection MPC (DRMPC) framework for input-affine nonlinear systems, where a disturbance observer actively compensates matched disturbances, while residual uncertainties are handled through constraint tightening and terminal design to ensure robustness and stability. More recently, Vázquez-Cruz et al. [19] combined MPC with ADRC principles by estimating a lumped total disturbance via an extended state observer and incorporating it into both feedforward compensation and constrained MPC optimization, achieving computationally efficient and robust control under limited computational resources. However, despite their effectiveness in disturbance rejection, these approaches typically lump complex uncertainties into low-dimensional disturbance terms and therefore remain insufficient for capturing the high-dimensional, strongly nonlinear, and operating-condition-dependent modeling errors inherent in vehicle dynamics.
In recent years, data-driven approaches have garnered widespread attention due to their powerful fitting capabilities. These methods have been introduced into vehicle dynamics modeling, where the vehicle model is typically decomposed into a standard dynamics model and a residual dynamics model. Some studies [10,20,21] have employed Gaussian Process Regression (GPR) to fit the residual model, capturing the discrepancy between the single-track dynamics model and actual vehicle dynamics. To enhance the computational efficiency of the GPR-based residual model, Li et al. [22] conducted an in-depth analysis of the error generation mechanisms in the dynamic model. By reducing the input features of GPR from five dimensions to three, while imposing strict physical constraints on these features, a low-dimensional residual model was constructed. The GPR-based residual modeling paradigm significantly improves the accuracy of vehicle dynamics modeling, thereby enhancing the control performance of high-speed autonomous racing vehicles.
To reduce the computational burden caused by the number of inducing points in GPR, clustering methods are typically employed to partition the dataset [23], with a separate GPR model fitted to each cluster. While this clustering-based GPR residual modeling approach has demonstrated significant effectiveness in data-driven modeling, it still suffers from the following fundamental limitations:
  • High-Dimensional and Nonlinear Characteristics of Vehicle Dynamics. Vehicle dynamics exhibit strong nonlinearities and complex couplings among multiple states, resulting in a high-dimensional and highly correlated feature space. Clustering-based modeling approaches rely heavily on manual feature engineering and predefined similarity metrics, which may overlook latent dependencies and fail to capture the intrinsic structure of nonlinear dynamic behaviors. Furthermore, the need to specify the number of clusters (e.g., the K value in K-means) and the ambiguity of cluster boundaries restricts the generalization capability of these models.
  • Inherent Trade-off Between Modeling Accuracy and Real-time Efficiency. Achieving high prediction accuracy often requires sophisticated models capable of capturing nonlinear and time-varying behaviors. However, such models tend to be computationally expensive and unsuitable for real-time control. Conversely, simplified models offer high computational efficiency but inevitably sacrifice accuracy. This trade-off between precision and real-time performance remains a persistent bottleneck in model-based vehicle motion control frameworks.
  • Information Loss Caused by Feature Reduction. To reduce computational complexity, many existing clustering-based modeling methods apply dimensionality reduction to the input features prior to regression or learning. Such preprocessing inevitably discards critical dynamic information, weakening the model’s ability to represent coupled vehicle dynamics under diverse conditions. The resulting information loss limits both accuracy and adaptability, especially when the model is used for predictive control in complex scenarios.

1.3. Contribution

Accurate vehicle dynamics modeling is fundamental to the performance of MPC. However, conventional physics-based models often struggle to balance prediction accuracy with real-time feasibility. To address this issue, this study proposes a hybrid modeling framework that combines a physics-based vehicle dynamics model with a DDR module. The physics-based model serves as the primary prediction structure, while the ensemble learning refinement module is selectively activated based on prediction errors to compensate for unmodeled and time-varying dynamics, thereby improving modeling accuracy without compromising real-time performance. Compared with traditional residual modeling methods based on GPR or clustering, the proposed approach employs an ensemble of lightweight predictors, which eliminates the need for manual clustering and provides stronger nonlinear approximation capability, enhanced robustness, and computational efficiency. The refined dynamics model is directly integrated into the MPC prediction layer, enabling more accurate and robust trajectory tracking across various driving scenarios. The main contributions of this study are summarized as follows:
  • An ensemble learning-based dynamics refinement method is proposed. The ensemble model integrates multiple independently trained predictors to characterize the discrepancy between the nominal model and real vehicle responses. This approach eliminates the need for manual clustering and enhances the adaptability, stability, and accuracy of data-driven dynamic modeling.
  • A feature-driven activation mechanism is developed to adaptively regulate the refinement model. By comparing prediction errors with a predefined threshold, the controller dynamically determines whether refinement is necessary: when errors remain small, refinement is disabled to reduce computational cost; once the threshold is exceeded, the refinement model is activated to enhance predictive accuracy. This strategy effectively balances modeling precision and real-time feasibility.
  • The proposed ensemble learning-enhanced MPC framework is validated through high-fidelity simulations under single lane-change, double lane-change, and slalom (serpentine) scenarios. The results demonstrate that the proposed method significantly improves modeling accuracy, real-time performance, and trajectory tracking precision compared with conventional MPC. In lane-change maneuvers, lateral tracking errors are reduced by approximately 15% and 20%, while the slalom test further verifies the superior stability and robustness of the proposed approach under rapidly varying curvature and highly dynamic steering conditions.
The remainder of this paper is organized as follows: Section 2 presents the construction of the ensemble learning-based dynamics refinement model and describes the corresponding training and activation mechanisms in detail. Section 3 introduces the trajectory tracking control framework, in which the refined dynamic model is integrated into the MPC formulation. Section 4 provides high-fidelity simulation results under single and double lane-change scenarios to validate the effectiveness of the proposed approach. Finally, Section 5 concludes the paper and discusses future research directions.

2. Vehicle Dynamics Model

The established vehicle dynamics model is shown in Figure 1. This three-degree-of-freedom (3-DOF) model is a widely adopted and well-validated framework in the vehicle dynamics and control literature, used to describe the longitudinal, lateral, and yaw motions of the vehicle. The state variables of the model are as follows:
x = X , Y , φ , v x , v y , ω , δ T .
where p = [ X , Y ] T denotes the vehicle position, and ϕ represents the heading angle. v x and v y are the longitudinal and lateral velocities, respectively, and ω is the yaw rate. These states are measured or estimated using onboard pose sensors, such as GNSS/INS and inertial sensors. The front wheel steering angle is denoted by δ , which is obtained from steering sensors. The control input is defined as u = Δ δ , representing the variation in the steering angle. Here, the state variables are defined in the global (inertial) coordinate frame, which is defined as a right-handed Cartesian coordinate system in the horizontal plane. The X- and Y-axes represent the global longitudinal and lateral directions, respectively.
We construct vehicle dynamics using the Dynamics with Data-driven Refinement (DDR) model, which augments a physics-based analytical formulation with an additional compensation term learned through an ensemble learning framework. The analytical component provides the structured representation of the vehicle dynamics, while the ensemble-based refinement captures nonlinear and time-varying behaviors that are difficult to express explicitly. Together, these components form the DDR model, which improves the fidelity of the overall dynamic representation.
x ˙ = f ( x , u ) = f a n a ( x , u ) + Γ k g E N S ( z )
z = B z [ x T u T ] T
where f a n a represents the analytical dynamics, g E N S the ensemble learning-based compensation model, z is the feature vector formed from states x and inputs u via the feature-selection matrix B z , and Γ k via a component-wise activation matrix determining which states are refined at time step k .
This formulation ensures that the DDR model retains the computational efficiency and interpretability of the analytical dynamics while benefiting from the adaptive refinement capability provided by ensemble learning.

2.1. Refinement Activation Rule

When the vehicle dynamics operate within the linear region, the contribution of the refinement model is typically small. In such cases, the compensation mechanism yields only marginal improvement, as the refined prediction becomes nearly identical to that of the analytical model, while unnecessarily increasing computational cost.
To address the aforementioned issue, we propose a conditional activation strategy based on dynamic characteristics. In this framework, the analytical dynamics are employed as an integrator, where the measured state x k of the vehicle at time step k is combined with the optimal control input u k computed by the MPC to predict the next-step state x ^ k + 1 . Subsequently, the prediction error e k + 1 is calculated as the difference between the predicted state x ^ k + 1 and the measured state x k + 1 obtained from sensors at time step k + 1 . The data-driven refinement is then applied in a component-wise manner. We define a diagonal gating matrix Γ k = diag ( γ v y , k , γ ω , k ) , where
γ v y , k = H e v y , k τ v , γ ω , k = H e ω , k τ ω
and H denotes the Heaviside step function defined as
H ( x ) = 1 , x > 0 0 , x 0
Here, τ v y and τ ω predefined activation thresholds for the lateral velocity-related and yaw-rate-related components, respectively. This activation rule enables the refinement model to operate selectively by turning on compensation only when the prediction error exceeds the threshold. Consequently, redundant computations are avoided during benign operating conditions, while the model’s adaptability is enhanced under nonlinear or time-varying dynamics, providing a more efficient and robust refinement mechanism for complex vehicle systems.

2.2. Analytical Dynamic Model

In this study, the analytical dynamic model is derived by comprehensively considering both the kinematic relationships and the dynamic characteristics of the vehicle. Based on the single-track assumption, a three-degree-of-freedom (3-DOF) model is formulated to capture the longitudinal, lateral, and yaw motions of the vehicle. The model integrates the effects of lateral tire forces, yaw moment, and vehicle motion constraints to accurately describe the vehicle’s dynamic behavior. The structure and variables of the model are illustrated in Figure 1. The model expression is given as follows:
x ˙ = v x cos φ v y sin φ v x sin φ + v y cos φ ω 1 m F x r F y f sin δ + F x f cos δ + v y ω 1 m F y r + F y f cos δ + F x f sin δ v x ω 1 I z ( F y f cos δ + F x f sin δ ) l f F r , y l r δ ˙
where F x f and F x r represent the longitudinal forces exerted by the tires on the front and rear axles, respectively, while F y f and F y r correspond to the lateral forces on the front and rear tires. The variable m denotes the mass of the vehicle, I z refers to the vehicle’s yaw moment of inertia, and l f , l r are the distances from the vehicle’s center of gravity to the front and rear axles, respectively. The tire forces F x f , F x r , F y f , and F y r are modeled using the widely adopted simplified Magic Formula (Pacejka) model [24], which characterizes the nonlinear mapping from slip ratio λ and slip angle α to longitudinal and lateral tire forces.
α f = arctan v y + l f ω v x δ
α r = arctan v y l r ω v x
λ = n R v x n R , v x < n R v x n R v x , v x n R
where n denotes the wheel rotational speed, and the Magic Formula is given by the following:
Y ( x ) = D sin C arctan B x E ( B x arctan ( B x ) ) )
where B , C , D , and E are parameters of the Magic Formula following the fitting process, and the parameters are presented in Table 1.

2.3. DDR Model

To enhance the predictive capability of the analytical dynamic model and improve the overall performance of MPC, this study develops a data-driven dynamics refinement framework based on an ensemble learning architecture. As illustrated in Figure 2, the workflow consists of three main stages: (i) data collection and preprocessing using a high-fidelity simulation platform; (ii) training the ensemble learning-based refinement model with the processed dataset; and (iii) validating and fine-tuning the trained model through iterative optimization to ensure robust performance under diverse driving conditions.

2.3.1. Data Collection and Processing

A high-fidelity vehicle dynamics model is established in a CarSim–Simulink (CarSim 2022.1; Simulink R2023a) co-simulation environment. Open-loop simulation data are collected at 10 Hz under a wide range of speed and steering inputs to cover representative driving scenarios. In addition, controller-in-the-loop simulations are conducted, where an MPC executes trajectory tracking tasks to generate closed-loop data for model learning. All collected data are filtered, normalized, and synchronized to form the final dataset for training, comprising approximately 2.6 GB of recordings.

2.3.2. Model Training

To establish the mapping between the system state–input pair and the dynamics refinement term, the ensemble refinement model is trained using supervised learning. The sampling period is defined as Δ t = t k + 1 t k . Given ( x k , u k ) and using the analytical dynamic model as the state integrator, the one-step prediction is
x ^ k + 1 = x k + Δ t f a n a ( x k , u k )
The refinement target is computed by comparing the predicted state with the sensor-derived state at the next time step. Specifically, the correction term is defined as
y e , k = x k + 1 x ^ k + 1 Δ t + w k
where w k denotes the stochastic disturbance term. The input consists of the state vector x k and the control input u k , while the supervisory signal corresponds to the residual output y e , k . Through supervised regression, the ensemble learning framework establishes a nonlinear mapping
g E N S : z k y k
which enables the model to capture unmodeled dynamics, time-varying behaviors, and systematic prediction biases of the analytical model. To improve robustness and generalization, the ensemble model integrates M lightweight base learners. Each learner independently estimates a refinement component, and the final output is obtained by averaging the predictions:
g E N S z k = 1 M i = 1 M g i z k
Each base learner g E N S is implemented as a shallow multilayer perceptron (MLP) to ensure fast inference. All input features are standardized using z-scores, and the training target corresponds to the normalized one-step refinement term y k .
The ensemble model is trained using a supervised regression loss formulated as follows:
L = B d g E N S z k y k 2 2 + λ w d θ 2 2
where θ denotes all learnable parameters of the ensemble, and λ w d is a weight-decay coefficient used to improve generalization and prevent overfitting.
After training, the ensemble-based refinement model is evaluated under representative driving scenarios to assess its prediction accuracy, generalization capability, and real-time computational performance. The validation focuses on the convergence of the refinement term, the consistency between refined and measured dynamics, and the stability of the ensemble output across varying operating conditions. The overall framework and computational procedure of the refinement-enhanced MPC are summarized in Figure 3 and Algorithm 1.
Algorithm 1 Ensemble-Enhanced MPC Algorithm
Require : Current   state   x k ;   lateral   velocity   error   threshold   e v y ;   yaw - rate   error   threshold   e ω , t
Ensure :   Optimal   control   input u k
1:
If   x k is uninitialized then
2:
x k 0
3:
end if
4:
Prediction errors:
5:
e v y , k x v y , k x ^ v y , k Δ t , e ω , k x ω , k x ^ ω , k Δ t
6:
Compute   γ ω , k   and   γ v y , k by (4).
7:
Update the MPC prediction equations by (2).
8:
Set the MPC reference trajectory using the multi-point tracking method.
9:
Solve   the   MPC   to   obtain   the   optimal   control   u k .
10:
State propagation:
11:
x ^ k + 1 x k + Δ t f a n a ( x k , u k )
12:
k k + 1

3. Vehicle Motion Control Framework

3.1. Lateral MPC with Ensemble-Based Dynamic Refinement

In general terms, MPC adjusts the control action by minimizing a cost function L ( x , u ) over a receding prediction horizon, subject to the system dynamics constraints x ˙ = f ( x , u ) , in order to ensure that the system output closely follows the reference states x r ( t ) and control inputs u r ( t ) :
m i n u L ( x , u ) s . t . x ˙ = f ( x , u ) x ( t 0 ) = x 0 r ( x , u ) = 0 h ( x , u ) > 0
where x 0 represents the initial conditions, and h and r are the inequality and equality constraints, respectively. In MPC, the cost function L x , u is most commonly quadratic, as this form enhances the numerical stability of the optimization process. The system is discretized over the prediction horizon T into N steps, with a step size of Δ t = T / N . Additionally, during the MPC optimization process. It is essential to impose appropriate constraints u min u k u max to satisfy the physical limitations of the actuators. Ultimately, the activation rule determines whether the refinement term produced by the ensemble learning model should be incorporated into the dynamic constraints. If the activation condition is not satisfied, the controller relies solely on the analytical dynamic model. Conversely, if the residual rule is triggered, the state sequence x 1 x 2 x N p is first computed using MPC. This state sequence is then input into the model to obtain the predicted residual sequence ω 1 ω 2 ω N p , which then imposes the DDR dynamics as constraints over the subsequent N p steps.
x ˙ k = f ( x , u ) + ω 1 x ˙ k + 1 = f ( x , u ) + ω 2 x ˙ k + N p = f ( x , u ) + ω N p
The system states are discretized using the fourth-order Runge–Kutta method, after which the MPC objective function is expressed in discrete-time form
min u k k = 0 N ( ( y k y r , k ) T Q ( y k y r , k ) + ( u k u r , k ) T R 1 ( u k u r , k ) ) s . t .   x k + 1 = f R K 4 ( x k , u k , Δ t ) x k = x 0 u min u k u max
In this formulation, the control input represents the increment of the front-wheel steering angle, ensuring smooth steering behavior and avoiding abrupt changes in the steering command. The objective function penalizes both the tracking error and the control effort, guiding the vehicle to follow the reference trajectory while maintaining steering smoothness.
To define the physical constraints of the steering actuator, we conducted a dedicated vehicle steering test. The front-wheel angle and its rate of change were recorded under normal operating conditions, and the averaged results were used to determine the admissible bounds. Based on experimental data, the maximum steering angle is limited to 30°, and the allowable increment of the steering angle per control step is constrained to 0.47°. These limits are incorporated into the MPC optimization as input constraints to ensure that the generated control commands remain feasible for the real steering mechanism.
In solving the MPC, the reference control input within the prediction horizon is initialized with the control value from the previous step, and the reference state variables y r , k = X r , k Y r , k φ r , k ( k = 1 , 2 , , N ) are generated through a multi-point tracking method. Specifically, the reference state is determined based on the current longitudinal speed v x and the discrete time step. The theoretical travel distance for a single time step is calculated as s = v x Δ t . A real-time trajectory projection is employed to find the matching point P d on the reference path corresponding to the vehicle’s look-ahead control point:
P e = X Y + l s cos φ sin φ ,
where l s denotes the look-ahead distance, the corresponding state values X , Y , and φ are then taken as the reference state for the first step of the prediction horizon y r 1 . To construct the complete reference sequence, the number of trajectory point jumps n = s δ s is calculated based on the discretization step size of the reference trajectory, where denotes the floor operator to ensure valid indexing, δ s denotes the arc-length spacing between adjacent points on the reference trajectory. Starting from the matching point P d , the reference states for the subsequent N steps are sequentially extracted along the reference trajectory with a step size of n , forming the reference sequence. Multi-point tracking enables the controller to consider the upcoming curves along the path, allowing for more effective adjustment of the control strategy. Using multi-point preview enables the controller to anticipate upcoming curvature and heading changes within the prediction horizon, improving tracking accuracy.
To efficiently solve this quadratic optimization problem, a multiple shooting method is employed to formulate the problem model, and the Sequential Quadratic Programming (SQP) method is used for real-time iterative solving. All implementations are carried out based on the high-performance computing frameworks ACADOS (v0.2.5) [25] and CasADi (v3.3.5) [26].

3.2. Feedforward Augmentation

Feedforward control predicts and adjusts the system output based on known system input information, effectively reducing system errors. When the curvature of the reference trajectory is large, feedforward control can respond proactively, mitigating overshoot and bringing the output signal closer to the desired value. Due to the inherent understeering characteristics in vehicle design, such as high-speed operation, changes in wheel toe, and elastic deformations of the steering system, the understeering behavior becomes more pronounced. Vehicle understeering can be represented as follows:
k u s = m · l r K f · l m · l f K r · l
where l is the wheelbase of the vehicle; K f is the front axle cornering stiffness; and K r is the rear axle cornering stiffness. The nonlinear feedforward control law is designed based on the curvature of the matching point corresponding to the vehicle’s lookahead point:
δ f f = l · ρ + k u s · v x 2 · ρ
where δ f f represents the feedforward steering angle, and ρ denotes the curvature of the matching point. Ultimately, the feedforward steering angle computed by the feedforward algorithm is combined with the steering angle obtained from the MPC to form the final steering command, which is then input into the actuator to perform the corresponding steering operation.

4. Simulation Results and Analysis

This study develops a high-fidelity validation environment by integrating the CarSim-Python co-simulation platform with real-time control systems. A vehicle simulation model is built in CarSim, and the key parameters of this model are provided in Table 2. All vehicle parameter values are factory-calibrated and used to construct a high-fidelity vehicle model. They represent the intrinsic physical properties of the vehicle and therefore do not vary with weather or road conditions. Within this environment, the Industrial PC (IPC) executes the proposed control strategy and facilitates data exchange with the CarSim FMU interface on a PC, where dynamic simulations are carried out using Python (v3.9.13). A USB-CAN interface enables bidirectional CAN signal communication between the IPC and a secondary PC running Python. Acados is employed to generate C++ code, serving as a fast embedded development framework with efficient nonlinear optimization solving capabilities based on the HPIPM algorithm. This allows rapid deployment and real-time execution of the control algorithm. In addition, the ensemble learning model is accelerated using TensorRT (v8.6.1) to ensure low-latency inference during closed-loop simulations. The overall workflow, illustrated in Figure 4, establishes a reliable and efficient experimental environment for validating advanced and computationally demanding control strategies.

4.1. Ensemble Model Training and Evaluation

The ensemble-based refinement model is trained using the collected offline simulation data. The model training and evaluation pipeline is developed based on the PyTorch (v1.13.1)deep learning framework. The Adam optimizer is employed with an initial learning rate of 0.001, while an exponential decay learning rate scheduler (decay factor of 0.98) is used to gradually reduce the learning rate and improve model stability. In the ensemble architecture, multiple lightweight base learners are trained jointly, and their outputs are averaged to produce the final refinement term. The dataset used for the experiments is split into 70% for training and 30% for testing. The loaded dataset is standardized to ensure that input features have zero mean and unit variance, enhancing the model’s convergence stability. The training is conducted on an NVIDIA RTX 4090 GPU with 300 training epochs. During training, the model is evaluated on the test set every 10 epochs, and the average loss is recorded. The training results are shown in Figure 5.
Figure 6 shows validation on the collected test dataset: rolling integration predictions of vehicle states over a 0.5 s horizon, given random initial conditions and varying control inputs. In this study, the MPC variable is defined as the front-wheel steering angle increment. Figure 6a,d present the variations in control inputs and the corresponding front-wheel steering angles. Figure 6b,e show that, after incorporating the ensemble-based refinement model, the analytical dynamic model achieves a substantial improvement in the prediction accuracy of the vehicle’s lateral velocity. Similarly, Figure 6c,f demonstrate that the refined model also provides more accurate yaw-rate predictions compared with the uncompensated analytical model. The quantitative results are summarized in Table 3.

4.2. Real-Time Control Strategy Validation Through Co-Simulation

At relatively high vehicle speeds, the mismatch between nominal physics-based vehicle dynamics models and actual vehicle behavior tends to become more pronounced, particularly in large-curvature maneuvers such as single and double lane-change scenarios. Therefore, a reference speed of 72 km/h is selected in this study to sufficiently expose model uncertainties and to more clearly evaluate the effectiveness of the proposed DDR model in compensating for modeling errors under high-speed and highly dynamic conditions.
Meanwhile, to further enhance the completeness of the comparison and to systematically assess the robustness of the proposed method across different operating conditions, a slalom test at 50 km/h is also designed. Although the vehicle speed in this scenario is relatively lower, the frequent steering reversals and pronounced dynamic characteristics impose more stringent requirements on the transient response and predictive accuracy of the control system, thereby providing a strong and complementary validation to the high-speed lane-change maneuvers.

4.2.1. Double-Lane Change Test

Figure 7 presents the trajectory tracking results under a double-lane-change scenario at a target speed of 72 km/h and a road adhesion coefficient of 0.9. As shown in Figure 7a, both models follow the reference path well, while the refined model exhibits faster and more accurate responses in curved segments. The steering input in Figure 7b is smoother, avoiding high-frequency oscillations. From Figure 7c,d, both controllers maintain stable longitudinal velocity; however, the refined model achieves a smaller overshoot and faster convergence in yaw rate. As seen in Figure 7e,f, the refined model reduces the maximum lateral deviation by approximately 6 cm and decreases the heading error by about 0.02 rad, indicating improved tracking accuracy. Furthermore, Figure 7g,h demonstrate that the refined model predicts lateral and yaw dynamics more consistently with the reference, enabling timely correction of dynamic errors and enhancing overall vehicle stability. Overall, the proposed MPC framework with ensemble-based dynamic refinement achieves superior path-tracking accuracy, smoother control actions, and enhanced dynamic stability compared with the nominal model.

4.2.2. Single-Lane Change Test

Figure 8 illustrates the path-tracking performance under a single-lane-change maneuver at a reference speed of 72 km/h and a road adhesion coefficient of 0.9. As shown in Figure 8a, both control models successfully follow the planned trajectory, while the refined model exhibits higher responsiveness in regions with large steering variations. In Figure 8b, the refined model generates smoother steering inputs and effectively suppresses oscillatory behavior. From Figure 8c,d, both models maintain stable longitudinal velocity, but the refined model produces a yaw-rate response that aligns more closely with the reference and shows reduced overshoot. Figure 8e,f reveal that the refined model achieves smaller lateral and heading errors, reducing the maximum lateral deviation by about 4 cm and the maximum vehicle heading error by approximately 0.015 rad. Furthermore, as seen in Figure 8g,h, the refined model more accurately captures the variations in lateral and yaw dynamics, thereby improving prediction precision and overall dynamic consistency of the control system. The training and testing losses of the ensemble-based refinement model eventually converge, indicating that the network rapidly learns effective feature representations. Moreover, the small gap between the testing and training losses, without any noticeable overshoot, demonstrates that the model possesses strong generalization capability.

4.2.3. Slalom Test

A serpentine path-tracking experiment is conducted at a constant speed of 50 km/h to evaluate controller performance under rapidly varying curvature conditions. As shown in Figure 9a, both DDR and ANA are able to stably follow the reference path without trajectory divergence, while the longitudinal speed remains well-regulated throughout the maneuver Figure 9c. From the error perspective, Figure 9e,f present the lateral displacement error and heading angle error, respectively. Compared with ANA, DDR exhibits smaller error amplitudes during peak steering phases, with the peak LDE maintained within approximately 0.1 m, indicating improved tracking accuracy under high-frequency curvature transitions. Further analysis of Figure 9b,d show that DDR achieves this error reduction without introducing more aggressive control actions, as evidenced by smoother yaw rate and steering angle responses. Finally, the close agreement between the predicted and reference yaw and lateral accelerations in Figure 9g,h confirm that DDR maintains higher dynamic prediction fidelity under the serpentine maneuver, thereby contributing to improved tracking stability and robustness.

5. Conclusions

This study presents an enhanced MPC framework that integrates an ensemble learning-based DDR model to address fundamental limitations of conventional physics-based vehicle dynamics modeling in high-dynamic driving scenarios. By retaining a nominal physics-based model as the structural backbone and augmenting it with data-driven residual dynamics learning, the proposed framework achieves a favorable balance between modeling accuracy, interpretability, and real-time feasibility.
Methodologically, the vehicle dynamics are explicitly decomposed into nominal and refinement components, enabling the proposed DDR model to effectively capture high-dimensional, nonlinear, and time-varying effects arising from tire–road interactions, load transfer, and parameter uncertainties. Compared with learning-based MPC approaches that rely on Gaussian processes, clustering strategies, or fully data-driven models, the ensemble learning-based DDR framework provides a scalable and computationally efficient alternative without requiring manual feature engineering or predefined operating-condition partitions. Furthermore, the proposed feature-driven activation mechanism selectively enables refinement of dynamics only when significant modeling discrepancies are detected, thereby mitigating unnecessary computational burden and enhancing suitability for real-time control applications.
From an application perspective, extensive high-fidelity simulations under representative high-dynamic driving conditions—including single lane-change, double lane-change, and slalom maneuvers—demonstrate that the proposed DDR-enhanced MPC consistently outperforms traditional MPC. The results show improved model prediction accuracy, smoother control actions, enhanced trajectory tracking performance, and increased robustness across different speeds and maneuver types. These findings indicate that the proposed approach effectively enhances the reliability of MPC in sharp-curvature and rapidly changing driving scenarios, which are critical for real-world intelligent and autonomous vehicle deployment.
Overall, this work contributes a unified, implementable, and extensible learning-augmented MPC solution that bridges the gap between theoretical data-driven modeling and practical vehicle motion control. Future research will focus on extending the framework to coupled longitudinal–lateral control, as well as incorporating online adaptive learning mechanisms to further improve robustness and generalization under more complex and uncertain operating conditions.

Author Contributions

All authors contributed to the design of the study and conducted data analysis and interpretation. Methodology, M.L.; Software, M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China under Grant No. 52432014; the National Natural Science Foundation of China under Grant No. 52325212; the Fundamental Research Funds for the Central Universities and the United Automotive Electronic Systems Co., Ltd. under Grant No. NE-2003-10; and the SAIC Motor Corporation Limited under Grant No. 2023023.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare that this study received funding from the United Automotive Electronic Systems Co., Ltd. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication.

References

  1. Stano, P.; Montanaro, U.; Tavernini, D.; Tufo, M.; Fiengo, G.; Novella, L.; Sorniotti, A. Model predictive path tracking control for automated road vehicles: A review. Annu. Rev. Control 2023, 55, 194–236. [Google Scholar] [CrossRef]
  2. Zha, Y.; Deng, J.; Qiu, Y.; Zhang, K.; Wang, Y. A survey of intelligent driving vehicle trajectory tracking based on vehicle dynamics. SAE Int. J. Veh. Dyn. Stab. NVH 2023, 7, 221–248. [Google Scholar] [CrossRef]
  3. Langåker, H.-A. Cautious mpc-Based Control with Machine Learning. Master’s Thesis, NTNU (Norwegian University of Science and Technology), Trondheim, Norway, 2018. [Google Scholar]
  4. Kong, J.; Pfeiffer, M.; Schildbach, G.; Borrelli, F. Kinematic and dynamic vehicle models for autonomous driving control design. In Proceedings of the 2015 IEEE Intelligent Vehicles Symposium (IV), Seoul, South Korea, 28 June–1 July 2015; pp. 1094–1099. [Google Scholar]
  5. Taherian, S.; Montanaro, U.; Dixit, S.; Fallah, S. Integrated trajectory planning and torque vectoring for autonomous emergency collision avoidance. In Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand, 27–30 October 2019; pp. 2714–2721. [Google Scholar]
  6. Tang, L.; Yan, F.; Zou, B.; Wang, K.; Lv, C. An improved kinematic model predictive control for high-speed path tracking of autonomous vehicles. IEEE Access 2020, 8, 51400–51413. [Google Scholar] [CrossRef]
  7. Vallon, C.; Ercan, Z.; Carvalho, A.; Borrelli, F. A machine learning approach for personalized autonomous lane change initiation and control. In Proceedings of the 2017 IEEE Intelligent Vehicles Symposium (IV), Los Angeles, CA, USA, 11–14 June 2017; pp. 1590–1595. [Google Scholar]
  8. Xie, J.; Xu, X.; Wang, F.; Tang, Z.; Chen, L. Coordinated control based path following of distributed drive autonomous electric vehicles with yaw-moment control. Control Eng. Pract. 2021, 106, 104659. [Google Scholar] [CrossRef]
  9. Hang, P.; Xia, X.; Chen, G.; Chen, X. Active safety control of automated electric vehicles at driving limits: A tube-based mpc approach. IEEE Trans. Transp. Electrif. 2021, 8, 1338–1349. [Google Scholar] [CrossRef]
  10. Kabzan, J.; Valls, M.I.; Reijgwart, V.J.; Hendrikx, H.F.; Ehmke, C.; Prajapat, M.; Bühler, A.; Gosala, N.; Gupta, M.; Sivanesan, R.; et al. Amz driverless: The full autonomous racing system. J. Field Robot. 2020, 37, 1267–1294. [Google Scholar] [CrossRef]
  11. Xie, H.; Xi, Y.; Song, K.; Guo, F. A trajectory tracking control algorithm of selfdriving tractor-trailer based on online estimation of cornering stiffness. In 2022 41st Chinese Control Conference (CCC); IEEE: New York City, NY, USA, 2022; pp. 5414–5419. [Google Scholar]
  12. Zhang, Y.; Wang, W.; Yang, C.; Ma, M. An adaptive model predictive control strategy for path following of autonomous vehicles based on tire cornering stiffness estimation. In Proceedings of the 2021 33rd Chinese Control and Decision Conference (CCDC), Kunming, China, 22–24 May 2021; pp. 1904–1909. [Google Scholar]
  13. Chu, Z.; Chen, N.; Zhang, N.; Li, G. Path-tracking for autonomous vehicles with on-line estimation of axle cornering stiffnesses. In Proceedings of the 2019 Chinese Control Conference (CCC), Guangzhou, China, 27–30 July 2019; pp. 6651–6657. [Google Scholar]
  14. Zhang, Z.; Zheng, L.; Li, Y.; Li, S.; Liang, Y. Cooperative strategy of trajectory tracking and stability control for 4wid autonomous vehicles under extreme conditions. IEEE Trans. Veh. Technol. 2023, 72, 3105–3118. [Google Scholar] [CrossRef]
  15. Yang, Z.; Wang, Z.; Yan, M. An optimization design of adaptive cruise control system based on MPC and ADRC. Actuators 2021, 10, 110. [Google Scholar] [CrossRef]
  16. Xie, H.; Xiao, P. Cooperative adaptive cruise algorithm based on trajectory prediction for driverless buses. Machines 2022, 10, 893. [Google Scholar] [CrossRef]
  17. Wang, Z.; Wang, J. Ultra-local model predictive control: A model-free approach and its application on automated vehicle trajectory tracking. Control Eng. Pract. 2020, 101, 104482. [Google Scholar] [CrossRef]
  18. Xie, H.; Dai, L.; Lu, Y.; Xia, Y. Disturbance rejection MPC framework for input-affine nonlinear systems. IEEE Trans. Autom. Control 2022, 67, 6595–6610. [Google Scholar] [CrossRef]
  19. Vázquez-Cruz, R.I.; Castellanos-Velasco, E.; Guerrero-Castellanos, J.F. Frugal model predictive control and active disturbance rejection for laser beam steering systems. Control Theory Technol. 2025, 23, 513–528. [Google Scholar] [CrossRef]
  20. Hewing, L.; Kabzan, J.; Zeilinger, M.N. Cautious model predictive control using gaussian process regression. IEEE Trans. Control Syst. Technol. 2019, 28, 2736–2743. [Google Scholar] [CrossRef]
  21. Kabzan, J.; Hewing, L.; Liniger, A.; Zeilinger, M.N. Learning-based model predictive control for autonomous racing. IEEE Robot. Autom. Lett. 2019, 4, 3363–3370. [Google Scholar] [CrossRef]
  22. Li, Y.; Huang, C.; Yang, D.; Liu, W.; Li, J. Learning based mpc for autonomous driving using a low dimensional residual model. arXiv 2024, arXiv:2412.03874. [Google Scholar] [CrossRef]
  23. Torrente, G.; Kaufmann, E.; Föhn, P.; Scaramuzza, D. Data-driven mpc for quadrotors. IEEE Robot. Autom. Lett. 2021, 6, 3769–3776. [Google Scholar] [CrossRef]
  24. Pacejka, H.B.; Bakker, E. The magic formula tyre model. Veh. Syst. Dyn. 1992, 21, 1–18. [Google Scholar] [CrossRef]
  25. Verschueren, R.; Frison, G.; Kouzoupis, D.; Frey, J.; Duijkeren, N.v.; Zanelli, A.; Novoselnik, B.; Albin, T.; Quirynen, R.; Diehl, M. Acados—A modular opensource framework for fast embedded optimal control. Math. Program. Comput. 2022, 14, 147–183. [Google Scholar] [CrossRef]
  26. Andersson, J.A.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. Casadi: A software framework for nonlinear optimization and optimal control. Math. Program. Comput. 2019, 11, 1–36. [Google Scholar] [CrossRef]
Figure 1. Three-degree-of-freedom single-track vehicle dynamic model.
Figure 1. Three-degree-of-freedom single-track vehicle dynamic model.
Sensors 26 00340 g001
Figure 2. Data-Driven Dynamics Refinement with f DDR = f ANA + f LRD : data are collected from a high-fidelity vehicle model under multiple operating conditions, including open-loop control and driver-in-the-loop control. The analytical dynamics model (ANA) is augmented by a learned residual dynamics model (LRD), which is trained on high-fidelity data using an ensemble learning strategy. The resulting data-driven dynamics refinement (DDR) is then integrated into the MPC framework to achieve accurate and robust trajectory tracking.
Figure 2. Data-Driven Dynamics Refinement with f DDR = f ANA + f LRD : data are collected from a high-fidelity vehicle model under multiple operating conditions, including open-loop control and driver-in-the-loop control. The analytical dynamics model (ANA) is augmented by a learned residual dynamics model (LRD), which is trained on high-fidelity data using an ensemble learning strategy. The resulting data-driven dynamics refinement (DDR) is then integrated into the MPC framework to achieve accurate and robust trajectory tracking.
Sensors 26 00340 g002
Figure 3. Ensemble-Enhanced MPC framework.
Figure 3. Ensemble-Enhanced MPC framework.
Sensors 26 00340 g003
Figure 4. Ensemble-enhanced MPC: an activation rule adaptively determines whether the analytical dynamic model is used alone or augmented with the refinement model during trajectory prediction.
Figure 4. Ensemble-enhanced MPC: an activation rule adaptively determines whether the analytical dynamic model is used alone or augmented with the refinement model during trajectory prediction.
Sensors 26 00340 g004
Figure 5. Residual Dynamics Training for DDR Model.
Figure 5. Residual Dynamics Training for DDR Model.
Sensors 26 00340 g005
Figure 6. Under Identical Inputs: DDR Model and ANA Model Predictions vs. Reference.
Figure 6. Under Identical Inputs: DDR Model and ANA Model Predictions vs. Reference.
Sensors 26 00340 g006aSensors 26 00340 g006b
Figure 7. Double-lane change results.
Figure 7. Double-lane change results.
Sensors 26 00340 g007
Figure 8. Single-lane change results.
Figure 8. Single-lane change results.
Sensors 26 00340 g008aSensors 26 00340 g008b
Figure 9. Slalom results.
Figure 9. Slalom results.
Sensors 26 00340 g009aSensors 26 00340 g009b
Table 1. Fitted Magic Formula parameters for longitudinal and lateral forces.
Table 1. Fitted Magic Formula parameters for longitudinal and lateral forces.
Direction B C D E
Longitudinal force ( F x )−0.2030.9783513.520−9.620
Lateral force ( F y )13.59−2.093−4206.900−1.218
Table 2. Vehicle parameter values used in the simulation.
Table 2. Vehicle parameter values used in the simulation.
ParameterValueParameterValue
Vehicle curb weight (kg)1360Moment of inertia about the Z-axis (kg · m2) 1805
Wheelbase (m)2.035Track width (mm)1356 (front, rear)
Distance from the center of gravity to the front axle (m)1.117Distance from the center of gravity to the rear axle (m)1.188
Center of gravity height (m)0.525Rolling radius of the wheel (m)0.283
Front axle cornering stiffness (N/rad)105,991Rear axle cornering stiffness (N/rad)106,456
Table 3. Comparison of DDR and ANA prediction errors vs. reference under identical inputs.
Table 3. Comparison of DDR and ANA prediction errors vs. reference under identical inputs.
DDR Prediction vs. ReferenceANA Prediction vs. Reference
AverageRMSEAverageRMSE
Test 1 v y 0.01000.01250.09730.1071
ω 0.00380.00450.06270.0670
Test 2 v y 0.00420.00530.09350.1001
ω 0.00460.00630.04860.0573
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiong, L.; Liu, M.; Xie, Z.; Leng, B.; Zhang, Y. Learning-Augmented MPC for Autonomous Vehicle Path Tracking via Ensemble Residual Dynamics Learning. Sensors 2026, 26, 340. https://doi.org/10.3390/s26010340

AMA Style

Xiong L, Liu M, Xie Z, Leng B, Zhang Y. Learning-Augmented MPC for Autonomous Vehicle Path Tracking via Ensemble Residual Dynamics Learning. Sensors. 2026; 26(1):340. https://doi.org/10.3390/s26010340

Chicago/Turabian Style

Xiong, Lu, Ming Liu, Zhihao Xie, Bo Leng, and Yuanjian Zhang. 2026. "Learning-Augmented MPC for Autonomous Vehicle Path Tracking via Ensemble Residual Dynamics Learning" Sensors 26, no. 1: 340. https://doi.org/10.3390/s26010340

APA Style

Xiong, L., Liu, M., Xie, Z., Leng, B., & Zhang, Y. (2026). Learning-Augmented MPC for Autonomous Vehicle Path Tracking via Ensemble Residual Dynamics Learning. Sensors, 26(1), 340. https://doi.org/10.3390/s26010340

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop