Research on Linear Active Disturbance Rejection Control of Electrically Excited Motor for Vehicle Based on ADP Parameter Optimization

Heping Ling; Junzhi Zhang; Hua Pan

doi:10.3390/act14090440

,

and

¹

Automotive Engineering Research Institute, BYD Company, Shenzhen 518118, China

²

School of Vehicle and Mobility, Tsinghua University, Beijing 100084, China

^*

Author to whom correspondence should be addressed.

Actuators2025, 14(9), 440;https://doi.org/10.3390/act14090440

This article belongs to the Section Control Systems

Version Notes

Order Reprints

Abstract

In the three-motor hybrid architecture, the auxiliary drive uses electrically excited synchronous motor (EESM), which has the advantages of high torque density, wide speed range and strong anti-demagnetization ability. However, the strong electromagnetic coupling between the field winding and the armature winding leads to the difficulty of current control, and the traditional PID has limitations in dynamic response and immunity. In order to solve this problem, a linear active disturbance rejection control (LADRC) method for the rotor of EESM is proposed in this paper, linear extended state observer (LESO) is used to estimate and compensate the system internal and external disturbances (such as winding coupling and parameter perturbation) in real time. The method only uses the input and output of the system and does not depend on any mechanical parameters, so that the torque response is improved by 50%, and the steady-state fluctuation is reduced by 10.2%. In addition, an adaptive dynamic programming (ADP) parameter optimization strategy is proposed to solve the bandwidth parameter tuning problem of LADRC algorithm in complex operating conditions, and the related mathematical analysis of optimality properties is given. Finally, the proposed method is compared with the traditional PI controller in several operating conditions of EESM, and the effectiveness of the proposed method is validated by the corresponding results.

Keywords:

EESM; LADRC; ADP; Interference suppression

1. Introduction

With the pursuit of extreme performance and efficiency of new energy vehicles, three-motor drive architecture is becoming increasingly popular. In this architecture, the electrically excited synchronous motor (EESM) as the auxiliary drive shows significant advantages; Its excitation current can be flexibly regulated, giving it excellent flux-weakening control capability and achieving both high-efficiency cruising and rapid dynamic response [,]. However, this also significantly increases the complexity of the motor controller. In addition to the torque distribution and dynamic coordination control mechanism between the main drive and auxiliary drive, each electrically excited motor requires an independent excitation current control loop, which is deeply coupled with the original torque/current control. This necessitates the design of advanced multivariable and multi-objective real-time optimization control strategies, integrating excitation regulation, efficiency maximization, dynamic power demand response, and redundancy management under fault conditions, ensuring that the three-motor control system achieves the optimal balance among performance, efficiency, and robustness.

Due to the linear and fixed-gain structure, the PID lacks the capability to estimate and compensate for internal cross-coupling and external disturbances in real-time, particularly under dynamic load conditions and d-axis current coupling disturbances, resulting in inadequate for addressing the advanced requirements of multi-motor coordination, nonlinear decoupling, and multi-objective optimization. Especially given the auxiliary drive’s demand for wide-speed capability and fast dynamic response, modern control strategies are required to break through performance bottlenecks while ensuring system robustness. The sliding mode control [,] demonstrates improved robustness and faster response than PID, its performance is heavily dependent on the design of the sliding surface and the switching gain. The chattering phenomenon inherent in SMC (Sliding model control) generates harmonic noise. Furthermore, the chattering issue becomes more severe when the system states are close to the sliding surface, limiting its practical application in high-precision scenarios. MPC (Model predictive control) shows good performance, but its superiority diminishes under parameter uncertainties []. Its core limitation lies in its heavy reliance on an accurate mathematical model of the machine. This is a critical drawback in practical applications, where machine parameters are subject to thermal drift and magnetic saturation. Additionally, for low-cost digital processors, the computational load for solving optimization problems online is a challenge. Against this backdrop, linear active disturbance rejection control (LADRC) [,,] has been proposed, which realizes disturbance observer design and feedback control with a linear structure, without relying on precise system models, thus greatly simplifying engineering implementation, at the same time, the absence of complex computational procedures substantially reduces the overall computational burden. Nevertheless, under the complex operating conditions of new energy vehicles, where different motor speeds and throttle depths correspond to varying torque, and frequent variations occur in the stator d-axis current and rotor excitation current, engineers still need to tune bandwidth parameters according to specific operating points to achieve optimal performance.

In the design of complex control systems, achieving a balance between optimal performance and robust stability has always been a central challenge for control engineers. To address this problem, the control theory community has developed various methods, each reflecting different design philosophies. Among them, Adaptive Dynamic Programming (ADP) and Linear Active Disturbance Rejection Control (LADRC) represent two important paradigms, i.e., performance-oriented and robustness-oriented strategies, respectively. The integration of these two approaches forms a solid theoretical foundation for constructing a new generation of intelligent and robust control architectures. From the perspective of control objectives, ADP emphasizes the optimization of control performance. At its core lies the minimization of a cost function, achieved through iterative learning to approximate the optimal control policy, allowing the system to reach the desired optimal performance. Theoretically, ADP can handle a wide range of systems, including nonlinear, time-varying, and uncertain ones. Its strength lies in unifying multiple objectives—such as control performance, energy consumption, and error minimization—within a single optimization framework. This makes ADP particularly suitable for EESM control problems, which requires adaptive and dynamically optimized strategies.

In contrast, LADRC adopts a fundamentally different design philosophy. It does not aim to derive mathematically optimal control laws, but instead focuses on ensuring system stability and dynamic performance under uncertain disturbances and incomplete models. This is achieved through the use of an Extended State Observer (ESO) that estimates the “total disturbance” in real time and compensates it via feedback. A key advantage of LADRC is its minimal dependence on accurate system models. Controller design requires only the system order and a desired bandwidth. This makes LADRC highly adaptable and deployable in EESM control problems, especially those with frequent parameter variations and strong external disturbances.

The complementarity between ADP and LADRC goes beyond a simple functional combination—it reflects a deeper synergy between control philosophies and engineering implementation. ADP focuses on global performance optimization and intelligent strategy learning, while LADRC emphasizes real-time responsiveness and disturbance rejection. By integrating their strengths, this hybrid approach can overcome the limitations of using either method alone, achieving the dual goals of performance optimality and robust stability. Such a framework offers a solid theoretical foundation for next-generation EESM intelligent adaptive control systems and holds great promise for future applications in complex system control. Compared to manually tuning the bandwidth parameter, Adaptive Dynamic Programming (ADP) can automatically and iteratively design an LADRC with optimal parameters. In 1977, Werbos [] first proposed the concept of Adaptive Critic Designs (ACDs), which integrates theories such as dynamic programming, reinforcement learning, and neural networks, making it a highly valuable and applicable method. The core idea of this theory is to use function approximation structures (e.g., neural networks) to iteratively and forward-in-time approximate the Bellman optimality conditions, thereby obtaining an optimal control policy.

The structure of the adaptive critic design algorithm originates from the actor–critic framework in reinforcement learning and consists of a model network, a critic network, and an action (or actor) network. The model network is used to model the dynamic system, the critic network approximates the optimal performance index function, and the action network approximates the optimal control strategy. The combination of the critic and action networks constitutes an agent. When the agent applies an action to the dynamic system, the environment provides rewards at different stages, which are used to adjust the critic network. The agent’s task is to learn a control policy that maximizes the cumulative rewards over time. This method effectively overcomes the limitations of traditional dynamic programming, allowing for online learning without requiring a known system model. In the past decade or so, adaptive dynamic programming has become a hot topic in intelligent control and computational intelligence research. The U.S. National Science Foundation held forums on approximate dynamic programming in 2002 and 2006. The IEEE Computational Intelligence Society established a dedicated Technical Committee on Adaptive Dynamic Programming and Reinforcement Learning in 2008, and international workshops on this topic were held in 2007, 2009, and 2011. Many major journals have published special issues on adaptive dynamic programming [,,,,,,,], and important review articles include [] and [], with key monographs listed in [,,].

2. LADRC of EESM

2.1. Model Description of Electrically Excited Motor

In the drive control of the electrically excited motor, an accurate model of the control object is required. Since we focus on the dynamic response performance of the motor, the end-region iron losses are neglected. The voltage equation of the electrically excited motor is as follows:

\vec{u} = \frac{d \vec{ψ}}{d t} + R \vec{i} + ω \vec{ψ},

(1)

in which the resistance matrix R consists of the stator resistance

R_{S}

and the rotor resistance

R_{f}

, while the matrix ω primarily includes the angular speed

ω_{r}

R = [\begin{matrix} R_{s} & 0 & 0 \\ 0 & R_{s} & 0 \\ 0 & 0 & R_{f} \end{matrix}], ω = [\begin{matrix} 0 & - ω_{r} & 0 \\ ω_{r} & 0 & 0 \\ 0 & 0 & 0 \end{matrix}] .

(2)

The vectors in the voltage equation consists of three components: d axis, q axis, and f axis:

\vec{u} = [\begin{array}{l} u_{d} \\ u_{q} \\ u_{f} \end{array}], \vec{i} = [\begin{array}{l} i_{d} \\ i_{q} \\ i_{f} \end{array}], \vec{ψ} = [\begin{array}{l} ψ_{d} \\ ψ_{q} \\ ψ_{f} \end{array}] .

(3)

Ignoring the effects of magnetic saturation and motor temperature, from the definition of the inductance, the differential term of the flux linkage can be reconstructed as

\frac{d \vec{ψ}}{d t} = L \frac{d \vec{i}}{d t},

(4)

where L is the incremental inductance matrix:

L = [\begin{matrix} L_{d d} & L_{d q} & L_{d f} \\ L_{q d} & L_{q q} & L_{q f} \\ L_{f d} & L_{f q} & L_{f f} \end{matrix}] .

(5)

The incremental inductance includes self-inductance

L_{S}

and mutual inductance

L_{m}

. The mutual inductance between the d and q axes is much smaller than that of other components, which can be approximately ignored.

L = L_{s} + L_{m} = [\begin{matrix} L_{d} & 0 & L_{m} \\ 0 & L_{q} & 0 \\ \frac{3}{2} L_{m} & 0 & L_{f} \end{matrix}] .

(6)

Based on the above equations, the current derivative can be derived as

\frac{d \vec{i}}{d t} = L^{- 1} (\vec{u} - R \vec{i} - ω \vec{ψ}) .

(7)

It can be seen from (7) that in the EESM model, the input is the voltage

\vec{u}

, speed ω, the output is current

\vec{i}

and electromagnetic torque

T_{e}

. At the same time, the flux linkage can be expressed by apparent inductance as

\vec{ψ} = [\begin{matrix} \frac{1}{i_{d}} \int L_{d d} d i_{d} & \frac{1}{i_{q}} \int L_{d q} d i_{q} & \frac{1}{i_{f}} \int L_{d f} d i_{f} \\ \frac{1}{i_{d}} \int L_{q d} d i_{d} & \frac{1}{i_{q}} \int L_{q q} d i_{q} & \frac{1}{i_{f}} \int L_{q f} d i_{f} \\ \frac{1}{i_{d}} \int L_{f d} d i_{d} & \frac{1}{i_{q}} \int L_{f q} d i_{q} & \frac{1}{i_{f}} \int L_{f f} d i_{f} \end{matrix}] [\begin{matrix} i_{d} \\ i_{q} \\ i_{f} \end{matrix}] .

(8)

In this paper, the interior electrically excited synchronous machine is considered, therefore

L_{d} \neq L_{q}

. Combining (8) with the definition of apparent inductance, (7) can be rewritten as

\frac{d \vec{i}}{d t} = L^{- 1} (\vec{u} - R \vec{i} - ω L \vec{i}) .

(9)

The basics of instantaneous power theory are introduced in Appendix A. According to the instantaneous power theory [], the stator instantaneous power is calculated from the stator terminal voltage

\vec{u_{s}}

and current

\vec{i_{s}}

:

\vec{S_{s}} = 1.5 \vec{u_{s}} \vec{i_{s}} = \{\begin{cases} p_{s} = Re (\vec{S_{s}}) = 1.5 (u_{d} i_{d} + u_{q} i_{q}) \\ q_{s} = Im (\vec{S_{s}}) = 1.5 (u_{q} i_{d} - u_{d} i_{q}) \end{cases} .

(10)

Based on (8), the motor power can be expressed in quadratic form as

\{\begin{cases} p_{e m} = 1.5 ω_{r} {\vec{i}}^{T} K_{P} \vec{i} \\ q_{e m} = 1.5 ω_{r} {\vec{i}}^{T} K_{Q} \vec{i} \end{cases} .

(11)

Neglecting the coupling between the d and q axes,

K_{P}

and

K_{Q}

can be expressed as follows:

\begin{array}{l} K_{P} = [\begin{matrix} - L_{q d} & \frac{L_{d} - L_{q}}{2} & - \frac{L_{q f}}{2} \\ \frac{L_{d d} - L_{q q}}{2} & L_{d q} & \frac{L_{d f}}{2} \\ - \frac{L_{d f}}{2} & \frac{L_{d f}}{2} & 0 \end{matrix}], \\ K_{Q} = [\begin{matrix} L_{d d} & \frac{L_{d q} + L_{q d}}{2} & \frac{L_{d f}}{2} \\ \frac{L_{d q} + L_{q d}}{2} & L_{q q} & \frac{L_{q f}}{2} \\ \frac{L_{d f}}{2} & \frac{L_{q f}}{2} & 0 \end{matrix}] . \end{array}

(12)

The instantaneous torque of the motor can be derived from the instantaneous power:

T_{e m} = 1.5 p {\vec{i}}^{T} K_{P} \vec{i} .

(13)

Since the values of L_d and L_q are not equal and p represents the number of pole pairs, the reluctance torque component cannot be ignored, and the electromagnetic torque can be expressed as

T_{e m} = 1.5 p [L_{m} i_{f} + (L_{d} - L_{q}) i_{d}] i_{q} .

(14)

As can be seen,

T_{e m}

consists of two components, one is the synchronous torque generated by the interaction between the excitation current and the q-axis current, and the other is the reluctance torque component.

2.2. Problem Statements

The control architecture of the electric-excited motor is shown in Figure 1. Substituting (2), (3), and (6) into Formula (7), the stator voltage equation is obtained as

\begin{array}{l} u_{d} = R_{s} i_{d} + L_{d} \frac{d i_{d}}{d t} + L_{m} \frac{d i_{f}}{d t} - ω L_{q} i_{q}, \\ u_{q} = R_{s} i_{q} + L_{q} \frac{d i_{q}}{d t} + ω (L_{d} i_{d} + L_{m} i_{f}) . \end{array}

(15)

Figure 1. Control architecture diagram.

The rotor voltage equation can be expressed as

u_{f} = R_{f} i_{f} + L_{f} \frac{d i_{f}}{d t} + \frac{3}{2} L_{m} \frac{d i_{d}}{d t} .

(16)

Based on the voltage equations of stator and rotor, it is known that there is strong coupling between d-axis and f-axis, and the current fluctuation will also affect the fluctuation of electromagnetic torque.

Formula (16) can be transformed into

\frac{d i_{f}}{d t} = \frac{1}{L_{f}} (u_{f} - R_{f} i_{f} - \frac{3}{2} L_{m} \frac{d i_{d}}{d t}),

(17)

which is abstracted as

\dot{y} = f + b_{0} u,

(18)

where y and u denote the output

i_{f}

and input

u_{f}

of the system, respectively. f represents the total disturbance

- \frac{R_{f}}{L_{f}} i_{f} - \frac{3}{2} L_{m} \frac{d i_{d}}{d_{t}}

of the system, where the coefficients

\frac{R_{f}}{L_{f}}

and

L_{m}

are all unknown quantities. Combining (17) and (18) yields

b_{0} = \frac{1}{L_{f}}

. It should be noted that the parameter

b_{0}

can be estimated and is mainly related to the system parameters. The above state space equation can be expressed as

\{\begin{cases} {\dot{x}}_{1} = x_{2} + b_{0} u \\ {\dot{x}}_{2} = \dot{f} \\ y = x_{1} \end{cases}

(19)

among which

x_{1}

indicates the excitation current

i_{f}

and

x_{2}

indicates the disturbance f. Then the extended state observer (LESO) can be designed as

\{\begin{cases} {\dot{z}}_{1} = z_{2} + λ_{1} (\hat{i_{f}} - z_{1}) + b_{0} u \\ {\dot{z}}_{2} = - λ_{2} (z_{1} - \hat{i_{f}}) \end{cases} .

(20)

In order to reduce the problem to a unit-gain integral control problem subject to disturbance

(f - \hat{f})

, the controller is designed as follows:

u = \frac{K_{p} ((i_{f} - z_{1}) - z_{2})}{b_{0}},

(21)

where

K_{p}

represents the proportional gain factor of the controller,

z_{1}

represents the observed value of excitation current,

\hat{i_{f}}

represents the sensor measurements,

z_{2}

represents the perturbed observations.

2.3. Proof of Stability

From the state space Equation (19), the state variables are taken as

z = [\begin{matrix} z_{1} \\ z_{2} \end{matrix}] = [\begin{matrix} i_{f} \\ f \end{matrix}] .

(22)

The equation above can also be written in matrix form

[\begin{matrix} {\dot{z}}_{1} \\ {\dot{z}}_{2} \end{matrix}] = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] [\begin{matrix} z_{1} \\ z_{2} \end{matrix}] + [\begin{matrix} b_{0} \\ 0 \end{matrix}] u .

(23)

Equation (19) can be expressed in the standard state-space form as

\{\begin{cases} \dot{x} = A x + B u + E \dot{f} \\ y = C x \end{cases},

(24)

where the vector A, B, C, E can be indicated as

A = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}], B = [\begin{matrix} b_{0} \\ 0 \end{matrix}], C = {[\begin{matrix} 1 \\ 0 \end{matrix}]}^{T}, E = [\begin{matrix} 0 \\ 1 \end{matrix}]

(25)

The state-space observer (20) can be reconstructed as

\begin{array}{l} \dot{z} = A z + B u + L (y - \hat{y}) \\ \hat{y} = c z \end{array}

(26)

where L is the observer gain vector

L = {[\begin{matrix} λ_{1} & λ_{2} \end{matrix}]}^{T} .

Let

e_{i} = x_{i} - z_{i}

, combining (25) and (26), the error can be written in the form

\dot{e} = A_{e} e + E \dot{f}

(27)

where E is defined in (25), and

A_{e} = A - L C = [\begin{matrix} - λ_{1} & 1 \\ - λ_{2} & 0 \end{matrix}]

The characteristic polynomial of LESO can be expressed as

γ (s) = |S I - A + L C| = S^{2} + S λ_{1} + λ_{2}

(28)

The root of characteristic polynomial

γ (s)

lies in the left of the s-phase; therefore, the LESO is BIBO and the perturbation

f

is also bounded.

3. Adaptive Dynamic Programming

3.1. Problem Statements

The motor system is described by the following discrete-time dynamic equation:

x (k + 1) = F (x (k), ϕ (x), σ (k)), k = 0, 1, \dots,

(29)

where

x (k) \in X

is the system state which includes the EESM current information,

ϕ (x)

is the LADRC, which specifies the control voltage

u_{f} = ϕ (x)

when the system occupies state

X

.

σ (k)

is the environment disturbance. Here, we denote the set of possible system states as

X

and the controller parameter space as

A_{x}

. Given the current system state

x (k)

and the current action

ϕ (x)

, the next system state

x (k + 1)

is determined by a probability distribution

p (\cdot | x (k), ϕ (k))

.

The expected total reward for initial state

x (0)

under the LADRC

ϕ (x)

is defined as

Υ^{ϕ} (x (0)) = E \{\sum_{k = 0}^{\infty} Ω (x (k), ϕ (x))\} .

(30)

In (30),

Ω (x (k), ϕ (x))

is the utility function which evaluates metrics such as the regulation time, magnitude of overshoot and fluctuation level of the excitation current. The shorter the setting time, the smaller the overshoot, and the lower the fluctuation level, the smaller the corresponding utility function

Ω (x (k), ϕ (x))

and performance index function

Υ^{ϕ} (x (0))

.

The goal of the presented algorithm is to find the optimal LADRC

ϕ (x)

to minimize the performance index function (30).

3.2. ADP-Based LADRC Optimization Procedure

In this section, we develop an adaptive dynamic programming-based algorithm to obtain the optimal LADRC

ϕ (x)

and the optimal performance index function (30) for the motor system.

For all

x \in X

, let

G_{0} (x) = Λ (x),

(31)

where

Λ (x)

is an arbitrary positive semi-definite function. Then, for all

x \in X

, the iterative LADRC

ϕ_{0} (x)

is computed as

ϕ_{0} (x) = \arg \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) G_{0} (j)\} .

(32)

For all

i = 1, 2, \dots

, let

G_{i} (x)

be the iterative value function that satisfies the following equation

G_{i} (x) = Ω (x, ϕ_{i - 1} (x)) + \sum_{j \in X} p (j | x, ϕ_{i - 1} (x)) G_{i - 1} (j) .

(33)

The iterative LADRC

ϕ_{i} (x)

is computed as

ϕ_{i} (x) = \arg \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) G_{i} (j)\} .

(34)

The algorithm will iterate between (33) and (34).

3.3. Algorithm Properties

Theorem 1.

For

i = 0, 1, \dots

, let

G_{i} (x)

and

ϕ_{i} (x)

be obtained by (31)–(34). Given constants

\underline{γ}

,

\bar{γ}

,

\underline{λ}

, and

\bar{λ}

that satisfy

0 \leq \underline{γ} \leq \bar{γ} < \infty,

(35)

0 \leq \underline{λ} \leq \bar{λ} < 1,

(36)

\underline{γ} Ω (x, a) \leq \sum_{j \in X} p (j | x, a) Υ^{*} (j) \leq \bar{γ} Ω (x, a),

(37)

and

\underline{λ} Υ^{*} (x) \leq G_{0} (x) \leq \bar{λ} Υ^{*} (x),

(38)

respectively, for

x \in X

, the iterative value function

G_{i} (x)

satisfies

(1 + {\bar{γ}}^{i} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{i}}) Υ^{*} (x) \leq G_{i} (x) \leq (1 + {\underline{γ}}^{i} \frac{\bar{λ} - 1}{{(1 + \underline{γ})}^{i}}) Υ^{*} (x), i = 0, 1, \dots .

(39)

Proof.

First, we prove that

(1 + {\bar{γ}}^{i} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{i}}) Υ^{*} (x) \leq G_{i} (x)

(40)

holds for

i = 0,1 \dots

.

According to (38), the left-hand side of the inequality (39) obviously holds for

i = 0, 1, \dots

. Let

i = 1

. Based on the left-hand side of (38), it is easy to obtain

\begin{array}{l} G_{1} (x) & = \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) G_{0} (j)\} \\ \geq \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) \underline{λ} Υ^{*} (j)\} \\ = \min_{a \in A_{x}} \{Ω (x, a) + \underline{λ} \sum_{j \in X} p (j | x, a) Υ^{*} (j)\} . \end{array}

(41)

By adding

\bar{γ} \frac{\underline{λ} - 1}{1 + \bar{γ}} Ω (x, a)

to and subtracting the same term from (41), (41) can easily be transformed into

\begin{array}{l} G_{1} (x) \geq \min_{a \in A_{x}} {Ω (x, a) + \underline{λ} \sum_{j \in X} p (j | x, a) Υ^{*} (j) \\ + \bar{γ} \frac{\underline{λ} - 1}{1 + \bar{γ}} Ω (x, a) + \bar{γ} \frac{1 - \underline{λ}}{1 + \bar{γ}} Ω (x, a)} . \end{array}

(42)

Since

\sum_{j \in X} p (j | x, a) Υ^{*} (j) \leq \bar{γ} Ω (x, a)

, (42) can be developed into

\begin{array}{l} G_{1} (x) \geq \min_{a \in A_{x}} {Ω (x, a) + \underline{λ} \sum_{j \in X} p (j | x, a) Υ^{*} (j) \\ + \bar{γ} \frac{\underline{λ} - 1}{1 + \bar{γ}} Ω (x, a) + \frac{1 - \underline{λ}}{1 + \bar{γ}} \sum_{j \in X} p (j | x, a) Υ^{*} (j)} . \end{array}

(43)

Combining similar terms of (43), we can obtain

G_{1} (x) \geq \min_{a \in A_{x}} {(1 + \bar{γ} \frac{\underline{λ} - 1}{1 + \bar{γ}}) Ω (x, a) + (\underline{λ} - \frac{\underline{λ} - 1}{1 + \bar{γ}}) \sum_{j \in X} p (j | x, a) Υ^{*} (j)} .

(44)

According to the Bellman equation

Υ^{*} (x) = \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) Υ^{*} (j)\},

(44) becomes

G_{1} (x) \geq (1 + \bar{γ} \frac{\underline{λ} - 1}{1 + \bar{γ}}) Ω^{*} (x) .

(45)

Assume (40) holds for

i = l - 1, l = 1, 2, \dots

. Then for

i = l

, we have

\begin{array}{l} G_{l} (x) \\ = \min_{a \in A_{x}} \{U (x, a) + \sum_{j \in X} p (j | x, a) G_{l - 1} (j)\} \\ \geq \min_{a \in A_{x}} {Ω (x, a) \\ + \sum_{j \in X} p (j | x, a) \{1 + {\bar{γ}}^{l - 1} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l - 1}}\} Υ^{*} (j)} . \end{array}

(46)

By adding

{\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}} Ω (x, a)

to and subtracting the same term from (46), (46) can easily be transformed into

\begin{array}{l} G_{l} (x) \geq \min_{a \in A_{x}} {Ω (x, a) \\ + \{1 + {\bar{γ}}^{l - 1} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l - 1}}\} \sum_{j \in X} p (j | x, a) Υ^{*} (j) \\ + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}} Ω (x, a) + {\bar{γ}}^{l} \frac{1 - \underline{λ}}{{(1 + \bar{γ})}^{l}} Ω (x, a)} . \end{array}

(47)

Since

\sum_{j \in X} p (j | x, a) Υ^{*} (j) \leq \bar{γ} Ω (x, a)

, (47) can be developed into

\begin{array}{l} G_{l} (x) \geq \\ \min_{a \in A_{x}} {Ω (x, a) \\ + \{1 + {\bar{γ}}^{l - 1} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l - 1}}\} \sum_{j \in X} p (j | x, a) Υ^{*} (j) \\ + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}} Ω (x, a) \\ + {\bar{γ}}^{l - 1} \frac{1 - \underline{λ}}{{(1 + \bar{γ})}^{l}} \sum_{j \in X} p (j | x, a) Υ^{*} (j)} . \end{array}

(48)

Combining similar terms of (48), we can obtain

\begin{array}{l} G_{l} (x) \\ \geq \min_{a \in A_{x}} {(1 + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}}) Ω (x, a) + {\bar{γ}}^{l - 1} \frac{1 - \underline{λ}}{{(1 + \bar{γ})}^{l}} \sum_{j \in X} p (j | x, a) Υ^{*} (j) \\ + (1 + {\bar{γ}}^{l - 1} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l - 1}}) \sum_{j \in X} p (j | x, a) Υ^{*} (j)} \\ = \min_{a \in A_{x}} {(1 + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}}) Ω (x, a) + (1 + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}}) \sum_{j \in X} p (j | x, a) Υ^{*} (j)} . \end{array}

(49)

According to the Bellman equation

Υ^{*} (x) = \min_{a \in A_{x}} \{Ω (x, a) + \sum_{j \in X} p (j | x, a) Υ^{*} (j)\},

we obtain

G_{l} (x) \geq (1 + {\bar{γ}}^{l} \frac{\underline{λ} - 1}{{(1 + \bar{γ})}^{l}}) Υ^{*} (x) .

(50)

The proof of

G_{i} (x) \leq (1 + {\underline{γ}}^{i} \frac{\bar{λ} - 1}{{(1 + \underline{γ})}^{i}}) Υ^{*} (x), i = 0, 1, \dots

(51)

follows similar steps. The proof is completed. □

4. Experimental Results

4.1. Initial Process

To validate the effectiveness of the proposed ADP-LADRC strategy, an experimental test bench for an electrically excited synchronous machine was established. The core components of the platform include a 150 kW EESM, an inverter and a control board equipped with the Texas Instruments C2000 DSP (The EESM, inverter and control board are produced and manufactured by BYD Company in Shenzhen, China. The Texas Instruments C2000 DSP is from Texas Instruments Company in Dallas, TX, USA). The rotor current and speed were measured by a Hall-effect current sensor and a resolver, respectively. The detailed connection diagram of the system is shown in Figure 2.

Figure 2. Experimental equipment layout.

The AVL bench parameters are shown in Table 1, and the motor parameters are shown in Table 2.

Table 1. AVL bench parameters.

Table 2. EESM parameters.

To guarantee the accuracy of the results, the coefficient calibration of the system sensor device, along with the motor parameter calibration, must be carried out, the relevant processes are listed in the following Table 3.

Table 3. Process preparation.

4.2. Results

From (17) and (18), it can be known that the parameter b₀ can be determined by the actual measured value of the rotor’s self-inductance. The initial parameters were tuned through a series of step-response tests. Starting from a conservative value to ensure stability, the value

W_{c}

was gradually increased until a proper excitation current response time was achieved, while carefully avoiding excessive overshoot and current chattering. The initial value represents a balance between dynamic performance and robustness. To ensure the rapid convergence of the interference term and the observation term, it is generally set that

W_{o} = 5 ~ 10 W_{c}

[]. For a fair comparison, the PI parameters were tuned to meet the required performance. Meanwhile, the torque data recorded by the test bench host computer is sampled at 1 kHz, which, considering the system’s time constant, is sufficient to accurately track the dynamic variations in the motor torque.

To verify the effectiveness of the proposed ADP-LADRC with respect to bandwidth parameter tuning, tests were conducted at a speed of 4000 rpm. Step torque references of 50 N·m, 100 N·m, and 300 N·m were applied via the test bench host computer. The initial values of the LADRC-related bandwidth parameters were set as

W_{O} = 400, W_{C} = 100 a n d b_{0} = 0.4

. It should be noted that the excitation current was set to 4 A by default after the PWM was enabled. In Figure 3, the performance comparison between the initial parameter values and the ADP-LADRC iteratively optimized bandwidth parameters at 4000 rpm is shown.

Figure 3. ADP-LADRC iteration. (a) The current tracking under the initial and the first iterative ADP-LADRC; (b) The current tracking under the first and second iterative ADP-LADRC; (c) the current tracking under the second iterative and converged ADP-LADRC.

Figure 3a–c show the dynamic behaviors of the excitation current feedback during the ADP-LADRC iteration process. It can be observed that when using only the initial bandwidth parameters, the excitation current exhibits large steady-state fluctuations, and the feedback current is significantly affected by noise. With successive iterations of the ADP-LADRC algorithm, the steady-state fluctuations gradually decrease, and the final steady-state error is reduced to within 0.2 A.

To compare the torque dynamic responses of the ADP-LADRC and PI control algorithms, step torque commands ranging from 50 N·m to 100 N·m were applied at speeds between 1000 rpm and 4000 rpm. Due to mechanical friction and leakage effects of the test bench, there is an approximate 1.2–1.8 N·m deviation between the commanded and actual torque. The torque data were recorded by the test bench at 10 Hz. As shown in Figure 4a–c, the ADP-LADRC outperforms the PI controller in terms of torque response time, overshoot suppression, and the magnitude of steady-state fluctuations.

Figure 4. ADP-LADRC iteration. (a) Dynamic response under speed of 1000 rpm and torque command of 50/100 N·m; (b) speed 2000 rpm, torque command 50/100 N·m dynamic response under speed of 2000 rpm and torque command of 50/100 N·m; (c) dynamic response under speed of 3000 rpm and torque command of 50/100 N·m; (d) dynamic response under speed of 4000 rpm and torque command of 50/100 N·m.

In order to compare the anti-disturbance performance of the ADP-LADRC with that of the PI controller, we set the motor speed at 4000 rpm and the target value of the excitation current was 8.5A. Subsequently, a step target torque command of 100 N·m was given and the torque was maintained for a period of time. Then, the command was immediately reset. From Figure 5, under the same conditions, the overshoot of ADP-LADRC and PI are 0.2% and 8%, respectively. The torque fluctuation amplitudes of ADP-LADRC and PI were 2.86A and 1.26A, respectively. Furthermore, the disturbance regulation time of ADP-LADRC was approximately 905 ms shorter than PI control.

Figure 5. Anti-disturbance performance comparison of ADP-LADRC and PI.

5. Discussion

The core findings of this study indicate that the control strategy combining linear active disturbance rejection control (LADRC) with adaptive dynamic programming (ADP) can effectively suppress the interference of the stator d-axis component on the excitation current control of electrically excited motors. Compared with the traditional PID control and the fixed-parameter LADRC, the proposed ADP-LADRC shows significant advantages in terms of dynamic response speed, anti-interference ability. More importantly, the introduction of the ADP algorithm has solved the problem of parameter tuning in the application of LADRC. The bandwidth parameters of the traditional LADRC usually rely on expert experience or trial and error, making it difficult to achieve the best performance. In this study, the ADP framework was adopted, which through the interactive learning of the evaluation network (Critic Network) and the execution network (Action Network), optimizes the control parameters of LADRC online, enabling it to adapt to different operating conditions. Since LSEO converts the first-order system into an integrated series type system, in the DSP system, only two discretized difference equations are needed to implement ADP-LADRC. Therefore, the computational cost is extremely low and fixed. MPC uses the system model to predict the future N steps of states and behaviors. The computational load is strongly dependent on the prediction time domain N and the problem scale. Moreover, the convergence time of the iterative algorithm will cause fluctuations in the calculation time. Therefore, LADRC can provide better real-time performance and lower computational costs.

However, this study has some limitations, which can serve as directions for further exploration in the future. The learning rate and structure of the ADP neural network still need to be manually set at present. In the future, research can be conducted on its adaptive adjustment strategies to further improve the convergence speed. Additionally, it can be considered to combine the ideas of predictive control with the existing framework to cope with more stringent constraint conditions.

6. Conclusions

This paper addresses the dynamic response and disturbance rejection challenges in the rotor current control of electrically excited motors and proposes an ADP-optimized Linear Active Disturbance Rejection Control (ADP-LADRC) strategy. By employing an Extended State Observer (ESO) for real-time estimation of internal and external disturbances as well as effective suppression of unmodeled dynamics and high-frequency noise, the electromagnetic torque response speed is increased by approximately 50%, and the steady-state current fluctuation amplitude is reduced by 10.2%, meeting the requirements of high-dynamic operating conditions and improving torque output smoothness. The linearized design based on error feedback avoids the overshoot phenomenon caused by integral saturation in traditional PI controllers, enabling the rotor current to converge rapidly without overshoot under step commands, thereby enhancing system stability. Furthermore, to address the difficulty of tuning the LADRC parameters

W_{O}, W_{C}, a n d b_{0}

, the ADP algorithm is introduced to construct an evaluation–action dual-network structure, using system tracking error and control energy consumption as the cost function to iteratively approximate the optimal control law online. The ADP method can complete global parameter optimization within 0.5 s, achieving more than 90% efficiency improvement compared with conventional trial-and-error tuning.

Author Contributions

Conceptualization, H.L.; Formal analysis, J.Z.; Data curation, H.P.; Writing—original draft, H.L.; Writing—review & editing, J.Z. and H.P.; Visualization, H.L. and H.P. All authors have read and agreed to the published version of the manuscript.

Funding

The authors gratefully appreciate the anonymous reviewers for their valuable comments and all the authors listed in the references. This work is supported by National Key Research and Development Program of China (No. 2024YFB2505100).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Research data is unavailable due to privacy restrictions.

Conflicts of Interest

Authors Heping Ling and Hua Pan were employed by the BYD Company. The remaining authors declared that the Research was conducted in the absence of any commercial or financial relationship that could be constructed as a potential conflict of interest.

Appendix A

Instantaneous Power Definition

When converting from three-phase ABC stationary coordinate to a two-phase

α β

stationary coordinate, considering the equal amplitude transformation, according to the theory in [], the instantaneous power can be expressed in the following complex form

\begin{array}{l} \vec{s} = p + j q = \frac{3}{2} \cdot {\vec{u}}^{(α β)} \cdot {\vec{i}}^{(α β)} \\ = \frac{3}{2} (u_{α} + j u_{β}) \cdot (i_{α} - j i_{β}) \\ = \frac{3}{2} (u_{α} \cdot i_{α} + u_{β} \cdot i_{β}) + j \frac{3}{2} (u_{β} \cdot i_{α} - u_{α} \cdot i_{β}), \end{array}

(A1)

where p and q, respectively, represent the three-phase instantaneous active power and the three-phase instantaneous reactive power.

Converting the above equation to the dq coordinate system

\begin{array}{l} \vec{s} = p + j q = \frac{3}{2} \cdot ({\vec{u}}^{(d q)} \cdot e^{j θ}) \cdot ({\vec{i}}^{(d q)} \cdot e^{j θ}) \\ = \frac{3}{2} (u_{d} \cdot i_{q} + u_{d} \cdot i_{q}) + j \frac{3}{2} (u_{q} \cdot i_{d} - u_{d} \cdot i_{q}), \end{array}

(A2)

Equation (10) is obtained.

References

Petit, Y.L. Electric Vehicle Life Cycle Analysis and Raw Material Availability. Transp. Environ. 2017. Available online: https://www.transportenvironment.org/articles (accessed on 26 October 2017).
Widmer, J.D.; Martin, R.; Kimiabeigi, M. Electric Vehicle Traction Motors without Rare Earth Magnets. Sustain. Mater. Technol. 2015, 3, 7–13. [Google Scholar] [CrossRef]
Zhang, X.; Li, Z. Sliding-mode observer-based mechanical parameter estimation for permanent magnet synchronous motor. IEEE Trans. Power Electron. 2015, 31, 5732–5745. [Google Scholar] [CrossRef]
Liang, D.; Li, J.; Qu, R.; Kong, W. Adaptive second-order sliding-mode observer for PMSM sensorless control considering VSI nonlinearity. IEEE Trans. Power Electron. 2017, 33, 8994–9004. [Google Scholar] [CrossRef]
Borhan, H.; Vahidi, A.; Phillips, A.M.; Kuang, M.L.; Kolmanovsky, I.V.; Di Cairano, S. MPC-Based Energy Management of a Power-Split Hybrid Electric Vehicle. IEEE Trans. Control Syst. Technol. 2012, 20, 593–603. [Google Scholar] [CrossRef]
Xue, W.; Huang, Y. On frequency-domain analysis of ADRC for uncertain system. In Proceedings of the 2013American Control Conference, Washington, DC, USA, 17–19 June 2013; IEEE: New York, NY, USA, 2013; pp. 6637–6642. [Google Scholar]
Gao, Z. Scaling and bandwidth-parameterization based controller tuning. In Proceedings of the American Control Conference, Denver, CO, USA, 4–6 June 2003; IEEE: New York, NY, USA, 2003; Volume 6, pp. 4989–4996. [Google Scholar]
Wang, G.; Liu, R.; Zhao, N.; Ding, D.; Xu, D. Enhanced linear ADRC strategy for HF pulse voltage signal injection-based sensorless IPMSM drives. IEEE Trans. Power Electron. 2018, 34, 514–525. [Google Scholar] [CrossRef]
Werbos, P.J. Advanced forecasting methods for global crisis warning and models of intelligence. Gen. Syst. Yearb. 1977, 22, 25–38. [Google Scholar]
Werbos, P.J. Approximate dynamic programming for real-time control and neural modeling. In Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches; White, D.A., Sofge, D.A., Eds.; Van Nostrand: New York, NY, USA, 1992; Chapter. 13. [Google Scholar]
Murray, J.J.; Cox, C.J.; Lendaris, G.G.; Saeks, R. Adaptive dynamic programming. IEEE Trans. Syst. Man Cybern. C Appl. Rev. 2002, 32, 140–153. [Google Scholar] [CrossRef]
Saeks, R.E.; Cox, C.J.; Mathia, K.; Maren, A.J. Asymptotic dynamic programming: Preliminary concepts and results. In Proceedings of the International Conference on Neural Networks (ICNN’97), Houston, TX, USA, 12 June 1997; pp. 2273–2278. [Google Scholar]
Bertsekas, D.P.; Tsitsiklis, J.N. Neuro-Dynamic Programming; Athena Scientific: Belmont, MA, USA, 1996. [Google Scholar]
Enns, R.; Si, J. Helicopter trimming and tracking control using direct neural dynamic programming. IEEE Trans. Neural Netw. 2003, 14, 929–939. [Google Scholar] [PubMed]
Lewis, F.L.; Huang, J.; Parisini, T.; Prokhorov, D.V.; Wunsch, D.C. Special Issue on neural networks for feedback control systems. IEEE Trans. Neural Netw. 2007, 18, 969–972. [Google Scholar] [CrossRef] [PubMed]
Lewis, F.L.; Lendaris, G.; Liu, D. Special issue on approximate dynamic programming and reinforcement learning for feedback control. IEEE Trans. Syst. Man. Cybern. B Cybern. 2008, 38, 896–897. [Google Scholar] [CrossRef]
Ferrari, S.; Jagannathan, S.; Lewis, F.L. Special issue on approximate dynamic programming and reinforcement learning. J. Control Theory Appl. 2011, 9, 309. [Google Scholar] [CrossRef]
Lewis, F.L.; Vrabie, D. Reinforcement learning and adaptive dynamic programming for feedback control. IEEE Circuits Syst. Mag. 2009, 9, 32–50. [Google Scholar] [CrossRef]
Wang, F.Y.; Zhang, H.; Liu, D. Adaptive dynamic programming: An introduction. IEEE Comput. Intell. Mag. 2009, 4, 39–47. [Google Scholar] [CrossRef]
Sutton, R.S.; Barto, A.G. Reinforcement Learning—An Introduction; MIT Press: Cambridge, MA, USA, 1998. [Google Scholar]
Si, J.; Barto, A.; Powel, W.; Wunsch, D. Handbook of Learning and Approximate Dynamic Programming; IEEE: Piscataway, NJ, USA, 2004. [Google Scholar]
Lewis, F.L.; Liu, D. Approximate Dynamic Programming and Reinforcement Learning for Feedback Control; Wiley: Hoboken, NJ, USA, 2012. [Google Scholar]
Akagi, H.; Watanabe, E.H.; Aredes, M. The Instantaneous Power Theory. In Instantaneous Power Theory and Applications to Power Conditioning; Wiley-IEEE Press: Tokyo, Japan; Rio deJaneiro, Brazil, 2007; pp. 41–107. [Google Scholar]

Figure 1. Control architecture diagram.

Figure 2. Experimental equipment layout.

Figure 3. ADP-LADRC iteration. (a) The current tracking under the initial and the first iterative ADP-LADRC; (b) The current tracking under the first and second iterative ADP-LADRC; (c) the current tracking under the second iterative and converged ADP-LADRC.

Figure 4. ADP-LADRC iteration. (a) Dynamic response under speed of 1000 rpm and torque command of 50/100 N·m; (b) speed 2000 rpm, torque command 50/100 N·m dynamic response under speed of 2000 rpm and torque command of 50/100 N·m; (c) dynamic response under speed of 3000 rpm and torque command of 50/100 N·m; (d) dynamic response under speed of 4000 rpm and torque command of 50/100 N·m.

Figure 5. Anti-disturbance performance comparison of ADP-LADRC and PI.

Table 1. AVL bench parameters.

Parameters	Range	Accuracy
Torque	0~500 N·m	0.4% FS
Speed	0~15,000 rpm	±1 rpm
Busbar voltage	15~1000 V	0.5% FS
Phase current	0~1500 A	0.5% FS
Temperature	−40~150 °C	±1 °C

Table 2. EESM parameters.

Parameters	Value	Unit
Pole pairs	4	/
Rotor inductance	2.5	μH
Rated voltage	400	V
Rated power	150	kW
Rated current	300	A
Rated speed	5000	rpm
Peak torque	200	N·m

Table 3. Process preparation.

Process	Objective
1. Motor position calibration	Obtain the initial electrical angle of the rotor of the electrically excited motor
2. Coefficient calibration of phase current Hall sensor	Minimize the current sampling error
3. Bus voltage sampling coefficient calibration	Minimize the sampling error of bus voltage
4. Excitation current sampling coefficient calibration	Obtain the A/D conversion coefficient for the excitation current sampling
5. Fault protection	Verify the effectiveness of the overcurrent and overvoltage fault protection functions
6. Motor electrical angle delay time compensation	Obtain the angle delay time
$7 . L_{d}, L_{q}, I_{d}, I_{q}$ and linkage Look-up table calibration	Obtain the offline table for current control loop

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Research on Linear Active Disturbance Rejection Control of Electrically Excited Motor for Vehicle Based on ADP Parameter Optimization

Abstract

1. Introduction

2. LADRC of EESM

2.1. Model Description of Electrically Excited Motor

2.2. Problem Statements

2.3. Proof of Stability

3. Adaptive Dynamic Programming

3.1. Problem Statements

3.2. ADP-Based LADRC Optimization Procedure

3.3. Algorithm Properties

4. Experimental Results

4.1. Initial Process

4.2. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Instantaneous Power Definition

References

Article Metrics

Citations

Article Access Statistics