Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems

Fu, Caixin; Jiang, Changhong; Wan, Zhiwei; Cheng, Peng; Wang, Shenquan

doi:10.3390/machines14050465

Open AccessArticle

Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems

by

Caixin Fu

¹,

Changhong Jiang

²,

Zhiwei Wan

^2,*

,

Peng Cheng

³ and

Shenquan Wang

²

¹

School of Mechanical and Electrical Engineering, Changchun University of Technology, Changchun 130012, China

²

College of Electrical and Electronic Engineering, Changchun University of Technology, Changchun 130012, China

³

CRRC Changchun Railway Vehicles Co., Ltd., Changchun 130011, China

^*

Author to whom correspondence should be addressed.

Machines 2026, 14(5), 465; https://doi.org/10.3390/machines14050465

Submission received: 19 March 2026 / Revised: 10 April 2026 / Accepted: 19 April 2026 / Published: 22 April 2026

(This article belongs to the Section Machines Testing and Maintenance)

Download

Browse Figures

Versions Notes

Abstract

This article addresses the fault detection (FD) problem for traction drive systems in the presence of unknown noise covariances. The dynamic behavior of the traction drive system, affected by actuator and sensor faults, is first formulated. Following the philosophy of the subspace identification, the system matrices are identified directly from collected process data using QR decomposition and singular value decomposition. Based on the identified model, a robust Kalman filter (KF)-based FD scheme is developed. By exploiting the iterative interaction between the estimator and measurement data within the KF framework, the noise covariance matrices are adaptively estimated, which alleviates the adverse effects caused by empirical covariance selection in conventional KF-based FD methods. Experimental results obtained from a real traction drive system verify the effectiveness and reliability of the proposed approach.

Keywords:

data-driven; robust KF; subspace identification; fault detection; traction drive systems

1. Introduction

Ensuring the safe and reliable operation of industrial systems has become increasingly important with the rapid development of automation and intelligent technologies. Traction drive systems serve a key role by transforming electrical energy into mechanical motion in industrial applications. The reliability of the systems is therefore of great importance, which has motivated extensive research on fault detection (FD) techniques for traction drive systems [1,2,3]. Advances in modern control theory have contributed to the design and implementation of model-based FD approaches in traction drive systems [4,5,6].

Model-based FD methods are attractive because of their high sensitivity and capability for early detection of system anomalies [7]. Depending on the structure of the residual generator, model-based FD methods for traction drive systems are typically classified into fault detection filter-based approaches and diagnostic observer-based approaches [8]. In parallel with these developments, recent studies, such as those by Cheng et al. [9], Sun et al. [10], and Xia et al. [11], have explored data-driven and model–data fusion approaches for FD in traction drive systems. Despite the diversity of these methodologies, accurate characterization of system dynamics and fault-induced residual variations remains crucial for reliable monitoring. In this context, fault detection filters are particularly appealing, as they explicitly account for state evolution and measurement updates in the residual generation. Among them, Kalman filter (KF)-based approaches are especially attractive due to the recursive estimation structure, which is well suited for dynamic systems operating under stochastic noises.

In recent years, KF-based FD methods and their variants have emerged as the dominant approach among fault detection filter-based techniques for traction drive systems [12,13,14]. Cheng et al. [12] developed a sigma-mixed unscented KF-based FD algorithm for addressing performance degradation. The approach constructs a mixture distribution using sigma points in the unscented KF framework and incorporates a Lévy process with jump characteristics to model degradation dynamics. A moving average interstate standard deviation index is finally developed for FD purposes. Foo et al. [13] introduced an extended KF-based approach for sensor FD and isolation, which ensures fault-resilient control when sensor faults occur. For stator inter-turn faults, Namdar et al. [14] proposed a KF-based FD method. In this approach, the KF is employed to extract signal features, and the standard deviation of the extracted signatures is subsequently used as an indicator for FD. Miniach et al. [15] developed a current sensor fault-tolerant control scheme based on an extended KF to achieve detection and compensation of current sensor faults in induction motor drives.

It should be noted that the practical deployment of aforementioned methods still faces several challenges. First, most of these methods rely on prior knowledge of system matrices. However, due to the complex internal structure of traction drive systems, constructing an accurate mathematical model is often difficult in practice, and the required system matrices are typically unavailable. As a result, the applicability of KF-based FD methods in real monitoring scenarios is limited. Second, the process and measurement noise covariance matrices in most existing KF and its variant-based FD approaches are usually specified based on prior knowledge or engineering experience. In practical operating environments, however, the actual noise statistics are generally unknown, which may lead to a mismatch between the assumed and actual noise characteristics during the iterative estimation process, thereby degrading the FD performance. Therefore, a key challenge lies not only in implementing KF-based FD for traction drive systems, but also in ensuring its effectiveness when both the system matrices and noise statistics are not accurately known a priori.

Although traction drive systems can be modeled in a state-space model form [8], the system matrices are often unavailable in practice. To address this issue, a subspace identification approach [16,17] is employed to construct the state-space model directly from measured input–output (I/O) data. The observability matrix is estimated via QR decomposition and singular value decomposition (SVD), from which the system matrices are identified. This enables KF and variant-based FD approaches to be implemented in a fully data-driven manner while retaining its capability for dynamic modeling and recursive estimation, without relying on prior model knowledge. Furthermore, to address the case where the process and measurement noise covariance matrices are unknown in traction drive systems, the proposed approach is inspired by the principle of iterative generalized least squares estimation [18]. The noise statistics are learned through iterative interactions between the estimator and the measurements, enabling reliable estimation of the noise covariance matrices. This mitigates the adverse impact on detection performance caused by relying on the priori noise covariance matrices that deviate from the true noise characteristics. The main contributions of this work are outlined as follows.

1.: A data-driven KF-based approach is developed for FD of traction drive systems, which does not require a first-principles system model.
2.: An iterative scheme is proposed to estimate the covariance matrices of noises from measured and estimated data, mitigating the adverse effects caused by mismatches between a priori covariance assumptions and the true noise statistics.
3.: The proposed method accounts for the dynamic characteristics of the systems while maintaining satisfactory monitoring results.

The remainder of this study is organized as follows: Section 2 presents preliminaries, including a brief introduction to traction drive systems, the KF, and the problem formulation. A detailed description of the proposed FD approach is provided in Section 3. Section 4 presents the corresponding experimental study on a traction drive system. The study concludes with a summary in Section 5.

Notations: All notations used in this paper follow standard conventions.

R^{k}

represents the k-dimensional Euclidean space. The symbol

{[\cdot]}^{-}

denotes the pseudo-inverse of the matrix

[\cdot]

. For a vector

ω

,

ω_{s} (k) = {[ω^{T} (k - s) ω^{T} (k - s + 1) \dots ω^{T} (k)]}^{T}

,

Ω_{k} = [ω (k - s) ω (k - s + 1) \dots ω (k - s + N - 1)]

,

Ω_{k, s} = [ω_{s} (k) ω_{s} (k + 1) \dots ω_{s} (k + N - 1)]

, where N and s are positive integers. The symbol

\hat{ξ}

denotes the estimate of

ξ

. The operator

v e c ([\cdot])

denotes the vectorization of the matrix

[\cdot]

. ⊗ denotes the Kronecker product.

2. Preliminaries and Problem Formulation

This section first presents the traction drive system along with its general mathematical model, followed by an introduction to the KF. Based on this foundation, the FD problem addressed in this work is formulated.

2.1. Traction Drive Systems

Traction drive systems serve as fundamental devices for converting electrical energy into mechanical motion, delivering the required traction to the equipment in a controlled manner. The system includes a DC-link, a traction inverter, a traction motor, and a traction control unit, as illustrated in Figure 1.

A variety of sensors, such as those measuring speed, current, and voltage, are incorporated to provide data for control and monitoring purposes. A total of seven sensors are used to collect operational data, as summarized in Table 1. Figure 1 further emphasizes the pivotal role of the traction control unit in maintaining safe and stable system operation.

Effective FD requires first representing the traction drive system in a state-space model. Around a fixed operating point, the system can be described by a generalized linear time-invariant model that incorporates fault-related effects [1]:

\begin{matrix} \begin{matrix} x (k + 1) & = A x (k) + B u (k) + E_{f} f (k) + w (k) \\ y (k) & = C x (k) + D u (k) + F_{f} f (k) + v (k) \end{matrix} \end{matrix}

(1)

where

f (k) \in R^{k_{f}}

,

u (k) \in R^{k_{u}}, x (k) \in R^{k_{x}}

, and

y (k) \in R^{k_{y}}

represent the actuator or sensor faults, system inputs, state variables, and process outputs, respectively. Fault matrices

E_{f}

and

F_{f}

have compatible dimensions. The system matrices

A, B, C

, and D are structured to satisfy the dimensional requirements of the state-space representation.

v (k) \in R^{k_{y}}

and

w (k) \in R^{k_{x}}

represent measurement and process noises, respectively, which are assumed to be statistically independent of

u (k)

and the initial state

x (0)

and satisfy

\begin{matrix} E ([\begin{matrix} w (i) \\ v (i) \end{matrix}] {[\begin{matrix} w (j) \\ v (j) \end{matrix}]}^{T}) = [\begin{matrix} Σ_{w} δ_{i j} & 0 \\ 0 & Σ_{v} δ_{i j} \end{matrix}], δ_{i j} = \{\begin{matrix} 0, i \neq j \\ 1, i = j \end{matrix} . \end{matrix}

For designing data-driven FD schemes, an I/O model reflecting the relation between system inputs and outputs is essential. By iteratively expanding (1), the associated I/O data model can be systematically represented as [19]:

Y_{k, s} = Γ_{s} X_{k} + H_{u, s} U_{k, s} + H_{f, s} F_{k, s} + H_{w, s} W_{k, s} + V_{k, s}

(2)

with

Y_{k, s} \in R^{(s + 1) k_{y} \times N}

and

\begin{matrix} Γ_{s} = [\begin{matrix} C \\ C A \\ ⋮ \\ C A^{s} \end{matrix}], H_{u, s} = [\begin{matrix} D & 0 & \dots & 0 \\ C B & D & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ C A^{s - 1} B & \dots & C B & D \end{matrix}] \end{matrix}

\begin{matrix} H_{w, s} = [\begin{matrix} 0 & 0 & \dots & 0 \\ C & 0 & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ C A^{s - 1} & \dots & C & 0 \end{matrix}], H_{f, s} = [\begin{matrix} F_{f} & 0 & \dots & 0 \\ C E_{f} & F_{f} & \dots & 0 \\ ⋮ & ⋱ & ⋱ & ⋮ \\ C A^{s - 1} E_{f} & \dots & C E_{f} & F_{f} \end{matrix}] \end{matrix}

in which

H_{w, s} \in R^{(s + 1) k_{y} \times (s + 1) k_{x}}

,

H_{f, s} \in R^{(s + 1) k_{y} \times (s + 1) k_{f}}

,

Γ_{s} \in R^{(s + 1) k_{y} \times k_{x}}

, and

H_{u, s} \in R^{(s + 1) k_{y} \times (s + 1) k_{u}}

.

2.2. Kalman Filter

For the fault-free case of system (1), the KF is adopted in an innovation-based predictor form. The innovation is defined as

r (k) = y (k) - C \hat{x} (k) - D u (k) = y (k) - \hat{y} (k),

(3)

and its covariance matrix is denoted by

Σ_{r, k} = E [r (k) r^{T} (k)] = Σ_{v} + C P_{k} C^{T},

(4)

where

P_{k} = E [(x (k) - \hat{x} (k)) {(x (k) - \hat{x} (k))}^{T}]

denotes the state estimation error covariance matrix, and

Σ_{v}

is the measurement noise covariance matrix. Then, KF gain is calculated by

K_{k} = A P_{k} C^{T} Σ_{r, k}^{- 1},

(5)

and the state estimation error covariance and state estimate are recursively updated as

\begin{matrix} P_{k + 1} & = A P_{k} A^{T} + Σ_{w} - K_{k} Σ_{r, k} K_{k}^{T}, \\ \hat{x} (k + 1) & = A \hat{x} (k) + B u (k) + K_{k} r (k), \end{matrix}

(6)

where

Σ_{w}

denotes the process noise covariance matrix. The innovation sequence

r (k)

represents the deviation between the actual measurement and the prediction, which can be employed as a residual signal for FD.

2.3. Problem Formulation

In practical operation of traction drive systems, the exact covariance matrices of the noises are often unknown, and the system parameters in the state-space representation (1) are difficult to accurately determine [8]. Consequently, conventional KF-based FD methods become less applicable. This motivates the development of a robust data-driven scheme that mitigates the impact of mismatches between assumed and true noise covariance matrices during iterative estimation, while avoiding complex system modeling. To achieve FD, the residual vector can be constructed based on (3) as

r (k) = y (k) - \hat{y} (k) .

(7)

According to (3)–(6), the residual generation process requires not only the process I/O data but also the knowledge of system matrices A, B, C, and D, as well as the noise covariance matrices

Σ_{w}

and

Σ_{v}

. However, due to the complexity of system modeling and the lack of accurate noise statistical information in practice, these quantities are often unavailable. Consequently, residual generation relying on parameters specified from prior knowledge may lead to unsatisfactory detection performance.

To achieve reliable FD, the proposed framework aims to identify system matrices and estimate noise covariance matrices directly from process data. Therefore, the objective of this work is to design a data-driven method capable of learning system parameters and noise statistics from I/O measurements.

3. Proposed FD Method

Since the state vector

x (k)

is typically unavailable in practice, the I/O data model (2) cannot be identified and employed directly for generating residual signals. Consequently, the reliance on the state variable

x (k)

needs to be eliminated. Considering the innovation sequence

e (k) = y (k) - \hat{y} (k)

and the gain matrix K, the I/O relationship of the process can alternatively be described by [20]

\begin{matrix} \begin{matrix} \hat{x} (k + 1) & = A \hat{x} (k) + B u (k) + K e (k) = A_{K} \hat{x} (k) + B_{K} u (k) + K y (k) \\ \hat{y} & = C \hat{x} (k) + D u (k), B_{K} = B - K D, A_{K} = A - K C . \end{matrix} \end{matrix}

(8)

According to (8), the following can be obtained

\begin{matrix} \hat{x} (k) = A_{K}^{s_{p}} \hat{x} (k - s_{p}) + \sum_{i = 1}^{s_{p}} A_{K}^{i - 1} [B_{K} K] [\begin{matrix} u (k - i) \\ y (k - i) \end{matrix}] . \end{matrix}

(9)

Given that all eigenvalues of

A_{K}

lie strictly within the unit circle, it follows that

A_{K}^{s_{p}} \to 0

for a sufficiently large positive integer

s_{p}

. This leads to

\begin{matrix} \hat{x} (k) \approx \sum_{i = 1}^{s_{p}} A_{K}^{i - 1} [B_{K} K] [\begin{matrix} u (k - i) \\ y (k - i) \end{matrix}] . \end{matrix}

(10)

This implies that the state vector

x (k)

can be inferred from historical I/O data. Therefore, in the absence of faults, the model (2) can be equivalently reformulated as

Y_{k, s} \approx Γ_{s} L_{p} Z_{p} + H_{u, s} U_{k, s} + H_{w, s} W_{k, s} + V_{k, s}

(11)

where

\begin{matrix} L_{p} & = [A_{K}^{s_{p} - 1} B_{K} \dots B_{k} A_{K}^{s_{p} - 1} K \dots K], \\ Z_{p} & = [\begin{matrix} U_{k - s_{p}, s_{p} - 1} \\ Y_{k - s_{p}, s_{p} - 1} \end{matrix}] . \end{matrix}

Applying a QR decomposition of the form

\begin{matrix} [\begin{matrix} Z_{p} \\ U_{k, s} \\ Y_{k, s} \end{matrix}] = [\begin{matrix} R_{11} & 0 & 0 \\ R_{21} & R_{22} & 0 \\ R_{31} & R_{32} & R_{33} \end{matrix}] [\begin{matrix} Q_{1} \\ Q_{2} \\ Q_{3} \end{matrix}] . \end{matrix}

(12)

Then, one can obtain

\begin{matrix} [\begin{matrix} Z_{p} \\ U_{k, s} \end{matrix}] & = [\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}] [\begin{matrix} Q_{1} \\ Q_{2} \end{matrix}], \end{matrix}

(13)

\begin{matrix} Y_{k, s} & = [R_{31} R_{32}] [\begin{matrix} Q_{1} \\ Q_{2} \end{matrix}] + R_{33} Q_{3} . \end{matrix}

(14)

Specially, the I/O data model (11) can be further expressed as

\begin{matrix} Y_{k, s} = [Γ_{s} L_{p} H_{u, s}] [\begin{matrix} Z_{p} \\ U_{k, s} \end{matrix}] + H_{w, s} W_{k, s} + V_{k, s} . \end{matrix}

(15)

From (13)–(15), the following equivalent form can be obtained

\begin{matrix} \begin{matrix} [Γ_{s} L_{p} H_{u, s}] [\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}] [\begin{matrix} Q_{1} \\ Q_{2} \end{matrix}] + H_{w, s} W_{k, s} + V_{k, s} = [R_{31} R_{32}] [\begin{matrix} Q_{1} \\ Q_{2} \end{matrix}] + R_{33} Q_{3} \\ \Rightarrow & [Γ_{s} L_{p} H_{u, s}] [\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}] ≜ [R_{31} R_{32}] \\ H_{w, s} W_{k, s} + V_{k, s} ≜ R_{33} Q_{3} \end{matrix} . \end{matrix}

(16)

In line with subspace identification methods [21], and without loss of generality, the following result can be obtained

\begin{matrix} r a n k ([\begin{matrix} Z_{p} \\ U_{k, s} \end{matrix}]) = number of the row \Rightarrow [\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}] is of full row rank . \end{matrix}

Therefore, one can obtain

\begin{matrix} [Γ_{s} L_{p} H_{u, s}] = [R_{31} R_{32}] {[\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}]}^{-} \end{matrix}

(17)

where

\begin{matrix} {[\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}]}^{-} = {[\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}]}^{T} {([\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}] {[\begin{matrix} R_{11} & 0 \\ R_{21} & R_{22} \end{matrix}]}^{T})}^{- 1} . \end{matrix}

With the identified matrix

[Γ_{s} L_{p} H_{u, s}]

, the matrices A, B, C, and D can be reliably estimated. To extract A and C from the observability matrix, an estimation

{\hat{Γ}}_{s}

is obtained by applying SVD to

Γ_{s} L_{p} Z_{p}

\begin{matrix} Γ_{s} L_{p} Z_{p} = [U_{1} U_{2}] [\begin{matrix} S_{1} & 0 \\ 0 & S_{2} \approx 0 \end{matrix}] [\begin{matrix} V_{1}^{T} \\ V_{2}^{T} \end{matrix}] . \end{matrix}

(18)

Based on the decomposition, the estimation

{\hat{Γ}}_{s}

is given by

{\hat{Γ}}_{s} = U_{1} S_{1}^{1 / 2}

. The matrix C is estimated by

\begin{matrix} \hat{C} = {\hat{Γ}}_{s} (1 : k_{y}, :) . \end{matrix}

(19)

Concurrently, the matrix A can be estimated through the following calculation

\begin{matrix} \begin{matrix} {\hat{Γ}}_{s} (k_{y} + 1 : (s + 1) k_{y}, :) = [\begin{matrix} C A \\ ⋮ \\ C A^{s} \end{matrix}] = [\begin{matrix} C \\ ⋮ \\ C A^{s - 1} \end{matrix}] A ≜ {\hat{Γ}}_{s - 1} A \\ \Rightarrow & \hat{A} = {({\hat{Γ}}_{s - 1}^{T} {\hat{Γ}}_{s - 1})}^{- 1} {\hat{Γ}}_{s - 1}^{T} {\hat{Γ}}_{s} (k_{y} + 1 : (s + 1) k_{y}, :) . \end{matrix} \end{matrix}

(20)

Furthermore, B and D can be estimated based on the Toeplitz matrix

H_{u, s}

. Specifically, the estimations of B and D are obtained as follows:

\begin{matrix} \hat{D} = H_{u, s} (s k_{y} + 1 : (s + 1) k_{y}, s k_{u} + 1 : (s + 1) k_{u}), \end{matrix}

(21)

\begin{matrix} H_{u, s} (k_{y} + 1 : (s + 1) k_{y}, 1 : k_{u}) = [\begin{matrix} C B \\ ⋮ \\ C A^{s - 1} B \end{matrix}] ≜ {\hat{Γ}}_{s - 1} B \end{matrix}

(22)

\begin{matrix} \Rightarrow \hat{B} = {({\hat{Γ}}_{s - 1}^{T} {\hat{Γ}}_{s - 1})}^{- 1} {\hat{Γ}}_{s - 1}^{T} H_{u, s} (k_{y} + 1 : (s + 1) k_{y}, 1 : k_{u}) . \end{matrix}

(23)

Although the measurement noise

v (k)

and process noise

w (k)

are commonly modeled as zero-mean Gaussian vectors, their covariance matrices

Σ_{v}

and

Σ_{w}

are generally unavailable in practice. This uncertainty may limit the monitoring performance when the KF-based residual generation is applied. To mitigate this difficulty, the covariance matrices

Σ_{w}

and

Σ_{v}

are estimated within the KF framework using the identified matrices. The estimation is achieved through iterative interactions between the estimator and the measurement data.

Remark 1.

For notational simplicity, the matrices A, B, C, and D used in the subsequent derivations denote their estimated counterparts

\hat{A}

,

\hat{B}

,

\hat{C}

, and

\hat{D}

, respectively.

Based on the KF formulation in (3)–(6), the following expression can be derived:

\begin{matrix} Σ_{v} = Σ_{r, k} - C P_{k} C^{T} \Rightarrow \{\begin{matrix} Σ_{v} + C Σ_{w} C^{T} = Σ_{r, k + 1} - Φ_{k, 0}, \\ Φ_{k, 0} = C (A P_{k} A^{T} - K_{k} Σ_{r, k} K_{k}^{T}) C^{T} . \end{matrix} \end{matrix}

(24)

Within the KF framework, the following formulation is obtained based on the idea of iterative interaction:

\begin{matrix} \begin{matrix} Σ_{v} + \sum_{i = 0}^{j} C A^{i} Σ_{w} {(C A^{i})}^{T} = Σ_{r, k + j + 1} - Φ_{k, j}, j = 0, 1, \dots \\ Φ_{k, j} = C (A^{j + 1} P_{k} {(A^{T})}^{j + 1} - \sum_{i = 0}^{j} A^{j - i} K_{k + i} Σ_{r, k + i} K_{k + i}^{T} {(A^{j - i})}^{T}) C^{T} . \end{matrix} \end{matrix}

(25)

Applying the vectorization operation to (24)–(25) yields

\begin{matrix} \begin{matrix} v e c (Σ_{v}) & = v e c (Σ_{r, k} - C P_{k} C^{T}) \\ v e c (Σ_{v} + \sum_{i = 0}^{j} C A^{i} Σ_{w} {(C A^{i})}^{T}) & = v e c (Σ_{v}) + \sum_{i = 0}^{j} [(C A^{i}) \otimes (C A^{i})] v e c (Σ_{w}) \\ = v e c (Σ_{r, k + j + 1}) - v e c (Φ_{k, j}), j = 0, 1, \dots, ϱ . \end{matrix} \end{matrix}

(26)

From (26), the following equivalent form can be derived

\begin{matrix} [\begin{matrix} I & 0 \\ I & C \otimes C \\ ⋮ & ⋮ \\ I & \sum_{i = 0}^{ϱ} (C A^{i}) \otimes (C A^{i}) \end{matrix}] [\begin{matrix} v e c (Σ_{v}) \\ v e c (Σ_{w}) \end{matrix}] = [\begin{matrix} v e c (Σ_{r, k}) - v e c (C P_{k} C^{T}) \\ v e c (Σ_{r, k + 1}) - v e c (Φ_{k, 0}) \\ ⋮ \\ v e c (Σ_{r, k + ϱ + 1}) - v e c (Φ_{k, ϱ}) \end{matrix}] . \end{matrix}

(27)

It is worth noting that

Σ_{v}

and

Σ_{w}

are diagonal matrices. Therefore, they contain

k_{y}

and

k_{x}

independent elements, respectively. These independent elements are denoted by

ϑ_{v}

and

ϑ_{w}

. Accordingly, there exists a matrix

Ψ \in R^{(k_{y}^{2} + k_{x}^{2}) \times (k_{y} + k_{x})}

such that

\begin{matrix} [\begin{matrix} v e c (Σ_{v}) \\ v e c (Σ_{w}) \end{matrix}] = Ψ [\begin{matrix} ϑ_{v} \\ ϑ_{w} \end{matrix}] \end{matrix}

(28)

where

\begin{matrix} ϑ_{v} = {[\begin{matrix} σ_{v, 1}, σ_{v, 2}, \dots, σ_{v, k_{y}} \end{matrix}]}^{T}, \\ ϑ_{w} = {[\begin{matrix} σ_{w, 1}, σ_{w, 2}, \dots, σ_{w, k_{x}} \end{matrix}]}^{T}, \end{matrix}

and

σ_{v, i}

represents the i-th diagonal element of

Σ_{v}

,

i = 1, 2, \dots, k_{y}

, and

σ_{w, j}

denotes the j-th diagonal element of

Σ_{w}

,

j = 1, 2, \dots, k_{x}

.

Substituting (28) into (27) yields

\begin{matrix} \underset{A_{v, w}}{\underset{︸}{[\begin{matrix} I & 0 \\ I & C \otimes C \\ ⋮ & ⋮ \\ I & \sum_{i = 0}^{ϱ} (C A^{i}) \otimes (C A^{i}) \end{matrix}] Ψ}} [\begin{matrix} ϑ_{v} \\ ϑ_{w} \end{matrix}] = [\begin{matrix} v e c (Σ_{r, k}) - v e c (C P_{k} C^{T}) \\ v e c (Σ_{r, k + 1}) - v e c (Φ_{k, 0}) \\ ⋮ \\ v e c (Σ_{r, k + ϱ + 1}) - v e c (Φ_{k, ϱ}) \end{matrix}] . \end{matrix}

(29)

It should be noted that (29) provides a general formulation for estimating the structured parameters of

Σ_{v}

and

Σ_{w}

. Based on (29), the estimations of

Σ_{v}

and

Σ_{w}

can be iteratively updated by solving the following expression:

\begin{matrix} A_{v, w} [\begin{matrix} ϑ_{v, i} \\ ϑ_{w, i} \end{matrix}] = [\begin{matrix} φ (k) - v e c (C P_{k} C^{T}) \\ φ (k + 1) - v e c (Φ_{k, 0}) \\ ⋮ \\ φ (k + ϱ + 1) - v e c (Φ_{k, ϱ}) \end{matrix}] \end{matrix}

(30)

where i represents the iteration index in the covariance update procedure and

\begin{matrix} φ (k + j) = v e c ([y (k + j) - \hat{y} (k + j)] {[y (k + j) - \hat{y} (k + j)]}^{T}), j = 0, 1, \dots, ϱ . \end{matrix}

Accordingly, the estimations of

Σ_{v}

and

Σ_{w}

are given by

\begin{matrix} \begin{matrix} {\hat{Σ}}_{v} & = d i a g (\hat{ϑ_{v}}), \\ {\hat{Σ}}_{w} & = d i a g (\hat{ϑ_{w}}) . \end{matrix} \end{matrix}

(31)

To enable FD, a residual vector is required for the construction of the test statistic. Using the identified matrices A, B, C, and D, together with the estimated covariance matrices

Σ_{v}

and

Σ_{w}

, the residual vector

r (k)

is derived from the KF algorithm (3)–(6). Based on this residual signal, the test statistic for FD is formulated as

\begin{matrix} J (r) = r^{T} (k) Σ_{r, 0} r (k) \sim χ^{2} (k_{y}) \end{matrix}

(32)

where

Σ_{r, 0}

denotes the covariance matrix of the residual vector under fault-free conditions. According to the statistic in (32), the threshold

J_{t h}

is given by

\begin{matrix} J_{t h} = χ_{α}^{2} (k_{y}) \end{matrix}

(33)

where

α

is a user-selected significance level that sets the upper limit of the acceptable false alarm rate (FAR).

Based on (32) and (33), the following detection logic is proposed for FD; that is,

\{\begin{matrix} J (r) - J_{t h} > 0, ⟹ fault alarm \\ J (r) - J_{t h} \leq 0, ⟹ no alarm . \end{matrix}

(34)

To construct a reliable FD scheme for traction drive systems, the matrices A, B, C, and D are identified, and covariance matrices

Σ_{v}

and

Σ_{w}

are estimated in an offline procedure. During online operation, process measurements are utilized to compute the residual signals, which serve as the basis for FD. The overall design of the developed FD system is summarized in Algorithms 1 and 2.

Algorithm 1 Design of the proposed FD scheme: Off-line Learning

Input: Fault-free process I/O data.

Output:

\hat{A}

,

\hat{B}

,

\hat{C}

,

\hat{D}

,

{\hat{Σ}}_{v}

, and

{\hat{Σ}}_{w}

.

1:: begin
2:: Load process data and construct matrices $U_{k, s}$ , $Y_{k, s}$ , and $Z_{p}$ ;
3:: Execute (12)–(17) to obtain $Γ_{s} L_{p}$ and $H_{u, s}$ ;
4:: Identify $\hat{A}$ , $\hat{B}$ , $\hat{C}$ , and $\hat{D}$ using (19)–(23);
5:: Obtain ${\hat{Σ}}_{v}$ and ${\hat{Σ}}_{w}$ using (30) and (31).
6:: end

Algorithm 2 Design of the proposed FD scheme: Online FD

Input: Online I/O data and offline-estimated parameters.

Output: Online FD results.

1:: begin
2:: Load process I/O data;
3:: Execute (3)–(6) to obtain the residual signal;
4:: Design the residual evaluation unit using (32) and (33);
5:: Execute FD based on the detection logic (34).
6:: end

4. Experiment and Discussion

Experimental validation is conducted on a practical traction drive system to evaluate the proposed approach. The test bench consists of a high-voltage control cabinet, a data acquisition board, a computer for the program implementation, and a permanent magnet synchronous motor, as depicted in Figure 2. The main parameters of the traction motor are listed in Table 2. It should be noted that this work focuses on algorithm validation rather than sensor metrology; therefore, no separate study on sensor uncertainty was conducted. Prior to the experiments, routine checks confirmed that the sensor outputs were stable and within the expected operating ranges under normal conditions. Consequently, remaining sensor inaccuracies are treated as part of the measurement noise in collected data.

In the experimental setup, the voltage measurements are used as input variables, while the speed and current measurements are considered as output variables; that is,

\begin{matrix} \begin{matrix} u & = {[V_{d c} V_{a} V_{b}]}^{T}, \\ y & = {[I_{a} I_{b} s]}^{T} . \end{matrix} \end{matrix}

(35)

I/O data for the parameter estimation are collected from monitoring nodes of the traction drive system, with representative samples shown in Figure 3. To assess the performance of the proposed approach, two distinct fault conditions are introduced in the traction drive system for analysis. These scenarios capture different operational challenges that may arise in practical applications.

1.: An offset fault with a magnitude of 0.1 A is injected into $I_{a}$ at k = 1001st.
2.: A drift fault, defined as $f_{2}$ = 0.15 (k − 1001) A, is applied to $I_{b}$ starting from k = 1001st.

To verify the effectiveness of the developed method, a conventional KF-based FD approach [7], a unscented KF-based FD approach [12], and an extended KF-based FD approach [15] are selected as benchmark methods for comparison, with the system matrices considered to be known.

Figure 4, Figure 5, Figure 6 and Figure 7 present the FD results for the offset fault scenario obtained using the three benchmark methods and the proposed data-driven approach, respectively. Figure 4, Figure 5 and Figure 6 indicate that deviations between the prior-assumed noise statistics and the true system noise statistics can impair the FD performance. Although the conventional KF-based, unscented KF-based, and extended KF-based FD methods, as well as the proposed method, exhibit comparable FAR, the three benchmark methods suffer from limited fault detection rates (FDR), resulting in unbalanced detection performance. It is worth noting that the abrupt increase in the test statistic at the beginning mainly results from the initialization transient of the KF rather than the fault itself. The discrepancy between the initial state estimate and the true system state produces large innovations during the initial sampling instants. Because the residual sequence has not yet converged to its steady-state distribution, the test statistic is temporarily amplified, which leads to the observed initial spike. For the proposed method, the FAR remains within the acceptable limit, while the desired detection performance is effectively maintained.

Figure 8, Figure 9, Figure 10 and Figure 11 show the detection results for the drift fault scenario using the three benchmark methods and the proposed approach, respectively. A similar initial spike of the test statistic can also be observed. This phenomenon mainly results from the transient behavior of the iterative algorithm during its initialization stage. Since the parameter estimates have not yet converged in the early iterations, the corresponding test statistic may be temporarily amplified. As observed from the figures, the proposed method exhibits a relatively low FDR for the drift fault scenario. This is mainly attributed to the gradual nature of the drift fault, where the short-term variation in system outputs may still remain within the normal fluctuation range, making early identification difficult. Nevertheless, compared with the conventional KF-based, unscented KF-based, and extended KF-based FD methods, the proposed data-driven method achieves more reliable FD, leading to improved detection performance.

To further demonstrate the effectiveness of the proposed method, Table 3 provides a quantitative comparison of the performance achieved by the benchmark FD methods and the proposed method. Although using the same identified system matrices as the proposed method, the conventional KF-based, unscented KF-based, and extended KF-based FD methods neglect the mismatch between assumed and actual noise statistics, resulting in degraded monitoring performance. In contrast, the proposed method effectively balances the FAR and FDR. Moreover, it does not rely on precise system modeling, thereby avoiding complex model construction while still achieving the desired detection performance. The results in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8, Figure 9, Figure 10 and Figure 11 and Table 3 demonstrate that, despite using the same identified system matrices, the conventional KF-based, unscented KF-based, and extended KF-based FD methods neglect the mismatch between assumed and actual noise statistics, resulting in degraded detection performance. In contrast, the proposed approach alleviates this limitation through iterative interaction and learning, effectively suppressing the adverse effects caused by noise statistics mismatch.

Although the proposed method achieves satisfactory FD performance, several limitations still remain. First, the current framework is developed based on a linear state-space model, which is derived by linearizing the traction drive system around a stable operating point. Its applicability may be limited under strongly nonlinear conditions when the operating state of the traction drive system varies significantly. On the other hand, the proposed method focuses on FD and does not explicitly address fault-tolerant control. In practical traction drive systems, reliable monitoring should be further integrated with control reconfiguration to ensure safe operation under complex conditions.

5. Conclusions

In this work, a data-driven FD scheme has been developed for traction drive systems in the presence of unknown noise covariances. To avoid reliance on accurate system modeling, the system matrices were directly identified from collected I/O data based on the principle of subspace identification. To address the challenge posed by unknown noise statistics, an adaptive covariance estimation strategy was incorporated into the KF framework. By exploiting the iterative interaction between the estimator and measurement data, the noise covariance matrices can be progressively updated, thereby mitigating the performance impairment caused by empirically specified covariance matrices. The proposed approach was experimentally validated on a real traction drive system. It achieves an FDR of 0.8040 for offset faults under comparable FAR and an FAR of 0.0150 for drift faults under acceptable FDR, both outperforming the benchmark methods. These results demonstrate that the proposed method delivers reliable and satisfactory FD performance. It should be noted that the developed method is established for linear dynamic systems. Future research will aim to extend the developed framework to nonlinear systems and enhance fault-tolerant control capability under more complex operating conditions.

Author Contributions

Conceptualization, C.F. and C.J.; methodology, C.F. and C.J.; software, C.F., S.W. and Z.W.; validation, C.F., S.W. and Z.W.; formal analysis, C.F.; investigation, C.F.; resources, C.J.; data curation, C.F. and S.W.; writing—original draft preparation, C.F., S.W. and Z.W.; writing—review and editing, P.C. and C.F.; visualization, C.F. and P.C.; supervision, C.J.; project administration, C.J.; funding acquisition, S.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by the National Natural Science Foundation of China under Grant 62273058.

Data Availability Statement

Requests for access to the data supporting this study should be directed to the authors.

Acknowledgments

The authors would like to thank the editors and reviewers for their constructive comments, which has helped improve the quality of this work.

Conflicts of Interest

Author Peng Cheng was employed by the company CRRC Changchun Railway Vehicles Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

FD	Fault detection
KF	Kalman filter
I/O	Input–output
SVD	Singular value decomposition
FAR	False alarm rate
FDR	Fault detection rate

References

Chen, H.; Jiang, B.; Ding, S.X.; Huang, B. Data-Driven Fault Diagnosis for Traction Systems in High-Speed Trains: A Survey, Challenges, and Perspectives. IEEE Trans. Intell. Transp. Syst. 2022, 23, 1700–1716. [Google Scholar] [CrossRef]
Xu, J.; Zhong, M.; Li, L.; Wu, Y.; Song, B. A Fuzzy H_i/H_∞ Optimization Approach to Fault Detection of High-Speed Train Traction Motor Systems. IEEE Trans. Ind. Inform. 2025, 21, 3655–3665. [Google Scholar] [CrossRef]
Zhong, M.; Zhang, K.; Zhang, L.; Zhong, N.; Guo, H. An innovation re-organised parity space approach to fault detection for rectifier in high-speed train electrical traction systems. Int. J. Syst. Sci. 2026. [Google Scholar] [CrossRef]
Mao, Z.; Tao, G.; Jiang, B.; Yan, X.G. Adaptive Compensation of Traction System Actuator Failures for High-Speed Trains. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2950–2963. [Google Scholar] [CrossRef]
Zhang, K.; Jiang, B.; Yan, X.G.; Shen, J. Interval Sliding Mode Observer Based Incipient Sensor Fault Detection with Application to a Traction Device in China Railway High-Speed. IEEE Trans. Veh. Technol. 2019, 68, 2585–2597. [Google Scholar] [CrossRef]
Kolpakhchyan, P.G.; Pakhomin, S.A.; Kochin, A.E.; Evstaf’ev, A.M.; Andreev, V. Traction Induction Motor State Observer Based on an Luenberger Filter. In Proceedings of the International Conference on Intelligent Information Technologies for Industry, St. Petersburg, Russia, 25–30 September 2023; Springer: Cham, Switzerland, 2023; pp. 260–270. [Google Scholar]
Ding, S.X. Model-Based Fault Diagnosis Techniques: Design Schemes, Algorithms, and Tools; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Chen, H.; Jiang, B. A Review of Fault Detection and Diagnosis for the Traction System in High-Speed Trains. IEEE Trans. Intell. Transp. Syst. 2020, 21, 450–465. [Google Scholar] [CrossRef]
Cheng, C.; Wan, Z.; Wang, W.; Sun, W.; Fu, C.; Chen, H. A Robust Data-Driven Sensor Fault Detection Method for Traction Drive Systems. IEEE Trans. Instrum. Meas. 2025, 74, 3562009. [Google Scholar] [CrossRef]
Sun, X.; Song, C.; Zhang, Y.; Sha, X.; Diao, N. An Open-Circuit Fault Diagnosis Algorithm Based on Signal Normalization Preprocessing for Motor Drive Inverter. IEEE Trans. Instrum. Meas. 2023, 72, 3513712. [Google Scholar] [CrossRef]
Xia, L.; Liang, Y.; Zheng, P.; Huang, X. Residual-Hypergraph Convolution Network: A Model-Based and Data-Driven Integrated Approach for Fault Diagnosis in Complex Equipment. IEEE Trans. Instrum. Meas. 2023, 72, 3501811. [Google Scholar] [CrossRef]
Cheng, C.; Wang, W.; Meng, X.; Shao, H.; Chen, H. Sigma-mixed unscented Kalman filter-based fault detection for traction systems in high-speed trains. Chin. J. Electron. 2023, 32, 982–991. [Google Scholar] [CrossRef]
Foo, G.H.B.; Zhang, X.; Vilathgamuwa, D.M. A Sensor Fault Detection and Isolation Method in Interior Permanent-Magnet Synchronous Motor Drives Based on an Extended Kalman Filter. IEEE Trans. Ind. Electron. 2013, 60, 3485–3495. [Google Scholar] [CrossRef]
Namdar, A.; Samet, H.; Allahbakhshi, M.; Tajdinian, M.; Ghanbari, T. A robust stator inter-turn fault detection in induction motor utilizing Kalman filter-based algorithm. Measurement 2022, 187, 110181. [Google Scholar] [CrossRef]
Miniach, M.; Orlowska-Kowalska, T. Innovative extended Kalman filter in a coherent system for detection and compensation of various current sensor faults in the induction motor drive. IEEE Trans. Ind. Electron. 2025, 72, 7772–7784. [Google Scholar] [CrossRef]
Huang, B.; Kadali, R. Dynamic Modeling, Predictive Control and Performance Monitoring: A Data-Driven Subspace Approach; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2008; Volume 374. [Google Scholar]
Favoreel, W.; De Moor, B.; Van Overschee, P. Subspace state space system identification for industrial processes. J. Process Control 2000, 10, 149–155. [Google Scholar] [CrossRef]
Goldstein, H. Multilevel mixed linear model analysis using iterative generalized least squares. Biometrika 1986, 73, 43–56. [Google Scholar] [CrossRef]
Ding, S.X.; Yang, Y.; Zhang, Y.; Li, L. Data-driven realizations of kernel and image representations and their application to fault detection and control system design. Automatica 2014, 50, 2615–2623. [Google Scholar] [CrossRef]
Ding, S.X. Data-Driven Design of Fault Diagnosis and Fault-Tolerant Control Systems; Springer: London, UK, 2014. [Google Scholar]
Qin, S.J. An overview of subspace identification. Comput. Chem. Eng. 2006, 30, 1502–1513. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the traction drive system.

Figure 2. Experimental setup of the traction drive system.

Figure 3. I/O signals of the traction drive system.

Figure 4. Offset fault detection results using the conventional KF-based method.

Figure 5. Offset fault detection results using the unscented KF-based method.

Figure 6. Offset fault detection results using the extended KF-based method.

Figure 7. Offset fault detection results using the proposed method.

Figure 8. Drift fault detection results using the conventional KF-based method.

Figure 9. Drift fault detection results using the unscented KF-based method.

Figure 10. Drift fault detection results using the extended KF-based method.

Figure 11. Drift fault detection results using the proposed data-driven method.

Table 1. Sensors used in the traction drive system.

Variable	Description	Unit
$I_{a}$	A-phase current	A
$I_{b}$	B-phase current	A
$V_{a}$	A-phase voltage	V
$V_{b}$	B-phase voltage	V
$V_{c}$	C-phase voltage	V
s	Motor speed	r/min
$V_{d c}$	System input voltage	V

Table 2. Key parameters of the traction motor.

Symbol	Parameter	Value (Unit)
$T_{e}$	rated torque	2.43 (N·m)
$J_{m}$	moment of inertia	$0.425 \times 10^{- 3} ($ kg· $m^{2}$ )
$L_{a}$	inductance of motor coil	$2.96 \times 10^{- 3}$ (H)
$T_{L}$	load torque	3.645 (N·m)
$O_{a}$	magnet flux	$9.92 \times 10^{- 2}$ (Wb)
p	pole pairs	4
$R_{a}$	resistance of motor coil	$0.985 (Ω)$
$D_{m}$	viscosity friction coefficient	$1 \times 10^{- 4}$

Table 3. Comparison of FD performance for different methods.

$Methods$	$f_{1}$ $: Offset Fault$		$f_{2}$ $: Drift Fault$
$Methods$	$FAR$	$FDR$	$FAR$	$FDR$
The method in [7]	0.0510	0.2430	0.0290	0.3340
The method in [12]	0.0450	0.2840	0.0310	0.4240
The method in [15]	0.0460	0.5920	0.0560	0.3360
The developed method	0.0460	0.8040	0.0150	0.5340

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fu, C.; Jiang, C.; Wan, Z.; Cheng, P.; Wang, S. Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems. Machines 2026, 14, 465. https://doi.org/10.3390/machines14050465

AMA Style

Fu C, Jiang C, Wan Z, Cheng P, Wang S. Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems. Machines. 2026; 14(5):465. https://doi.org/10.3390/machines14050465

Chicago/Turabian Style

Fu, Caixin, Changhong Jiang, Zhiwei Wan, Peng Cheng, and Shenquan Wang. 2026. "Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems" Machines 14, no. 5: 465. https://doi.org/10.3390/machines14050465

APA Style

Fu, C., Jiang, C., Wan, Z., Cheng, P., & Wang, S. (2026). Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems. Machines, 14(5), 465. https://doi.org/10.3390/machines14050465

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Driven Robust Kalman Filter-Based Fault Detection for Traction Drive Systems

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. Traction Drive Systems

2.2. Kalman Filter

2.3. Problem Formulation

3. Proposed FD Method

4. Experiment and Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI