1. Introduction
Owing to its salient features, the Modular Multilevel Converter (MMC) has become the mainstream topology for High Voltage Direct Current (HVDC) systems [
1,
2,
3]. Many MMC-HVDC projects have been put into operation in China, such as the Nan’ao multi-terminal flexible DC transmission project, the Zhoushan five-terminal flexible DC project, and the Zhangbei flexible DC grid project [
4,
5,
6].
In MMC systems, significant circulating currents flow within the converter, resulting in increased current stress, higher system losses, and other adverse effects. To mitigate these issues, researchers have developed various suppression strategies, including the Proportional-Resonant (PR) controller, the 
dq frame-based Proportional-Integral (PI) controller, and the repetitive controller [
7,
8,
9,
10]. However, all these strategies rely on accurate MMC arm current measurements. If the arm current measurement becomes unreliable or fails, the circulating current control could malfunction, posing a risk to system safety.
Current measurement devices in HVDC systems primarily consist of three types: zero-flux DC current transformers, active DC current transformers, and all-fiber DC optical current transformers [
11]. Due to their complex structures, these devices are prone to malfunctions. In recent years, there has been a high frequency of failures in these devices. Specifically, by 2017, there were 383 recorded current transformer faults across 21 DC systems operated by the State Grid Corporation of China [
12]. In a HVDC transmission system, measurement deviations occurred due to increased contact resistance caused by loosened fastening screws in the shunt of the current measurement device. This resulted in the incorrect operation of the longitudinal differential protection for the Pole 2 line [
13]. To mitigate these impacts, a fault detection method for current measurement devices is urgently required.
The study of fault detection in MMCs has garnered significant attention in recent years. However, most research has primarily focused on Submodule (SM) IGBT open-circuit faults [
14,
15,
16] and arm faults [
17], with limited focus on current measurement faults. Zhang et al. [
18] proposed a fault feature identification-based diagnostic strategy for MMC current sensors, enabling open-circuit fault detection and localization by monitoring the constant characteristics of submodule capacitor voltages without requiring additional hardware circuits. However, this method primarily targets open-circuit faults in output current and arm current sensors, exhibiting limited capability to address other types of current measurement failures. Belayneh et al. [
19] addressed constant gain faults and amplitude deviation faults in MMC arm current measurement by proposing a composite compensation algorithm based on integral operations and PI controllers. By extracting fault characteristics and dynamically adjusting the measured values, the method achieves fault-tolerant control. However, the focus of this paper is not on measurement fault detection, and only constant gain faults and amplitude deviation faults are considered, neglecting other potential fault cases, which presents a limitation. Therefore, the simultaneous detection of multiple fault types in arm current measurement devices deserves further research. This requires a fault detection method with advantages such as fast detection speed and strong anti-interference capability to effectively detect the four types of arm current measurement faults.
In the field of fault detection, existing methodologies are basically classified into hardware-based and software-based approaches [
20]. Hardware-based methods rely on additional sensors for fault detection, which suffer from high implementation costs. Model-based approaches are widely adopted software-based fault detection methods, which mainly include Sliding Mode Observer (SMO), Model Predictive Control (MPC), and Kalman Filter (KF). In MMC systems, SMO was first introduced for IGBT open-circuit fault detection in [
21], while [
22,
23] developed dedicated SMO for arm currents and output currents, respectively, utilizing residual signals between observed and measured values for fault localization. MPC-based approaches are explored in [
24,
25], where [
24] utilized error dynamic correction-enhanced MPC for arm current prediction to identify faulty arms, and [
25] introduced an incremental MPC model with circulating currents as state variables, employing multi-step prediction and error compensation. The KF, which incorporates process and measurement noise, enables optimal state estimation by utilizing system input-output data. In [
26], a real-time fault diagnosis for a linear drive system was developed based on the KF. In [
27], the KF was used to detect and isolate actuator and sensor faults on an aircraft engine. In [
28], a water leak fault was diagnosed through an extended KF with data obtained from a real pipeline. In [
29], a modified unscented KF was used to detect the simultaneous air data sensor faults for an aircraft. In [
30,
31], the KF was adopted to detect SM IGBT open-circuit faults. Given its advantages of strong real-time performance, robustness against noise, and suitability for state estimation in dynamic systems, the KF was considered for MMC arm current measurement fault detection.
This paper focuses on four typical MMC arm current measurement faults: constant gain faults, amplitude deviation faults, phase shift faults, and stuck faults, along with their detrimental impacts on circulating current control. These faults can induce significant fluctuations in circulating currents, which threaten system stability. To prevent compromised system stability caused by arm current measurement faults, a KF-based fault detection method is proposed. The method selects the SM average capacitor voltage as the observation variable. During normal operation, the observed voltage value converges to the actual value. However, when a fault occurs, the observed voltage value will deviate from the actual voltage. To enhance the fault detection, this paper introduces the Moving Root Mean Square (MRMS) value of the residual. The proposed method allows for the rapid detection of faults in MMC arm current measurement.
The rest of this paper is organized as follows. 
Section 2 describes the topology structure of MMCs and the characteristics of circulating current, as well as the 
dq circulating current suppression controller. 
Section 3 provides a detailed analysis of four typical arm current measurement faults and their impacts. 
Section 4 presents the principles of the KF-based fault detection method. 
Section 5 provides the results of the simulation and experimental validation. Finally, the conclusions are presented in 
Section 6.
  3. Impact of Fault on Current Measurement Device
In this section, the four typical arm current measurement faults are discussed, followed by a comprehensive analysis of the fault characteristics and their impacts on the system. The MMC-HVDC simulation model was established in MATLAB/Simulink 2025a to support the analysis. The main parameters of the simulated system are listed in 
Table 1, which are based on the Nan’ao multi-terminal project Jinniu station.
The fault of an active DC current measurement device is quite complex. For the convenience of theoretical description and further analysis, the faults occurring in actual operation are classified into constant gain faults, amplitude deviation faults, phase shift faults, and stuck faults. This paper focuses on the above-mentioned four fault types, analyzing their impacts on the MMC three-phase circulating currents and DC current characteristics.
The constant gain fault can be characterized as
      where 
 is the measured value, 
 is the actual value, and 
 is the deviation gain, which can be positive and negative according to the fault characteristics.
Considering the symmetry of MMCs, the upper arm current measurement device in phase 
 was studied, supposing that 
 and 
 arm current measurement faults occur. The typical 
dq rotating frame-based CCSC was adopted. Corresponding simulation waveforms are shown in 
Figure 3 and 
Figure 4.
In 
Figure 3, an 
 arm current measurement fault occurs at 2 s, which means that the arm current value sent to the control system is only 90% of the actual value. Before 2 s, the circulating currents are well mitigated. After that, undesired circulating currents appear. Different from the second-order negative-sequence circulating currents, these currents exhibit zero-sequence characteristics with fundamental frequency. These fundamental frequency oscillating currents also appear on the MMC DC-side, as shown in 
Figure 3b. The average SM capacitor voltages of six arms are shown in 
Figure 3c, which are also greatly distorted. Simulation results for the 
 fault case are shown in 
Figure 4. Severe fault cases induce instability in the MMC. The 
 and 
 cases simulation results are similar and are not shown here due to page limit.
- 2.
- Amplitude Deviation Faults 
The amplitude deviation fault can be characterized as
      where 
 is the amplitude deviation factor, and 
 denotes the arm current amplitude under rated system operating cases. 
 is a constant term; the amplitude deviation fault manifests as a DC component deviation between measured and actual values, with fault severity directly proportional to the absolute value of 
.
When amplitude deviation faults occur, with amplitude deviation faults occur, with 
 and 
; the simulation waveforms illustrating the fault impacts are shown in 
Figure 5 and 
Figure 6.
In 
Figure 5, a 
 arm current measurement fault occurs at 2 s. At this time, the arm current value sent to the control system is 30 A larger than the actual value. Unlike the constant gain fault, after the occurrence of an amplitude deviation fault, the three-phase circulating currents exhibit significant amplitude steps. Compared to the non-faulty state, the circulating current decreases in phase 
, while the amplitude of the circulating currents in phases 
 and 
 increases. The amplitude of the DC current also experiences an amplitude step, as shown in 
Figure 5b. As the 
 value increases, the three-phase circulating currents and DC current after the fault in 
Figure 6 exhibit larger amplitude steps compared with those in 
Figure 5, indicating that the impact of the fault on the MMC system becomes greater.
- 3.
- Phase Shift Faults 
The phase shift faults can be characterized as
      where 
 is the phase shift factor, ranging from 0 to 2, which means the phase shift angle spans from 0 to 2π—a full sinusoidal period.
Figure 7 and 
Figure 8 illustrate the circulating current and DC current waveforms under phase shift faults with 
 and 
 in the upper arm of phase 
, respectively. Following the fault occurrence, the three-phase circulating current waveforms exhibit significant fluctuations accompanied by phase shifts. Specifically, the circulating current of phases 
b and 
c becomes identical, while that of phase 
 is phase-shifted by 180 degrees relative to the other two phases. Under a phase shift fault, the DC current will exhibit a fundamental frequency oscillating current. Notably, the amplitude of the fundamental frequency oscillating current increases proportionally with larger phase shift angles.
 - 4.
- Stuck Faults 
The stuck fault refers to a case where, following the occurrence of a current measurement abnormality, the arm current remains fixed at the value recorded at the fault inception instant.
      where 
 is the stuck factor, ranging from −0.26 to 0.66. When 
, it indicates a stuck arm current measurement fault when the arm current reaches its maximum value; correspondingly, 
 signifies that the fault occurs at the minimum arm current.
This paper conducts simulation verification on the arm current under stuck fault cases at three characteristic operating points: the maximum value, median value, and minimum value, respectively. The corresponding simulation waveforms are shown in 
Figure 9, 
Figure 10 and 
Figure 11.
Similar to the above-mentioned three fault types, the stuck fault also induces severe fluctuations in both three-phase circulating currents and DC current.
In summary, all four types of faults can trigger pronounced oscillations in three-phase circulating currents. In severe fault cases, the MMC may fail, triggering system protection. To prevent these arm current measurement faults from compromising system stability, it is imperative to develop a rapid fault detection method.
  5. Simulation and Experimental Verification
In this section, the proposed fault detection method is verified through both simulation and experimental results.
  5.1. MRMS and Fault Detection
Figure 17 shows the MRMS value of the SM capacitor voltage observation residual. In this paper, the integration interval is set to 20 ms. Before 2 s, the MRMS value is almost zero. When an arm current measurement fault occurs, it quickly reaches a high level, making it a good fault indicator.
 To further illustrate the relationship between MRMS values of the residual and the degree of fault, 
Figure 18 displays MRMS values under a constant gain fault case with different 
. As the severity of the fault increases, the MRMS residual increases correspondingly. Moreover, negative deviation faults have a larger impact on the MMC and are more likely to cause instability. Based on 
Figure 18, a proper detection threshold 
 can be selected according to the desired arm current measurement fault detection sensitivity.
Figure 19 shows the MRMS value of the SM capacitor voltage observation residual under the amplitude deviation fault. Similar to the case of constant gain fault, when the system is under normal case 2 s before, the MRMS value nears 0. When an arm current measurement fault occurs at 2 s, the MRMS value rapidly increases, which can effectively detect the occurrence of the fault.
 Figure 20 displays the MRMS value under the amplitude deviation fault case with different 
. As the severity of the fault increases, the MRMS values also increase accordingly. Fault severity is positively correlated with the MRMS value; the higher the severity of the fault, the easier it is to detect it.
 Upon the occurrence of a phase shift fault, the MRMS value rapidly increases from near 0 to a stabilized plateau level. As shown in 
Figure 21a,b, the plateau value of the MRMS exhibits a positive correlation with the phase shift angle magnitude. This relationship is further validated in 
Figure 22, where the plateau value linearly increases with phase shift factor up to 
, accompanied by escalating fault severity. The MRMS value reaches its global maximum at 
. Beyond 
, the MRMS plateau value decreases progressively with increasing phase shift factor. At 
, the arm current completes a full-cycle displacement, resulting in phase realignment with the normal operating case. Consequently, the MRMS value converges to its baseline level observed at 
.
Figure 23 presents the MRMS values under different stuck fault cases. Consistent with the above-mentioned fault scenarios, the MRMS exhibits a significant surge immediately following the fault inception, enabling effective detection of the fault occurrence.
 Through 
Figure 24, the relationship between the MRMS value and 
 under the stuck fault case can be visually understood.
Figure 25, 
Figure 26 and 
Figure 27 are provided to explore the impact of one arm current measurement fault on the KF observation in other arms. 
Figure 25 illustrates the MRMS waveform when the normal current measurement in the upper arm of phase 
 and the lower arm of phase 
 experiences a constant gain fault with 
. Following the fault occurrence at 2 s, the MRMS value of the lower arm of phase 
 rises rapidly, whereas the MRMS value of the upper arm of phase 
 remains unchanged and near 0, indicating no interference from faults in the adjacent arm within the same phase.
 Figure 26 and 
Figure 27 present the MRMS waveforms under two experimental cases: (1) normal current measurement in the upper arm of phase 
 with a 
 constant gain fault in the upper arm of phase 
; and (2) normal current measurement in the upper arm of phase 
 with simultaneous 
 constant gain faults in both the lower arm of phase 
 and the upper arm of phase 
 at 2 s. Both conclusions from 
Figure 26 and 
Figure 27 confirm that neither adjacent arm measurement faults in the same phase nor arm measurement faults in different phases affect the KF observation. These results validate the independence of the proposed method.
 Based on the MRMS values observed across all the above-mentioned fault scenarios, the detection threshold must be selected to ensure the identification of all fault cases. Based on the analysis, a 4.5 V threshold is set. The fault occurrence is conclusively diagnosed when the MRMS value exceeds this threshold.
  5.2. Fault Control
Figure 28 demonstrates the circulating current waveforms under four fault scenarios before and after implementing the KCL-based control strategy. Prior to the fault occurrence, the three-phase circulating currents exhibit only a DC component. Following the fault initiation, harmonics emerge in the circulating currents, inducing waveform distortion and triggering MRMS values to increase. Upon exceeding the 4.5 V threshold, the KCL-based control is activated, which rapidly suppresses the harmonic components and restores the circulating currents to their normal state. These simulation results validate the effectiveness of both the fault detection methodology and the KCL control strategy.
   5.3. Discussion
To further verify the robustness of the proposed method, 
Figure 29 presents the MRMS values under active power changing, reactive power changing, and phase-to-ground fault cases. 
Figure 29a shows the MRMS waveform under the power changing case, where at 2 s, the system’s reactive current changes from 0 A to 200 A. 
Figure 29b shows the MRMS waveform under the active power changing case, where at 2 s, the DC load steps. 
Figure 29c shows the MRMS waveform under the phase-to-ground fault case, where at 2 s, a single phase-to-ground fault occurs on phase 
. In all cases, when a fault occurs, the MRMS value only experiences small fluctuations and does not affect the detection result.
In addition, the impact of measurement noise is also analyzed. According to the national standards for DC current measurement devices in China [
33], these devices have multiple standard accuracy levels. In this study, a typical DC current measurement device with an accuracy level of 0.1 (highest) is considered. Therefore, a 0.1% measurement noise during the measurement process is introduced. 
Figure 30 shows the MRMS waveforms under both conditions: without measurement noise and with measurement noise. When no measurement noise is added, the MRMS without faults is approximately 0.4 V. After adding measurement noise, the MRMS value remains close to that of the case without measurement noise, which confirms the robustness against noise.
  5.4. Experimental Results
Figure 31 illustrates the designed experimental platform. The MMC is powered by a voltage source and drives RL loads. The typical 
dq frame-based circulating current control method is employed.
 Figure 32, 
Figure 33, 
Figure 34 and 
Figure 35 illustrate the fault impacts of four arm current measurement faults on the DC current, upper and lower arm currents of the 
 phase, and circulating currents, as well as the experimental waveforms of fault detection and KCL control. 
Figure 32a,b show the fault impact, fault detection, and control experimental waveforms at 
, respectively. Experiments demonstrate that the proposed method can detect measurement faults within 10 ms. When the MRMS reaches the threshold, KCL control is activated, and the circulating current recovers to normal. 
Figure 33a,b present the fault impact and fault detection experimental waveforms under an amplitude deviation fault of 
. After the fault occurs, the DC current and circulating current fluctuate. After engaging fault detection and control, the fault is mitigated, and the system restores stable operation within approximately 5 ms.
 In 
Figure 34 and 
Figure 35, when arm measurement occurs in phase shift faults and stuck faults, although both faults have more severe impacts on the DC current and circulating current, the proposed strategy can still stabilize the MMC system within approximately 7 ms after introducing fault control.
These experimental results validate the effectiveness of the proposed KF-based fault detection method and KCL control method.
  5.5. Discussion of Limitations
For the simulation system:
First, the simulation platform operates with discrete time steps, which limits its ability to fully replicate the continuous dynamics of real MMC systems. Second, in the simulation, the Half-Bridge MMC module provided by Simulink is used, which is an equivalent model that ignores the switching characteristics.
For the experimental system:
First, the experimental platform is a low-voltage, low-power downscaled prototype, whereas practical MMC-HVDC systems typically operate at several hundred kilovolts. As such, the experimental results primarily provide qualitative validation. Moreover, in the laboratory setup, all necessary signals are directly routed to the DSP for processing, without accounting for the multi-level controller communication in actual systems.