Data-Based Predictive Control Based Voltage Control in Active Distribution Networks

Li, Qihan; Zhu, Yongqi; Tang, Zhiyuan; Liu, Youbo; Shi, Yang

doi:10.3390/electronics14214211

Open AccessArticle

Data-Based Predictive Control Based Voltage Control in Active Distribution Networks

by

Qihan Li

¹,

Yongqi Zhu

^2,*,

Zhiyuan Tang

²

,

Youbo Liu

² and

Yang Shi

³

¹

College Aviation of Electronic and Electrical Engineering, Civil Aviation Flight University of China, Guanghan 618307, China

²

College of Electrical Engineering, Sichuan University, Chengdu 610065, China

³

State Grid Ningbo Power Supply Company, Ningbo 315000, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(21), 4211; https://doi.org/10.3390/electronics14214211

Submission received: 29 September 2025 / Revised: 18 October 2025 / Accepted: 21 October 2025 / Published: 28 October 2025

(This article belongs to the Topic Advances in Power Science and Technology, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

The increasing integration of distributed renewable energy sources into distribution networks results in significant voltage regulation challenges. To address these challenges, we introduce a novel data-driven approach for voltage regulation that utilizes predictive control mechanisms, specifically data-enabled predictive control (DeePC). This method exploits the capabilities of photovoltaic (PV) inverters and battery energy storage systems (BESS) to manage bus voltages within the distribution network. Unlike traditional model-based approaches that require a precise physical model of the network, the DeePC algorithm operates optimally by relying solely on historical data to predict and adjust bus voltages. By employing the DeePC algorithm, the proposed controller maintains voltage profiles and the state of charge (SoC) of BESSs within operational thresholds in an optimal and robust manner. To further reduce the computational complexity, a reformulation of DeePC is developed using scoring functions, where the DeePC algorithm is efficiently approximated via differentiable convex programming. We validate our approach through simulations on the IEEE 34-bus test system, demonstrating its efficiency in maintaining desired voltage levels without the need for a detailed physical system model.

Keywords:

energy storage systems; distribution networks; data-driven voltage control; distributed generation

1. Introduction

The increasing integration of distributed generation (DG) into distribution networks (DNs) has introduced significant challenges related to voltage regulation, particularly due to the reverse power flow caused by the high penetration of PV systems. These challenges primarily manifest as voltage fluctuations and voltage violations, complicating the operational management of DNs [1]. To mitigate these challenges, voltage regulation strategies have drawn increasing attention, driven by elevated levels of PV penetration. Various approaches have been explored, including PV generation curtailment [2], inverter-based control strategies [3,4], and the integration of an energy storage system (ESS) [5]. In recent years, BESSs have emerged as a promising solution to enhance voltage regulation in distribution networks, offering a flexible and efficient approach to addressing voltage fluctuations [6]. Given the variability associated with distributed energy resources (DERs), effectively coordinating systems like PV and BESS to deliver precise control actions poses a significant challenge in voltage management.

In the literature, various strategies have been proposed to mitigate the voltage impacts of PV generation. In [7], a charging/discharging strategy is designed for ESS integrated with PV generation. A droop-based control approach that combines ESS with the reactive capacity of PV generation to improve voltage profiles is presented in [8]. In [6], a distributed control scheme leveraging PV generation is introduced. A combination of localized and distributed algorithms for regulating voltage magnitudes within a specified range while minimizing the output of controllable devices is detailed in [9]. The weighted regulation method in [10] adjusts the charge/discharge rates of BESS based on their capacities. In [6], a coordinated control strategy for BESS is proposed, integrating distributed and localized control methodologies.

Although the previously discussed control strategies effectively facilitate voltage regulation, their performance depends on comprehensive network modeling with complete physical information. This reliance can be problematic, as obtaining and maintaining such detailed models is challenging—especially in dynamic environments with high renewable energy penetration. Consequently, the effectiveness of model-based design strategies is substantially undermined. As reported in [11], discrepancies between the actual system model and the design model can lead to performance degradation or even significant deterioration of control performance.

To address these challenges, data-driven methods have emerged as promising approaches for designing voltage control strategies in DNs. These methods are broadly categorized into two paradigms: direct data-driven control and machine learning-based control. Recent advances in machine learning have introduced innovative solutions to traditional voltage regulation problems [12]. For example, Reference [13] employs a Monte Carlo tree search-based reinforcement learning (MCTS-RL) framework to coordinate PV generation and BESS for voltage regulation. Similarly, Reference [14] proposes a deep deterministic policy gradient (DDPG)-based online scheduling algorithm to optimize PV-BESS dispatch by learning operational patterns of PV generation and grid conditions. To enhance adaptability during unexpected topology changes, Reference [15] introduces a multi-task soft actor-critic (MT-SAC) deep reinforcement learning (DRL) approach for voltage regulation via PV control. Further advancing this field, Reference [16] develops a physics-model-free dual-time-scale voltage control framework, where the scheduling of PV generation across multiple sub-networks is formulated as a Markov game and solved using a multi-agent soft actor-critic (MASAC) algorithm. In this approach, each sub-network operates as an autonomous intelligent agent.

However, machine learning-based control methods inherently depend on extensive offline datasets for initial model training. These approaches often face challenges in computational efficiency, prolonged training durations, and limited adaptability to evolving system conditions. A critical limitation arises when network topology changes render existing models obsolete, thereby requiring extensive data recollections and computationally intensive retraining processes. Such procedures introduce significant time delays and operational costs while compromising control performance in dynamic environments [12,17,18].

In contrast, direct data-driven methods rooted in behavioral system theory—such as data-enabled predictive control (DeePC)—eliminate the need for system identification by directly synthesizing control strategies from limited historical data. DeePC operates by constructing a nonparametric model using measured system trajectories, contingent on the input signals satisfying persistent excitation criteria. While conventional DeePC frameworks bypass parametric modeling and derive optimal control policies from raw datasets, their reliance on large-scale data introduces a critical limitation: computational complexity scales exponentially with the dimensionality of input–output data, posing challenges for high-dimensional or real-time systems [19,20,21]. To mitigate this issue, we propose a reformulated DeePC framework that integrates scoring functions and employs differentiable convex programming to approximate the original optimization problem efficiently.

In this study, we propose a novel control strategy for BESSs aimed at voltage regulation in distribution networks with high PV penetration. This strategy is based on a direct data-driven algorithm. The principal contributions of this article are as follows:

1.: The proposed control method is entirely data-driven, eliminating the need for specific physical parameters. Instead, they construct a nonparametric model of the system based on historical data. By continuously updating and iterating the input–output data, the method achieves highly accurate predictions of future input–output data.
2.: Based on the scoring functions, we propose a reformulation of DeePC that introduces a novel perspective for data-enabled approaches. The score functions are parameterized as a differentiable convex program, enabling efficient approximation and enhancing the applicability of DeePC.
3.: The proposed BESS control strategy aims to regulate voltage in distribution networks with high levels of PV. This strategy ensures that the voltage at each bus in the distribution network remains within permissible limits, preventing overvoltage or undervoltage conditions that could compromise the stability of the system.
4.: The IEEE 34-bus test is employed to demonstrate the effectiveness of the proposed data-driven control scheme, and the control performance is comparable with the model-based scheme.

Compared to existing data-driven methods such as reinforcement learning, the proposed DeePC-based approach does not require extensive offline training or retraining when system conditions change. Unlike model-based methods, it eliminates the need for accurate network parameters. Furthermore, the introduction of a differentiable convex programming-based approximation significantly reduces online computational burden, making it more suitable for real-time voltage control in large-scale distribution networks.

This paper extends our previous study [22] in the following three aspects. Firstly, it extends the data-driven control approach originally proposed for small-networked microgrid systems (NMG) in [22] to large DNs. Secondly, in contrast to [22], this work not only regulates PV generation but also incorporates BESSs for voltage regulation. Last, this work employs a reformulation of DeePC instead of the general version of DeePC presented in [22].

The rest of the article is organized as follows: Section 2 provides the system model. Section 3 presents an overview of the DeePC algorithm and a reformulation of DeePC. In Section 4, a detailed discussion on the DeePC-based voltage control scheme is provided. The case studies are presented in Section 5, followed by the conclusions presented in Section 6.

2. Network Description

In this section, we introduce the DN model used to implement the DeePC method. Consider a DN consisting of N + 1 buses, represented by the set N_a:= {0, 1 …, N}, with distribution lines defined by the set E_a:= {(s, t)} ⊂ N_a × N_a. A typical example is illustrated in Figure 1. In the DN model, the bus indexed as zero, known as the point of common coupling (PCC), typically located at the distribution substation, serves as the voltage reference point. Each bus is equipped with a PV for local power generation. The voltage magnitude, active power injection, and reactive power injection at each bus are denoted by V_s, P_s, and Q_s ∈ ℝⁿ respectively, measured in per unit (p.u.). For each line (s, t)∈ E_a, R_st and X_st denote the line resistance and reactance, while R_st and Q_st denote the net active and reactive power injected from bus s to bus t, respectively. The DistFlow equations [17], used to model power injection and voltage for every line (s, t)∈ E_a, are given as follows:

\begin{array}{l} P_{s t} - \sum_{k : (t, k) \in E_{a}} P_{t k} = - P_{t} + R_{s t} \frac{P_{s t}^{2} + Q_{s t}^{2}}{V_{s}^{2}}, t = 1, \dots, N \\ Q_{s t} - \sum_{k : (t, k) \in E_{a}} Q_{t k} = - Q_{t} + X_{s t} \frac{P_{s t}^{2} + Q_{s t}^{2}}{V_{s}^{2}}, t = 1, \dots, N \\ V_{s}^{2} - V_{t}^{2} = 2 (R_{s t} P_{s t} + X_{s t} Q_{s t}) - (R_{s t}^{2} + X_{s t}^{2}) \frac{P_{s t}^{2} + Q_{s t}^{2}}{V_{s}^{2}}, (s, t) \in E_{a} \end{array}

(1)

The DistFlow equations can be linearized as follows by assuming that: (a) line losses are negligible compared to power flow and (b) the voltage profile is relatively flat (i.e.,

V_{s}^{2} - V_{t}^{2} \approx 2 (V_{s} - V_{t})

) [17]:

\begin{array}{l} P_{s t} - \sum_{k : (t, k) \in E_{a}} P_{t k} = - P_{t}, t = 1, \dots N \\ Q_{s t} - \sum_{k : (t, k) \in E_{a}} Q_{t k} = - Q_{t}, t = 1, \dots N \\ V_{s} - V_{t} = R_{s t} P_{s t} + X_{s t} Q_{s t}, (s, t) \in E_{a} \end{array}

(2)

which can be reformulated in a compact form as:

V = N^{*} P + M^{*} Q + V^{0} 1_{n}

(3)

where V⁰ ∈ ℝⁿ is fixed to be the unit, and matrices N*, M* ∈

S

ⁿ represent sensitivity matrices derived from specific system parameters

\begin{array}{l} N_{s t}^{*} : = \sum_{(i, k) \in P_{s} \cap P_{t}} R_{i k}, \\ M_{s t}^{*} : = \sum_{(i, k) \in P_{s} \cap P_{t}} X_{i k}, \end{array}

(4)

where both N* and M* are positive definite matrices (PD) [23]. The total injected active power is decomposed into two constituent parts as p^c + p^e, where

p^{c} = (p_{1}^{c}, \dots, p_{n}^{c})

represents the controllable component (e.g., PV generation, BESS), and

p^{e} = (p_{1}^{e}, \dots, p_{n}^{e})

denotes the exogenous component (e.g., uncontrollable loads). The total injected reactive power is defined the same as the active power by following the same decomposition procedure. We define

\bar{V} ≜

N*P + M*Q + V₀1_n∈ ℝⁿ, yielding:

V = N^{*} P^{c} + M^{*} Q^{c} + \bar{V} .

(5)

The objective of voltage control in DNs is to maintain voltage levels within acceptable limits by regulating both reactive and active power provided by control resources given the voltage condition

\bar{V}

. The voltage control problem can therefore be expressed as

V (t + 1) = N^{*} P^{c} (t) + M^{*} Q^{c} (t) + \bar{V} .

(6)

Defining Ꞷ^e(t) =

\bar{V}

(t)−

\bar{V}

(t − 1), we obtain the following control system:

V (t + 1) = V (t) + N^{*} P^{c} (t) + M^{*} Q^{c} (t) + ω^{e} (t) .

(7)

Within this framework, the control objective at time t is to regulate bus voltages within a safe operational range by adjusting the active power injections p^c from controllable resources. When the sampling interval Δt is sufficiently short (e.g., minute-scale), voltage deviations caused by exogenous factors such as uncontrollable generation or load variations can be approximated as quasi-static, i.e., Ꞷ^e(t) = 0. This assumption holds because slow-varying exogenous disturbances exhibit negligible variation over short time horizons.

3. Proposed Control Scheme

Based on the linearized DistFlow model in (7), the control problem is constructed as follows, where the objective is minimize the control efforts of PV and BESS while satisfying the system model introduced in Section 2, control limits, and voltage limits:

\min \sum \frac{1}{2} ({‖V - μ‖}_{Ξ} + {‖P^{g}‖}_{R}^{2} + {‖Q^{g}‖}_{Ψ}^{2} + κ ({‖ε‖}_{2}^{2} + {‖δ‖}_{2}^{2}))

(8a)

subject to

V^{\min} - ε \leq V \leq V^{\max} + δ

(8b)

Q_{g}^{\min} \leq Q^{g} \leq Q_{g}^{\max}

(8c)

Q_{g}^{\min} = - Q_{g}^{\max}

(8d)

Q_{g}^{\max} = \sqrt{S^{2} - P_{v}^{2}}

(8e)

- P_{b a t}^{m a x} \leq P^{g} \leq P_{d i s}^{m a x}

(8f)

P^{g} = P_{d i s} - P_{b a t}

(8g)

ρ_{\min} S o C_{\max} \leq S o C (t) \leq ρ_{\max} S o C_{\max}

(8h)

S o C (t) = S o C (t - 1) - (\frac{P_{d i s}}{χ} - χ P_{b a t}) Δ t

(8i)

S o C (0) = S o C^{0}

(8j)

V (t + 1) = V (t) + N^{*} P^{c} (t) + M^{*} Q^{c} (t) + ω^{e} (t)

(8k)

where Q^g and P^g represent the reactive and active power generated by the PV systems and BESSs, respectively, with μ denoting the voltage set-points. The matrices Ξ, R and Ψ are positive definite. Vectors ε and δ serve as slack variables to soften the hard voltage constraints with V^min and V^max defining the voltage limits. A sufficiently large penalty parameter κ is chosen to enforce constraint satisfaction in the optimization problem. The reactive power output of the PV generation is determined by its rated capacity S and the corresponding active power P_v. The SoC at a given time instant t, denoted as SoC(t), is defined as the ratio of the current available capacity to the maximum allowable capacity of the BESS under the same conditions. SoC_max indicates the maximum energy storage capacity of the BESSs, while

P_{bat}^{\max}

and

P_{dis}^{\max}

denote the maximum charging and discharging power, respectively. The charge per unit limits, ρ_min and ρ_max, represent the minimum and maximum allowable SoC levels. The time step is represented by ∆t, while χ accounts for the efficiency of the BESSs.

Given the challenges in accurately obtaining the topology and line parameters of the DNs—i.e., the sensitivity matrices N* and M* in (4) are often unknown in practice—it becomes difficult to directly address the voltage control problem using model-based approaches that rely on precise physical representations. Consequently, it is essential to explore alternative methods for constructing a voltage control model based on system input–output data when N* and M* are unavailable. To address these challenges, the DeePC control framework is introduced in subsequent sections. This framework enables voltage control and optimization in DNs without requiring detailed physical models. Specifically, the DeePC algorithm is formulated to solve the control problem (8a)–(8k) while bypassing the need for precise network parameters.

The application of DeePC in this work is based on the following criteria: (i) the unavailability or inaccuracy of physical network models; (ii) the presence of time-varying dynamics due to high renewable penetration; (iii) the availability of sufficient historical input–output data; (iv) the ability to ensure persistent excitation through controlled perturbations; and (v) the need for real-time, scalable control. These factors make DeePC a suitable candidate for data-driven voltage regulation in active distribution networks. Furthermore, the proposed approximation via differentiable convex programming enables efficient online implementation, addressing the scalability challenge of standard DeePC in large-scale systems.

4. Overview of DeePC Algorithm

This section delineates that the dynamics of linear system can be learned directly from trajectories, obviating the necessity for a system model.

4.1. Data-Driven System Representation

Consider a controllable and observable linear time-invariant (LTI) system

ℬ

\begin{array}{l} x (t + 1) & = A x (t) + B u (t) + Z_{d} d (t) \\ y (t) & = C x (t) + D u (t) \end{array}

(9)

where x(t) ∈ ℝⁿ, u(t) ∈ ℝ^m, y(t) ∈ ℝ^p and d(t) ∈ ℝ^q represent the state, input vector, output vector, and external input variables, respectively, at discrete time steps t ∈ ℤ_>0. In this study, the system state and output are defined as voltage measurements, i.e., x(t) = y(t) = V(t), while u(t) represents the changes in active and reactive power injection. The term d(t) = w^e(t) accounts for the voltage fluctuations resulting from the unregulated operation of loads and PV generation. The system matrices are specified as A = C = Z_d = I, B = [N*; M*] and D = 0.

As previously discussed, solving the control problem (8a)–(8k) presents a significant challenge due to the lack of sufficiently accurate physical parameters for ADNs. Specifically, matrix B (refer to (9)), which depends on the physical model, remains indeterminate, making it impractical to directly implement optimal voltage control using traditional model-based approaches. Unlike conventional system theory, which requires explicit system parameterization, behavioral control theory relies solely on measured input–output data to construct a data-driven representation of the system’s signal space. This challenge can be addressed by formulating a data-driven model based on raw measurement data, thereby circumventing the need for an explicit physical model.

In the data-driven approach, we impose the following assumption: d(t) = w^e(t), which defines a signal that remains consistently predictable and satisfies d(t) = d(t + 1), indicating that the voltage variation between successive time steps is constant. Specifically, this implies that the uncontrollable voltage,

\bar{V}

, remains constant between successive time steps. The assumption is reasonable when the sampling interval is sufficiently short, ensuring minimal fluctuations in uncontrollable voltage sources. Based on the above assumption, we derive the following system model:

y (t + 1) = y (t) + B u (t)

(10)

where the parametric matrix B in (9) is unknown, behavioral system theory enables the replacement of traditional LTI models with nonparametric data-driven representations [18]. Let u = col(u₀, u₁, …) denote a persistently exciting input trajectory of length T. The Hankel matrix, a cornerstone of behavioral theory, organizes this data into a structured matrix for system identification. For a user-defined horizon L ∈ ℤ_>0, the input Hankel matrix is constructed as:

H_{L} (u) = [\begin{matrix} u (0) & u (1) & \begin{matrix} \dots & u (T - L) \end{matrix} \\ ⋮ & ⋮ & \begin{matrix} ⋮ \end{matrix} \\ u (L - 1) & u (L) & \begin{matrix} \dots & u (T - 1) \end{matrix} \end{matrix}]

(11)

where each column of the input Hankel matrix H_L(u^d) ∈ ℝ^mL^×(^T⁻^L⁺¹⁾ represents a length-L subsequence of the persistently exciting input trajectory u^d = col(u₀,u₁, …, u_T₋₁). Similarly, the output Hankel matrix

H_{L} (y^{d})

is defined analogously using measured outputs y^d = col(y₀, y₁, …, y_T₋₁). Let T_ini, T_f ∈ ℤ_>0 denote the past and future horizons, respectively, with L = T_ini + T_f. The Hankel matrices are partitioned into past and future components:

H_{T_{i n i} + T_{f}} (u^{d}) : = (\begin{matrix} U_{p} \\ U_{f} \end{matrix}), H_{T_{i n i} + T_{f}} (y^{d}) : = (\begin{matrix} Y_{p} \\ Y_{f} \end{matrix}) .

(12)

Here, U_p and Y_p represent the past components of the Hankel matrices, corresponding to the first T_ini time steps, while U_f and Y_f denote the future components associated with the next T_f time steps. If the Hankel matrix

H_{L} (u^{d})

has full row rank, i.e., rank(

H_{L} (u^{d})

) = mL, then the input signal sequence u is defined as an L-order continuously exciting sequence. To ensure that

H_{L} (u^{d})

maintains full row rank, the length of the initial trajectory T must satisfy the lower bound condition T ≥ (m + 1)L−1 [24]. This condition ensures the Hankel matrix spans the full input–output behavior space of the LTI system

ℬ

. Assume that the precollected input u^d is persistently exciting of order T_ini + T_f + n(

ℬ

), where n(

ℬ

) denotes the system lag. Willems’ fundamental lemma [24] guarantees that any valid future trajectory col(u,y) can be expressed as a linear combination of the Hankel matrix columns. Given the current time t, a trajectory col(u_ini, y_ini, u, y) is generated from the system, implying the existence of a vector g ∈ ℝ^T^−L+1 [25]:

(\begin{matrix} U_{p} \\ Y_{p} \\ U_{f} \\ Y_{f} \end{matrix}) g = (\begin{matrix} u_{i n i} \\ y_{i n i} \\ u \\ y \end{matrix}) \Rightarrow 0 = {‖I - \underset{: = M}{\underset{︸}{{(\begin{matrix} U_{p} \\ Y_{p} \\ U_{f} \end{matrix})}^{†} (\begin{matrix} U_{p} \\ Y_{p} \\ U_{f} \end{matrix}) g}}‖}_{p} .

(13)

This relationship forms the foundation of the DeePC framework. In this framework, the future behavior of the system is effectively predicted using historical input–output data, thus eliminating the need for an explicit system model. The parameter g serves as the decision variable spanning the precollected data, while u and y represent the predicted control inputs and outputs, respectively. The matrix [U_p^⊤ Y_p^⊤ U_f^⊤ Y_f^⊤]^⊤, constructed from historical input–output data, ensures that the optimization process remains purely data-driven. The trajectory w_ini = col(u_ini, y_ini) is generated from the latest input–output measurements and is used to estimate the system’s initial conditions in real time. By solving y_ini =Y_pg and u_ini =U_pg in (13), the decision variable g is determined, which subsequently enables the prediction of the system’s output via y =Y_fg. Figure 2 provides a simplified schematic of the DeePC formulation as presented in (13). It illustrates how the vector g selects elements from the historical trajectory library [U_p^⊤ Y_p^⊤ U_f^⊤ Y_f^⊤]^⊤ to ensure alignment and consistency with both the initialized and future trajectories col(u_ini, y_ini, u, y). The DeePC based optimization problem can be formulated as follows:

\begin{array}{l} \min_{g, u, d, y} \sum_{t = 0}^{T_{f} - 1} ({‖y (t) - r (t + s)‖}_{Ξ}^{2} + {‖u (t)‖}_{R}^{2}) \\ s . t . (\begin{matrix} U_{p} \\ Y_{p} \\ U_{f} \\ Y_{f} \end{matrix}) g = (\begin{matrix} u_{i n i} \\ y_{i n i} \\ u \\ y \end{matrix}) \end{array}

(14)

where the Hankel matrices

H_{T_{ini + f}} (u^{d})

and

H_{T_{ini + f}} (y^{d})

are constructed from pre-collected input–output trajectories w^d = col(u^d, y^d) ∈

ℬ

_T, where

ℬ

_T denotes the system’s behavior over T time steps. The vector w_ini = col(u_ini, y_ini) ∈

B_{T_{ini}}

represents the most recent T_ini input–output measurements, while u∈ℝ^m and y∈ℝⁿ denote future control inputs and predicted outputs over a horizon T_f. Let

r = (r_{0}, r_{1}, \dots, r_{T_{f} - 1})

define the reference voltage trajectory. The quadratic cost function is weighted by Ξ ∈ ℝⁿ^×n (output tracking error) and R ∈ ℝ^m^×m (control effort).

Considering the presence of uncertainties and noise, we introduce slack variables

σ_{u_{i n i}} \in ℝ^{T_{i n i} m}

and

σ_{y_{i n i}} \in ℝ^{T_{i n i} n}

to soften and relax the corresponding constraints [25,26,27], ensuring feasibility and robustness in the presence of uncertainties. The modified DeePC based optimization problem, incorporating these slack variables, is formulated as follows:

\begin{array}{l} \min_{\begin{array}{l} g, u, d, y, \\ σ_{y_{i n i}}, σ_{u_{i n i}} \end{array}} \sum_{t = 0}^{T_{f} - 1} ({‖y (t) - r (t + s)‖}_{Ξ}^{2} + {‖u (t)‖}_{R}^{2}) \\ + λ_{y_{i n i}} {‖σ_{y_{i n i}}‖}_{2}^{2} + λ_{u_{i n i}} {‖σ_{u_{i n i}}‖}_{2}^{2} + λ_{g} \cdot h (g) \\ s . t . (\begin{matrix} U_{p} \\ Y_{p} \\ U_{f} \\ Y_{f} \end{matrix}) g = (\begin{matrix} u_{i n i} + σ_{u_{i n i}} \\ y_{i n i} + σ_{y_{i n i}} \\ u \\ y \end{matrix}) \end{array}

(15)

where the regularization term h(g) =

{‖g‖}_{2}^{2}

is included to mitigate the effects of noise and inaccuracies in data, thereby enhancing the robustness of the control strategy in nonlinear computations. The regularization parameters

λ_{u_{ini}}

,

λ_{y_{ini}}

,

λ_{g_{ini}}

∈ ℝ_>0 penalize the slack variables and the regularization term, ensuring a smooth and stable input trajectory.

4.2. Approximation of DeePC

The DeePC framework constructs nonparametric system representations directly from input–output trajectories, bypassing explicit physical modeling while capturing dynamic system behavior through historical data. By embedding implicit operational patterns from measured trajectories, DeePC enhances control flexibility and robustness in uncertain grid environments. However, its computational complexity scales exponentially with the dimensionality of the data matrix M ∈ ℝ⁽^m+p^)L×(^T⁻^L⁺¹⁾, where L = T_ini + T_f is the total prediction horizon. This arises because the decision variable g ∈ ℝ^T⁻^L⁺¹ grows linearly with the dataset size T, rendering real-time optimization intractable for large-scale distribution networks.

To address this limitation, we propose a learning-based approximate DeePC framework that replaces the full-data optimization with a parametric surrogate model [28]. This model is trained offline to approximate the optimal control policy using supervised learning on historical DeePC solutions, thereby reducing online computation to a low-dimensional function evaluation.

Consider the predicted I/O trajectory sequence as τ = col(u,y), where u ∈ ℝ^mL and y ∈ ℝ^pL. To balance computational efficiency with control performance, we reformulate the DeePC optimization (16) by decoupling the objective into two components:

\min_{τ} A (τ) + Β (τ)

(16)

where the objective function is defined as follows:

\begin{array}{l} A (τ) = {‖y - r‖}_{Ξ}^{2} + {‖u‖}_{R}^{2} + {‖V - r‖}_{Ξ}^{2} + {‖P‖}_{R}^{2} + {‖Q‖}_{Ψ}^{2} \\ + Ⅱ_{u^{L} \times y^{L}} (τ) + Ⅱ_{\{= τ_{i n i}\}} (F_{i n i} τ) \end{array}

(17)

\begin{array}{l} B (τ) = \min λ_{y_{i n i}} {‖σ_{y_{i n i}}‖}_{2}^{2} + λ_{u_{i n i}} {‖σ_{u_{i n i}}‖}_{2}^{2} + λ_{g} \cdot h (g) \\ s . t . \tilde{H} [\begin{matrix} g \\ σ_{u_{i n i}} \\ σ_{y_{i n i}} \end{matrix}] = τ, \end{array}

(18)

and r

≜

r ⊗ 1_L, R

≜

R ⊗ I_L, Ξ

≜

Ξ ⊗ I_L, τ_ini

≜

col(u_ini, y_ini), F_ini

≜

[\begin{matrix} \begin{matrix} I \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ 0 \end{matrix} & \begin{matrix} \begin{matrix} 0 \\ I \end{matrix} & \begin{matrix} 0 \\ 0 \end{matrix} \end{matrix} \end{matrix} \end{matrix}]

is a selection matrix employed to extract the initial trajectory τ_ini from the full I/O sequenc τ,

\tilde{H} ≜ {[\begin{matrix} \begin{matrix} U_{P}^{⊤} \\ - I \\ 0 \end{matrix} & \begin{matrix} Y_{P}^{⊤} \\ 0 \\ - I \end{matrix} & \begin{matrix} \begin{matrix} U_{f}^{⊤} \\ 0 \\ 0 \end{matrix} & \begin{matrix} Y_{f}^{⊤} \\ 0 \\ 0 \end{matrix} \end{matrix} \end{matrix}]}^{⊤}

is the augmented Hankel matrix which encodes the historical data constraints for predictive modeling.

Ⅱ_{\{= τ_{i n i}\}}

denotes the indication function. In the

Ⅱ_{u^{L} \times y^{L}}

, “×” means Cartesian power. By employing an approximate form of the score function

\tilde{B} (τ)

, we reconstruct the score function and reformulate the optimization problem. The new formulation seeks to balance computational efficiency with control performance while ensuring robust trajectory predictions. Thus, the modified control problem is given by:

\min_{τ} A (τ) + \tilde{Β} (τ) .

(19)

Our proposed approximation goal is to ensure that the control input (i.e., the optimal solution to the learning problem

\min_{τ} A (τ) + \tilde{Β} (τ)

) closely matches the solution obtained from the true scoring function

\min_{τ} A (τ) + Β (τ)

. To achieve this, we define the learning objective as minimizing the error between the proximal operators of the approximate scoring function

\tilde{B} (τ)

and the true scoring function. Figure 3 illustrates the overall framework, depicting how the approximation process refines control performance while maintaining computational efficiency.

The proximal operator [29] of

\tilde{B} (τ)

is defined as:

P r o x_{\tilde{B}} (τ) = \underset{\tilde{τ}}{\arg \min} \hat{B} (\hat{τ}) + \frac{1}{2} {‖\hat{τ} - τ‖}^{2} .

(20)

The optimal solution

{\hat{τ}}^{⋆}

of (20) can be determined equivalently by reformulating the approximation operator of A and the scoring function

\hat{B}

in terms of operator splitting techniques. Given an appropriate formulation of the operator, the optimal solution

{\hat{τ}}^{⋆}

can be efficiently computed. Based on differentiable convex programming, the approximate scoring function can be expressed as:

\begin{array}{l} \hat{B} = \min_{z \in ℝ^{n_{z}}} {‖d i a g (d_{a}) z‖}_{2}^{2} \\ s . t . G_{t} z + W_{t} τ = 0 \end{array}

(21)

where

\hat{B}

is obtained through convex optimization with learnable parameters

d_{a} \in ℝ^{n_{z}}

,

G_{t} \in ℝ^{m_{z} \times n_{z}}

, and

W_{t} \in ℝ^{m_{z} \times L (m + p)}

. To achieve the learning objective, i.e., ensuring that

P r o x_{\tilde{B}} (τ)

≈

P r o x_{Β} (τ)

, the differentiable convex program is trained using a gradient-based approach to iteratively update the parameters

d_{a}

,

G_{t}

, and

W_{t}

.

Let

d_{a} = {[\underset{M t i m e s}{\underset{︸}{\sqrt{λ_{g}}, \dots, \sqrt{λ_{g}}}}, \underset{m T_{i n i} t i m e s}{\underset{︸}{\sqrt{λ_{u_{i n i}}}, \dots, \sqrt{λ_{u_{i n i}}}}}, \underset{p T_{i n i} t i m e s}{\underset{︸}{\sqrt{λ_{y_{i n i}}}, \dots, \sqrt{λ_{y_{i n i}}}}}]}^{T}

,

G_{t}

=

\tilde{H}

,

W_{t}

= −I. Under these conditions,

\hat{B} (τ)

is equivalent to

Β (τ)

. Based on (20), we derive the following results:

\begin{array}{l} (z^{*}, {\hat{τ}}^{*}) = \arg \min {‖d i a g (d_{a}) z‖}_{2}^{2} + \\ {‖\hat{τ} - τ‖}^{2} / 2, s . t . G_{t} z + W_{t} \hat{τ} = 0 . \\ P r o x_{\tilde{B}} (τ) = {\hat{τ}}^{*} . \end{array}

(22)

To solve (22) and compute the gradient of the learned parameters

d_{a}

,

G_{t}

, and

W_{t}

with respect to

{\hat{τ}}^{*}

, we employ an unrolling-based approach [30]. Specifically, the Douglas-Rachford Splitting (DRS) method [31] is utilized to iteratively solve (22) using the following process:

\begin{array}{l} [\begin{matrix} z^{k_{a} + 1 / 2} \\ {\hat{τ}}^{k_{a} + 1 / 2} \end{matrix}] = [\begin{matrix} s h_{d_{a}} ({ξ_{a}}^{k_{a}}) \\ \frac{τ + {η_{a}}^{k_{a}}}{2} \end{matrix}], \\ [\begin{matrix} z^{k_{a} + 1} \\ {\hat{τ}}^{k_{a} + 1} \end{matrix}] = (I - {\tilde{G_{t}}}^{†}) [\begin{matrix} 2 z^{k_{a} + 1 / 2} - {ξ_{a}}^{k_{a}} \\ 2 {\hat{τ}}^{k_{a} + 1 / 2} - {η_{a}}^{k_{a}} \end{matrix}], \\ [\begin{matrix} {ξ_{a}}^{k_{a} + 1} \\ {η_{a}}^{k_{a} + 1} \end{matrix}] = [\begin{matrix} {ξ_{a}}^{k_{a}} + z^{k_{a} + 1} - z^{k_{a} + 1 / 2} \\ {η_{a}}^{k_{a}} + {\hat{τ}}^{k_{a} + 1} - {\hat{τ}}^{k_{a} + 1 / 2} \end{matrix}], \end{array}

(23)

where

ξ_{a}

,

η_{a}

are auxiliary variables,

\tilde{G_{t}}

= [

G_{t}

W_{t}

], and

s h_{d_{a}} {(y)}_{i}

is defined as:

s h_{d_{a}} {(y)}_{i} = \{\begin{array}{l} \frac{y_{i}}{1 + 2 {(d_{a i})}^{2}} i f y_{i} > 0, \\ \frac{y_{i}}{1 + 2 {(d_{a i})}^{2}} i f y_{i} < 0 \\ 0 o t h e r w i s e . \end{array}

(24)

Here,

d_{a i}

is the i-th element of

d_{a}

. Figure 4 shows the unrolled DRS iterations Illustration.

During the offline training phase, the internal optimization problem of approximating the scoring function

\hat{B} (τ)

is reformulated as a microscopic computational module using algorithm unrolling. Specifically, the DRS iterative process in (23) is expanded into a sequence of microscopic network layers, as illustrated in Figure 4. This transformation enables the utilization of existing deep learning frameworks to train the learning parameters

d_{a}

,

G_{t}

, and

W_{t}

, ensuring that the output of (i.e., the rating value) closely approximates the true scoring function

Β (τ)

.

The key steps in mapping the DRS iteration to network layers include: initializing the input–output sequence τ along with the initial values of variables z, τ,

ξ_{a}

, and

η_{a}

; mapping each DRS iteration to a network layer, which consists of soft-thresholding operations and affine transformations; and obtaining the output as the proximal operator result

\hat{B} (τ)

. The learning parameters

d_{a}

,

G_{t}

, and

W_{t}

are optimized through end-to-end training.

To ensure that the proximal operator of the approximated scoring function

P r o x_{\tilde{B}} (τ)

closely matches that of the true scoring function

P r o x_{Β} (τ)

, we define the mean squared error (MSE) loss function as:

L (θ_{e}; D) = {\sum_{τ \in D} ‖P r o x_{\tilde{B}} (τ) - P r o x_{B} (τ)‖}_{2}^{2} .

(25)

In this formulation, θ_e = (

d_{a}

,

G_{t}

,

W_{t}

) represents the set of learnable parameters, and D denotes the set of input–output trajectory sequences. The learning objective is to adjust θ_e such that the approximated proximal operator

P r o x_{\tilde{B}} (τ)

closely approximates the true proximal operator

P r o x_{Β} (τ)

. Since (22) is expressed through the DRS iterative process, and all iteration steps consist of differentiable operations, the gradient of the loss function with respect to θ_e can be computed via backpropagation. In this work, the Adam optimization algorithm is employed to perform gradient-based updates.

4.3. Data-Driven Voltage Control Model

We now elaborate on the application of this approach to the control problems discussed in Section 3. Consider a DN model constructed using the DistFlow branching equations, Equations (2)–(5), where the system parameters M* and N* are unavailable. The data collection process is defined as y(t) = V(t) ∈ ℝⁿ, representing the measured voltage magnitude, while the input variables are denoted as x(t) = [P(t), Q(t)] ∈ ℝ^m, representing the active and reactive power from the BESSs and PV generation. Given the assumption that the uncontrolled voltage V^par remains constant, i.e., Ꞷ^e(t) = 0, only the bus voltage, active power from the BESS, and reactive power from the PV generation are recorded during historical data collection. Consequently, the DeePC-based voltage control formulation in (8) is expressed as follows:

A_{f} (τ) = {‖V - r‖}_{Ξ}^{2} + {‖P‖}_{R}^{2} + {‖Q‖}_{Ψ}^{2} + Ⅱ_{{(P, Q)}^{L} \times V^{L}} (τ) + Ⅱ_{\{= τ_{i n i}\}} (F_{i n i} τ)

(26)

\begin{array}{l} {\hat{B}}_{f} (τ) = \min_{z \in ℝ^{n_{z}}} {‖d i a g (d_{b}) z‖}_{2}^{2} \\ s . t . G_{f} z + W_{f} τ = 0 \end{array}

(27)

where

d_{b} = {[\underset{M t i m e s}{\underset{︸}{\sqrt{λ_{g}}, \dots, \sqrt{λ_{g}}}}, \underset{m T_{i n i} t i m e s}{\underset{︸}{\sqrt{λ_{Q_{i n i}}}, \sqrt{λ_{P_{i n i}}} \dots, \sqrt{λ_{Q_{i n i}}}, \sqrt{λ_{P_{i n i}}}}}, \underset{p T_{i n i} t i m e s}{\underset{︸}{\sqrt{λ_{V_{i n i}}}, \dots, \sqrt{λ_{V_{i n i}}}}}]}^{T}

, the overall control problem becomes:

\begin{array}{l} \min_{τ} A_{f} (τ) + {\hat{B}}_{f} (τ) \\ s . t . (8 b) - (8 j) \end{array}

(28)

This formulation ensures optimal voltage control based on historical data while adhering to the network constraints. The parameterized model

\hat{B} (τ)

is computed through offline training and obtained by solving the internal optimization problem using the DRS algorithm. The algorithm transforms this problem into a differentiable module, eliminating the dependency of

Β (τ)

on large-scale data matrices. Instead, it is computed using pre-learned parameters θe and a fixed-dimension differentiable convex optimization problem. Since

\hat{B} (τ)

is a fixed-scale problem, its computation time remains independent of data volume, ensuring that the complexity of online control does not grow with historical data expansion.

During the offline implementation stage, historical trajectory data (Q^d, P^d, V^d) of length T is first collected to construct the matrices Q_p, P_p, V_p, Q_f, P_f, and V_f. Each segment in H is augmented with random noise to form a dataset D. For each trajectory τ ∈ D, the proximal operator

P r o x_{Β} (τ)

of the true scoring function

Β (τ)

is computed. Next, the parameters of the approximated scoring function

\hat{B} (τ)

, denoted as θ_e, are randomly initialized.

During forward propagation, the module for computing

\hat{B} (τ)

is executed for each τ ∈ D, generating

P r o x_{\tilde{B}} (τ)

and computing the loss function. In the backward propagation step, the gradient of the loss function with respect to the learning parameters is computed via automatic differentiation, and the Adam optimizer is applied to update the parameters. Training stops when the loss function variation falls below a predefined threshold or the maximum number of training iterations is reached. The learned approximation is then integrated into the DeePC framework and solved in a receding horizon optimization manner. Finally, the control cycle is systematically managed to ensure effective operation. Figure 5 illustrates the flowchart of the voltage control process for the DN using the DeePC algorithm.

5. Case Study

In this section, we introduce the modified IEEE 34-bus test system and present the simulation results. The primary objective is to verify the effectiveness and advantages of the proposed DeePC-based voltage controller.

5.1. Experimental Settings and Offline Data Collection

Figure 6 presents a schematic of the IEEE 34-bus test system, from which we extracted raw parameters and power dispatch information. Detailed information regarding the raw line impedance and shunt conductance is available in the literature [26]. The system’s base power and power/voltage ratings follow the specifications in [32]. The proposed voltage control strategy aims to regulate the bus voltage magnitude within the range of 0.95 to 1.05 p.u.

In this system, each bus is equipped with PV generation. We assume that the active and reactive power from PV generation, as well as the load at each bus, fluctuate slightly around their nominal values due to random noise. The upper and lower reactive power limits of the PV generation are adjusted based on the inverter rating, set to 9 kVA for active power output. Additionally, seven BESSs are strategically deployed at buses 3, 6, 10, 17, 21, 26, and 31.

For simulation purposes, substation bus voltages are fixed at 1 p.u. To ensure realistic conditions, we incorporate PV generation and load profiles at a 1 min resolution, as illustrated in Figure 7. Further details on the DeePC-based algorithm parameters can be found in Table 1, while Table 2 provides the parameters of the BESS.

The control parameters are carefully selected based on both physical insights and preliminary simulation studies. The prediction horizon T_f determines the time window over which future system behavior is optimized; a longer T_f improves long-term performance but increases computational burden. The historical horizon T_ini captures recent system dynamics for state initialization in the prediction model, and is set to match the dominant time scales of load and generation variations.

Regularization weights λ_g,

λ_{Q_{ini}}

,

λ_{v_{ini}}

, and

λ_{P_{ini}}

penalize deviations from nominal control inputs and outputs in the initial trajectory, enhancing robustness against measurement noise and model inaccuracies. Specifically, a larger λ_g strengthens noise suppression but may slow down dynamic response. The control weighting matrices R and Ξ balance control effort against tracking performance, where higher values reduce actuator wear at the cost of slower voltage regulation. The penalty coefficient κ enforces constraint satisfaction in the optimization; a sufficiently large κ ensures feasibility but may amplify numerical sensitivity.

In Section 4, the proposed control method involves generating historical input–output trajectories offline using continuously stimulated input data. To enhance realism, process and measurement noise are incorporated into the simulation. The sampling period is set to 1 min, and the controller executes 1440 iterations with distinct datasets of length T to construct the Hankel matrix.

At the current sampling instant t, the voltage, the active power of the BESS, and the reactive power of the PV are predicted over the prediction range [t, t + T_f] using the collected historical trajectories and the initial trajectory of length T_ini. The initial trajectory of the DeePC algorithm is updated at the next time instant t + t_s by applying the first control input signal to the system at the current time instant. The whole process is repeated after t_s time steps.

5.2. Simulation Results

In our simulations, we consider three different scenarios to demonstrate the effectiveness and advantage of our proposed data-driven approach. Our control objective is to maintain the voltage at each bus within the permissible range by controlling the power output of the controllable devices. And Figures 8–10 and 12 show the collective voltage profiles of all load buses over time, with each curve representing the voltage trajectory of one bus, illustrating the overall voltage distribution across the system.

Scenario 1—Basic Effectiveness Test: The disparity between PV generation and peak load demand heightens the vulnerability of distribution network to over-voltage conditions during midday and under-voltage in the evening. This effect is depicted in Figure 8, where voltage deviations are highlighted. In the absence of control strategy, the 34-bus test system undergoes significant voltage violations, resulting in bus voltages surpassing acceptable operational thresholds. Figure 9 presents the daily voltage profile achieved with the proposed data-driven control scheme. By analyzing the voltage control outcomes, it is evident that the proposed control strategy effectively maintains most bus voltages within the permissible range of 0.95 to 1.05 p.u.

Scenario 2—Comparison with the Model-based Method: To highlight the superiority of the proposed data-driven control methodology, its performance is evaluated in comparison to a model-based scheme, as referenced in (8). Unlike model-based strategies, which depend on specific system parameters, the data-driven approach circumvents the need for parameter identification. The comparative analysis, illustrated in Figure 9 and Figure 10, demonstrates that both control methods yield similar outcomes, validating the effectiveness of the proposed control strategy. Furthermore, Figure 11 provides a comparison of the RMS voltage averages between the two methods. Results indicate that the data-driven control approach is largely comparable to the model-based method in terms of control performance. Notably, the proposed DeePC voltage control method effectively addresses the voltage overrun issue, matching the traditional model-based strategy in its capacity to manage voltage levels. Moreover, the method enhances operational efficiency by regulating global voltage through local control, without requiring detailed knowledge of network topology or line parameters, thereby improving the economic operation of storage systems and enhancing overall practicality and performance.

Scenario 3—Robustness Test: To demonstrate the robustness of the proposed control strategy, the proposed data-driven controller is tested under changed system conditions. Particularly, the output power of PV generation at each bus was reduced by 5%, and the BESS was reallocated from buses 3, 6, 10, 17, 21, 26, and 31 to buses 4, 7, 11, 18, 22, 27, and 32. The resulting bus voltage profiles are presented in Figure 12. A comparison between Figure 9 and Figure 12 demonstrates that the proposed data-driven control method effectively mitigates voltage violation issues, maintaining control robustness despite changes in system parameters and network topology.

Figure 8, Figure 9, Figure 10 and Figure 12 illustrate the dynamic voltage responses across all buses. To further quantify performance differences, Table 3 summarizes the key voltage metrics, including Max Voltage Deviation, Average Voltage Deviation and Voltage Violation Duration Ratio under the three control strategies. It can be observed that both the model-based and data-driven approaches significantly reduce the maximum voltage deviation compared to no control and greatly decrease the voltage violation duration ratio. Notably, proposed data-driven control achieves performance on par with the model-based method, despite not relying on any explicit system model. For instance, the voltage violation duration ratio is reduced from approximately 18.1% in the no-control scenario to just 2.3% under the proposed data-driven control—very close to the 2.1% achieved by the model-based counterpart. This near-equivalent performance underscores the effectiveness of the proposed approach in learning accurate control policies directly from data, highlighting its potential as a practical and model-free solution for voltage regulation in active distribution networks.

5.3. Parameter Sensitivity Analysis

To evaluate the robustness of the proposed controller to parameter variations, we conduct a sensitivity analysis on three key parameters: the regularization weight λ_g, the prediction horizon T_f, and the control effort weighting matrix R. Each parameter is varied across a representative range while keeping others fixed, and the closed-loop performance is assessed under the same test scenario as in Section 5.2.

Table 4 illustrates the impact of parameter variation on voltage regulation performance, measured by the root mean square error (RMSE) of node voltages, the total duration of voltage violations, and the number of energy storage system (ESS) charging/discharging cycles. As shown, increasing λ_g, from 1 to 10 reduces voltage RMSE by 18% due to improved noise filtering, but further increasing it to 100 leads to a 12% performance degradation due to sluggish response. A prediction horizon T_f of 30–60 s yields optimal trade-offs between performance and computational load; shorter horizons result in reactive control, while longer ones offer diminishing returns.

Similarly, increasing R reduces ESS cycling by up to 30%, which is beneficial for battery longevity, but at the expense of a 20% increase in voltage RMSE. These results highlight the need for balanced parameter tuning in practical deployment. The current fixed-parameter design demonstrates acceptable robustness across tested ranges, though adaptive tuning could further enhance performance.

5.4. Comparative Performance Analysis

To further validate the effectiveness and practical advantages of the proposed data-driven voltage control strategy, this section presents a comprehensive study comparing its performance with representative methods, summarized in Table 5.

While model-based MPC achieves good regulation accuracy, it relies on precise system models and suffers from high computational complexity. On the other hand, existing model-free reinforcement learning methods avoid modeling but require extensive training and exhibit limited generalization and real-time performance.

In contrast, the proposed method achieves a voltage violation duration ratio of 2.3% and a maximum deviation of 0.043 p.u., outperforming all listed learning-based approaches and closely matching the performance of model-based MPC—without requiring any system model or offline training. Moreover, its low online computational cost ensures high real-time feasibility. This demonstrates that the proposed strategy uniquely combines high accuracy, strong robustness, and ease of deployment, making it a superior choice for real-time voltage control in complex and uncertain distribution networks.

6. Conclusions and Future Works

In this paper, we present a novel DeePC-based control framework for bus voltage regulation in ADNs. The proposed framework leverages slack variables and regularization terms to effectively manage load uncertainties in net load conditions. Numerical case studies highlight the advantages of the data-driven voltage control approach, particularly in eliminating the need for precise physical modeling. Simulation results demonstrate that the data-driven controller reduces the voltage violation duration ratio from 18.1% (no control) to just 2.3%, with a maximum voltage deviation of 0.043 p.u.—performance that is comparable to model-based MPC (2.1%, 0.041 p.u.). Notably, the proposed data-driven control method effectively mitigates voltage violation issues, preserving control effectiveness despite variations in system parameters and network topology. Moreover, the reformulation using differentiable convex programming ensures low online computational burden, achieving high real-time feasibility.

Although the simulation validation in this work is based on the IEEE 34-bus system, the proposed scoring-function-based DeePC framework is inherently scalable and well-suited for larger distribution networks. The key lies in our reformulation of the original DeePC problem into a differentiable convex programming surrogate, which effectively mitigates the exponential growth of computational burden associated with large Hankel matrices in conventional DeePC. Specifically, the control policy is approximated by a fixed-dimensional parametric model trained offline, enabling real-time execution, for which the computational time is independent of the size of historical data. Furthermore, the architecture supports modular deployment; future extensions could leverage regional partitioning and coordinated local controllers to achieve distributed voltage regulation, thereby enhancing scalability. These features collectively ensure strong potential for application to larger systems, such as the IEEE 123-bus network or practical urban distribution grids.

For future work, we aim to investigate the integration of continuously stimulated large-scale input data and the application of the method to larger network topologies. Further research will also focus on validating the proposed solutions through practical implementation and testing on real power grids.

Author Contributions

Conceptualization, Y.L.; Methodology, Z.T.; Software, Q.L.; Validation, Y.L.; Formal analysis, Z.T. and Y.S.; Investigation, Y.Z.; Resources, Q.L.; Writing– original draft, Q.L. and Y.Z.; Writing– review and editing, Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

Author Yang Shi was employed by the company State Grid Ningbo Power Supply Company. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Tonkoski, R.; Turcotte, D.; EL-Fouly, T.H.M. Impact of high pv penetration on voltage profiles in residential neighborhoods. IEEE Trans. Sustain. Energy 2012, 3, 518–527. [Google Scholar] [CrossRef]
Ren, F.H.; Zhang, M.J.; Sutanto, D. A multi-agent solution to distribution system management by considering distributed generators. IEEE Trans. Power Syst. 2013, 28, 1442–1451. [Google Scholar] [CrossRef]
Elkhatib, M.E.; El-Shatshat, R.; Salama, M.M.A. Novel coordinated voltage control for smart distribution networks with DG. IEEE Trans. Smart Grid 2011, 2, 598–605. [Google Scholar] [CrossRef]
Von Appen, J.; Stetz, T.; Braun, M.; Schmiegel, A. Local voltage control strategies for pv storage systems in distribution grids. IEEE Trans. Smart Grid 2014, 5, 1002–1009. [Google Scholar] [CrossRef]
Zeraati, M.; Golshan, M.E.H.; Guerrero, J. Distributed Control of Battery Energy Storage Systems for Voltage Regulation in Distribution Networks with High PV Penetration. IEEE Trans. Smart Grid 2017, 9, 3582–3593. [Google Scholar] [CrossRef]
Wang, Y.; Tan, K.T.; Peng, X.Y.; So, P.L. Coordinated Control of Distributed Energy-Storage Systems for Voltage Regulation in Distribution Networks. IEEE Trans. Power Deliv. 2016, 31, 1132–1141. [Google Scholar] [CrossRef]
Alam, M.J.E.; Muttaqi, K.M.; Sutanto, D. Mitigation of rooftop solar PV impacts and evening peak support by managing available capacity of distributed energy storage systems. IEEE Trans. Power Syst. 2013, 28, 3874–3884. [Google Scholar] [CrossRef]
Kabir, M.N.; Mishra, Y.; Ledwich, G.; Dong, Z.Y.; Wong, K.P. Coordinated control of grid-connected photovoltaic reactive power and battery energy storage systems to improve the voltage profile of a residential distribution feeder. IEEE Trans. Ind. Inf. 2014, 10, 967–977. [Google Scholar] [CrossRef]
Conti, S.; Greco, A.M.; Raiti, S. Local control of photovoltaic distributed generation for voltage regulation in LV distribution networks and simulation tools. Eur. Trans. Elect. Power 2009, 19, 798–813. [Google Scholar] [CrossRef]
Hanif, A.; Choudhry, M. Dynamic voltage regulation and power export in a distribution system using distributed generation. J. Zhejiang Univ. Sci. A 2009, 10, 1523–1531. [Google Scholar] [CrossRef]
Huo, Y.; Li, P.; Ji, H.; Yu, H.; Yan, J.; Wu, J.; Wang, C. Data-Driven Coordinated Voltage Control Method of Distribution Networks with High DG Penetration. IEEE Trans. Power Syst. 2023, 38, 1543–1557. [Google Scholar] [CrossRef]
Hou, Z.; Jin, S. Data-driven model-free adaptive control for a class of MIMO nonlinear discrete-time system. IEEE Trans. Neural Netw. 2011, 22, 2173–2188. [Google Scholar]
Al-Saffar, M.; Musilek, P. Reinforcement Learning-Based Distributed BESS Management for Mitigating Overvoltage Issues in Systems with High PV Penetration. IEEE Trans. Smart Grid 2020, 11, 2980–2994. [Google Scholar] [CrossRef]
Li, Y.; Wu, J.; Pan, Y. Deep Reinforcement Learning for Online Scheduling of Photovoltaic Systems with Battery Energy Storage Systems. Intell. Converg. Netw. 2024, 5, 28–41. [Google Scholar] [CrossRef]
Pei, Y.; Zhao, J.; Yao, Y.; Ding, F. Multi-Task Reinforcement Learning for Distribution System Voltage Control with Topology Changes. IEEE Trans. Smart Grid 2023, 14, 2481–2484. [Google Scholar] [CrossRef]
Cao, D.; Zhao, J.; Hu, W.; Yu, N.; Ding, F.; Huang, Q.; Chen, Z. Deep Reinforcement Learning Enabled Physical-Model-Free Two-Timescale Voltage Control Method for Active Distribution Systems. IEEE Trans. Smart Grid 2022, 13, 149–165. [Google Scholar] [CrossRef]
Baran, M.; Wu, F.F. Optimal sizing of capacitors placed on a radial distribution system. IEEE Trans. Power Deliv. 1989, 4, 735–743. [Google Scholar] [CrossRef]
Markovsky, I.; Huang, L.; Dörfler, F. Data-driven control based on the behavioral approach: From theory to applications in power systems. IEEE Control Syst. 2023, 43, 28–68. [Google Scholar] [CrossRef]
Fiedler, F.; Lucia, S. On the relationship between data-enabled predictive control and subspace predictive control. In Proceedings of the 2021 European Control Conference (ECC), Delft, The Netherlands, 29 June–2 July 2021; pp. 222–229. [Google Scholar]
Huang, L.; Coulson, J.; Lygeros, J.; Dorfler, F. Data-Enabled Predictive Control for Grid-Connected Power Converters. In Proceedings of the 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 11–13 December 2019; pp. 8130–8135. [Google Scholar]
Coulson, J.; Lygeros, J.; Dorfler, F. Distributionally Robust Chance Constrained Data-Enabled Predictive Control. IEEE Trans. Autom. Control 2022, 67, 3289–3304. [Google Scholar] [CrossRef]
Yu, W.; Tang, Z.; Xiong, W. Distributed robust data-enabled predictive control based voltage control for networked microgrid system. Electr. Power Syst. Res. 2024, 231, 110360. [Google Scholar] [CrossRef]
Kekatos, V.; Zhang, L.; Giannakis, G.B.; Baldick, R. Voltage Regulation Algorithms for Multiphase Power Distribution Grids. IEEE Trans. Power Syst. 2016, 31, 3913–3923. [Google Scholar] [CrossRef]
Willems, J.C.; Rapisarda, P.; Markovsky, I.; De Moor, B.L.M. A note on persistency of excitation. Syst. Control Lett. 2005, 54, 325–329. [Google Scholar] [CrossRef]
Lou, G.; Gu, W.; Lu, X.; Xu, Y.; Hong, H. Distributed Secondary Voltage Control in Islanded Microgrids with Consideration of Communication Network and Time Delays. IEEE Trans. Smart Grid 2020, 11, 3702–3715. [Google Scholar] [CrossRef]
Su, X.; Masoum, M.A.S.; Wolfs, P.J. Optimal PV Inverter Reactive Power Control and Real Power Curtailment to Improve Performance of Unbalanced Four-Wire LV Distribution Networks. IEEE Trans. Sustain. Energy 2014, 5, 967–977. [Google Scholar] [CrossRef]
Guo, C.; Wang, X.; Zheng, Y.; Zhang, F. Optimal energy management of multi-microgrids connected to distribution system based on deep reinforcement learning. Int. J. Electr. Power Energy Syst. 2021, 131, 107048. [Google Scholar] [CrossRef]
Zhou, Y.; Lu, Y.; Li, Z.; Yan, J.; Mo, Y. Learning-Based Efficient Approximation of Data-Enabled Predictive Control. In Proceedings of the 2024 IEEE 63rd Conference on Decision and Control (CDC), Milan, Italy, 15–18 December 2024; pp. 322–327. [Google Scholar]
Parikh, N.; Boyd, S. Proximal algorithms. Found. Trends Optim. 2014, 1, 127–239. [Google Scholar] [CrossRef]
Monga, V.; Li, Y.; Eldar, Y.C. Algorithm unrolling: Interpretable, efficient deep learning for signal and image processing. IEEE Signal Process. Mag. 2021, 38, 18–44. [Google Scholar] [CrossRef]
Eckstein, J.; Bertsekas, D.P. On the douglas—Rachford splitting method and the proximal point algorithm for maximal monotone operators. Math. Program. 1992, 55, 293–318. [Google Scholar] [CrossRef]
Xiong, W.; Tang, Z.; Cui, X. Distributed data-driven voltage control for active distribution networks with changing grid topologies. Control Eng. Pract. 2024, 147, 105933. [Google Scholar] [CrossRef]

Figure 1. Structure of the distribution network.

Figure 2. Schematic illustration of the DeePC formulation of (13).

Figure 3. Schematic illustration of the overall framework.

Figure 4. Schematic illustration of the unrolled DRS iterations.

Figure 5. Flow chart of voltage control solution based on DeePC algorithm.

Figure 6. The IEEE 34-bus test system. The red indexes indicate the buses with BESSs.

Figure 7. Daily active power profiles of load and PV generation.

Figure 8. Daily voltage profile across all load buses without control. (Each curve represents the voltage trajectory of one bus. The green curve with larger fluctuations is Bus 34; the blue curve with smaller fluctuations near 1.0 p.u. is Bus 1.).

Figure 9. Daily voltage profile across all load buses with the data-driven approach. (Each curve represents the voltage trajectory of one bus. The green curve with larger fluctuations is Bus 34; the blue curve with smaller fluctuations near 1.0 p.u. is Bus 1.).

Figure 10. Daily voltage profile across all load buses with the model-based approach. (Each curve represents the voltage trajectory of one bus. The green curve with larger fluctuations is Bus 34; the blue curve with smaller fluctuations near 1.0 p.u. is Bus 1.).

Figure 11. Daily RMS voltage average with the model-based and data-driven approach.

Figure 12. Daily voltage profile across all load buses with the data-driven approach under changes in system parameters and topology. (Each curve represents the voltage trajectory of one bus. The green curve with larger fluctuations is Bus 34; the blue curve with smaller fluctuations near 1.0 p.u. is Bus 1.).

Table 1. Parameters of the test system.

T	T_ini	T_f	R	Ψ	Q	$λ_{Q_{ini}}$	$λ_{v_{ini}}$	$λ_{P_{ini}}$	λ_g	κ
600	6	12	100I	100I	I	1 × 10⁵	1 × 10⁵	1 × 10⁵	100	1 × 10⁵

Table 2. BESSs parameters.

$p_{dis}^{\max}$	$p_{bat}^{\max}$	SoC_max	SoC⁰	ρ_min	ρ_max	μ
6 kW	6 kW	15 kVA⸱h	4 kVA⸱h	0.2	0.8	0.96

Table 3. Quantitative Comparison of Voltage Performance.

Control Strategy	Max Voltage Deviation (p.u.)	Avg Voltage Deviation (p.u.)	Voltage Violation Duration Ratio (%)
No Control	0.142	0.068	18.1
Model-Based Control	0.041	0.012	2.1
Proposed Data-Driven Control	0.043	0.013	2.3

Table 4. Performance comparison under different parameter settings.

Parameter Config	λ_g	T_f	R	Voltage RMSE (p.u.)	Violation Time (min)	ESS Cycles
Baseline	10	60	1	0.012	8.2	45
High λ_g	100	60	1	0.0135	9.1	43
Low T_f	10	30	1	0.018	15.6	52
High R	10	60	10	0.0144	10.3	32

Table 5. Comparison of Voltage Control Performance with Recent Studies.

Control Method	Model-Free?	Offline Training Required?	Max Voltage Deviation (p.u.)	Voltage Violation Duration Ratio (%)	Real-Time Feasibility
Model-Based MPC	No	Yes	~0.040	~2.0	Moderate
Deep RL	Yes	Yes	~0.055	~4.5	Limited
Model-Free RL	Yes	Yes	~0.058	~3.8	Low
Proposed	Yes	No	0.043	2.3	High

Note: Performance values are aggregated from recent studies on voltage control in IEEE 34 bus systems under comparable loading and disturbance scenarios.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, Q.; Zhu, Y.; Tang, Z.; Liu, Y.; Shi, Y. Data-Based Predictive Control Based Voltage Control in Active Distribution Networks. Electronics 2025, 14, 4211. https://doi.org/10.3390/electronics14214211

AMA Style

Li Q, Zhu Y, Tang Z, Liu Y, Shi Y. Data-Based Predictive Control Based Voltage Control in Active Distribution Networks. Electronics. 2025; 14(21):4211. https://doi.org/10.3390/electronics14214211

Chicago/Turabian Style

Li, Qihan, Yongqi Zhu, Zhiyuan Tang, Youbo Liu, and Yang Shi. 2025. "Data-Based Predictive Control Based Voltage Control in Active Distribution Networks" Electronics 14, no. 21: 4211. https://doi.org/10.3390/electronics14214211

APA Style

Li, Q., Zhu, Y., Tang, Z., Liu, Y., & Shi, Y. (2025). Data-Based Predictive Control Based Voltage Control in Active Distribution Networks. Electronics, 14(21), 4211. https://doi.org/10.3390/electronics14214211

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Based Predictive Control Based Voltage Control in Active Distribution Networks

Abstract

1. Introduction

2. Network Description

3. Proposed Control Scheme

4. Overview of DeePC Algorithm

4.1. Data-Driven System Representation

4.2. Approximation of DeePC

4.3. Data-Driven Voltage Control Model

5. Case Study

5.1. Experimental Settings and Offline Data Collection

5.2. Simulation Results

5.3. Parameter Sensitivity Analysis

5.4. Comparative Performance Analysis

6. Conclusions and Future Works

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI