Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems

Yadav, Gaurav; Liao, Yuan; Cramer, Aaron M.

doi:10.3390/en18154061

Open AccessArticle

Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems

by

Gaurav Yadav

,

Yuan Liao

^*

and

Aaron M. Cramer

Department of Electrical and Computer Engineering, University of Kentucky, Lexington, KY 40506, USA

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(15), 4061; https://doi.org/10.3390/en18154061

Submission received: 18 June 2025 / Revised: 24 July 2025 / Accepted: 29 July 2025 / Published: 31 July 2025

Download

Browse Figures

Versions Notes

Abstract

Power distribution systems are witnessing a growing deployment of distributed, inverter-based renewable resources such as solar generation. This poses certain challenges such as rapid voltage fluctuations due to the intermittent nature of renewables. Volt-Var control (VVC) methods have been proposed to utilize the ability of inverters to supply or consume reactive power to mitigate fast voltage fluctuations. These methods usually require a detailed power network model including topology and impedance data. However, network models may be difficult to obtain. Thus, it is desirable to develop a model-free method that obviates the need for the network model. This paper proposes a novel model-free cooperative control method to perform voltage regulation and reduce inverter aging in power distribution systems. This method assumes the existence of time-series voltage and load data, from which the relationship between voltage and nodal power injection is derived using a feedforward artificial neural network (ANN). The node voltage sensitivity versus reactive power injection can then be calculated, based on which a cooperative control approach is proposed for mitigating voltage fluctuation. The results obtained for a modified IEEE 13-bus system using the proposed method have shown its effectiveness in mitigating fast voltage variation due to PV intermittency. Moreover, a comparative analysis between model-free and model-based methods is provided to demonstrate the feasibility of the proposed method.

Keywords:

artificial neural network; cooperative control; inverter-based resources; distributed generators; irradiance; solar PVs; voltage and var control

1. Introduction

Traditional distribution networks have been evolving into active distribution networks (ADNs) with increasing deployment of DGs. Renewable DGs, when introduced into the system, can lead to certain challenges such as fast voltage fluctuations due to DG output intermittency. In traditional networks, feeder devices such as on-load tap changers (OLTCs) and capacitor banks (CBs) are used to mitigate these fluctuations. However, such devices, when used in renewable DG-integrated systems, might need frequent switching which would lead to increased operation and maintenance (O&M) costs.

To address this issue, various VVC techniques have been designed. These techniques can perform voltage regulation, peak load shaving, and various functions with the help of smart inverters capable of operation in four quadrants. The authors of [1,2] developed a voltage regulation method focused on the coordinated operation of OLTCs and the reactive power of DGs. A deep reinforcement learning (DRL)-based method was devised for VVC in [3]. The authors of [4,5] developed a kernel-based model and distributed model predictive control, respectively, to address voltage fluctuations through optimal reactive power dispatch. A dual-layer DRL-based method was proposed in [6,7] to perform voltage regulation. Cooperative control-based voltage regulation methods that harness inverter reactive power capability have been proposed in [8,9,10].

These previous methods include model-based methods. In other words, they require distribution network models including topology and impedance data, which may be difficult to obtain [11]. This calls for developing a VVC method agnostic to the network model, i.e., model-free VVC. The model-free method relies on the data obtained from metering devices such as advanced metering infrastructure (AMI), from which the voltage and power relationship can be derived for the feeder under control. A comprehensive review was provided by [12] focusing on the need to combine artificial intelligence (AI) with distributed energy resources (DERs) to create an efficient, economic, and adaptive energy system, along with adopting cybersecurity measures to strengthen the resilience and reliability of the power system.

Some model-free studies have been proposed in the field of VVC. The authors of [13] developed a strategy to perform hourly voltage regulation, maximize generated active power, minimize active power loss by using VVC and volt/watt control (VWC) by selecting optimal parameters using parallel genetic algorithm (GA) or parallel particle swarm optimization (PSO) while considering various PV uncertainties. Then a neural network based on Local Outlier Factor (LOF), Long Short-Term Memory (LSTM), and Gate Recurrent Unit with Normalization (GN) was developed to reduce the search time to find the optimal control strategy. A model-free VVC was performed by [14,15] for wind farms using the Koopman operator-based method and zeroth-order feedback optimization, respectively. An extremum seeking approach was developed by [16] for VVC which required only local optimization to minimize power losses. A model-free VVC framework was developed by [17] based on statistical analysis of measurement data using K-nearest neighbor (KNN) with Principal Component Analysis (PCA) method. This method was effective even in the presence of measurement noise due to the noise filtering feature of the proposed method. A DRL-based model-free VVC was proposed by the authors of [18,19,20,21,22,23] where [18,19] used the multi-agent DRL framework, and Ref. [20] used a two-agent framework, and Refs. [21,22] relied on a single-agent framework. A multi-agent framework was used for the faster time scale, whereas a single agent was selected for the slower time scale optimization in [23]. Ref. [18] devised a cooperative bi-level framework for VVC between DSOs and customers. A reward-based function was developed by [19,21] to perform voltage regulation and minimize power loss where [19] considered the capacitor, voltage regulators, and smart inverters, whereas [21] only considered PVs. Ref. [22] proposed a Markov decision-based VVC scheme which used a safe deep reinforcement learning (SDRL) algorithm to ensure safety constraints are followed during the learning process. Dual-layer optimization was proposed by [20,23] to regulate voltage where the first layer obtained schedules for OLTCs and CBs, whereas the second layer focused on optimal dispatch for PV inverters. The authors of [24] presented a robust regression-based feedback optimization algorithm and a revised alternating direction multiplier method (ADMM) to perform VVC for ADNs containing multiple virtual power plants (VPPs). Refs. [13,14,15,16,17,18,19,20,21,22,23,24] focused on using PVs as DERs, refs. [14,15] focused on wind energy, and ref. [19] chose a hybrid model combining PV and wind. However, they did not consider EVs in their study for volt-var control. Refs. [25,26] devised VVC strategies while considering electric vehicles, aimed at minimizing power losses and voltage limit violations. A three-level VVC framework was developed by [27] to mitigate voltage violations and minimize real power loss and peak demand. A recurrent neural network was proposed by [28] to solve a VVC problem modeled as the Markov decision process for voltage regulation. Some studies have focused on power converter design and control using reinforcement learning [29,30] which can be used for various power sector applications such as volt-var control.

However, the above-mentioned model-free volt-var control studies do not consider fast-paced voltage regulation, i.e., on the second level, and the aging of smart inverters unlike the proposed study in this paper. Hence, the paper makes the following contributions. A novel model-free VVC framework based on a cooperative control algorithm has been proposed for voltage regulation on the second level while considering PV inverter aging due to reactive operation. This framework does not rely on feeder parameters and is based on the voltage and power relationship model derived from an ANN. Cooperative control is a distributed optimization method that focuses on reactive power optimization of DGs by using their local information and information from neighboring nodes. Moreover, an adaptive gain-based method [31] is used to calculate the gradient gains to help convergence.

The remainder of the paper is organized as follows. Section 2 presents the ANN that models the relationship between nodal power injections and node voltages. Section 3 describes the cooperative control algorithm and the VVC problem. Section 4 illustrates and discusses the results obtained using the proposed method. It also provides a comparative analysis between model-based and model-free control methods. Section 5 provides the conclusion.

2. ANN That Models Voltage and Power Relationship

This section describes the process of creating an ANN that models the relationship between node voltages and nodal power injections based on measurement data without using the power grid topology and impedance data.

2.1. Structure of the Neural Network

The power distribution network has been modeled on the phase level, where a neural network has been defined individually for each phase to derive the relationship between voltage and nodal power injection. The adopted neural network model is shown in Figure 1. The network contains one input layer, one hidden layer, and one output layer. The number of neurons in the input and output layers depends on the number of inputs and outputs, i.e., m and n, respectively, whereas the number of neurons in the hidden layer is defined by the user. The activation function chosen for the hidden layer is the sigmoid function (σ) shown in (1).

σ (x_{k}) = \frac{1}{1 + e^{- x_{k}}}

(1)

Here,

x_{k}

is the output of

k^{t h}

neuron in the hidden layer.

2.2. Training of the Neural Network

The ANN is trained using the Levenberg–Marquardt backpropagation algorithm. Under this algorithm, the gradient (

g

) is computed as follows:

g = J^{T} e

(2)

Here,

J

is the Jacobian matrix containing the first derivatives of the network errors with respect to the weights and biases.

e

is a vector of network errors.

After calculating the Jacobian matrix and the gradient, the weights of the inputs

(W)

are updated as shown in (3).

W (k + 1) = W (k) - {[J^{T} J + μ I]}^{- 1} g

(3)

Here,

μ

is a scalar value that dynamically changes to speed up convergence.

The training data for ANN was divided into three parts: 70% of the data was used for training the network, 15% was for network validation, and the remaining 15% was for testing the trained network. The training data was obtained by running power flow studies using the simulation software OpenDSS (version 10.1.0.1).

Different numbers of neurons in the hidden layer were tested to identify the most suitable neural network.

M S E = \frac{1}{C} \sum_{i = 1}^{C} {(Y_{i} - \hat{Y_{i}})}^{2}

(4)

Here,

C

is the total number of outputs.

Y_{i}

and

\hat{Y_{i}}

are the actual and predicted outputs, respectively.

The cross-validation technique has been used here to avoid overfitting. Under this method, 15% of the training data is considered for the validation dataset which is used to ensure that the trained model performs well without causing overfitting. Moreover, one of the stopping criteria for the Levenberg–Marquardt algorithm (algorithm used for network training) is the validation check in which the network training stops if the number of validation checks exceeds 6.

2.3. Inputs and Outputs for Training the Neural Network

The inputs for training the neural network

X_{t r a i n}

include the substation voltage and active and reactive power of the load nodes in the power distribution network, which are assumed to be obtained through the metering system. The first row of the matrix is the substation voltage. The remaining rows include the net active and reactive power of load nodes as shown in (5).

X_{t r a i n} = {[\begin{matrix} V_{S 1} & V_{S 2} & \dots & V_{S K} \\ P_{11}^{n e t} & P_{12}^{n e t} & \dots & P_{1 K}^{n e t} \\ Q_{11}^{n e t} & Q_{12}^{n e t} & \dots & Q_{1 K}^{n e t} \\ ⋮ & ⋮ & \dots & ⋮ \\ P_{N 1}^{n e t} & P_{N 2}^{n e t} & \dots & P_{N K}^{n e t} \\ Q_{N 1}^{n e t} & Q_{N 2}^{n e t} & \dots & Q_{N K}^{n e t} \end{matrix}]}_{(2 N + 1) ⨯ K}

(5)

Here,

V_{S k}

is the substation voltage at time moment

k

(from 1 to the total number of moments

K

).

P_{n k}^{n e t}

and

Q_{n k}^{n e t}

are the net active and reactive power at node

n

(from 1 to the total number of nodes

N

) and moment

k

, which are obtained as

P_{n k}^{n e t} = P_{n k} - P_{n k}^{g e n}

(6)

Q_{n k}^{n e t} = Q_{n k} - Q_{n k}^{g e n}

(7)

P_{n k}

and

Q_{n k}

are the load active and reactive power, whereas

P_{n k}^{g e n}

and

Q_{n k}^{g e n}

are the active and reactive power generated by the DG connected at node

n

.

The outputs, the ground truth, for training the neural network are the node voltages, which are assumed to be obtained through the metering system and are

Y_{t r a i n} = {[\begin{matrix} V_{11} & V_{12} & \dots & V_{1 K} \\ V_{21} & V_{22} & \dots & V_{2 K} \\ ⋮ & ⋮ & \dots & ⋮ \\ V_{N 1} & V_{N 2} & \dots & V_{N K} \end{matrix}]}_{(N ⨯ K)}

(8)

All the inputs and outputs are in per unit, a commonly used normalization technique in power system analysis, and no additional normalization process is performed.

2.4. Trained Neural Network

When successfully trained, a power distribution network is represented by a neural network function called

n e t

. The function when given the input values in the same order as given during network training will yield outputs in the same order as the outputs obtained during network training. The

n e t

function is shown in (9).

Y_{o u t} = n e t (X_{i n})

(9)

Here,

X_{i n}

is the input and

Y_{o u t}

is the output. The sensitivity of the output versus input change can be calculated based on the perturbation of the input. For this study, the ANN inputs were changed by 1% to calculate the sensitivity of each output with respect to the inputs.

3. Model-Free Cooperative Control

This section presents the VVC problem and describes the cooperative control algorithm using the voltage sensitivity versus power injection derived in Section 2 and without utilizing the power grid model.

3.1. Reactive Power Utilization Ratio ( $α_{q i}$ )

The proportion of reactive power supplied by a DG is denoted by the reactive power utilization ratio (

α_{q i}

), i.e., the decision variable, shown in (10).

α_{q i} = \frac{Q_{G_{i}}}{\bar{{Q_{G}}_{i}}}

(10)

Q_{G_{i}}

is the reactive power generated by the

i^{t h}

DG. The available reactive power capacity of a DG is denoted by

\bar{Q_{G_{i}}}

that is calculated using its rated apparent power and active power determined by solar irradiance. A positive and negative value of

α_{q i}

denotes generation and consumption of reactive power, respectively. The range of

{α_{q}}_{i}

is from −1 to 1.

3.2. Objective Function

The cooperative control method aims to minimize the objective function shown in (11), which includes the voltage deviation from the desired value (1 per unit is used here)at selected nodes and the reactive power generation/consumption by the DGs to reduce inverter aging. This is achieved by fairly utilizing the reactive power of the DGs. The node without a DG is referred to as a non-DG node.

F = \sum_{i = 1}^{N_{D G}} f_{i} + \sum_{k = 1}^{N_{N D G}} f_{k} = \sum_{i = 1}^{N_{D G}} [w_{v_{D G}} {(V_{i} - 1)}^{2} + w_{α} α_{q_{i}}^{2} {\bar{Q_{G_{i}}}}^{2}] + \sum_{k = 1}^{N_{N D G}} w_{v_{N D G}} {(V_{k} - 1)}^{2}

(11)

Here,

$V_{i}$ is the per unit voltage at DG node $i$ ;
$V_{k}$ is the per unit voltage at non-DG node $k$ ;
$w_{v_{D G}}$ is the weight associated with voltage deviation minimization at DG nodes;
$w_{v_{N D G}}$ is the weight associated with voltage deviation minimization at non-DG nodes;
$w_{α}$ is the weight associated with minimizing the generation/consumption level of the reactive power;
$N_{D G}$ is the total number of DGs;
$N_{N D G}$ is the total number of non-DG nodes.

In (11), the first part of cost function

f_{i}

relates to voltage deviation from the desired value, and the second part relates to the reactive power generation/consumption of DGs. In the second part of

f_{i}

,

α_{q_{i}}^{2} {\bar{Q_{G_{i}}}}^{2}

has been used instead of

α_{q_{i}} \bar{Q_{G_{i}}}

to consider the case of consumption of reactive power by a DG as

α_{q i}

will be negative. For non-DG nodes, only the voltage deviation is considered.

3.3. Inputs and Outputs for the ANN

The inputs for the ANN at time moment

k

, i.e.,

X_{k}

comprise the substation voltage and net active and reactive power at load nodes, as shown in (12).

X_{k} = {[\begin{matrix} V_{S k} \\ P_{1 k}^{n e t} \\ Q_{1 k}^{n e t} \\ ⋮ \\ P_{N k}^{n e t} \\ Q_{N k}^{n e t} \end{matrix}]}_{(2 N + 1) ⨯ 1}; k = 1, 2, 3, \dots, N_{s e c}

(12)

Here,

N_{s e c}

is the total number of seconds considered for the study.

P_{n k}^{n e t}

and

Q_{n k}^{n e t}

are the net active and reactive power as obtained in (13) and (14), respectively.

P_{n k}^{n e t} = \{\begin{matrix} P_{n k}, f o r n o n - D G n o d e s \\ P_{n k} - λ_{k} P_{{D G R}_{i}}, f o r D G n o d e s \end{matrix}

(13)

All the DGs in this study are assumed to be solar PVs.

λ_{k}

is the solar irradiance at moment

k

, scaled between 0 and 1.

P_{{D G R}_{i}}

is the rated active power of the

i^{t h}

DG.

Q_{n k}^{n e t} = \{\begin{matrix} Q_{n k}, f o r n o n - D G n o d e s \\ Q_{n k} - α_{q i} (\sqrt{S_{D G_{i}}^{2} - {(λ_{k} P_{{D G R}_{i}})}^{2}}), f o r D G n o d e s \end{matrix}

(14)

P_{n k} a n d Q_{n k}

are the active and reactive power of the load at node

n

at moment

k

.

S_{D G_{i}}

is the apparent power of the

i^{t h}

DG.

The outputs for the ANN are the node voltages as shown in (15).

Y_{k} = {[\begin{matrix} V_{1 k} \\ V_{2 k} \\ ⋮ \\ V_{N k} \end{matrix}]}_{(N ⨯ 1)}; k = 1, 2, 3, \dots, N_{s e c}

(15)

3.4. Communication Topology

In cooperative control, the communication network between the participating nodes is defined using the communication topology matrices shown in (16) and (17).

The communication from DG node

j

to

i

is represented by a square matrix

d_{i j}

of order (

N_{D G}

×

N_{D G}

), shown in (16).

d_{i j} = \{\begin{matrix} 0; n o c o m m u n i c a t i o n b e t w e e n n o d e i a n d j \\ \frac{W_{i j} S_{i j}}{\sum_{l = 1}^{N_{D G}} (W_{i l} S_{i l})}; i n f o r m a t i o n f l o w s f r o m n o d e j t o i \end{matrix}

(16)

Here,

W_{i j} > 0

is the weight of communication channel between node

i

and

j

. For a symmetric system, where all channels have equal weight,

W_{i j}

is one.

S_{i j}

represents communication status between nodes

i

and

j

.

S_{i j}

is one if there is a communication between node

i

and

j

and is zero otherwise.

The information flowing from a non-DG node

k

to a DG node

i

is denoted by the matrix

d_{N D G i k}

, shown in (17), of order (

N_{D G}

×

N_{N D G}

).

d_{N D G i k} = \{\begin{matrix} 0; n o c o m m u n i c a t i o n b e t w e e n n o d e i a n d k \\ \frac{W_{i k} S_{i k}}{\sum_{l = 1}^{N_{N D G}} (W_{i l} S_{i l})}; i n f o r m a t i o n f l o w f r o m n o d e k t o i \end{matrix}

(17)

Here,

W_{i k} > 0

represents the weight of communication channel between node

i

and

k

. For a symmetric system,

W_{i k}

is one.

S_{i k}

represents communication status between nodes

i

and

k

.

S_{i k}

is one if there is a communication between node

i

and

k

and is zero otherwise.

Although there are various possible communication topologies when considering the information flow direction and nature of interaction between DGs, a unidirectional intra-phase DG communication has been assumed for this study, as shown in Figure 2. This communication topology is called Unidirectional Individual Cooperative Control (UDICC). In the figure, DG1–DG3 belong to phase A, whereas DG4–DG6 and DG7–DG9 belong to phase B and C, respectively. In this study, selected non-DG nodes are communicating with each of the DG nodes.

3.5. Gradient Components

The change in objective function

f_{i}

with respect to change in

{α_{q}}_{i}

, i.e.,

g_{i i}

is calculated in (18). The change in objective function

f_{k}

with respect to the change in the

{α_{q}}_{i}

, i.e.,

g_{k i}

is calculated as shown in (19).

g_{i i} = \frac{\partial f_{i}}{\partial α_{q_{i}}} = 2 w_{v_{D G}} (V_{i} - 1) \bar{Q_{G_{i}}} S_{V_{i i}} + {2 w}_{α} {α_{q}}_{i} {\bar{Q_{G_{i}}}}^{2}

(18)

Here,

S_{V_{i i}}

is the voltage sensitivity at node

i

with respect to the reactive power at node

i

.

g_{k i} = \frac{\partial f_{k}}{\partial α_{q_{i}}} = 2 w_{v_{N D G}} (V_{k} - 1) \bar{Q_{G_{i}}} S_{V_{k i}}

(19)

Here,

S_{V_{k i}}

is the voltage sensitivity at node

k

with respect to the reactive power at node

i

.

The voltage sensitivity shown in (18) and (19) is calculated based on the ANN model as shown in (9).

Thus, the derivative of the objective function with respect to

α_{q_{i}}

is

g_{i} = g_{i i} + \sum_{k = 1}^{N_{N D G}} {d_{N D G i k} g_{k i}}

(20)

3.6. Gradient Gain

The gain for

g_{i}

is calculated using the adaptive gradient method [31] as follows:

β_{i} = L_{i} {[g_{i}^{2} + ξ]}^{- \frac{1}{2}}

(21)

Here,

L_{i}

is the learning rate for the

i^{t h}

DG, and

ξ

is a small value employed to avoid division by zero.

3.7. Updating Reactive Power Utilization Ratio Based on Consensus Algorithm

An optimal

α_{q i}

for

i^{t h}

DG is achieved through a consensus algorithm-based iterative process, as shown in (22).

α_{q i} (n + 1) = (\sum_{l = 1}^{N_{D G}} {{d}_{i l} α_{q l} (n)}) - β_{i} g_{i}

(22)

As per the consensus algorithm, cooperative control aims to regulate voltage by equitably utilizing each DG’s reactive power. The convergence criterion for the iterative process is that the utilization ratio between consecutive iterations no longer changes significantly, i.e., the difference between the utilization ratio at consecutive iterations is less than a predefined tolerance (0.001 in this study) [32].

4. Results and Discussions

4.1. Power Distribution Network

This study was performed for a modified 13-bus system shown in Figure 3. The system contains nine DGs that include three single-phase DGs on buses 670 and 671 and one single-phase DG on buses 645, 646, and 692. The DGs considered in this study are assumed to be single-phase elements to address the phase voltage imbalance.

4.2. Inputs and Outputs for the Neural Networks for the Modified IEEE 13-Bus System

As mentioned in Section 2.1, a separate neural network has been created for each phase of the modified 13-bus system shown in Figure 3. This has been performed with the aim of implementing cooperative control on the phase level [8,9]. The nodes that have a DG and/or a load connected to them are the nodes providing inputs and outputs to the neural networks. The inputs and outputs are obtained as shown in (12)–(15). The substation bus for the modified IEEE 13-bus system is the bus RG60. Table 1 shows the list of buses to be included in the neural network.

4.3. Model-Free Cooperative Control

This study horizon is one hour, i.e., 3600 s. The solar PVs connected to the distribution system are rated at 300 kW each and follow the second-based irradiance profile, scaled between 0 and 1, as shown in Figure 4. The irradiance profile displaying significant fluctuations within an hour was chosen from the irradiance data released by the National Renewable Energy Laboratory (NREL) [33].

The number of neurons in the hidden layer is changed from 5 to 13 to find the optimal number of neurons in the hidden layer in the ANN. The MSE for 5 to 13 neurons for the phase A network was 1.59 × 10⁻⁷, 9.81 × 10⁻⁸, 8.41 × 10⁻⁸, 7.92 × 10⁻⁸, 7.79 × 10⁻⁸, 7.66 × 10⁻⁸, 7.54 × 10⁻⁸, 6.5 × 10⁻⁸, and 6.49 × 10⁻⁸, respectively. As can be seen, after 12 neurons, the MSE change is insignificant, and we choose 13 neurons. The MSEs for phases B and C reflect a similar trend.

Table 2 shows the weights considered for the objective function shown in (11) for implementing the method. Case 1 tries to minimize voltage deviation by harnessing the reactive power capacity of inverters. Case 2 limits the utilization of the inverters’ reactive power capability. Case 3 tries to reduce both voltage deviation and inverters’ reactive power operation. Note that the voltage-related terms and reactive power-related terms in the objective function (11) are not directly comparable, although we add them together to demonstrate a simple way of considering both factors. How we form a more practical objective function and choose appropriate weights for the voltage terms and reactive power terms in the objective function will depend on practical application considerations and warrant further research. A learning rate of 0.04 is used in this study. The typical iteration number required for convergence for all cases was between 19 and 25 for a tolerance of 0.001.

Selected DG and non-DG nodes are chosen to display the results for the studied cases. For the DG node, phase A of bus 692 is selected, whereas phase A of bus 675 is considered for the non-DG node. The results for phases B and C are similar to those of phase A.

The base voltage profile considered for all cases is obtained for the scenario where all DGs have a unity power factor, i.e., the reactive power of all the DGs is zero.

Figure 5 depicts the

{α_{q}}_{i}

for all three cases described in Table 2. Case 1 focuses solely on minimizing the voltage deviation. As a result, the

{α_{q}}_{i}

for the DGs can take any value deemed necessary to ideally achieve a unity voltage profile. Case 2 focuses solely on minimizing the generation/consumption of the reactive power of DGs. Thus, the

{α_{q}}_{i}

will take a zero value. In case 3, both voltage deviation and reactive power utilization reduction are considered. As a result, the

{α_{q}}_{i}

for case 3 is smaller than that of case 1 but higher than case 2.

Figure 6 shows the voltage profile for all three cases at phase A of bus 692, i.e., node 692.1. It can be observed that the best optimal voltage profile is obtained for case 1, as utilization of inverter reactive power is not intentionally restricted apart from rating limits. The voltage profile for case 2 aligns with the base voltage profile due to intentionally limiting the inverter reactive power utilization to zero. The voltage profile for case 3 is better than case 2 but is inferior to case 1 due to the objective trade-off between voltage deviation and reactive power utilization.

Figure 7 shows the voltage profile at phase A of bus 675, namely node 675.1, being a non-DG node. It can be observed that the non-DG node also exhibits similar behavior to that shown in Figure 6.

4.4. Comparative Analysis of Model-Based and Model-Free Cooperative Control

This section presents the comparison between the model-free and model-based methods. This comparative study has been performed for the modified IEEE 13-bus shown in Figure 3. The model-based method is implemented for the aforesaid distribution system using MATLAB R2023a and OpenDSS (version 10.1.0.1). The same irradiance profile as shown in Figure 4 and the same cases shown in Table 2 are studied. The same learning rate is adopted. The DG and non-DG nodes considered for the study shown in Section 4.3 are selected for this analysis as well. Moreover, the MSE has been calculated for the node voltage profile as the mean square error using 1.0 as the target voltage.

4.4.1. Case 1—Minimizing the Voltage Deviation $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0)$

Figure 8 and Figure 9 show the comparison between the voltage profile obtained for DG node 692.1 and non-DG node 675.1, respectively, using model-based and model-free methods for case 1. The MSE obtained for node 692.1 for the model-based and model-free systems is 2.87 × 10⁻⁵ and 2.18 × 10⁻⁵, respectively.

The voltage profiles for case 1 resulting from both methods are very similar, indicating that the model-free method can deliver effective voltage regulation as well. The MSE obtained for node 675.1 for model-based and model-free systems is 2.51 × 10⁻⁶ and 1.82 × 10⁻⁶, respectively.

4.4.2. Case 2—Minimizing the Generation/Consumption of Reactive Power at DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 0, w_{α} = 1)$

Figure 10 and Figure 11 show the voltage profiles obtained for case 2, where the emphasis is on minimizing the supply/consumption of reactive power, leading to minimal voltage regulation. It can be observed that the voltage profiles for model-free and model-based approaches exhibit similarity with minor deviations observed across the time horizon.

The MSE obtained for node 692.1 for model-based and model-free systems is 4.50 × 10⁻⁴ and 9.80 × 10⁻⁴, respectively, and the MSE for node 675.1 is 6.76 × 10⁻⁴ and 13.00 × 10⁻⁴, respectively.

4.4.3. Case 3—Reducing Both the Reactive Power at DG Nodes and Voltage Deviation at DG and Non-DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0.1)$

This section presents the voltage profiles for DG and non-DG nodes for case 3, where both voltage deviation and reactive power utilization limitation are considered. The voltage profiles shown in Figure 12 and Figure 13 display a high level of similarity with minor deviations between the methods. The results again show that the model-free method can improve voltage profiles and may be used as a potential alternative method when the power grid model is not available.

The MSE obtained for node 692.1 for model-based and model-free systems is 7.05 × 10⁻⁵ and 2.65 × 10⁻⁴, respectively, and the MSE for node 675.1 is 1.62 × 10⁻⁴ and 4.68 × 10⁻⁴, respectively.

5. Conclusions

This paper presented a model-free cooperative control method for voltage regulation and to increase the lifetime of the PV inverters through reactive power optimization. A feedforward neural network is designed to capture the relationship between the node voltage and nodal power injections for a modified IEEE 13-bus system. The results obtained using model-free cooperative control have shown improved voltage profiles in comparison to those without harnessing the reactive power control capability of inverters. Comparative analysis is also performed between model-based and model-free control methods. The model-free method performs similarly to the model-based method. The results have demonstrated that the proposed method may be used as a feasible alternative for voltage control in the absence of the distribution network model. It is noted that the proposed method requires real-world measurements obtained from the metering system to train the ANN. More studies may be performed in the future to examine the requirements on the number and location of meters and the impacts of measurement errors and further study the method based on larger-sized power grids.

Author Contributions

Conceptualization, Y.L. and A.M.C.; methodology, Y.L., G.Y. and A.M.C.; software, G.Y. and Y.L.; validation, G.Y. and Y.L.; formal analysis, G.Y., Y.L. and A.M.C.; writing—original draft preparation, G.Y. and Y.L.; writing—review and editing, G.Y., Y.L. and A.M.C.; supervision, Y.L.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This material is based upon work supported by the U.S. Department of Energy’s Office of Electricity under the award Number DE-OE0000989.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest. This paper was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

References

Tunçel, S.; Gözel, T. The robust curve-fitting based method for coordinated voltage regulation with distributed generator and OLTC in distribution network. Alex. Eng. J. 2023, 75, 243–259. [Google Scholar] [CrossRef]
Li, J.; Huo, Q.; Yin, J.; Liu, Q.; Sun, L.; Wei, T. Study on coordinated voltage regulation strategy of flexible on-load tap changer and distributed generator. Energy Rep. 2022, 8, 601–609. [Google Scholar] [CrossRef]
Kabir, F.; Yu, N.; Gao, Y.; Wang, W. Deep reinforcement learning-based two-timescale Volt-VAR control with degradation-aware smart inverters in power distribution systems. Appl. Energy 2023, 335, 120629. [Google Scholar] [CrossRef]
Hu, J.; Ye, C.; Ding, Y.; Tang, J.; Liu, S. A distributed MPC to exploit reactive power V2G for real-time voltage regulation in distribution networks. IEEE Trans. Smart Grid 2022, 13, 576–588. [Google Scholar] [CrossRef]
Haghi, H.V.; Qu, Z. A Kernel-based predictive model of EV capacity for distributed voltage control and demand response. IEEE Trans. Smart Grid 2018, 9, 3180–3190. [Google Scholar] [CrossRef]
Shang, C.; Fu, L.; Bao, X.; Xiao, H.; Xu, X.; Hu, Q. Dynamic joint optimization of power generation and voyage scheduling in ship power system based on deep reinforcement learning. Electr. Power Syst. Res. 2024, 229, 110165. [Google Scholar] [CrossRef]
Xiong, M.; Yang, X.; Zhang, Y.; Wu, H.; Lin, Y.; Wang, G. Reactive power optimization in active distribution systems with soft open points based on deep reinforcement learning. Int. J. Electr. Power Energy Syst. 2024, 155, 109601. [Google Scholar] [CrossRef]
Yadav, G.; Liao, Y.; Jewell, N.; Ionel, D.M. Cooperative control for mitigation of voltage fluctuations in power distribution systems. In Proceedings of the 2023 North American Power Symposium, NAPS 2023, Asheville, NC, USA, 15–17 October 2023. [Google Scholar] [CrossRef]
Yadav, G.; Liao, Y.; Jewell, N.; Ionel, D.M. Dual-layer voltage and var control for power distribution systems. In Proceedings of the IEEE Power and Energy Society General Meeting, Seattle, WA, USA, 21–25 July 2024. [Google Scholar] [CrossRef]
Maknouninejad, A.; Qu, Z. Realizing unified microgrid voltage profile and loss minimization: A cooperative distributed optimization and control approach. IEEE Trans. Smart Grid 2014, 5, 1621–1630. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, J.; Li, Z. Interval state estimation with uncertainty of distributed generation and line parameters in unbalanced distribution systems. IEEE Trans. Power Syst. 2020, 35, 762–772. [Google Scholar] [CrossRef]
Cavus, M. Advancing power systems with renewable energy and intelligent technologies: A comprehensive review on grid transformation and integration. Electronics 2025, 14, 1159. [Google Scholar] [CrossRef]
Qiu, W.; Yadav, A.; You, S.; Dong, J.; Kuruganti, T.; Liu, Y.; Yin, H. Neural networks-based inverter control: Modeling and adaptive optimization for smart distribution networks. IEEE Trans. Sustain. Energy 2024, 15, 1039–1049. [Google Scholar] [CrossRef]
Guo, L.; Liu, Z.; Wang, Z.; Li, X.; Liu, Y.; Zhang, Y. Model-free optimal volt-var control of wind farm based on data-driven lift-dimension linear power flow. CSEE J. Power Energy Syst. 2025, 11, 91–101. [Google Scholar] [CrossRef]
Li, S.; Wu, W.; Xu, J.; Dong, J.; Wang, F.; Yuan, Q. Zeroth-order feedback optimization for inverter-based volt-var control in wind farm. In Proceedings of the 2024 9th International Conference on Power and Renewable Energy, ICPRE 2024, Guangzhou, China, 20–23 September 2024; pp. 1384–1389. [Google Scholar] [CrossRef]
Ren, H.; Jha, R.R.; Dubey, A.; Schulz, N.N. Extremum-seeking adaptive-droop for model-free and localized volt-var optimization. IEEE Trans. Power Syst. 2022, 37, 179–190. [Google Scholar] [CrossRef]
Bagheri, P.; Xu, W. Model-free volt-var control based on measurement data analytics. IEEE Trans. Power Syst. 2019, 34, 1471–1482. [Google Scholar] [CrossRef]
Hong, L.; Wu, M.; Wang, Y.; Shahidehpour, M.; Chen, Z.; Yan, Z. MADRL-based DSO-Customer coordinated bi-level volt/var optimization method for power distribution networks. IEEE Trans. Sustain. Energy 2024, 15, 1834–1846. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, X.; Wang, J.; Zhang, Y. Deep reinforcement learning based volt-var optimization in smart distribution systems. IEEE Trans. Smart Grid 2021, 12, 361–371. [Google Scholar] [CrossRef]
Liu, H.; Wu, W.; Wang, Y. Bi-level off-policy reinforcement learning for two-timescale volt/var control in active distribution networks. IEEE Trans. Power Syst. 2023, 38, 385–395. [Google Scholar] [CrossRef]
Glover, D.; Dubey, A. Centralized coordination of DER smart inverters using deep reinforcement learning. In Proceedings of the 2023 IEEE Industry Applications Society Annual Meeting, IAS 2023, Nashville, TN, USA, 29 October–2 November 2023. [Google Scholar] [CrossRef]
Hua, D.; Peng, F.; Liu, S.; Lin, Q.; Fan, J.; Li, Q. Coordinated volt/var control in distribution networks considering demand response via safe deep reinforcement learning. Energies 2025, 18, 333. [Google Scholar] [CrossRef]
Cao, D.; Zhao, J.; Hu, W.; Yu, N.; Ding, F.; Huang, Q.; Chen, Z. Deep reinforcement learning enabled physical-model-free two-timescale voltage control method for active distribution systems. IEEE Trans. Smart Grid 2022, 13, 149–165. [Google Scholar] [CrossRef]
Li, S.; Wu, W.; Lin, Y. Robust data-driven and fully distributed volt/var control for active distribution networks with multiple virtual power plants. IEEE Trans. Smart Grid 2022, 13, 2627–2638. [Google Scholar] [CrossRef]
Jeon, S.; Nguyen, H.T.; Choi, D.H. Safety-integrated online deep reinforcement learning for mobile energy storage system scheduling and volt/var control in power distribution networks. IEEE Access 2023, 11, 34440–34455. [Google Scholar] [CrossRef]
Hernández-Gómez, O.M.; Vieira, J.P.A.; Tabora, J.M.; Sales e Silva, L.E. Mitigating voltage drop and excessive step-voltage regulator tap operation in distribution networks due to electric vehicle fast charging. Energies 2024, 17, 4378. [Google Scholar] [CrossRef]
Nguyen, H.T.; Choi, D.H. Three-stage inverter-based peak shaving and volt-var control in active distribution networks using online safe deep reinforcement learning. IEEE Trans. Smart Grid 2022, 13, 3266–3277. [Google Scholar] [CrossRef]
Wang, C.; Li, C.; Li, Y.; Liu, J.; Ling, F.; Liu, Q. A reinforcement learning based voltage regulation strategy for active distribution networks. In Proceedings of the 2023 2nd Asian Conference on Frontiers of Power and Energy, ACFPE 2023, Chengdu, China, 20–22 October 2023; pp. 433–437. [Google Scholar] [CrossRef]
Zeng, Y.; Jiang, S.; Konstantinou, G.; Pou, J.; Zou, G.; Zhang, X. Multi-objective controller design for grid-following converters with easy transfer reinforcement learning. IEEE Trans. Power Electron. 2025, 40, 6566–6577. [Google Scholar] [CrossRef]
Zeng, Y.; Xiao, Z.; Liu, Q.; Liang, G.; Rodriguez, E.; Zou, G.; Zhang, X.; Pou, J. Physics-informed deep transfer reinforcement learning method for the input-series output-parallel dual active bridge-based auxiliary power modules in electrical aircraft. IEEE Trans. Transp. Electrif. 2025, 11, 6629–6639. [Google Scholar] [CrossRef]
Villarraga, D. AdaGrad—Cornell University Computational Optimization Open Textbook—Optimization Wiki. Available online: https://optimization.cbe.cornell.edu/index.php?title=AdaGrad (accessed on 17 November 2024).
Ren, W.; Beard, R.W.; Atkins, E.M. Information consensus in multivehicle cooperative control. IEEE Xplore 2007, 27, 71–82. [Google Scholar]
Sengupta, M.; Andreas, A. Oahu Solar Measurement Grid (1-Year Archive): 1-Second Solar Irradiance; Oahu, Hawaii (Data). NREL Report No. DA-5500-56506. Available online: https://midcdmz.nrel.gov/apps/sitehome.pl?site=OAHUGRID (accessed on 18 May 2025).

Figure 1. Artificial neural network representing the power distribution network.

Figure 2. UDICC communication topology.

Figure 3. Modified IEEE 13-bus system.

Figure 4. Irradiance profile for solar DGs.

Figure 5. Reactive power utilization ratio (

{α_{q}}_{i}

) for all cases for phase A of bus 692.

Figure 5. Reactive power utilization ratio (

{α_{q}}_{i}

) for all cases for phase A of bus 692.

Figure 6. Voltage profile for all cases for phase A of bus 692.

Figure 7. Voltage profile for all cases for phase A of bus 675.

Figure 8. Voltage profile for DG node 692.1 for case 1.

Figure 9. Voltage profile for non-DG node 675.1 for case 1.

Figure 10. Voltage profile for DG node 692.1 for case 2.

Figure 11. Voltage profile for non-DG node 675.1 for case 2.

Figure 12. Voltage profile for DG node 692.1 for case 3.

Figure 13. Voltage profile for non-DG node 675.1 for case 3.

Table 1. Buses considered in the neural network for each phase.

Phase	Buses
Phase	DG Buses	Non-DG Buses
A	670, 671, 692	634, 652, 675
B	645, 670, 671	634, 646, 675
C	646, 670, 671	611, 634, 675, 692

Table 2. Weights considered for model-free cooperative control.

Case	Weights			Objective
Case	$w_{v_{D G}}$	$w_{v_{N D G}}$	$w_{α}$	Objective
1	1	1	0	Minimizes the voltage deviation
2	0	0	1	Minimizes the generation and consumption of reactive power
3	1	1	0.1	Reduce both voltage deviation and reactive power generation and consumption

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yadav, G.; Liao, Y.; Cramer, A.M. Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems. Energies 2025, 18, 4061. https://doi.org/10.3390/en18154061

AMA Style

Yadav G, Liao Y, Cramer AM. Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems. Energies. 2025; 18(15):4061. https://doi.org/10.3390/en18154061

Chicago/Turabian Style

Yadav, Gaurav, Yuan Liao, and Aaron M. Cramer. 2025. "Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems" Energies 18, no. 15: 4061. https://doi.org/10.3390/en18154061

APA Style

Yadav, G., Liao, Y., & Cramer, A. M. (2025). Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems. Energies, 18(15), 4061. https://doi.org/10.3390/en18154061

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems

Abstract

1. Introduction

2. ANN That Models Voltage and Power Relationship

2.1. Structure of the Neural Network

2.2. Training of the Neural Network

2.3. Inputs and Outputs for Training the Neural Network

2.4. Trained Neural Network

3. Model-Free Cooperative Control

3.1. Reactive Power Utilization Ratio ( $α_{q i}$ )

3.2. Objective Function

3.3. Inputs and Outputs for the ANN

3.4. Communication Topology

3.5. Gradient Components

3.6. Gradient Gain

3.7. Updating Reactive Power Utilization Ratio Based on Consensus Algorithm

4. Results and Discussions

4.1. Power Distribution Network

4.2. Inputs and Outputs for the Neural Networks for the Modified IEEE 13-Bus System

4.3. Model-Free Cooperative Control

4.4. Comparative Analysis of Model-Based and Model-Free Cooperative Control

4.4.1. Case 1—Minimizing the Voltage Deviation $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0)$

4.4.2. Case 2—Minimizing the Generation/Consumption of Reactive Power at DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 0, w_{α} = 1)$

4.4.3. Case 3—Reducing Both the Reactive Power at DG Nodes and Voltage Deviation at DG and Non-DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0.1)$

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Model-Free Cooperative Control for Volt-Var Optimization in Power Distribution Systems

Abstract

1. Introduction

2. ANN That Models Voltage and Power Relationship

2.1. Structure of the Neural Network

2.2. Training of the Neural Network

2.3. Inputs and Outputs for Training the Neural Network

2.4. Trained Neural Network

3. Model-Free Cooperative Control

3.1. Reactive Power Utilization Ratio ( α q i )

3.2. Objective Function

3.3. Inputs and Outputs for the ANN

3.4. Communication Topology

3.5. Gradient Components

3.6. Gradient Gain

3.7. Updating Reactive Power Utilization Ratio Based on Consensus Algorithm

4. Results and Discussions

4.1. Power Distribution Network

4.2. Inputs and Outputs for the Neural Networks for the Modified IEEE 13-Bus System

4.3. Model-Free Cooperative Control

4.4. Comparative Analysis of Model-Based and Model-Free Cooperative Control

4.4.1. Case 1—Minimizing the Voltage Deviation ( w v D G = w v N D G = 1 , w α = 0 )

4.4.2. Case 2—Minimizing the Generation/Consumption of Reactive Power at DG Nodes ( w v D G = w v N D G = 0 , w α = 1 )

4.4.3. Case 3—Reducing Both the Reactive Power at DG Nodes and Voltage Deviation at DG and Non-DG Nodes w v D G = w v N D G = 1 , w α = 0.1

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. Reactive Power Utilization Ratio ( $α_{q i}$ )

4.4.1. Case 1—Minimizing the Voltage Deviation $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0)$

4.4.2. Case 2—Minimizing the Generation/Consumption of Reactive Power at DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 0, w_{α} = 1)$

4.4.3. Case 3—Reducing Both the Reactive Power at DG Nodes and Voltage Deviation at DG and Non-DG Nodes $(w_{v_{D G}} = w_{v_{N D G}} = 1, w_{α} = 0.1)$