Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization

Tian, Ye-Ning

doi:10.3390/electronics15010154

Open AccessArticle

Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization

by

Ye-Ning Tian

School of Computer Science and Engineering, South China University of Technology, Guangzhou 510006, China

Electronics 2026, 15(1), 154; https://doi.org/10.3390/electronics15010154 (registering DOI)

Submission received: 25 November 2025 / Revised: 21 December 2025 / Accepted: 26 December 2025 / Published: 29 December 2025

(This article belongs to the Special Issue Enhancing Power System Resilience: Advanced Algorithm, Control Strategies, and Topologies for a Sustainable Energy Future)

Download

Browse Figures

Versions Notes

Abstract

The rapid proliferation of distributed solar photovoltaic systems has intensified voltage fluctuations and uncertainty in distribution networks. Traditional Volt/VAR control strategies often struggle with robustness against extreme scenarios and impose high communication overheads. To address these challenges, this paper proposes a Bayesian Evolutionary Optimization with Conditional Value at Risk (BEO-CVaR) framework for optimizing Volt/VAR control rules. This novel approach integrates Conditional Value at Risk (CVaR) into the objective function to explicitly mitigate tail risks arising from grid uncertainties. Furthermore, it employs Bayesian Evolutionary Optimization (BEO) utilizing Gaussian process surrogate modeling to efficiently solve the computationally expensive, black-box optimization problem. Validation on a standard IEEE test feeder demonstrates that BEO-CVaR achieves superior voltage regulation, strict adherence to safety standards, and significantly reduced communication requirements compared to conventional decentralized strategies. Additionally, the framework’s scalability and robustness are verified through extensive experiments across varying dimensions of decision spaces, confirming its effectiveness in complex multi-inverter coordination scenarios.

Keywords:

distributed energy resources (DERs); Conditional Value at Risk (CVaR); Bayesian Evolutionary Optimization (BEO); Volt/VAR control; risk-averse optimization

1. Introduction

The global energy landscape is experiencing a profound transformation, driven by the urgent need to combat climate change and the volatility of fossil fuel prices. This transition has spurred a worldwide effort to integrate substantial amounts of renewable energy sources, particularly distributed energy resources (DERs) [1]. The rapid growth of DERs, exemplified by the significant increase in distributed photovoltaic (PV) inverters in China’s southern power grid, highlights the broader challenges and opportunities in modern power systems. As of September 2024, the total grid-connected installed capacity of distributed solar PV in the Southern China region reached 36.72 gigawatts, accounting for 34.2% of the total installed PV capacity in the entire grid. This represents a year-on-year increase of 114%, with an average growth rate of 104% over the past two years. From January to September 2024, the entire grid added 15.89 gigawatts of new distributed PV capacity, constituting 45.6% of the total newly installed PV capacity during this period. Consequently, the high penetration of DERs has introduced significant variability and uncertainty into the distribution grid, necessitating research into effective Volt/VAR power control rules [2,3]. To address the uncertainty from high DER penetration, recent studies have begun to explore advanced optimization and risk-aware approaches. For instance, hybrid policy-based reinforcement learning has been applied to adaptive energy management in constrained environments [4], while the chance-constrained optimization framework proposed in [5] provides probabilistic guarantees for voltage compliance.

This necessity arises because the inherent variability of renewable generation frequently drives distribution feeders away from their nominal operating states [6].

Maintaining bus voltages within their safety bounds (e.g.,

\pm 5 %

of the rated value) is critical, as operation outside these limits entails detrimental consequences for both utility assets and end-user equipment. Specifically, undervoltage conditions can cause induction motors to stall or draw excessive current, leading to thermal damage and potentially triggering localized voltage collapse due to the lack of reactive power support. Conversely, overvoltage accelerates the aging of equipment insulation, increases transformer core losses, and often forces the curtailment of distributed PV inverters to protect the grid, thereby reducing economic benefits. In severe scenarios, sustained voltage violations may even evolve into cascading failures, jeopardizing the stability of the entire power system.

To address these issues, effective voltage regulation through Volt/VAR control strategies has become essential. By utilizing inverters equipped with advanced power electronics to provide reactive power compensation, it is possible to dynamically stabilize voltage levels, provided these inverters are appropriately controlled. The IEEE 1547 standard [2], a set of technical specifications governing the interconnection of DERs to the grid, recommends various Volt/VAR control strategies that involve adjusting inverter setpoints to maintain grid stability and optimize voltage profiles.

Traditionally, inverter-based voltage regulation strategies can be categorized into three approaches: centralized, distributed, and localized methods. Centralized approaches are capable of determining optimal reactive power setpoints through optimal power flow solutions. However, they often encounter challenges related to high computational and communication overheads. Additionally, optimal power flow only identifies optimal reactive power setpoints. It cannot directly define the essential Volt/VAR control rules necessary for dynamic and adaptive voltage regulation [7]. Distributed strategies, on the other hand, allow limited communication between neighboring stations to enhance coordination. An example is the distributed voltage control approach, where stations exchange information locally to achieve a more balanced and optimized voltage profile, as introduced in [8]. Nevertheless, they can suffer from convergence delays. Localized control strategies rely solely on local data to adjust reactive power injections based on immediate voltage measurements. For example, local static feedback laws, such as those described in [9,10], directly link reactive power adjustments to local voltage levels, achieving fast response times but often lacking global optimality.

To address the challenges mentioned above, this study proposes Bayesian Evolutionary Optimization with Conditional Value at Risk (BEO-CVaR) for Volt/VAR control rule optimization. The contributions of this paper are summarized as follows:

This paper enhances the objective function with Conditional Value at Risk (CVaR). In our voltage regulation process, the regulation results are affected by uncertainties (environmental variables, noise, model errors, etc.). In the event of communication failures, applying decision variables derived from a specific load scenario to other scenarios may expose the system to extreme tail risks, resulting in insufficient robustness and vulnerability under high uncertainty. CVaR quantifies the risk measure of the average worst-case loss when the loss exceeds the Value at Risk (VaR) at a given confidence level [11].
This paper employs Bayesian Evolutionary Optimization (BEO) to address this issue. For Volt/VAR power control issues, the BEO framework dynamically optimizes decision vectors. By constructing a Gaussian process (GP) surrogate model to approximate computationally expensive multi-load-scenario power flow calculations, and actively selecting the most promising parameter points for evaluation based on acquisition functions such as Expected Improvement (EI), this approach significantly reduces the number of simulations required to accurately estimate the CVaR risk value. The method ensures optimization accuracy while substantially enhancing search efficiency, ultimately enabling effective solutions to high-dimensional, non-convex CVaR optimization problems with uncertain constraints under a limited computational budget.

The rest of this paper is organized as follows: In Section 2, the detailed power grid model is defined. In Section 3, the proposed BEO-CVaR is elaborated. In Section 4, this paper shows the details of the experiment. The conclusions are presented in Section 5.

2. The Formulation of Volt/VAR Control Rules Optimization

2.1. System Modeling

Consider a radial distribution network with

N + 1

buses, where bus 0 denotes the slack bus (substation) with a fixed voltage magnitude

v_{0} = 1

p.u. The remaining N buses are modeled as

P Q

nodes.

The exact AC power flow equations are non-convex and computationally expensive, posing a significant challenge for iterative optimization frameworks like BEO. To balance computational efficiency and modeling accuracy, this paper adopts the linearized power flow model proposed in [12]. This model provides a reliable approximation of voltage magnitudes under nominal operating conditions by exploiting the structural characteristics of distribution grids.

It is important to note that, while this linearized model serves as the theoretical basis for deriving the stability constraints (Section 2.2), the proposed BEO framework employs exact nonlinear AC power flow calculations (via MATPOWER) for the objective function evaluation. This ensures that the optimization accuracy is not compromised by linearization errors, and the solver interacts with the true physical model of the grid.

Let

v \in R^{N}

denote the vector of squared voltage magnitudes, while

p, q \in R^{N}

represent the vectors of active and reactive power injections, respectively. The relationship between nodal voltages and power injections is approximated as follows:

v \approx 1 + R p + X q

(1)

where

1

is a vector of all ones.

The sensitivity matrices

R, X \in R^{N \times N}

are strictly positive definite and are determined solely by the network topology and line impedances. Specifically, the element

X_{i j}

corresponds to the common reactance path shared by nodes i and j from the substation. This formulation effectively decouples the voltage deviation into active and reactive components, allowing us to explicitly quantify the impact of inverter reactive power support (

q

) on the voltage profiles. For the rigorous derivation of the sensitivity matrices and the error bound analysis of this linearization, we refer the readers to [12].

2.2. Volt/VAR Control Rule

The IEEE 1547 standard [13] allows DERs to regulate voltage by adjusting reactive power output based on local voltage measurements. This feedback mechanism creates a dynamical system where the inverter at node h updates its reactive power injection

q_{h}

at time step

t + 1

based on the local voltage

v_{h}

at time t:

q_{h} (t + 1) = g_{h} (v_{h} (t))

(2)

where

g_{h} (\cdot)

is the Volt/VAR control function.

As illustrated in Figure 1, this paper adopts the standard piecewise linear characteristic defined by four tunable parameters: reference voltage

\bar{v}

, deadband

δ

, saturation voltage

σ

, and maximum reactive power capacity

\bar{q}

. The control function

g_{h} (v)

dictates that the inverter injects reactive power to counteract voltage deviations. Specifically, in the linear regulation region (where the voltage deviation is between

δ

and

σ

), the control law is defined as follows:

q = - β (v - \bar{v} - δ \cdot sgn (v - \bar{v}))

(3)

where

sgn (\cdot)

is the sign function. The slope

β = \bar{q} / (σ - δ)

represents the control gain.

While a steeper slope

β

(high gain) provides stronger voltage support, it may lead to control loop instability. Therefore, the optimization of these parameters must strictly satisfy stability constraints.

The primary objective is to minimize voltage deviations across the network while satisfying operational constraints. The optimization problem is formulated as follows:

min_{z \in Z} F (z) = \sum_{n = 1}^{N} {[v_{n} (z) - 1]}^{2}

(4)

where

z = [\bar{v}, δ, σ, \bar{q}]

is the parameter vector defining the Volt/VAR control rules.

Stability constraints must be incorporated to prevent oscillations. Let

β

be the vector collecting all slope parameters, and define the diagonal control matrix

A : = diag (β)

. Theoretically, the system dynamics are stable if the spectral radius of the interaction matrix

AX

is strictly less than 1. To ensure robustness against modeling errors, this paper enforces a stricter condition with a safety margin

ε

:

{∥ AX ∥}_{2} \leq 1 - ε

(5)

Although the stability condition is derived based on the linearized approximation in Equation (1), the inherent modeling errors and higher-order terms neglected by linearization are compensated for by enforcing this stricter inequality with a safety margin

ε

. This margin acts as a buffer, ensuring that the control rules remain robust even when the actual grid dynamics deviates slightly from the linear approximation.

This condition is equivalently expressed as a Linear Matrix Inequality (LMI):

[\begin{matrix} (1 - ε) I & A X \\ X A & (1 - ε) I \end{matrix}] ⪰ 0

(6)

where

X

represents the grid topology matrix described in Section 2.

To ensure that the Volt/VAR control decision variables are robust against grid fluctuations, they must be screened using CVaR [14,15]. CVaR optimizes against severe tail risks and safeguards against system instability under worst-case scenarios.

At a given confidence level

α

(e.g.,

α = 95 %

or

α = 99 %

), the Value at Risk (VaR) represents the maximum possible loss. In other words, we have

α

certainty that the loss will not be greater than

{VaR}_{α}

. The standard definition of VaR at confidence level

α

is

{VaR}_{α} (L) = inf \{l \in R ∣ F_{L} (l) \geq α\}

(7)

where

L

is a random variable representing the loss (voltage deviation),

F_{L} (l) = P (L \leq l)

is the Cumulative Distribution Function (CDF) of the loss, and inf denotes the lower bound.

CVaR measures the average expected loss faced in a worst-case, small-probability, extreme loss scenario. It provides a more complete picture of the extent of tail risk than (7). Mathematically,

{CVaR}_{α}

is the conditional expectation that loss

L

exceeds

{VaR}_{α}

(

L \geq {VaR}_{α}

):

{CVaR}_{α} (L) = E [L ∣ L \geq {VaR}_{α} (L)]

(8)

Expressed in integral form, we can write CVaR as follows:

{CVaR}_{α} (L) = \frac{1}{1 - α} \int_{α}^{1} {VaR}_{γ} (L) d γ

(9)

Thus, the true objective function is the CVaR-robustified sum of squared voltage deviation:

min_{z \in Z} {CVaR}_{α} (F (z, ξ)) = min_{z \in Z} {CVaR}_{α} (\sum_{n = 1}^{N} {[v_{n} (z, ξ) - 1]}^{2})

(10)

By integrating active/reactive power profiles from historical/simulated data, applying stability constraints (5), and modeling voltage and reactive power through power flow equations, a baseline performance metric (4) can be calculated. However, optimizing this metric alone ignores extreme risks arising from grid uncertainties. Thus, the proposed CVaR-based objective function explicitly minimizes the average worst-case voltage deviations beyond a specified risk threshold (

α

), thereby ensuring robust voltage regulation under high uncertainty while preserving all operational constraints and the physical model.

3. The Proposed BEO-CVaR

3.1. BEO

Traditional BEO directly models the risk measure objective function (4), which requires numerous repeated evaluations of the expensive black-box function to estimate the risk value. If a single evaluation takes 30 min, accurately estimating the risk value may require several days. In contrast, the BEO-CVaR algorithm proposed in this paper directly models the underlying function and leverages a GP to capture its structure, thereby avoiding the prohibitively expensive direct evaluation of the risk measure.

Since the computation of voltage deviation is prohibitively expensive, this paper deems the voltage deviation calculation function as a black-box function. For such costly black-box functions, BEO can efficiently perform optimization within a limited number of computational iterations. It iterates through three core steps [16,17].

3.1.1. Training a GP Surrogate Model

Using a GP Surrogate Model, which is defined as

f (x) \sim GP (m (x), k (x, x^{'})),

(11)

the joint distribution of the observed data y follows a multivariate Gaussian distribution:

y \sim N (0, K_{θ} + σ_{n}^{2} I)

(12)

where

μ (x)

is the mean function, typically set to zero;

k (x_{i}, x_{j})

is the RBF kernel function/covariance function, used to define the similarity between points; and

K_{θ}

is the kernel matrix, and its elements are defined as

K_{i j} = k_{θ} (x_{i}, x_{j})

.

The RBF kernel function is given by

k_{θ} (x_{i}, x_{j}) = σ_{f}^{2} exp (- \frac{∥ x_{i} - x_{j} ∥^{2}}{2 l^{2}})

(13)

Here, the hyperparameter vector

θ

contains three parameters: the length scale l, the output scale

σ_{f}^{2}

, and the observation noise variance

σ_{n}^{2}

; l controls the smoothness of the function. The influence of a point

x_{i}

on another point

x_{j}

decays rapidly as the normalized distance

\frac{∥ x_{i} - x_{j} ∥}{l}

increases.

σ_{f}^{2}

controls the amplitude (vertical scale) of the function. A larger

σ_{f}^{2}

allows for a larger possible range of function values.

σ_{n}^{2}

measures the average magnitude of the deviation of the observed values from the true function values. A larger

σ_{n}^{2}

indicates more noise in the observations and higher uncertainty; a smaller

σ_{n}^{2}

indicates more precise observations and lower uncertainty.

The goal of training is to find the hyperparameters

θ

that best explain the current observed data D. The objective function used in this paper is the marginal log-likelihood (MLL):

log p (y | X, θ) = - \frac{1}{2} y^{T} {(K_{θ} + σ_{n}^{2} I)}^{- 1} y - \frac{1}{2} log | K_{θ} + σ_{n}^{2} I | - \frac{n}{2} log (2 π)

(14)

This formula consists of three components: The data-fitting term (

- \frac{1}{2} y^{T} K^{- 1} y

) measures how well the model matches the observed data. The complexity penalty term (

- \frac{1}{2} log | K |

) acts as a regularizer for the model complexity. The determinant

| K |

of the kernel matrix measures the model’s flexibility in a certain sense. A more complex model typically has a larger

| K |

, resulting in a smaller value for this term (i.e., a larger penalty). The constant term (

- \frac{n}{2} log (2 π)

) can be ignored during optimization.

This structure resembles the classic bias–variance trade-off in machine learning. Equation (14) automatically balances data fitting and model complexity.

Therefore, the goal of GP optimization is to find the optimal hyperparameters

θ^{*}

that maximize Equation (14).

The optimal hyperparameters

θ^{*}

are found by maximizing the MLL:

θ^{*} = arg max_{θ} log p (y ∣ X, θ)

(15)

This is inherently a non-convex optimization problem with continuous variables. Although it cannot be solved analytically, iterative methods based on the gradient descent principle—such as gradient descent itself, conjugate gradient, or L-BFGS—can be employed to find the optimal solution. This work derives the gradient of the objective function (14) with respect to the hyperparameters

θ = (l, σ_{f}^{2}, σ_{n}^{2})

:

\frac{\partial log p (y ∣ X, θ)}{\partial l} = \frac{1}{2} y^{T} K^{- 1} \frac{\partial K}{\partial l} K^{- 1} y - \frac{1}{2} tr (K^{- 1} \frac{\partial K}{\partial l})

(16)

The term

\frac{\partial K}{\partial l}

can be computed based on the definition of the RBF kernel function given in (13).

The parameters are then updated iteratively, where the “loss” refers to the negative MLL (the negative of (14), i.e.,

- log p (y ∣ X, θ)

):

θ_{new} = θ_{old} - η \frac{\partial loss}{\partial θ}

(17)

In summary, the training procedure for a GP is as follows:

Initialize the hyperparameters $θ = {l, σ_{f}^{2}, σ_{n}^{2}}$ .
Construct the kernel matrix $K$ using the current $θ$ .
Compute the current negative marginal log-likelihood (loss).
Compute the gradients of the loss with respect to $θ$ in (16).
Update the hyperparameters $θ$ using an optimizer (e.g., gradient descent as in (17)).
Repeat Steps 2–5 until a convergence criterion is met (e.g., the gradient norm is sufficiently small, the maximum number of iterations is reached, or the change in the loss function is negligible).

3.1.2. Constructing and Optimizing the Acquisition Function

Based on the mean

μ_{t} (x)

and standard deviation

σ_{t} (x)

provided by the current GP model, an acquisition function is defined to measure the expected utility of performing the next evaluation at point x. This function balances exploration (of regions with high uncertainty) and exploitation (of regions with a high predicted mean). The point

x_{t}

that corresponds to the maximum value of the acquisition function is considered to be the most promising candidate for locating a better solution in the next iteration.

This paper compares four distinct acquisition function strategies: EI, upper confidence bound (UCB), Max-Value Entropy Search (MES), and Thompson Sampling (TS). The acquisition function can be formally expressed as

a (x; D_{1 : t})

, where

D_{1 : t}

denotes the set of all observed data accumulated from the first to the t-th iteration. For acquisition functions in medium-to-high dimensions that are differentiable, multi-start gradient-based methods are commonly employed to efficiently obtain high-quality local maxima, which are then selected as the next sampling points.

EI: Measures the expected value of improvement of x over $y_{best}$ (improvement amount weighted by probability).

$a_{EI} (x; D_{1 : t}) = E [max (f (x) - f_{best}, 0)]$

(18)
UCB: Selects the next point based on the upper confidence bound.

$a_{UCB} (x; D_{1 : t}) = μ_{t} (x) + λ_{t} σ_{t} (x)$

(19)

where $λ_{t} \geq 0$ is a parameter controlling the degree of exploration (which can be time-varying or fixed).
MES: Aims to select an evaluation point x such that, after observing the function value y at this point, the uncertainty about the global maximum $f^{*}$ is maximally reduced, which can be expressed using conditional entropy as $H (f^{*} | D)$ . This reduction in uncertainty is the information gain:

$a_{MES} (x | D_{1 : t}) = I (f^{*}; y | D_{1 : t}, x)$

(20)

$a_{MES} (x | D_{1 : t}) = H (f^{*} | D_{1 : t}) - E_{p (y | x)} [H (f^{*} | D_{1 : t} \cup {(x, y)})]$

(21)

where $E_{p (y | x)} [H (f^{*} | D_{1 : t} \cup {(x, y)})]$ represents the remaining uncertainty about $f^{*}$ after sampling at candidate point x and obtaining observation y. This expectation is taken over possible values of y (which follows the predictive distribution $p (y | D_{1 : t}, x)$ of the current GP model at point x).
TS: Samples a function $f_{sample} \sim gp (μ_{t}, k_{t})$ from the GP posterior distribution, and then takes the maximum point on $f_{sample}$ as $x_{t + 1}$ . No explicit calculation of $a (x)$ is required.

3.1.3. Evaluation and Update

Evaluate the computationally expensive objective function at the selected point. Add the new (input, output) data pairs to the existing training dataset. Repeat this process until the evaluation budget is exhausted or a convergence criterion is met.

3.2. BEO-CVaR Algorithm Framework

The black-box function of the BEO-CVaR algorithm is used to solve for voltage deviation values. Figure 2 illustrates the framework of BEO-CVaR. It consists of two parts: (1) BEO-CVaR optimization and (2) computation of the voltage deviation via (4).

We refer to the second part of the function used to solve for the voltage deviation values as a black-box function for the BEO-CVaR algorithm. Algorithm 1 is the pseudo-code.

Algorithm 1: BEO-CVaR Framework

Input: Variable boundaries

B \in R^{s \times 2}

, initial sample size

N_{init}

, optimization iteration count T, environmental sample count K, risk level

α

Output: Optimal parameters

z_{b e s t}

, minimum CVaR

y_{b e s t}

1: Normalize the training data to the range

[0, 1]

   2: for iteration = 1 to max_iterations T do
   3:       Normalize training data to the range [0,1]
   4:       Train a GP model: Use the RBF kernel function from (13); train the model hyperparameters using MLL (neural network gradient method)
   5:       Construct the acquisition function: Use the acquisition function strategy from Section 3.1, calculate the current optimum as the reference point, and construct the acquisition function
   6:       Optimize the acquisition function: Acquisition function optimization is performed via multi-start random restarts and random sampling points, using gradient-based optimization methods within the 8-dimensional standardized parameter space, while employing boundary handling and convergence checks to ensure numerical stability
   7:       Denormalize the candidate point to the original parameter space
   8:       Evaluate the black-box function at the candidate point:
   9:       for environment sample = 1 to K do
   10:           Randomly select a load scenario
   11:           Calculate power system voltage deviation cost
   12:       end for
   13:       Calculate CVaR at

α

risk level
   14:       Update the training dataset with the new candidate point and its CVaR value
   15:       Periodically save the optimization progress
   16: end for

Figure 3 is the flowchart of the voltage deviation calculation function. This function is computationally expensive. This paper computes it based on (4).

The input is an 8-dimensional control decision vector to be evaluated, which defines the Volt/VAR power control rules for two inverters (including decision variables such as reference voltage, deadband voltage, saturation voltage, and maximum reactive power).

To fully capture grid uncertainties (such as fluctuations in load and photovoltaic output), the algorithm randomly selects K different operating scenarios. For each scenario, the following detailed calculation process is executed: (1) Initialize the grid model by loading the topology and initial parameters of the IEEE 123-bus test system using MATPOWER’s loadcase function. (2) Inject the load and photovoltaic data for the current scenario, modifying the active load, reactive load, and photovoltaic active power output in the model. (3) Call MATPOWER’s runpf function to perform a complete AC power flow calculation. This function iteratively solves the power balance equations using the Newton–Raphson method, directly outputting the precise voltage magnitudes for each node. (4) Based on the power flow results, calculate the sum of squared deviations between all node voltages and the nominal voltage for that scenario.

After iterating through all K scenarios, the process collects a set of voltage deviation samples and finally computes the CVaR at a given risk level

α

as the output. This process ensures the accuracy of performance evaluation through expensive multi-scenario power flow calculations, while also highlighting the necessity of employing the BEO algorithm for efficient optimization.

4. Experiment and Result Analysis

To comprehensively evaluate the effectiveness and robustness of the proposed BEO-CVaR framework, this section presents a series of numerical simulations on the IEEE 123-node test feeder. The experimental validation is organized as follows: First, the test system setup and algorithmic parameter configurations are detailed in Section 4.1 and Section 4.2. Subsequently, Section 4.3 analyzes the algorithm’s internal characteristics, including sensitivity to the risk confidence level (

α

) and convergence behavior. Following this, comparative studies against traditional decentralized strategies and meta-heuristic optimization (PSO) are presented in Section 4.4, while the performance of different acquisition functions is evaluated in Section 4.5. Finally, the framework’s scalability, communication efficiency, and robustness under extreme physical constraints are rigorously verified in Section 4.6, Section 4.7, and Section 4.8, respectively.

4.1. Test Case

The backbone network of the standard IEEE 123-node test grid proposed in [18] was adopted as the grid infrastructure, with a schematic diagram shown in Figure 4. To simulate realistic load conditions, the power demand profiles were derived from the DiSC simulation framework, which aggregates anonymized residential electricity consumption data from approximately 1200 households near Horsens, Denmark, provided by the local distribution system operator NRGi.

Two PV units were integrated into the grid model as micro-generators. Figure 5a illustrates the total load demand profiles across all nodes over a 12 h period (6:00–18:00), with one representative curve highlighted.

Figure 5b simultaneously displays the power generation curves of the two newly added PV units, exhibiting time-varying output patterns influenced by factors such as solar irradiance.

To validate the practical value of the BEO-CVaR algorithm in setting Volt/VAR control decision vectors, the optimization process was simulated and verified using MATPOWER on a quasi-steady-state nonlinear AC power flow model.

4.2. Experimental Setting

For the architecture in Section 3.2, its key parameters are shown in Table 1. This table defines the key algorithm parameters for BEO-CVaR optimization. While most parameters are determined by the physical constraints and computational budget, the risk confidence level

α

is a critical hyperparameter that dictates the trade-off between average efficiency and tail risk robustness. Its specific selection process and the resulting impact on system performance are investigated in detail in Section 4.3.1. There exists an 8-dimensional decision vector representing the Volt/VAR power control rule settings (

\bar{v}

,

δ

,

σ

,

\bar{q}

) of 2 inverters. Subsequently, 68 optimization iterations are carried out. In each iteration, the GP and an acquisition function are used to select the next most promising point for evaluation. The EI strategy is employed in this subsection, while the remainder of this paper will compare the results produced by different strategies. Unless otherwise stated, all computations were performed on a hardware platform equipped with an AMD64 Family 25 Model 97 processor and 16 GB of RAM.

To address the computationally expensive CVaR objective and ensure the robustness of local optima, a multi-restart strategy with a large number (50) of restarts was employed during the acquisition function optimization step. The 200 original samples represent the number of unique load/PV scenarios (K) used in each evaluation of the black-box function to compute the CVaR at the specified risk level. The risk confidence level

α

represents the proportion of the loss distribution that the optimizer accounts for when calculating risk. In this study, we explore a range of candidates

α \in {0.5, 0.6, 0.7, 0.8, 0.9}

to empirically identify the optimal operating point for the IEEE 123-node feeder. This ensures that the chosen control rules are not only robust against extreme scenarios but also maintain high performance under normal operating conditions.

Table 2 defines the feasible search space for the Volt/VAR power control decision vector and imposes physical and operational constraints on the two inverters.

To ensure reproducibility, the specific algorithmic settings and convergence strategies are defined as follows:

Hyperparameter Optimization: The GP kernel hyperparameters are optimized by maximizing the MLL using the L-BFGS-B algorithm. As a quasi-Newton method, this approximates the Hessian matrix to automatically adapt step sizes, eliminating the need for manual tuning of the learning rate and terminating the update when the gradient norm satisfies the convergence tolerance.
Convergence Strategy: The outer optimization loop employs a fixed evaluation budget ( $T = 68$ iterations) rather than an asymptotic convergence threshold. Given the computationally expensive nature of the CVaR objective, this budget strategy ensures a strictly bounded and predictable execution time (approx. 31 min) while sufficiently exploring the design space.
Acquisition Maximization: To address the non-convexity of the acquisition function, we employ a multi-start optimization strategy. The algorithm generates $N_{r a w} = 200$ random samples to identify high-potential regions, which are then used to initialize $N_{r e s t a r t s} = 50$ local gradient-based searches (via L-BFGS-B) to precisely locate the next sampling point.

4.3. Results and Analysis

4.3.1. Sensitivity Analysis and Selection of Risk Parameter $α$

THe CVaR confidence level,

α

, serves as a critical hyperparameter in the BEO-CVaR framework, determining the algorithm’s degree of risk aversion. A generic choice of

α

may not yield the optimal trade-off between operational efficiency (average performance) and grid safety (robustness against extreme events). To scientifically determine the optimal

α

for the IEEE 123-node test feeder, we conducted a sensitivity analysis by varying

α

across the set

{0.5, 0.6, 0.7, 0.8, 0.9}

.

Figure 6 illustrates the impact of

α

on two key performance metrics: the average voltage deviation (representing normal operating efficiency) and the extreme tail risk (quantified as the 99.9% worst-case voltage deviation). The results reveal a convex, “U-shaped” relationship, indicating that neither risk-neutral nor excessively risk-averse strategies are ideal:

Under-Protection Region ( $α < 0.7$ ): At $α = 0.5$ , the algorithm behaves similarly to a risk-neutral expectation minimization. While it achieves a moderate average deviation ( $0.0122$ p.u.), it fails to sufficiently penalize extreme violations. Consequently, the system exhibits the highest tail risk ( $0.0520$ p.u.), indicating vulnerability to rare but severe load fluctuations.
Over-Conservatism Region ( $α > 0.7$ ): As $α$ increases to $0.9$ , the optimizer focuses exclusively on the worst $10 %$ of scenarios. This excessive risk aversion leads to “over-regulation,” where inverters adopt aggressive control rules that are suboptimal for the majority of normal scenarios. As a result, the average voltage deviation degrades to $0.0115$ p.u. Furthermore, estimating the extreme tail with finite samples introduces higher variance into the GP surrogate, resulting in a slight increase in the measured risk ( $0.0510$ p.u.) compared to $α = 0.7$ .
Optimal Balance ( $α = 0.7$ ): The value of $α = 0.7$ is identified as the optimal trade-off point. It achieves the global minimum for the average voltage deviation (0.0102 p.u.), which prioritizes and guarantees the highest efficiency under day-to-day nominal conditions. While a marginally lower extreme tail risk is observed at $α = 0.8$ , this gain is minimal and comes at the cost of a significant degradation in average performance. Therefore, $α = 0.7$ is the superior choice, as it robustly constrains the tail risk (0.0478 p.u.) without sacrificing crucial operational efficiency. This indicates that focusing on the worst $30 %$ of scenarios provides sufficient data for the BEO algorithm to learn robust control boundaries without sacrificing performance in nominal states.

Based on these empirical findings,

α = 0.7

was identified as the optimal operating point for this specific network topology. Therefore, this value is adopted for all subsequent performance comparisons and scalability analyses in this study.

4.3.2. Optimization Process and Convergence Analysis

Figure 7 shows the step-by-step evaluation of the CVaR objective (blue dots) and the progression of the historical best CVaR (red line). Some points with high CVaR values represent exploratory samples evaluated by the BEO process. These high values originate from the algorithm intentionally testing potentially high-risk regions (decision vectors that could lead to system instability, e.g., load curtailment, in some scenarios). The graph demonstrates the progressive improvement in risk control during the BEO process, highlighted by the continuous downward trend of the historical best CVaR, which clearly indicates the algorithm’s convergence.

Figure 8 visualizes the evolution of all eight optimized decision variables throughout the evaluation process, revealing the search trajectory and convergence behavior of each variable during BEO. All of the parameters spend most of their time in stable “exploitation” phases, conducting local fine searches. The occasional abrupt changes represent crucial “exploration” behaviors, where the algorithm actively jumps out of the current region to probe new areas of the search space, avoiding local optima and seeking the global best solution.

Figure 9 tracks the dynamic changes of the noise variance (

σ_{n}^{2}

) and output scale (

σ_{f}^{2}

) hyperparameters of the GP model throughout the iterations, demonstrating the enhancement of the model’s stability in capturing the behavior of the objective function.

$σ_{n}^{2}$ reaches its peak around the 35th iteration, and then declines rapidly and stabilizes at a low value, indicating early active exploration of high-uncertainty regions and improved model reliability. The delayed convergence of $σ_{n}^{2}$ underscores its role as a key parameter requiring careful calibration.
$σ_{f}^{2}$ undergoes a sharp drop around the 10th iteration and quickly converges to a stable value, reflecting the algorithm’s rapid calibration of the overall variation amplitude of the target function in the initial phase (with early emphasis on exploring extreme regions), ultimately determining an appropriate scale.

Both parameters evolve independently but stabilize after their respective transitions, marking the shift of the BEO algorithm from exploration to exploitation.

Figure 10 displays the length scales l for the eight Volt/VAR power control decision variables defined in Table 2. In the early stages of optimization (when the number of iterations is low), the length scales across all dimensions typically undergo significant fluctuations and exploration. This occurs because the GP model has limited knowledge of the function’s structure initially and must explore different length scale configurations to identify the optimal model structure. The early peaks observed in some dimensions in the figure confirm this behavior. As the iterations progress (with increasing number of iterations), the BEO algorithm collects more data, and the GP model gains a clearer understanding of the target function. As a result, the length scales in all dimensions gradually stabilize, with reduced fluctuations, eventually converging to a relatively stable and very small value. This indicates that the target function being optimized is complex and finely detailed (highly nonlinear with intricate physical relationships), and that its output is highly sensitive to minor changes in the input decision variables. Through learning and modeling, the BEO algorithm successfully identifies this characteristic. It ultimately selects a high-resolution, high-precision model (with a small l value) to characterize the target function, enabling a detailed search within this complex function and accurately locating the global optimum or its vicinity.

4.3.3. Statistical Reproducibility and Computational Efficiency

To strictly evaluate the reproducibility and robustness of the proposed method, we conducted eight independent trials with different random seeds. The statistical distribution of key performance metrics and computational costs is illustrated in Figure 11.

Figure 11c–e display the distribution of the final optimized objectives. As detailed in Table 3, the BEO-CVaR framework demonstrates high stability. The standard deviation of the CVaR objective is extremely low (0.0012), indicating that the algorithm consistently converges to high-quality solutions regardless of the initial random sampling. Notably, the voltage violation rate remains minimal at

0.1030 % \pm 0.0834 %

, and the average voltage deviation is stable at

0.0113 \pm 0.0006

p.u. These results confirm the statistical significance of the method’s performance, proving that the optimization capability is robust against the randomness inherent in the BEO process.

Furthermore, to quantify the computational overhead, we recorded the execution time for both the initialization phase (20 samples) and the optimization phase (68 iterations), as shown in Figure 11a,b. The specific time consumption statistics are summarized in Table 4.

The average total execution time is approximately 31 min (

1869.75

s). Considering that the optimized control rules are intended for long-term operation (e.g., updated every 12 h, as discussed in Section 4.3.4) and eliminate the need for real-time communication, this offline training cost is highly acceptable for practical engineering applications.

4.3.4. Performance Validation on Standard Test Scenarios

Following the statistical analysis, we selected the representative decision vector to validate the control efficacy across the full set of test scenarios. We sampled every 5 s over 12 h (6 a.m. to 6 p.m.) to obtain the 8640 scenarios. When deployed across these 8640 scenarios, the results are as shown in Figure 12:

The simulation results can be summarized as follows:

Strict Safety Compliance: The proposed BEO-CVaR method achieved a remarkably low voltage violation rate of 0.026% (violations occurred in only a negligible fraction of the 8640 scenarios). This demonstrates that the algorithm effectively learned to mitigate tail risks, ensuring that the grid operates within the IEEE 1547 safety limits (≤1.05 p.u.) under almost all uncertainty realizations.
Coordinated Inverter Response: As shown in the bottom panel of Figure 12, the inverters exhibit a distinct coordinated behavior. The smaller Inverter 2 (Green) operates near its saturation limit to provide baseload reactive support, while the larger Inverter 1 (Orange) dynamically adjusts its output, reaching full saturation only during peak solar generation periods (scenario indices 4000–6000).

In contrast to the method in [1], which requires communication every two hours to update the control rules, the proposed approach relies on a robust, fixed decision vector derived from offline training. The results confirm that this fixed strategy can autonomously maintain grid stability over a 12 h period using only locally measured voltages, significantly reducing communication dependency.

4.4. Comparison with Traditional and Meta-Heuristic Strategies

4.4.1. Comparison with Traditional Decentralized Methods

There are three traditional methods:

Uncontrolled Scenario: In the uncontrolled case, the two PV inverters inject active power at a unitary power factor (i.e., zero reactive power). This results in voltage violations, with the magnitude exceeding the 1.05 p.u. upper limit at multiple buses (Figure 13, uncontrolled panel). These overvoltage issues highlight the necessity of reactive power control strategies to maintain grid voltage stability.
Fully Decentralized Control (Static Feedback): This scenario evaluates the static local feedback law proposed in [2,9,10,19]. The control law relies on purely local measurements. It adjusts reactive power injection based on a predefined control strategy with a reference voltage $\bar{v}$ of 1.0 p.u., a deadband voltage $δ$ of ±0.01 p.u., a saturation voltage $σ$ of ±0.04 p.u., and maximum reactive power injections $\bar{q}$ of 1.0 p.u. for one PV inverter and 0.2 p.u. for the other. As shown in Figure 14 (Fully decentralized control (static)), while some buses exhibit improved voltage profiles, violations persist at certain nodes, including generator buses. This indicates that static decentralized control strategies are insufficient for effectively mitigating overvoltage in all cases.
Fully Decentralized Control (Incremental Feedback): This paper also implements the incremental feedback strategy, which adjusts reactive power iteratively based on local voltage deviation [20,21,22]. The results, depicted in Figure 15, demonstrate that while incremental feedback offers some improvements, it fails to strictly regulate voltages below the 1.05 p.u. threshold. The persistent overvoltage issues at certain nodes underscore the limitations of purely decentralized strategies.

The experimental results of these three traditional methods are compared with those of the BEO-CVaR framework in Table 5. In summary, the comparative analysis clearly demonstrates the superiority of the proposed BEO-CVaR framework over traditional Volt/VAR power control strategies.

Specifically, to evaluate robustness under extreme conditions, we analyzed a high-risk operational interval (scenarios 4500–6000) characterized by high load or generation volatility. As shown in the “High-Risk Violation Rate” column of Table 5, traditional methods fail significantly in this interval, with violation rates soaring to 10.71% (uncontrolled), 8.64% (incremental), and 5.21% (static). This indicates that decentralized methods lack the coordination required to handle system stress.

In contrast, BEO-CVaR achieves consistent and robust voltage regulation. Crucially, across all 8640 real-world load scenarios, BEO-CVaR effectively maintains voltage deviation within safe margins, achieving near-perfect compliance (99.90%) with the IEEE 1547 safety standard. Furthermore, this enhanced stability and robustness are achieved with significantly reduced operational overhead: BEO-CVaR only requires infrequent decision vector updates (once every 12 h in the experiment), greatly diminishing reliance on communication reliability. This stands in sharp contrast to traditional decentralized methods, which necessitate continuous real-time communication or frequent updates to maintain their own relatively ineffective control, thereby imposing a substantial communication burden. The results demonstrate that BEO-CVaR effectively addresses the inherent key limitations of existing decentralized methods—namely, robustness, optimality, and communication efficiency—in managing distribution networks with DERs.

4.4.2. Comparison with Meta-Heuristic Optimization (PSO)

To rigorously evaluate the advanced nature of BEO-CVaR in handling black-box optimization problems, in addition to the aforementioned traditional methods, this study benchmarks the proposed framework against a classic meta-heuristic algorithm: Particle Swarm Optimization (PSO). PSO is widely recognized for its simplicity and effectiveness and has been extensively utilized in optimal power flow and Volt/VAR optimization tasks within power systems [23,24]

Under the exact same experimental environment, the PSO was configured with a population size of 30 and 50 iterations, targeting the minimization of the average voltage deviation. The comparative results are presented in Table 5 and Figure 16. The analysis reveals two key insights:

Superior Robustness in High-Risk Scenarios: While PSO achieves a reasonable overall compliance rate, it struggles significantly under stress. In the high-risk interval (scenarios 4500–6000), PSO exhibits a violation rate of 2.34%. This reveals a critical limitation: standard PSO optimizes for average performance (expectation minimization) and ignores tail risks. In sharp contrast, BEO-CVaR maintains a violation rate of only 0.50% in the same high-stress window—an improvement of nearly 80% over PSO. This demonstrates the effectiveness of the CVaR component, which explicitly penalizes voltage violations in worst-case scenarios, ensuring grid safety even during peak volatility.
Computational Efficiency: Regarding computational cost, the optimization process for PSO required approximately 54.5 min, whereas BEO-CVaR required only about 31 min. PSO relies on stochastic exploration of the search space by a population, necessitating a large number of function evaluations. Conversely, BEO-CVaR leverages a GP surrogate model to efficiently “learn” the grid behavior, achieving superior convergence with significantly fewer samples. This demonstrates the superior sample efficiency of the Bayesian optimization framework.

4.5. Comparison of Four Acquisition Function Strategies

Figure 17 illustrates the performance of four different acquisition strategies (EI, UCB, MES, and TS) in minimizing the objective function CVaR. The EI strategy, represented by the blue curve, demonstrates the fastest initial convergence speed. It shows extremely rapid performance improvement within the first few iterations, quickly catching up with and surpassing other strategies. This is a strategy that is biased towards exploitation, as it greedily searches near the currently known optimal point, enabling it to rapidly find local improvements. The UCB strategy, depicted by the red curve, exhibits very robust performance. From early to late stages, it maintains a steady declining trend without significant performance fluctuations. This indicates that it is a reliable and robust choice requiring minimal parameter tuning, with strong performance predictability. The MES strategy, shown by the purple curve, starts from the best initial point (the point most informative about the global optimum). However, it progresses slowly in the early stages, even with slight fluctuations. Nevertheless, during the mid-to-late iterations (after approximately 10 iterations), it demonstrates sustained optimization capability, ultimately achieving slightly leading or comparable performance relative to other strategies. The TS strategy, represented by the green curve, performs similarly to MES but possesses inherent randomness and a strong exploratory nature. The selection of its initial point highly depends on the form of the randomly sampled function, allowing it to explore any uncertain regions within the search space. It shows more pronounced fluctuations in the early stages before eventually converging to a good solution. The semi-transparent region surrounding each curve represents the average fluctuation range across multiple runs (95% confidence interval). The narrowest region observed for EI indicates very stable and highly reproducible performance. In contrast, MES and TS show wider regions, reflecting greater variability in results across different runs due to their more random, exploration-focused nature. This further confirms their core characteristic of being exploration-oriented.

4.6. Scalability Analysis in Multi-Inverter Scenarios

To rigorously evaluate the scalability of the proposed BEO-CVaR framework, a series of experiments were conducted on a standard distribution test system modified to incorporate a varying number of controllable inverters. Three distinct scenarios were established: a 2-inverter base case, a 5-inverter case, and a 10-inverter case. The inverters were progressively added to different buses to simulate increasing penetration of DERs. A large-scale inverter, rated at 1.0 Mvar, was situated at a designated node, while the remaining inverters, each with a 0.2 Mvar capacity, were deployed in a decentralized manner at remote locations within the network to provide fine-grained local voltage regulation. This configuration was consistently maintained across all test scenarios. For all scenarios, the risk-aversion parameter for the CVaR objective was set to

α = 0.95

, creating a stringent test condition that forces the algorithm to optimize against the worst 5% of potential outcomes, thereby providing a rigorous benchmark for system robustness under extreme uncertainty.

The performance was evaluated based on the voltage violation rate across 8640 load scenarios. The results are summarized in Table 6.

As illustrated in Table 6, the voltage violation rate remains exceptionally low (below 0.32%) across all cases, demonstrating the robust scalability of the BEO-CVaR method. The 5-inverter case achieved the lowest violation rate of 0.0062%, indicating an optimal balance between control authority and optimization complexity. At this moderate scale, the BEO effectively navigates the search space to find a coordination strategy that leverages the additional control points to near-perfect effect.

In the 2-inverter case (Figure 18), the dark blue curve represents a large-scale 1.0 Mvar inverter. In the majority of high-risk scenarios, it operates almost continuously at its full capacity of −1.0 Mvar. This indicates that the system relies heavily on this single inverter to maintain grid voltage, shouldering the vast majority of the voltage regulation task. The light blue curve, representing a small-scale 0.2 Mvar inverter, also actively absorbs reactive power, but its output is constrained by its 0.2 Mvar capacity.

In the 5-inverter case (Figure 19), the voltage profiles exhibit significant convergence. The maximum nodal voltage is effectively suppressed, establishing a safer margin from the 1.05 p.u. upper limit, which demonstrates a notable improvement in overall voltage quality. The operational saturation of the 1.0 Mvar primary inverter (dark blue curve) increases compared to the 2-inverter scenario, reaching full capacity in approximately half of the scenarios. Concurrently, the other four distributed inverters actively participate in voltage regulation. This case strikes an optimal balance between control degrees of freedom and optimization tractability. The 20-dimensional decision space is sufficiently rich to enable effective coordinated control. The BEO-CVaR algorithm successfully explores this space and identifies a high-quality solution approaching the global optimum within a finite computational budget. This solution is characterized by assigning the core task to the most capable unit (the primary inverter) while mobilizing all distributed units for auxiliary support and local optimization, ultimately achieving near-perfect risk mitigation. The high saturation level of the primary inverter is, in fact, an integral part of this efficient, coordinated strategy.

In the 10-inverter case (Figure 20), the 1.0 Mvar primary inverter operates at its full −1.0 Mvar capacity for over 80% of its operational time, a saturation level far exceeding the previous two cases. Among the remaining nine 0.2 Mvar inverters, only one operates in a relatively saturated state, while the other eight provide almost no reactive power output. Although the Voltage Violation Rate (0.1007%) remains at a favorable level, it is higher than in the 5-inverter case. This performance degradation in the 40-dimensional case (10 inverters) can be theoretically attributed to the “curse of dimensionality” inherent in non-parametric BEO. As the dimension D of the decision space increases, the volume of the search space grows exponentially, leading to extreme data sparsity when the sample size N (computation budget) remains fixed. From the perspective of the GP surrogate model, this sparsity specifically impacts the efficacy of the RBF kernel (13). The kernel relies on the Euclidean distance

∥ x_{i} - x_{j} ∥

to measure similarity. In high-dimensional spaces, the distance between any two randomly sampled points tends to converge, making it difficult for the GP to distinguish between ‘nearby’ and ‘distant’ points effectively. Consequently, the predictive variance

σ_{t} (x)

becomes less informative, and the acquisition function struggles to balance exploration and exploitation efficiently. The optimizer requires significantly more samples to cover the latent structure of the objective function, confirming that standard BO faces theoretical bottlenecks when

D > 20

without dimensionality reduction techniques.

In summary, the scalability analysis confirms that the BEO-CVaR framework is highly effective for systems with 2 to 10 inverters, consistently ensuring robust voltage regulation under a strict risk-aversion setting. The results clearly demonstrate that the method’s performance is subject to the curse of dimensionality, as evidenced by the non-monotonic trend in violation rates. The optimal performance at 5 inverters suggests a practical limit for the current setup under the given computational budget.

Nevertheless, it is crucial to emphasize that, even in the challenging 10-inverter case, the framework maintains a remarkably low violation rate (0.1007%), which drastically outperforms traditional decentralized methods (as shown in Table 5). This demonstrates that BEO-CVaR possesses significant robustness to the degradation effects of high dimensionality, making it a practical and powerful solution for real-world distribution grids. Future work will address the identified curse of dimensionality by incorporating high-dimensional BEO techniques. Potential strategies include using additive GPs (which decompose the objective function into lower-dimensional sub-problems) or embedding the search space into a lower-dimensional manifold (e.g., via random embeddings or autoencoders) to further enhance performance in very large-scale systems.

4.7. Quantitative Analysis of Communication Efficiency and Robustness

To evaluate the communication burden, we distinguish between the monitoring uplink (data collection) and the critical control downlink (dispatching commands). While the BEO-CVaR framework maintains a standard 3 min uplink for SCADA data acquisition, it fundamentally decouples voltage stability from real-time control instructions.

4.7.1. Reduction in Mandatory Control Updates

Traditional real-time strategies require continuous dispatching of reactive power setpoints (

Q_{s e t}

) to handle PV variability. While our simulations evaluate performance against grid dynamics evolving every 5 s, traditional centralized control is often constrained by communication bandwidth and computational latency, typically allowing for a 3 min control cycle. Under this common operational constraint, the number of mandatory control updates for a standard 12 h window is

N_{t r a d i t i o n a l} = \frac{12 \times 60 \min}{3 \min} = 240 updates

(22)

In contrast, BEO-CVaR optimizes control rules rather than specific setpoints. Although the communication channel supports frequent data exchange, the necessity of transmitting new decisions is minimized to a single initial update:

N_{B E O - C V a R} = 1 update

(23)

Consequently, the frequency of critical control interventions is reduced by approximately 99.6%. While the monitoring link remains active, the control bandwidth remains effectively idle for the vast majority of operation, significantly lowering the operational burden.

4.7.2. Robustness Against Communication Failures

The most critical advantage is the system’s resilience during outages.

Traditional Methods (High Real-Time Dependency): A downlink failure causes the loss of crucial 3 min optimal setpoint dispatches ( $N_{t r a d i t i o n a l}$ updates missed). Inverters, unable to receive these updates, must revert to unoptimized fallback modes.
BEO-CVaR (Asynchronous Robustness): In contrast, BEO-CVaR transmits a robust Volt/VAR curve, enabling local control execution. Even if communication is severed immediately after the single initial update (losing all subsequent monitoring data), inverters autonomously adjust reactive power based on local voltage.

In summary, BEO-CVaR converts communication from a synchronous, critical constraint to an asynchronous support function, greatly enhancing grid resilience.

4.8. Stress Testing Under Extreme Physical Conditions

While the previous sections have validated the scalability and communication robustness, it is equally critical to assess the control generality under complex grid characteristics and off-nominal operating states (e.g., weak grids). To this end, a rigorous stress test was conducted on the IEEE 123-node test feeder by introducing three simultaneous physical constraints that drastically compress the feasible solution space:

High Impedance (Weak Grid): All line impedances (Z) are scaled by a factor of $2.0$ , significantly exacerbating voltage sensitivity to power injections.
Heavy Loading: The active and reactive load demands are scaled by $1.5$ , pushing the system far beyond its nominal operating point.
Limited Capacity: The reactive power capacity ( $Q_{max}$ ) of all inverters is reduced to $50 %$ of the nominal value.

Furthermore, the high-impedance (weak grid) scenario serves as a scenario-based sensitivity analysis regarding the linearization assumption used in Section 2. Since linearization errors typically increase with grid impedance, the robust performance observed in this scenario validates that the stability constraints derived from the linear model remain effective even when the grid characteristics challenge the modeling assumptions.

Despite the severe degradation of the physical environment, statistical analysis across 10 independent experiments confirms that the BEO-CVaR framework maintains extremely high robustness. For the vast majority of time steps, nodal voltages were effectively maintained confined within safety bounds (0.95–1.05 p.u.), with the Voltage Violation Rate limited to

1.722 %

and an average voltage deviation of

0.145

p.u.

Figure 21 presents the results of the most representative trial among these 10 experiments, revealing the control mechanism under limit states in detail:

Voltage Profile (Top Panel): It can be observed that, despite severe physical constraints, the vast majority of voltage curves remain within the safety band. Only during the middle period with the most intense PV and load fluctuations (approximately scenarios 3500 to 7500) did a few specific nodes exhibit voltage violations.
Inverter Response (Bottom Panel): The bottom panel clearly explains the cause of these violations. During the violation periods, both the large-capacity Inverter 1 (orange line) and the small-capacity Inverter 2 (green line) demonstrated coordinated stress response behavior; both reached and sustained their physical output lower limits (saturation state) for extended periods.

This phenomenon indicates that the rare voltage violations do not stem from the algorithm’s failure to find an optimal solution, but rather that the algorithm accurately identified the risk and commanded all inverters to release their full available capacity. The residual violations can thus be attributed to the physical scarcity of hardware regulation resources. This result robustly proves that, even on the verge of physical infeasibility, BEO-CVaR can maximize the utilization of existing resources to maintain system stability through “best-effort” saturation control.

5. Conclusions

This paper presents BEO-CVaR, a robust data-driven framework for optimizing Volt/VAR control rules in distribution grids with high DER penetration. The key innovations are the integration of CVaR for risk-averse optimization against severe uncertainty-driven voltage deviation and the use of BEO for efficient global optimization of the non-convex, computation-heavy CVaR objective. BEO-CVaR dynamically tailors control parameters by learning grid behavior through GP regression and strategically exploring the parameter space via BEO.

Experimental results on the IEEE 123-node test system confirm that BEO-CVaR achieves the following:

Ensures Robust Voltage Stability: Maintains voltage deviation within strict safety limits under diverse uncertainty scenarios, achieving near-perfect voltage compliance across standard and high-risk tested conditions.
Mitigates Extreme Risks: Explicitly minimizes the impact of worst-case uncertainty realizations through CVaR, enhancing grid resilience.
Reduces Operational Burden: The framework requires only infrequent updates of decision variables (once every 12 h), reducing the critical control downlink frequency by 99.3% compared to standard 3 min real-time control cycles. This significantly lowers the communication overhead and ensures high robustness against communication outages.
Outperforms Comparative Baselines: Significantly improves voltage profiles compared to traditional decentralized strategies and achieves superior sample efficiency and risk control compared to the meta-heuristic benchmark (i.e., PSO).

In an evolving energy landscape with increasing renewable integration, BEO-CVaR provides a practical and risk-aware solution for optimizing voltage control and enhancing grid stability.

Funding

This research received no external funding.

Data Availability Statement

Publicly available datasets were analyzed in this study. The original IEEE 123 Node Test Feeder data can be found at http://ewh.ieee.org/soc/pes/dsacom/testfeeders/index.html (accessed on 19 February 2025) and are described in [25]. The specific symmetric balanced configuration used in our simulations follows the setup in [18], with data available at https://github.com/saveriob/approx-pf (accessed on 19 February 2025).

Conflicts of Interest

The author declares no conflicts of interest.

References

Murzakhanov, I.; Gupta, S.; Chatzivasileiadis, S.; Kekatos, V. Optimal Design of Volt/VAR Control Rules for Inverter-Interfaced Distributed Energy Resources. IEEE Trans. Smart Grid 2024, 15, 1234–1245. [Google Scholar] [CrossRef]
IEEE. IEEE Standard for Interconnection and Interoperability of Distributed Energy Resources with Associated Electric Power Systems Interfaces. [Online]. 2018. Available online: https://standards.ieee.org/ieee/1547/5915/ (accessed on 10 February 2025).
Iranpour Mobarakeh, S.; Sadeghi, R.; Saghafi, H.; Delshad, M. Hierarchical integrated energy system management considering energy market, demand response and uncertainties: A robust optimization approach. Comput. Electr. Eng. 2025, 123, 110138. [Google Scholar] [CrossRef]
Yang, L.; Li, X.; Sun, M.; Sun, C. Hybrid Policy-Based Reinforcement Learning of Adaptive Energy Management for the Energy Transmission-Constrained Island Group. IEEE Trans. Ind. Inform. 2023, 19, 10751–10762. [Google Scholar] [CrossRef]
Colot, A.; Perotti, E.; Glavic, M.; Dall’Anese, E. Incremental Volt/Var Control for Distribution Networks via Chance-Constrained Optimization. IEEE Trans. Power Syst. 2025, 40, 4561–4573. [Google Scholar] [CrossRef]
Hong, R.; Huang, H.; Guo, Z. Morphology, key technologies and prospects of active support type VSC-HVDC for enhancing the stability of receiving end power grid. Proc. CSEE 2024, 44, 6818–6830. [Google Scholar]
Farivar, M.; Neal, R.; Clarke, C.; Low, S. Optimal inverter VAR control in distribution systems with high PV penetration. In Proceedings of the 2012 IEEE Power and Energy Society General Meeting, San Diego, CA, USA, 22–26 July 2012. [Google Scholar]
Bolognani, S.; Carli, R.; Cavraro, G.; Zampieri, S. Distributed reactive power feedback control for voltage regulation and loss minimization. IEEE Trans. Autom. Control 2015, 60, 966–981. [Google Scholar] [CrossRef]
Turitsyn, K.; Šulc, P.; Backhaus, S.; Chertkov, M. Options for control of reactive power by distributed photovoltaic generators. Proc. IEEE 2011, 99, 1063–1073. [Google Scholar] [CrossRef]
Jahangiri, P.; Aliprantis, D.C. Distributed Volt/VAr control by PV inverters. IEEE Trans. Power Syst. 2013, 28, 3429–3439. [Google Scholar] [CrossRef]
Cicek, A.; Erdinc, O. Optimal Bidding Strategy Considering Bilevel Approach and Multistage Process for a Renewable Energy Portfolio Manager Managing RESs With ESS. IEEE Syst. J. 2022, 16, 6062–6073. [Google Scholar] [CrossRef]
Bolognani, S.; Carli, R.; Cavraro, G.; Zampieri, S. On the need for communication for voltage regulation of power distribution grids. IEEE Trans. Control Netw. Syst. 2019, 6, 1111–1123. [Google Scholar] [CrossRef]
Bolognani, S. Networked-Volt-Var: A Comparison of Volt/VAR Feedback Control Laws: Fully Decentralized vs. Networked Strategies. GitHub Repository. 2018. Available online: https://github.com/saveriob/networked-volt-var (accessed on 2 February 2025).
Nguyen, Q.P.; Dai, Z.; Low, B.K.H.; Jaillet, P. Optimizing Conditional Value-at-Risk of Black-Box Functions. In Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Online, 6–14 December 2021; Volume 34, pp. 4170–4180. [Google Scholar]
Nguyen, Q.P.; Dai, Z.; Low, B.K.H.; Jaillet, P. Value-at-Risk Optimization with Gaussian Processes. In Proceedings of the International Conference on Machine Learning, Online, 18–24 July 2021; pp. 8063–8072. [Google Scholar]
Nguyen, Q. Bayesian Optimization in Action; Manning Publications Co.: Shelter Island, NY, USA, 2023; ISBN 978-1-61729-932-8. Available online: https://www.manning.com/books/bayesian-optimization-in-action (accessed on 1 February 2025).
Sain, R.; Mittal, V.; Gupta, V. A Comprehensive Review on Recent Advances in Variational Bayesian Inference. In Proceedings of the 2015 International Conference on Advances in Computer Engineering and Applications, Ghaziabad, India, 19–20 March 2015; pp. 488–492. [Google Scholar]
Bolognani, S.; Zampieri, S. On the existence and linear approximation of the power flow solution in power distribution networks. IEEE Trans. Power Syst. 2016, 31, 163–172. [Google Scholar] [CrossRef]
Vovos, P.N.; Kiprakis, A.E.; Wallace, A.R.; Harrison, G.P. Centralized and distributed voltage control: Impact on distributed generation penetration. IEEE Trans. Power Syst. 2007, 22, 476–483. [Google Scholar] [CrossRef]
Li, N.; Qu, G.; Dahleh, M. Real-time decentralized voltage control in distribution networks. In Proceedings of the Allerton Conference on Communication, Control, and Computing, Allerton, IL, USA, 30 September–3 October 2014; pp. 582–588. [Google Scholar]
Farivar, M.; Zhou, X.; Chen, L. Local voltage control in distribution systems: An incremental control algorithm. In Proceedings of the IEEE International Conference on Smart Grid Communications, Miami, FL, USA, 2–5 November 2015. [Google Scholar]
Kekatos, V.; Zhang, L.; Giannakis, G.B.; Baldick, R. Voltage regulation algorithms for multiphase power distribution grids. IEEE Trans. Power Syst. 2016, 31, 3913–3923. [Google Scholar] [CrossRef]
Shan, Y.; Hu, J.; Liu, H. A Holistic Power Management Strategy of Microgrids Based on Model Predictive Control and Particle Swarm Optimization. IEEE Trans. Ind. Inform. 2022, 18, 5115–5126. [Google Scholar] [CrossRef]
Zhang, L.; Zheng, H.; Hu, Q.; Su, B.; Lyu, L. An Adaptive Droop Control Strategy for Islanded Microgrid Based on Improved Particle Swarm Optimization. IEEE Access 2020, 8, 3579–3593. [Google Scholar] [CrossRef]
Kersting, W.H. Radial distribution test feeders. In Proceedings of the IEEE PES Winter Meeting, Columbus, OH, USA, 28 January–1 February 2001; Volume 2, pp. 908–912. [Google Scholar]

Figure 1. Illustration of the Volt/VAR control rule. The solid blue line represents the reactive power output curve, and the black dashed lines indicate the control parameters and limits.

Figure 2. Framework of BEO-CVaR.

Figure 3. The flowchart of the voltage deviation calculation function.

Figure 4. Standard IEEE 123-node test feeder. The numbers in the diagram represent the node indices.

Figure 5. (a) The aggregate load demand at all nodes during the 12 h simulation period (06:00–18:00). (b) The generation profiles of the two newly added PV units. The purple curve represents PV Unit 1, and the orange curve represents PV Unit 2.

Figure 6. Sensitivity analysis of control performance with respect to the risk confidence level

α

. The blue dashed line represents the average voltage deviation (efficiency), while the red solid line represents the extreme tail risk (99.9% worst deviation).

Figure 6. Sensitivity analysis of control performance with respect to the risk confidence level

α

. The blue dashed line represents the average voltage deviation (efficiency), while the red solid line represents the extreme tail risk (99.9% worst deviation).

Figure 7. CVaR variation during BEO calculation.

Figure 8. Eight optimization parameters across evaluations.

Figure 9. GP hyperparameter evolution.

Figure 10. Per-dimension evolution of lengthscales.

Figure 11. Statistical analysis of BEO-CVaR performance and computational cost across 8 independent runs: (a) Optimization phase duration (68 iterations). (b) Total execution time, including initialization. (c) Distribution of final CVaR values. (d) Distribution of voltage violation rates, showing consistently low violations. (e) Distribution of average voltage deviation. The boxplots indicate the median (line), interquartile range (box), and full data range (whiskers), with individual run results overlaid as points.

Figure 12. Performance validation using the representative decision vector across 8640 scenarios. (Top) Voltage profiles of 56 nodes. The red and blue dashed lines represent the upper (1.05 p.u.) and lower (0.95 p.u.) safety limits, respectively. (Bottom) Reactive power output of the two inverters. Inverter 1 (large capacity) handles dynamic fluctuations, while Inverter 2 (small capacity) provides consistent support.

Figure 13. Overvoltage contingency of uncontrolled strategy. The multiple curves represent the voltage profiles of the 56 monitored buses. The dashed lines indicate the upper (1.05 p.u.) and lower (0.95 p.u.) voltage limits. The orange curve represents Inverter 1, and the blue curve represents Inverter 2.

Figure 14. Overvoltage contingency of static strategy. The orange curve represents Inverter 1, and the blue curve represents Inverter 2. (a) Voltage profiles of 56 buses (in p.u.) versus time (in h). The gray lines represent individual bus voltages, and the dashed lines indicate the safety limits. (b) Reactive power output (in p.u.) of two PV inverters versus time (in h).

Figure 15. Overvoltage contingency of incremental strategy. The orange curve represents Inverter 1, and the blue curve represents Inverter 2. (a) Voltage profiles of 56 buses (in p.u.) versus time (in h). The gray lines represent individual bus voltages, and the dashed lines indicate the safety limits. (b) Reactive power output (in p.u.) of two PV inverters versus time (in h).

Figure 16. Overvoltage contingency of PSO strategy: (a) Voltage profiles of 56 nodes (in p.u.) versus scenario index. (b) Reactive power output (in p.u.) of two PV inverters versus scenario index.

Figure 17. Minimum CVaR values among all evaluated parameter combinations obtained using different acquisition functions. The solid lines represent the mean performance across multiple runs, and the colored shadow areas indicate the 95% confidence intervals.

Figure 18. Two-inverter case: 56-node voltage curve and inverter reactive power output curve.

Figure 19. Five-inverter case: 56-node voltage curve and inverter reactive power output curve.

Figure 20. Ten-inverter case: 56-node voltage curve and inverter reactive power output curve.

Figure 21. Performance validation under extreme physical stress testing (weak grid, heavy loading, and limited capacity). (Top) Voltage profiles of 56 nodes across 8640 scenarios. The vertical axis represents the voltage magnitude in per unit (p.u.), and the horizontal axis corresponds to the load scenario index (time steps). (Bottom) Corresponding reactive power outputs of the two inverters. The vertical axis denotes reactive power (Mvar/p.u.).

Table 1. BEO-CVaR parameters and their values.

Parameter	Value	Parameter	Value
Dimensions	8	Restarts	50
Initial samples	20	Raw samples	200
Optimization iterations	68	Risk level ( $α$ )	See Section 4.3.1

Table 2. Feasible search space for Volt/VAR control vector.

Parameter	Min	Max	Parameter	Min	Max
${\bar{v}}_{1}$	0.95	1.05	${\bar{v}}_{2}$	0.95	1.05
$δ_{1}$	0.00	0.03	$δ_{2}$	0.00	0.03
$σ_{1}$	0.03	0.180	$σ_{2}$	0.03	0.180
${\bar{q}}_{1}$	−1.0	1.0	${\bar{q}}_{2}$	−0.2	0.2

Table 3. Statistical performance metrics over 8 independent runs.

Metrics	Mean ± Std	Unit
CVaR	$0.0160 \pm 0.0012$	-
Voltage Violation Rate	$0.1030 \pm 0.0834$	%
Avg. Voltage Deviation	$0.0113 \pm 0.0006$	p.u.

Table 4. Computational time breakdown (Hardware: AMD64 Family 25 Model 97).

Metric	Mean ± Std	Unit
Optimization Time (68 Iterations)	$1522.74 \pm 138.00$	s
Initialization Time (20 Samples)	$347.01 \pm 45.00$	s
Total Execution Time	1869.75 ± 127.68	s

Table 5. Comparison of different strategies based on various performance metrics.

Strategy	Overall Compliance	Non-Compliant Bus Count	High-Risk Violation Rate	Comm. Frequency
Uncontrolled	95.75%	6	10.71%	–
Static Feedback	98.61%	4	5.21%	Real-time
Incremental Feedback	97.17%	5	8.64%	Real-time
PSO (Meta-heuristic)	99.47%	2	2.34%	12 h
BEO-CVaR (Proposed)	99.90%	1	0.50%	12 h

Table 6. Performance violation rates vs. number of inverters (CVaR = 0.95).

Number of Inverters	Voltage Violation Rate (%)
2	0.3121%
5	0.0062%
10	0.1007%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Tian, Y.-N. Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization. Electronics 2026, 15, 154. https://doi.org/10.3390/electronics15010154

AMA Style

Tian Y-N. Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization. Electronics. 2026; 15(1):154. https://doi.org/10.3390/electronics15010154

Chicago/Turabian Style

Tian, Ye-Ning. 2026. "Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization" Electronics 15, no. 1: 154. https://doi.org/10.3390/electronics15010154

APA Style

Tian, Y.-N. (2026). Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization. Electronics, 15(1), 154. https://doi.org/10.3390/electronics15010154

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Voltage Control in Distribution Networks via CVaR-Based Bayesian Optimization

Abstract

1. Introduction

2. The Formulation of Volt/VAR Control Rules Optimization

2.1. System Modeling

2.2. Volt/VAR Control Rule

3. The Proposed BEO-CVaR

3.1. BEO

3.1.1. Training a GP Surrogate Model

3.1.2. Constructing and Optimizing the Acquisition Function

3.1.3. Evaluation and Update

3.2. BEO-CVaR Algorithm Framework

4. Experiment and Result Analysis

4.1. Test Case

4.2. Experimental Setting

4.3. Results and Analysis

4.3.1. Sensitivity Analysis and Selection of Risk Parameter α

4.3.2. Optimization Process and Convergence Analysis

4.3.3. Statistical Reproducibility and Computational Efficiency

4.3.4. Performance Validation on Standard Test Scenarios

4.4. Comparison with Traditional and Meta-Heuristic Strategies

4.4.1. Comparison with Traditional Decentralized Methods

4.4.2. Comparison with Meta-Heuristic Optimization (PSO)

4.5. Comparison of Four Acquisition Function Strategies

4.6. Scalability Analysis in Multi-Inverter Scenarios

4.7. Quantitative Analysis of Communication Efficiency and Robustness

4.7.1. Reduction in Mandatory Control Updates

4.7.2. Robustness Against Communication Failures

4.8. Stress Testing Under Extreme Physical Conditions

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.3.1. Sensitivity Analysis and Selection of Risk Parameter $α$