1. Introduction
In recent years, the penetration of PV generation in DNs has increased markedly. The high penetration of renewable energy sources has introduced new technical challenges [
1]. The strong intermittency and stochasticity of PV generation have made the operating conditions of DNs increasingly complex, leading to more prominent issues such as voltage fluctuations, voltage exceeding limits, and degradation of power quality [
2]. Therefore, how to achieve reactive power optimization and voltage dynamic regulation in DNs with high PV penetration is critical for ensuring the safe, efficient, and stable operation of DNs. At present, a large number of researchers have proposed various reactive power and voltage optimization control methods to address the integration of large-scale distributed generation, achieving significant results. These methods are primarily categorized into two types: centralized and distributed reactive power optimization approaches.
Centralized reactive power optimization methods typically rely on a central control center, which collects data from the entire network and uses optimization algorithms to compute the optimal reactive power distribution. To address voltage fluctuations caused by the integration of distributed generation, researchers have developed various centralized reactive power optimization and control approaches. Reference [
3] introduces a convex relaxation-based centralized optimal power flow method using the branch flow model, offering a theoretical foundation for implementing Volt–var control in DNs. In reference [
4], comparative studies validated the effectiveness of centralized Volt–var control in maintaining voltage stability and meeting relevant constraints. A linear programming method based on sensitivity coefficients was proposed in [
5], enabling rapid real-time dispatch without the need for specific network parameters. Additionally, genetic algorithms, HGPSO, and MPC have been widely applied to centralized reactive power optimization [
6,
7,
8,
9]. For instance, to achieve multi-objective coordinated control, fuzzy sets and genetic algorithms were introduced in [
10] to design a centralized reactive power optimization model that simultaneously accounts for voltage deviation, power loss, and compensation costs, demonstrating strong adaptability and convergence performance. However, while centralized reactive power optimization methods are capable of achieving the global optimum, they require the continuous collection of measurement data from the entire network to perform centralized calculations. This leads to a significant reliance on communication infrastructure, with high demands for communication bandwidth, delay tolerance, and data integrity [
11]. In situations where communication quality is unstable or faults arise, the real-time performance and execution effectiveness of control strategies can be severely impacted. Moreover, as DNs grow in scale, the number of nodes and their state dimensions increase, causing a sharp rise in computational complexity for centralized approaches. Particularly when tackling non-convex optimal power flow problems in dynamic environments, these methods face challenges in adapting to scenarios characterized by high-frequency changes [
12,
13]. Therefore, the limitations of centralized approaches are particularly evident in rural distribution networks with high PV penetration, such as those in China. These systems often suffer from weak communication infrastructure, complex topologies, and frequent voltage fluctuations caused by intermittent PV output. To address the shortcomings of the centralized control, distributed reactive power optimization control methods have gradually become a research hotspot. Recent studies highlight the practical value of distributed coordination in such contexts. For instance, in Thailand’s Mae Sa Riang microgrid, distributed BESS optimization reduced voltage deviation by 46% using only local data and neighbor communication [
14]. Meanwhile, in European energy communities, day-ahead optimization incorporating distributed PV, storage, and flexible loads has shown potential for improving cost-effectiveness and adaptability to local demand patterns [
15]. These cases confirm that distributed strategies effectively resolve voltage violations and operational challenges in PV-dense networks with constrained communication, which is critical for energy communities, rural grids, and non-interconnected island systems.
These methods no longer rely on a central controller for the unified collection and centralized computation of entire network information. Instead, reactive power control decisions are made through multi-agent collaboration, with each node relying solely on local measurement data and limited communication with neighboring PV nodes. This ensures power quality while improving the scalability and control robustness of DNs [
16]. Various typical distributed reactive power control strategies have been proposed to date, one of which is the traditional droop control advocated by the IEEE 1547-2018 standard [
17,
18,
19]. It is a static feedback method based on local voltage measurements, where the local bus voltage is monitored and reactive power injection is adjusted accordingly. However, ref. [
20] shows that under certain conditions, droop controllers may fail to maintain a feasible voltage distribution. Since these static feedback rules can lead to oscillatory behavior, incremental control strategies have been proposed. This class of methods introduces historical reactive power set-point feedback based on traditional static control. Nodes adjust power injection according to voltage magnitude and past reactive power set-points [
21,
22]. In addition, new methods based on distributed primal–dual algorithms and push–sum algorithms have been proposed [
23,
24,
25]. For example, ref. [
25] investigates the general constraints of distributed online optimization, specifically coupled inequality constraints. A distributed primal–dual algorithm for online optimization was proposed, and it was proven that this method can achieve sublinear static regret, which indicates the algorithm’s capability of maintaining long-term optimization performance and gradually converging to near-optimal solutions. However, most existing distributed online voltage control methods fail to adequately account for real-world operating conditions in DNs, such as severe voltage fluctuations and the stochastic fluctuations of loads and distributed generation. Additionally, they neglect comprehensive research on voltage quality optimization.
In practical applications such as federated learning and wireless sensor networks, terminal devices are often subject to delays when executing computational tasks and transmitting data over networks [
26]. These factors collectively induce information delays. Thus, extensive literature has thoroughly investigated the impact of information delays on the performance of various optimization algorithms [
27,
28,
29,
30,
31,
32,
33,
34]. For example, ref. [
27] explores the feasibility of employing a dual averaging algorithm and mirror descent algorithm to handle delayed information in the context of distributed delayed stochastic optimization. In [
28], a distributed mirror descent algorithm with subgradient delay is presented for distributed convex optimization. The effect of delays to the consensus algorithm over a distributed system of agents having time-varying network graphs is studied in [
30]. Meanwhile, the feedback delay apparently existed in online optimization as well. To address unattainable real-time feedback, ref. [
31] proposes the online optimization algorithm utilizing delayed feedback information. Asynchronous online learning with nonlinear network constraints in multi-agent systems is explored in [
32,
33], wherein primal and dual variables are updated via delayed gradient information, respectively.
To address the aforementioned issues, this paper proposes a distributed online reactive power–voltage optimization control method for voltage regulation in DNs. Building on this foundation, distributed online optimization problems with time-varying constraints are further investigated. To align with practical communication environments where state feedback suffers from delays, a delayed feedback mechanism is introduced. Specifically, during each decision-making round, each node cannot immediately access accurate feedback from its current neighboring PV nodes but only receives time-lagged historical information. The paper focuses on analyzing the impact of delayed feedback on the performance and robustness of the proposed DOPP algorithm under asynchronous communications and limited feedback conditions.
Specifically, this paper first formulates an optimization model that aims to minimize voltage deviation in DNs. The model is subject to the upper and lower limits of node voltages and the reactive power output limits of distributed PV inverters. To overcome the challenges of high communication volume and computational burden associated with traditional centralized solving approaches, a DOPP method is introduced. This method coordinates limited neighbor-to-neighbor communication, incorporating a push–sum strategy to enable rapid consensus and global convergence of Lagrange multipliers under unbalanced grid topologies. Each PV inverter node can independently and concurrently update its reactive power decisions online using only local measurements and neighbor communication data, significantly reducing both communication overhead and computational complexity at individual nodes. At the same time, the method enables real-time tracking of load and PV output fluctuations, dynamically optimizing voltage levels and line losses. Particularly, with the integration of a delayed feedback mechanism, the DOPP method maintains effective and stable reactive power control even when each PV node operates under realistic communication conditions where only delayed feedback from local measurements and neighbors is available. This demonstrates enhanced robustness to delays and excellent dynamic adaptability.
To better illustrate the novelty of our method in comparison with related approaches,
Table 1 provides a structured comparison of representative algorithms, focusing on five dimensions: voltage constraint handling, reactive power constraint consideration, communication requirements, delay treatment, and handling of feedback delays. Unlike existing methods, the proposed strategy jointly considers strict voltage and reactive power limits, fixed feedback delays, and realistic unbalanced communication, which enhances its adaptability and robustness under practical distribution network conditions.
2. Distributed Voltage Control Framework Based on DOPP Algorithm with Feedback Delays
Figure 1 illustrates the overall framework of the proposed distributed voltage control strategy for distribution networks with high PV penetration. The framework integrates physical-layer modeling of the power system with a communication layer structure based on local information exchange. In particular, the communication layer only allows data exchange between neighboring PV nodes that are directly connected without any intermediate PV nodes. This framework reflects the overall approach and idea of this study, which aims to develop a distributed optimization and control strategy that explicitly accounts for communication delays and local information exchange among neighboring PV nodes.
Each PV unit is associated with a local agent responsible for measuring the voltage magnitude and reactive power at its bus and executing the distributed voltage control algorithm. A voltage optimization model is formulated to minimize voltage deviations across all PV nodes while satisfying the reactive power capacity limits of PV inverters and the acceptable voltage bounds at each node.
The proposed control architecture consists of two layers: the physical layer, which represents the basic electrical structure of the DN with PV integration, and the communication layer, which characterizes localized information exchange among agents through a neighbor-to-neighbor communication topology represented by a column-stochastic adjacency matrix.
To solve the optimization problem in a distributed manner, each agent implements a DOPP algorithm with feedback delays. At each iteration, delayed state information is exchanged between neighboring PV nodes, enabling each agent to update its local reactive power decisions based on local measurements and delayed feedback from neighboring PV nodes. This architecture ensures robust and stable voltage regulation even under communication constraints and asynchronous update conditions.
Specifically, the following sections detail the system modeling, the voltage optimization problem formulation, the algorithm design with communication delays, and the performance evaluation under multiple scenarios.
4. Distributed Online Voltage Control Framework Under Feedback Delay
4.1. Push–Sum Algorithm for Unbalanced Communication
The imbalance in communication weights challenges traditional distributed consensus algorithms, which fail to accurately normalize dual and state variables during distributed optimization. To address this issue, various strategies have been proposed in the literature, including the push–sum method [
38,
39,
40], the surplus-based adjustment method [
41], the row-stochastic matrix correction method [
42], and the epigraph method [
43]. In this work, the push–sum method is used to compensate for the normalization bias caused by unbalanced communication.
Specifically, the push–sum method introduces an additional weight variable combined with a normalization operation to achieve consensus updates of all state variables. The update process of the variables can be expressed as follows:
where
is the normalization weight variable initialized to
1, which serves to eliminate the normalization bias caused by unbalanced communication and mitigate the asymmetry of the communication topology.
and
represent the dual variable and auxiliary variable of node
i, respectively, where
quantifies the degree of voltage violation with respect to the upper and lower bounds at node
i, and
dynamically tracks the voltage violation behavior.
For PV inverters, although they can be connected to wireless communication networks at low cost, feedback delays are inevitable. Such delays may arise from various factors including network congestion, bandwidth limitations, limited computational capacity of agents, and data transmission latency. To better reflect the delay characteristics observed in practical DNs, a fixed-step feedback delay model is developed in the communication layer to simulate asynchronous interactions among PV inverter nodes. Specifically, there exists a fixed time lag in the exchange of information between PV nodes, meaning that each node can only access delayed historical feedback from its neighbors rather than real-time state information. In contrast, each node has immediate access to its own state variables (e.g., local voltage and reactive power output), which ensures the real-time responsiveness of the local control strategy.
Therefore, based on the push–sum method for unbalanced communication, a distributed online optimization algorithm is developed that can handle feedback delays in voltage and reactive power control of DNs. Specifically, based on the state update mechanism in (21), a fixed-step delay
τ is introduced to construct a new state update mechanism:
4.2. Framework of the DOPP Algorithm Under Feedback Delay
To address voltage violations in DNs with high PV penetration, especially under realistic conditions with inherent feedback delays, this paper proposes a DOPP algorithm incorporating a fixed-step delay model. The approach is developed based on the primal–dual method and the push–sum communication strategy. The main framework of the proposed DOPP algorithm is structured as follows:
A convex optimization model is formulated, aiming to minimize the voltage deviation at DN nodes while incorporating both the voltage bounds and reactive power capacity constraints of distributed PV inverters. Based on this model, each PV inverter node initializes local decision variables, dual variables, and auxiliary variables. The communication weight matrix is defined as column-stochastic, rather than row-stochastic, to accommodate unbalanced communication and employ the push–sum method to ensure accurate normalization of state variables under this asymmetric setting.
To address the commonly observed feedback delays in practical communication processes, a fixed-step delay model is introduced and embedded into the push–sum communication mechanism. The weight variable, dual variable, and auxiliary variable are updated using delayed push operations. This push mechanism adheres to the principles of asynchronous communication, where the communication weights depend on the out-degree of the transmitting nodes. Notably, each node consistently employs its most recent local state variables during the push process, thereby ensuring the timeliness of local information.
At each iteration t, upon receiving the delayed dual variable feedback from neighboring PV nodes, the composite gradient at node i is formed by combining the gradient of the objective function and the gradient of the coupled voltage inequality constraints, i.e., , where is a weight variable introduced to eliminate the imbalance of the communication graph. This composite gradient reflects the influence of both the optimization objective and the constraint penalty terms on the primal variable . Meanwhile, the primal variable is updated using the gradient descent method and projection operation, enabling the online optimization of reactive power injection at PV inverter nodes.
Based on the constructed Lagrangian function , the DOPP algorithm iteratively performs weight and dual variable push (considering delayed neighbor feedback), gradient computation, and updates of the primal, dual, and auxiliary variables. These steps collectively enable the online cooperative optimization of voltage violation problem in DNs.
4.3. Online Solution Procedure of the DOPP Algorithm for Distributed Voltage Control
For all nodes , initialize , , , , , .
At any iteration
, each node
i receives delayed state information from its neighboring PV nodes
(with a fixed delay
), and the weight variables are updated first as follows:
Then, the dual variables and auxiliary variables are pushed and summed separately, both affected by the feedback delay
, as follows:
where
represents the communication weight from neighboring PV node
j to node
i, which depends only on the out-degree of the transmitting node
j, ensuring the consistency of the push–sum method under unbalanced communication. It is worth emphasizing that for local variables such as
,
and
, which do not involve communication, the most recent state values are always used and are not affected by delays.
Each node computes its composite gradient based on the current gradient direction and the delayed dual feedback. Specifically, the subgradient at node
i is given by
where
is the value of
associated with node
i, and
and
correspond to the gradients of
and
, respectively, each scaled by
and
, and then added to the subgradient.
It is worth noting that to reduce computational complexity, the reactive power variable is incorporated into the projection step, where the computed reactive power is projected onto the interval
. Specifically,
where
denotes the step size, and the projection ensures that the reactive power
remains within the interval
.
The dual variable associated with the lower and upper voltage limits,
and
, are updated, respectively, as follows:
where
, and
is a parameter primarily used to suppress unbounded growth; [·]
+ represents the component-wise non-negative projection, ensuring that the dual variable
μ ≥ 0.
The above two equations are used to track the variations of the lower inequality constraints and upper voltage inequality constraints , respectively.
Let , and repeat steps 2 through 6 until convergence is achieved or the maximum number of iterations is reached. Upon termination, the result obtained in Step 4 corresponds to the reactive power output of the PV inverter node i.
5. Case Study
In this section, all simulations are implemented in MATLAB version R2022a using the MATPOWER toolbox, which provides a convenient environment for modeling distribution networks and solving optimal power flow problems. As shown in
Figure 2, the modified IEEE 123-bus DNis adopted to numerically evaluate the proposed distributed online voltage control algorithm, with a system base power of 1 MVA and a nominal voltage of 4.16 kV. Ten PV units, named PV1 to PV10, are installed at buses 24, 31, 38, 49, 56, 62, 73, 82, 95, and 102, respectively. To ensure a fair comparison of voltage regulation performance under different control methods, all PV units are assigned fixed upper limits for reactive power regulation throughout the simulation. Specifically, PV1 to PV6 are limited to a maximum reactive power of 0.5 Mvar, while PV7 to PV10 are limited to 0.25 Mvar. It is assumed that each PV node is equipped with a smart meter, enabling the measurement of active power, reactive power, and voltage magnitude at each node. Meanwhile, the acceptable voltage range across all nodes is set to [0.95, 1.05] p.u. to ensure compliance with voltage standards. The entire model is formulated in MATPOWER format and simulated in MATLAB, and can be fully reconstructed from the provided configuration and code.
5.1. Static Scenario Analysis
The paper investigates the impact of feedback delay length on the performance of the proposed voltage control method.
Figure 3 and
Figure 4, respectively, illustrate the node voltage profiles and reactive power outputs of the DN under feedback delays of 0, 2, 5, and 10 time steps. It is worth noting that the acceptable voltage range in this study is set to [0.95, 1.05] p.u., which is suitable for distributed implementation as it does not rely on global average voltage information.
As shown in
Figure 3, increasing the number of feedback delay steps leads to larger initial fluctuations in node voltages and a slower convergence rate. This performance degradation arises because longer delays cause nodes to update their states based on outdated information, increasing the estimation error and weakening the responsiveness of the controller to voltage deviations. Nevertheless, as the algorithm gradually converges, the proposed method effectively regulates voltages within the acceptable range [0.95, 1.05] p.u., achieving noticeable reductions in steady-state voltage deviation.
Similarly,
Figure 4 illustrates the variation trends of reactive power outputs at different nodes under various feedback delay settings. As the delay step increases, the magnitude of initial fluctuations becomes larger, and more iterations are required for the system to reach the final reactive power outputs. However, as the algorithm gradually converges, the reactive power outputs of all nodes eventually stabilize within their feasible ranges, effectively reducing the voltage deviations across the network.
Therefore, it can be concluded that under fixed feedback delays, the DOPP algorithm is still capable of effectively achieving voltage regulation and optimization. The reactive power outputs at all nodes can stably converge to feasible solutions, demonstrating good convergence and feasibility of the proposed method.
5.2. Dynamic Scenario Analysis
To evaluate the effectiveness of the proposed DOPP voltage control method under feedback delays in a dynamic environment, a time-varying distribution system is considered. The simulation is conducted on the IEEE 123-bus test system with high PV penetration. The daily load profile and PV generation curve are shown in
Figure 5 and
Figure 6, respectively. Notably, to test the regulation performance and robustness of the proposed algorithm under dynamic load disturbances, a non-standard load curve, as shown in
Figure 5, was designed to simulate load fluctuation and disturbance conditions.
Figure 7 illustrates the voltage variations at typical PV nodes from 6:00 to 18:00 without any voltage control strategy applied. When the load is low but PV generation is high, some node voltages exceed 1.09 p.u. due to the lack of effective reactive power regulation, which is far above the allowable upper voltage limit of 1.05 p.u. In contrast, during early morning and evening hours, when the distribution system is heavily loaded and PV generation is unavailable, some nodes’ minimum voltages drop to around 0.96 p.u., near the acceptable lower bound of 0.95 p.u. This indicates that without any voltage control measures, the system suffers from significant voltage fluctuations and potential overvoltage or undervoltage risks. Therefore, it is necessary to coordinate the reactive power output of distributed PV inverters to dynamically compensate for local voltage, thereby alleviating voltage violations and enhancing the system’s voltage stability and operational reliability.
To further validate the regulation performance and robustness of the proposed distributed optimization control method under dynamic operating conditions, simulations were conducted based on the previously defined daily load and PV generation profiles. The authors selected droop control and incremental control for comparison and conducted a comparative analysis of the voltage regulation performance of different control strategies under dynamic conditions. Moreover, the performance of the DOPP algorithm was compared under both delay-free and fixed feedback delay scenarios (with a delay
), in order to assess its robustness and adaptability in the presence of communication disturbances. The dynamic voltage profiles of key nodes under four control strategies are shown in
Figure 8, including droop control, incremental control, DOPP algorithm without delay, and DOPP algorithm with fixed feedback delay.
From
Figure 8, it can be observed that all four control scenarios are able to mitigate the voltage violation issues caused by PV output fluctuations to a certain extent, thereby achieving dynamic voltage regulation. Droop control performs local regulation based on a predefined linear voltage–reactive power (V–Q) characteristic. Although this method is simple and easy to implement, its flexibility is limited, resulting in noticeable voltage violations at certain nodes. Incremental control iteratively adjusts reactive power injections to progressively suppress voltage deviations. This improves control accuracy compared to droop control, yet the response speed remains insufficient when facing rapid changes in load or PV generation.
Compared with other strategies, the proposed DOPP control approach, which relies on a distributed optimization mechanism based on neighboring information, achieves faster and more accurate responses to system disturbances, exhibiting superior dynamic regulation capability and steady-state performance. Under zero feedback delay, the DOPP algorithm provides the best voltage control performance, with node voltages quickly converging and stably maintained within the acceptable range of [0.95, 1.05] p.u. When fixed feedback delay is introduced, the initial response speed of the algorithm slightly decreases, and the voltage fluctuations slightly increase. However, the system still achieves stable convergence within a limited number of iterations, and the final control outcome remains close to the no-delay scenario, demonstrating strong robustness.
Figure 9 shows the time evolution of reactive power outputs at PV nodes under the DOPP algorithm, with and without feedback delay. It can be observed that there are noticeable differences in the reactive power trajectories among PV nodes. The primary reason for these differences lies in the DOPP algorithm’s reliance on local voltage measurements and neighbor-based communication, rather than global system information. The varying electrical locations and local voltage fluctuations within the IEEE 123-bus system also result in varying reactive power responses across PV units. Meanwhile, a clear discrepancy is observed between the two cases, indicating that feedback delay impacts the reactive power regulation process.
Under the DOPP control without feedback delay, each PV node is able to quickly respond to local voltage deviations through distributed computation and coordination with neighboring PV nodes, enabling fast adjustment and dynamic optimization of reactive power. The reactive power output curves exhibit smooth transitions with good synchronization and convergence, indicating effective coordination capability of the system. In contrast, when fixed feedback delays are introduced, PV nodes can only obtain outdated neighboring state information during updates, resulting in short-term prediction errors and delayed responses in local control.
As shown in
Figure 9, some nodes experience short-term oscillations and overshoot during the regulation process due to state estimation errors caused by asynchronous information under feedback delays. Nevertheless, as time progresses, the DOPP algorithm with fixed feedback delays still achieves coordinated convergence of the reactive power outputs across all nodes, eventually stabilizing within their feasible control regions. These results demonstrate that the proposed method possesses strong adaptability and robustness, and is capable of effectively supporting the coordinated reactive power control of PV nodes in DNs.
Therefore, it can be concluded that although feedback delays may cause short-term lags and fluctuations in reactive power response, the DOPP algorithm still exhibits good long-term stability and convergence performance. It enables a more rational distribution of reactive power and enhances the dynamic coordination capability among nodes.
6. Conclusions
This paper proposes a distributed online reactive power optimization and control method for voltage regulation in DNs, and introduces a fixed-step feedback delay model to construct a more realistic asynchronous communication control scheme.
In terms of model formulation, based on a linearized power flow model, this paper formulates a quadratic objective to minimize voltage deviations by adjusting PV reactive power, subject to voltage and reactive power constraints.
Regarding communication topology modeling, and algorithm design, to address communication asymmetry in DNs, a column-stochastic matrix is used to form an asymmetric topology, and the push–sum method is applied for state normalization. Furthermore, a fixed-step feedback delay mechanism is introduced into the communication layer, enabling the design of a distributed online optimization algorithm that accommodates delayed information exchange.
Numerical simulations conducted on the modified IEEE 123-bus test system validate the effectiveness of the proposed method. Results show that (1) under static conditions, the DOPP algorithm with feedback delays exhibits stable convergence and effectively mitigates voltage violations within 50–100 iterations, regulating all nodal voltages into the permissible range of [0.95, 1.05] p.u.; (2) under dynamic conditions, the proposed method ensures effective voltage regulation and coordinated reactive power control, achieving 100% voltage compliance across all nodes throughout the simulation horizon. Compared with traditional droop and incremental control strategies, the proposed algorithm maintains all nodal voltages strictly within the permissible range of [0.95, 1.05] p.u., thereby improving voltage stability and demonstrating strong responsiveness and robustness under time-varying operating scenarios.
Considering the potential risk of communication interruptions in practical DN operations, future research will focus on analyzing and enhancing the adaptability and robustness of the proposed algorithm under partial communication failure scenarios, thereby improving its feasibility for real-world engineering applications.