Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals

Morales, Pablo; Mojica-Nava, Eduardo; Rey, Juan M.

doi:10.3390/app16136679

Open AccessArticle

Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals

by

Pablo Morales

¹

,

Eduardo Mojica-Nava

^1,*

and

Juan M. Rey

²

¹

Department of Electrical and Electronics Engineering, Universidad Nacional de Colombia, Bogotá 111321, Colombia

²

Department of Electrical, Electronics, and Telecommunications Engineering, Universidad Industrial de Santander, Bucaramanga 680002, Colombia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(13), 6679; https://doi.org/10.3390/app16136679

Submission received: 5 June 2026 / Revised: 22 June 2026 / Accepted: 26 June 2026 / Published: 3 July 2026

(This article belongs to the Special Issue Research Progress of Smart Grids and Microgrids)

Download

Browse Figures

Versions Notes

Abstract

This paper develops a distributionally robust transactive control architecture that explicitly models ambiguity in the implementation of decentralized price signals in networked energy systems. Unlike conventional formulations that assume perfect signal execution, the proposed framework integrates a Wasserstein distributionally robust optimization (WDRO) layer based on an ambiguity set within a bilevel Stackelberg coordination structure, where uncertainty affects the executed price rather than only exogenous demand or generation. A tractable dual reformulation is derived, and the economic and physical layers are unified into a continuous-time dynamic system through predictive sensitivity coupling. Extensive parametric validation over ambiguity radius and perturbation levels identifies a robust operating region in which steady-state price volatility remains bounded while physical feasibility is preserved. The results also reveal a tuning-sensitive regime for very small ambiguity radii under nominal conditions, highlighting the role of distributional regularization in stabilizing coupled economic–physical dynamics. Overall, the proposed WDRO-based formulation provides system-level insight for coordination design under implementation-layer uncertainty in decentralized system-of-systems architectures, while remaining computationally tractable for large distribution networks.

Keywords:

transactive control; distributionally robust optimization; Wasserstein distance; bilevel optimization; predictive sensitivity; smart grids

1. Introduction

The ongoing transformation of electric power systems, driven by the large-scale integration of distributed energy resources (DERs) and flexible demand, is fundamentally reshaping traditional tariff structures and operational paradigms [1,2,3,4]. Conventional pricing schemes, such as static tariffs, are increasingly inadequate to capture the temporal and spatial variability of modern power systems. As a result, dynamic pricing mechanisms, such as time-of-use tariffs and real-time pricing, have emerged as key enablers for improving system efficiency, reducing peak demand, and incentivizing consumer participation [5,6,7]. In this context, transactive energy systems (TESs) have emerged as a promising decentralized coordination paradigm, enabling multiple agents, including prosumers, aggregators, and system operators, to interact through market-based mechanisms [8,9,10]. Early works on transactive control and distributed coordination have focused on price-based demand response and decentralized dispatch, demonstrating scalability and flexibility in managing distributed resources [11,12]. More recent approaches incorporate advanced market designs and peer-to-peer trading mechanisms, allowing agents to exchange energy based on localized marginal costs and preferences [10,13,14,15].

A common modeling framework for TES is bilevel optimization, which captures the hierarchical interaction between distributed agents and system operators. In such formulations, an upper-level coordinator (e.g., a system operator or aggregator) anticipates the optimal response of distributed agents such as prosumers and flexible loads, while the lower-level models network-constrained economic dispatch and price formation [16,17]. Several works have successfully applied bilevel programming to energy markets and transactive systems, highlighting its ability to represent strategic interactions and enforce system-level constraints [18,19,20,21,22]. More recent contributions extend this paradigm to multi-agent and integrated energy systems, where one-leader multi-follower Stackelberg games are formulated as nonlinear or mixed-integer bilevel programs to capture discrete decisions and network-coupled constraints [23,24]. Despite these advances, most existing transactive control and bilevel optimization approaches rely on the assumption that price signals are perfectly implemented and accurately received by agents. In practice, however, this assumption is often violated due to communication delays, packet losses, measurement errors, and local disturbances in cyber-physical energy systems [25,26,27]. These factors introduce uncertainty at the implementation layer, leading to discrepancies between announced and executed prices, which may significantly affect agent decisions and overall system performance.

To address uncertainty in power systems, robust and stochastic optimization techniques have been widely studied [28]. For stochastic bilevel problems and Stackelberg models, several formulations have been proposed relying on scenario-based optimization or chance constraints to provide probabilistic guarantees on feasibility and performance [29]. In parallel, robust and risk-averse approaches have been introduced, leveraging techniques such as conditional value-at-risk (CVaR) and robust optimization to hedge against worst-case realizations and enhance system resilience [30]. Despite these advances, most existing methods rely on centralized reformulations, which can be computationally demanding and limit scalability in large-scale systems [31]. Consequently, there is a growing need for distributed and dynamically implementable bilevel optimization methods that explicitly account for uncertainty while preserving the hierarchical structure of transactive energy systems. In particular, distributionally robust optimization (DRO) has gained increasing attention due to its ability to provide performance guarantees under ambiguity in probability distributions [32,33]. Wasserstein-based DRO (WDRO) formulations have been successfully applied to data-driven optimization problems, offering tractable reformulations and strong theoretical guarantees [34]. Applications in power systems include look-ahead economic dispatch, chance-constrained energy management, voltage-regulation incentive design, and real-time scheduling of renewable-dominated energy systems [35,36,37,38,39]. In parallel, hierarchical decision-making architectures based on bilevel optimization and Stackelberg games that account for uncertainty have become increasingly important for transactive energy systems and distributed energy resource coordination. Recent studies have explored robust and learning-based mechanisms for coordinating distributed resources under uncertainty, demonstrating the effectiveness of combining optimization, market design, and robustness considerations [37]. However, existing DRO-based approaches typically model uncertainty in exogenous parameters, while assuming that control signals such as prices or local market costs are implemented without distortion.

Despite these advances, most existing WDRO formulations in power systems model uncertainty as an exogenous phenomenon affecting renewable generation, load forecasts, market conditions, or operational constraints. By contrast, the framework proposed in this paper considers uncertainty in the implementation of the coordination signal itself. Specifically, ambiguity affects the realized transactive price communicated to distributed agents rather than only external disturbances. Consequently, the resulting ambiguity set becomes directly coupled with the economic coordination mechanism, leading to a decision-dependent distributionally robust Stackelberg formulation. In Table 1, a comparison with representative recent literature is presented. To the best of the authors’ knowledge, explicit treatment of price-signal uncertainty within a WDRO-based transactive energy framework has received limited attention in the existing literature and constitutes the primary novelty of the proposed approach.

In this paper, we explicitly model the uncertainty in the implementation of price signals. Unlike prior works that focus on uncertainty in system inputs [26], we consider uncertainty in the economic coordination mechanism itself. This perspective is particularly relevant in decentralized and cyber-physical energy systems, where communication and execution layers play a critical role in system behavior. The proposed framework formulates the problem as a bilevel Stackelberg game [19], where distributed agents optimize their decisions under distributionally robust price signals, and the system operator ensures network feasibility through constrained dispatch. A tractable dual reformulation based on Wasserstein ambiguity sets is derived, enabling efficient computation of the resulting problem. Furthermore, we introduce a continuous-time dynamic coupling between the economic and physical layers using predictive sensitivity analysis, building on recent advances in distributed saddle-flow dynamics [40].

The main contributions of this paper are summarized as follows: First, we introduce a novel distributionally robust formulation of transactive control that explicitly captures implementation-layer uncertainty in price signals. We model uncertainty in the implemented price signals using a Wasserstein ambiguity set, leading to a decision-dependent distributionally robust Stackelberg formulation. Second, to avoid the computational burden of classical reformulations, we derive a single-timescale primal-dual dynamic algorithm that incorporates predictive sensitivity analysis, enabling real-time tracking of the lower-level optimal response without nested optimization. This yields a scalable and distributed solution method that preserves the hierarchical structure of the problem. Third, we establish stability and robustness properties of the proposed dynamics. Finally, we illustrate, through numerical simulations, how the Wasserstein radius induces a fundamental trade-off between robustness to uncertainty and coordination efficiency.

The remainder of the paper is organized as follows: Section 2 presents the problem formulation. Section 3 develops the distributionally robust reformulation. Section 4 presents simulation experiments, and Section 5 concludes the paper.

Preliminaries: Distributionally Robust Optimization with Wasserstein Ambiguity Sets

DRO provides a framework for decision-making under uncertainty when the underlying probability distribution is not known exactly but is assumed to lie within an ambiguity set. Instead of optimizing with respect to a single nominal distribution, DRO considers the worst-case expected value over a family of plausible distributions, thus offering robustness against model misspecification. We focus on Wasserstein DRO, where the ambiguity set is defined using the Wasserstein distance. Let

ξ \in R

denote a random variable with unknown distribution

P

. Given an empirical distribution

\hat{P}

constructed from available samples, the ambiguity set is defined as

P_{ϵ} (\hat{P}) : = \{P \in M (Ξ) : W (P, \hat{P}) \leq ϵ\}

(1)

where

M (Ξ)

denotes the set of probability measures supported on

Ξ \subseteq R

,

W (\cdot, \cdot)

is the Wasserstein distance, and

ϵ > 0

is a radius parameter controlling the level of conservativeness.

Given a decision variable

x \in X

and a loss function

ℓ (x, ξ)

, the WDRO problem is formulated as

min_{x \in X} sup_{P \in P_{ϵ}} E_{P} [ℓ (x, ξ)] .

(2)

This formulation seeks decisions that perform well under the worst-case distribution within the Wasserstein ball. A key advantage of WDRO is that, under mild conditions on ℓ, the inner supremum admits a tractable dual reformulation, which transforms the infinite-dimensional optimization over probability measures into a finite-dimensional convex problem. In the context of this work, uncertainty enters through perturbations in the price signal, and the loss function will be linear in

ξ

. This structure enables an explicit characterization of the robust counterpart, which will be derived in the next section.

2. Problem Statement

We consider a transactive energy system composed of the set of agents

N = 1, \dots, N

, interconnected through a distribution network operated by a system operator. Here,

p \in R^{N}

represents the vector of power injections, where

p_{j} \geq 0

for generators and

p_{k} \leq 0

for consumers;

λ \in Λ \subseteq R^{N}

is the vector of nominal prices; and

θ \in R^{M}

denotes the network state variables (e.g., voltage angles). The feasible set

Λ

is assumed to be convex and closed. The lower-level feasible set is defined by

Z : = {(p, θ) ∣ h (p, θ) = 0, g (p, θ) \leq 0}

(3)

which is assumed to be non-empty. Each agent represents a prosumer capable of consuming or producing energy, and coordination is achieved through price signals

λ

determined at the system level. In practical implementations, however, the price signal

λ

is not perfectly received by the agents. Instead, each agent observes a perturbed version of the price, modeled as

\tilde{λ} = λ + ξ

(4)

where

ξ \in R^{N}

represents uncertainty arising from communication imperfections, delays, or local disturbances. Unlike standard approaches that assume a known probability distribution for

ξ

, we consider an ambiguity set (1),

P

, of possible distributions. We define individual objective functions to model production costs based on the implemented price signal as

J_{j} (p_{j}, {\tilde{λ}}_{j}) = α_{j} p_{j}^{2} + β_{j} p_{j} - {\tilde{λ}}_{j} p_{j}

(5)

where

α_{j} p_{j}^{2}

is the quadratic production cost,

β_{j} p_{j}

is the linear operational cost, and

{\tilde{λ}}_{j} p_{j}

represents the revenue from selling energy at price

{\tilde{λ}}_{j}

. Consumption utilities are represented as follows:

J_{k} (p_{k}, {\tilde{λ}}_{k}) = - α_{k} p_{k}^{2} - β_{k} p_{k} + {\tilde{λ}}_{k} p_{k}

(6)

where

α_{k} p_{k}^{2}

models the convex utility of consumption,

β_{k} p_{k}

is a linear incentive for consuming more energy, and

λ_{k}^{*} p_{k}

represents the payment for consuming energy at price

{\tilde{λ}}_{k}

. This function corresponds to the negative of the consumer’s utility, written in minimization form. Equations (5) and (6) describe the individual cost functions, while (7) consolidates these terms into a total system cost function as follows

J (\tilde{λ}, p) = \sum_{j \in G} J_{j} (p_{j}, {\tilde{λ}}_{j}) + \sum_{k \in C} J_{k} (p_{k}, {\tilde{λ}}_{k})

(7)

where

G

and

C

denote the sets of generators and consumers, respectively. The uncertain price vector

\tilde{λ}

affects each agent differently. The total cost function

J (\tilde{λ}, p)

aggregates the production costs of generators and the (negative) utilities of consumers, all expressed in minimization form.

We formulate a bilevel optimization problem that captures the interaction between the system coordinator and the distributed agents under price uncertainty. In this framework, we aim to minimize the worst-case expected economic performance in response to possible deviations in the price signal, considering a statistically bounded ambiguity set such as the Wasserstein ambiguity set, which has been widely adopted due to its strong theoretical guarantees and computational tractability [34]. The robust objective considers the worst-case expected performance within the Wasserstein ambiguity set (1). The bilevel uncertain WDRO problem is defined as follows:

\begin{matrix} min_{λ \in Λ} & sup_{P \in P_{ϵ} (\hat{P})} E_{ξ \sim P} [J (λ + ξ, p^{*} (λ + ξ))] \\ s . t . & L λ = 0, \\ p^{*} (\tilde{λ}) : = \arg min_{p, θ} J (\tilde{λ}, p) (p, θ) \in Z \\ s . t . h (p, θ) = 0, g (p, θ) \leq 0 \end{matrix}

(8)

where

\tilde{λ} \sim P

is the random variable representing the actual implemented prices, drawn from a distribution

P

;

P \in P_{ϵ} (\hat{P})

is the ambiguity set defined as a Wasserstein ball of radius

ϵ

centered at an empirical nominal distribution

\hat{P}

;

J (\tilde{λ}, p^{*} (\tilde{λ}))

is the total system cost function evaluated at realized prices

\tilde{λ}

and optimal power decisions

p^{*} (\tilde{λ})

; and

L λ = 0

is the consensus constraint, where

L \in R^{N \times N}

is the Laplacian matrix of a connected communication graph. This condition enforces agreement among agents, implying

λ = c \cdot 1

at optimality. The power

p^{*} (\tilde{λ})

is the solution of the lower-level problem, i.e., the optimal response of agents to the implemented price

\tilde{λ}

;

h (p, θ) = 0

is the set of equality constraints representing physical system relations (e.g., power balance or network flow equations); and

g (p, θ) \leq 0

is the set of inequality constraints representing operational limits (e.g., generation/consumption bounds, voltage or flow limits).

Problem (8) models the implemented price as a random variable drawn from an uncertain distribution. We also describe the agents’ responses as optimal solutions conditioned on the realized price, and we enforce coherence among distributed signals through a consensus constraint and incorporate physical and operational constraints at the lower level. The proposed formulation differs from classical DRO settings in that uncertainty affects the control signal

λ

itself rather than exogenous parameters. As a result, the decision variable enters both the optimization argument and the ambiguity set through

λ + ξ

, leading to a decision-dependent distributional structure. Problem (8) can be interpreted as a Stackelberg game, where the upper-level coordinator (leader) selects

λ

, anticipating the optimal response

p^{*} (\tilde{λ})

of the agents (followers) under uncertain price realization.

Assumption 1.

The function

J (\tilde{λ}, p)

is strongly convex in p, and the feasible set

Z

(3) satisfies Slater’s condition. Then, for every

\tilde{λ}

, the lower-level problem admits a unique optimal solution

p^{*} (\tilde{λ})

, and the solution mapping is continuously differentiable.

Assumption 1 is consistent with standard economic dispatch and transactive energy formulations. Generator production costs are commonly represented by quadratic heat-rate models, while consumer utility functions are often approximated by concave quadratic functions. Consequently, the aggregate economic objective possesses a positive-definite Hessian with respect to power injections, yielding strong convexity and guaranteeing uniqueness of the lower-level optimal response.

The proposed formulation preserves the hierarchical structure of classical transactive control, but extends it by incorporating distributional robustness directly into the leader’s objective, while maintaining a parametric dependence of the follower’s response on the uncertain signal.

3. Robust Bilevel Formulation

In this section, we develop the proposed methodological framework to address uncertainty in the price signal within transactive control systems involving multiple agents. Inspired by previous work on timescale unification and predictive sensitivity, we propose a dynamic and distributed architecture that allows us to solve the power allocation problem jointly and determine the equilibrium price. We construct the approach using a bilevel formulation, preserving the functional hierarchy between consumption-production decisions and pricing mechanisms but eliminating temporal staggering by coupling both levels into a single continuous dynamic system. Furthermore, we strengthen the system’s response against deviations in the price signal by incorporating a distributionally robust optimization perspective based on ambiguity sets defined through the Wasserstein distance.

3.1. Robust Reformulation of the Bilevel Problem

Following the WDRO approach, we reformulate the bilevel problem to explicitly incorporate the ambiguity associated with the price signal. In this model, we consider that the effectively implemented prices may differ from the nominal value defined by the coordinator, remaining within a statistically bounded neighborhood defined by a Wasserstein ball. By applying duality results from convex optimization theory, we express the original problem as a minimization problem with structured constraints, protecting the system against the worst-case expected economic performance.

For notational simplicity, define the value function

Φ (\tilde{λ}) : = J (\tilde{λ}, p^{*} (\tilde{λ})) .

(9)

Then, the upper-level problem can be written as

min_{λ \in Λ} sup_{P \in P_{ϵ} (\hat{P})} E_{ξ \sim P} [Φ (λ + ξ)] s . t . L λ = 0 .

(10)

We consider the dual structure proposed in [28] to derive a tractable representation of the robust problem, which we present in (11) and use as the foundation for developing the distributed dynamics described later. The dual formulation is then obtained as

\begin{matrix} min_{λ \in Λ, α \geq 0} α ϵ + \frac{1}{N} \sum_{j = 1}^{N} sup_{ξ \in R^{N}} \{Φ (λ + ξ) - α ∥ ξ - {\hat{ξ}}_{j} ∥\} \\ s . t . L λ = 0 \end{matrix}

(11)

where

α \geq 0

is the dual variable associated with the Wasserstein ball constraint,

{\hat{ξ}}_{j}

is the j-th empirical sample from the nominal distribution

\hat{P}

of implemented prices, and

λ_{i}

is the candidate realization of the adversarial price in the inner minimization. To guarantee the existence and uniqueness of the solution of Problem (11), we assume the following Lipschitz condition.

Assumption 2.

Assume that the function

Φ (λ)

is Lipschitz continuous with constant

L_{ϕ}

. Then, the inner supremum admits a finite value for

α \geq L_{ϕ}

.

In practical transactive energy systems, prices are constrained within admissible operational ranges, and power injections remain bounded by generation and demand limits. Under these conditions, the optimal-response mapping remains bounded and locally Lipschitz continuous. Consequently, the value function

Φ (λ)

inherits Lipschitz continuity, which is a standard property of parametric convex optimization problems satisfying Slater’s condition.

Once the robust reformulation of the problem has been established, we proceed to describe how it can be solved through a distributed dynamic architecture based on predictive sensitivity in the next section.

3.2. Saddle-Point Reformulation of the Robust Problem

The dual reformulation in (11) expresses the distributionally robust objective as a finite-dimensional minimization problem involving the nominal price

λ

and the Wasserstein dual variable

α

. This reformulation enables a saddle-point interpretation by introducing dual variables associated with the consensus constraint. Specifically, the robust coordination problem can be written as

min_{λ \in Λ, α \geq 0} max_{γ} L_{1} (λ, α, γ)

(12)

where

γ

is the Lagrange multiplier associated with the consensus constraint

L λ = 0

, and the augmented Lagrangian

L_{1}

is defined as

L_{1} (λ, α, γ) = α ϵ + \frac{1}{N} \sum_{j = 1}^{N} sup_{ξ} \{Φ (λ + ξ) - α ∥ ξ - {\hat{ξ}}_{j} ∥\} + γ^{⊤} L λ .

(13)

Under convexity of the robust objective in

(λ, α)

and linearity of the constraint

L λ = 0

, the problem admits a saddle-point structure. This allows the use of primal–dual gradient dynamics to compute the solution. This saddle-point formulation provides the foundation for the dynamic system introduced in the next section. In particular, the evolution of

λ

and

α

corresponds to gradient descent on

L_{1}

, while

γ

evolves according to gradient ascent, enforcing the consensus constraint. The Wasserstein dual variable

α

plays the role of a robustness regularization parameter, penalizing sensitivity of the objective to deviations in the implemented price signal. Larger values of

α

correspond to more conservative coordination strategies.

3.3. Augmented Lagrangians and Predictive Sensitivity

While the saddle-point formulation above characterizes the upper-level dynamics, the lower-level problem remains implicitly defined through the optimal response

p^{*} (\tilde{λ})

. To avoid nested optimization, we introduce a predictive sensitivity framework that captures how the lower-level optimal solution varies with respect to

λ

, enabling a unified dynamic system. This construction preserves the bilevel structure while embedding both levels into a single continuous-time saddle-flow system. Hence, we define the Lagrangian function for the lower-level problem as follows:

L_{2} (λ, p, θ, μ) = J (λ, p) + μ^{⊤} h (p, θ) + τ^{⊤} log (- g (p, θ))

(14)

where

μ

is the Lagrange multiplier associated with the equality constraints

h (p, θ) = 0

,

τ

is the barrier parameter used in the logarithmic penalization of inequality constraints, and

log (- g (p, θ))

is the element-wise logarithmic barrier applied to enforce

g (p, θ) < 0

. The barrier formulation enforces

g (p, θ) < 0

and ensures smoothness of the lower-level problem.

L_{2} (λ, p, θ, μ)

represents the lower-level Lagrangian, which includes system costs, equality constraints, and a logarithmic penalization for inequality constraints, which facilitates a continuous and differentiable formulation. These expressions allow us to dynamically couple the two decision levels through a predictive sensitivity matrix, which forms the foundation of the proposed distributed approach. First, we assume a property for the Hessian of

L_{2}

.

Assumption 3.

The Hessian

\nabla_{z z}^{2} L_{2} (λ, z)

is positive definite for all feasible z. This guarantees that the predictive sensitivity matrix is well-defined.

The positive-definiteness requirement stated in Assumption 3 should be interpreted locally around the operating point. In practical distribution systems, dispatch decisions are performed near a nominal equilibrium satisfying operational limits. The combination of quadratic economic costs and logarithmic barrier terms introduces positive curvature in the optimization landscape, resulting in a nonsingular KKT matrix and a well-defined predictive-sensitivity operator.

To dynamically couple the upper-level decisions with the optimal response of the lower level, we introduce the predictive sensitivity matrix

S_{λ}^{p} (λ, z)

. This matrix captures how the optimal solution of the power allocation problem varies in response to changes in the nominal price signal by using second-order derivatives of the lower-level Lagrangian function. We construct this matrix by inverting the Hessian with respect to the lower-level variables and combining it with cross-derivatives with respect to the price, thus enabling efficient anticipation of the impact of the coordinator’s decisions on the system’s response. The predictive sensitivity matrix captures the system’s anticipated response to changes in prices by analyzing second-order derivatives of the lower-level Lagrangian function.

The predictive sensitivity term plays a central role in the proposed formulation, since it enables both decision levels to be embedded into a unified continuous-time dynamic system. Instead of solving the lower-level problem independently at each upper-level iteration, the sensitivity matrix anticipates the effect of price changes on the agent responses and incorporates that effect directly into the coupled dynamics. This construction preserves the hierarchical structure of the original bilevel problem while avoiding explicit nested optimization loops during dynamic implementation. First, we define a vector z to stack all the variables involved in the lower-level problem as follows:

z : = {[p, θ, μ]}^{⊤} .

(15)

The predictive-sensitivity matrix

S_{λ}^{p} (λ, z)

is defined as

S_{λ}^{p} (λ, z) : = - {(\nabla_{z z}^{2} L_{2} (λ, z))}^{- 1} \nabla_{z λ}^{2} L_{2} (λ, z)

(16)

where

L_{2} (λ, z)

is the lower-level Lagrangian function,

\nabla_{z z}^{2} L_{2}

is the Hessian matrix of the lower-level Lagrangian with respect to z, and

\nabla_{z λ}^{2} L_{2}

is the cross-derivative matrix of the lower-level Lagrangian with respect to z and

λ

.

S_{λ}^{p} (λ, z)

captures how the optimal lower-level solution z changes in response to variations in

λ

. Optimality of the lower-level problem can be characterized through the Karush–Kuhn–Tucker (KKT) conditions. At the optimum

z^{*} (λ)

, the following conditions hold

\nabla_{p} J (λ, p^{*}) + \nabla_{p} h {(p^{*}, θ^{*})}^{⊤} μ^{*} = 0,

(17)

h (p^{*}, θ^{*}) = 0, g (p^{*}, θ^{*}) \leq 0 .

(18)

Under the barrier formulation adopted in (14), the inequality constraints are incorporated into the objective, and the optimality condition reduces to

\nabla_{z} L_{2} (λ, z^{*}) = 0 .

(19)

Recall that the upper-level objective can be expressed as

Φ (λ) : = J (λ, p^{*} (λ))

(20)

where

p^{*} (λ)

is implicitly defined by the KKT conditions of the lower-level problem. The gradient of

Φ

with respect to

λ

can be characterized using the implicit function theorem. Differentiating the KKT condition

\nabla_{z} L_{2} (λ, z^{*} (λ)) = 0

with respect to

λ

yields

\nabla_{z z}^{2} L_{2} (λ, z^{*}) \frac{d z^{*}}{d λ} + \nabla_{z λ}^{2} L_{2} (λ, z^{*}) = 0 .

(21)

Solving for the sensitivity, we obtain

\frac{d z^{*}}{d λ} = - {(\nabla_{z z}^{2} L_{2} (λ, z^{*}))}^{- 1} \nabla_{z λ}^{2} L_{2} (λ, z^{*}) .

(22)

This expression coincides with the predictive sensitivity matrix defined in (16), i.e.,

S_{λ}^{p} (λ, z^{*}) = \frac{d z^{*}}{d λ} .

(23)

Using the chain rule, the gradient of

Φ

is given by

\nabla_{λ} Φ (λ) = \nabla_{λ} J (λ, p^{*}) + \nabla_{p} J (λ, p^{*}) \frac{d p^{*}}{d λ} .

(24)

Substituting the sensitivity expression, we obtain

\nabla_{λ} Φ (λ) = \nabla_{λ} J (λ, p^{*}) - \nabla_{p} J (λ, p^{*}) {(\nabla_{z z}^{2} L_{2})}^{- 1} \nabla_{z λ}^{2} L_{2} .

(25)

At optimality, the stationarity condition implies

\nabla_{p} J (λ, p^{*}) = - \nabla_{p} h {(p^{*}, θ^{*})}^{⊤} μ^{*} .

(26)

Thus, the gradient

\nabla_{λ} Φ

incorporates both direct price effects and indirect effects through the equilibrium constraints. This characterization shows that the gradient of the upper-level objective accounts for the implicit response of the lower-level problem, which is precisely captured by the predictive sensitivity matrix used in the proposed dynamics. The characterization of

\nabla_{λ} Φ (λ)

derived above directly determines the structure of the upper-level dynamics. In particular, the gradient appearing in the saddle-point system is not computed explicitly through nested optimization, but instead approximated using the predictive sensitivity matrix

S_{λ}^{p} (λ, z)

, which encodes the implicit dependence of the lower-level solution on

λ

. Specifically, the gradient

\nabla_{λ} Φ (λ)

can be written as

\nabla_{λ} Φ (λ) = \nabla_{λ} J (λ, p) + \nabla_{z} L_{2} {(λ, z)}^{⊤} S_{λ}^{p} (λ, z) .

(27)

At optimality,

\nabla_{z} L_{2} (λ, z) = 0

, and the sensitivity term captures first-order variations away from equilibrium. Substituting this expression into the gradient flow of the upper-level Lagrangian, the price dynamics can be interpreted as

\dot{λ} = - \nabla_{λ} L_{1} (λ, α, γ) = - \nabla_{λ} Φ (λ) - L^{⊤} γ

(28)

where the term

\nabla_{λ} Φ (λ)

implicitly includes the sensitivity correction through

S_{λ}^{p}

. Similarly, the lower-level dynamics

\dot{z} = - \nabla_{z} L_{2} (λ, z) - S_{λ}^{p} (λ, z) \nabla_{λ} L_{1} (λ, α, γ)

(29)

can be interpreted as a correction of the standard primal–dual dynamics, where the second term anticipates the effect of price updates on the optimal solution

z^{*} (λ)

. Therefore, both levels of the bilevel problem are coupled through the same sensitivity operator

S_{λ}^{p}

, ensuring that the gradient used in the upper-level dynamics is consistent with the implicit dependence of the lower-level solution. This guarantees that the overall system follows a coherent saddle-flow trajectory associated with the robust bilevel formulation.

3.4. Continuous Distributed Predictive-Sensitivity Wasserstein Dynamics

In this section, we introduce the proposed predictive-sensitivity Wasserstein joint dynamic system that describes the temporal evolution of all variables in the system. This continuous dynamic system allows us to solve the robust bilevel problem without explicit temporal hierarchies. The differential equations govern the update of the nominal price vector

λ

, the dual variable

α

associated with the Wasserstein ambiguity set, and the multiplier

γ

enforcing price consensus. Additionally, the evolution of the lower-level state vector

z = {[p, θ, μ]}^{⊤}

incorporates a coupling term provided by the predictive sensitivity matrix

S_{λ}^{p} (λ, z)

, enabling efficient anticipation of the system’s response to variations in the price signal. This formulation supports a distributed real-time implementation of robust control in transactive systems. The coupled dynamics between both levels are schematically depicted in Figure 1, where the predictive sensitivity matrix serves as a forward-looking link between price evolution and power decisions. This interaction enables the system to respond in real time to disturbances, maintaining consistency between economic coordination and physical feasibility.

First, we assume a connected communication graph between generators and prosumers.

Assumption 4.

The communication graph associated with the Laplacian matrix L is connected.

To obtain a tractable gradient expression, we explicitly introduce the maximizer of the inner supremum in the WDRO reformulation. Let

ξ_{j}^{*} (λ, α) \in \arg max_{ξ} \{Φ (λ + ξ) - α ∥ ξ - {\hat{ξ}}_{j} ∥\} .

(30)

Note that

ξ_{j}^{*}

depends on

(λ, α)

, and is recomputed implicitly along the system trajectory.

Assumption 5.

The upper-level robust objective induced by the Wasserstein reformulation is convex and continuously differentiable in

(λ, α)

, and

\nabla L_{1} (λ, α, γ)

is globally Lipschitz continuous.

The uniqueness of the Wasserstein adversarial perturbation is guaranteed whenever the dual robustness parameter

α

exceeds the Lipschitz constant of the value function

Φ

. Under this condition, the penalty term associated with the Wasserstein distance dominates local variations of

Φ

, yielding a unique maximizer of the inner robust optimization problem. Operationally, this corresponds to selecting a robustness level sufficiently large relative to the expected variability of the implemented price signal.

Assumption 5 ensures regularity of the Wasserstein robust reformulation, while Assumption 4 guarantees consensus feasibility through the Laplacian constraint. Then, by the envelope theorem, the gradient of the upper-level objective is given by

\nabla_{λ} L_{1} = \frac{1}{N} \sum_{j = 1}^{N} \nabla_{λ^{*}} J (λ + ξ_{j}^{*}, p^{*} (λ + ξ_{j}^{*})) + L^{⊤} γ

(31)

which implies that

\nabla_{λ} Φ (λ + ξ_{j}^{*}) = \nabla_{λ^{*}} J (λ^{*}, p^{*} (λ^{*})) |_{λ^{*} = λ + ξ_{j}^{*}}

. We obtain the distributed Wasserstein saddle-point dynamics by replacing the gradient (31) with the maximizer (30) in the lower (28) and upper dynamics (29), respectively, as follows

\begin{matrix} \dot{λ} & = - \nabla_{λ} L_{1} = - \frac{1}{N} \sum_{j = 1}^{N} \nabla_{λ} J (λ + ξ_{j}^{*}, p^{*} (λ + ξ_{j}^{*})) - L^{⊤} γ \end{matrix}

(32)

\begin{matrix} \dot{α} & = \nabla_{α} L_{1} = Π_{R_{+}} (- ϵ + \frac{1}{N} \sum_{j = 1}^{N} ∥ ξ_{j}^{*} - {\hat{ξ}}_{j} ∥) \end{matrix}

(33)

\begin{matrix} \dot{γ} & = \nabla_{γ} L_{1} = L λ \end{matrix}

(34)

\begin{matrix} \dot{z} & = - \nabla_{z} L_{2} (λ, z) - S_{λ}^{p} (λ, z) \nabla_{λ} L_{1} (λ, α, γ) \end{matrix}

(35)

where

Π_{R_{+}}

denotes projection onto the nonnegative orthant. In the dynamic implementation,

λ

represents the current estimate of the implemented price signal, consistent with the single-timescale formulation.

The predictive sensitivity term modifies the standard primal-dual dynamics by incorporating the first-order variation of the lower-level optimizer with respect to the upper-level decision. This eliminates the need for explicit timescale separation and enables simultaneous convergence of both levels. The lower-level dynamics correspond to a gradient flow associated with the KKT conditions of the lower-level problem. The additional predictive sensitivity term accounts for variations in

λ

, allowing the system to track the moving optimal solution without requiring explicit re-optimization. The proposed dynamics generalize classical saddle-flow methods by incorporating distributionally robust corrections and predictive sensitivity coupling. This results in a single-timescale algorithm that solves a decision-dependent WDRO bilevel problem without nested optimization.

From a systems perspective, the resulting dynamics can be interpreted as a closed-loop coordination mechanism in which economic signals, physical constraints, and robustness corrections interact continuously. The sensitivity-based coupling ensures that price updates are informed by their expected effect on agent responses, leading to improved stability and convergence properties under implementation uncertainty. The observed convergence behavior across all simulations suggests that the proposed dynamics exhibit stable trajectories under the considered operating conditions. To provide an intuitive representation of the coupled dynamics, Figure 1 illustrates the interaction between upper-level price updates and lower-level agent responses. The proposed formulation extends conventional dynamic transactive control by incorporating a distributionally robust correction that accounts for ambiguity in price implementation. In the next section, we analyze the stability and convergence of the proposed coupled dynamics.

3.5. Convergence Analysis of the Predictive-Sensitivity Dynamics

In this section, we establish convergence guarantees for the proposed distributionally robust bilevel dynamics. We begin by stating the assumptions required for convergence. We first establish well-posedness of the proposed dynamic system.

Lemma 1.

Under Assumptions 1–5, the dynamical system (32)–(35) admits a unique maximal solution for every initial condition.

Proof.

From Assumptions 1–5, the mappings

\nabla_{λ} L_{1}

,

\nabla_{α} L_{1}

,

\nabla_{z} L_{2}

are locally Lipschitz. Moreover, Assumption 3 guarantees invertibility of

\nabla_{z z}^{2} L_{2}

, implying that the predictive sensitivity matrix

S_{λ}^{p}

is locally Lipschitz. Hence, the overall vector field defining (32)–(35) is locally Lipschitz. Existence and uniqueness of solutions therefore follow from the Picard–Lindelöf theorem [41]. □

Next, we establish the relation between equilibrium points of the dynamics and optimality conditions of the original bilevel problem.

Lemma 2.

A point

(λ^{*}, α^{*}, γ^{*}, z^{*})

is an equilibrium point of (32)–(35) if and only if it satisfies the KKT conditions of the Wasserstein distributionally robust bilevel optimization problem.

Proof.

At equilibrium,

\dot{λ} = \dot{α} = \dot{γ} = \dot{z} = 0 .

(36)

From (32),

\nabla_{λ} L_{1} (λ^{*}, α^{*}, γ^{*}) = 0

. From (35), the projection dynamics imply the complementarity condition

α^{*} \geq 0

,

\nabla_{α} L_{1} (λ^{*}, α^{*}, γ^{*}) \geq 0

(37)

together with

α^{*} \nabla_{α} L_{1} (λ^{*}, α^{*}, γ^{*}) = 0 .

(38)

From (34),

L λ^{*} = 0

, which corresponds to the consensus feasibility constraint. Finally, from (32),

\nabla_{z} L_{2} (λ^{*}, z^{*}) = - S_{λ}^{p} \nabla_{λ} L_{1} .

(39)

Since equilibrium also implies

\nabla_{λ} L_{1} = 0

, we obtain

\nabla_{z} L_{2} (λ^{*}, z^{*}) = 0

(40)

which corresponds to the stationarity conditions of the lower-level problem. Collecting stationarity, primal feasibility, dual feasibility, and complementarity conditions yields the KKT system of the robust bilevel optimization problem. □

Although the convergence analysis relies on standard regularity assumptions from bilevel and distributionally robust optimization, these conditions are consistent with normal operating regimes of distribution systems. The assumptions are primarily local in nature and are expected to hold around economically optimal dispatch points where generation costs, demand utilities, and network constraints exhibit smooth behavior. Consequently, the theoretical analysis should be interpreted as characterizing the local stability properties of the proposed transactive control architecture rather than asserting global convergence under arbitrary operating conditions.

We now state the main convergence result.

Theorem 1.

Suppose Assumptions 1–4 hold, and the maximizer

ξ_{j}^{*} (λ, α)

exists and is unique for all j. Then, every trajectory of the coupled dynamic system (32)–(35) converges asymptotically to an equilibrium point

(λ^{*}, α^{*}, γ^{*}, z^{*})

that satisfies the KKT conditions of the Wasserstein distributionally robust bilevel problem.

Proof.

We begin with the construction of the Lyapunov function. First, we define the vector x and let

x^{*} = (λ^{*}, α^{*}, γ^{*}, z^{*})

(41)

be an equilibrium point of (32)–(35), whose existence follows from convexity and feasibility assumptions. Consider the Lyapunov candidate

V (x) = \frac{1}{2} {∥ λ - λ^{*} ∥}^{2} + \frac{1}{2} {(α - α^{*})}^{2} + \frac{1}{2} {∥ γ - γ^{*} ∥}^{2} + \frac{1}{2} {∥ z - z^{*} ∥}^{2} .

(42)

Clearly, since we take every term to be quadratic, we have that

V (x) \geq 0

, with equality if and only if

x = x^{*}

. Moreover,

V (x)

is radially unbounded. We proceed with the time derivative of the Lyapunov function. Differentiating (42) along system trajectories yields

\begin{matrix} \dot{V} & = {(λ - λ^{*})}^{⊤} \dot{λ} + (α - α^{*}) \dot{α} + {(γ - γ^{*})}^{⊤} \dot{γ} + {(z - z^{*})}^{⊤} \dot{z} . \end{matrix}

(43)

Substituting (32)–(35), we obtain

\begin{matrix} \dot{V} = & - {(λ - λ^{*})}^{⊤} (\frac{1}{N} \sum_{j = 1}^{N} \nabla_{λ} J + L^{⊤} γ) + (α - α^{*}) Π_{R_{+}} (- \nabla_{α} L_{1}) + {(γ - γ^{*})}^{⊤} L λ \\ - {(z - z^{*})}^{⊤} \nabla_{z} L_{2} - {(z - z^{*})}^{⊤} S_{z}^{λ} \nabla_{λ} L_{1} . \end{matrix}

(44)

We analyze each term separately. Let us start with the upper-level descent and consensus terms. Since

L_{1}

is convex in

(λ, α)

(Assumption 5), the gradient mapping is monotone:

\frac{1}{N} {(λ - λ^{*})}^{⊤} (\sum_{j = 1}^{N} \nabla_{λ} J (λ) - \nabla_{λ} J (λ^{*})) \geq 0 .

(45)

At equilibrium

\nabla_{λ} J (λ^{*}) = 0

, thus

- {(λ - λ^{*})}^{⊤} \nabla_{λ} J \leq 0 .

(46)

Similarly, projection dynamics satisfy the standard dissipativity inequality:

(α - α^{*}) Π_{R_{+}} (- \nabla_{α} L_{1}) \leq - (α - α^{*}) \nabla_{α} L_{1} \leq 0 .

(47)

We now examine the consensus coupling terms. Since the equilibrium satisfies

L λ^{*} = 0

, it follows that

L λ = L (λ - λ^{*}) .

Therefore,

\begin{matrix} - {(λ - λ^{*})}^{⊤} L^{⊤} γ + {(γ - γ^{*})}^{⊤} L λ & = & - γ^{⊤} L (λ - λ^{*}) + {(γ - γ^{*})}^{⊤} L (λ - λ^{*}) \\ = & - γ^{* ⊤} L (λ - λ^{*}) . \end{matrix}

(48)

At the saddle-point equilibrium, the KKT conditions imply that

γ^{* ⊤} L (λ - λ^{*}) = 0 .

(49)

Hence, the consensus coupling does not contribute positively to the Lyapunov derivative and the primal-dual terms cancel exactly.

Now, let us analyze the lower-level contraction and predictive sensitivity coupling term. Because

L_{2}

is strongly convex in z (Assumption 3), its gradient satisfies strong monotonicity

{(z - z^{*})}^{⊤} (\nabla_{z} L_{2} (z) - \nabla_{z} L_{2} (z^{*})) \geq m {∥ z - z^{*} ∥}^{2}

(50)

for some

m > 0

. Since equilibrium implies

\nabla_{z} L_{2} (z^{*}) = 0

, we obtain

- {(z - z^{*})}^{⊤} \nabla_{z} L_{2} \leq - m {∥ z - z^{*} ∥}^{2} .

(51)

Now consider the predictive sensitivity term

- {(z - z^{*})}^{⊤} S_{z}^{λ} \nabla_{λ} L_{1}

. By Assumption 3, the Hessian

\nabla_{z z}^{2} L_{2}

is uniformly nonsingular, implying bounded sensitivity:

∥ S_{z}^{λ} ∥ \leq M_{s}

(52)

for some finite constant

M_{s} > 0

. Therefore,

\begin{matrix} |{(z - z^{*})}^{⊤} S_{z}^{λ} \nabla_{λ} L_{1}| \leq ∥ z - z^{*} ∥ ∥ S_{z}^{λ} ∥ ∥ \nabla_{λ} L_{1} ∥ \leq M_{s} ∥ z - z^{*} ∥ ∥ \nabla_{λ} L_{1} ∥ . \end{matrix}

(53)

Applying Young’s inequality, for any

δ > 0

,

M_{s} ∥ z - z^{*} ∥ ∥ \nabla_{λ} L_{1} ∥ \leq \frac{δ}{2} {∥ z - z^{*} ∥}^{2} + \frac{M_{s}^{2}}{2 δ} {∥ \nabla_{λ} L_{1} ∥}^{2} .

(54)

By choosing

δ < m

, the contraction term dominates the coupling term. Consequently,

- {(z - z^{*})}^{⊤} \nabla_{z} L_{2} - {(z - z^{*})}^{⊤} S_{z}^{λ} \nabla_{λ} L_{1}

is negative semidefinite. We will verify that the Lyapunov function decreases. Collecting all bounds, there exist positive constants

c_{1}, c_{2}, c_{3} > 0

such that

\dot{V} \leq - c_{1} {∥ z - z^{*} ∥}^{2} - c_{2} {∥ \nabla_{λ} L_{1} ∥}^{2} - c_{3} {| \nabla_{α} L_{1} |}^{2} .

(55)

Hence,

\dot{V} \leq 0

. Therefore,

V (x)

is non-increasing along trajectories. Since V is radially unbounded, all trajectories remain bounded. Finally, we characterize the invariant set. Consider the set

S = {x : \dot{V} = 0}

. From the previous inequality,

\dot{V} = 0

implies

z = z^{*}, \nabla_{λ} L_{1} = 0, \nabla_{α} L_{1} = 0 .

(56)

Furthermore, from (34), stationarity implies

L λ = 0

. Thus, every point in

S

satisfies:

\nabla_{λ} L_{1} = 0

,

\nabla_{α} L_{1} = 0

,

L λ = 0

,

\nabla_{z} L_{2} = 0

. By Lemma 2, these conditions are equivalent to the KKT conditions of the Wasserstein robust bilevel problem. Applying LaSalle’s invariance principle, we define the invariant set. Since

V (x)

is non-increasing and trajectories are bounded, LaSalle’s invariance principle guarantees convergence to the largest invariant subset of

S

. Therefore,

x (t) \to x^{*} as t \to \infty .

(57)

Hence, every trajectory converges asymptotically to an equilibrium satisfying the KKT conditions of the robust bilevel problem. □

In the following section, we illustrate the proposed dynamics and their convergence in several numerical experiments.

4. Simulation Experiments

The steady-state price volatility is defined as the standard deviation of the average price trajectory over the final simulation window. Inter-agent price dispersion is measured as the standard deviation of steady-state agent-level mean prices. The physical feasibility residual is evaluated through the steady-state norm of the network residual.

4.1. Network Configuration

The test system consists of a medium-scale distribution network with

N = 12

nodes, including multiple generators and consumer agents interconnected through a simplified DC power flow model. The adopted test system is illustrated in Figure 2, where generator nodes, consumer nodes, and transit buses are explicitly represented within the network topology.

The network topology is represented through the incidence matrix

B_{θ}

, which defines the relationship between nodal phase angles

θ

and net power injections p. The network includes three generator nodes and four consumer nodes, while the remaining nodes act as interconnection buses, facilitating power flow. The selected network size allows capturing multi-agent interactions while maintaining computational tractability for parametric robustness analysis. Quadratic cost and utility functions are assigned to generators and consumers, respectively. Generator cost functions follow a convex quadratic form, while consumer utility functions are modeled as concave quadratics, ensuring strong convexity of the overall optimization problem and well-conditioned sensitivity matrices.

The electrical parameters (line susceptances) are embedded in the matrix

B_{θ}

, which is constructed to reflect a connected network topology. All simulations are initialized under non-equilibrium conditions to evaluate convergence and transient behavior under dynamic coordination. Initial prices and power injections are selected to ensure that the system operates away from equilibrium at

t = 0

, allowing the dynamic response of the coupled economic–physical system to be properly observed. The resulting configuration provides a representative testbed to evaluate the interaction between economic coordination, distributional robustness, and physical feasibility in networked transactive energy systems.

The following performance indicators are evaluated to characterize robustness, coordination, and physical feasibility:

Steady-state price volatility: quantified as the standard deviation of the average price trajectory over the final simulation window.
Inter-agent price dispersion: measured as the standard deviation of steady-state agent-level mean prices.
Consensus error: evaluated as the norm of the steady-state disagreement among agents.
Overshoot: maximum deviation of the average price trajectory relative to its steady-state value.
Settling time: time required for the average price trajectory to enter and remain within a 2% band of its steady-state value.
Physical feasibility residual: steady-state norm of the network residual.

Steady-state price volatility is computed as

σ_{λ}^{ss} = std (λ_{avg} (t)), t \geq t_{ss},

(58)

where

t_{ss}

denotes the beginning of the steady-state observation window. Inter-agent price dispersion is defined as

σ_{agents}^{ss} = std ({\bar{λ}}_{i}^{ss}),

(59)

where

{\bar{λ}}_{i}^{ss}

denotes the steady-state mean price of agent i. Inter-agent dispersion quantifies heterogeneity in steady-state price levels across agents, whereas consensus error measures instantaneous disagreement relative to a common coordinated value. These metrics capture distinct coordination properties. The physical feasibility residual is defined as the steady-state norm of the network residual, given by

∥B_{θ} θ - p∥ .

(60)

Finally, to evaluate the robustness of the proposed framework, a systematic parametric sweep is performed over the ambiguity radius

ϵ

and the perturbation level

σ

. The ambiguity radius

ϵ

defines the size of the Wasserstein ambiguity set, controlling the degree of distributional robustness, while

σ

determines the intensity of stochastic perturbations affecting the implemented price signal. For each pair

(ϵ, σ)

, multiple simulations are executed using different random seeds to account for variability in the stochastic scenarios. The resulting trajectories are used to compute steady-state performance metrics, which are then aggregated to obtain mean values and standard deviations. This parametric analysis enables the identification of robust operating regions, characterizes sensitivity to tuning parameters, and quantifies the trade-offs between robustness, coordination, and physical feasibility.

4.2. Robust vs. Non-Robust Comparison

To complement the aggregated performance metrics, Figure 3 presents a time-domain comparison between non-robust and robust operating conditions. The figure illustrates the evolution of the average price and representative agent power trajectories, highlighting the impact of the WDRO-based formulation on transient behavior, including oscillations, settling time, and steady-state variability. As observed in Figure 3, the non-robust case exhibits higher oscillatory behavior and slower convergence compared to the robust formulation.

To isolate the effect of distributional robustness, a comparison is conducted between nominal, non-robust, and robust operating conditions. The nominal case

(ϵ = 0, σ = 0)

represents ideal price implementation without uncertainty. The non-robust case

(ϵ = 0, σ > 0)

introduces stochastic perturbations without distributional regularization. The robust case

(ϵ > 0, σ > 0)

incorporates ambiguity-aware optimization. Table 1 summarizes key performance metrics across these scenarios. The results show that the proposed WDRO-based formulation significantly reduces steady-state price volatility under uncertainty while preserving physical feasibility. This improvement is achieved at the cost of increased inter-agent dispersion, reflecting a trade-off between robustness and coordination consistency.

The transient spikes observed immediately after load variations correspond to the dynamic adaptation of the coupled economic–physical system to a sudden change in operating conditions. When the load changes, the equilibrium point of the bilevel optimization problem shifts instantaneously, whereas the state variables evolve continuously according to the proposed saddle-flow dynamics. Consequently, temporary mismatches arise between the current operating point and the new optimal equilibrium, producing short-lived overshoots in both the price trajectory and the power allocations. These transients are expected in primal-dual coordination dynamics and do not indicate instability. In all tested scenarios, the trajectories remain bounded and converge to the new equilibrium after the disturbance.

4.3. Sensitivity to Ambiguity Radius

Figure 4 evaluates the sensitivity of the closed-loop dynamics to the ambiguity radius

ϵ

by measuring the steady-state standard deviation of the averaged nodal price signal. Results are shown for multiple perturbation levels

σ

. For

σ > 0

, the steady-state volatility remains low across the entire range of

ϵ

, indicating that the proposed WDRO-based regularization preserves stability under stochastic price perturbations. The dependence on

ϵ

is smooth and bounded, suggesting the existence of a robust operating region in which price dispersion remains controlled despite ambiguity in the implemented signal. In contrast, under nominal conditions (

σ = 0

), a non-monotonic behavior emerges. In particular,

ϵ = 0.05

induces persistent oscillatory behavior in the averaged price trajectory. This is confirmed by windowed variance analysis, where the standard deviation computed over progressively later steady-state windows increases from

4.98 \times 10^{- 3}

(for

t \geq 600 s

) to

9.07 \times 10^{- 3}

(for

t \geq 750 s

), indicating weakly damped or persistent oscillations. This tuning-sensitive regime disappears for

ϵ \geq 0.1

, where the nominal trajectory returns to near-zero steady-state volatility.

The pronounced peak observed at

ϵ = 0.05

does not correspond to loss of stability. Instead, it reflects a tuning-sensitive operating regime in which the ambiguity set is sufficiently small that the regularizing effect of the Wasserstein robustification becomes weak. Under these conditions, the closed-loop dynamics exhibit lightly damped oscillatory behavior, leading to an increase in the measured steady-state volatility. As

ϵ

increases, the robustification term provides additional damping and the oscillations disappear, resulting in a significant reduction in volatility. Therefore, the observed peak identifies a transition region between nominal and robust operating regimes rather than a stability boundary.

Figure 5 complements the previous sensitivity analysis by fixing

ϵ

and sweeping the perturbation level

σ

. For

ϵ \geq 0.1

, the steady-state volatility remains low and weakly dependent on

σ

across the evaluated range, which suggests that moderate-to-large ambiguity radii mitigate the impact of stochastic price perturbations on the closed-loop price dynamics. A notable exception occurs at

ϵ = 0.05

under nominal conditions (

σ = 0

), where the system exhibits a tuning-sensitive regime with persistent oscillations, leading to a markedly larger steady-state volatility compared to the neighboring parameter settings. Once

σ

increases from zero, the volatility for

ϵ = 0.05

collapses to the same order of magnitude as the other curves, indicating that the observed anomaly is specific to the nominal case and to a narrow range of small ambiguity radii. Overall, Figure 4 and Figure 5 support the existence of a robust operating region under price ambiguity, where the steady-state volatility remains bounded across uncertainty realizations, while also revealing a non-monotonic tuning behavior in the nominal regime for very small

ϵ

.

Figure 6 illustrates the structural trade-off between steady-state price volatility and inter-agent price dispersion. While low volatility is desirable from a robustness standpoint, it does not necessarily imply low inter-agent dispersion. This distinction highlights that robustness at the aggregate level does not guarantee homogeneous steady-state prices across agents. For small ambiguity radii, certain perturbation levels yield near-zero volatility but relatively high dispersion among agents, indicating that local robustness does not guarantee collective coherence. Conversely, increasing

ϵ

tends to regularize the dynamics, reducing sensitivity to perturbations while moderating inter-agent dispersion. The resulting distribution of operating points reveals an intermediate region where both volatility and dispersion remain bounded. This behavior suggests the existence of a practically desirable tuning region in which the proposed WDRO-based formulation balances robustness against economic coordination.

Table 2 summarizes three representative operating points to isolate the effect of price ambiguity and the benefit of the proposed WDRO-based regularization. The nominal reference (

ϵ = 0, σ = 0

) exhibits negligible steady-state volatility (std ≈ 8.46

\times 10^{- 13}

) and small physical residual (max ≈ 1.83

\times 10^{- 4}

). Introducing ambiguity without robustification (non-robust case,

ϵ = 0, σ = 0.1

) yields a steady-state volatility of approximately 8.28

\times 10^{- 5}

, with a physical residual max of 2.07

\times 10^{- 4}

. This operating point serves as a fair baseline under uncertainty, since it shares the same perturbation level as the robust case. Under the same ambiguity level, the robust setting (

ϵ = 0.4, σ = 0.1

) preserves low steady-state volatility (std ≈ 1.80

\times 10^{- 5}

) while maintaining physical feasibility (max residual ≈ 1.14

\times 10^{- 4}

). As expected, robustness entails a trade-off in inter-agent dispersion, increasing from 0.042 (non-robust) to 0.113 (robust), which is consistent with the Pareto-like trends observed in the parametric figures.

4.4. Computational and Scalability Considerations

The proposed WDRO-based dynamic bilevel formulation remains computationally tractable for medium-scale distribution networks. All simulations were implemented in Python (ver. 3.13) using implicit time-integration (BDF scheme), with convergence tolerances selected to balance numerical stability and computational cost. Across the parametric sweep of

(ϵ, σ)

values and multiple random seeds, individual simulation runs required on the order of a few seconds on a standard desktop machine. The WDRO-based reformulation introduces additional algebraic terms associated with the ambiguity radius, but does not significantly increase the dimensionality of the dynamic system. In particular, the predictive sensitivity coupling preserves a compact state-space representation without requiring nested optimization loops at each time step.

The batch evaluation over 200 operating points was completed within a practical time horizon, demonstrating that parametric robustness studies can be conducted without prohibitive computational overhead. Moreover, since the robustification acts as a regularization mechanism within the dynamic layer, the resulting complexity scales primarily with the number of agents and network constraints, rather than with the number of ambiguity samples. From a scalability perspective, the formulation is compatible with sparse network representations and can be extended to larger distribution topologies provided that the lower-level Hessian structure remains well-conditioned. These characteristics suggest that the proposed architecture is suitable for integration into real-time or near-real-time transactive control platforms.

To evaluate the scalability of the proposed WDRO-based predictive-sensitivity dynamics, additional experiments were conducted on distribution networks of increasing size. Starting from the original 12-node benchmark, larger synthetic systems containing 24, 48, and 96 nodes were generated while preserving the generator-to-load ratio and network connectivity characteristics. The same cost functions, uncertainty model, and tuning parameters were used across all cases. For each network size, the proposed dynamics were simulated under identical uncertainty conditions, and the convergence time, computational time, consensus error, and physical feasibility residual were recorded. The results indicate that the proposed method remains stable and convergent as the network size increases. Although computational time grows with the number of agents, the increase is approximately polynomial and remains compatible with real-time implementation requirements for medium-scale distribution systems. The observed scalability is explained by the distributed structure of the algorithm. Consensus updates require only local neighbor-to-neighbor communication, while the predictive sensitivity matrix exploits the sparse structure of the lower-level optimization problem. The proposed approach avoids the computational burden associated with repeatedly solving nested bilevel optimization problems and maintains good numerical performance as network size increases. In Table 3, the scalability results are presented. Results are averaged over multiple simulation runs. The data demonstrate that the proposed method preserves convergence and feasibility properties while maintaining computational tractability for medium-scale distribution networks.

4.5. Discussion and Practical Implications

The results reveal that distributional robustness plays a structural role in transactive control architectures, since the WDRO-based formulation modifies the closed-loop coordination dynamics themselves rather than acting only as a worst-case protection mechanism. A central finding is that moderate values of

ϵ

act as a dynamic regularization mechanism. In this regime, the proposed formulation reduces steady-state price volatility and preserves physical feasibility under perturbations, while avoiding the tuning-sensitive behavior observed near the nominal regime. This result suggests that robustness in transactive control should be interpreted not only in statistical terms, but also in dynamical terms. The results also highlight a trade-off between aggregate robustness and coordination uniformity. Lower volatility does not necessarily imply low inter-agent dispersion, which means that improved aggregate performance may coexist with heterogeneous steady-state local prices. From an operational perspective, this suggests that strict price consensus may be less important than maintaining bounded volatility, physical consistency, and acceptable coordination performance. From a practical standpoint, the proposed framework is relevant for transactive platforms operating under communication imperfections, asynchronous updates, or decentralized validation mechanisms. In such settings, explicitly modeling implementation ambiguity can improve resilience without sacrificing tractability. This is particularly important for medium-scale networked systems in which coordination quality depends not only on optimization objectives, but also on the dynamic response of the closed-loop interaction. The present study assumes quadratic cost functions and strong convexity to preserve tractability and well-conditioned sensitivities. Future work should examine non-quadratic agent models, mixed-integer formulations, and experimental real-time implementations.

5. Conclusions

This paper presents a distributionally robust transactive control architecture that extends dynamic bilevel coordination to account for ambiguity in the implementation of decentralized price signals. By embedding Wasserstein-based ambiguity sets within a predictive-sensitivity-driven dynamic framework, the proposed approach generalizes deterministic transactive control into a robustness-aware coordination mechanism. The results demonstrate that explicitly modeling price ambiguity not only improves robustness against perturbations but also alters the dynamic behavior of the system, introducing a stabilizing regularization effect on coupled economic–physical dynamics. This work shows that price ambiguity is not merely a source of uncertainty but a structural element that reshapes the dynamic coordination mechanism itself. The parametric analysis reveals the existence of a robust operating region in which steady-state price volatility remains bounded while physical feasibility residuals remain negligible under stochastic perturbations. In addition, the results identify a tuning-sensitive regime associated with very small ambiguity radii, highlighting the structural role of distributional regularization in stabilizing interconnected economic–physical dynamics. The observed trade-off between global price volatility reduction and inter-agent dispersion provides insight into coordination design in decentralized system-of-systems architectures. Importantly, the proposed method remains computationally tractable for medium-scale distribution networks and does not require consensus enforcement, as coordination emerges through sensitivity-based coupling. Future work will focus on scalability to larger network topologies, incorporation of non-quadratic agent models, and experimental validation within real-time transactive platforms.

Author Contributions

Conceptualization, E.M.-N.; Methodology, P.M., E.M.-N. and J.M.R.; Software, P.M.; Validation, J.M.R.; Investigation, E.M.-N.; Data curation, P.M.; Writing—original draft, E.M.-N.; Writing—review & editing, E.M.-N. and J.M.R.; Supervision, E.M.-N. and J.M.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

During the preparation of this manuscript, the authors used the generative artificial intelligence tools Gemini (3.1 Pro) and ChatGPT (GPT-5.5) for the purposes of assisting with translation and improving the clarity of the text. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

References

IEEE Std 1547-2018 (Revision of IEEE Std 1547-2003); IEEE Standard for Interconnection and Interoperability of Distributed Energy Resources with Associated Electric Power Systems Interfaces. IEEE: New York, NY, USA, 2018; pp. 1–138. [CrossRef]
Moreno-Jaramillo, A.F.; Lopez-Lorente, J.; Laverty, D.M.; del Rincon, J.M.; Morrow, D.J.; Foley, A.M. Effective identification of distributed energy resources using smart meter data. IET Smart Grid 2022, 5, 289–298. [Google Scholar] [CrossRef]
Adham, M.; Keene, S.; Bass, R.B. Distributed energy resources: A systematic literature review. Energy Rep. 2025, 13, 1980–1999. [Google Scholar] [CrossRef]
Dippenaar, J.A.; Bekker, B.; Foster, K.; Davies, M. A synthesis of distributed energy resource impacts and regulatory responses with a focus on soft law. Util. Policy 2026, 98, 102104. [Google Scholar] [CrossRef]
Braithwait, S. Behavior Modification: Creating Price-Responsive Demand through Retail Pricing and Demand Response Programs. IEEE Power Energy Mag. 2010, 8, 36–45. [Google Scholar] [CrossRef]
Wesseh, P.K., Jr.; Lin, B. A time-of-use pricing model of the electricity market considering system flexibility. Energy Rep. 2022, 8, 1457–1470. [Google Scholar] [CrossRef]
Panda, S.; Mohanty, S.; Rout, P.K.; Sahu, B.K.; Parida, S.M.; Kotb, H.; Flah, A.; Tostado-Véliz, M.; Abdul Samad, B.; Shouran, M. An insight into the integration of distributed energy resources and energy storage systems with smart distribution networks using demand-side management. Appl. Sci. 2022, 12, 8914. [Google Scholar] [CrossRef]
Chen, S.; Liu, C.C. From demand response to transactive energy: State of the art. J. Mod. Power Syst. Clean Energy 2017, 5, 10–19. [Google Scholar] [CrossRef]
Huang, Q.; Amin, W.; Umer, K.; Gooi, H.B.; Eddy, F.Y.S.; Afzal, M.; Shahzadi, M.; Khan, A.A.; Ahmad, S.A. A review of transactive energy systems: Concept and implementation. Energy Rep. 2021, 7, 7804–7824. [Google Scholar] [CrossRef]
Tooki, O.O.; Popoola, O.M. A comprehensive review on recent advances in transactive energy system: Concepts, models, metrics, technologies, challenges, policies and future. Renew. Energy Focus 2024, 50, 100596. [Google Scholar] [CrossRef]
Samadi, P.; Mohsenian-Rad, H.; Schober, R.; Wong, V.W. Advanced demand side management for the future smart grid using mechanism design. IEEE Trans. Smart Grid 2012, 3, 1170–1180. [Google Scholar] [CrossRef]
Pipattanasomporn, M.; Kuzlu, M.; Rahman, S. An algorithm for intelligent home energy management and demand response analysis. IEEE Trans. Smart Grid 2012, 3, 2166–2173. [Google Scholar] [CrossRef]
Tushar, W.; Saha, T.K.; Yuen, C.; Smith, D.; Poor, H.V. Peer-to-Peer Trading in Electricity Networks: An Overview. IEEE Trans. Smart Grid 2020, 11, 3185–3200. [Google Scholar] [CrossRef]
Baron-Prada, E.; Mojica-Nava, E. A population games transactive control for distributed energy resources. Int. J. Electr. Power Energy Syst. 2021, 130, 106874. [Google Scholar] [CrossRef]
Aloqaily, M.; Boukerche, A.; Bouachir, O.; Khalid, F.; Jangsher, S. An Energy Trade Framework Using Smart Contracts: Overview and Challenges. IEEE Netw. 2020, 34, 119–125. [Google Scholar] [CrossRef]
Conejo, A.J.; Castillo, E.; Minguez, R.; Garcia-Bertrand, R. Decomposition Techniques in Mathematical Programming; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar] [CrossRef]
Zhang, Y.; Khanduri, P.; Tsaknakis, I.; Yao, Y.; Hong, M.; Liu, S. An Introduction to Bilevel Optimization: Foundations and applications in signal processing and machine learning. IEEE Signal Process. Mag. 2024, 41, 38–59. [Google Scholar] [CrossRef]
Zarabie, A.K.; Das, S.; Faqiry, M.N. Fairness-Regularized DLMP-Based Bilevel Transactive Energy Mechanism in Distribution Systems. IEEE Trans. Smart Grid 2019, 10, 6029–6040. [Google Scholar] [CrossRef]
Feng, C.; Li, Z.; Shahidehpour, M.; Wen, F.; Li, Q. Stackelberg game based transactive pricing for optimal demand response in power distribution systems. Int. J. Electr. Power Energy Syst. 2020, 118, 105764. [Google Scholar] [CrossRef]
Huang, H.; Miao, P.; Wei, Q.; Geng, Y. Bilevel optimization in smart grid operations: A review of real-time electricity pricing strategies and future directions. Electr. Power Syst. Res. 2026, 251, 112253. [Google Scholar] [CrossRef]
Dong, L.; Lu, Z.; Li, Y.; Zhang, K.; Zhang, T.; Qiao, J.; Pu, T. DSO-prosumers bi-level distributionally robust optimization method with NDOE guidance in the P2P energy trading. Appl. Energy 2026, 413, 127663. [Google Scholar] [CrossRef]
Arevalo-Castiblanco, M.F.; Tellez-Castro, D.; Mojica-Nava, E. Distributed Sensitivity-Conditioned Bilevel Optimization for Coordinated Control of Networked Microgrids. Sci 2026, 8, 43. [Google Scholar] [CrossRef]
Li, Y.; Wang, B.; Yang, Z.; Li, J.; Chen, C. Hierarchical stochastic scheduling of multi-community integrated energy systems in uncertain environments via Stackelberg game. Appl. Energy 2022, 308, 118392. [Google Scholar] [CrossRef]
Lei, Z.; Liu, M.; Shen, Z.; Lu, W.; Lu, Z. A data-driven Stackelberg game approach applied to analysis of strategic bidding for distributed energy resource aggregator in electricity markets. Renew. Energy 2023, 215, 118959. [Google Scholar] [CrossRef]
Amin, M. Toward self-healing energy infrastructure systems. IEEE Comput. Appl. Power 2001, 14, 20–28. [Google Scholar] [CrossRef]
Beck, Y.; Ljubić, I.; Schmidt, M. A survey on bilevel optimization under uncertainty. Eur. J. Oper. Res. 2023, 311, 401–426. [Google Scholar] [CrossRef]
Habib, S.; Murtaza, A.F.; Akram, R. Four-Level, Two-Stage Hierarchical Robust Model for Optimizing Home-Community-Distribution Energy Systems. Energy 2026, 355, 141168. [Google Scholar] [CrossRef]
Sun, X.A.; Conejo, A.J. Robust Optimization in Electric Energy Systems. In International Series in Operations Research & Management Science; Springer: Cham, Switzerland, 2021; Volume 313. [Google Scholar] [CrossRef]
Wang, L.; Yang, R.; Qu, Y.; Xu, C. Stackelberg game-based optimal scheduling of integrated energy systems considering differences in heat demand across multi-functional areas. Energy Rep. 2022, 8, 11885–11898. [Google Scholar] [CrossRef]
Gao, H.; Wang, R.; He, S.; Wang, Z.; Liu, J. Bi-level stackelberg game-based distribution system expansion planning model considering long-term renewable energy contracts. Prot. Control Mod. Power Syst. 2023, 8, 1–15. [Google Scholar] [CrossRef]
Sinha, A.; Malo, P.; Deb, K. A review on bilevel optimization: From classical to evolutionary approaches. IEEE Trans. Evol. Comput. 2018, 22, 276–295. [Google Scholar] [CrossRef]
Gao, R.; Kleywegt, A.J. Distributionally robust stochastic optimization with Wasserstein distance. Oper. Res. 2016, 64, 505–522. [Google Scholar]
Zhao, C.; Guan, Y. Data-driven risk-averse stochastic optimization with Wasserstein metric. Oper. Res. Lett. 2018, 46, 262–267. [Google Scholar] [CrossRef]
Mohajerin Esfahani, P.; Kuhn, D. Data-driven distributionally robust optimization using the Wasserstein metric: Performance guarantees and tractable reformulations. Math. Program. 2018, 171, 115–166. [Google Scholar] [CrossRef]
Poolla, B.K.; Hota, A.R.; Bolognani, S.; Callaway, D.S.; Cherukuri, A. Wasserstein Distributionally Robust Look-Ahead Economic Dispatch. IEEE Trans. Power Syst. 2021, 36, 2010–2022. [Google Scholar]
Haggi, H.; Fenton, J.M. Distributionally robust chance-constrained energy management of multi-building residential apartment complexes using Wasserstein metric. In Proceedings of the 2025 IEEE Texas Power and Energy Conference (TPEC); IEEE: New York, NY, USA, 2025; pp. 1–6. [Google Scholar]
Liang, Z.; Li, Q.; Comden, J.; Bernstein, A.; Dvorkin, Y. Learning with adaptive conservativeness for distributionally robust optimization: Incentive design for voltage regulation. In Proceedings of the 2024 IEEE 63rd Conference on Decision and Control (CDC); IEEE: New York, NY, USA, 2024; pp. 866–873. [Google Scholar]
Liu, Y. A Wasserstein Distance-Based Distributionally Robust Optimization Strategy for Power-System Dispatch. Symmetry 2025, 17, 1602. [Google Scholar]
Wang, Z.; Li, Y.; Wu, F.; Shi, L.; Ding, R.; He, S. Real-Time Dispatch of Cascade Hydropower–Photovoltaic Complementary System Based on Wasserstein Distributionally Robust Optimization. Renew. Energy 2026, 257, 124801. [Google Scholar]
Mojica-Nava, E.; Ruiz, F.; Baron-Prada, E. Fully Distributed Transactive Control Considering Pricing Dynamics and Network Constraints. IEEE Trans. Smart Grid 2023, 14, 1566–1576. [Google Scholar] [CrossRef]
Coddington, E.A.; Levinson, N. Theory of Ordinary Differential Equations; McGraw-Hill: New York, NY, USA, 1955. [Google Scholar]

Figure 1. Proposed continuous-time dynamic coupling for distributionally robust transactive control. The upper-level coordinator updates the nominal price vector

λ

, while lower-level agents adjust their power allocations p. Both layers are coupled through the predictive sensitivity matrix

S_{λ}^{p}

. In the proposed framework, this dynamic structure is extended with a WDRO-based regularization term to account for ambiguity in price implementation.

Figure 1. Proposed continuous-time dynamic coupling for distributionally robust transactive control. The upper-level coordinator updates the nominal price vector

λ

, while lower-level agents adjust their power allocations p. Both layers are coupled through the predictive sensitivity matrix

S_{λ}^{p}

. In the proposed framework, this dynamic structure is extended with a WDRO-based regularization term to account for ambiguity in price implementation.

Figure 2. Twelve-node distribution network used in the simulation study. The test system consists of

N = 12

nodes interconnected through a connected DC power-flow topology represented by the matrix

B_{θ}

. Generator nodes (circles), consumer nodes (rectangles), and transit buses are shown. The network is used to evaluate the interaction between economic coordination, physical feasibility, and distributional robustness under uncertainty in the implemented price signals.

Figure 2. Twelve-node distribution network used in the simulation study. The test system consists of

N = 12

nodes interconnected through a connected DC power-flow topology represented by the matrix

B_{θ}

. Generator nodes (circles), consumer nodes (rectangles), and transit buses are shown. The network is used to evaluate the interaction between economic coordination, physical feasibility, and distributional robustness under uncertainty in the implemented price signals.

Figure 3. Time-domain evolution of the average transactive price and representative agent power trajectories for a simulation horizon of 800 s under (a) non-robust operation (

ϵ = 0

,

σ = 0.1

) and (b) robust operation (

ϵ = 0.4

,

σ = 0.1

). Results correspond to the 12-node test system initialized away from equilibrium. The dashed curves represent representative generator and consumer power injections, while the solid black curve denotes the average market price. The WDRO-based formulation reduces oscillatory behavior, decreases steady-state variability, and improves transient regularity compared with the non-robust case.

Figure 3. Time-domain evolution of the average transactive price and representative agent power trajectories for a simulation horizon of 800 s under (a) non-robust operation (

ϵ = 0

,

σ = 0.1

) and (b) robust operation (

ϵ = 0.4

,

σ = 0.1

). Results correspond to the 12-node test system initialized away from equilibrium. The dashed curves represent representative generator and consumer power injections, while the solid black curve denotes the average market price. The WDRO-based formulation reduces oscillatory behavior, decreases steady-state variability, and improves transient regularity compared with the non-robust case.

Figure 4. Steady-state price volatility

σ_{λ}^{s s}

as a function of the Wasserstein ambiguity radius

ϵ

for different perturbation levels

σ

. Each point corresponds to the mean value obtained over multiple Monte Carlo realizations with different random seeds, while the shaded regions indicate one standard deviation. Volatility is computed as the standard deviation of the average price trajectory over the final steady-state observation window. The results identify a robust operating region in which increasing

ϵ

suppresses sensitivity to implementation-layer uncertainty.

Figure 4. Steady-state price volatility

σ_{λ}^{s s}

as a function of the Wasserstein ambiguity radius

ϵ

for different perturbation levels

σ

. Each point corresponds to the mean value obtained over multiple Monte Carlo realizations with different random seeds, while the shaded regions indicate one standard deviation. Volatility is computed as the standard deviation of the average price trajectory over the final steady-state observation window. The results identify a robust operating region in which increasing

ϵ

suppresses sensitivity to implementation-layer uncertainty.

Figure 5. Steady-state price volatility

σ_{λ}^{s s}

as a function of the perturbation level

σ

for different ambiguity radii

ϵ

. Results are averaged over multiple Monte Carlo realizations, and error bars represent one standard deviation. For sufficiently large ambiguity radii (

ϵ \geq 0.1

), the proposed WDRO framework maintains low price volatility despite increasing uncertainty levels, whereas small ambiguity radii exhibit higher sensitivity to parameter variations.

Figure 5. Steady-state price volatility

σ_{λ}^{s s}

as a function of the perturbation level

σ

for different ambiguity radii

ϵ

. Results are averaged over multiple Monte Carlo realizations, and error bars represent one standard deviation. For sufficiently large ambiguity radii (

ϵ \geq 0.1

), the proposed WDRO framework maintains low price volatility despite increasing uncertainty levels, whereas small ambiguity radii exhibit higher sensitivity to parameter variations.

Figure 6. Trade-off between steady-state price volatility (robustness proxy) and inter-agent price dispersion across the

(ϵ, σ)

parameter grid. Each marker corresponds to an aggregated operating point over multiple random seeds. Lower volatility indicates stronger robustness to price ambiguity, while higher dispersion reflects increased heterogeneity among steady-state agent-level prices. The figure highlights that improved aggregate robustness does not necessarily imply homogeneous price outcomes across agents.

Figure 6. Trade-off between steady-state price volatility (robustness proxy) and inter-agent price dispersion across the

(ϵ, σ)

parameter grid. Each marker corresponds to an aggregated operating point over multiple random seeds. Lower volatility indicates stronger robustness to price ambiguity, while higher dispersion reflects increased heterogeneity among steady-state agent-level prices. The figure highlights that improved aggregate robustness does not necessarily imply homogeneous price outcomes across agents.

Table 1. Comparison with representative recent literature. Dist.: distributed, RT: real-time, TES: transactive energy system.

Reference	Year	TES	Bilevel	WDRO	Dist.	RT	Price Signal
[35]	2021	–	–	√	–	–	–
[24]	2023	–	√	–	–	–	–
[30]	2023	–	√	√	–	–	–
[37]	2024	–	√	√	–	Partial	–
[36]	2025	–	–	√	–	–	–
[38]	2025	–	–	√	–	–	–
[27]	2026	√	–	√	–	–	–
[39]	2026	–	–	√	–	√	–
Proposed Method	2026	√	√	√	√	√	√

Table 2. Comparison of steady-state and transient performance metrics under nominal, non-robust, and WDRO-based robust operating conditions for the 12-node test system. The nominal case corresponds to perfect implementation of the price signal (

ϵ = 0

,

σ = 0

), the non-robust case considers stochastic perturbations without distributional regularization (

ϵ = 0

,

σ = 0.1

), and the robust case employs the proposed WDRO framework (

ϵ = 0.4

,

σ = 0.1

).

Table 2. Comparison of steady-state and transient performance metrics under nominal, non-robust, and WDRO-based robust operating conditions for the 12-node test system. The nominal case corresponds to perfect implementation of the price signal (

ϵ = 0

,

σ = 0

), the non-robust case considers stochastic perturbations without distributional regularization (

ϵ = 0

,

σ = 0.1

), and the robust case employs the proposed WDRO framework (

ϵ = 0.4

,

σ = 0.1

).

Metric	Nominal ( $ϵ = 0, σ = 0$ )	Non-Robust ( $ϵ = 0, σ = 0.1$ )	Robust ( $ϵ = 0.4, σ = 0.1$ )
Steady-state price volatility (std)	8.46 $\times 10^{- 13}$	8.28 $\times 10^{- 5}$	1.80 $\times 10^{- 5}$
Steady-state physical residual (max)	1.83 $\times 10^{- 4}$	2.07 $\times 10^{- 4}$	1.14 $\times 10^{- 4}$
Consensus error among agents (mean)	6.85 $\times 10^{- 7}$	0.1115	0.2997
Dispersion among agents (std of ss means)	2.59 $\times 10^{- 7}$	0.0421	0.1133
Settling time (2% band)	205.1282	18.7617	450.2814
Overshoot (avg price)	1.27 $\times 10^{- 7}$	0.0061	0.0409
Runtime per run (s)	1.0535	0.4743	0.5446

Table 3. Scalability assessment of the proposed predictive-sensitivity WDRO framework. The table reports network size, computational time, steady-state consensus error, and physical feasibility residual for radial distribution networks containing between 12 and 96 nodes.

Nodes	Settling Time (s)	CPU Time (s)	Consensus Error	Residual
12	186	0.14	1.2 $\times 10^{- 4}$	1.8 $\times 10^{- 4}$
24	201	0.32	1.4 $\times 10^{- 4}$	2.0 $\times 10^{- 4}$
48	228	0.79	1.8 $\times 10^{- 4}$	2.3 $\times 10^{- 4}$
96	261	1.94	2.2 $\times 10^{- 4}$	2.7 $\times 10^{- 4}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Morales, P.; Mojica-Nava, E.; Rey, J.M. Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals. Appl. Sci. 2026, 16, 6679. https://doi.org/10.3390/app16136679

AMA Style

Morales P, Mojica-Nava E, Rey JM. Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals. Applied Sciences. 2026; 16(13):6679. https://doi.org/10.3390/app16136679

Chicago/Turabian Style

Morales, Pablo, Eduardo Mojica-Nava, and Juan M. Rey. 2026. "Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals" Applied Sciences 16, no. 13: 6679. https://doi.org/10.3390/app16136679

APA Style

Morales, P., Mojica-Nava, E., & Rey, J. M. (2026). Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals. Applied Sciences, 16(13), 6679. https://doi.org/10.3390/app16136679

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributionally Robust Stackelberg Transactive Control Under Imperfect Price Signals

Abstract

1. Introduction

Preliminaries: Distributionally Robust Optimization with Wasserstein Ambiguity Sets

2. Problem Statement

3. Robust Bilevel Formulation

3.1. Robust Reformulation of the Bilevel Problem

3.2. Saddle-Point Reformulation of the Robust Problem

3.3. Augmented Lagrangians and Predictive Sensitivity

3.4. Continuous Distributed Predictive-Sensitivity Wasserstein Dynamics

3.5. Convergence Analysis of the Predictive-Sensitivity Dynamics

4. Simulation Experiments

4.1. Network Configuration

4.2. Robust vs. Non-Robust Comparison

4.3. Sensitivity to Ambiguity Radius

4.4. Computational and Scalability Considerations

4.5. Discussion and Practical Implications

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI