Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations

Yuan, Meng; Yu, Rui

doi:10.3390/math14101731

Open AccessArticle

Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations

by

Meng Yuan

¹ and

Rui Yu

^2,*

¹

Department of Control Science and Engineering, Tongji University, Shanghai 201804, China

²

The Provincial Key Laboratory of Multimodal Perceiving and Intelligent Systems, The Engineering Research Center of Intelligent Human Health Situation Awareness of Zhejiang Province, Jiaxing University, Jiaxing 314001, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(10), 1731; https://doi.org/10.3390/math14101731

Submission received: 8 April 2026 / Revised: 9 May 2026 / Accepted: 10 May 2026 / Published: 18 May 2026

(This article belongs to the Special Issue AI in Game Theory: Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

Distributed online aggregative games are widely used to model sequential decision-making problems in dynamic networked systems. However, the repeated information exchange required by distributed algorithms may disclose players’ sensitive local data. This paper investigates a privacy-preserving distributed online aggregative game over multi-agent networks. A distributed online mirror descent algorithm with correlated perturbations is developed to protect local private information. Under standard assumptions, an expected dynamic regret bound and a statistical privacy guarantee are established for the proposed algorithm. Numerical results demonstrate the effectiveness of the proposed algorithm and reveal the tradeoff between privacy protection and algorithmic performance.

Keywords:

online aggregative games; distributed mirror descent; Bregman divergence; dynamic regret; statistical privacy

MSC:

91A10; 90C25; 68W15

1. Introduction

Sequential decision making over networks arises in resource allocation, energy coordination, communication systems, and other large-scale cyber-physical platforms. In such settings, each agent acts online: future objective functions are unknown, decisions are updated repeatedly, and only local information is available at each stage. Online optimization and online game models therefore provide a natural mathematical language for describing system evolution, while regret serves as the main performance criterion over a finite horizon [1,2,3,4]. When centralized computation is impractical, distributed algorithms are preferable because they require only local processing and neighbor-to-neighbor exchanges [5,6]. Recent application-oriented studies further highlight the broad relevance of online and dynamic decision making in networked engineering systems, including online knowledge seeking in technical R&D teams [7], online computation offloading for collaborative space/aerial-aided edge computing [8], and fine-grained air traffic flow prediction [9]. These studies show that online decision making, dynamic information processing, and networked coordination have found applications in a wide range of fields.

Among distributed interaction models, aggregative games are especially important because each player’s cost depends on its own decision and on an aggregate term generated by the entire population. This structure appears in charging coordination, congestion management, and shared-resource allocation. Distributed equilibrium computation for aggregative games has been widely studied [10,11,12,13]. The online case is more subtle: objective functions and equilibria may both vary with time, so the main issue is equilibrium tracking rather than convergence to a fixed point. Recent contributions published in Mathematics reflect this development from several complementary viewpoints. Yang et al. considered distributed online aggregative optimization in an unknown dynamic environment, Huo et al. studied sampled data average consensus tracking, He et al. proposed event-triggered Nash equilibrium seeking schemes, and Cao et al. investigated privacy-preserving distributed learning based on a Newton mechanism [14,15,16,17]. These results motivate a unified treatment of online game dynamics, consensus tracking, and privacy protection.

In distributed implementations, each player repeatedly communicates local estimates or auxiliary states. Even if a single message carries limited information, the entire communication history may reveal sensitive features of local objectives, decision trajectories, or underlying datasets. Differential privacy and noise injection mechanisms have therefore become standard ingredients in distributed learning, optimization, and consensus [18,19,20,21,22,23,24]. For online aggregative games, however, one still needs a perturbation mechanism that hides transmitted information without destroying the aggregate-tracking structure used by the distributed algorithm.

Among the existing studies on privacy-preserving online aggregative games, the work of Lin et al. [25] is relevant to our setting. In [25], a statistical privacy-preserving online distributed Nash equilibrium tracking algorithm was developed for aggregative games by using correlated perturbations and a privacy criterion based on the Kullback–Leibler divergence. The present paper follows the same statistical privacy viewpoint, but differs from [25] in the algorithmic framework and the corresponding theoretical analysis. Specifically, ref. [25] is based on a Euclidean projected gradient update, whereas this paper develops a distributed online mirror descent method with a general Bregman divergence. Moreover, many existing algorithms rely on Euclidean projection updates, which may be less suitable for structured feasible sets such as simplex or probability constraints [26,27,28]. Therefore, it is still necessary to develop a privacy-preserving distributed online aggregative-game algorithm that can simultaneously handle dynamic equilibrium tracking, non-Euclidean decision geometry, and statistical privacy protection. To further clarify the relationship between the present paper and representative related studies, Table 1 provides a structured comparison from the perspectives of problem setting, algorithmic method, privacy mechanism, and main distinction.

Motivated by the above considerations, this paper studies a distributed online aggregative game in which a subset of players may be compromised and the remaining players seek protection of their local information. We adopt a statistical privacy criterion based on the Kullback–Leibler divergence and design a distributed online mirror descent algorithm equipped with correlated perturbations. The perturbations mask the exchanged aggregate estimates while preserving a balancing identity that is crucial for dynamic average tracking. Compared with Euclidean projection-based methods, the proposed mirror-descent framework allows a general Bregman divergence and is therefore better suited to structured feasible sets.

The main contributions are summarized as follows.

We formulate a privacy-preserving distributed online aggregative game with corrupted players, where the goal is to track the time-varying Nash equilibrium while protecting the local information of the uncorrupted players. Statistical privacy is quantified by a Kullback–Leibler divergence criterion.
We propose a privacy-preserving distributed online mirror descent algorithm with correlated perturbations. The correlated perturbations mask the exchanged aggregate estimates, while their balancing property preserves the dynamic average-tracking structure required by the distributed algorithm.
We establish an expected dynamic regret bound for the proposed algorithm under a general Bregman geometry. Unlike Euclidean projection-based analyses, the proof relies on mirror descent updates and Bregman divergence arguments, and the resulting bound explicitly reflects the effects of the equilibrium path variation, the stepsizes, and the perturbation magnitudes.
We prove a statistical privacy guarantee for the proposed algorithm by bounding the Kullback–Leibler divergence between the observation distributions generated by different datasets. Numerical simulations further demonstrate the tradeoff between privacy level and performance, and the simplex-constrained experiment illustrates the advantage of using a KL-divergence mirror update.

The remainder of this paper is organized as follows. Section 2 introduces the notation, privacy model, game formulation, and standing assumptions. Section 3 presents the privacy-preserving distributed online mirror descent algorithm. Section 4 develops the regret and privacy analysis. Section 5 reports numerical examples. Section 6 discusses the proposed algorithm, and Section 7 concludes the paper.

2. Notation, Privacy Model, and Game Formulation

Consider an undirected communication graph

G = (V, E),

where

V = {1, \dots, N}

is the player set and

E \subseteq V \times V

is the edge set. If

e_{i j} \in E

, then players i and j can exchange information. For each player i, let

N_{i} = {j \in V : e_{i j} \in E}

be its neighbor set. Throughout this paper, the graph is assumed to be connected. Let

W \in R^{N \times N}

be the mixing matrix associated with

G

; thus,

{[W]}_{i j} > 0

if

e_{i j} \in E

and

{[W]}_{i j} = 0

otherwise.

For player

i \in V

at time k, the decision variable is denoted by

x_{i, k} \in X_{i} \subseteq R^{n}

, and the stacked action vector is

x_{k} = col (x_{1, k}, \dots, x_{N, k}) .

The aggregate term is written as

σ (x_{k})

. For notational convenience, we also use

x_{- i, k} = col (x_{1, k}, \dots, x_{i - 1, k}, x_{i + 1, k}, \dots, x_{N, k})

for the profile of all players except player i.

2.1. Statistical Privacy

We begin with two graph-theoretic objects that will be used in the privacy analysis.

Definition 1.

A subset

S \subset V

is called a vertex cut if removing the nodes in

S

together with their incident edges disconnects

G

.

Fix an arbitrary orientation of the edges of

G

and index them as

e_{q}

,

q = 1, \dots, | E |

.

Definition 2.

The oriented incidence matrix

D \in R^{N \times | E |}

is defined by

{[D]}_{i, q} = \{\begin{matrix} 1, & if node i is the head of e_{q}, \\ - 1, & if node i is the tail of e_{q}, \\ 0, & otherwise . \end{matrix}

The graph Laplacian is

L = D D^{⊤}

. Its spectral decomposition can be written as

L = U diag (λ_{1}, λ_{2}, \dots, λ_{N}) U^{⊤},

where U is orthogonal and

0 = λ_{1} < λ_{2} \leq \dots \leq λ_{N}

. The Moore–Penrose inverse is therefore

L^{†} = U diag (0, \frac{1}{λ_{2}}, \dots, \frac{1}{λ_{N}}) U^{⊤} .

Let

C

and

H

denote the compromised-player set and the honest-player set, respectively. Over a time horizon T, the adversary collects an observation record

O = {O_{k}}_{k = 1}^{T}

from the compromised players, where

O_{k}

is the information available at time k. The underlying dataset is denoted by

D

. Privacy is interpreted as indistinguishability: if two candidate datasets can generate the same observation record, then the adversary should find them difficult to distinguish. We measure this indistinguishability with the Kullback–Leibler divergence. For two probability measures P and Q, the Kullback–Leibler divergence is defined by

D_{KL} (P ∥ Q) = \{\begin{matrix} \int log (\frac{d P}{d Q}) d P, & P ≪ Q, \\ + \infty, & otherwise . \end{matrix}

When P and Q admit densities

h_{p}

and

h_{p^{'}}

with respect to a common dominating measure, we also write

D_{KL} (h_{p}, h_{p^{'}}) = \int h_{p} (z) log \frac{h_{p} (z)}{h_{p^{'}} (z)} d z .

Definition 3

([25]). Given two datasets

D_{1}

and

D_{2}

, an algorithm

A

is said to preserve the statistical privacy of the honest players if

O = A (D_{1}) = A (D_{2})

and the quantity

D_{KL} (h_{O ∣ D_{1}}, h_{O ∣ D_{2}})

is bounded.

Remark 1.

The private information considered in this paper is the local information of the honest players. This information may determine their local cost functions, gradients, unmasked aggregate estimates, and local decision trajectories. Public system parameters, such as the communication graph, the stepsizes, and the perturbation magnitudes, are not regarded as private. The mechanism is also not intended to protect the corrupted players’ own data or internal states. Its purpose is to prevent the communication transcript from uniquely revealing the honest players’ local information.

Remark 2.

The equality condition in Definition 3 should be understood as an observational compatibility condition rather than as equality of two output distributions. It means that the same realized observation record O available to the adversary can be generated by two different candidate datasets, possibly under different realizations of the hidden perturbations. The KL divergence then quantifies the statistical distinguishability of these two candidate explanations from the adversary’s viewpoint. For example, in an additive-noise mechanism

O = θ + ξ

, the same observed value can be compatible with two different values of θ, provided that the hidden noise takes different values. The privacy leakage is not zero in general; it is measured by the likelihood ratio, or equivalently by the KL divergence between the induced observation distributions.

Adversary model: The adversary is assumed to be honest-but-curious. The compromised players follow the prescribed algorithm, but share all their available information. At each time k, they know the algorithm and public parameters, observe their own local data and states, and observe the messages and perturbation variables on edges incident to them. However, they do not observe the honest players’ unmasked variables, nor the perturbation variables generated on edges whose two endpoints are both honest. Hence, after conditioning on the compromised players’ observations, the hidden perturbations among honest players provide the residual uncertainty used in the statistical privacy analysis.

2.2. Game Formulation

At each stage k, player i minimizes a time-varying cost

f_{i, k}

that depends on its own action and on an aggregate of all players’ actions. Because the environment is dynamic, the stage game and its Nash equilibrium may change with k. The goal is to construct a distributed online algorithm that uses only local communication, tracks the time-varying equilibrium, and protects the information of the honest players.

For player i, we measure online performance by the expected dynamic regret over the horizon T,

{Reg}_{i} (T) ≜ \sum_{k = 1}^{T} E [f_{i, k} (x_{i, k}, σ (x_{i, k}, x_{- i, k}^{*})) - f_{i, k} (x_{i, k}^{*}, σ (x_{k}^{*}))],

where

x_{k}^{*}

is the Nash equilibrium of the stage-k game and

x_{- i, k}^{*}

is the equilibrium profile of all players except player i.

2.3. Assumptions

Let

ω

be the mirror map, and let

D (ξ, ζ) ≜ ω (ξ) - ω (ζ) - 〈 \nabla ω (ζ), ξ - ζ 〉

denote the associated Bregman divergence. For each player

i \in V

and each time k, define

\begin{matrix} g_{i, k} (x_{i, k}, y_{i, k}) ≜ (\nabla_{x_{i, k}} f_{i, k} (\cdot, β) + \frac{1}{N} \nabla_{β} f_{i, k} (x_{i, k}, \cdot)) |_{β = y_{i, k}}, \\ ϕ_{i, k} (x_{k}) ≜ \nabla_{x_{i, k}} f_{i, k} (x_{i, k}, σ (x_{k})), \\ ϕ_{k} (x_{k}) ≜ col (ϕ_{1, k} (x_{k}), \dots, ϕ_{N, k} (x_{k})) . \end{matrix}

When

y_{i, k} = σ (x_{k})

, the two quantities coincide; i.e.,

ϕ_{i, k} (x_{k}) = g_{i, k} (x_{i, k}, σ (x_{k})) .

The next assumptions are standard in analyses of aggregative games, mirror descent, and distributed online optimization [10,25,29].

Assumption 1.

The mirror map ω is

σ_{ω}

-strongly convex with respect to

∥ \cdot ∥

, namely,

ω (x) \geq ω (y) + 〈 \nabla ω (y), x - y 〉 + \frac{σ_{ω}}{2} {∥ x - y ∥}^{2} .

Assumption 2.

There exists

G > 0

such that

∥ g_{i, k} (x_{i, k}, y_{i, k}) ∥ \leq G, \forall y_{i, k} \in R^{n},

for all players i and all stages k.

Assumption 3.

For every player i and every fixed

x_{i} \in X_{i}

, the mapping

z \mapsto g_{i, k} (x_{i}, z)

is

L_{i}

-Lipschitz on

R^{n}

; i.e.,

∥ g_{i, k} (x_{i}, z_{1}) - g_{i, k} (x_{i}, z_{2}) ∥ \leq L_{i} ∥ z_{1} - z_{2} ∥, \forall z_{1}, z_{2} \in R^{n} .

Assumption 4.

The pseudo-gradient mapping

ϕ_{k}

is μ-strongly monotone on

X_{1} \times \dots \times X_{N}

for some

μ > 0

, namely,

{(ϕ_{k} (x) - ϕ_{k} (y))}^{⊤} (x - y) \geq μ {∥ x - y ∥}^{2}

for all

x, y \in X_{1} \times \dots \times X_{N}

.

Assumption 5.

For each

i \in [N]

and each

ζ \in X_{i}

, the function

D (\cdot, ζ)

is K-Lipschitz on

X_{i}

; i.e.,

| D (ξ_{1}, ζ) - D (ξ_{2}, ζ) | \leq K ∥ ξ_{1} - ξ_{2} ∥, \forall ξ_{1}, ξ_{2} \in X_{i} .

Assumption 6.

The graph

G

is undirected and connected, and the weight matrix W is doubly stochastic:

W 1_{N} = 1_{N}, 1_{N}^{⊤} W = 1_{N}^{⊤} .

3. Our Proposed Algorithm

To track the time-varying Nash equilibrium while preserving the statistical privacy of the uncorrupted players, we propose a privacy-preserving distributed online mirror descent algorithm. The proposed method combines online mirror descent with dynamic average consensus, and incorporates a correlated perturbation mechanism to mask the exchanged aggregate estimates.

The detailed procedure is presented in Algorithm 1. For each player

i \in V

, let the initial action be

x_{i, 0} \in R^{n}

and let the initial aggregate estimate satisfy

y_{i, 0} = x_{i, 0}

. At each iteration k, player i first generates independent Gaussian noises for all its neighbors and constructs a correlated perturbation by aggregating the differences between the received and transmitted noises. This perturbation is then added to the local aggregate estimate before communication, so that the exchanged message is masked from the adversary’s viewpoint.

Algorithm 1 Privacy-Preserving Distributed Online Algorithm

Initialization: For each player

i \in V

, initialize

x_{i, 0} \in R^{n}

and

y_{i, 0} = x_{i, 0}

.

Iterations: For

k = 0, 1, \dots, T

, each player i performs:

1: For each neighbor

j \in N_{i}

, generate an independent Gaussian noise

η_{i j, k} \sim N (0_{n}, M_{k}^{2} I_{n}),

and compute the correlated perturbation

η_{i, k} = \sum_{j \in N_{i}} (η_{j i, k} - η_{i j, k}) .

2: Update the masked aggregate estimate:

{\tilde{y}}_{i, k} = y_{i, k} + η_{i, k} .

3: Update the local action:

x_{i, k + 1} = \underset{x \in X_{i}}{arg min} \{〈x, g_{i, k} (x_{i, k}, y_{i, k})〉 + \frac{1}{α_{k}} D (x, x_{i, k})\} .

4: Exchange

{\tilde{y}}_{i, k}

with neighbors and update the aggregate estimate:

y_{i, k + 1} = \sum_{j = 1}^{N} {[W]}_{i j} {\tilde{y}}_{j, k} + x_{i, k + 1} - x_{i, k} .

More specifically, for each neighbor

j \in N_{i}

, player i generates an independent Gaussian noise

η_{i j, k} \sim N (0_{n}, M_{k}^{2} I_{n})

and computes the correlated perturbation

η_{i, k} = \sum_{j \in N_{i}} (η_{j i, k} - η_{i j, k}) .

The masked aggregate estimate is then updated as

{\tilde{y}}_{i, k} = y_{i, k} + η_{i, k} .

Based on the current local estimate

y_{i, k}

, player i updates its action via the mirror-descent step

x_{i, k + 1} = \underset{x \in X_{i}}{arg min} \{〈x, g_{i, k} (x_{i, k}, y_{i, k})〉 + \frac{1}{α_{k}} D (x, x_{i, k})\} .

After exchanging

{\tilde{y}}_{i, k}

with its neighbors, player i updates its aggregate estimate according to

y_{i, k + 1} = \sum_{j = 1}^{N} {[W]}_{i j} {\tilde{y}}_{j, k} + x_{i, k + 1} - x_{i, k} .

The above mechanism preserves privacy by masking the communicated aggregate estimates, while still allowing each player to maintain an accurate estimate of the aggregate action through dynamic average consensus. In particular, the correlated perturbation is globally balanced, since

\sum_{i = 1}^{N} η_{i, k} = \sum_{i = 1}^{N} \sum_{j \in N_{i}} (η_{j i, k} - η_{i j, k}) = 0_{n} .

Therefore, although the individual aggregate estimates are perturbed before transmission, the average of the aggregate estimates remains unchanged. This property is essential for achieving both privacy preservation and aggregate tracking performance.

For better readability, Figure 1 provides a visual summary of the proposed privacy-preserving distributed online algorithm. At each iteration, each player first generates pairwise Gaussian noises and constructs a correlated perturbation. The perturbation is added to the local aggregate estimate before communication, so that the exchanged message is masked. After forming the masked aggregate estimate, each player first computes the next decision by the mirror-descent update. Then the masked aggregate estimates are exchanged with neighboring players, and the local aggregate estimate is updated through the dynamic average consensus step. This process is repeated over the time horizon and jointly achieves online equilibrium tracking and privacy protection.

Remark 3.

The pairwise noise-exchange step is an auxiliary protocol for implementing the correlated perturbation. For each ordered pair

(i, j)

and each time k, the variable

η_{i j, k}

is sampled independently of all local private data, cost functions, gradients, decisions, and aggregate estimates. Hence, revealing

η_{i j, k}

to its neighboring endpoint does not directly disclose private information. In the adversarial model considered in this paper, corrupted players may observe all messages and all pairwise noise variables on edges incident to them. These observed variables are therefore conditioned on in the privacy analysis. The privacy guarantee relies only on the residual randomness of pairwise noises on edges whose two endpoints are both honest.

4. Main Results

This section derives two theoretical properties of the proposed algorithm, including the regret bound and the statistical privacy guarantee.

4.1. Regret Analysis

We first give the deviation of the local action by mirror descent update.

Lemma 1.

Under Assumptions 1 and 2, each player

i \in V

satisfies

∥ x_{i, k + 1} - x_{i, k} ∥ \leq \frac{α_{k} G}{σ_{ω}}, k \geq 0 .

Proof.

Optimality of the mirror update implies that, for any

x \in X_{i}

,

〈g_{i, k} (x_{i, k}, y_{i, k}) + \frac{1}{α_{k}} (\nabla ω (x_{i, k + 1}) - \nabla ω (x_{i, k})), x - x_{i, k + 1}〉 \geq 0 .

Choosing

x = x_{i, k}

and using the

σ_{ω}

-strong convexity of

ω

gives

\frac{σ_{ω}}{α_{k}} ∥ x_{i, k + 1} - x_{i, k} ∥^{2} \leq ∥ g_{i, k} (x_{i, k}, y_{i, k}) ∥ ∥ x_{i, k + 1} - x_{i, k} ∥ .

Assumption 2 then yields the claim. □

The next lemma shows that the balancing property of the perturbations preserves the network average.

Lemma 2.

Under Assumptions 1–6,

{\bar{y}}_{k} = {\bar{x}}_{k}, \forall k \geq 0,

where

{\bar{y}}_{k} ≜ \frac{1}{N} \sum_{i = 1}^{N} y_{i, k}, {\bar{x}}_{k} ≜ \frac{1}{N} \sum_{i = 1}^{N} x_{i, k} .

Proof.

From Algorithm 1,

y_{i, k + 1} = \sum_{j = 1}^{N} w_{i j} (y_{j, k} + η_{j, k}) + x_{i, k + 1} - x_{i, k} .

Summing over i and using the double stochasticity of W gives

\sum_{i = 1}^{N} y_{i, k + 1} = \sum_{i = 1}^{N} y_{i, k} + \sum_{i = 1}^{N} η_{i, k} + \sum_{i = 1}^{N} x_{i, k + 1} - \sum_{i = 1}^{N} x_{i, k} .

Because

\sum_{i = 1}^{N} η_{i, k} = 0

, we obtain

{\bar{y}}_{k + 1} = {\bar{y}}_{k} + {\bar{x}}_{k + 1} - {\bar{x}}_{k} .

Since

y_{i, 0} = x_{i, 0}

for all i, we have

{\bar{y}}_{0} = {\bar{x}}_{0}

, and induction completes the proof. □

We next estimate the disagreement between a local aggregate estimate and the network average.

Lemma 3.

Under Assumptions 1–6, for every player

i \in V

and every

k \geq 1

,

E ∥ y_{i, k} - {\bar{x}}_{k} ∥ \leq \sqrt{N} γ^{k} P_{1} + \frac{\sqrt{N} G}{σ_{ω}} \sum_{l = 0}^{k - 1} γ^{k - l - 1} α_{l} + \sqrt{N} \sum_{l = 0}^{k - 1} γ^{k - l} E {∥ η_{l} ∥}_{\infty},

where

P_{1} ≜ {max}_{j \in V} ∥ y_{j, 0} ∥

and

γ \in (0, 1)

is the contraction factor associated with W.

Proof.

Denote

\begin{matrix} y_{k} & = col (y_{1, k}, \dots, y_{N, k}), \\ x_{k} & = col (x_{1, k}, \dots, x_{N, k}), \\ η_{k} & = col (η_{1, k}, \dots, η_{N, k}) . \end{matrix}

In vector form, the update of the aggregate estimate can be written as

y_{k + 1} = W y_{k} + W η_{k} + x_{k + 1} - x_{k} .

By recursion,

y_{k} = W^{k} y_{0} + \sum_{l = 0}^{k - 1} W^{k - l} η_{l} + \sum_{l = 0}^{k - 1} W^{k - l - 1} (x_{l + 1} - x_{l}) .

On the other hand, by Lemma 2,

{\bar{x}}_{k} = {\bar{y}}_{k} = \frac{1}{N} 1_{N}^{⊤} y_{k} .

Hence,

\begin{matrix} y_{i, k} - {\bar{x}}_{k} = & \sum_{j = 1}^{N} ({[W^{k}]}_{i j} - \frac{1}{N}) y_{j, 0} + \sum_{j = 1}^{N} \sum_{l = 0}^{k - 1} ({[W^{k - l}]}_{i j} - \frac{1}{N}) η_{j, l} \\ + \sum_{j = 1}^{N} \sum_{l = 0}^{k - 1} ({[W^{k - l - 1}]}_{i j} - \frac{1}{N}) (x_{j, l + 1} - x_{j, l}) . \end{matrix}

Taking norms and expectations, and using the standard mixing estimate

\sum_{j = 1}^{N} |{[W^{s}]}_{i j} - \frac{1}{N}| \leq \sqrt{N} γ^{s}, s \geq 0,

we obtain

E ∥ y_{i, k} - {\bar{x}}_{k} ∥ \leq \sqrt{N} γ^{k} P_{1} + \sum_{l = 0}^{k - 1} \sqrt{N} γ^{k - l} E ∥ η_{l} ∥_{\infty} + \sum_{l = 0}^{k - 1} \sqrt{N} γ^{k - l - 1} E {∥ x_{l + 1} - x_{l} ∥}_{\infty} .

By Lemma 1,

E ∥ x_{l + 1} - x_{l} ∥_{\infty} \leq \frac{G}{σ_{ω}} α_{l} .

Substituting this estimate into the above inequality yields the desired result. □

The following identity of the Bregman divergence will be used repeatedly.

Lemma 4.

For any vectors

a, b, c

,

〈 a - b, \nabla ω (b) - \nabla ω (c) 〉 = D (a, c) - D (a, b) - D (b, c) .

Lemma 5.

Under Assumption 5, for each player

i \in V

and each

k > 0

,

D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k}^{*}, x_{i, k + 1}) \leq D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k + 1}^{*}, x_{i, k + 1}) + K ∥ x_{i, k + 1}^{*} - x_{i, k}^{*} ∥ .

Proof.

Insert and subtract

D (x_{i, k + 1}^{*}, x_{i, k + 1})

:

\begin{matrix} D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k}^{*}, x_{i, k + 1}) \\ = & D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k + 1}^{*}, x_{i, k + 1}) + (D (x_{i, k + 1}^{*}, x_{i, k + 1}) - D (x_{i, k}^{*}, x_{i, k + 1})) . \end{matrix}

The second term is bounded by the Lipschitz property of

D (\cdot, x_{i, k + 1})

from Assumption 5, which gives the result. □

We then give the regret bound.

Theorem 1.

Suppose Assumptions 1–6 hold. Then the expected dynamic regret of Algorithm 1 satisfies

{Reg}_{i} (T) = O (\sqrt{T (\frac{V_{T} + 1}{α_{T}} + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k})}),

where

V_{T} ≜ \sum_{k = 1}^{T - 1} ∥ x_{k + 1}^{*} - x_{k}^{*} ∥

denotes the path variation of the equilibrium sequence.

Proof.

Denote

δ_{i, k} = y_{i, k} - σ (x_{k}) .

For the average aggregative game considered in this paper,

σ (x_{k}) = {\bar{x}}_{k}

, and hence Lemma 3 can be applied to

δ_{i, k}

.

By the convexity of

f_{i, k}

and the boundedness of the corresponding gradient in Assumption 2, we have

\begin{matrix} f_{i, k} (x_{i, k}, σ (x_{i, k}, x_{- i, k}^{*})) - f_{i, k} (x_{i, k}^{*}, σ (x_{k}^{*})) \leq G ∥ x_{i, k} - x_{i, k}^{*} ∥ . \end{matrix}

Therefore,

{Reg}_{i} (T) \leq G \sum_{k = 1}^{T} E ∥ x_{i, k} - x_{i, k}^{*} ∥ \leq G \sqrt{T \sum_{k = 1}^{T} E {∥ x_{k} - x_{k}^{*} ∥}^{2}} .

Thus, it remains to bound

\sum_{k = 1}^{T} E {∥ x_{k} - x_{k}^{*} ∥}^{2}

.

From the optimality condition of the mirror descent update, for any

x \in X_{i}

,

〈g_{i, k} (x_{i, k}, y_{i, k}) + \frac{1}{α_{k}} (\nabla ω (x_{i, k + 1}) - \nabla ω (x_{i, k})), x - x_{i, k + 1}〉 \geq 0 .

Taking

x = x_{i, k}^{*}

gives

α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k + 1} - x_{i, k}^{*}〉 \leq 〈x_{i, k}^{*} - x_{i, k + 1}, \nabla ω (x_{i, k + 1}) - \nabla ω (x_{i, k})〉 .

By the three-point identity of the Bregman divergence in Lemma 4,

\begin{matrix} α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k + 1} - x_{i, k}^{*}〉 \leq D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k}^{*}, x_{i, k + 1}) - D (x_{i, k + 1}, x_{i, k}) . \end{matrix}

Since

x_{i, k + 1} - x_{i, k}^{*} = x_{i, k} - x_{i, k}^{*} + x_{i, k + 1} - x_{i, k},

we obtain

\begin{matrix} α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k}^{*}〉 \\ \leq & D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k}^{*}, x_{i, k + 1}) + α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k + 1}〉 . \end{matrix}

Using Assumption 2 and Lemma 1, the last term satisfies

α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k + 1}〉 \leq \frac{α_{k}^{2} G^{2}}{σ_{ω}} .

Hence,

\begin{matrix} α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k}^{*}〉 \\ \leq & D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k}^{*}, x_{i, k + 1}) + \frac{α_{k}^{2} G^{2}}{σ_{ω}} . \end{matrix}

Applying Lemma 5 further gives

\begin{matrix} α_{k} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k}^{*}〉 \\ \leq & D (x_{i, k}^{*}, x_{i, k}) - D (x_{i, k + 1}^{*}, x_{i, k + 1}) + K ∥ x_{i, k + 1}^{*} - x_{i, k}^{*} ∥ + \frac{α_{k}^{2} G^{2}}{σ_{ω}} . \end{matrix}

Next, we lower bound the left-hand side. Since

ϕ_{i, k} (x_{k}) = g_{i, k} (x_{i, k}, σ (x_{k})),

we have

\begin{matrix} \sum_{i = 1}^{N} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k}^{*}〉 \\ = & 〈ϕ_{k} (x_{k}), x_{k} - x_{k}^{*}〉 + \sum_{i = 1}^{N} 〈g_{i, k} (x_{i, k}, y_{i, k}) - g_{i, k} (x_{i, k}, σ (x_{k})), x_{i, k} - x_{i, k}^{*}〉 . \end{matrix}

Since

x_{k}^{*}

is the Nash equilibrium of the stage-k game, it satisfies the variational inequality

〈ϕ_{k} (x_{k}^{*}), x - x_{k}^{*}〉 \geq 0, \forall x \in X_{1} \times \dots \times X_{N} .

Taking

x = x_{k}

and using the strong monotonicity of

ϕ_{k}

yields

〈ϕ_{k} (x_{k}), x_{k} - x_{k}^{*}〉 \geq μ {∥ x_{k} - x_{k}^{*} ∥}^{2} .

Moreover, by Assumption 3,

∥g_{i, k} (x_{i, k}, y_{i, k}) - g_{i, k} (x_{i, k}, σ (x_{k}))∥ \leq L_{i} ∥ δ_{i, k} ∥ .

Let

L_{max} = {max}_{i \in V} L_{i}

. By Assumption 2,

∥ ϕ_{k} (x) ∥ \leq \sqrt{N} G

for all feasible

x

. Combining this bound with the strong monotonicity of

ϕ_{k}

gives a uniform tracking bound

∥ x_{k} - x_{k}^{*} ∥ \leq \frac{2 \sqrt{N} G}{μ} ≜ B_{e} .

Therefore,

\begin{matrix} \sum_{i = 1}^{N} 〈g_{i, k} (x_{i, k}, y_{i, k}), x_{i, k} - x_{i, k}^{*}〉 \geq μ ∥ x_{k} - x_{k}^{*} ∥^{2} - L_{max} B_{e} \sum_{i = 1}^{N} ∥ δ_{i, k} ∥ . \end{matrix}

Combining the preceding upper and lower bounds and summing over all players gives, for

k = 1, \dots, T - 1

,

\begin{matrix} μ ∥ x_{k} - x_{k}^{*} ∥^{2} \leq & \frac{A_{k} - A_{k + 1}}{α_{k}} + \frac{K}{α_{k}} \sum_{i = 1}^{N} ∥ x_{i, k + 1}^{*} - x_{i, k}^{*} ∥ \\ + \frac{N G^{2}}{σ_{ω}} α_{k} + L_{max} B_{e} \sum_{i = 1}^{N} ∥ δ_{i, k} ∥, \end{matrix}

where

A_{k} = \sum_{i = 1}^{N} D (x_{i, k}^{*}, x_{i, k}) .

For the terminal index

k = T

, we do not introduce

x_{T + 1}^{*}

. Instead, using the inequality before applying Lemma 5 and the nonnegativity of the Bregman divergence, we obtain

μ ∥ x_{T} - x_{T}^{*} ∥^{2} \leq \frac{A_{T}}{α_{T}} + \frac{N G^{2}}{σ_{ω}} α_{T} + L_{max} B_{e} \sum_{i = 1}^{N} ∥ δ_{i, T} ∥ .

We now sum the above inequalities and take expectations. Since the stepsize sequence is nonincreasing,

\sum_{k = 1}^{T - 1} \frac{A_{k} - A_{k + 1}}{α_{k}} + \frac{A_{T}}{α_{T}} = \frac{A_{1}}{α_{1}} + \sum_{k = 2}^{T} A_{k} (\frac{1}{α_{k}} - \frac{1}{α_{k - 1}}) .

By the uniform boundedness of the Bregman divergence, there exists a constant

B_{D} > 0

such that

A_{k} \leq B_{D}

for all k. Therefore,

\sum_{k = 1}^{T - 1} \frac{A_{k} - A_{k + 1}}{α_{k}} + \frac{A_{T}}{α_{T}} \leq \frac{B_{D}}{α_{T}} .

Moreover,

\sum_{k = 1}^{T - 1} \frac{1}{α_{k}} \sum_{i = 1}^{N} ∥ x_{i, k + 1}^{*} - x_{i, k}^{*} ∥ \leq \frac{\sqrt{N}}{α_{T}} \sum_{k = 1}^{T - 1} ∥ x_{k + 1}^{*} - x_{k}^{*} ∥ = \frac{\sqrt{N} V_{T}}{α_{T}} .

It remains to bound the aggregate-estimation error term. By Lemma 3,

E ∥ δ_{i, k} ∥ = E ∥ y_{i, k} - {\bar{x}}_{k} ∥ \leq \sqrt{N} γ^{k} P_{1} + \frac{\sqrt{N} G}{σ_{ω}} \sum_{l = 0}^{k - 1} γ^{k - l - 1} α_{l} + \sqrt{N} \sum_{l = 0}^{k - 1} γ^{k - l} E {∥ η_{l} ∥}_{\infty} .

Since each perturbation component is Gaussian with standard deviation proportional to

M_{l}

, there exists a constant

C_{η} > 0

, independent of l and T, such that

E ∥ η_{l} ∥_{\infty} \leq C_{η} M_{l} .

Using the geometric summability of

γ^{k}

, we obtain

\sum_{k = 1}^{T} \sum_{i = 1}^{N} E ∥ δ_{i, k} ∥ = O (1 + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k}) .

Combining the above estimates yields

\sum_{k = 1}^{T} E {∥ x_{k} - x_{k}^{*} ∥}^{2} = O (\frac{V_{T} + 1}{α_{T}} + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k}) .

Substituting this bound into the regret estimate at the beginning of the proof gives

{Reg}_{i} (T) = O (\sqrt{T (\frac{V_{T} + 1}{α_{T}} + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k})}) .

This completes the proof. □

Remark 4.

The effectiveness of Algorithm 1 in a rapidly changing environment can be interpreted through the path variation term

V_{T} = \sum_{k = 1}^{T - 1} ∥ x_{k + 1}^{*} - x_{k}^{*} ∥

in Theorem 1. This quantity measures the cumulative movement of the time-varying Nash equilibrium. A slowly varying environment corresponds to a small value of

V_{T}

, whereas a rapidly changing environment leads to a larger

V_{T}

.

From Theorem 1, the normalized dynamic regret satisfies

\frac{{Reg}_{i} (T)}{T} = O (\sqrt{\frac{1}{T} (\frac{V_{T} + 1}{α_{T}} + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k})}) .

Therefore, the proposed algorithm can achieve vanishing average dynamic regret when

\frac{V_{T} + 1}{α_{T}} + \sum_{k = 1}^{T} α_{k} + \sum_{k = 1}^{T} M_{k} = o (T) .

For example, if

α_{k} = O (k^{- 1 / 2})

and

M_{k} = O (k^{- β})

with

0 < β < 1

, then

\frac{{Reg}_{i} (T)}{T} = O (\sqrt{\frac{V_{T} + 1}{\sqrt{T}} + T^{- 1 / 2} + T^{- β}}) .

Hence, when

V_{T} = o (T)

, the average dynamic regret converges to zero even though the equilibrium is time-varying. However, if the environment changes very rapidly so that

V_{T} = Θ (T)

, the bound predicts a non-vanishing average regret. This is consistent with the intrinsic difficulty of tracking a fast-moving equilibrium using only causal online information. In such cases, the algorithm can still operate in a distributed and privacy-preserving manner, but the tracking error and regret are expected to increase.

4.2. Statistical Privacy Guarantee

Assumption 7.

The set of corrupted players

C

is not a vertex cut of the communication graph

G

. Hence, the subgraph induced by the uncorrupted players, denoted by

G_{H}

, is connected.

We first characterize the distribution of the correlated perturbation.

Lemma 6.

Suppose that the communication graph

G

is undirected and connected. For each edge

e_{q} = (i, j)

with

i < j

, define

η_{e_{q}, k} = η_{j i, k} - η_{i j, k} .

Let

η_{E, k} = col (η_{e_{1}, k}, \dots, η_{e_{| E |}, k})

, and let D be the oriented incidence matrix of

G

. Then, for each coordinate

ℓ \in [n]

,

{[η_{k}]}_{ℓ} \sim N^{†} (0_{N}, 2 M_{k}^{2} L),

where

η_{k} = col (η_{1, k}, \dots, η_{N, k}),

L = D D^{⊤}

is the Laplacian matrix of

G

, and

N^{†}

denotes the degenerate Gaussian distribution.

Proof.

For each edge

e_{q} = (i, j)

with

i < j

, since

η_{i j, k}

and

η_{j i, k}

are independent and both follow

N (0_{n}, M_{k}^{2} I_{n})

, one has

η_{e_{q}, k} = η_{j i, k} - η_{i j, k} \sim N (0_{n}, 2 M_{k}^{2} I_{n}) .

Hence, for each coordinate

ℓ \in [n]

,

{[η_{e_{q}, k}]}_{ℓ} \sim N (0, 2 M_{k}^{2}) .

By the definition of the correlated perturbation,

η_{i, k} = \sum_{j \in N_{i}} (η_{j i, k} - η_{i j, k}),

and thus, in vector form,

η_{k} = D η_{E, k} .

Therefore,

{[η_{k}]}_{ℓ} = D {[η_{E, k}]}_{ℓ} .

Since all edge noises are independent, it follows that

E [{[η_{k}]}_{ℓ}] = 0_{N},

and

Cov ({[η_{k}]}_{ℓ}) = D Cov ({[η_{E, k}]}_{ℓ}) D^{⊤} = 2 M_{k}^{2} D D^{⊤} = 2 M_{k}^{2} L .

Hence,

{[η_{k}]}_{ℓ} \sim N^{†} (0_{N}, 2 M_{k}^{2} L) .

This completes the proof. □

The next lemma shows that the Kullback–Leibler divergence between two candidate observations can be bounded by the distance between the corresponding aggregate-estimate vectors.

Lemma 7.

Let

y_{k}

and

y_{k}^{'}

be two candidate aggregate-estimate vectors that lead to the same masked observation

{\tilde{y}}_{k}

, that is,

{\tilde{y}}_{k} = y_{k} + η_{k} = y_{k}^{'} + η_{k}^{'} .

Then, for each coordinate

ℓ \in [n]

,

D_{KL} (h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}]}_{ℓ}}, h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}^{'}]}_{ℓ}}) \leq \frac{∥ {[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ} ∥^{2}}{4 M_{k}^{2} \underset{̲}{λ} (L)},

where

\underset{̲}{λ} (L)

denotes the smallest nonzero eigenvalue of L. Consequently,

D_{KL} (h_{{\tilde{y}}_{k} ∣ y_{k}}, h_{{\tilde{y}}_{k} ∣ y_{k}^{'}}) \leq \frac{∥ y_{k} - y_{k}^{'} ∥^{2}}{4 M_{k}^{2} \underset{̲}{λ} (L)} .

Proof.

Since G is connected, we have

R (L) = {z \in R^{N} : 1_{N}^{⊤} z = 0} .

By Lemma 6, for each coordinate

ℓ \in [n]

, both

{[η_{k}]}_{ℓ}

and

{[η_{k}^{'}]}_{ℓ}

are supported on

R (L)

. Hence the conditional laws of

{[{\tilde{y}}_{k}]}_{ℓ}

given

{[y_{k}]}_{ℓ}

and

{[y_{k}^{'}]}_{ℓ}

are supported on

{[y_{k}]}_{ℓ} + R (L) and {[y_{k}^{'}]}_{ℓ} + R (L),

respectively. Since the two candidate aggregate-estimate vectors lead to the same masked observation,

{\tilde{y}}_{k} = y_{k} + η_{k} = y_{k}^{'} + η_{k}^{'},

we have, for each

ℓ \in [n]

,

{[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ} = {[η_{k}^{'}]}_{ℓ} - {[η_{k}]}_{ℓ} \in R (L) .

Therefore,

{[y_{k}]}_{ℓ} + R (L) = {[y_{k}^{'}]}_{ℓ} + R (L),

which means that the two singular Gaussian measures have the same affine support. The densities below are understood with respect to the

(N - 1)

-dimensional Lebesgue measure on this common affine support.

By Lemma 6, for each

ℓ \in [n]

, the random vector

{[η_{k}]}_{ℓ}

follows the degenerate Gaussian distribution

N^{†} (0_{N}, 2 M_{k}^{2} L) .

Hence, the conditional density of

{[{\tilde{y}}_{k}]}_{ℓ}

given

{[y_{k}]}_{ℓ}

is

h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}]}_{ℓ}} (z) = \frac{1}{\sqrt{{det}^{*} (4 π M_{k}^{2} L)}} exp (- \frac{{(z - {[y_{k}]}_{ℓ})}^{⊤} L^{†} (z - {[y_{k}]}_{ℓ})}{4 M_{k}^{2}}),

and similarly,

h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}^{'}]}_{ℓ}} (z) = \frac{1}{\sqrt{{det}^{*} (4 π M_{k}^{2} L)}} exp (- \frac{{(z - {[y_{k}^{'}]}_{ℓ})}^{⊤} L^{†} (z - {[y_{k}^{'}]}_{ℓ})}{4 M_{k}^{2}}) .

Substituting the above densities into the definition of KLD yields

D_{KL} (h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}]}_{ℓ}}, h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}^{'}]}_{ℓ}}) = \frac{1}{4 M_{k}^{2}} {({[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ})}^{⊤} L^{†} ({[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ}) .

Since

{[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ} \in R (L)

and the eigenvalues of

L^{†}

on

R (L)

are

1 / λ_{2} (L), \dots, 1 / λ_{N} (L)

, we have

{({[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ})}^{⊤} L^{†} ({[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ}) \leq \frac{∥ {[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ} ∥^{2}}{λ_{2} (L)} .

Then, we obtain

D_{KL} (h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}]}_{ℓ}}, h_{{[{\tilde{y}}_{k}]}_{ℓ} ∣ {[y_{k}^{'}]}_{ℓ}}) \leq \frac{∥ {[y_{k}]}_{ℓ} - {[y_{k}^{'}]}_{ℓ} ∥^{2}}{4 M_{k}^{2} \underset{̲}{λ} (L)} .

Summing over all coordinates

ℓ \in [n]

gives

D_{KL} (h_{{\tilde{y}}_{k} ∣ y_{k}}, h_{{\tilde{y}}_{k} ∣ y_{k}^{'}}) \leq \frac{∥ y_{k} - y_{k}^{'} ∥^{2}}{4 M_{k}^{2} \underset{̲}{λ} (L)} .

This completes the proof. □

We are now ready to establish the statistical privacy guarantee of Algorithm 1.

Theorem 2 (Statistical Privacy).

Suppose Assumptions 1–7 hold. Then, during the time horizon T, Algorithm 1 preserves the statistical privacy of the uncorrupted players. Specifically, for any

k \leq T

,

D_{KL} (h_{{\tilde{y}}_{H, k} ∣ y_{H, k}}, h_{{\tilde{y}}_{H, k} ∣ y_{H, k}^{'}}) \leq \frac{N G^{2} α_{k}^{2}}{σ_{ω}^{2} \underset{̲}{λ} (L_{H}) M_{k}^{2}},

where

L_{H}

is the Laplacian matrix of the subgraph

G_{H}

induced by the uncorrupted players, and

\underset{̲}{λ} (L_{H})

denotes its smallest nonzero eigenvalue.

Proof.

For each honest player

i \in H

, decompose its perturbation as

η_{i, k} = b_{i, k} + ξ_{i, k},

where

b_{i, k} = \sum_{j \in N_{i} \cap C} (η_{j i, k} - η_{i j, k})

is the contribution from edges incident to corrupted players, and

ξ_{i, k} = \sum_{j \in N_{i} \cap H} (η_{j i, k} - η_{i j, k})

is the contribution from edges whose two endpoints are honest.

The variables

b_{i, k}

are observed by the adversary and are therefore conditioned on in the privacy analysis. They act only as known deterministic shifts. Let

{\hat{y}}_{H, k} = {\tilde{y}}_{H, k} - b_{H, k} .

Then

{\hat{y}}_{H, k} = y_{H, k} + ξ_{H, k} .

Since Assumption 7 guarantees that the honest-player subgraph

G_{H}

is connected, Lemma 6 applied to

G_{H}

gives, for each coordinate

ℓ \in [n]

,

{[ξ_{H, k}]}_{ℓ} \sim N^{†} (0, 2 M_{k}^{2} L_{H}),

and

{[ξ_{H, k}]}_{ℓ} \in R (L_{H}) .

Now consider two candidate honest-player datasets that generate the same observation record. Since

b_{H, k}

is part of the conditioned adversarial observation, the same realization of

b_{H, k}

is used for both candidates. Hence

{\hat{y}}_{H, k} = y_{H, k} + ξ_{H, k} = y_{H, k}^{'} + ξ_{H, k}^{'} .

Therefore, by Lemma 7 applied to the honest-player subgraph

G_{H}

,

D_{KL} (h_{{\hat{y}}_{H, k} | y_{H, k}}, h_{{\hat{y}}_{H, k} | y_{H, k}^{'}}) \leq \frac{∥ y_{H, k} - y_{H, k}^{'} ∥^{2}}{4 M_{k}^{2} λ_{2} (L_{H})} .

Since

{\hat{y}}_{H, k} = {\tilde{y}}_{H, k} - b_{H, k}

differs from

{\tilde{y}}_{H, k}

only by a known translation, the same KL bound holds for the conditional laws of

{\tilde{y}}_{H, k}

.

It remains to bound

∥ y_{H, k} - y_{H, k}^{'} ∥

.

From Algorithm 1, the aggregate-estimate update is

y_{k + 1} = W {\tilde{y}}_{k} + x_{k + 1} - x_{k} .

Hence, for two candidate datasets generating the same masked estimate

{\tilde{y}}_{k}

, one has

y_{k + 1} - y_{k + 1}^{'} = (x_{k + 1} - x_{k}) - (x_{k + 1}^{'} - x_{k}^{'}) .

Restricting the above equality to the uncorrupted players gives

y_{H, k + 1} - y_{H, k + 1}^{'} = (x_{H, k + 1} - x_{H, k}) - (x_{H, k + 1}^{'} - x_{H, k}^{'}) .

Therefore,

∥ y_{H, k + 1} - y_{H, k + 1}^{'} ∥ \leq ∥ x_{H, k + 1} - x_{H, k} ∥ + ∥ x_{H, k + 1}^{'} - x_{H, k}^{'} ∥ .

By Lemma 1, for each

i \in H

,

∥ x_{i, k + 1} - x_{i, k} ∥ \leq \frac{α_{k} G}{σ_{ω}}, ∥ x_{i, k + 1}^{'} - x_{i, k}^{'} ∥ \leq \frac{α_{k} G}{σ_{ω}} .

Thus,

∥ x_{H, k + 1} - x_{H, k} ∥^{2} \leq | H | \frac{α_{k}^{2} G^{2}}{σ_{ω}^{2}} \leq N \frac{α_{k}^{2} G^{2}}{σ_{ω}^{2}},

and similarly,

∥ x_{H, k + 1}^{'} - x_{H, k}^{'} ∥^{2} \leq N \frac{α_{k}^{2} G^{2}}{σ_{ω}^{2}} .

Hence,

∥ y_{H, k + 1} - y_{H, k + 1}^{'} ∥ \leq 2 \sqrt{N} \frac{α_{k} G}{σ_{ω}},

which implies

∥ y_{H, k + 1} - y_{H, k + 1}^{'} ∥^{2} \leq 4 N \frac{α_{k}^{2} G^{2}}{σ_{ω}^{2}} .

Replacing

k + 1

by k in the above bound, we obtain

∥ y_{H, k} - y_{H, k}^{'} ∥^{2} \leq 4 N \frac{α_{k}^{2} G^{2}}{σ_{ω}^{2}} .

Substituting this estimate into the KLD inequality yields

D_{KL} (h_{{\tilde{y}}_{H, k} ∣ y_{H, k}}, h_{{\tilde{y}}_{H, k} ∣ y_{H, k}^{'}}) \leq \frac{N α_{k}^{2} G^{2}}{M_{k}^{2} σ_{ω}^{2} \underset{̲}{λ} (L_{H})} .

According to Definition 3, Algorithm 1 preserves the statistical privacy of the uncorrupted players. □

Remark 5.

The quantity bounded in Theorem 2 has a direct statistical interpretation. For two candidate honest-player datasets, the KL divergence can be written as

D_{KL} (P_{O | D_{H}} ∥ P_{O | D_{H}^{'}}) = E_{O \sim P_{O | D_{H}}} [log \frac{d P_{O | D_{H}}}{d P_{O | D_{H}^{'}}}],

which is the expected log-likelihood ratio available to the adversary for distinguishing the two candidate datasets. Therefore, a small KL divergence means that the observation record provides limited statistical evidence for distinguishing these two candidates. Moreover, by Pinsker’s inequality, if the KL divergence is bounded by ρ, then the total variation distance between the two observation distributions is at most

\sqrt{ρ / 2}

.

For each stage k, Theorem 2 gives the privacy leakage bound

ρ_{k} = \frac{N G^{2} α_{k}^{2}}{σ_{ω}^{2} λ (L_{H}) M_{k}^{2}} .

Thus, increasing the perturbation magnitude

M_{k}

or improving the algebraic connectivity

λ (L_{H})

of the honest subgraph reduces the statistical leakage, while larger stepsizes may increase it. This interpretation also explains the privacy-performance tradeoff: larger perturbations improve privacy but may increase the estimation error and regret.

The above KL-based notion is different from the standard

(ϵ, δ)

-differential privacy [19,20]. Differential privacy imposes a worst-case output likelihood-ratio bound for adjacent datasets. In contrast, the present criterion controls the average log-likelihood ratio, namely the KL divergence, between two candidate observation distributions.

5. Numerical Case Study: Privacy-Preserving EV Charging Coordination

In this section, we validate the proposed privacy-preserving distributed online algorithm through a practical electric vehicle (EV) charging coordination case study. The case study is motivated by residential or workplace charging networks, where a group of EV users repeatedly update their charging powers according to time-varying electricity prices, charging preferences, and the aggregate load of the network. In such systems, the charging profile of an individual user may reveal private information, such as daily routines, travel habits, and energy consumption patterns. Therefore, privacy-preserving distributed coordination is important for enabling cooperative charging control without directly exposing users’ local information.

In the considered EV charging network, each player corresponds to one EV user. The decision variable represents the charging power, the aggregate term represents the average charging load, and the local cost captures both the price-related payment and the deviation from the user’s preferred charging demand. This application naturally fits the online aggregative game model because the charging environment varies over time and each user’s cost depends on both its own charging action and the aggregate behavior of the charging population.

Consider a charging network with

N = 20

users. For each user

i \in V

, let

x_{i} (k) \in R

denote the charging power at iteration k, and let the local feasible set be

X_{i} = [0, {\bar{x}}_{i}], i = 1, \dots, N,

where

{\bar{x}}_{i} = 6

. The aggregative term is defined as

σ (x (k)) = \frac{1}{N} \sum_{j = 1}^{N} x_{j} (k) .

For each user i, the time-varying local cost function is given by

J_{i, k} (x_{i} (k), σ (x (k))) = (a_{k} σ (x (k)) + b_{k}) x_{i} (k) + c_{i} {(x_{i} (k) - r_{i, k})}^{2},

where

a_{k} = 0.35 + 0.10 sin (k / 90)

,

b_{k} = 0.40 + 0.08 cos (k / 110)

,

r_{i, k} = 2.2 + \frac{i}{25} + 0.40 sin (\frac{k}{120} + \frac{i}{7})

,

c_{i} = 1 + \frac{i}{40} .

The communication graph is chosen as an undirected ring graph, and the mixing matrix W is constructed by the Metropolis rule. Since each node in the ring graph has degree two, the nonzero weights are

w_{i i} = 1 / 3

and

w_{i j} = 1 / 3

for

j \in N_{i}

. The proposed algorithm is implemented under the Euclidean Bregman divergence

D (x, z) = \frac{1}{2} {∥ x - z ∥}^{2}

. The simulation horizon is

T = 800

, and the iteration index is

k = 1, \dots, T

. The stepsize is

α_{k} = \frac{α_{0}}{\sqrt{k + 1}},

where

α_{0} = 0.35

. For the Gaussian perturbation mechanism, the perturbation magnitude is

M_{k} = \frac{M_{0}}{{(k + 1)}^{β}},

with

β = 0.25

and

M_{0} = 0.45

. For the Laplace-noise benchmark, the Laplace scale is

M_{k}^{L} = \frac{1.60}{{(k + 1)}^{0.25}} .

The privacy-free baseline corresponds to

M_{k} = 0

. The initial action of each user is sampled uniformly from

[0, 6]

, and the initial aggregate estimate is chosen as

y_{i} (0) = x_{i} (0)

.

All reported curves are averaged over 20 independent Monte Carlo runs. For Figure 2 and Figure 3, the random seeds are 1000–1019 for the privacy-free baseline, 2000–2019 for the Gaussian perturbation mechanism, and 3000–3019 for the Laplace-noise benchmark. The corrupted-player set used for the privacy interpretation is fixed as

C = {1, 2},

and hence

| C | = 2

and

H = {3, \dots, 20}

. Removing nodes 1 and 2 from the ring graph leaves the honest-player subgraph connected, so Assumption 7 is satisfied.

For the experiment with different privacy calibrations in Figure 4, we set a calibration constant

Δ = 7.5

. For positive

ϵ \in {5, 10, 15}

, the Gaussian perturbation parameter is chosen as

M_{0} = \frac{Δ}{ϵ},

which gives

M_{0} = 1.50, 0.75, 0.50

, respectively. The case

ϵ = 0

denotes the privacy-free no-noise baseline. The random seeds are 4000–4019, 4100–4119, 4200–4219, and 4300–4319 for

ϵ = 0, 5, 10, 15

, respectively. The plotted curves are Monte Carlo means. For a plotted quantity

z (k)

, the corresponding

95 %

confidence interval is computed as

\bar{z} (k) \pm 1.96 \frac{s_{z} (k)}{\sqrt{20}},

where

\bar{z} (k)

and

s_{z} (k)

are the sample mean and sample standard deviation over the 20 Monte Carlo runs.

The practical interpretation of the main variables and parameters in the EV charging case study is summarized in Table 2. This interpretation shows how the abstract online aggregative game model corresponds to a concrete charging coordination problem.

To evaluate the algorithm performance, we consider the maximum average estimate error

e_{est} (k) = max_{i \in V} |y_{i} (k) - \bar{x} (k)|, \bar{x} (k) = \frac{1}{N} \sum_{j = 1}^{N} x_{j} (k),

and the normalized regret

Reg (T) / T,

where

Reg (T) = \sum_{k = 1}^{T} (\sum_{i = 1}^{N} J_{i, k} (x_{i} (k), σ (x (k))) - \sum_{i = 1}^{N} J_{i, k} (x_{i}^{*} (k), σ (x^{*} (k)))),

and

x^{*} (k)

is the time-varying Nash equilibrium at iteration k.

Figure 2 compares the maxima of average estimate errors for three algorithms, namely, the privacy-free distributed online algorithm, the proposed algorithm with correlated Gaussian perturbation, and the proposed algorithm with correlated Laplace perturbation. It can be observed that the privacy-free algorithm achieves the smallest estimate error, while both privacy-preserving algorithms lead to slightly larger errors due to the injected perturbations. Moreover, the Gaussian perturbation yields slightly better tracking accuracy than the Laplace perturbation.

Figure 3 shows the corresponding curves of

Reg (T) / T

for the above three algorithms. The privacy-free algorithm gives the smallest normalized regret. After introducing privacy-preserving perturbations, the regret becomes slightly larger, but all curves still decrease as the iteration proceeds. This shows that the proposed method can preserve satisfactory online decision performance while protecting the exchanged aggregate estimates.

To further evaluate the robustness of the proposed algorithm, we provide a statistical summary of the numerical results over 20 independent Monte Carlo runs. For each run, we compute the time-averaged maximum estimation error and the time-averaged normalized regret over the horizon

T = 800

. Table 3 reports the mean, sample standard deviation, and median of these quantities. The results show that Algorithm 1 has a slightly larger estimation error and regret than the privacy-free baseline due to the injected perturbations, but it remains stable across different random trials. Compared with the Laplace-noise benchmark, the proposed correlated Gaussian perturbation achieves smaller average estimation error and smaller normalized regret, which further supports the reliability of the proposed mechanism.

To further illustrate the influence of privacy levels, Figure 4 plots the curves of

Reg (T) / T

for different privacy values. It is seen that higher privacy level magnitudes generally lead to larger regret, which reveals the tradeoff between privacy protection and online performance.

Finally, we modify the feasible set from an interval to a simplex set and compare the Euclidean-distance-based algorithm with the KL-divergence-based algorithm. Specifically, for each user i, the decision variable is changed to

x_{i} (k) = {[x_{i}^{1} (k), x_{i}^{2} (k), \dots, x_{i}^{d} (k)]}^{⊤} \in R^{d},

with

d = 3

, and the feasible set is

Δ = \{x \in R^{d} | x_{ℓ} \geq 0, \sum_{ℓ = 1}^{d} x_{ℓ} = 1\} .

To examine the influence of network topology, we compare the proposed algorithm under three connected communication graphs: a ring graph, a ring graph with additional chordal edges, and a star graph. For the star graph, the hub node is selected outside the corrupted-player set to keep the honest-player subgraph connected. Figure 5 compares the aggregate-estimation error under these communication graphs. The results show that the proposed algorithm remains effective under all tested topologies and is not restricted to a single ring graph. The graph with additional chordal edges generally gives a smaller estimation error, indicating that better network connectivity improves the information-mixing ability of the dynamic average consensus step.

Figure 6 presents the corresponding curves of

Reg (T) / T

. It is observed that both methods can solve the problem over the simplex set, while the KL-divergence algorithm achieves better regret performance, which indicates that the induced non-Euclidean geometry is more suitable for simplex constrained charging allocation problems.

6. Discussion

The simulation results show that the proposed method can balance privacy protection and online decision performance in distributed online aggregative games. After introducing perturbations into the exchanged aggregate estimates, the estimate errors and normalized regret become slightly larger than those of the privacy-free algorithm. However, the overall performance remains stable, which indicates that the correlated perturbation mechanism does not destroy the online tracking ability of the algorithm.

The results also show that the perturbation distribution affects performance. In the considered setting, the correlated Gaussian perturbation achieves slightly better estimate accuracy and regret performance than the correlated Laplace perturbation. In addition, the experiments under different privacy levels reveal a clear relationship between privacy strength and online performance. Stronger privacy protection requires larger perturbations, which improves privacy but also increases the regret.

The simplex constrained experiment provides another useful observation. Compared with the Euclidean distance based update, the KL-divergence update achieves better performance. This result suggests that when the feasible set has a simplex structure, choosing a geometry that matches the constraint set is beneficial. Overall, the proposed method provides an effective way to protect privacy while maintaining satisfactory online performance.

The EV charging case study also illustrates the practical relevance of the proposed framework. In a distributed charging network, users may be unwilling to share their charging states or aggregate-load estimates directly, since such information can be linked to personal mobility and energy consumption habits. Therefore, the method is suitable for privacy-sensitive networked resource allocation problems in which local decisions are coupled through an aggregate quantity. Beyond EV charging, similar structures appear in demand response, shared energy management, communication resource allocation, congestion control, and cloud resource allocation. In these applications, each agent optimizes its own online decision while being affected by the aggregate behavior of the whole population, and privacy protection is needed during repeated information exchange.

7. Conclusions

This paper studied statistical privacy preservation in distributed online aggregative games with time-varying costs. To protect sensitive local information during repeated communication, we proposed a privacy-preserving distributed online mirror descent algorithm with correlated perturbations. The correlated perturbations mask the exchanged aggregate estimates while preserving a global balancing property, which allows the dynamic average-tracking mechanism to be maintained. Under the stated assumptions, we established an expected dynamic regret bound and a Kullback–Leibler divergence statistical privacy guarantee. The numerical simulations on electric vehicle charging further illustrate the effectiveness of the proposed algorithm, the tradeoff between privacy protection and online performance, and the advantage of using a KL-divergence-based mirror geometry for simplex-constrained problems.

The proposed framework can be applied to broader distributed online decision-making problems, such as energy management, communication networks, traffic control, and resource allocation. Several directions remain open for future research. First, it would be meaningful to extend the method to directed, time-varying, lossy, or asynchronous communication networks. Second, adaptive choices of stepsizes and perturbation magnitudes may further improve the tradeoff between privacy and regret performance. Finally, stronger adversarial or collusion models, extensions to coupled constraints, stochastic feedback, and large-scale practical applications deserve further investigation.

Author Contributions

Conceptualization, M.Y.; methodology, M.Y.; software, M.Y.; validation, M.Y.; formal analysis, M.Y.; investigation, M.Y.; writing—original draft, M.Y.; writing—review & editing, R.Y.; visualization, R.Y.; supervision, R.Y.; project administration, R.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Hazan, E. Introduction to Online Convex Optimization. Found. Trends Optim. 2016, 2, 157–325. [Google Scholar] [CrossRef]
Shalev-Shwartz, S. Online Learning and Online Convex Optimization. Found. Trends Mach. Learn. 2012, 4, 107–194. [Google Scholar] [CrossRef]
Li, X.; Xie, L.; Li, N. A Survey on Distributed Online Optimization and Game. arXiv 2023, arXiv:2205.00473. [Google Scholar] [CrossRef]
Yi, X.; Li, X.; Xie, L.; Johansson, K.H. Distributed Online Convex Optimization with Time-Varying Coupled Inequality Constraints. IEEE Trans. Signal Process. 2020, 68, 731–746. [Google Scholar] [CrossRef]
Nedić, A.; Ozdaglar, A. Distributed Subgradient Methods for Multi-Agent Optimization. IEEE Trans. Autom. Control 2009, 54, 48–61. [Google Scholar] [CrossRef]
Nedić, A.; Olshevsky, A. Stochastic Gradient-Push for Strongly Convex Functions on Time-Varying Directed Graphs. IEEE Trans. Autom. Control 2016, 61, 3936–3947. [Google Scholar] [CrossRef]
Zhang, W.; Jiang, Y.; Zhang, W. Antecedents of Online Knowledge Seeking of Employees in Technical R&D Team: An Empirical Study in China. IEEE Trans. Eng. Manag. 2023, 70, 523–532. [Google Scholar] [CrossRef]
Liu, Y.; Jiang, L.; Qi, Q.; Xie, K.; Xie, S. Online Computation Offloading for Collaborative Space/Aerial-Aided Edge Computing Toward 6G System. IEEE Trans. Veh. Technol. 2024, 73, 2495–2505. [Google Scholar] [CrossRef]
Liu, H.; Hu, M.; Yang, L. A Directional Flow Pattern-Driven Framework for Fine-Grained Air Traffic Flow Prediction. Aerosp. Sci. Technol. 2026, 168, 111029. [Google Scholar] [CrossRef]
Koshal, J.; Nedić, A.; Shanbhag, U.V. Distributed Algorithms for Aggregative Games on Graphs. Oper. Res. 2016, 64, 680–704. [Google Scholar] [CrossRef]
Belgioioso, G.; Nedić, A.; Grammatico, S. Distributed Generalized Nash Equilibrium Seeking in Aggregative Games on Time-Varying Networks. IEEE Trans. Autom. Control 2021, 66, 2061–2075. [Google Scholar] [CrossRef]
Salehisadaghiani, F.; Pavel, L. Distributed Nash Equilibrium Seeking: A Gossip-Based Algorithm. Automatica 2016, 72, 209–216. [Google Scholar] [CrossRef]
Pavel, L. Game Theory for Control of Optical Networks; Birkhäuser: Cham, Switzerland, 2012. [Google Scholar]
Yang, C.; Wang, S.; Zhang, S.; Lin, S.; Huang, B. A Class of Distributed Online Aggregative Optimization in Unknown Dynamic Environment. Mathematics 2024, 12, 2460. [Google Scholar] [CrossRef]
Huo, B.; Ma, J.; Du, M.; Yin, L. Average Consensus Tracking of Weight-Balanced Multi-Agent Systems via Sampled Data. Mathematics 2024, 12, 674. [Google Scholar] [CrossRef]
He, L.; Cheng, H.; Zhang, Y. Centralized and Decentralized Event-Triggered Nash Equilibrium-Seeking Strategies for Heterogeneous Multi-Agent Systems. Mathematics 2025, 13, 419. [Google Scholar] [CrossRef]
Cao, Z.; Guo, X.; Zhang, H. Privacy-Preserving Distributed Learning via Newton Algorithm. Mathematics 2023, 11, 3807. [Google Scholar] [CrossRef]
Duchi, J.C.; Jordan, M.I.; Wainwright, M.J. Privacy-Aware Learning. J. ACM 2014, 61, 38. [Google Scholar] [CrossRef]
Dwork, C.; Roth, A. The Algorithmic Foundations of Differential Privacy. Found. Trends Theor. Comput. Sci. 2014, 9, 211–407. [Google Scholar] [CrossRef]
Kairouz, P.; Oh, S.; Viswanath, P. The Composition Theorem for Differential Privacy. IEEE Trans. Inf. Theory 2017, 63, 4037–4049. [Google Scholar] [CrossRef]
Huang, Z.; Mitra, S.; Dullerud, G.E. Differentially Private Iterative Synchronous Consensus. In Proceedings of the 2012 ACM Workshop on Privacy in the Electronic Society, Raleigh, NC, USA, 15 October 2012; pp. 81–90. [Google Scholar]
Nozari, E.; Tallapragada, P.; Cortés, J. Differentially Private Average Consensus: Obstructions, Trade-Offs, and Optimal Algorithm Design. Automatica 2017, 81, 221–231. [Google Scholar] [CrossRef]
Zhang, K.; Li, Z.; Wang, Y.; Louati, A.; Chen, J. Privacy-Preserving Dynamic Average Consensus via State Decomposition: Case Study on Multi-Robot Formation Control. Automatica 2022, 136, 110182. [Google Scholar] [CrossRef]
Wang, Y. A Robust Dynamic Average Consensus Algorithm that Ensures Both Differential Privacy and Accurate Convergence. In Proceedings of the 62nd IEEE Conference on Decision and Control (CDC), Singapore, 13–15 December 2023; pp. 1130–1137. [Google Scholar]
Lin, Y.; Liu, K.; Han, D.; Xia, Y. Statistical Privacy-Preserving Online Distributed Nash Equilibrium Tracking in Aggregative Games. IEEE Trans. Autom. Control 2024, 69, 323–330. [Google Scholar]
Yuan, D.; Hong, Y.; Ho, D.W.C.; Xu, S. Distributed Mirror Descent for Online Composite Optimization. IEEE Trans. Autom. Control 2021, 66, 714–729. [Google Scholar] [CrossRef]
Beck, A.; Teboulle, M. Mirror Descent and Nonlinear Projected Subgradient Methods for Convex Optimization. Oper. Res. Lett. 2003, 31, 167–175. [Google Scholar] [CrossRef]
Bubeck, S. Convex Optimization: Algorithms and Complexity. Found. Trends Mach. Learn. 2015, 8, 231–357. [Google Scholar] [CrossRef]
Yuan, M.; Lei, J.; Hong, Y. Differentially Private Distributed Online Mirror Descent Algorithm. Neurocomputing 2023, 551, 126531. [Google Scholar] [CrossRef]

Figure 1. Flow chart of the proposed privacy-preserving distributed online mirror descent algorithm.

Figure 2. Comparison of maxima of average estimate errors for the privacy-free algorithm, the proposed algorithm with correlated Gaussian perturbation, and the proposed algorithm with correlated Laplace perturbation.

Figure 3. Comparison of

Reg (T) / T

for the privacy-free algorithm, the proposed algorithm with correlated Gaussian perturbation, and the algorithm with correlated Laplace perturbation.

Figure 3. Comparison of

Reg (T) / T

for the privacy-free algorithm, the proposed algorithm with correlated Gaussian perturbation, and the algorithm with correlated Laplace perturbation.

Figure 4. Comparison of

Reg (T) / T

for the proposed algorithm under different privacy protection levels.

Figure 4. Comparison of

Reg (T) / T

for the proposed algorithm under different privacy protection levels.

Figure 5. Maximum aggregate-estimation error under different communication graphs.

Figure 6. Comparison of

Reg (T) / T

under simplex constraints for the Euclidean distance algorithm and the KL-divergence algorithm.

Figure 6. Comparison of

Reg (T) / T

under simplex constraints for the Euclidean distance algorithm and the KL-divergence algorithm.

Table 1. Comparison with representative related works.

Work	Problem Setting	Method and Privacy Feature	Main Distinction
Yang et al. [14]	Distributed online aggregative optimization	Distributed online update with performance analysis; privacy is not the main focus	Does not address privacy-preserving online aggregative games.
Cao et al. [17]	Privacy-preserving distributed learning	Newton method with a privacy-preserving mechanism	Does not study online game equilibrium tracking.
Lin et al. [25]	Privacy-preserving online aggregative games	Euclidean projected gradient update; privacy with correlated perturbations	Closest to this paper, but only relies on Euclidean projection.
This paper	Privacy-preserving distributed online aggregative games	Distributed online mirror descent; privacy with correlated perturbations and KL-divergence-based statistical privacy	Integrates dynamic equilibrium tracking, non-Euclidean decision geometry, and statistical privacy protection.

Table 2. Practical interpretation of the EV charging case study.

Symbol or Parameter	Practical Meaning
$i \in V$	EV user in the distributed charging network.
$x_{i} (k)$	Charging power selected by user i at iteration k.
$X_{i} = [0, {\bar{x}}_{i}]$	Physical charging-power limit of user i.
$σ (x (k))$	Average charging load of all users, reflecting the aggregate demand level.
$a_{k}$ and $b_{k}$	Time-varying price or grid-condition coefficients.
$r_{i, k}$	Preferred charging demand of user i at time k.
$c_{i}$	User penalty weight for deviating from the preferred charging demand.

Table 3. Statistical summary over 20 independent Monte Carlo runs.

Algorithm	Time-Averaged Estimation Error		Time-Averaged $Reg (T) / T$
Algorithm	Mean ± Std.	Median	Mean ± Std.	Median
Privacy-free	$0.0657 \pm 0.0013$	$0.0655$	$0.1430 \pm 0.0124$	$0.1408$
Gaussian perturbation	$0.1897 \pm 0.0055$	$0.1899$	$0.1555 \pm 0.0105$	$0.1544$
Laplace perturbation	$0.8633 \pm 0.0215$	$0.8674$	$0.2246 \pm 0.0279$	$0.2174$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yuan, M.; Yu, R. Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations. Mathematics 2026, 14, 1731. https://doi.org/10.3390/math14101731

AMA Style

Yuan M, Yu R. Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations. Mathematics. 2026; 14(10):1731. https://doi.org/10.3390/math14101731

Chicago/Turabian Style

Yuan, Meng, and Rui Yu. 2026. "Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations" Mathematics 14, no. 10: 1731. https://doi.org/10.3390/math14101731

APA Style

Yuan, M., & Yu, R. (2026). Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations. Mathematics, 14(10), 1731. https://doi.org/10.3390/math14101731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Statistical Privacy-Preserving Distributed Online Aggregative Games via Mirror Descent with Correlated Perturbations

Abstract

1. Introduction

2. Notation, Privacy Model, and Game Formulation

2.1. Statistical Privacy

2.2. Game Formulation

2.3. Assumptions

3. Our Proposed Algorithm

4. Main Results

4.1. Regret Analysis

4.2. Statistical Privacy Guarantee

5. Numerical Case Study: Privacy-Preserving EV Charging Coordination

6. Discussion

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI