On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation

Lv, Jiawen; Zhang, Xiang; Li, Zhou

doi:10.3390/e28030352

Open AccessArticle

On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation

by

Jiawen Lv

¹,

Xiang Zhang

²

and

Zhou Li

^1,*

¹

Guangxi Key Laboratory of Multimedia Communications and Network Technology, Guangxi University, Nanning 530004, China

²

Department of Electrical Engineering and Computer Science, Technical University of Berlin, 10623 Berlin, Germany

^*

Author to whom correspondence should be addressed.

Entropy 2026, 28(3), 352; https://doi.org/10.3390/e28030352

Submission received: 9 February 2026 / Revised: 18 March 2026 / Accepted: 19 March 2026 / Published: 20 March 2026

(This article belongs to the Special Issue Secure Aggregation for Federated Learning and Distributed Computation)

Download

Browse Figure

Versions Notes

Abstract

Motivated by heterogeneous data distributions and task-dependent aggregation requirements in federated learning, we study information-theoretic secure aggregation of linear functions over a two-hop hierarchical network. The system comprises an aggregation server, an intermediate layer of U relays, and

U V

users, where each relay serves a disjoint cluster of V users. Each relay observes all uplink transmissions within its cluster and forwards a coded message to the server. The server is authorized to compute a prescribed linear function F of the users’ inputs with zero error, while being prevented from learning any additional information about an unauthorized linear function G. Moreover, each relay must obtain no information about any non-trivial linear function

B_{u}

of the inputs in its own cluster. We define the communication rates on both hops as the number of transmitted symbols per input symbol. By deriving matching information-theoretic converse and achievability bounds, we fully characterize the optimal communication rates and propose an explicit linear coding scheme that achieves the resulting optimal region. Our results demonstrate that hierarchical architectures can attain optimal communication rates while substantially reducing the server-side masking burden, thereby enabling scalable secure aggregation of authorized linear functions.

Keywords:

hierarchical secure aggregation; vector linear; information-theoretic security; federated learning

1. Introduction

With the rapid proliferation of machine learning and data analytics applications, massive amounts of data are continuously generated by geographically distributed users and devices. In many practical scenarios, such as healthcare analytics, intelligent transportation, and personalized recommendation systems, these data are highly sensitive. Directly collecting or centrally storing raw user data therefore poses significant privacy risks and regulatory challenges. Secure aggregation has emerged as a fundamental primitive to address this tension, enabling an aggregator to compute desired statistics over distributed data while preventing the disclosure of individual user information.

From an information-theoretic perspective, secure aggregation inherently incurs simultaneous costs in communication efficiency and randomness consumption. A classical starting point is secure aggregation, where each of K users holds a private input and transmits a masked message to a server. The server is required to recover the sum of all inputs with zero error while learning no additional information. Prior work [1] has shown that achieving perfect secure inevitably requires a nontrivial amount of randomness, and that reducing communication cost typically increases the required key rate. This communication and randomness relationship has been fully characterized for secure aggregation and several of its variants, establishing randomness as a fundamental resource rather than a mere implementation detail.

As distributed learning systems evolve, secure aggregation alone is insufficient to capture practical requirements. First, the desired computation is often more general than a scalar sum and can be modeled as an arbitrary linear transformation of the users’ data. Second, security requirements are frequently function-specific: while the server is authorized to learn a prescribed linear function F of the users’ data, it must be prevented from inferring other sensitive linear functions, denoted by G. This motivates the formulation of vector linear secure aggregation, in which the security cost is no longer determined solely by the number of users, but also by the algebraic relationship between the authorized function F and the protected functions G. In particular, the additional information contained in G beyond what is revealed by F is quantified by the conditional rank

rank (G ∣ F)

, which directly determines the minimum amount of randomness required for security.

Most existing information-theoretic results on vector linear secure aggregation focus on single-hop network architectures [2,3], where all users communicate directly with a central server. While such models are analytically convenient, they do not fully reflect the structure of large-scale practical systems. In real deployments, direct communication between a server and a massive number of users can lead to scalability and access limitations. As a result, hierarchical or edge-assisted architectures are widely adopted, in which users first communicate with nearby relays or gateways, and the relays subsequently forward aggregated messages to the server.

Introducing a hierarchical architecture fundamentally changes the secure aggregation problem [4]. Unlike the classical single-hop setting, where only the server’s inference needs to be controlled, a two-hop network creates an additional inference layer: each relay observes all transmissions from users in its cluster and may infer extra intra-cluster linear information unless properly constrained. Meanwhile, the server should recover only a prescribed global linear function of the cluster aggregates and remain ignorant of other unauthorized linear combinations.

Information-theoretic secure aggregation has been extended to a variety of settings, including user dropout [5,6], secure aggregation with user selection [7], designs resilient to user collusion [8,9,10], schemes employing groupwise keys [11,12], secure aggregation with oblivious servers [13], secure aggregation under unreliable communication [14], and hierarchical secure aggregation [15,16,17,18,19]. Other related works on secure aggregation from different perspectives can be found in [20,21,22,23].

However, existing works do not characterize the vector linear two-hop hierarchical setting within a unified information-theoretic framework, where relay-side protection against unauthorized intra-cluster linear inference and server-side recovery of only a prescribed global function must be enforced simultaneously. Our contribution is not only to unify hierarchical secure aggregation and vector linear secure aggregation within a single information-theoretic model, but also to show that the resulting two-hop formulation exhibits genuinely coupled relay-side and server-side security constraints, leading to a new optimal key-rate characterization and requiring a joint algebraic coding design.

To further clarify the distinction from prior single-hop vector linear secure aggregation schemes, Table 1 summarizes the main differences between those formulations and the proposed two-hop hierarchical setting.

In this work, we study information-theoretic vector linear secure aggregation over a two-hop hierarchical network consisting of U relays, each serving a disjoint cluster of V users. The server is required to recover, with zero error, a prescribed linear function F of the cluster aggregates while learning no additional information about an unauthorized linear function G. At the same time, each relay may assist local aggregation but must remain ignorant of the unauthorized intra-cluster linear functions characterized by

B_{u}

within its own cluster. Our goal is to completely characterize the fundamental communication and randomness limits of this problem.

We prove that, in the unified hierarchical vector linear secure aggregation model, the communication optimality remains unchanged compared with the single-hop setting: the first-hop rate still satisfies

R_{X} = 1

, and the second-hop rate can still achieve

R_{Y} = 1

even after introducing an additional relay layer. However, the minimum source key rate changes from depending only on

rank (G ∣ F)

in the single-hop model to being jointly determined by the relay-side intra-cluster protection requirement

K_{u}

and the server-side protection constraint

rank (G ∣ F)

. This shows that, although the hierarchical structure does not increase the communication cost, it introduces a coupling between relay-side security and server-side function security in the key design.

From a technical standpoint, establishing the fundamental limits is challenging because both the converse and the achievability must simultaneously account for relay-side intra-cluster secrecy and server-side function authorization. In particular, the converse requires a joint information-theoretic argument for the two levels of security, while the achievability calls for a unified linear coding design that preserves local privacy, enables authorized global recovery, and maintains optimal communication rates over both hops.

We further provide an explicit linear coding scheme that achieves these fundamental limits.

2. Problem Statement

Consider a three-layer hierarchical secure aggregation system consisting of an aggregation server, an intermediate layer of

U \geq 2

relays, and a bottom layer of

U V

users. The network operates over two hops, where the server communicates with all relays and each relay serves a disjoint cluster of exactly V users, as illustrated in Figure 1. All communication links are assumed to be error-free. We consider a static system model with fixed cluster size, where no user dropout occurs during the protocol. We further assume that no collusion takes place among users, relays, and the server, and that all entities follow the prescribed protocol without adversarial or Byzantine behavior. The v-th user associated with the u-th relay is indexed by

(u, v) \in [U] \times [V]

. Each user

(u, v)

holds a private input

W_{u, v}

over a finite field

F_{q}

with entropy

H (W_{u, v}) = L

measured in q-ary units, and the inputs are assumed to be independent and uniformly distributed across users. In addition, each user

(u, v)

is equipped with a local key variable

Z_{u, v}

, satisfying

H (Z_{u, v}) = L_{Z}

. The collection of individual keys

Z_{[U] \times [V]} ≜ {Z_{u, v}}_{u \in [U], v \in [V]}

is deterministically generated from a common source key variable

Z_{Σ}

, where

H (Z_{Σ}) = L_{Z_{Σ}}

. The source key

Z_{Σ}

is generated and securely distributed by a trusted third-party entity. The key variables

Z_{[U] \times [V]}

are statistically independent of the user inputs

W_{[U] \times [V]} ≜ {W_{u, v}}_{u \in [U], v \in [V]}

.

\begin{matrix} H (Z_{[U] \times [V]}, W_{[U] \times [V]}) = H (Z_{[U] \times [V]}) + \sum_{u \in [U], v \in [V]} H (W_{u, v}), \end{matrix}

(1)

\begin{matrix} H (Z_{[U] \times [V]} | Z_{Σ}) = 0 . \end{matrix}

(2)

The system adopts a two-hop communication protocol. In the first hop, User

(u, v)

transmits a message

X_{u, v}

to its associated relay. The message

X_{u, v}

is generated as a function of the local input

W_{u, v}

and the local key

Z_{u, v}

, and consists of

L_{X}

symbols. In the second hop, relay u transmits a message

Y_{u}

to the aggregation server. The message

Y_{u}

consists of

L_{Y}

symbols and is generated as a function of the received messages

{X_{u, v}}_{v \in [V]}

from Users in cluster u.

\begin{matrix} H (X_{u, v} ∣ W_{u, v}, Z_{u, v}) = 0, \forall (u, v) \in [U] \times [V], \end{matrix}

(3)

\begin{matrix} H (Y_{u} ∣ {X_{u, v}}_{v \in [V]}) = 0, \forall u \in [U] . \end{matrix}

(4)

We define the cluster aggregate at relay u as the sum of the users’ inputs within cluster u, i.e.,

\begin{matrix} S_{u} ≜ \sum_{v \in [V]} W_{u, v}, u \in [U] . \end{matrix}

(5)

In general, the relay message

Y_{u}

can be an arbitrary function of the received messages

{X_{u, v}}_{v \in [V]}

. Specifically, in this work, we restrict attention to schemes in which the relay message

Y_{u}

is a deterministic function of the cluster aggregate

S_{u}

and the local keys

{Z_{u, v}}_{v \in [V]}

, i.e.,

\begin{matrix} H (Y_{u} ∣ S_{u}, {Z_{u, v}}_{v \in [V]}) = 0, \forall u \in [U] . \end{matrix}

(6)

From the relay messages, the aggregation server aims to recover an authorized linear function F while revealing no information about an unauthorized linear function G in the information-theoretic sense. Define

S ≜ [S_{1}; \dots; S_{U}] \in F_{q}^{U \times L} .

The functions F and G are given by

\begin{matrix} F = F S \in F_{q}^{M \times L}, G = G S \in F_{q}^{N \times L}, \end{matrix}

(7)

where

F \in F_{q}^{M \times U}

and

G \in F_{q}^{N \times U}

are assumed to have full row rank, i.e.,

M = rank (F)

and

N = rank (G)

, without loss of generality.

To prevent trivial cases, we assume that

F

contains no zero columns. A zero column associated with

S_{u}

would indicate that

S_{u}

has no effect on the computation of F and could thus be excluded without affecting the problem.

From the relay’s messages, the server should be able to recover the desired linear function F, i.e.,

\begin{matrix} [Correctness] H (F | {Y_{u}}_{u \in [U]}) = 0 . \end{matrix}

(8)

The security constraints require that each relay should not gain any information about any unauthorized linear function

B_{u}

from the messages transmitted by its associated users. Specifically, let

W_{u} ≜ [W_{u, 1}; \dots; W_{u, V}] \in F_{q}^{V \times L},

and define the unauthorized function

B_{u} = B_{u} W_{u} \in F_{q}^{K_{u} \times L},

(9)

where

B_{u} \in F_{q}^{K_{u} \times V}

is assumed to have full row rank without loss of generality, i.e.,

K_{u} = rank (B_{u}), u \in [U]

. The relay security constraint can then be expressed as

\begin{matrix} I (B_{u}; {X_{u, v}}_{v \in [V]}) = 0, \forall u \in [U] . \end{matrix}

(10)

In addition, the server must not learn any information about the unauthorized function G beyond what is already contained in the authorized function F. This server security constraint is written as

\begin{matrix} I (G; {Y_{u}}_{u \in [U]} ∣ F) = 0 . \end{matrix}

(11)

The communication rates

R_{X}

and

R_{Y}

are defined as the numbers of symbols in each transmitted message

X_{u, v}

and

Y_{u}

, respectively, normalized by the input length L. Similarly, the source key rate

R_{Z_{Σ}}

represents the number of symbols in the key variable

Z_{Σ}

per input symbol. Formally,

\begin{matrix} R_{X} ≜ \frac{L_{X}}{L}, R_{Y} ≜ \frac{L_{Y}}{L}, R_{Z_{Σ}} ≜ \frac{L_{Z_{Σ}}}{L} . \end{matrix}

(12)

A rate tuple

(R_{X}, R_{Y}, R_{Z_{Σ}})

is said to be achievable if there exists a secure aggregation scheme, specified by the key variable

Z_{Σ}

, and the transmitted messages

{X_{u, v}}_{(u, v) \in [U] \times [V]}

and

{Y_{u}}_{u \in [U]}

, satisfying (3) and (4), such that the communication and key rates are

(R_{X}, R_{Y}, R_{Z_{Σ}})

and the correctness constraint (8) together with the security constraints (10) and (11) are all met. The optimal rate region

R^{*}

is defined as the closure of the set of all achievable rate tuples.

3. Main Results

In this section, we present the main results of this work. The optimal vector linear communication and key rate region for the hierarchical vector linear secure aggregation problem is characterized in Theorem 1.

Theorem 1.

For the hierarchical vector linear secure aggregation problem described above, the optimal vector linear communication and key rate region is

\begin{matrix} R^{*} = & \{(R_{X}, R_{Y}, R_{Z_{Σ}}) | R_{X} \geq 1, R_{Y} \geq 1, R_{Z_{Σ}} \geq max \{max_{u \in [U]} K_{u}, rank (G ∣ F)\}\}, \end{matrix}

(13)

where

rank (G ∣ F) = rank ([F; G]) - rank (F) .

(14)

Moreover, the converse holds under the stated model, and the above region is achievable by a vector linear coding scheme over sufficiently large finite fields.

4. Motivating Example (U = 4, V = 3, M = 2, N = 1)

Prior to describing the general achievability scheme in Theorem 1, we introduce a representative example to convey the key principles behind the proposed hierarchical vector linear secure aggregation problem. These examples serve to build intuition for the design, after which the complete construction is presented.

Consider a two-hop hierarchical system with

U = 4

relays and

V = 3

users per cluster. In the first hop, each relay aggregates the messages from users in its corresponding cluster while being prevented from learning any information about the linear function

B_{u} W

, where

W ≜ {[W_{u, 1}, W_{u, 2}, W_{u, 3}]}^{T} \in F_{7}^{3 \times 1},

and

B_{u}

is specified as follows.

\begin{matrix} B_{1} & = [\begin{matrix} 2 & 4 & 6 \end{matrix}], B_{2} = [\begin{matrix} 3 & 5 & 1 \end{matrix}], \\ B_{3} & = [\begin{matrix} 1 & 3 & 2 \\ 3 & 6 & 1 \end{matrix}] \overset{row / column operations}{\to} [\begin{matrix} 1 & 0 & 4 \\ 0 & 1 & 4 \end{matrix}], \\ B_{4} & = [\begin{matrix} 1 & 2 & 3 \\ 4 & 5 & 6 \\ 5 & 0 & 2 \end{matrix}] \overset{row / column operations}{\to} [\begin{matrix} 1 & 0 & 6 \\ 0 & 1 & 2 \\ 0 & 0 & 0 \end{matrix}] . \end{matrix}

(15)

In the second hop, the server aims to recover

F S

from the messages uploaded by all relays with zero error, where

S ≜ {[S_{1}, S_{2}, S_{3}, S_{4}]}^{T} \in F_{7}^{4 \times 1}, S_{u} = \sum_{v \in [3]} W_{u, v}, u \in [4] .

Moreover, the server must not obtain any additional information about

G S

beyond what is implied by

F S

.

F = [\begin{matrix} 1 & 2 & 3 & 4 \\ 0 & 1 & 2 & 3 \end{matrix}], G = [\begin{matrix} 3 & 2 & 0 & 1 \end{matrix}] .

(16)

F = [\begin{matrix} 1 & 2 & 3 & 4 \\ 0 & 1 & 2 & 3 \end{matrix}] \overset{row / column operations}{\to} [\begin{matrix} 1 & 0 & 6 & 5 \\ 0 & 1 & 2 & 3 \end{matrix}] .

(17)

Consequently, we have

\begin{matrix} F S & = [\begin{matrix} S_{1} + 2 S_{2} + 3 S_{3} + 4 S_{4} \\ S_{2} + 2 S_{3} + 3 S_{4} \end{matrix}] \\ = [\begin{matrix} \sum_{v \in [3]} W_{1, v} + 2 \sum_{v \in [3]} W_{2, v} + 3 \sum_{v \in [3]} W_{3, v} + 4 \sum_{v \in [3]} W_{4, v} \\ \sum_{v \in [3]} W_{2, v} + 2 \sum_{v \in [3]} W_{3, v} + 3 \sum_{v \in [3]} W_{4, v} \end{matrix}], \end{matrix}

(18)

\begin{matrix} G S & = 3 S_{1} + 2 S_{2} + S_{4} \\ = 3 \sum_{v \in [3]} W_{1, v} + 2 \sum_{v \in [3]} W_{2, v} + \sum_{v \in [3]} W_{4, v} . \end{matrix}

(19)

where

G S

is a scalar linear combination of the components of

S

.

Consider the second hop and set

L = 1

. Based on (17), suppose we have two independent and uniformly distributed noise variables

T_{1}, T_{2}

over

F_{7}

. Then we have

\begin{matrix} Y_{1} & = S_{1} - 6 T_{1} - 5 T_{2} = W_{1, 1} + W_{1, 2} + W_{1, 3} - 6 T_{1} - 5 T_{2}, \\ Y_{2} & = S_{2} - 2 T_{1} - 3 T_{2} = W_{2, 1} + W_{2, 2} + W_{2, 3} - 2 T_{1} - 3 T_{2}, \\ Y_{3} & = S_{3} + T_{1} = W_{3, 1} + W_{3, 2} + W_{3, 3} + T_{1}, \\ Y_{4} & = S_{4} + T_{2} = W_{4, 1} + W_{4, 2} + W_{4, 3} + T_{2} . \end{matrix}

(20)

For the server security constraint, only 1 key symbol is required. It turns out that

T_{1}

and

T_{2}

need not be independent; introducing correlation between them in the next step is the most technical part of the proof.

We then seek a

1 \times 2

matrix

Q \in F_{q}^{1 \times 2}

that characterizes the correlation between

(T_{1}, T_{2})

, such that

[\begin{matrix} F \\ G \\ 0 & 0 & Q \end{matrix}] has full rank 4 .

(21)

Note that such a matrix

Q

exists since

rank ([F; G]) = 3

. Consequently, there always exists a nonzero

1 \times 2

vector

Q

that completes (21) to full rank. Any valid choice of

Q

suffices for our purpose.

We then compute the right null space of

Q

, denoted by

Q^{⊥} \in F_{q}^{2 \times 1}

, which satisfies

Q = [\begin{matrix} 1 & 3 \end{matrix}], Q^{⊥} = [\begin{matrix} 4 \\ 1 \end{matrix}] .

(22)

Then the key symbols

T_{1}

and

T_{2}

can be generated from a single uniformly distributed key symbol

P_{1}

by precoding with

Q^{⊥}

,

[\begin{matrix} T_{1} \\ T_{2} \end{matrix}] = Q^{⊥} P_{1} = [\begin{matrix} 4 P_{1} \\ P_{1} \end{matrix}] .

(23)

We may write out the final message assignment using the single key symbol

P_{1}

:

\begin{matrix} Y_{1} & = W_{1, 1} + W_{1, 2} + W_{1, 3} - P_{1}, Y_{2} = W_{2, 1} + W_{2, 2} + W_{2, 3} - 4 P_{1}, \\ Y_{3} & = W_{3, 1} + W_{3, 2} + W_{3, 3} + 4 P_{1}, Y_{4} = W_{4, 1} + W_{4, 2} + W_{4, 3} + P_{1} . \end{matrix}

(24)

The signal observed at relay u can be expressed as

Y_{u} ≜ S_{u} + Z_{u}^{Y},

(25)

where

Z_{u}^{Y}

denotes the key component embedded in

Y_{u}

.

In Example 1, this decomposition admits an explicit representation:

[\begin{matrix} Z_{1}^{Y} \\ Z_{2}^{Y} \\ Z_{3}^{Y} \\ Z_{4}^{Y} \end{matrix}] = [\begin{matrix} - 1 \\ - 4 \\ 4 \\ 1 \end{matrix}] P_{1} .

(26)

Next, we investigate the security of relay 4 under the proposed key assignment. Since

rank (B_{4}) = 2

, relay 4 requires at least

K_{4} = 2

independent keys, denoted by

N_{1}

and

N_{2}

. There exists a matrix

A \in F_{q}^{1 \times 2}

such that

P_{1} = A N = a_{11} N_{1} + a_{12} N_{2} .

Since the coefficients of

A

can be any nonzero values in

F_{q}

, for simplicity we set

a_{11} = a_{12} = 1

, yielding

P_{1} = N_{1} + N_{2} .

Therefore, the relay messages can be written as

\begin{matrix} Y_{1} & = S_{1} - (N_{1} + N_{2}), & Y_{2} & = S_{2} - 4 (N_{1} + N_{2}), \\ Y_{3} & = S_{3} + 4 (N_{1} + N_{2}), & Y_{4} & = S_{4} + (N_{1} + N_{2}) . \end{matrix}

(27)

To prevent relay 4 from obtaining any information regarding the linear function

B_{4} X

, we require

B_{4} [\begin{matrix} X_{1} \\ X_{2} \\ X_{3} \end{matrix}] = B_{4} [\begin{matrix} W_{4, 1} \\ W_{4, 2} \\ W_{4, 3} \end{matrix}] + \underset{\neq 0}{\underset{︸}{B_{4} [\begin{matrix} Z_{4, 1} \\ Z_{4, 2} \\ Z_{4, 3} \end{matrix}]}} .

(28)

Specifically, the interference term is constructed using the keys as

B_{4} [\begin{matrix} Z_{4, 1} \\ Z_{4, 2} \\ Z_{4, 3} \end{matrix}] = \underset{\neq 0}{\underset{︸}{B_{4} R_{4}}} [\begin{matrix} N_{1} \\ N_{2} \end{matrix}], R_{4} \in F_{q}^{3 \times 2} .

(29)

Equivalently, we can write

[\begin{matrix} Z_{4, 1} \\ Z_{4, 2} \\ Z_{4, 3} \end{matrix}] = R_{4} [\begin{matrix} N_{1} \\ N_{2} \end{matrix}], R_{4} \in F_{q}^{3 \times 2} .

(30)

To ensure that the noise term fully masks the signal space and cannot be nullified via linear projection, the product

B_{4} R_{4}

must have full rank, i.e.,

rank (B_{4} R_{4}) = 2 .

Specifically, we construct the first two rows of

R_{4}

as a full-rank block to ensure linear independence, and utilize the last row to satisfy the aggregation coefficient constraints. Consequently, as shown in (26), the aggregated interference term

Z_{4}^{Y}

yields the summation of the keys:

Z_{4}^{Y} = 1 \cdot P_{1} = [\begin{matrix} 1 & 1 \end{matrix}] [\begin{matrix} N_{1} \\ N_{2} \end{matrix}] = N_{1} + N_{2} .

(31)

The corresponding precoding matrix is

R_{4} = [\begin{matrix} - 1 & 0 \\ 0 & - 1 \\ 2 & 2 \end{matrix}] \in F_{q}^{3 \times 2} .

(32)

The left null space of

R_{4}

is

R_{4}^{⊥} = [\begin{matrix} 2 & 2 & 1 \end{matrix}] \in F_{q}^{1 \times 3}, with R_{4}^{⊥} R_{4} = 0 .

(33)

This construction ensures that

R_{4}

has full column rank,

rank (R_{4}) = 2

, satisfying the required rank condition.

Similarly, for relay 1, since

rank (B_{1}) = 1

, it requires only one independent key, namely

(N_{1} + N_{2})

. To satisfy the condition

rank (B_{1} R_{1}) = 1

, we construct

R_{1}

as follows:

Z_{1}^{Y} = - 1 \cdot P_{1} = - 1 [N_{1} + N_{2}] .

R_{1} = [\begin{matrix} 1 \\ 1 \\ - 3 \end{matrix}] .

(34)

Consequently, the left null space of

R_{1}

is given by

R_{1}^{⊥} = [\begin{matrix} 6 & 1 & 0 \\ 3 & 0 & 1 \end{matrix}], satisfying R_{1}^{⊥} R_{1} = 0 .

(35)

Similarly, for the other relays 2 and 3, we construct

R_{2}

and

R_{3}

, from which the individual user keys are obtained as follows:

\begin{matrix} Z_{1, 1} & = N_{1} + N_{2}, & Z_{1, 2} & = N_{1} + N_{2}, & Z_{1, 3} & = - 3 N_{1} - 3 N_{2}, \\ Z_{2, 1} & = N_{1} + N_{2}, & Z_{2, 2} & = N_{1} + N_{2}, & Z_{2, 3} & = - 6 N_{1} - 6 N_{2}, \\ Z_{3, 1} & = N_{1}, & Z_{3, 2} & = N_{2}, & Z_{3, 3} & = 3 N_{1} + 3 N_{2}, \\ Z_{4, 1} & = - N_{1}, & Z_{4, 2} & = - N_{2}, & Z_{4, 3} & = 2 N_{1} + 2 N_{2} . \end{matrix}

(36)

Since

L_{X} = L_{Y} = 1

and

L_{Z_{Σ}} = 2

, the resulting rates are

R_{X} = R_{Y} = 1, R_{Z_{Σ}} = 2,

which match the converse bound established in Theorem 1.

Correctness: From the received signals $Y_{1}, Y_{2}, Y_{3}, Y_{4}$ , the server applies the linear transform $F$ and successfully recovers

$F = F S$

with zero error.
Relay security: From the transformation in (15), it follows that

$K_{u} = 1 for u \in {1, 2}, K_{u} = 2 for u \in {3, 4} .$

Since relays whose

B_{u}

have the same rank require the same total number of independent masking key symbols, it suffices to establish the security proof for relays 1 and 4; the cases of relays 2 and 3 follow by analogous arguments.

Consider relay 4, for example:

\begin{matrix} I ({B_{4}}; {X_{4, v}}_{v \in [3]}) \end{matrix}

(37)

\begin{matrix} = H (X_{4, 1}, X_{4, 2}, X_{4, 3}) - H (X_{4, 1}, X_{4, 2}, X_{4, 3} ∣ B_{4}) \end{matrix}

(38)

\begin{matrix} \leq 3 - H (X_{4, 1}, X_{4, 2}, X_{4, 3}, R_{4}^{⊥} [X_{4, 1}, X_{4, 2}, X_{4, 3}] ∣ B_{4}) \end{matrix}

(39)

\begin{matrix} = 3 - H (X_{4, 1}, X_{4, 2}, X_{4, 3}, R_{4}^{⊥} [W_{4, 1}, W_{4, 2}, W_{4, 3}] ∣ B_{4}) \end{matrix}

(40)

\begin{matrix} = 3 - H (R_{4}^{⊥} [W_{4, 1}, W_{4, 2}, W_{4, 3}] ∣ B_{4}) - H (X_{4, 1}, X_{4, 2}, X_{4, 3} ∣ R_{4}^{⊥} [W_{4, 1}, W_{4, 2}, W_{4, 3}], B_{4}) \end{matrix}

(41)

\begin{matrix} = 3 - rank (R_{4}^{⊥}) - H (N_{1}, N_{2}) \end{matrix}

(42)

\begin{matrix} = 3 - 1 - 2 = 0 . \end{matrix}

(43)

In (40), we adopt a zero-forcing strategy by constructing the precoding matrix

R_{4}^{⊥}

so that the key components are perfectly eliminated in its left null space, i.e.,

R_{4}^{⊥} R_{4} = 0 .

In (42), the second term holds because

R_{4}^{⊥} R_{4} = 0,

and

R_{4}^{⊥} [W_{4, 1}, W_{4, 2}, W_{4, 3}]

is independent of

B_{4}

. Moreover, the matrix formed by

R_{4}^{⊥} [W_{4, 1}, W_{4, 2}, W_{4, 3}]

and

B_{4}

has full rank, and hence is invertible with respect to

W_{4, 1}, W_{4, 2}, W_{4, 3}

.

Consider relay 1, for example:

\begin{matrix} I ({B_{1}}; {X_{1, v}}_{v \in [3]}) \\ = H (X_{1, 1}, X_{1, 2}, X_{1, 3}) - H (X_{1, 1}, X_{1, 2}, X_{1, 3} ∣ B_{1}) \end{matrix}

(44)

\begin{matrix} \leq 3 - H (X_{1, 1}, X_{1, 2}, X_{1, 3}, R_{1}^{⊥} [X_{1, 1}, X_{1, 2}, X_{1, 3}] ∣ B_{1}) \end{matrix}

(45)

\begin{matrix} = 3 - H (X_{1, 1}, X_{1, 2}, X_{1, 3}, R_{1}^{⊥} [W_{1, 1}, W_{1, 2}, W_{1, 3}] ∣ B_{1}) \end{matrix}

(46)

\begin{matrix} \overset{(35)}{=} 3 - H (R_{1}^{⊥} [W_{1, 1}, W_{1, 2}, W_{1, 3}] ∣ B_{1}) - H (X_{1, 1}, X_{1, 2}, X_{1, 3} ∣ R_{1}^{⊥} [W_{1, 1}, W_{1, 2}, W_{1, 3}], B_{1}) \end{matrix}

(47)

\begin{matrix} = 3 - rank (R_{1}^{⊥}) - H (N_{1} + N_{2}) \end{matrix}

(48)

\begin{matrix} = 3 - 2 - 1 = 0 . \end{matrix}

(49)

In (46), we adopt a zero-forcing strategy by constructing the precoding matrix

R_{1}^{⊥}

so that the key components are perfectly eliminated in its left null space, i.e.,

R_{1}^{⊥} R_{1} = 0 .

In (48), the second term holds because

R_{1}^{⊥} R_{1} = 0,

and

R_{1}^{⊥} [W_{1, 1}, W_{1, 2}, W_{1, 3}]

is independent of

B_{1}

. Moreover, the matrix formed by

R_{1}^{⊥} [W_{1, 1}, W_{1, 2}, W_{1, 3}]

and

B_{1}

has full rank, and hence is invertible with respect to

W_{1, 1}, W_{1, 2}, W_{1, 3}

.

We now proceed to present the security proof for the server.

\begin{matrix} I (G; Y_{1}, Y_{2}, Y_{3}, Y_{4} ∣ F) \end{matrix}

\begin{matrix} = H (Y_{1}, Y_{2}, Y_{3}, Y_{4} ∣ F) - H (Y_{1}, Y_{2}, Y_{3}, Y_{4} ∣ G, F) \end{matrix}

(50)

\begin{matrix} = [H (Y_{1}, Y_{2}, Y_{3}, Y_{4}, F) - H (F)] - H (Y_{1}, Y_{2}, Y_{3}, Y_{4}, Q [Y_{3}, Y_{4}] ∣ G, F) \end{matrix}

(51)

\begin{matrix} = [H (Y_{1}, Y_{2}, Y_{3}, Y_{4}, F) - H (F)] - H (Y_{1}, Y_{2}, Y_{3}, Y_{4}, Q [S_{3}, S_{4}] ∣ G, F) \end{matrix}

(52)

\begin{matrix} \leq (4 - 2) - H (Q [S_{3}, S_{4}] ∣ G, F) - H (Y_{1}, Y_{2}, Y_{3}, Y_{4} ∣ Q [S_{3}, S_{4}], G, F) \end{matrix}

(53)

\begin{matrix} = 2 - 1 - H (P_{1}) \end{matrix}

(54)

\begin{matrix} = 2 - 1 - H (N_{1} + N_{2}) = 2 - 1 - 1 = 0, \end{matrix}

(55)

where (52) follows from the orthogonality

Q Q^{⊥} = 0

, which implies that the noise components precoded by

Q^{⊥}

are completely eliminated (zero-forced) when left-multiplied by

Q

, cf. (22) and (23). Concerning (54), we leverage the full-rank properties of

Q [S_{3}; S_{4}]

,

G S

, and

F S

, which ensure the unique solvability of

S_{1}, \dots, S_{4}

(see (21)).

5. General Achievability Proof of Theorem 1

5.1. Conceptual Overview of the Construction

Before introducing the detailed algebraic construction, we briefly explain the guiding idea of the scheme. Transforming

F

into systematic form makes its right null space explicit. For

F = [\begin{matrix} I_{M} & {\tilde{F}}_{M \times (U - M)} \end{matrix}], F [\begin{matrix} - {\tilde{F}}_{M \times (U - M)} \\ I_{U - M} \end{matrix}] = 0 .

Therefore, the systematic form of

F

explicitly characterizes all key-injection directions that preserve the authorized function

F S

, thereby ensuring correctness. The role of

Q

is then to further restrict key injection to a smaller effective subspace within

Null (F)

, containing only the minimum number of directions needed to perfectly hide the unauthorized function

G S

conditioned on

F S

.

At this stage, the aggregate noise

Z_{u}^{Y}

at relay u has already been specified by the server-side design. It remains to assign user-level keys such that their aggregate equals

Z_{u}^{Y}

, while satisfying the relay-side privacy and aggregation constraints.

For the relay-side privacy requirement, writing

B_{u}

in systematic form enables a compatible canonical parametrization of the key-assignment matrix

R_{u}

, under which the full-rank condition on

B_{u} R_{u}

is reduced to an invertibility constraint on

L_{u}

, as shown in (68)–(70); such a constraint is always feasible over a sufficiently large finite field. Meanwhile, enforcing

1_{V}^{⊤} R_{u} = e_{1}^{⊤}

in (73), equivalently (74), ensures that the user-level noise aggregates precisely into the prescribed cluster-level noise

Z_{u}^{Y}

. Meanwhile,

R_{u}^{⊥}

is not part of the construction of

R_{u}

itself, but is introduced for the relay privacy proof, where its left-null-space property is used to zero-force the injected keys.

5.2. General Construction

We now present the general achievability scheme for the two-hop hierarchical vector linear secure aggregation problem. Building on the intuition provided by the motivating example, we construct a unified linear coding scheme and show that it simultaneously guarantees correctness, relay-side security, and server-side function authorization while achieving the claimed communication and key rates.

Given that

F

has full row rank, we may, without loss of generality, transform it into the following systematic form via column permutations and invertible row operations:

F = [\begin{matrix} I_{M} & {\tilde{F}}_{M \times (U - M)} \end{matrix}],

(56)

where

I_{M}

denotes the

M \times M

identity matrix, and

\tilde{F} \in F_{q}^{M \times (U - M)}

represents the remaining submatrix.

The rows of

Q

are constructed to be linearly independent of the row space of

[F; G]

, thereby completing a basis of

F_{q}^{U}

. Specifically, we select any

U - rank ([F; G])

row vectors that are linearly independent of

[F; G]

, and then use the identity submatrix in the first M columns of

F

to linearly eliminate their first M components, yielding

Q

.

The resulting

(U - rank ([F; G])) \times (U - M)

matrix

Q

satisfies

[\begin{matrix} F \\ G \\ 0_{(U - rank ([F; G])) \times M} & Q \end{matrix}] has full rank U,

(57)

which guarantees that the row spaces of

F

,

G

, and

Q

together span the entire ambient space

F_{q}^{U}

.

Intuitively,

Q

selects and compresses the residual degrees of freedom that are linearly independent of the row space of

F

into lower-dimensional injection directions. This enables key injection without affecting the

F

-related structure and avoids using degrees of freedom observable through

G

. By reordering the columns if necessary,

Q

can be written in the following block form:

\begin{matrix} Q & = [\begin{matrix} I_{U - rank ([F; G])} & \tilde{Q} \end{matrix}], \\ Q^{⊥} & = {[\begin{matrix} - \tilde{Q} \\ I_{rank ([F; G]) - M} \end{matrix}]}_{(U - M) \times (rank ([F; G]) - M)}, \\ Q Q^{⊥} & = 0 . \end{matrix}

(58)

We are now ready to describe the secure aggregation protocol. Set

L = 1

and define

L_{Z_{Σ}} ≜ max \{{max}_{u \in [U]} K_{u}, rank (G ∣ F)\} .

Let

N ≜ [N_{1}; \dots; N_{L_{Z_{Σ}}}]

consist of mutually independent and uniformly distributed key symbols.

We generate the key vector

P = A N, P \in F_{q}^{rank (G ∣ F) \times 1},

where

A \in F_{q}^{rank (G ∣ F) \times L_{Z_{Σ}}}

is chosen to be full row rank over

F_{q}

. The injected key symbols are then defined as

T ≜ [T_{1}; \dots; T_{U - M}] = Q^{⊥} P = Q^{⊥} A N .

(59)

The transmitted symbols are constructed as

\begin{matrix} [Y_{1}; \dots; Y_{M}] & = [S_{1}; \dots; S_{M}] - \tilde{F} [T_{1}; \dots; T_{U - M}], \\ Y_{M + 1}; \dots; Y_{U}] & = [S_{M + 1}; \dots; S_{U}] + [T_{1}; \dots; T_{U - M}] . \end{matrix}

(60)

Based on (60), the key design for each relay

Z_{u}^{Y}

is constructed as follows:

\begin{matrix} T ≜ [T_{1}; \dots; T_{U - M}] & = Q^{⊥} P = Q^{⊥} A N \\ [Z_{1}^{Y}; \dots; Z_{M}^{Y}] & = - \tilde{F} Q^{⊥} P = - \tilde{F} Q^{⊥} A N \\ [Z_{M + 1}^{Y}; \dots; Z_{U}^{Y}] & = [T_{1}; \dots; T_{U - M}] = Q^{⊥} P = Q^{⊥} A N . \end{matrix}

(61)

[\begin{matrix} Z_{1}^{Y} \\ ⋮ \\ Z_{U}^{Y} \end{matrix}] = \underset{≜ Φ \in F_{q}^{U \times L_{Z_{Σ}}}}{\underset{︸}{[\begin{matrix} - \tilde{F} \\ I_{U - M} \end{matrix}] Q^{⊥} A}} N .

(62)

Let

β_{u}^{⊤} \in F_{q}^{1 \times L_{Z_{Σ}}}

denote the u-th row of

Φ

.

Thus, for each

u \in [U]

, we have

Z_{u}^{Y} = β_{u}^{⊤} N .

(63)

Next, we extend the achievability to the general relay case. Without loss of generality, let

B_{u}

be represented in its systematic form:

B_{u} = [\begin{matrix} I_{K_{u}} & {\tilde{B}}_{K_{u} \times (V - K_{u})} \end{matrix}],

(64)

as any

B_{u}

can be transformed into this form via column permutations and invertible row operations.

Let

N^{(u)} ≜ D_{u} N

, where

D_{u} \in F_{q}^{K_{u} \times L_{Z_{Σ}}}

is a full-row-rank matrix that maps the global key vector to a relay-specific key vector (

rank (D_{u}) = K_{u}

).

At this stage, the noise

Z_{u}^{Y}

has already been fixed by the server-side design. Moreover,

D_{u}

is chosen such that

e_{1}^{⊤} D_{u} = β_{u}^{⊤},

(65)

where

e_{1} = {[1, 0, \dots, 0]}^{⊤} \in F_{q}^{K_{u}}

selects the first row of

D_{u}

. Hence,

e_{1}^{⊤} D_{u} N = β_{u}^{⊤} N,

(66)

so that the first component of

N^{(u)}

coincides with the prescribed cluster-level aggregate noise, namely,

Z_{u}^{Y} = β_{u}^{⊤} N = e_{1}^{⊤} D_{u} N .

(67)

With

B_{u} = [I_{K_{u}} {\tilde{B}}_{u}] \in F_{q}^{K_{u} \times V}

, where

{\tilde{B}}_{u} \in F_{q}^{K_{u} \times (V - K_{u})}

. Choose

R_{u} = [\begin{matrix} I_{K_{u}} \\ L_{u} \end{matrix}], L_{u} \in F_{q}^{(V - K_{u}) \times K_{u}} .

(68)

Then

B_{u} R_{u} = [I_{K_{u}} {\tilde{B}}_{u}] [\begin{matrix} I_{K_{u}} \\ L_{u} \end{matrix}] = I_{K_{u}} + {\tilde{B}}_{u} L_{u} .

(69)

Since the right-hand side is a

K_{u} \times K_{u}

square matrix, we have

rank (B_{u} R_{u}) = K_{u} \Leftrightarrow det (I_{K_{u}} + {\tilde{B}}_{u} L_{u}) \neq 0 .

(70)

Among all solutions of the linear constraint, we choose

L_{u}

such that

I_{K_{u}} + {\tilde{B}}_{u} L_{u}

is nonsingular; such a choice exists over a sufficiently large finite field

F_{q}

.

Using a common user-level encoding matrix

R_{u}

for the V users in cluster u, define

[\begin{matrix} Z_{u, 1} \\ ⋮ \\ Z_{u, V} \end{matrix}] ≜ R_{u} N^{(u)} = R_{u} D_{u} N .

(71)

To ensure correctness after relay aggregation, we impose

1_{V}^{⊤} [\begin{matrix} Z_{u, 1} \\ ⋮ \\ Z_{u, V} \end{matrix}] = Z_{u}^{Y} = 1_{V}^{⊤} R_{u} N^{(u)} .

(72)

From (67) and (72), it follows that:

1_{V}^{⊤} R_{u} = e_{1}^{⊤} .

(73)

Substituting

R_{u} = [\begin{matrix} I_{K_{u}} \\ L_{u} \end{matrix}]

yields the equivalent condition

1_{V - K_{u}}^{⊤} L_{u} = e_{1}^{⊤} - 1_{K_{u}}^{⊤} .

(74)

Since rank

(R_{u}) = K_{u}

, the left null space of

R_{u}

has dimension

V - K_{u}

. Thus there exists a full-row-rank

R_{u}^{⊥} \in F_{q}^{(V - K_{u}) \times V}

with

R_{u}^{⊥} R_{u} = 0 .

Let us prove the above scheme is correct and secure. For correctness (refer to (8)), we have

\begin{matrix} F = F S & \overset{(56)}{=} [S_{1}; \dots; S_{M}] + \tilde{F} [S_{M + 1}; \dots; S_{U}] \\ \overset{(60)}{=} [Y_{1}; \dots; Y_{M}] + \tilde{F} [Y_{M + 1}; \dots; Y_{U}] . \end{matrix}

(75)

so that

F

can be decoded correctly from

{(Y_{u})}_{u \in [U]}

.

For relay security (refer to (10)), we have

\begin{matrix} I ({B_{u}}; {X_{u, v}}_{v \in [V]}) \end{matrix}

(76)

\begin{matrix} = H ({X_{u, v}}_{v \in [V]}) - H ({X_{u, v}}_{v \in [V]} ∣ B_{u}) \end{matrix}

(77)

\begin{matrix} \leq V - H ({X_{u, v}}_{v \in [V]}, R_{u}^{⊥} [X_{u, 1}; \dots; X_{u, V}] ∣ B_{u}) \end{matrix}

(78)

\begin{matrix} = V - H (R_{u}^{⊥} [W_{u, 1}; \dots; W_{u, V}] ∣ B_{u}) - H ({X_{u, v}}_{v \in [V]} ∣ R_{u}^{⊥} [W_{u, 1}; \dots; W_{u, V}], B_{u}) \end{matrix}

(79)

\begin{matrix} = V - (V - K_{u}) - H (N^{(u)}) \end{matrix}

(80)

\begin{matrix} = V - V + K_{u} - K_{u} = 0 . \end{matrix}

(81)

In (79),

R_{u}^{⊥} R_{u} = 0

guarantees that the injected keys are zero-forced in

R_{u}^{⊥} [X_{u, 1}; \dots; X_{u, V}]

, and the last equality follows from

rank (R_{u}^{⊥}) = V - K_{u}

and

rank ([B_{u}; R_{u}^{⊥}]) = V

.

For server security (refer to (11)), we have

\begin{matrix} I (G; {(Y_{u})}_{u \in [U]} ∣ F) & = H ({(Y_{u})}_{u \in [U]} ∣ F) - H ({(Y_{u})}_{u \in [U]} ∣ G, F) \end{matrix}

(82)

\begin{matrix} = H ({(Y_{u})}_{u \in [U]}, F) - H (F) - H ({(Y_{u})}_{u \in [U]}, Q [Y_{M + 1}; \dots; Y_{U}] ∣ G, F) \end{matrix}

(83)

\begin{matrix} \overset{(8) (60)}{=} H ({(Y_{u})}_{u \in [U]}) - H (F) - H ({(Y_{u})}_{u \in [U]}, Q [S_{M + 1}; \dots; S_{U}] ∣ G, F) \\ \leq (U - M) - H (Q [S_{M + 1}; \dots; S_{U}] ∣ G, F) \end{matrix}

(84)

\begin{matrix} - H ({(Y_{u})}_{u \in [U]} ∣ Q [S_{M + 1}; \dots; S_{U}], G, F) \end{matrix}

(85)

\begin{matrix} \overset{(57)}{=} (U - M) - [U - rank ([F; G])] - H ({(Y_{u})}_{u \in [U]} ∣ {(S_{u})}_{u \in [U]}) \end{matrix}

(86)

\begin{matrix} \overset{(60)}{=} rank ([F; G]) - M - H (T ∣ {(S_{u})}_{u \in [U]}) \end{matrix}

(87)

\begin{matrix} \overset{(60)}{=} rank ([F; G]) - M - H (Q^{⊥} P ∣ {(S_{u})}_{u \in [U]}) \end{matrix}

(88)

\begin{matrix} = rank ([F; G]) - M - H (P ∣ {(S_{u})}_{u \in [U]}) \end{matrix}

(89)

\begin{matrix} = rank ([F; G]) - M - H (A N ∣ {(S_{u})}_{u \in [U]}) \end{matrix}

(90)

\begin{matrix} = rank ([F; G]) - M - rank (G ∣ F) \end{matrix}

(91)

\begin{matrix} = rank ([F; G]) - M - [rank ([F; G]) - M] = 0 . \end{matrix}

(92)

Fundamentally, our design methodology reconstructs the solution by working backward from the security requirement in (11). The condition of vanishing mutual information implies that the conditional entropy

H ({(Y_{u})}_{u \in [U]} ∣ G, F)

must saturate the value

U - M

. We accomplish this by utilizing

Q

to isolate a signal-bearing subspace independent of channel realizations and maximizing its rank. Consequently, the keys injection is projected exclusively onto

Q^{⊥}

. This geometric arrangement ensures that the security threshold is satisfied with the minimum necessary keys dimensions.

6. Converse

We begin with a useful lemma. It states that each user message

X_{u, v}

must contain at least L symbols of information, even when all other inputs are revealed. Similarly, each relay message

Y_{u}

must carry at least L symbols whenever there exists at least one connected input

X_{u, v}

that remains unknown.

Lemma 1.

For any

u \in [U], v \in [V]

, we have

\begin{matrix} H (X_{u, v} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) & \geq L, \end{matrix}

(93)

\begin{matrix} H (Y_{u} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) & \geq L . \end{matrix}

(94)

Proof.

Consider (93), we have

\begin{matrix} H (X_{u, v} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \end{matrix}

\begin{matrix} \geq I (X_{u, v}; F | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \\ = H (F | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \end{matrix}

(95)

\begin{matrix} - H (F | X_{u, v}, {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \\ \overset{(3), (4)}{=} H (W_{u, v} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \end{matrix}

(96)

\begin{matrix} - \underset{\overset{(8)}{=} 0}{\underset{︸}{H (F | X_{u, v}, {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}, Y_{[U]})}} \end{matrix}

(97)

\begin{matrix} \overset{(1)}{=} H (W_{u, v}) = L . \end{matrix}

(98)

where the last step is due to the independence of the inputs and the keys.

The proof of (94) is similar to that of (93):

\begin{matrix} H (Y_{u} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \end{matrix}

\begin{matrix} = I (Y_{u}; F | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \\ = H (F | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) \end{matrix}

(99)

\begin{matrix} - \underset{\overset{(3), (4), (8)}{=} 0}{\underset{︸}{H (F | Y_{u}, {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}})}} \end{matrix}

(100)

\begin{matrix} = H (W_{u, v}) = L . \end{matrix}

(101)

Note that in the proof of (93) and (94), only the correctness constraint (8) is imposed and the security constraints (10) and (11) are not used.

Lemma 2.

For any

u \in [U]

, we demonstrate that the messages must not disclose excessive information regarding the inputs, as doing so would violate the security constraint (10).

\begin{matrix} I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]}) \leq (V - rank (B_{u})) L . \forall u \in [U] . \end{matrix}

(102)

Proof.

\begin{matrix} I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]}) \end{matrix}

\begin{matrix} \leq I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]}, B_{u}) \end{matrix}

(103)

\begin{matrix} = \underset{\overset{(10)}{=} 0}{\underset{︸}{I ({X_{u, v}}_{v \in [V]}; B_{u})}} + I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]} ∣ B_{u}) \end{matrix}

(104)

\begin{matrix} \leq H ({W_{u, v}}_{v \in [V]} ∣ B_{u}) \end{matrix}

(105)

\begin{matrix} \leq H ({W_{u, v}}_{v \in [V]}, B_{u}) - H (B_{u}) \end{matrix}

(106)

\begin{matrix} = (V - rank (B_{u})) L . \end{matrix}

(107)

The first term in (104) is zero due to the relay security constraint (10).

Lemma 3.

Consider any G and F, the received signals must not reveal any information about the individual inputs beyond the aggregated result, as otherwise, the server security constraint (11) would be violated. we have

\begin{matrix} I ({Y_{u}}_{u \in [U]}; {S_{u}}_{u \in [U]}) \leq (U - rank (G ∣ F)) L . \end{matrix}

(108)

Proof.

\begin{matrix} I ({Y_{u}}_{u \in [U]}; {S_{u}}_{u \in [U]}) \end{matrix}

(109)

\begin{matrix} = I ({Y_{u}}_{u \in [U]}; {S_{u}}_{u \in [U]}, G) \end{matrix}

(110)

\begin{matrix} = I ({Y_{u}}_{u \in [U]}; G) + I ({Y_{u}}_{u \in [U]}; {S_{u}}_{u \in [U]} | G) \end{matrix}

(111)

\begin{matrix} \leq I ({Y_{u}}_{u \in [U]}, F; G) + H ({S_{u}}_{u \in [U]} | G) \end{matrix}

(112)

\begin{matrix} = I (F; G) + \underset{\overset{(11)}{=} 0}{\underset{︸}{I (G; {Y_{u}}_{u \in [U]} | F)}} + H ({S_{u}}_{u \in [U]}, G) - H (G) \end{matrix}

(113)

\begin{matrix} = H (G) - H (G | F) + H ({S_{u}}_{u \in [U]}, G) - H (G) \end{matrix}

(114)

\begin{matrix} \leq (rank (G) - rank (G ∣ F)) L + (U - rank (G)) L \end{matrix}

(115)

\begin{matrix} = (U - rank (G ∣ F)) L . \end{matrix}

(116)

The third term in (113) is zero due to the server security constraint (11).

Proof of

R_{Z_{Σ}} \geq max \{{max}_{u \in [U]} K_{u}, rank (G ∣ F)\} .

Building on the above lemmas, we complete the proof of the converse.

First, we show that $R_{Z_{Σ}} \geq {max}_{u \in [U]} K_{u}$ . By Lemma 2, we have

\begin{matrix} L_{Z_{Σ}} & \geq H (Z_{Σ}) \end{matrix}

(117)

\begin{matrix} \geq H ({Z_{u, v}}_{v \in [V]}) \end{matrix}

(118)

\begin{matrix} \geq I ({Z_{u, v}}_{v \in [V]}; {X_{u, v}}_{v \in [V]} | {W_{u, v}}_{v \in [V]}) \end{matrix}

(119)

\begin{matrix} = H ({X_{u, v}}_{v \in [V]} | {W_{u, v}}_{v \in [V]}) \end{matrix}

(120)

\begin{matrix} = H ({X_{u, v}}_{v \in [V]}) - I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]}) \end{matrix}

(121)

\begin{matrix} \geq \sum_{v \in [V]} H (X_{u, v} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, v)}}) - I ({X_{u, v}}_{v \in [V]}; {W_{u, v}}_{v \in [V]}) \end{matrix}

(122)

\begin{matrix} \overset{(93) (102)}{\geq} V L - (V - rank (B_{u})) L = K_{u} L . \end{matrix}

(123)

Therefore,

R_{Z_{Σ}} = \frac{L_{Z_{Σ}}}{L} \geq max_{u \in [U]} K_{u} .

(124)

Second, we show that

R_{Z_{Σ}} \geq rank (G ∣ F)

. By Lemma 3, we have

\begin{matrix} L_{Z_{Σ}} & \geq H (Z_{Σ}) \end{matrix}

(125)

\begin{matrix} \geq H (Z_{[U] \times [V]}) \end{matrix}

(126)

\begin{matrix} \geq I (Z_{[U] \times [V]}; {Y_{u}}_{u \in [U]} | {S_{u}}_{u \in [U]}) \end{matrix}

(127)

\begin{matrix} = H ({Y_{u}}_{u \in [U]} | {S_{u}}_{u \in [U]}) - \underset{\overset{(6)}{=} 0}{\underset{︸}{H ({Y_{u}}_{u \in [U]} | {S_{u}}_{u \in [U]}, Z_{[U] \times [V]})}} \end{matrix}

(128)

\begin{matrix} = H ({Y_{u}}_{u \in [U]}) - I ({S_{u}}_{u \in [U]}; {Y_{u}}_{u \in [U]}) \end{matrix}

(129)

\begin{matrix} \geq \sum_{u = 1}^{U} H (Y_{u} | {W_{i, j}, Z_{i, j}}_{(i, j) \in [U] \times [V] ∖ {(u, j) : j \in [V]}}) - I ({S_{u}}_{u \in [U]}; {Y_{u}}_{u \in [U]}) \end{matrix}

(130)

\begin{matrix} \overset{(94) (108)}{\geq} U L - (U - rank (G ∣ F)) L \end{matrix}

(131)

\begin{matrix} = rank (G ∣ F) L . \end{matrix}

(132)

Hence,

R_{Z_{Σ}} = \frac{L_{Z_{Σ}}}{L} \geq rank (G ∣ F) .

(133)

Combining (124) and (133), we obtain

R_{Z_{Σ}} \geq max \{max_{u \in [U]} K_{u}, rank (G ∣ F)\} .

7. Conclusions

This paper investigates information theoretic secure aggregation of linear functions over a two hop hierarchical network with relay-assisted communication. By jointly accounting for relay-level privacy constraints and server-side function-specific security requirements, we establish a unified framework for hierarchical vector linear secure aggregation.

Our main contribution is a complete characterization of the optimal communication key rate region. We show that both hops achieve the minimum possible communication rate of one symbol per input symbol, while the required source key rate is governed by the maximum of the intra-cluster security requirement and the conditional rank rank

(G ∣ F)

. This result demonstrates that hierarchical architectures incur no additional communication cost compared to single hop systems, while substantially reducing the masking burden at the server through structured key injection.

To achieve these fundamental limits, we propose an explicit linear coding scheme based on systematic precoding, subspace alignment, and zero forcing. The scheme exploits the algebraic structure of the authorized and unauthorized functions to inject randomness exclusively into dimensions that do not interfere with the authorized computation. The achievability and converse proofs together establish that the derived rate region is information theoretically tight.

Overall, this work clarifies the fundamental role of hierarchy in secure aggregation and provides theoretical guidance for the design of scalable privacy preserving distributed learning systems. Future work includes extending the framework to scenarios with collusion among servers, relays, and users, as well as investigating robustness under user dropouts, heterogeneous cluster sizes, and asymmetric communication constraints.

Author Contributions

Conceptualization, J.L. and Z.L.; methodology, J.L., X.Z. and Z.L.; formal analysis, J.L. and Z.L.; investigation, J.L. and X.Z.; writing—original draft preparation, J.L. and Z.L.; writing—review and editing, J.L., X.Z. and Z.L.; funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research and Development Program of Guangxi (No. AD25069071), the Guangxi Natural Science Foundation (Grant No. 2025GXNSFBA069315), the National Natural Science Foundation of China (Grant No. 62401266), and the Jiangsu Natural Science Foundation (Grant No. BK20241452).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhao, Y.; Sun, H. Secure Summation: Capacity Region, Groupwise Key, and Feasibility. IEEE Trans. Inf. Theory 2023, 70, 1376–1387. [Google Scholar] [CrossRef]
Yuan, X.; Sun, H. Vector Linear Secure Aggregation. arXiv 2025, arXiv:2502.09817. [Google Scholar] [CrossRef]
Hu, L.; Ulukus, S. On the Capacity Region of Individual Key Rates in Vector Linear Secure Aggregation. arXiv 2026, arXiv:2601.03241. [Google Scholar] [CrossRef]
Zhang, X.; Wan, K.; Sun, H.; Wang, S.; Ji, M.; Caire, G. Optimal Communication and Key Rate Region for Hierarchical Secure Aggregation with User Collusion. arXiv 2024, arXiv:2410.14035. [Google Scholar] [CrossRef]
Zhao, Y.; Sun, H. Information Theoretic Secure Aggregation with User Dropouts. IEEE Trans. Inf. Theory 2022, 68, 7471–7484. [Google Scholar] [CrossRef]
Zhang, Z.; Liu, J.; Wan, K.; Sun, H.; Ji, M.; Caire, G. On Secure Aggregation with Uncoded Groupwise Keys Against User Dropouts and User Collusion. IEEE Trans. Inf. Theory 2025, 71, 8391–8413. [Google Scholar] [CrossRef]
Zhao, Y.; Sun, H. The Optimal Rate of MDS Variable Generation. In Proceedings of the 2023 IEEE International Symposium on Information Theory (ISIT); IEEE: Piscataway, NJ, USA, 2023; pp. 832–837. [Google Scholar]
Jahani-Nezhad, T.; Maddah-Ali, M.A.; Li, S.; Caire, G. Swiftagg: Communication-efficient and dropout-resistant secure aggregation for federated learning with worst-case security guarantees. In Proceedings of the 2022 IEEE International Symposium on Information Theory (ISIT); IEEE: Piscataway, NJ, USA, 2022; pp. 103–108. [Google Scholar]
Jahani-Nezhad, T.; Maddah-Ali, M.A.; Li, S.; Caire, G. SwiftAgg+: Achieving asymptotically optimal communication loads in secure aggregation for federated learning. IEEE J. Sel. Areas Commun. 2023, 41, 977–989. [Google Scholar] [CrossRef]
Li, Z.; Zhao, Y.; Sun, H. Weakly secure summation with colluding users. IEEE Trans. Inf. Theory 2025, 71, 5672–5683. [Google Scholar] [CrossRef]
Wan, K.; Sun, H.; Ji, M.; Mi, T.; Caire, G. The Capacity Region of Information Theoretic Secure Aggregation with Uncoded Groupwise Keys. IEEE Trans. Inf. Theory 2024, 70, 6932–6949. [Google Scholar] [CrossRef]
Wan, K.; Yao, X.; Sun, H.; Ji, M.; Caire, G. On the information theoretic secure aggregation with uncoded groupwise keys. IEEE Trans. Inf. Theory 2024, 70, 6596–6619. [Google Scholar] [CrossRef]
Sun, H. Secure Aggregation with an Oblivious Server. arXiv 2023, arXiv:2307.13474. [Google Scholar] [CrossRef]
Weng, S.; Ren, C.; Zhao, Y.; Xiao, M.; Skoglund, M. Coding-Enforced Robust Secure Aggregation for Federated Learning Under Unreliable Communication. arXiv 2025, arXiv:2507.07565. [Google Scholar]
Zhang, X.; Li, Z.; Wan, K.; Sun, H.; Ji, M.; Caire, G. Communication-Efficient Hierarchical Secure Aggregation with Cyclic User Association. In Proceedings of the 2025 IEEE International Symposium on Information Theory (ISIT); IEEE: Piscataway, NJ, USA, 2025; pp. 1–6. [Google Scholar] [CrossRef]
Egger, M.; Hofmeister, C.; Wachter-Zeh, A.; Bitar, R. Private aggregation in wireless federated learning with heterogeneous clusters. In Proceedings of the 2023 IEEE International Symposium on Information Theory (ISIT); IEEE: Piscataway, NJ, USA, 2023; pp. 54–59. [Google Scholar]
Xu, M.; Han, X.; Wan, K.; Ge, G. On hierarchical secure aggregation against relay and user collusion. arXiv 2025, arXiv:2511.20117. [Google Scholar] [CrossRef]
Li, Z.; Zhang, X.; Lv, J.; Chen, H.; Fan, J.; Caire, G. Hierarchical Secure Aggregation with Heterogeneous Security Constraints and Arbitrary User Collusion. arXiv 2025, arXiv:2507.14768. [Google Scholar]
Lu, Q.; Cheng, J.; Kang, W.; Liu, N. Capacity of Hierarchical Secure Coded Gradient Aggregation with Straggling Communication Links. arXiv 2024, arXiv:2412.11496. [Google Scholar] [CrossRef]
Zhang, X.; Luo, Y.; Li, T. A Review of Research on Secure Aggregation for Federated Learning. Future Internet 2025, 17, 308. [Google Scholar] [CrossRef]
Xing, L.; Luo, Z.; Deng, K.; Wu, H.; Ma, H.; Lu, X. FedHSQA: Robust Aggregation in Hierarchical Federated Learning via Anomaly Scoring-Based Adaptive Quantization for IoV. Electronics 2025, 14, 1661. [Google Scholar] [CrossRef]
Gao, Q.; Sun, Y.; Chen, X.; Yang, F.; Wang, Y. An Efficient Multi-Party Secure Aggregation Method Based on Multi-Homomorphic Attributes. Electronics 2024, 13, 671. [Google Scholar] [CrossRef]
Park, S.; Chi, J. V-MHESA: A Verifiable Masking and Homomorphic Encryption-Combined Secure Aggregation Strategy for Privacy-Preserving Federated Learning. Mathematics 2025, 13, 3687. [Google Scholar] [CrossRef]

Figure 1. Illustration of hierarchical secure linear aggregation. The aggregation server is permitted to compute information about the linear function F, but is not permitted to compute any information about the linear function G. In addition, each relay is required to be unable to compute any intra-cluster information about the linear function

B_{u}

(intra-cluster privacy constraint).

Figure 1. Illustration of hierarchical secure linear aggregation. The aggregation server is permitted to compute information about the linear function F, but is not permitted to compute any information about the linear function G. In addition, each relay is required to be unable to compute any intra-cluster information about the linear function

B_{u}

(intra-cluster privacy constraint).

Table 1. Comparison between representative single-hop vector linear schemes and the proposed two-hop hierarchical scheme.

Aspect	Representative Single-Hop Vector Linear Schemes	Proposed Two-Hop Hierarchical Scheme
Architecture	Single-hop	Two-hop hierarchical
Trust model	Honest-but-curious server	Semi-trusted relays and honest-but-curious server
Security target	Server-side target-function privacy	Relay-side $B_{u}$ -function privacy and server-side target-function privacy
Communication efficiency	Optimal communication rate: $R_{X} = 1$	Optimal communication rates on both hops: $R_{X} = 1, R_{Y} = 1$
Technical challenge	Single-layer code design	Unified linear design under coupled relay/server security constraints

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lv, J.; Zhang, X.; Li, Z. On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation. Entropy 2026, 28, 352. https://doi.org/10.3390/e28030352

AMA Style

Lv J, Zhang X, Li Z. On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation. Entropy. 2026; 28(3):352. https://doi.org/10.3390/e28030352

Chicago/Turabian Style

Lv, Jiawen, Xiang Zhang, and Zhou Li. 2026. "On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation" Entropy 28, no. 3: 352. https://doi.org/10.3390/e28030352

APA Style

Lv, J., Zhang, X., & Li, Z. (2026). On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation. Entropy, 28(3), 352. https://doi.org/10.3390/e28030352

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Communication–Key Rate Region of Hierarchical Vector Linear Secure Aggregation

Abstract

1. Introduction

2. Problem Statement

3. Main Results

4. Motivating Example (U = 4, V = 3, M = 2, N = 1)

5. General Achievability Proof of Theorem 1

5.1. Conceptual Overview of the Construction

5.2. General Construction

6. Converse

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI