On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation

Liu, Yueyang; Jia, Haobo; Jia, Zhuqing

doi:10.3390/e27090975

Open AccessArticle

On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation

by

Yueyang Liu

,

Haobo Jia

and

Zhuqing Jia

^*

School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(9), 975; https://doi.org/10.3390/e27090975

Submission received: 10 August 2025 / Revised: 14 September 2025 / Accepted: 15 September 2025 / Published: 18 September 2025

(This article belongs to the Special Issue Information-Theoretic Security and Privacy)

Download

Browse Figures

Versions Notes

Abstract

The problem of graph-based X-secure T-private linear computation (GXSTPLC) is to allow a user to retrieve a linear combination of K messages from a set of N distributed servers that store the messages in a graph-based fashion, i.e., each message is restricted to be distributed among a subset of servers. T-privacy requires that the coefficients of the linear combination are not revealed to any group of up to T colluding servers, and X-security guarantees that any set of up to X colluding servers learns nothing about the messages. In this paper, we propose an achievability scheme for GXSTPLC that enables a storage–communication trade-off by exploiting non-replicated storage codes. Novel aspects of our achievability scheme include the usage of the idea of cross-subspace alignment null shaper that addresses various challenges posed by the graph-based storage structure. In addition, unlike previous works, our scheme allows a direct transformation into a quantum one to achieve a superdense coding gain by leveraging the idea of N-Sum Box abstraction of quantum “over-the-air” computing.

Keywords:

linear computation; communication efficiency; storage efficiency; cross subspace alignment; private information retrieval

1. Introduction

Escalating concerns regarding security and privacy in distributed systems motivate the problem of private linear computation (PLC). PLC considers a scenario where K messages are stored (possibly replicated or coded) across N distributed servers. The user aims to retrieve a linear combination of these messages without revealing the coefficients of the linear combination to any group of up to T colluding servers, where T, representing the maximum number of tolerable colluding servers, is referred to as the privacy threshold. Notably, PLC is a non-trivial generalization of private information retrieval (PIR), as PIR corresponds to the special case where the user uses a one-hot coefficient vector to retrieve a single message. Recent advancements in the study of PIR and PLC from an information-theoretic perspective have yielded a series of capacity (i.e., the reciprocal of the minimum possible normalized download cost across N servers) characterizations and novel coding schemes for these problems and their variants [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45].

Existing paradigms like PIR and PLC often assume global data availability, where each message can be stored across all servers, which is cumbersome in real-world applications due to constraints such as geographic blocking, network connectivity, and data security, so that data are not uniformly available at all servers, and data must be secure against storage eavesdroppers. Driven by this problem, a PLC variant known as graph-based X-secure T-private linear computation (GXSTPLC) incorporates two key constraints. First, the non-uniform data availability, or graph-based PLC, where each message is restricted to be stored among a specific subset of servers, referred to as the storage pattern. This pattern can be naturally represented by a hypergraph where servers are nodes and the storage subset for each message forms a hyperedge. Second, X-security of the storage, a standard requirement ensuring that no information about the messages is revealed to any set of up to X colluding servers. While several prior works address graph-based PIR/PLC [35,36,37,38], and the asymptotic (i.e., in the limit as the number of messages K approaches infinity) capacity of GXSTPLC was fully characterized in [38], a subtle observation arises. To the best of our knowledge, the storage codes in related prior graph-based PIR/PLC works are essentially repetition codes (meaning each stored portion of a message is at least as large as the original message). Furthermore, the achievability scheme in [38] may necessitate storage exceeding simple replication, which can be highly storage-inefficient. Our work is motivated by this potential storage inefficiency; i.e., we are interested in the trade-off between the download cost and storage efficiency by leveraging coding techniques in the storage construction. We note that deriving the tight converse bound (i.e., the full capacity characterization) for the storage–download trade-off of GXSTPLC remains a challenging open problem. The focus of this work is to establish the first achievability scheme that enables such a trade-off.

The main result of this work is an achievability scheme for GXSTPLC that allows a trade-off between the download cost and storage cost. Specifically, we first propose an achievability scheme for a closely related problem, namely Asymmetric MDS-GXSTPLC, where security and privacy requirements are non-uniform across the messages; i.e., each message and the corresponding coefficient are specified with individual security and privacy thresholds. In addition, the storage code for each message is an MDS code that results in a reduction in storage cost (by trading off the communication efficiency). Then, based on the achievability scheme for Asymmetric MDS-GXSTPLC, our achievability scheme is finalized by adapting the idea of the augmented system in [38]. The key novelty that distinguishes our scheme from previous state-of-the-art is summarized as follows:

The idea of exploiting MDS codes for the storage in graph-based PIR/PLC: Ref. [38] achieves a single point (minimum download cost) using replication codes that turn out to be storage-inefficient, while our scheme leverages MDS-coded storage to allow a storage–download trade-off. To the best of our knowledge, this is the first scheme to incorporate MDS codes in graph-based PIR/PLC.
The technique used to handle the challenges introduced by the graph-based storage structure: Our work introduces a novel technique centered around the idea of a cross-subspace alignment (CSA) null shaper introduced in [27] to address the challenges introduced by the graph-based storage structure. The CSA null shaper was originally designed for storage-consistent private updates with unavailable servers. However, in this work, this idea is adapted to ensure that the overall storage conforms to valid CSA codewords under the graph-based storage constraints. This distinguishes our scheme from the scheme in [38], where the PLC under graph-based storage structure is enabled by a combination of techniques including CSA codes, dual Generalized Reed–Solomon (GRS) codes, and a Vandermonde decomposition of Cauchy matrices. Intuitively, CSA codes can be viewed as evaluation codes, and the CSA null shaper carefully places zeros at certain evaluation points, which correspond precisely to the servers prohibited by the graph-based storage pattern from storing codewords of a particular message. Consequently, the codewords for these servers are explicitly set to zero, requiring no storage at all, and the overall codewords (including zeros) remain valid CSA codewords. It should be noted that the idea of placing zeros in the storage construction for graph-based PIR/PLC may be profound, as the storage code of many known PIR/PLC schemes can be viewed as evaluation codes (e.g., polynomial codes based PIR/PLC in [12,24,46,47,48,49]). This idea may transform known PIR/PLC schemes into graph-based ones.
Reduced decoding complexity and quantum adaptability: Unlike schemes based on dual GRS codes properties, where a pre-processing step of interference cancellation during decoding is generally necessary, in our scheme, the user can recover the desired linear combination by merely solving linear systems defined by Cauchy–Vandermonde matrices, hence the reduction in decoding complexity. Moreover, our scheme is compatible with the N-Sum Box abstraction of quantum “over-the-air” computing [44,50], enabling a direct transformation of our scheme into a quantum one to achieve the superdense coding gain.

Notations:

Z_{> 0}

denotes the set of positive integers. The set of rational numbers is denoted as

Q

, while

Q_{\geq 0}

denotes the set of non-negative rational numbers. For any two positive integers

M < N

,

[M : N]

denotes the set

{M, M + 1, \dots, N}

, and

[N]

denotes

[1 : N]

. For any set of random variables

X_{1}, X_{2}, \dots, X_{N}

indexed by

[N]

, and an index set

I \subset [N]

,

X_{I}

denotes

{X_{i} ∣ i \in I}

. For any

x \in R

,

{(x)}^{+}

denotes

max (x, 0)

.

2. Problem Statement

Let us consider a private linear computation problem with N servers, denoted as Server

n, n \in [N]

, and K messages. As depicted in Figure 1, the messages are partitioned into M disjoint sets, i.e.,

W = ⋃_{m \in [M]} W_{m}, W_{i} \cap W_{j} = \emptyset, \forall i \neq j, i, j \in [M]

, where

W_{m}

is the

m^{t h}

message set and

W

is the set of all messages. For each

m \in [M]

, we define

W_{m} = {W_{m, 1}, W_{m, 2}, \dots, W_{m, K_{m}}}

; i.e., the message set

W_{m}

consists of

K_{m}

messages, and each of the messages comprises L i.i.d. symbols from a finite field

F_{q}

. Formally, for all

m \in [m], k \in [K_{m}]

, we have

W_{m, k} = {[W_{m, k} (1), W_{m, k} (2), \dots, W_{m, k} (L)]}^{⊤}

, and

\begin{matrix} H ({(W_{m, k})}_{m \in [M], k \in [K_{m}]}) = K L, \end{matrix}

(1)

in q-ary units. Due to non-uniform data availability, each message set is only allowed to be stored among a subset of the N servers. To characterize this graph-based storage fashion, let us define

\begin{matrix} R & = {R_{1}, R_{2}, \dots, R_{M}}, \end{matrix}

(2)

\begin{matrix} R_{m} & = {R_{m} (1), R_{m} (2), \dots, R_{m} (ρ_{m})} \subset [N], \end{matrix}

(3)

where

R_{m}

corresponds to the

m^{t h}

message set

W_{m}

, and for all

m \in [M]

,

R_{m}

represents a subset of servers; i.e., the message set

W_{m}

is allowed to be stored among Server

n, n \in R_{m}

. The collection of server subsets

R

is referred to as the storage pattern. Note that it is occasionally more convenient to consider the dual representation of the storage pattern,

\begin{matrix} M & = {M_{1}, M_{2}, \dots, M_{N}}, \end{matrix}

(4)

\begin{matrix} M_{n} & = {m \in [M] ∣ R_{m} ∋ n}, \end{matrix}

(5)

i.e., for all

n \in [N]

, Server n is allowed to store a securely coded codeword of the messages

W_{m}, m \in M_{n}

. In other words, denoting the codeword of the message

W_{m, k}, m \in [M],

k \in [K_{m}]

stored at Server

n, n \in R_{m}

as

{\tilde{W}}_{m, k}^{(n)}

, the storage at server

n, n \in [N]

, denoted as

S_{n}

, is, thus,

\begin{matrix} S_{n} = {{\tilde{W}}_{m, k}^{(n)} ∣ m \in M_{n}, k \in [K_{m}]} . \end{matrix}

(6)

Moreover, the X-secure storage constraint guarantees that the collusion of any up to X servers discloses nothing about the message, i.e.,

\begin{matrix} I (S_{X}; W) = 0, \forall X \subset [N], | X | = X . \end{matrix}

(7)

Let us further elaborate on the above notations via the following example. Assume that we have

M = 4

message sets

W_{1}, W_{2}, W_{3}, W_{4}

that are stored at

N = 7

servers, as shown in the following table.

Server 1	Server 2	Server 3	Server 4	Server 5	Server 6	Server 7	Server 8
$W_{2}$	$W_{1}$	$W_{1}$	$W_{2}$	$W_{2}$	$W_{1}$	$W_{1}, W_{2}$	$W_{1}$

For this example, we have

\begin{matrix} R_{1} & = {2, 3, 6, 7}, & ρ_{1} & = 4 & R_{2} & = {1, 4, 5, 7, 8}, & ρ_{2} & = 5 \end{matrix}

(8a)

\begin{matrix} M_{1} & = {2}, & M_{2} & = {1}, & M_{3} & = {1}, & M_{4} & = {2}, \end{matrix}

(8b)

\begin{matrix} M_{5} & = {2}, & M_{6} & = {1}, & M_{7} & = {1, 2}, & M_{8} & = {1} . \end{matrix}

(8c)

In our private linear computation problem, the user is interested in a linear combination of the K messages of the following form

\begin{matrix} λ_{Λ} (W) ≜ \sum_{m \in [M]} \sum_{k \in [K_{m}]} λ_{m, k} W_{m, k}, \end{matrix}

(9)

where

Λ = {(λ_{m, k})}_{m \in [M], k \in [K_{m}]}

is the coefficient of the linear combination and

λ_{m, k, l} \in F_{q}, m \in [M], k \in [K_{m}], l \in [L]

are generated by the user privately, uniformly i.i.d. over the finite field

F_{q}

. For this purpose, the user generates a total of N queries,

Q_{n}^{(Λ)}, n \in [N]

, where

Q_{n}^{(Λ)}

is intended for Server n, without prior awareness of messages or server storage

\begin{matrix} I (S_{[N]}; Λ, Q_{1}^{(Λ)}, \dots, Q_{n}^{(Λ)}) = 0, \end{matrix}

(10)

so that any set of up to T colluding servers learns nothing about the coefficients

Λ

. Formally, we have

\begin{matrix} I (Q_{T}^{(Λ)}; Λ) = 0, \forall T \subset [N], | T | = T . \end{matrix}

(11)

Once the query

Q_{n}^{(Λ)}

is available at Server n,

n \in [N]

, an answer string

A_{n}^{(Λ)}

is generated as a function of

Q_{n}^{(Λ)}

and its storage

S_{n}

, i.e.,

\begin{matrix} H (A_{n}^{(Λ)} ∣ Q_{n}^{(Λ)}, S_{n}) = 0 . \end{matrix}

(12)

A private linear computation scheme is correct if and only if the desired linear combination is resolvable from the collection of downloaded server answers, i.e.,

\begin{matrix} H (λ_{Λ} (W) ∣ A_{[N]}^{(Λ)}, Q_{[N]}^{(Λ)}, Λ) = 0 . \end{matrix}

(13)

To characterize the communication efficiency of a private linear computation scheme per server, we define the normalized total download cost

D_{n}

as the expected number of q-ary symbols downloaded from the n-th server, normalized by L, that is,

D_{n} = \frac{H (A_{n}^{(Λ)})}{L}

. In addition, to measure the storage efficiency per server, we define the normalized storage cost for all

n \in [N]

as follows:

\begin{matrix} C_{n} = \frac{H (S_{n})}{K L} . \end{matrix}

(14)

3. Main Result

The main result of this work is an achievability scheme for the problem of GXSTPLC that allows a trade-off between the download cost and storage cost, formally presented in the following theorem.

Theorem 1.

Let

η = (η_{1}, η_{2}, \dots, η_{M}) \in Q^{M}

be a vector of rational numbers such that for all

m \in [M], η_{m} \geq 1

, and each

η_{m} = p_{m}^{'} / q_{m}^{'}

, with

p_{m}^{'}, q_{m}^{'}

being co-prime. Consider a target vector of per-server normalized download costs

(D_{1}, D_{2}, \dots, D_{N}) \in Q_{\geq 0}^{N}

, where each

D_{n} = p_{n} / q_{n}

, with

p_{n}, q_{n}

being co-prime. If the vector

(D_{1}, D_{2}, \dots, D_{N})

lies within the achievable region

D

, defined as

\begin{matrix} D (η) ≜ \{(D_{1}, \dots, D_{N}) \in Q_{\geq 0}^{N} |\begin{matrix} \forall m \in [M], \forall R_{m}^{'} \subseteq R_{m} with \\ | R_{m}^{'} | = | R_{m} | - X - T, \sum_{n \in R_{m}^{'}} D_{n} \geq η_{m} \end{matrix}\}, \end{matrix}

(15)

then this vector of download costs is achievable by the proposed GXSTPLC scheme. The corresponding normalized storage cost

C_{n}, n \in [N]

is given by

\begin{matrix} C_{n} = \frac{1}{K} \sum_{m \in M_{n}} min (τ_{n}, γ_{m}) \frac{K_{m}}{q_{0} (η_{m} - 1) + 1}, \end{matrix}

(16)

where the integers

τ_{n}

are defined as

τ_{n} = q_{0} D_{n}

with

q_{0} = l c m (q_{1}, q_{2}, \dots, q_{N}, q_{1}^{'}, q_{2}^{'}, \dots, q_{M}^{'})

, and for each

m \in [M]

,

γ_{m}

is the

(X + T)

-th largest value in the multiset

{τ_{n} ∣ n \in R_{m}}

when its elements are sorted in non-increasing order.

Remark 1.

It is remarkable that by setting

η_{m} = 1

for all

m \in [M]

, our result reduces to that of [38], which is capacity-achievable; i.e., it achieves the minimum possible normalized total download cost

D = \sum_{n \in [N]} D_{n}

via the optimal configuration of

D_{n}

s. It is also worth mentioning that by setting

η_{m} = 1

for all

m \in [M]

, the closure of our achievable region

D (1)

in the real space

R^{N}

exactly matches the converse region established in [38] (Theorem 1).

The Storage–Communication Trade-Off in the Proposed GXSTPLC Scheme

In this section, we investigate the fundamental trade-off between the normalized storage cost

C_{n}

and the normalized download cost

D_{n}

of our proposed GXSTPLC scheme. To facilitate this analysis, we can bound the expression for

C_{n}

as follows.

\begin{matrix} C_{n} & = \frac{1}{K} \sum_{m \in M_{n}} min (τ_{n}, γ_{m}) \frac{K_{m}}{q_{0} (η_{m} - 1) + 1} \end{matrix}

(17)

\begin{matrix} \leq \frac{1}{K} \sum_{m \in M_{n}} τ_{n} \frac{K_{m}}{q_{0} (η_{m} - 1 + \frac{1}{q_{0}})} \end{matrix}

(18)

\begin{matrix} = \frac{D_{n}}{K} \sum_{m \in M_{n}} \frac{K_{m}}{η_{m} - 1 + \frac{1}{q_{0}}}, \end{matrix}

(19)

where (18) holds by noting that

min (τ_{n}, γ_{m}) \leq τ_{n}

. To further simplify our analysis, let us consider a symmetric scenario where

K_{m} = K / M

for all

m \in [M]

. In addition, by definition,

q_{0}

tends to be large, so the bound can then be approximated as

\begin{matrix} C_{n} ⪅ \frac{D_{n}}{M} \sum_{m \in M_{n}} \frac{1}{η_{m} - 1} . \end{matrix}

(20)

This approximation explicitly shows that for operating points within the achievable region

D (η)

, for a given server n, its storage cost

C_{n}

is directly proportional to its own download cost

D_{n}

and inversely related to the term

η_{m} - 1

for each message set m it stores. This relationship suggests that one potential strategy to reduce

C_{n}

is to increase the corresponding values of

η_{m}

while keeping

D_{n}

fixed. However, according to the definition of the achievable region

D (η)

, increasing any

η_{m}

tightens the feasibility constraint associated with the message set m, requiring a larger sum of download costs from the relevant servers. To satisfy this stricter condition, the download costs of other servers (i.e., those in

R_{m}

) must collectively increase. This increase in another server’s download cost, namely

D_{n^{'}}

, is likely to elevate its own storage cost

C_{n^{'}}

. Therefore, an attempt to locally optimize storage cost on one server can inadvertently shift the burden, increasing both download and storage costs elsewhere in the system.

On the other hand, it is also of interest to explore the trade-off between the total normalized storage cost, denoted by

C = \sum_{n = 1}^{N} C_{n}

, and the total normalized download cost,

D = \sum_{n = 1}^{N} D_{n}

. To construct the optimal trade-off curve of D versus C for a given total download cost

D_{target}

, our goal is to find the minimum possible total storage cost

C_{min}

. This is formulated as an optimization problem:

\begin{matrix} C_{min} (D_{target}) = min_{η, {(D_{n})}_{n \in [N]}} & \sum_{n = 1}^{N} C_{n} \end{matrix}

(21)

\begin{matrix} s . t . & \sum_{n = 1}^{N} D_{n} \leq D_{target}, \end{matrix}

(22)

\begin{matrix} (D_{1}, D_{2}, \dots, D_{N}) \in D (η), \end{matrix}

(23)

\begin{matrix} η_{m} \geq 1, \forall m \in [M] . \end{matrix}

(24)

By solving this optimization problem for a range of

D_{target}

values, we obtain a set of optimal operating points

(D_{target}, C_{min} (D_{target}))

. Figure 2 presents such a storage–communication trade-off curve for a specific setting. It is crucial to note that this figure is plotted by numerically solving the optimization problem (21)–(24) via an exhaustive search-based strategy. In particular, for the specific system setting, we first discretize and sweep over a range of plausible values for the parameters

η_{1}

and

η_{2}

(subject to

η_{1}, η_{2} \geq 1

). Then, for each candidate pair

(η_{1}, η_{2})

, we find achievable download cost vector

(D_{1}, D_{2}, \dots, D_{N})

that minimizes the total download cost

D_{target} = \sum_{n} D_{n}

for the specific

η

. For each feasible

(η_{1}, η_{2})

and its associated optimal download vector

(D_{1}, D_{2}, \dots, D_{N})

, the corresponding total storage cost

C = \sum_{n = 1}^{N} C_{n}

is calculated. This results in a point

(D_{target}, C)

on the D-C plane for the given

η

. The set of all points

(D_{target}, C)

obtained from all feasible

(η_{1}, η_{2})

pairs forms a set of achievable

(D_{target}, C)

points, and the trade-off curve is obtained by finding its lower convex hull. Although this numerical method does not guarantee the exploration of the entire feasible region, the resulting curve provides a conservative approximation of the optimal solution. In other words, the true optimal trade-off curve can only lie on or to the lower left of the curve presented in the figure, which serves as a bound for the

(D_{target}, C_{min} (D_{target}))

curve.

4. An Achievability Scheme for Asymmetric Setting

As discussed in the introduction, our achievability scheme construction begins with a scheme for Asymmetric MDS-GXSTPLC. Specifically, in this asymmetric variant, the security and privacy constraints are parameterized by two tuples

X = (X_{1}, X_{2}, \dots, X_{M})

and

T = (T_{1}, T_{2}, \dots, T_{M})

, and for each message set

m \in [M]

, the corresponding constraints are

X_{m}

-security and

T_{m}

-privacy, i.e.,

\begin{matrix} I (S_{X}; W_{m}) = 0, & \forall X \subset [N], | X | = X_{m}, \end{matrix}

(25)

\begin{matrix} I (Q_{T}^{(Λ)}; {(λ_{m, k})}_{k \in [K_{m}]}) = 0, & \forall T \subset [N], | T | = T_{m} . \end{matrix}

(26)

Moreover, for each message set

m \in [M]

, the codewords distributed among the servers

n \in R_{m}

must form an

(N, X_{m} + η_{m})

-MDS code; i.e., the messages

W_{m}

must be recoverable from any

X_{m} + η_{m}

of its codewords, and the size of any of the codeword is at most

L / η_{m}

in q-ary units, where

η_{m} \in Z_{> 0}

for all

m \in [M]

. In particular, we show that the following (uniform) normalized download cost is achievable for Asymmetric MDS-GXSTPLC for all

n \in [N]

\begin{matrix} D_{n} = \frac{1}{{({min}_{m \in [M]} ρ_{m} - X_{m} - T_{m} - η_{m} + 1)}^{+}} . \end{matrix}

(27)

Once the achievability scheme for the asymmetric setting is established, we then employ the idea of the augmented system adapted from [38] to show that the normalized download cost in Theorem 1 is achievable. The remainder of this section is devoted to the presentation of the achievability scheme for the problem of Asymmetric MDS-GXSTPLC.

4.1. Preliminaries

Recall that the security and privacy level are parameterized by two tuples

X = (X_{1}, X_{2}, \dots, X_{M})

and

T = (T_{1}, T_{2}, \dots, T_{M})

. Also, the MDS code constraint is parameterized by a set of M positive integers

η_{1}, η_{2}, \dots, η_{M}

. Moreover, let us assume that for all

m \in [M]

, we have

ρ_{m} > X_{m} + T_{m} + η_{m} - 1

; otherwise, our scheme is infeasible. Let us define

μ = {min}_{m \in [M]} ρ_{m} - X_{m} - T_{m} - η_{m} + 1

, and set

L = lcm (μ, η_{1}, η_{2}, \dots, η_{M})

; i.e., each of the K messages consists of L i.i.d. symbols from the finite field

F_{q}

. Recall that setting

L = lcm (μ, η_{1}, η_{2}, \dots, η_{M})

allows us to define a positive integer

J_{m} = L / η_{m}

as we must have

η_{m} ∣ L

. Now for all

m \in [M]

, let us define a mapping

ϕ_{m} : [J_{m}] \times [η_{m}] \to [L]

as

ϕ_{m} (ℓ, κ) = ℓ + (κ - 1) J_{m}

, i.e., the column-major order reshaping of

[L]

. It is obvious that

ϕ_{m}

is invertible, denoted as

ϕ_{m}^{- 1}

. Note that the mapping

ϕ_{m}

allows us to reshape message vectors

{(W_{m, k})}_{k \in [K_{m}]}

into

J_{m} \times η_{m}

matrices for all

m \in [M]

, where

J_{m} = L / η_{m}

. In other words, for all

m \in [M], k \in [K_{m}], ℓ \in [J_{m}], κ \in [η_{m}]

, we define

W_{m, k} (ℓ, κ) = W_{m, k} (ϕ_{m} (ℓ, κ))

. Also, we need a total of

N + L

distinct elements from

F_{q}

, denoted as

α_{1}, α_{2}, \dots, α_{N}, f_{1}, f_{2}, \dots, f_{L}

. The existence of such distinct elements is guaranteed by selecting a sufficiently large field

q \geq N + L

. For all

m \in [M], ℓ \in [J_{m}], κ \in [η_{m}]

, let us define

f_{ℓ, κ}^{(m)} = f_{ϕ_{m} (ℓ, κ)}

. Finally, for all

m \in [M], ℓ \in [J_{m}], κ \in [η_{m}]

, let us define

W_{m, ℓ, κ} = {[W_{m, 1} (ℓ, κ), W_{m, 2} (ℓ, κ), \dots, W_{m, K_{m}} (ℓ, κ)]}^{⊤}

and for all

m \in [M]

, define

λ_{m} = {[λ_{m, 1}, λ_{m, 2}, \dots, λ_{m, K_{m}}]}^{⊤}

. Clearly, the desired linear combination can be equivalently written in the following form

\begin{matrix} λ_{Λ} (W) = {(\sum_{m \in [M]} W_{m, ℓ, κ}^{⊤} λ_{m})}_{ℓ \in [J_{m}], κ \in [η_{m}]} . \end{matrix}

(28)

4.2. Construction of the Storage

For all

m \in [M]

, let us define the following null-shaper polynomial in

α

\begin{matrix} N_{m} (α) = \prod_{n \in [N] ∖ R_{m}} (α - α_{n}), \end{matrix}

(29)

and for all

m \in [M], ℓ \in [J_{m}]

, let us define the following (vector-valued) rational function in

α

\begin{matrix} {\tilde{W}}_{m, ℓ} (α) = N_{m} (α) (\sum_{κ \in [η_{m}]} \frac{N_{m} {(f_{ℓ, κ}^{(m)})}^{- 1}}{α - f_{ℓ, κ}^{(m)}} W_{m, ℓ, κ} + \sum_{x \in [X_{m}]} α^{x - 1} Z_{m, ℓ, x}) \end{matrix}

(30)

where for all

m \in [M], ℓ \in [J_{m}], x \in [X_{m}]

,

Z_{m, ℓ, x}

are uniformly i.i.d. column vectors from

F_{q}^{K_{m}}

, independent of the messages. Note that by the definition of

N (α)

, for all

m \in [M], n \in [N] ∖ R_{m}

, we have

{\tilde{W}}_{m, ℓ} (α_{n}) = 0

, i.e., the rational function

{\tilde{W}}_{m, ℓ} (α)

, evaluated at the points corresponding to the servers that are prohibited from storing codewords of the

m^{t h}

message set, is zero. Moreover, by partial fraction decomposition, we can also write

\begin{matrix} {\tilde{W}}_{m, ℓ} (α) = & \sum_{κ \in [η_{m}]} \frac{1}{α - f_{ℓ, κ}^{(m)}} W_{m, ℓ, κ} + \sum_{i \in [X_{m} + N - ρ_{m}]} α^{i - 1} Y_{m, ℓ, i}, \end{matrix}

(31)

where for all

m \in [M], ℓ \in [J_{m}]

,

{(Y_{m, ℓ, i})}_{i \in [X_{m} + N - ρ_{m}]}

are various linear combinations of

{(W_{m, ℓ, κ})}_{κ \in [η_{m}]}

and

{(Z_{m, ℓ, x})}_{x \in [X_{m}]}

. Now, for all

n \in [N]

, the storage at Server n is constructed as follows

\begin{matrix} S_{n} = \{{\tilde{W}}_{m, ℓ} (α_{n}) ∣ m \in [M], ℓ \in [J_{m}]\} . \end{matrix}

(32)

Again, while evaluating the rational function

{\tilde{W}}_{m, ℓ} (α)

for all

α_{n}, n \in [N]

might seem to violate the graph-based storage pattern, this paradox is resolved because our construction guarantees that for each message set

m \in [M]

, the evaluations for servers that refrain from storing any codeword of that message set are explicitly set to zero; hence, no storage is necessary. Moreover, for all

m \in [M], k \in [K_{m}]

, the size of the codeword of

W_{m, k}

for Server

n, n \in R_{m}

is

J_{m} = L / η_{m}

. And for all

m \in [M], ℓ \in [J_{m}]

and

K \subset R_{m}

such that

| K | = η_{m} + X_{m}

, given

{({\tilde{W}}_{m, ℓ} (α_{n}))}_{n \in K}

, the message symbols

{(W_{m, ℓ, κ})}_{κ \in [η_{m}]}

are recoverable. This is because for all

n \in R_{m}

, we have

N (α_{m}) \neq 0

, so from

{(N_{m} {(α_{m})}^{- 1} {\tilde{W}}_{m, ℓ} (α_{n}))}_{n \in K}

,

{(W_{m, ℓ, κ})}_{κ \in [η_{m}]}

are recoverable by inverting the following (scaled) Cauchy–Vandermonde matrix

\begin{matrix} [\begin{matrix} \frac{N_{m} {(f_{ℓ, 1}^{(m)})}^{- 1}}{α_{i_{1}} - f_{ℓ, 1}^{(m)}} & \dots & \frac{N_{m} {(f_{ℓ, η_{m}}^{(m)})}^{- 1}}{α_{i_{1}} - f_{ℓ, η_{m}}^{(m)}} & 1 & \dots & α_{i_{1}}^{X_{m} - 1} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ \frac{N_{m} {(f_{ℓ, 1}^{(m)})}^{- 1}}{α_{i_{η_{m} + X_{m}}} - f_{ℓ, 1}^{(m)}} & \dots & \frac{N_{m} {(f_{ℓ, η_{m}}^{(m)})}^{- 1}}{α_{i_{η_{m} + X_{m}}} - f_{ℓ, η_{m}}^{(m)}} & 1 & \dots & α_{i_{η_{m} + X_{m}}}^{X_{m} - 1} \end{matrix}] \end{matrix}

(33)

where

K = {i_{1}, i_{2}, \dots, i_{η_{m} + X_{m}}}

, and for any distinct

α_{i_{1}}, α_{i_{2}}, \dots, α_{i_{η_{m} + X_{m}}}, f_{ℓ, 1}^{(m)}, f_{ℓ, 2}^{(m)}, \dots, f_{ℓ, η_{m}}^{(m)}

, the Cauchy–Vandermonde matrix must be invertible. Finally, guaranteed by the

(N, X_{m})

-MDS coded uniform noise terms, for all

m \in [M]

, the storage at any

X_{m}

servers is independent of

W_{m}

, i.e., the storage is

X_{m}

-secure with respect to the

m^{t h}

message set.

4.3. Construction of the Queries

For all

m \in [M], ℓ \in [J_{m}], κ \in [η_{m}]

, let us define the following polynomial in

α

\begin{matrix} Q_{m, ℓ, κ} (α) = \prod_{κ^{'} \in [η_{m}] ∖ {κ}} \frac{α - f_{ℓ, κ^{'}}^{(m)}}{f_{ℓ, κ}^{(m)} - f_{ℓ, κ^{'}}^{(m)}} λ_{m} + \prod_{κ^{'} \in [η_{m}]} (α - f_{ℓ, κ^{'}}^{(m)}) (\sum_{t \in [T_{m}]} α^{t - 1} Z_{m, ℓ, κ, t}^{'}), \end{matrix}

(34)

where for all

m \in [M], ℓ \in [J_{m}], κ \in [η_{m}], t \in [T_{m}]

,

Z_{m, ℓ, κ, t}^{'}

are uniformly i.i.d. column vectors from

F_{q}^{K_{m}}

, independent of the coefficients. Now the query for Server n is constructed as

\begin{matrix} Q_{n}^{(Λ)} = \{Q_{m, ℓ, κ} (α_{n}) ∣ m \in [M], ℓ \in [J_{m}], κ \in [η_{m}]\} . \end{matrix}

(35)

Similarly, due to the fact that the coefficients are protected by the

(N, T_{m})

-MDS coded uniform noise terms, for all

m \in [M]

, the queries are

T_{m}

-private with respect to the

m^{t h}

message set.

4.4. Construction of the Answers

Let us define

V = L / μ

, and note that

μ ∣ L

, V must be a positive integer. Once the query

Q_{n}^{(Λ)}

is available at Server n, for all

n \in [N]

, the answer

A_{n}^{(Λ)}

is constructed as follows

\begin{matrix} A_{n}^{(Λ)} = \{\sum_{\begin{matrix} m \in [M] \\ (ℓ, κ) \in ϕ_{m}^{- 1} [(v - 1) μ + 1 : v μ] \end{matrix}} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, κ} (α_{n}) | v \in [V]\} . \end{matrix}

(36)

To see the correctness of our scheme, note that the term

{\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, κ} (α_{n})

can be viewed as the evaluation of the rational function

{\tilde{W}}_{m, ℓ} {(α)}^{⊤} Q_{m, ℓ, κ} (α)

at

α_{n}

, where, exploiting the equivalent form of

{\tilde{W}}_{m, ℓ} (α)

in (31), we can write

\begin{matrix} {\tilde{W}}_{m, ℓ} {(α)}^{⊤} Q_{m, ℓ, κ} (α) = \frac{1}{α - f_{ℓ, κ}^{(m)}} W_{m, ℓ, κ}^{⊤} λ_{m} + \sum_{i \in [X_{m} + N - ρ_{m} + η_{m} + T_{m} - 1]} α^{i - 1} Y_{m, ℓ, κ, i}^{'} \end{matrix}

(37)

where for all

m \in [M], ℓ \in [J_{m}], κ \in [η_{m}], i \in [X_{m} + N - ρ_{m} + η_{m} + T_{m} - 1]

,

Y_{m, ℓ, κ, i}^{'}

are various linear combinations of

{(W_{m, ℓ, κ})}_{κ \in [η_{m}]}

,

{(Y_{m, ℓ, i})}_{i \in [X_{m} + N - ρ_{m}]}

,

λ_{m}

and

{(Z_{m, ℓ, κ, t}^{'})}_{t \in [T_{m}]}

. Note that by the definition of

μ

, for all

m \in [M]

, we have

μ \leq ρ_{m} - X_{m} - T_{m} - η_{m} + 1

. Therefore

X_{m} + N - ρ_{m} + η_{m} + T_{m} - 1 = N - (ρ_{m} - X_{m} - T_{m} - η_{m} + 1) \leq N - μ

; i.e., we can equivalently write

\begin{matrix} {\tilde{W}}_{m, ℓ} {(α)}^{⊤} Q_{m, ℓ, κ} (α) = \frac{1}{α - f_{ℓ, κ}^{(m)}} W_{m, ℓ, κ}^{⊤} λ_{m} + \sum_{i \in [N - μ]} α^{i - 1} Y_{m, ℓ, κ, i}^{'} \end{matrix}

(38)

where for all

N - (ρ_{m} - X_{m} - T_{m} - η_{m} + 1) < i \leq N - μ

, we simply set

Y_{m, ℓ, κ, i}^{'} = 0

. Now plug this form into the definition of the answer, and recall that for all

m \in [M]

,

ϕ_{m}

represents the column-major order reshaping of

[L]

, we have

\begin{matrix} \sum_{m \in [M], (ℓ, κ) \in ϕ_{m}^{- 1} [(v - 1) μ + 1 : v μ]} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, κ} (α_{n}) \\ = \sum_{l \in [(v - 1) μ + 1 : v μ]} \frac{1}{α_{n} - f_{l}} (\sum_{m \in [M]} W_{m, (ϕ_{m}^{- 1} (l))}^{⊤} λ_{m}) + \sum_{i \in [N - μ]} α_{n}^{i - 1} Y_{v, i}^{″} \end{matrix}

(39)

where for all

v \in [V]

,

\begin{matrix} Y_{v, i}^{″} = \sum_{m \in [M], (ℓ, κ) \in ϕ_{m}^{- 1} [(v - 1) μ + 1 : v μ]} Y_{m, ℓ, κ, i}^{'} . \end{matrix}

(40)

Now it is clear that for all

v \in [V]

, the desired linear combination symbols

{(\sum_{m \in [M]} W_{m, ℓ, κ}^{⊤} λ_{m})}_{(ℓ, κ) \in ϕ_{m}^{- 1} [(v - 1) μ + 1 : v μ]}

can be decoded by inverting the following Cauchy–Vandermonde matrix

\begin{matrix} \underset{{C S A}_{N, μ}^{q} (α, f)}{\underset{︸}{[\begin{matrix} \frac{1}{α_{1} - f_{(v - 1) μ + 1}} & \dots & \frac{1}{α_{1} - f_{v μ}} & 1 & \dots & α_{1}^{N - μ - 1} \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ \frac{1}{α_{N} - f_{(v - 1) μ + 1}} & \dots & \frac{1}{α_{N} - f_{v μ}} & 1 & \dots & α_{N}^{N - μ - 1} \end{matrix}]}}, \end{matrix}

(41)

whose invertibility is ensured by the fact that

f_{1}, f_{2}, \dots, f_{L}, α_{1}, α_{2}, \dots, α_{N}

are distinct. Finally, note that for each server n, a total of V symbols are downloaded, and the normalized download cost of the scheme is, thus,

\begin{matrix} D_{n} = \frac{V}{L} = \frac{1}{μ} = \frac{1}{{min}_{m \in [M]} ρ_{m} - X_{m} - T_{m} - η_{m} + 1}, \forall n \in [N] . \end{matrix}

(42)

4.5. Motivating Example

Let us elaborate on our scheme via an illustrative example. Consider an example where we have

N = 20

servers that store

M = 2

message sets according to the following storage pattern.

\begin{matrix} R_{1} & = {3, 4, 5, 6, 7, 8, 13, 14, 15, 16, 17, 18}, & ρ_{1} & = 12, \end{matrix}

(43a)

\begin{matrix} R_{2} & = {1, 2, 9, 10, 11, 12, 16, 17, 19, 20}, & ρ_{2} & = 10 . \end{matrix}

(43b)

Conversely, we can also write

\begin{matrix} \begin{matrix} M_{1} = {2}, & M_{2} = {2}, & M_{3} = {1}, & M_{4} = {1}, \\ M_{5} = {1}, & M_{6} = {1}, & M_{7} = {1}, & M_{8} = {1}, \\ M_{9} = {2}, & M_{10} = {2}, & M_{11} = {2}, & M_{12} = {2}, \\ M_{13} = {1}, & M_{14} = {1}, & M_{15} = {1}, & M_{16} = {1}, \\ M_{17} = {1}, & M_{18} = {1}, & M_{19} = {2}, & M_{20} = {2} . \end{matrix} \end{matrix}

(44)

Moreover, let us set the asymmetric security and privacy thresholds

X_{1} = 3, X_{2} = 2

,

T_{1} = 3, T_{2} = 2

. Also, the MDS code constraint is parameterized by two positive integers

η_{1} = 2, η_{2} = 2

. According to the definition of Section 4.1, we have

μ = 5, L = 10

. Recall that the K messages are partitioned into two disjoint sets

W_{1}

and

W_{2}

, where

\begin{matrix} W_{1} = {W_{1, 1}, W_{1, 2}, \dots, W_{1, K_{1}}}, \end{matrix}

(45a)

\begin{matrix} W_{2} = {W_{2, 1}, W_{2, 2}, \dots, W_{2, K_{2}}} . \end{matrix}

(45b)

Each message comprises

L = 10

symbols from

F_{q}

, where

q \geq 30

, i.e.,

\begin{matrix} W_{1, k} = {[W_{1, k} (1), W_{1, k} (2), \dots, W_{1, k} (10)]}^{⊤}, k \in [K_{1}], \end{matrix}

(46)

\begin{matrix} W_{2, k} = {[W_{2, k} (1), W_{2, k} (2), \dots, W_{2, k} (10)]}^{⊤}, k \in [K_{2}] . \end{matrix}

(47)

Let

α_{1}, α_{2}, \dots, α_{20}, f_{1, 1}, \dots, f_{5, 1}, f_{1, 2}, \dots, f_{5, 2}

be 30 distinct constants from the finite field

F_{q}

. According to the definition of Section 4.1, we have

J_{1} = L / η_{1} = 5

and

J_{2} = L / η_{2} = 5

. For all

ℓ \in [5], κ \in [2]

, let us define

\begin{matrix} W_{1, ℓ, κ} = {[W_{1, 1} (ℓ + 5 (κ - 1)), W_{1, 2} (ℓ + 5 (κ - 1)), \dots, W_{1, K_{1}} (ℓ + 5 (κ - 1))]}^{⊤} \end{matrix}

(48)

\begin{matrix} W_{2, ℓ, κ} = {[W_{2, 1} (ℓ + 5 (κ - 1)), W_{2, 2} (ℓ + 5 (κ - 1)), \dots, W_{2, K_{2}} (ℓ + 5 (κ - 1))]}^{⊤} . \end{matrix}

(49)

and define

\begin{matrix} λ_{1} = {[λ_{1, 1}, λ_{1, 2}, \dots, λ_{1, K_{1}}]}^{⊤}, \end{matrix}

(50)

\begin{matrix} λ_{2} = {[λ_{2, 1}, λ_{2, 2}, \dots, λ_{2, K_{2}}]}^{⊤} . \end{matrix}

(51)

Then the desired linear combination of the two messages can be written in the following form.

\begin{matrix} λ_{Λ} (W) = {[\sum_{m \in [2]} W_{m, 1, 1}^{⊤} λ_{m}, \dots, \sum_{m \in [2]} W_{m, 5, 1}^{⊤} λ_{m}, \sum_{m \in [2]} W_{m, 1, 2}^{⊤} λ_{m}, \dots, \sum_{m \in [2]} W_{m, 5, 2}^{⊤} λ_{m}]}^{⊤} . \end{matrix}

(52)

For all

m \in [2]

, let us define the following null-shaper polynomial in

α

,

\begin{matrix} N_{m} (α) = \prod_{n \in [N] ∖ R_{m}} (α - α_{n}) . \end{matrix}

(53)

Let us define the following (vector-valued) rational function in

α

,

\begin{matrix} {\tilde{W}}_{1, ℓ} (α) ≜ & N_{1} (α) (\frac{N_{1} {(f_{ℓ, 1})}^{- 1}}{α - f_{ℓ, 1}} W_{1, ℓ, 1} + \frac{N_{1} {(f_{ℓ, 2})}^{- 1}}{α - f_{ℓ, 2}} W_{1, ℓ, 2} + \sum_{x \in [3]} α^{x - 1} Z_{1, ℓ, x}) \end{matrix}

(54)

\begin{matrix} = & \frac{1}{α - f_{ℓ, 1}} W_{1, ℓ, 1} + \frac{1}{α - f_{ℓ, 2}} W_{1, ℓ, 2} + \sum_{i \in [11]} α^{i - 1} Y_{1, ℓ, i} . \end{matrix}

(55)

Note that since for all

n \in [N] ∖ R_{1}

,

N_{1} (α_{n}) = 0

, we have

{\tilde{W}}_{1, ℓ} (α_{n}) = 0

for all

ℓ \in [5]

. In (55), for all

ℓ \in [5]

,

{(Y_{1, ℓ, i})}_{i \in [11]}

are various linear combinations of

{(W_{1, ℓ, κ})}_{κ \in [2]}

and

{(Z_{1, ℓ, x})}_{x \in [3]}

. Since

| R_{1} | = 12

, the degree of the last term (viewed as a polynomial in

α

) in (55) is 10.

Let us define the following (vector-valued) rational function in

α

,

\begin{matrix} {\tilde{W}}_{2, ℓ} (α) & ≜ N_{2} (α) (\frac{N_{2} {(f_{ℓ, 1})}^{- 1}}{α - f_{ℓ, 1}} W_{2, ℓ, 1} + \frac{N_{2} {(f_{ℓ, 2})}^{- 1}}{α - f_{ℓ, 2}} W_{2, ℓ, 2} + \sum_{x \in [2]} α^{x - 1} Z_{2, ℓ, x}) \end{matrix}

(56)

\begin{matrix} = \frac{1}{α - f_{ℓ, 1}} W_{2, ℓ, 1} + \frac{1}{α - f_{ℓ, 2}} W_{2, ℓ, 2} + \sum_{i \in [12]} α^{i - 1} Y_{2, ℓ, i} \end{matrix}

(57)

Note that since for all

n \in [N] ∖ R_{2}

,

N_{2} (α_{n}) = 0

, we have

{\tilde{W}}_{2, ℓ} (α_{n}) = 0

for all

ℓ \in [5]

. In (57), for all

ℓ \in [5]

,

{(Y_{2, ℓ, i})}_{i \in [12]}

are various linear combinations of

{(W_{2, ℓ, κ})}_{κ \in [2]}

and

{(Z_{2, ℓ, x})}_{x \in [2]}

. Since

| R_{2} | = 10

, the degree of the last term (viewed as a polynomial in

α

) in (57) is 11.

For all

n \in [20]

, the storage at Server n is

S_{n} = {{\tilde{W}}_{1, ℓ} (α_{n}), {\tilde{W}}_{2, ℓ} (α_{n}) ∣ ℓ \in [5]}

. The MDS(

20, 3

) coded random noise term in (54) guarantees that the messages

{(W_{1, k})}_{k \in [K_{1}]}

is

X_{1} = 3

-secure. Similarly, the MDS(

20, 2

) coded random noise term in (56) guarantees that the message

{(W_{2, k})}_{k \in [K_{2}]}

is

X_{2} = 2

-secure. Thus we have

\begin{matrix} I (S_{X}; W_{1}) = 0, & \forall X \subset [20], | X | = 3, \end{matrix}

(58a)

\begin{matrix} I (S_{X}; W_{2}) = 0, & \forall X \subset [20], | X | = 2 . \end{matrix}

(58b)

Note that for all

n \in [20], m \in [2] ∖ M_{n}, ℓ \in [5]

,

{\tilde{W}}_{m, ℓ} (α_{n}) = 0

holds, i.e., if the graph-based storage pattern prohibits a server from storing a certain message, the corresponding codeword is explicitly set to zero.

For all

ℓ \in [5], κ \in [2], t \in [3]

, let

Z_{1, ℓ, κ, t}^{'}

be uniformly i.i.d. column vectors from

F_{q}^{K_{1}}

, independent of the coefficients. For all

ℓ \in [5]

, let us define the following rational functions in

α

,

\begin{matrix} Q_{1, ℓ, 1} (α) = \frac{α - f_{ℓ, 2}}{f_{ℓ, 1} - f_{ℓ, 2}} λ_{1} + (α - f_{ℓ, 1}) (α - f_{ℓ, 2}) (\sum_{t \in [3]} α^{t - 1} Z_{1, ℓ, 1, t}^{'}), \end{matrix}

(59)

\begin{matrix} Q_{1, ℓ, 2} (α) = \frac{α - f_{ℓ, 1}}{f_{ℓ, 2} - f_{ℓ, 1}} λ_{1} + (α - f_{ℓ, 1}) (α - f_{ℓ, 2}) (\sum_{t \in [3]} α^{t - 1} Z_{1, ℓ, 2, t}^{'}), \end{matrix}

(60)

For all

ℓ \in [5], κ \in [2], t \in [2]

, let

Z_{2, ℓ, κ, t}^{'}

be uniformly i.i.d. column vectors from

F_{q}^{K_{2}}

, independent of the coefficients. For all

ℓ \in [5]

, let us define the following rational functions in

α

,

\begin{matrix} Q_{2, ℓ, 1} (α) = \frac{α - f_{ℓ, 2}}{f_{ℓ, 1} - f_{ℓ, 2}} λ_{2} + (α - f_{ℓ, 1}) (α - f_{ℓ, 2}) (\sum_{t \in [2]} α^{t - 1} Z_{2, ℓ, 1, t}^{'}), \end{matrix}

(61)

\begin{matrix} Q_{2, ℓ, 2} (α) = \frac{α - f_{ℓ, 1}}{f_{ℓ, 2} - f_{ℓ, 1}} λ_{2} + (α - f_{ℓ, 1}) (α - f_{ℓ, 2}) (\sum_{t \in [2]} α^{t - 1} Z_{2, ℓ, 2, t}^{'}), \end{matrix}

(62)

For all

n \in [20]

, the query sent to Server n is constructed as

Q_{n}^{(Λ)} = {Q_{1, ℓ, κ} (α_{n}),

Q_{2, ℓ, κ} (α_{n}) ∣ ℓ \in [5], κ \in [2]}

. The MDS(

20, 3

) coded random noise terms in (59) and (60) guarantee that the coefficient

λ_{1}

is

T_{1} = 3

–private. Similarly, the MDS(

20, 2

) coded random noise terms in (61) and (62) guarantee that the coefficient

λ_{2}

is

T_{2} = 2

-private. Thus we have

\begin{matrix} I (Q_{T}^{(Λ)}; {(λ_{1, k})}_{k \in [K_{1}]}) = 0, & \forall T \subset [20], | T | = 3, \end{matrix}

(63a)

\begin{matrix} I (Q_{T}^{(Λ)}; {(λ_{2, k})}_{k \in [K_{2}]}) = 0, & \forall T \subset [20], | T | = 2 . \end{matrix}

(63b)

According to the definition of Section 4.1,

V = L / μ = 2

. The answer returned by Server n,

n \in [20]

is constructed as follows.

\begin{matrix} A_{n}^{(Λ)} = \{\sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, 1} (α_{n}), \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, 2} (α_{n}),\}, \end{matrix}

(64)

where

\begin{matrix} \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, 1} (α_{n}) \\ = & \sum_{ℓ \in [5]} {(\frac{1}{α_{n} - f_{ℓ, 1}} W_{1, ℓ, 1} + \frac{1}{α_{n} - f_{ℓ, 2}} W_{1, ℓ, 2} + \sum_{i \in [11]} α_{n}^{i - 1} Y_{1, ℓ, i})}^{⊤} \\ \times [\frac{α_{n} - f_{ℓ, 2}}{f_{ℓ, 1} - f_{ℓ, 2}} λ_{1} + (α_{n} - f_{ℓ, 1}) (α_{n} - f_{ℓ, 2}) (\sum_{t \in [3]} α_{n}^{t - 1} Z_{1, ℓ, 1, t}^{'})] \\ + {(\frac{1}{α_{n} - f_{ℓ, 1}} W_{2, ℓ, 1} + \frac{1}{α_{n} - f_{ℓ, 2}} W_{2, ℓ, 2} + \sum_{i \in [12]} α_{n}^{i - 1} Y_{2, ℓ, i})}^{⊤} \end{matrix}

\begin{matrix} \times [\frac{α_{n} - f_{ℓ, 2}}{f_{ℓ, 1} - f_{ℓ, 2}} λ_{2} + (α_{n} - f_{ℓ, 1}) (α_{n} - f_{ℓ, 2}) (\sum_{t \in [2]} α_{n}^{t - 1} Z_{2, ℓ, 1, t}^{'})] \end{matrix}

(65)

\begin{matrix} = & \sum_{ℓ \in [5]} (\frac{1}{α_{n} - f_{ℓ, 1}} W_{1, ℓ, 1}^{⊤} λ_{1} + \frac{1}{α_{n} - f_{ℓ, 1}} W_{2, ℓ, 1}^{⊤} λ_{2} + \sum_{s \in [15]} α_{n}^{s - 1} Y_{ℓ, 1, s}^{'}) \end{matrix}

(66)

\begin{matrix} = & \sum_{ℓ \in [5]} \frac{1}{α_{n} - f_{ℓ, 1}} (\sum_{m \in [2]} W_{m, ℓ, 1}^{⊤} λ_{m}) + \sum_{s \in [15]} α_{n}^{s - 1} {\bar{Y}}_{1, s} \end{matrix}

(67)

where for all

ℓ \in [5], s \in [15]

,

Y_{ℓ, 1, s}^{'}

are various linear combinations of

W_{1, ℓ, 2}^{⊤} λ_{1}

,

{(Y_{1, ℓ, i}^{⊤} λ_{1})}_{i \in [11]}

,

{(W_{1, ℓ, 2}^{⊤} Z_{1, ℓ, 1, t}^{'})}_{t \in [3]}

,

{(Y_{1, ℓ, i}^{⊤} Z_{1, ℓ, 1, t}^{'})}_{t \in [3], i \in [12]}

,

W_{2, ℓ, 2}^{⊤} λ_{2}

,

{(Y_{2, ℓ, i}^{⊤} λ_{2})}_{i \in [12]}

,

{(W_{2, ℓ, 2}^{⊤} Z_{2, ℓ, 1, t}^{'})}_{t \in [2]}

,

{(Y_{2, ℓ, i}^{⊤} Z_{2, ℓ, 1, t}^{'})}_{t \in [2], i \in [12]}

, whose exact form is not relevant. Similarly, we have

\begin{matrix} \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{n})}^{⊤} Q_{m, ℓ, 2} (α_{n}) \\ = & \sum_{ℓ \in [5]} \frac{1}{α_{n} - f_{ℓ, 2}} (\sum_{m \in [2]} W_{m, ℓ, 2}^{⊤} λ_{m}) + \sum_{s \in [15]} α_{n}^{s - 1} {\bar{Y}}_{2, s} . \end{matrix}

(68)

Note that for all

n \in [20], m \in [2] ∖ M_{n}, ℓ \in [5]

,

{\tilde{W}}_{m, ℓ} (α_{n}) = 0

always holds; thus, there is no need for the user to upload

Q_{m, ℓ, 1}

and

Q_{m, ℓ, 2}

.

Now, we can write the symbols contained in

{(A_{n}^{(Λ)})}_{n \in [20]}

in the following matrix form,

\begin{matrix} [\begin{matrix} \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{1})}^{⊤} Q_{m, ℓ, 1} (α_{1}) \\ \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{2})}^{⊤} Q_{m, ℓ, 1} (α_{2}) \\ ⋮ \\ \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{20})}^{⊤} Q_{m, ℓ, 1} (α_{20}) \end{matrix}] \\ = & [\begin{matrix} \frac{1}{α_{1} - f_{1, 1}} & \dots & \frac{1}{α_{1} - f_{5, 1}} & 1 & α_{1} & \dots & α_{1}^{13} \\ \frac{1}{α_{2} - f_{1, 1}} & \dots & \frac{1}{α_{2} - f_{5, 1}} & 1 & α_{2} & \dots & α_{2}^{13} \\ ⋮ \\ \frac{1}{α_{20} - f_{1, 1}} & \dots & \frac{1}{α_{20} - f_{5, 1}} & 1 & α_{20} & \dots & α_{20}^{13} \end{matrix}] \\ \times {[\sum_{m \in [2]} W_{m, 1, 1}^{⊤} λ_{m} \dots \sum_{m \in [2]} W_{m, 5, 1}^{⊤} λ_{m} {\bar{Y}}_{1, 1} {\bar{Y}}_{1, 2} \dots {\bar{Y}}_{1, 15}]}^{⊤} \end{matrix}

(69)

\begin{matrix} [\begin{matrix} \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{1})}^{⊤} Q_{m, ℓ, 2} (α_{1}) \\ \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{2})}^{⊤} Q_{m, ℓ, 2} (α_{2}) \\ ⋮ \\ \sum_{m \in [2], ℓ \in [5]} {\tilde{W}}_{m, ℓ} {(α_{20})}^{⊤} Q_{m, ℓ, 2} (α_{20}) \end{matrix}] \\ = & [\begin{matrix} \frac{1}{α_{1} - f_{1, 2}} & \dots & \frac{1}{α_{1} - f_{5, 2}} & 1 & α_{1} & \dots & α_{1}^{13} \\ \frac{1}{α_{2} - f_{1, 2}} & \dots & \frac{1}{α_{2} - f_{5, 2}} & 1 & α_{2} & \dots & α_{2}^{13} \\ ⋮ \\ \frac{1}{α_{20} - f_{1, 2}} & \dots & \frac{1}{α_{20} - f_{5, 2}} & 1 & α_{20} & \dots & α_{20}^{13} \end{matrix}] \\ \times {[\sum_{m \in [2]} W_{m, ℓ, 2}^{⊤} λ_{m} \dots \sum_{m \in [2]} W_{m, ℓ, 2}^{⊤} λ_{m} {\bar{Y}}_{2, 1} {\bar{Y}}_{2, 2} \dots {\bar{Y}}_{2, 15}]}^{⊤} \end{matrix}

(70)

Now it is clear that the desired linear combination symbols

{[\sum_{m \in [M]} W_{m, ℓ, κ}^{⊤} λ_{m}]}_{ℓ \in [5], κ \in [2]}

can be decoded by inverting the Cauchy–Vandermonde matrix in (69) and (70), respectively, whose invertibility is ensured by the fact that

f_{1, 1}, \dots, f_{5, 1}, f_{1, 2}, \dots, f_{5, 2}, α_{1}, α_{2}, \dots, α_{20}

are distinct. Finally, note that for each server n, a total of two symbols are downloaded, and the normalized download cost of the scheme is, thus,

D_{n} = 2 / 10 = 1 / 5

for all

n \in [20]

.

Remark 2.

It is of interest to compare the decoding complexity of our scheme with that of [38]. For the decoding procedure, our scheme directly inverts a Cauchy–Vandermonde matrix, which can be conducted in

O (N^{3})

via standard methods or

\tilde{O} (N {log}^{2} N)

using fast algorithms. In contrast, the scheme in [38] requires an additional pre-processing step for interference cancellation before a similar inversion, incurring an extra complexity of

O (N^{2})

above the complexity of inverting a Vandermonde matrix (which is also

O (N^{3})

via standard methods or

\tilde{O} (N {log}^{2} N)

using fast algorithms).

5. Proof of Theorem 1

In this section, adapting from the idea of the augmented system in [38], we show that the download cost and the corresponding storage cost as defined in Theorem 1 are achievable. The augmented system is an Asymmetric MDS-GXSTPLC instance with the same message sets that are to be eventually reduced to the original GXSTPLC setting by merging servers. Specifically, the augmented system consists of

\bar{N} = \sum_{n \in [N]} τ_{n}

servers, denoted as Server

(1, 1), (1, 2), \dots, (1, τ_{1}), \dots, (N, 1), (N, 2), \dots, (N, τ_{N})

. The storage pattern

\bar{R} = {{\bar{R}}_{1}, {\bar{R}}_{2}, \dots, {\bar{R}}_{M}}

and security/privacy thresholds for each message set

\bar{X} = ({\bar{X}}_{1}, {\bar{X}}_{2}, \dots, {\bar{X}}_{M})

,

\bar{T} = ({\bar{T}}_{1},, {\bar{T}}_{2}, \dots, {\bar{T}}_{M})

are defined as follows.

\begin{matrix} {\bar{X}}_{m} & = X γ_{m}, \forall m \in [M] \end{matrix}

(71)

\begin{matrix} {\bar{T}}_{m} & = T γ_{m}, \forall m \in [M] \end{matrix}

(72)

\begin{matrix} {\bar{R}}_{m} & = \{(n, i) ∣ n \in R_{m}, i \in [min (γ_{m}, τ_{n})]\} \end{matrix}

(73)

In addition, for all

m \in [M]

, we set the MDS coding parameter for the augmented system

{\bar{η}}_{m} = q_{0} (η_{m} - 1) + 1

. Recall that by the definition of

q_{0}

,

{\bar{η}}_{m}

must be an integer.

Our GXSTPLC scheme is obtained by merging servers

(n, 1), (n, 2), \dots, (n, τ_{n})

into Server n for all

n \in [N]

, i.e., assigning the storage, queries, and corresponding answers. Note that this recovers the original storage pattern

R

because for all

n \in [N]

, we have

⋃_{i \in [τ_{n}]} {\bar{M}}_{(n, i)} \subseteq M_{n}

, where

{\bar{M}}_{(n, i)} = {m \in [M] ∣ {\bar{R}}_{m} ∋ (n, i)}

is the dual representation of

\bar{R}

. Moreover, due to the fact that for all

m \in [M]

,

n \in R_{m}

, the total number of pairs of the form

(n, *)

in

{\bar{R}}_{m}

cannot be greater than

{\bar{X}}_{m} / X

and

{\bar{T}}_{m} / T

, so any X (or T) colluding servers have, at most,

{\bar{X}}_{m}

(or

{\bar{T}}_{m}

) codewords (or queries) of the

m^{t h}

message set. Guaranteed by the

X_{m}

-security (or

T_{m}

-privacy), these colluding servers disclose nothing about the messages (or coefficients); i.e., the scheme is X-secure and T-private. Finally, according to Section 4, for the augmented system, the normalized download cost of

\begin{matrix} D_{n, i} = \frac{1}{{min}_{m \in [M]} {\bar{ρ}}_{m} - {\bar{X}}_{m} - {\bar{T}}_{m} - {\bar{η}}_{m} + 1} \end{matrix}

(74)

for each server

(n, i), n \in [\bar{N}], i \in [τ_{n}]

is achievable, where

{\bar{ρ}}_{m} = | {\bar{R}}_{m} |, m \in [M]

. According to the definition, for all

m \in [M]

,

{\bar{ρ}}_{m} = {\bar{X}}_{m} + {\bar{T}}_{m} + ν_{m}

, where

ν_{m}

is the summation of the smallest

(ρ_{m} - X - T)

elements in

{(τ_{n})}_{n \in R_{m}}

. Since

(D_{1}, D_{2}, \dots, D_{N}) \in D

, we must have

ν_{m} \geq q_{0} η_{m}

for all

m \in [M]

. Now, since reducing the augmented system to the original setting does not increase the download cost, we can verify that the desired normalized download cost

D_{n}

for all Server

n, n \in [N]

is achievable by our scheme, as follows.

\begin{matrix} \sum_{i \in [τ_{n}]} D_{n, i} = & \frac{τ_{n}}{{min}_{m \in [M]} {\bar{ρ}}_{m} - {\bar{X}}_{m} - {\bar{T}}_{m} - {\bar{η}}_{m} + 1} \end{matrix}

(75)

\begin{matrix} = & \frac{τ_{n}}{{min}_{m \in [M]} ν_{m} - {\bar{η}}_{m} + 1} \end{matrix}

(76)

\begin{matrix} \leq & \frac{τ_{n}}{q_{0} η_{m} - q_{0} (η_{m} - 1)} \end{matrix}

(77)

\begin{matrix} = & \frac{τ_{n}}{q_{0}} \end{matrix}

(78)

\begin{matrix} = & D_{n} \end{matrix}

(79)

Moreover, the normalized storage cost of Server n,

C_{n}

, can be calculated as follows.

\begin{matrix} C_{n} & = \frac{\sum_{i \in [τ_{n}], m \in {\bar{M}}_{n, i}} K_{m} / {\bar{η}}_{m}}{K} \end{matrix}

(80)

\begin{matrix} = \frac{1}{K} \sum_{i = 1}^{τ_{n}} \sum_{m = 1}^{M} I ((n, i) \in {\bar{R}}_{m}) \frac{K_{m}}{{\bar{η}}_{m}} \end{matrix}

(81)

\begin{matrix} = \frac{1}{K} \sum_{m \in M_{n}} min (τ_{n}, γ_{m}) \frac{K_{m}}{q_{0} (η_{m} - 1) + 1} \end{matrix}

(82)

Remark 3.

Recall that our achievability scheme is constructed by reducing the augmented system (i.e., an Asymmetric MDS-GXSTPLC instance) to the original setting. Consequently, the decoding procedure for the scheme consists of inverting a series of Cauchy–Vandermonde matrices defined in (41). Indeed, this can be viewed as a series of instances of the CSA scheme, where the desired linear combination symbols are carried by the Cauchy terms, and the interference symbols are aligned within the Vandermonde terms. Then, according to the N-Sum Box abstraction of CSA codes [44,50], in a quantum setting where servers can send entangled qudits via separate quantum channels to the user, representing encoded classical answer symbols through local quantum operations, the superdense coding gain is achievable. In particular, for any total normalized download cost of

D = \sum_{n \in [N]} D_{n} > 2

in the classical setting, the corresponding quantum scheme achieves a total download cost of

\frac{D}{2}

. For any classical total normalized download cost of

D < 2

, the quantum scheme achieves the (obviously) optimal normalized total download cost of

D = 1

. Note that this direct application of the CSA structure and the subsequent quantum gain is not directly applicable to schemes exploiting properties of dual GRS codes [26,38], since their decoding requires a pre-processing procedure of interference cancellation, and the resulting structure does not follow that of CSA codes.

Motivating Example

Consider another motivating example where we have

N = 8

servers and K messages,

X = 1

and

T = 1

. The K messages are partitioned into two disjoint sets

W_{1}

and

W_{2}

. Let us set

K_{1} = K_{2} = K / 2

. The storage pattern for this example is as follows.

\begin{matrix} R_{1} & = {2, 3, 6, 7}, \end{matrix}

(83a)

\begin{matrix} R_{2} & = {1, 4, 5, 7, 8} . \end{matrix}

(83b)

Let us set

η = (η_{1}, η_{2}) = (1.2, 1.2)

. Moreover, let us also set the target vector of per-server normalized download costs as

(D_{1}, D_{2}, \dots, D_{8}) = (2 / 5, 3 / 5, 3 / 5, 2 / 5, 2 / 5, 3 / 5, 3 / 5, 2 / 5)

. It can be easily verified that the selected vector of per-server normalized download costs lies in the feasible region

D (η)

defined in (15). Following the notations defined in Theorem 1 and above, we have

q_{0} = 5

,

(τ_{1}, τ_{2}, \dots, τ_{6}) = (2, 3, 3, 2, 2, 3, 3, 2)

and

\bar{N} = 20

. The 20 servers in the augmented system are listed as

((1, 1), (1, 2), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (4, 1), (4, 2), (5, 1), (5, 2), (6, 1), (6, 2), (6, 3), (7, 1), (7, 2), (7, 3), (8, 1), (8, 2))

.

Now we can generate the augmented system. Specifically, for the augmented system, we have

{\bar{X}}_{3} = {\bar{T}}_{1} = 3

,

{\bar{X}}_{2} = {\bar{T}}_{2} = 2

,

{\bar{η}}_{1} = {\bar{η}}_{2} = 2

, and the storage pattern is shown in Table 1.

This is exactly the setting presented in the motivating example in Section 4.5 with server mapping

Ψ : ((1, 1), (1, 2), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2), (3, 3), (4, 1), (4, 2), (5, 1), (5, 2), (6, 1), (6, 2), (6, 3), (7, 1), (7, 2), (7, 3), (8, 1), (8, 2)) \to (1, 2, \dots, 20)

, and for each server

(n, i), n \in [8], i \in [τ_{n}]

, the normalized download cost of

D_{n, i} = 1 / 5

is achievable. After reducing the augmented system to the original setting, we can calculate that the normalized download costs

(D_{1}, D_{2}, \dots, D_{8}) = (2 / 5, 3 / 5, 3 / 5, 2 / 5, 2 / 5, 3 / 5, 3 / 5, 2 / 5)

are achievable, and the normalized storage costs are

(C_{1}, C_{2}, \dots, C_{8}) = (1 / 2, 3 / 4, 3 / 4, 1 / 2, 1 / 2, 3 / 4, 5 / 4, 1 / 2)

.

Now let us see why the resulting GXSTPLC scheme is

X = 1

–secure and

T = 1

–private. Note that for all

n \in {2, 3, 6}

,

\begin{matrix} I (S_{n}; W) = & I ({{\tilde{W}}_{1, ℓ} (α_{Ψ ((n, i))}), {\tilde{W}}_{2, ℓ} (α_{Ψ ((n, i))}) ∣ i \in [3], ℓ \in [5]}; W_{1}, W_{2}) \end{matrix}

(84)

\begin{matrix} = & I ({{\tilde{W}}_{1, ℓ} (α_{Ψ ((n, i))}) ∣ i \in [3], ℓ \in [5]}; W_{1}) \end{matrix}

(85)

\begin{matrix} = & 0, \end{matrix}

(86)

\begin{matrix} I (Q_{n}^{[Λ]}; Λ) = & I ({Q_{1, ℓ, κ} (α_{Ψ ((n, i))}), Q_{2, ℓ, κ} (α_{Ψ ((n, i))}) ∣ i \in [3], ℓ \in [5], κ \in [2]}; & {\{λ_{m, k}\}}_{m \in [2], k \in [K_{m}]}) \end{matrix}

(87)

\begin{matrix} = & I ({Q_{1, ℓ, κ} (α_{Ψ ((n, i))}) ∣ i \in [3], ℓ \in [5], κ \in [2]}; {\{λ_{1, k}\}}_{k \in [K_{1}]}) \end{matrix}

(88)

\begin{matrix} = & 0, \end{matrix}

(89)

For all

n \in {1, 4, 5, 8}

,

\begin{matrix} I (S_{n}; W) = & I ({{\tilde{W}}_{1, ℓ} (α_{Ψ ((n, i))}), {\tilde{W}}_{2, ℓ} (α_{Ψ ((n, i))}) ∣ i \in [2], ℓ \in [5]}; W_{1}, W_{2}) \end{matrix}

(90)

\begin{matrix} = & I ({{\tilde{W}}_{2, ℓ} (α_{Ψ ((n, i))}) ∣ i \in [2], ℓ \in [5]}; W_{2}) \end{matrix}

(91)

\begin{matrix} = & 0, \end{matrix}

(92)

\begin{matrix} I (Q_{n}^{[Λ]}; Λ) = & I ({Q_{1, ℓ, κ} (α_{Ψ ((n, i))}), Q_{2, ℓ, κ} (α_{Ψ ((n, i))}) ∣ i \in [2], ℓ \in [5], κ \in [2]}; & {\{λ_{m, k}\}}_{m \in [2], k \in [K_{m}]}) \end{matrix}

(93)

\begin{matrix} = & I ({Q_{2, ℓ, κ} (α_{Ψ ((n, i))}) ∣ i \in [2], ℓ \in [5], κ \in [2]}; {\{λ_{2, k}\}}_{k \in [K_{2}]}) \end{matrix}

(94)

\begin{matrix} = & 0, \end{matrix}

(95)

and

\begin{matrix} I (S_{7}; W) = & I ({{\tilde{W}}_{1, ℓ} (α_{Ψ ((7, i))}), {\tilde{W}}_{2, ℓ} (α_{Ψ ((7, i))}) ∣ i \in [3], ℓ \in [5]}; W_{1}, W_{2}) \\ = & I ({{\tilde{W}}_{1, ℓ} (α_{Ψ ((7, i))}) ∣ i \in [3], ℓ \in [5]}; W_{1}) \end{matrix}

(96)

\begin{matrix} + I ({{\tilde{W}}_{2, ℓ} (α_{Ψ ((7, i))}) ∣ i \in [2], ℓ \in [5]}; W_{2}) \end{matrix}

(97)

\begin{matrix} = & 0, \end{matrix}

(98)

\begin{matrix} I (Q_{7}^{[Λ]}; Λ) = & I ({Q_{1, ℓ, κ} (α_{Ψ ((7, i))}), Q_{2, ℓ, κ} (α_{Ψ ((7, i))}) ∣ i \in [3], ℓ \in [5], κ \in [2]}; & {\{λ_{m, k}\}}_{m \in [2], k \in [K_{m}]}) \\ = & I ({Q_{1, ℓ, κ} (α_{Ψ ((7, i))}) ∣ i \in [3], ℓ \in [5], κ \in [2]}; {\{λ_{1, k}\}}_{k \in [K_{1}]}) \end{matrix}

(99)

\begin{matrix} + I ({Q_{2, ℓ, κ} (α_{Ψ ((7, i))}) ∣ i \in [2], ℓ \in [5], κ \in [2]}; {\{λ_{2, k}\}}_{k \in [K_{2}]}) \end{matrix}

(100)

\begin{matrix} = & 0, \end{matrix}

(101)

(86), (92), and (98) hold due to (58), (89), (95) and (101) holds due to (63).

6. Conclusions

We explored the problem of GXSTPLC by proposing an achievability scheme that establishes a trade-off between communication cost and storage cost. Notably, our scheme demonstrates the generalization of the CSA null shaper idea beyond its application in storage-consistent private updates. The application of CSA null shaper preserves the standard CSA decoding structure while offering significant advantages, including reduced decoding complexity and a direct framework for quantum transformation. Promising avenues for future research include the complete capacity characterization for the MDS-GXSTPLC problem. Additionally, investigating the potential applicability of the CSA null shaper to other relevant problems is of considerable interest.

Author Contributions

Conceptualization, Y.L., Z.J., and H.J.; methodology, Y.L. and Z.J.; investigation, Y.L. and H.J.; writing—original draft preparation, Y.L., Z.J., and H.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by the National Natural Science Foundation of China under grant number 62201080.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CSA	Cross-Subspace Alignment
GXSTPLC	Graph-Based X-Secure T-Private Linear Computation
GRS	Generalized Reed–Solomon
PIR	Private Information Retrieval
PLC	Private Linear Computation

References

Sun, H.; Jafar, S.A. The Capacity of Private Information Retrieval. IEEE Trans. Inf. Theory 2017, 63, 4075–4088. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. The Capacity of Robust Private Information Retrieval with Colluding Databases. IEEE Trans. Inf. Theory 2018, 64, 2361–2370. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. The Capacity of Private Computation. IEEE Trans. Inf. Theory 2019, 65, 3880–3897. [Google Scholar] [CrossRef]
Tahmasebi, B.; Maddah-Ali, M.A. Private Function Computation. In Proceedings of the 2020 IEEE International Symposium on Information Theory (ISIT), Los Angeles, CA, USA, 21–26 June 2020; pp. 1118–1123. [Google Scholar] [CrossRef]
Lu, Y.; Jia, Z.; Jafar, S.A. Double blind T-private information retrieval. IEEE J. Sel. Areas Inf. Theory 2021, 2, 428–440. [Google Scholar] [CrossRef]
Raviv, N.; Karpuk, D.A. Private Polynomial Computation From Lagrange Encoding. IEEE Trans. Inf. Forensics Secur. 2020, 15, 553–563. [Google Scholar] [CrossRef]
Chen, Z.; Wang, Z.; Jafar, S.A. The Asymptotic Capacity of Private Search. IEEE Trans. Inf. Theory 2020, 66, 4709–4721. [Google Scholar] [CrossRef]
Yao, X.; Liu, N.; Kang, W. The Capacity of Private Information Retrieval Under Arbitrary Collusion Patterns for Replicated Databases. IEEE Trans. Inf. Theory 2021, 67, 6841–6855. [Google Scholar] [CrossRef]
Banawan, K.; Ulukus, S. The Capacity of Private Information Retrieval From Coded Databases. IEEE Trans. Inf. Theory 2018, 64, 1945–1956. [Google Scholar] [CrossRef]
Freij-Hollanti, R.; Gnilke, O.; Hollanti, C.; Karpuk, D. Private information retrieval from coded databases with colluding servers. SIAM J. Appl. Algebra Geom. 2017, 1, 647–664. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. Private information retrieval from MDS coded data with colluding servers: Settling a conjecture by Freij-Hollanti et al. IEEE Trans. Inf. Theory 2018, 64, 1000–1022. [Google Scholar] [CrossRef]
Tajeddine, R.; Gnilke, O.W.; Karpuk, D.; Freij-Hollanti, R.; Hollanti, C. Private Information Retrieval From Coded Storage Systems with Colluding, Byzantine, and Unresponsive Servers. IEEE Trans. Inf. Theory 2019, 65, 3898–3906. [Google Scholar] [CrossRef]
Tajeddine, R.; Gnilke, O.W.; El Rouayheb, S. Private Information Retrieval From MDS Coded Data in Distributed Storage Systems. IEEE Trans. Inf. Theory 2018, 64, 7081–7093. [Google Scholar] [CrossRef]
Wang, Q.; Skoglund, M. Symmetric Private Information Retrieval from MDS Coded Distributed Storage with Non-Colluding and Colluding Servers. IEEE Trans. Inf. Theory 2019, 65, 5160–5175. [Google Scholar] [CrossRef]
Obead, S.A.; Lin, H.Y.; Rosnes, E.; Kliewer, J. Private Linear Computation for Noncolluding Coded Databases. IEEE J. Sel. Areas Commun. 2022, 40, 847–861. [Google Scholar] [CrossRef]
Wang, Z.; Banawan, K.; Ulukus, S. Private Set Intersection: A Multi-Message Symmetric Private Information Retrieval Perspective. IEEE Trans. Inf. Theory 2022, 68, 2001–2019. [Google Scholar] [CrossRef]
Kadhe, S.; Garcia, B.; Heidarzadeh, A.; El Rouayheb, S.; Sprintson, A. Private Information Retrieval with Side Information. IEEE Trans. Inf. Theory 2020, 66, 2032–2043. [Google Scholar] [CrossRef]
Wei, Y.P.; Banawan, K.; Ulukus, S. Fundamental Limits of Cache-Aided Private Information Retrieval with Unknown and Uncoded Prefetching. IEEE Trans. Inf. Theory 2019, 65, 3215–3232. [Google Scholar] [CrossRef]
Wei, Y.P.; Banawan, K.; Ulukus, S. The Capacity of Private Information Retrieval with Partially Known Private Side Information. IEEE Trans. Inf. Theory 2019, 65, 8222–8231. [Google Scholar] [CrossRef]
Chen, Z.; Wang, Z.; Jafar, S.A. The capacity of T-private information retrieval with private side information. IEEE Trans. Inf. Theory 2020, 66, 4761–4773. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. Multiround private information retrieval: Capacity and storage overhead. IEEE Trans. Inf. Theory 2018, 64, 5743–5754. [Google Scholar] [CrossRef]
Wang, Q.; Sun, H.; Skoglund, M. The Capacity of Private Information Retrieval with Eavesdroppers. IEEE Trans. Inf. Theory 2019, 65, 3198–3214. [Google Scholar] [CrossRef]
Yang, H.; Shin, W.; Lee, J. Private Information Retrieval for Secure Distributed Storage Systems. IEEE Trans. Inf. Forensics Secur. 2018, 13, 2953–2964. [Google Scholar] [CrossRef]
Jia, Z.; Sun, H.; Jafar, S.A. Cross subspace alignment and the asymptotic capacity of X-secure T-private information retrieval. IEEE Trans. Inf. Theory 2019, 65, 5783–5798. [Google Scholar] [CrossRef]
Jia, Z.; Jafar, S.A. X-secure T-private information retrieval from MDS coded storage with byzantine and unresponsive servers. IEEE Trans. Inf. Theory 2020, 66, 7427–7438. [Google Scholar] [CrossRef]
Jia, Z.; Jafar, S.A. On the asymptotic capacity of X-secure T-private information retrieval with graph-Based replicated storage. IEEE Trans. Inf. Theory 2020, 66, 6280–6296. [Google Scholar] [CrossRef]
Jia, Z.; Jafar, S.A. X-secure T-private federated submodel learning with elastic dropout resilience. IEEE Trans. Inf. Theory 2022, 68, 5418–5439. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. The Capacity of Symmetric Private Information Retrieval. IEEE Trans. Inf. Theory 2019, 65, 322–329. [Google Scholar] [CrossRef]
Zhu, J.; Yan, Q.; Tang, X.; Li, S. Symmetric private polynomial computation from Lagrange encoding. IEEE Trans. Inf. Theory 2022, 68, 2704–2718. [Google Scholar] [CrossRef]
Banawan, K.; Ulukus, S. The Capacity of Private Information Retrieval from Byzantine and Colluding Databases. IEEE Trans. Inf. Theory 2019, 65, 1206–1219. [Google Scholar] [CrossRef]
Banawan, K.; Ulukus, S. Asymmetry hurts: Private information retrieval under asymmetric traffic constraints. IEEE Trans. Inf. Theory 2019, 65, 7628–7645. [Google Scholar] [CrossRef]
Sun, H.; Jafar, S.A. Optimal Download Cost of Private Information Retrieval for Arbitrary Message Length. IEEE Trans. Inf. Forensics Secur. 2017, 12, 2920–2932. [Google Scholar] [CrossRef]
Jia, Z.; Sun, H.; Jafar, S.A. The Capacity of Private Information Retrieval with Disjoint Colluding Sets. In Proceedings of the GLOBECOM 2017—2017 IEEE Global Communications Conference, Singapore, 4–8 December 2017; pp. 1–6. [Google Scholar] [CrossRef]
Tian, C.; Sun, H.; Chen, J. Capacity-Achieving Private Information Retrieval Codes with Optimal Message Size and Upload Cost. IEEE Trans. Inf. Theory 2019, 65, 7613–7627. [Google Scholar] [CrossRef]
Raviv, N.; Tamo, I.; Yaakobi, E. Private Information Retrieval in Graph-Based Replication Systems. IEEE Trans. Inf. Theory 2020, 66, 3590–3602. [Google Scholar] [CrossRef]
Sadeh, B.; Gu, Y.; Tamo, I. Bounds on the Capacity of Private Information Retrieval Over Graphs. IEEE Trans. Inf. Forensics Secur. 2023, 18, 261–273. [Google Scholar] [CrossRef]
Banawan, K.; Ulukus, S. Private Information Retrieval from Non-Replicated Databases. In Proceedings of the 2019 IEEE International Symposium on Information Theory (ISIT), Paris, France, 7–12 July 2019; pp. 1272–1276. [Google Scholar] [CrossRef]
Jia, H.; Jia, Z. The Asymptotic Capacity of X-Secure T-Private Linear Computation with Graph Based Replicated Storage. IEEE Trans. Inf. Theory 2024, 70, 5269–5288. [Google Scholar] [CrossRef]
Nomeir, M.; Vithana, S.; Ulukus, S. Asymmetric X-Secure T-Private Information Retrieval: More Databases is Not Always Better. In Proceedings of the 2024 58th Annual Conference on Information Sciences and Systems (CISS), Princeton, NJ, USA, 13–15 March 2024; pp. 1–6. [Google Scholar] [CrossRef]
Aytekin, A.; Nomeir, M.; Vithana, S.; Ulukus, S. Quantum Symmetric Private Information Retrieval with Secure Storage and Eavesdroppers. In Proceedings of the 2023 IEEE Globecom Workshops (GC Wkshps), Kuala Lumpur, Malaysia, 4–8 December 2023; pp. 1057–1062. [Google Scholar] [CrossRef]
Song, S.; Hayashi, M. Capacity of Quantum Private Information Retrieval with Multiple Servers. IEEE Trans. Inf. Theory 2021, 67, 452–463. [Google Scholar] [CrossRef]
Song, S.; Hayashi, M. Capacity of Quantum Private Information Retrieval with Colluding Servers. IEEE Trans. Inf. Theory 2021, 67, 5491–5508. [Google Scholar] [CrossRef]
Allaix, M.; Song, S.; Holzbaur, L.; Pllaha, T.; Hayashi, M.; Hollanti, C. On the Capacity of Quantum Private Information Retrieval From MDS-Coded and Colluding Servers. IEEE J. Sel. Areas Commun. 2022, 40, 885–898. [Google Scholar] [CrossRef]
Lu, Y.; Jafar, S.A. Quantum Cross Subspace Alignment Codes via the N-sum Box Abstraction. In Proceedings of the 2023 57th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 29 October–1 November 2023; pp. 670–674. [Google Scholar] [CrossRef]
Lu, Y.; Jafar, S.A. Quantum X-Secure T-Private Information Retrieval From MDS Coded Storage with Unresponsive and Byzantine Servers. IEEE J. Sel. Areas Inf. Theory 2025, 6, 59–73. [Google Scholar] [CrossRef]
Karpuk, D. Private Computation of Systematically Encoded Data with Colluding Servers. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018; pp. 2112–2116. [Google Scholar] [CrossRef]
Obead, S.A.; Lin, H.Y.; Rosnes, E.; Kliewer, J. Private Polynomial Function Computation for Noncolluding Coded Databases. IEEE Trans. Inf. Forensics Secur. 2022, 17, 1800–1813. [Google Scholar] [CrossRef]
Wang, Q.; Skoglund, M. On PIR and Symmetric PIR From Colluding Databases with Adversaries and Eavesdroppers. IEEE Trans. Inf. Theory 2019, 65, 3183–3197. [Google Scholar] [CrossRef]
Yu, Q.; Li, S.; Raviv, N.; Kalan, S.M.M.; Soltanolkotabi, M.; Avestimehr, S.A. Lagrange coded computing: Optimal design for resiliency, security, and privacy. In Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, PMLR, Naha, Japan, 16–18 April 2019; Volume 89, pp. 1215–1225. [Google Scholar]
Allaix, M.; Lu, Y.; Yao, Y.; Pllaha, T.; Hollanti, C.; Jafar, S.A. N-Sum Box: An Abstraction for Linear Computation Over Many-to-One Quantum Networks. IEEE Trans. Inf. Theory 2025, 71, 1121–1139. [Google Scholar] [CrossRef]

Figure 1. The K messages are partitioned into M disjoint message sets, where the m-th message set consists of

K_{m}

messages.

Figure 1. The K messages are partitioned into M disjoint message sets, where the m-th message set consists of

K_{m}

messages.

Figure 2. An illustrating example of the storage–communication trade-off in the proposed GXSTPLC scheme, where

N = 8, X = 1, T = 1, K / K_{m} = M = 2

, and the storage pattern is given by

R_{1} = {2, 3, 6, 7}, R_{2} = {1, 4, 5, 7, 8}

. Notably, the top-left point

(D, C) = (\frac{10}{3}, 11)

corresponds to the achievability scheme in [38], i.e., achieves the minimum possible download cost. On the other hand, the achievability of the point

(D, C) = (4, 5.5)

is illustrated as a motivating example in Section 4.5 and Section 5.

Figure 2. An illustrating example of the storage–communication trade-off in the proposed GXSTPLC scheme, where

N = 8, X = 1, T = 1, K / K_{m} = M = 2

, and the storage pattern is given by

R_{1} = {2, 3, 6, 7}, R_{2} = {1, 4, 5, 7, 8}

. Notably, the top-left point

(D, C) = (\frac{10}{3}, 11)

corresponds to the achievability scheme in [38], i.e., achieves the minimum possible download cost. On the other hand, the achievability of the point

(D, C) = (4, 5.5)

is illustrated as a motivating example in Section 4.5 and Section 5.

Table 1. Storage pattern of the augmented system in Example 1.

Server	$(1, 1)$	$(1, 2)$	$(2, 1)$	$(2, 2)$	$(2, 3)$	$(3, 1)$	$(3, 2)$	$(3, 3)$	$(4, 1)$
${\bar{M}}_{(n, i)}$	2	2	1	1	1	1	1	1	2
${\bar{M}}_{(n, i)}$
Server	$(4, 2)$	$(5, 1)$	$(5, 2)$	$(6, 1)$	$(6, 2)$	$(6, 3)$	$(7, 1)$	$(7, 2)$	$(7, 3)$
${\bar{M}}_{(n, i)}$	2	2	2	1	1	1	1	1	1
${\bar{M}}_{(n, i)}$							2	2
Server	$(8, 1)$	$(8, 2)$
${\bar{M}}_{(n, i)}$	2	2
${\bar{M}}_{(n, i)}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, Y.; Jia, H.; Jia, Z. On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation. Entropy 2025, 27, 975. https://doi.org/10.3390/e27090975

AMA Style

Liu Y, Jia H, Jia Z. On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation. Entropy. 2025; 27(9):975. https://doi.org/10.3390/e27090975

Chicago/Turabian Style

Liu, Yueyang, Haobo Jia, and Zhuqing Jia. 2025. "On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation" Entropy 27, no. 9: 975. https://doi.org/10.3390/e27090975

APA Style

Liu, Y., Jia, H., & Jia, Z. (2025). On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation. Entropy, 27(9), 975. https://doi.org/10.3390/e27090975

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Storage–Communication Trade-Off in Graph-Based X-Secure T-Private Linear Computation

Abstract

1. Introduction

2. Problem Statement

3. Main Result

The Storage–Communication Trade-Off in the Proposed GXSTPLC Scheme

4. An Achievability Scheme for Asymmetric Setting

4.1. Preliminaries

4.2. Construction of the Storage

4.3. Construction of the Queries

4.4. Construction of the Answers

4.5. Motivating Example

5. Proof of Theorem 1

Motivating Example

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI