Robust Secure Authentication and Data Storage with Perfect Secrecy

Baur, Sebastian; Boche, Holger

doi:10.3390/cryptography2020008

Open AccessFeature PaperArticle

Robust Secure Authentication and Data Storage with Perfect Secrecy

by

Sebastian Baur

^* and

Holger Boche

Institute of Theoretical Information Technology, Technical University of München, 80333 München, Germany

^*

Author to whom correspondence should be addressed.

Cryptography 2018, 2(2), 8; https://doi.org/10.3390/cryptography2020008

Submission received: 29 January 2018 / Revised: 23 March 2018 / Accepted: 6 April 2018 / Published: 10 April 2018

(This article belongs to the Special Issue Physical Security in a Cryptographic Enviroment)

Download

Browse Figures

Versions Notes

Abstract

:

We consider an authentication process that makes use of biometric data or the output of a physical unclonable function (PUF), respectively, from an information theoretical point of view. We analyse different definitions of achievability for the authentication model. For the secrecy of the key generated for authentication, these definitions differ in their requirements. In the first work on PUF based authentication, weak secrecy has been used and the corresponding capacity regions have been characterized. The disadvantages of weak secrecy are well known. The ultimate performance criteria for the key are perfect secrecy together with uniform distribution of the key. We derive the corresponding capacity region. We show that, for perfect secrecy and uniform distribution of the key, we can achieve the same rates as for weak secrecy together with a weaker requirement on the distribution of the key. In the classical works on PUF based authentication, it is assumed that the source statistics are known perfectly. This requirement is rarely met in applications. That is why the model is generalized to a compound model, taking into account source uncertainty. We also derive the capacity region for the compound model requiring perfect secrecy. Additionally, we consider results for secure storage using a biometric or PUF source that follow directly from the results for authentication. We also generalize known results for this problem by weakening the assumption concerning the distribution of the data that shall be stored. This allows us to combine source compression and secure storage.

Keywords:

authentication; secure storage; perfect secrecy; privacy leakage

1. Introduction

The present work addresses two essential practical problems concerning secrecy in information systems. The first problem is authentication in order to manage access to the system. The second problem is secure storage in public databases. Both problems are of essential importance for further development of future communication systems. The goal of this work is to derive a fundamental characterization of the possible performance of such communication systems that meets very strict secrecy requirements. We show that these strict requirements can be met without loss in performance compared to known results with weaker secrecy requirements.

Information theoretic security has become a very active field of research in information theory in the past ten years, with a large number of promising approaches. For a current presentation, see [1]. In [2], the paper first introducing information theoretic security, the authors suggest requiring perfect secrecy [3] to guarantee security in communication. This means the data available to an attacker should be stochastically independent of the message that should be kept secret (the data and the message are modeled using random variables (RVs)). Thus, an attacker does not benefit from learning these data. In [4], this notion of security is weakened. The authors use weak secrecy [3] instead of perfect secrecy to guarantee secure communication. In many of the works on information theoretic security following [4], one considers weak secrecy or strong secrecy [3], which is yet another security requirement that is also weaker than perfect secrecy. As the name suggests, perfect secrecy is the desired ideal situation in cryptographic applications where an attacker does not get any information about the secret. Considering the roots of information theoretic security and its intuitive motivation, it suggests itself to require perfect secrecy for secure communications. Additionally, in [3], the recommendation is to not use weak secrecy as a secrecy measure. In [5], there is an example of a protocol that is obviously not secure, but meets the weak secrecy requirement.

The authors of the landmark paper [6] derive the capacity for secret key generation requiring perfect secrecy. A different model in information theoretic security has as an essential feature a biometric source or a PUF source. The outputs of biometric sources and the outputs of PUF sources both uniquely characterize a person [7], or a device, respectively [8]. This property qualifies them for being used for authentication as well as for secure storage. In [7,9], the authors consider a model for authentication using the output of a biometric source. They also consider a model that can be interpreted as a model for secure storage using a biometric source. Both of these models are very similar to the model for secret key generation and for both of the models the authors require weak secrecy to hold when defining achievability.

In [6,7,9], the authors assume that the statistics of the (PUF) source are perfectly known. A simple analysis of [6,7,9] shows that the protocols for authentication constructed there heavily depend on the knowledge of the source statistics. Particularly, it is possible that small variations of the source statistics influence the reliability and secrecy of the protocols for authentication or storage, respectively. The assumption that the source statistics are perfectly known is too optimistic in applications. That is why we are interested in considering the uncertainty of the source or PUF source. We assume that we do not know the statistics of the source, but that we know a set of source statistics that contains the actual source statistic. Thus, we consider a compound version of the source model. We want to develop robust protocols that work for all source statistics in a given set. The compound model also allows us to describe an attack scenario where the attacker is able to alter the source statistics. There are relatively few results concerning compound sources. The compound version of the source model from [6] is considered in [10].

One of our contributions in the present work is the generalization of the model for authentication from [7], by considering authentication using a compound PUF source (or equivalently a biometric source). Additionally, our work differs from the state of the art as we consider protocols for authentication that achieve perfect secrecy.

We also consider secure data storage making use of a PUF source (or equivalently a biometric source). The corresponding information theoretic model is very similar to the second model presented in [7], but, in contrast to [7], we define achievability requiring perfect secrecy and we consider source uncertainty of the PUF source. Our considerations concerning perfect secrecy in this work answer the question posed in the conclusion of [11].

Some of the results for secure authentication described in this work have already been published in [12]. Here, we additionally present the proofs that have been omitted in [12], i.e., the proofs of Theorem 4 and Theorem 5 and some more discussion. The results concerning secure storage have been presented in [13,14]. As these results heavily depend on [12], we briefly state them here (as well as the corresponding definitions).

In Section 2, we describe the authentication process and define the corresponding information theoretic model. We discuss different definitions of achievability for the model in Section 3. In this context, protocols that achieve perfect secrecy are of special interest. We develop the corresponding definition of achievability in this section. In Section 4, we prove capacity results for the model with respect to the various definitions of achievability. The main result in this section is Theorem 2. In Section 5, we generalize the model for authentication to the case with source uncertainty and define achievability for this model in Section 6. In Section 7, we derive the capacity region for the compound storage model. In Section 8, we consider some results for secure storage that follow from our results for authentication. The key result from authentication that we use for secure storage with perfect secrecy is Theorem 2. In Section 9, we further discuss our results.

For the most part, we use the notation introduced in [3].

2. Authentication Model

At first, we consider authentication using biometric or PUF data. This means we consider a scenario where a user enrolls in a system by giving a certain amount of biometric or PUF data to the system. Later, when the user wants to be authenticated, he again gives biometric or PUF data to the system. The system then decides if the user is accepted, i.e., if it is the same user that is enrolled in the system. In our considerations, we assume that the system can store some data in a public database.

Figure 1 depicts the authentication process as described in [7]. The process consists of two phases. In the first phase, the enrollment phase, the authentication system receives

X^{n}

from the PUF source and the

I D

of a user. It generates a helper message M and a secret-key K from

X^{n}

. It then uses a one-way function f on K and stores the result and M in a public database together with the user’s

I D

. The second phase is the authentication phase. In this phase, the system receives

Y^{n}

from the PUF source and the

I D

of a user. It reads the corresponding helper message M and

f (K)

from the database. From M and

Y^{n}

, it generates a secret-key

\hat{K}

. Then, the system compares

f (K)

and

f (\hat{K})

. If they are equal, the user is accepted; otherwise, the user is rejected.

Now, we define an information theoretic model of the authentication process. We use random variables (RVs) to model the data. In the first chapters of this work, we assume that the distribution of the RVs is perfectly known. We drop this assumption in Section 5.

Definition 1.

Let

n \in N

. The authentication model consists of a discrete memoryless multiple source (DMMS) with generic variables

X Y

[3], the (possibly randomized) encoders [3]

Φ : X^{n} \to M

,

Θ : X^{n} \to K

and the deterministic decoder

ψ : Y^{n} \times M \to \hat{K}

. Let

X^{n}

and

Y^{n}

be the output of the DMMS. The RVs M and K are generated from

X^{n}

using Φ and Θ. The RV

\hat{K}

is generated from

Y^{n}

and M using ψ. We use the term authentication protocol for

(Φ, Θ, ψ)

.

Remark 1.

It is possible to define the authentication protocol in a more general way by permitting randomized decoders Ψ, but one can argue that in our definition of achievability a randomized Ψ does not improve the performance of the protocols ([3], Problem 17.11). For convenience, we use the less general definition.

Remark 2.

We model the PUF source as a DMMS. Due to physically induced distortions, we model the biometric/PUF data read in the two phases as jointly distributed RVs.

Remark 3.

The distribution of

X Y

is assumed to be known and can be used for the generation of the RVs. Thus, the encoders and the decoder are allowed to depend on the distribution.

3. Various Definitions of Achievability

For the authentication model, we define achievable secret-key rate versus privacy-leakage rate pairs. Intuitively, we want the probability that a legitimate user is rejected in the authentication phase to be small. Thus,

\Pr (K = \hat{K})

should be large to fulfill this reliability condition. Additionally, the probability that an attacker is accepted in the authentication phase should be as small as possible. Thus, we consider the maximum false acceptance probability (mFAP) [15], which is the probability that an attacker using the best possible attack strategy is accepted in the authentication phase averaged over all public messages

m \in M

. As we want the mFAP to be as small as possible, we are interested in the largest possible set of secret keys

K

. This reasoning is explained below. The system uses the output of a PUF source as input so it should leak as little information about

X^{n}

as possible [7]. This motivates the following definition of achievable rate pairs.

Definition 2.

A tuple

(R, L)

,

R, L \geq 0

, is an achievable secret-key rate versus privacy-leakage rate pair for the authentication model if for every

δ > 0

there is an

n_{0} = n_{0} (δ)

such that for all

n \geq n_{0}

there exists an authentication protocol such that

\begin{matrix} \Pr (K = \hat{K}) & \geq 1 - δ, \\ mFAP & \leq \frac{1}{| K |}, \\ \frac{1}{n} log | K | & \geq R - δ, \\ \frac{1}{n} I (M; X^{n}) & \leq L + δ . \end{matrix}

(1)

We denote the corresponding authentication protocols by FAP-Protocols (False-Acceptance- Probability-Protocols).

Remark 4.

In [15], a very similar definition of achievability is used. Instead of considering the relation between the mFAP and the set of secret-keys (1), the authors define the false-acceptance exponent that describes the exponential decrease of the mFAP in n. A rate pair

(R, L)

that is achievable using FAP-protocols is also achievable according to the definition in [15], R playing the role of the false-acceptance exponent.

We now clarify the bound on the mFAP in Inequality (1) and our interest in large secret-key rates. For this purpose, we consider the following observation.

Lemma 1.

For a communication protocol fulfilling the reliability condition, it holds that

\begin{matrix} mFAP \geq \frac{1 - δ}{| K |} . \end{matrix}

Proof.

Introduce the RV E, setting

E = 1

for

K \neq \hat{K}

and

E = 0

, otherwise. Thus,

\begin{matrix} mFAP & = \sum_{m \in M} P_{M} (m) max_{y^{n} \in Y^{n}} P_{K | M} (ψ (y^{n}, m) | m) \\ \geq \sum_{m \in M} P_{M} (m) max_{y^{n} \in Y^{n}} P_{K | M E} (ψ (y^{n}, m) | m, 0) P_{E | M} (0 | m) \\ \overset{(a)}{=} \sum_{m \in M} P_{M E} (m, 0) max_{k \in K} P_{K | M E} (k | m, 0) \\ \geq \sum_{m \in M} P_{M E} (m, 0) \frac{1}{| K |} \\ \overset{(b)}{\geq} (1 - δ) \frac{1}{| K |} . \end{matrix}

Here, (a) follows as

P_{K | M E} (k | m, 0) = 0

if there is no

y^{n} \in Y^{n}

such that

ψ (y^{n}, m) = k

and (b) follows from the

δ

-recoverability of K from

\hat{K}

. ☐

Thus, Lemma 1 shows that requiring Inequality (1) is in fact equivalent to requiring the mFAP to be as small as possible. It also justifies our interest in a large set

K

.

There is another way to define achievable secret-key rate versus privacy-leakage rate pairs for the authentication model. Here, we want to keep the key secret from the attacker.

H (K | M)

can be interpreted as the average information required to specify k when m is known ([16], Chapter 2). Thus, we want

H (K | M)

to be as large as possible instead of requiring a small mFAP. This means we require

log | K | = H (K | M)

. This condition is equivalent to the combination of the perfect secrecy condition

I (K; M) = 0

[5] and the uniform distribution of the key, i.e.,

H (K) = log | K |

. Thus, we define achievability as follows.

Definition 3.

A tuple

(R, L)

,

R, L \geq 0

, is an achievable secret-key rate versus privacy-leakage rate pair for the authentication model if for every

δ > 0

there is an

n_{0} = n_{0} (δ)

such that for all

n \geq n_{0}

there exists an authentication protocol such that

\begin{matrix} \Pr (K = \hat{K}) & \geq 1 - δ, \\ (2) & H (K) & = log | K |, \\ (3) & I (M; K) & = 0, \\ \frac{1}{n} log | K | & \geq R - δ, \\ \frac{1}{n} I (M; X^{n}) & \leq L + δ . \end{matrix}

We denote the corresponding authentication protocols by PSA-Protocols (Perfect-Secrecy-Authentication-Protocols).

Remark 5.

In [6], the authors derive the secret-key capacity for the source model. They define achievability requiring perfect secrecy and uniform distribution of the key. They do not consider the privacy-leakage in contrast to our definition of achievability.

It is interesting to compare the rate pairs achievable with respect to the restrictive Definition 3 with commonly used weaker requirements. In ([7], Definition 3.1), the authors give a different definition of achievable secret-key rate versus privacy-leakage rate pairs. Instead of Eqation (2), they require

\begin{matrix} H (K) \geq log | K | - δ \end{matrix}

and instead of Equation (3) they require

\begin{matrix} \frac{1}{n} I (M; K) \leq δ, \end{matrix}

which is called the weak secrecy condition [5]. Thus, we get a third definition of achievability.

Definition 4

([7]). A tuple

(R, L)

,

R, L \geq 0

, is an achievable secret-key rate versus privacy-leakage rate pair for the authentication model if for every

δ > 0

there is an

n_{0} = n_{0} (δ)

such that for all

n \geq n_{0}

there exists an authentication protocol such that

\begin{matrix} \Pr (K = \hat{K}) & \geq 1 - δ, \\ H (K) & \geq log | K | - δ, \\ \frac{1}{n} I (M; K) & \leq δ, \\ \frac{1}{n} log | K | & \geq R - δ, \\ \frac{1}{n} I (M; X^{n}) & \leq L + δ . \end{matrix}

We denote the corresponding authentication protocols by WSA-Protocols (Weak-Secrecy-Authentication-Protocols).

Definition 5.

The set of achievable rate pairs that are achievable using PSA-Protocols is called the capacity region

R_{P S A}

. The set of achievable rate pairs that are achievable using WSA-Protocols is called the capacity region

R_{W S A}

and the set of achievable rate pairs that are achievable using FAP-Protocols is called the capacity region

R_{F A P}

.

Now, we look at some straightforward relations between these capacity regions. We can directly see that Definition 3 is more restrictive than Definition 4 so a PSA-Protocol is also a WSA-Protocol and thus

\begin{matrix} R_{P S A} \subset R_{W S A} . \end{matrix}

(4)

We now show that a PSA-Protocol is also a FAP-Protocol.

Lemma 2.

It holds that

\begin{matrix} R_{P S A} \subset R_{F A P} . \end{matrix}

Proof.

As Equations (2) and (3) imply,

P_{K | M} (k | m) = \frac{1}{| K |}

for all

(k, m) \in K \times M

, we have

\begin{matrix} mFAP & = \sum_{m \in M} P_{M} (m) max_{y^{n} \in Y^{n}} P_{K | M} (ψ (y^{n}, m) | m) \\ \leq \sum_{m \in M} P_{M} (m) max_{k \in K} P_{K | M} (k | m) = \frac{1}{| K |} . \end{matrix}

☐

4. Capacity Regions for the Authentication Model

In ([7], Theorem 3.1), the authors derive the capacity region

R_{W S A}

.

Theorem 1

([7]). It holds that

\begin{matrix} R_{W S A} = ⋃_{U} {(R, L) : & 0 \leq R \leq I (U; Y), L \geq I (U; X) - I (U; Y)} . \end{matrix}

The union is over all RVs U such that

U - X - Y

. We only have to consider RVs U with

| U | \leq | X | + 1

.

Remark 6.

The authors of [7] do not consider randomized encoders. In contrast, we permit randomization of the encoders in the enrollment phase. Using the strategy described in ([3], Problem 17.15), one can use the converse for deterministic encoders to prove the converse for randomized encoders with the same bounds on the secret-key rate and the privacy-leakage rate. Thus, the converse in [7] also holds true when randomization is permitted.

The following theorem is one of our main results.

Theorem 2.

It holds that

\begin{matrix} R_{P S A} = R_{W S A} . \end{matrix}

Proof.

We do not prove Theorem 2 here but prove a more general result in the remainder of the text. This result is Theorem 5. It is more general as it is concerned with a compound version of the authentication model. The authentication model is a special case of the compound authentication model where the compound set consists of a single DMMS. ☐

We now strengthen Lemma 2.

Theorem 3.

It holds that

\begin{matrix} R_{P S A} = R_{F A P} . \end{matrix}

Proof.

The achievability result is implied by Lemma 2. For the converse, we use a result of [15]. As discussed in Remark 4, a rate pair

(R, L),

which is achievable according to Definition 2 is also achievable according to the definition of achievability used in [15], where R plays the role of the false acceptance exponent E. Thus, we use ([15], Theorem 4), which says that a rate pair

(E, L) \notin R_{W S A}

is not achievable. This implies our converse. ☐

5. Compound Authentication Model

We now consider authentication when the data source is not perfectly known. Figure 2 shows the corresponding authentication process. The only difference to the authentication process in Section 2 is the source uncertainty. As one can see in Figure 2, we even assume that an attacker can influence the source in the sense that the state of the source is altered, i.e., it generates another statistic. If the protocol for authentication is not robust, then authentication will not work.

We define the following information theoretic model for this authentication process with source uncertainty.

Definition 6.

Let

n \in N

. The compound authentication model consists of a set

S

of DMMSs with generic variables

X_{s} Y_{s}

,

s \in S

, (all on the same alphabets

X

and

Y

), the (possibly randomized) encoders

Φ : X^{n} \to M

,

Θ : X^{n} \to K

and the (possibly randomized) decoder

Ψ : Y^{n} \times M \to \hat{K}

. Let

X^{n}

and

Y^{n}

be the output of one of the DMMSs in

S

, i.e.,

P_{X Y} = P_{X_{s} Y_{s}}

for an

s \in S

, but s is not known. The RVs M and K are generated from

X^{n}

using Φ and Θ. The RV

\hat{K}

is generated from

Y^{n}

and M using Ψ. We use the term compound authentication protocol for

(Φ, Θ, Ψ)

.

Remark 7.

The uncertainty of the data source is modeled making use of a compound DMMS, that is, the DMMS modeling the PUF source is not known, but we know a set of DMMSs to which the actual DMMS belongs.

Remark 8.

S

is assumed to be known and can be used for the generation of the RVs, that is, the encoder and the decoder can depend on these distributions.

Definition 7.

Given

S

, we define the set

\begin{matrix} I (\hat{s}) = {s \in S : \sum_{y \in Y} P_{X_{s} Y_{s}} (x, y) = P_{X_{\hat{s}}} (x) \forall x \in X} \end{matrix}

for

\hat{s} \in S

. The sets

I (\hat{s})

,

\hat{s} \in S

, form a partition of

S

, as they form the equivalence classes for the corresponding equivalence relation. We denote a set of representatives by

\hat{S}

.

6. Achievability for the Compound Model

For the compound authentication model, we define achievable secret-key rate versus privacy-leakage rate pairs.

Definition 8.

A tuple

(R, L)

,

R, L \geq 0

, is an achievable secret-key rate versus privacy-leakage rate pair for the compound authentication model if for every

δ > 0

there is an

n_{0} = n_{0} (δ)

such that, for all

n \geq n_{0}

, there exists a compound authentication protocol such that, for all

s \in S,

\begin{matrix} (5) & \Pr (K & = \hat{K}) \geq 1 - δ, \\ (6) & H (K) & = log | K |, \\ (7) & I (M; K) & = 0, \\ \frac{1}{n} log | K | & \geq R - δ, \\ \frac{1}{n} I (M; X^{n}) & \leq L + δ, \end{matrix}

where

P_{X Y} = P_{X_{s} Y_{s}}

. We denote the corresponding authentication protocols by PSCA-Protocols (Perfect-Secrecy-Compound-Authentication-Protocols).

Definition 9.

The set of achievable secret-key versus privacy-leakage rate pairs that are achievable using PSCA-Protocols is called the compound capacity region

R_{P S C A} (S)

.

7. Capacity Regions for the Compound Authentication Model

We now derive the compound capacity region

R_{P S C A} (S)

for the compound authentication model. We only consider compound sets

S

such that

| \hat{S} | < \infty

. For the proof, we need the following theorem, which is a generalization of ([3], Theorem 6.10).

Theorem 4.

Given a (possibly infinite) set

W

of channels

W : X \to Y

, a set

A \subset X^{n}

with

P^{n} (A) > η

,

P \in P (X)

,

η > 0

and

ϵ > 0

. Then, for every

τ > 0

and all n large enough, there is a pair of mappings

(f, ϕ)

,

f : M_{f} \to X^{n}

,

ϕ : Y^{n} \to M_{f}

, such that

(f, ϕ)

is an

(n, ϵ)

-code for all

W \in W

with codewords in A and

\begin{matrix} \frac{1}{n} log | M_{f} | \geq inf_{W \in W} I (P; W) - τ . \end{matrix}

We call this pair of mappings a compound

(n, ϵ)

-code for

W

.

Even though the proof of Theorem 4 is very similar to the proof of ([3], Theorem 6.10), the proof of ([17], Theorem 4.3) and the proof of the results in [18], we prove Theorem 4 for the sake of completeness. The proof can be found in Appendix A.

Theorem 5.

It holds that

\begin{matrix} R_{P S C A} (S) & = ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} {(R, L) : 0 \leq R \leq inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}), L \geq sup_{s \in I (\hat{s})} I (U_{\hat{s}}; X_{s}) - I (U_{\hat{s}}; Y_{s})} \\ \overset{(a)}{=} ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}}), \end{matrix}

where, for (

a)

, we define

R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}})

appropriately. For all

\hat{s} \in \hat{S},

the union is over all RVs

U_{\hat{s}}

such that, for all

s \in I (\hat{s}),

we have

U_{\hat{s}} - X_{s} - Y_{s}

. For

| S | < \infty

, we only have to consider RVs

U_{\hat{s}}

with

| U_{\hat{s}} | \leq | X | + | I (\hat{s}) |

.

Proof.

For all

\hat{s} \in \hat{S}

and all

s \in I (\hat{s})

, let

U_{\hat{s}}

,

X_{s}

and

Y_{s}

be RVs where

X_{s} Y_{s}

are the output of the DMMS in

S

with index s and

X_{s}

and

U_{\hat{s}}

are connected by the channel

V_{\hat{s}} : X \to U_{\hat{s}}

. Thus, we have the Markov chains

U_{\hat{s}} - X_{s} - Y_{s}

for all

s \in I (\hat{s})

. Let

U = ⋃_{\hat{s} \in \hat{S}} U_{\hat{s}}

. We now show that, given

δ > 0

, for n large enough we can choose a set

C \subset U^{n}

that consists of

| M |

disjoint subsets

C_{m}

with the following properties.

We consider a partition of the set of all sets $C_{m}$ in $| \hat{S} |$ subsets. Thus, we denote the sets $C_{m}$ by $C_{m, \hat{s}}$ , $\hat{s} \in \hat{S}$ , indicating to which subset they belong. We denote the set of indices m corresponding to $\hat{s}$ by $M_{\hat{s}}$ . For each $C_{m, \hat{s}}$ , we have

$\begin{matrix} | C_{m, \hat{s}} | = ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉ . \end{matrix}$
Each $C_{m, \hat{s}}$ consists of sequences of the same type.
It holds that

$\begin{matrix} P_{U_{\hat{s}}}^{n} (C) > 1 - η \end{matrix}$

(8)

for $η > 0$ and all $\hat{s} \in \hat{S}$ .
For each $\hat{s} \in \hat{S}$ , one can define pairs of mappings that are compound $(n, ϵ)$ -codes, $ϵ > 0$ , for the channels $W_{s} : U \to Y$ , $W_{s} = P_{Y_{s} | U_{\hat{s}}}$ for all $s \in I (\hat{s})$ in the following way. Define an (arbitrary) bijective mapping $f_{m} : {1 \dots | C_{m, \hat{s}} |} \to C_{m, \hat{s}}$ and an appropriate mapping $ϕ_{m} : Y^{n} \to {1 \dots | C_{m, \hat{s}} |}$ . Then, $(f_{m}, ϕ_{m})$ is such a code. This means

$\begin{matrix} W_{s}^{n} (ϕ_{m}^{- 1} (f_{m}^{- 1} (u^{n})) | u^{n}) \geq 1 - ϵ \end{matrix}$

(9)

for all $s \in I (\hat{s})$ and for all codewords $u^{n}$ in $C_{m, \hat{s}}$ . This is possible for all $m \in M_{\hat{s}}$ .

Let

δ^{'} > 0

. We denote the elements of

\hat{S}

by

{\hat{s}}_{1}, {\hat{s}}_{2}, \dots, {\hat{s}}_{| \hat{S} |}

. We consider

T_{P_{U_{{\hat{s}}_{1}}}, ξ}^{n}, T_{P_{U_{{\hat{s}}_{2}}}, ξ}^{n}, \dots, T_{P_{U_{{\hat{s}}_{| \hat{S} |}}}, ξ}^{n}

,

ξ > 0

, which are disjoint subsets of

U^{n}

. We show that they are in fact disjoint subsets of

U^{n}

for

ξ

small enough. This can be seen as follows. For

{\hat{s}}_{i}, {\hat{s}}_{j} \in \hat{S}

,

{\hat{s}}_{i} \neq {\hat{s}}_{j}

, it holds that

P_{U_{{\hat{s}}_{i}}} (u) \neq P_{U_{{\hat{s}}_{j}}} (u)

for at least one

u \in U

. Thus, there is a

u \in U

with

\begin{matrix} | P_{U_{{\hat{s}}_{i}}} (u) - P_{U_{{\hat{s}}_{j}}} (u) | > α \end{matrix}

for some

α > 0

.

Now, assume that there is a

u^{n} \in T_{P_{U_{{\hat{s}}_{i}}}, ξ}^{n} \cap T_{P_{U_{{\hat{s}}_{j}}}, ξ}^{n}

. Denote the type of

u^{n}

by

p_{u^{n}}

. Thus, there is a

u \in U

with

\begin{matrix} α & < | P_{U_{{\hat{s}}_{i}}} (u) - P_{U_{{\hat{s}}_{j}}} (u) | \\ = | P_{U_{{\hat{s}}_{i}}} (u) - P_{u^{n}} (u) + P_{u^{n}} (u) - P_{U_{{\hat{s}}_{j}}} (u) | \\ \leq | P_{U_{{\hat{s}}_{i}}} (u) - P_{u^{n}} (u) | + | P_{U_{{\hat{s}}_{j}}} (u) - P_{u^{n}} (u) | \leq 2 ξ, \end{matrix}

where the last inequality follows from the assumption that

u^{n} \in T_{P_{U_{{\hat{s}}_{i}}}, ξ}^{n} \cap T_{P_{U_{{\hat{s}}_{j}}}, ξ}^{n}

. Thus, for

ξ < α / 2

, this is a contradiction and we know

T_{P_{U_{{\hat{s}}_{i}}}, ξ}^{n}

and

T_{P_{U_{{\hat{s}}_{j}}}, ξ}^{n}

are disjoint.

We start the construction of

C

by choosing a set

A_{1, {\hat{s}}_{1}} \subset T_{P_{U_{{\hat{s}}_{1}}}, ξ}^{n}

with

P_{U_{{\hat{s}}_{1}}}^{n} (A_{1, {\hat{s}}_{1}}) \geq η^{'}

with

η > η^{'} > 0

. According to Theorem 4, there is a compound

(n, ϵ)

-code for the channels

W_{s}

,

s \in I ({\hat{s}}_{1})

with at least

\begin{matrix} ⌈ inf_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ^{'})) ⌉ \end{matrix}

codewords

u^{n} \in A_{1, {\hat{s}}_{1}}

for n large enough. We denote the set of these codewords by

C_{1, {\hat{s}}_{1}}^{'}

. As there are less than

{(n + 1)}^{| U |}

types, we know that there is a set of at least

\begin{matrix} ⌈\frac{⌈ {inf}_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ^{'})) ⌉}{{(n + 1)}^{| U |}}⌉ \end{matrix}

codewords in

C_{1, {\hat{s}}_{1}}^{'}

with the same type. We only pick these codewords. There are at least

\begin{matrix} ⌈inf_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ^{'} - | U | \frac{log (n + 1)}{n}))⌉ \geq ⌈ inf_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ)) ⌉ \end{matrix}

of them for n large enough. We now pick exactly

\begin{matrix} ⌈ inf_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ)) ⌉ \end{matrix}

of these codewords and we denote this set by

C_{1, {\hat{s}}_{1}}

. Now, we choose a set

A_{2, {\hat{s}}_{1}} \subset T_{P_{U_{{\hat{s}}_{1}}}, ξ}^{n} \ C_{1, {\hat{s}}_{1}}

with

P_{U}^{n} (A_{2, {\hat{s}}_{1}}) \geq η^{'}

. We construct the set

C_{2, {\hat{s}}_{1}}

in the same way as

C_{1, {\hat{s}}_{1}}

. Thus,

C_{2, {\hat{s}}_{1}}

is a set of

\begin{matrix} ⌈ inf_{s \in I ({\hat{s}}_{1})} exp (n (I (U_{{\hat{s}}_{1}}; Y_{s}) - δ)) ⌉ \end{matrix}

codewords of the same type corresponding to an

(n, ϵ)

-code. We continue this process until we can not find a set

\begin{matrix} A_{| M_{{\hat{s}}_{1}} | + 1, {\hat{s}}_{1}} \subset T_{P_{U_{{\hat{s}}_{1}}}, ξ}^{n} \ ⋃_{i \in M_{{\hat{s}}_{1}}} C_{i, {\hat{s}}_{1}} \end{matrix}

with

\begin{matrix} P_{U_{{\hat{s}}_{1}}}^{n} (A_{| M_{{\hat{s}}_{1}} | + 1, {\hat{s}}_{1}}) \geq η^{'} . \end{matrix}

This means

\begin{matrix} P_{U_{{\hat{s}}_{1}}}^{n} ({(⋃_{i \in M_{{\hat{s}}_{1}}} C_{i, {\hat{s}}_{1}})}^{c} \cap T_{P_{U_{{\hat{s}}_{1}}}, ξ}^{n}) < η^{'} . \end{matrix}

We repeat this process for all

\hat{s} \neq {\hat{s}}_{1}

,

\hat{s} \in \hat{S}

. Thus, we have for all

\hat{s} \in \hat{S}

\begin{matrix} P_{U_{\hat{s}}}^{n} (C) & \geq P_{U_{\hat{s}}}^{n} (⋃_{i \in M_{\hat{s}}} C_{i, \hat{s}}) \\ = 1 - P_{U_{\hat{s}}}^{n} ({(⋃_{i \in M_{\hat{s}}} C_{i, \hat{s}})}^{c}) \\ = 1 - P_{U_{\hat{s}}}^{n} ({(⋃_{i \in M_{\hat{s}}} C_{i, \hat{s}})}^{c} \cap T_{P_{U_{\hat{s}}}, ξ}^{n}) - P_{U_{\hat{s}}}^{n} ({(⋃_{i \in M_{\hat{s}}} C_{i, \hat{s}})}^{c} \cap {(T_{P_{U_{\hat{s}}}, ξ}^{n})}^{c}) \\ \geq 1 - P_{U_{\hat{s}}}^{n} ({(⋃_{i \in M_{\hat{s}}} C_{i, \hat{s}})}^{c} \cap T_{P_{U_{\hat{s}}}, ξ}^{n}) - P_{U_{\hat{s}}}^{n} ({(T_{P_{U_{\hat{s}}}, ξ}^{n})}^{c}) . \end{matrix}

Thus, we have Inequality (8) for n large enough.

We now can define the encoders/decoders

Φ

,

Θ

and

Ψ

.

We define $Φ$ and $Θ$ as follows. The system gets a sequence $x^{n}$ . It checks if $x^{n} \in T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}$ , $ξ^{'} > 0$ , for an $\hat{s} \in \hat{S}$ (We can choose $ξ^{'}$ small enough and n large enough such that the $T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}$ are disjoint). If this is true for $\hat{s}$ , the channel $V_{\hat{s}}$ is used n times to generate $u^{n}$ from $x^{n}$ . For $Φ$ , the system looks in $C$ for $u^{n}$ . If $u^{n} \in C$ the system chooses for m the index of the subset $C_{m}$ containing $u^{n}$ . If $u^{n} \notin C$ it chooses an arbitrary $m \in M$ . In addition, if $x^{n} \notin ⋃_{\hat{s} \in \hat{S}} T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}$ , it chooses an arbitrary $m \in M$ . For $Θ$ , the system looks in $C$ for $u^{n}$ . If $u^{n} \in C$ , it considers the compound $(n, ϵ)$ -code corresponding to the subset $C_{m, \hat{s}}$ containing $u^{n}$ . If

$\begin{matrix} | C_{m, \hat{s}} | > min_{\hat{s} \in \hat{S}} ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉, \end{matrix}$

we consider the following deterministic mapping $h_{m} : f_{m}^{- 1} (C_{m}) \to K \cup {\tilde{k}}$ . Here,

$\begin{matrix} K = {1 \dots min_{\hat{s} \in \hat{S}} ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉} . \end{matrix}$

The preimage of any $k \in K$ under $h_{m}$ is a subset of $f_{m}^{- 1} (C_{m})$ of size

$\begin{matrix} ⌊\frac{| C_{m, \hat{s}} |}{{min}_{\hat{s} \in \hat{S}} ⌈ {inf}_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉}⌋ . \end{matrix}$

The rest of the $k^{'} \in f_{m}^{- 1} (C_{m})$ is mapped on $\tilde{k} \notin K$ . If

$\begin{matrix} | C_{m, \hat{s}} | = min_{\hat{s} \in \hat{S}} ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉, \end{matrix}$

the system chooses $k = f_{m}^{- 1} (u^{n})$ . In this case, we also define $h_{m} : f_{m}^{- 1} (C_{m}) \to K \cup {\tilde{k}}$ where $h_{m}$ is injective. If $u^{n} \notin C$ , k is chosen at random according to a uniform distribution on the alphabet. The same holds if $u^{n}$ is mapped on $\tilde{k}$ or if $x^{n} \notin ⋃_{\hat{s} \in \hat{S}} T_{P_{X, \hat{s}}, ξ^{'}}^{n}$ .
We define $Ψ$ as follows. The system gets a sequence $y^{n}$ and m. It decodes $y^{n}$ using the code corresponding to $C_{m, \hat{s}}$ . Then, $h_{m}$ is used on the result. The result is $\hat{k}$ if it differs from $\tilde{k}$ . Otherwise, an arbitrary $\hat{k} \in K$ is chosen.

Using the properties of the communication protocol, we analyse the achievability conditions. We denote the outputs of the DMMS by

X^{n}

and

Y^{n}

and the output of the channel used on

X^{n}

by

U^{n}

. Assume the index of the DMMS is

s \in I (\hat{s})

,

\hat{s} \in S

. Thus,

P_{X^{n} Y^{n}} = P_{X_{s} Y_{s}}^{n}

.

We define the following events:

$\begin{matrix} E_{1} = & {(x^{n}, y^{n}, u^{n}) \in X^{n} \times Y^{n} \times U^{n} : (x^{n}, u^{n}) \notin T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n}}, \\ E_{2} = & {(x^{n}, y^{n}, u^{n}) \in X^{n} \times Y^{n} \times U^{n} : u^{n} \notin C}, \\ E_{3} = & ⋃_{m \in M} {(x^{n}, y^{n}, u^{n}) \in X^{n} \times Y^{n} \times U^{n} : u^{n} \in C_{m} \land h_{m} (f_{m}^{- 1} (u^{n})) = \tilde{k}}, \\ E_{4} = & ⋃_{m \in M} {(x^{n}, y^{n}, u^{n}) \in X^{n} \times Y^{n} \times U^{n} : u^{n} \in C_{m} \land f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n})} . \end{matrix}$

According to ([3], Lemma 2.10), we can choose $ξ^{″}$ small enough such that $(x^{n}, u^{n}) \in T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n}$ implies $x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n}$ and $u^{n} \in T_{P_{U_{\hat{s}}}, ξ}^{n}$ . We have

$\begin{matrix} P_{X^{n} Y^{n} U^{n}} (E_{1}) & = 1 - P_{X^{n} Y^{n} U^{n}} (E_{1}^{c}) \\ \overset{(a)}{=} 1 - P_{X_{s} Y_{s} U_{\hat{s}}}^{n} (E_{1}^{c}) = 1 - P_{X_{s} U_{\hat{s}}}^{n} (T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n}) \\ = P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) . \end{matrix}$

Here, $(a)$ follows as for $x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n}$ the system uses $V_{\hat{s}}$ to generate $u^{n}$ from $x^{n}$ . Thus,

$\begin{matrix} \Pr (K \neq \hat{K}) & \leq P_{X^{n} Y^{n} U^{n}} (E_{1} \cup E_{2} \cup E_{3} \cup E_{4}) \\ = P_{X^{n} Y^{n} U^{n}} (E_{1}) + P_{X^{n} Y^{n} U^{n}} ((E_{2} \cup E_{3} \cup E_{4}) \cap E_{1}^{c}) \\ = P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + P_{X^{n} Y^{n} U^{n}} (E_{2} \cap E_{1}^{c}) + P_{X^{n} Y^{n} U^{n}} ((E_{3} \cup E_{4}) \cap E_{1}^{c} \cap (E_{2}^{c} \cup E_{1})) . \end{matrix}$

Now, we use

$\begin{matrix} P_{X^{n} Y^{n} U^{n}} (E_{2} \cap E_{1}^{c}) & \leq \sum_{\begin{matrix} (x^{n}, u^{n}) : \\ x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \\ \land u^{n} \in C^{c} \end{matrix}} P_{X^{n} U^{n}} (x^{n}, u^{n}) \\ = \sum_{\begin{matrix} (x^{n}, u^{n}) : \\ x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \\ \land u^{n} \in C^{c} \end{matrix}} P_{X_{s} U_{\hat{s}}}^{n} (x^{n}, u^{n}) \\ \leq \sum_{\begin{matrix} (x^{n}, u^{n}) : \\ x^{n} \in X^{n} \\ \land u^{n} \in C^{c} \end{matrix}} P_{X_{s} U_{\hat{s}}}^{n} (x^{n}, u^{n}) = P_{U_{\hat{s}}}^{n} (C^{c}) \end{matrix}$

and get

$\begin{matrix} \Pr (K \neq \hat{K}) & \leq P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + P_{U_{\hat{s}}}^{n} (C^{c}) + P_{X^{n} Y^{n} U^{n}} ((E_{3} \cup E_{4}) \cap E_{1}^{c} \cap E_{2}^{c}) \\ \leq P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + P_{U_{\hat{s}}}^{n} (C^{c}) + P_{X^{n} Y^{n} U^{n}} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c}) + P_{X^{n} Y^{n} U^{n}} (E_{3} \cap E_{1}^{c} \cap E_{2}^{c}) . \end{matrix}$

Now, we define the RV $E = e (X^{n}, U^{n})$ with $e : X^{n} \times U^{n} \to {0, 1}$

$\begin{matrix} e (x^{n}, u^{n}) = \{\begin{matrix} 0, & for u^{n} \in C \land x^{n} \in ⋃_{\hat{s} \in \hat{S}} T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}, \\ 1, & otherwise . \end{matrix} \end{matrix}$

We have

$\begin{matrix} \Pr (K \neq \hat{K}) & \leq P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + P_{U_{\hat{s}}}^{n} (C^{c}) \\ + \sum_{m \in M} P_{M} (m) P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) \\ + \sum_{m \in M} P_{M E} (m, 0) P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cap E_{1}^{c} \cap E_{2}^{c} | m, 0) \end{matrix}$

as for all $m \in M$

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cap E_{1}^{c} \cap E_{2}^{c} | m, 1) = 0 . \end{matrix}$

As $u^{n} \in C$ and $u^{n} \in T_{P_{U_{\hat{s}}}, ξ}^{n}$ imply $u^{n} \in C_{m}$ for an $m \in M_{\hat{s}}$ , we know

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) = 0 \end{matrix}$

and

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M} (E_{3} \cap E_{1}^{c} \cap E_{2}^{c} | m) = 0 \end{matrix}$

for $m \notin M_{\hat{s}}$ . Thus, we have

$\begin{matrix} \Pr (K \neq \hat{K}) & \leq P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + P_{U_{\hat{s}}}^{n} (C^{c}) \\ + \sum_{m \in M_{\hat{s}}} P_{M} (m) P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) \\ + \sum_{m \in M_{\hat{s}}} P_{M E} (m, 0) P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cap E_{1}^{c} \cap E_{2}^{c} | m, 0) . \end{matrix}$

We know for $m \in M_{\hat{s}}$

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) \\ \leq \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \land x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \end{matrix}} P_{X^{n} Y^{n} U^{n} | M} (x^{n}, y^{n}, u^{n} | m) \\ = \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \land x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \end{matrix}} P_{X^{n} | U^{n} Y^{n} M} (x^{n} | u^{n}, y^{n}, m) P_{Y^{n} | U^{n} M} (y^{n} | u^{n}, m) P_{U^{n} | M} (u^{n} | m) . \end{matrix}$

Using $M - U^{n} - Y^{n}$ , we have

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) \\ \leq \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \land x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \end{matrix}} P_{X^{n} | U^{n} Y^{n} M} (x^{n} | u^{n}, y^{n}, m) P_{Y^{n} | U^{n}} (y^{n} | u^{n}) P_{U^{n} | M} (u^{n} | m) \\ = \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \land x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \end{matrix}} P_{X^{n} | U^{n} Y^{n} M} (x^{n} | u^{n}, y^{n}, m) W_{s}^{n} (y^{n} | u^{n}) P_{U^{n} | M} (u^{n} | m) \\ \leq \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \end{matrix}} P_{X^{n} | U^{n} Y^{n} M} (x^{n} | u^{n}, y^{n}, m) W_{s}^{n} (y^{n} | u^{n}) P_{U^{n} | M} (u^{n} | m) \\ = \sum_{\begin{matrix} (y^{n}, u^{n}) : \\ f_{m}^{- 1} (u^{n}) \neq ϕ_{m} (y^{n}) \\ \land u^{n} \in C_{m} \end{matrix}} W_{s}^{n} (y^{n} | u^{n}) P_{U^{n} | M} (u^{n} | m) \\ = \sum_{u^{n} \in C_{m}} W_{s}^{n} ({(ϕ_{m}^{- 1} (f_{m}^{- 1} (u^{n})))}^{c} | u^{n}) P_{U^{n} | M} (u^{n} | m) . \end{matrix}$

Thus, using Inequality (9), we have

$\begin{matrix} \sum_{m \in M_{\hat{s}}} P_{M} (m) P_{X^{n} Y^{n} U^{n} | M} (E_{4} \cap E_{1}^{c} \cap E_{2}^{c} | m) \leq ϵ \end{matrix}$

for n large enough. Now, consider $u^{n} \in C_{m}$ , $m \in M$ . We get

$\begin{matrix} P_{U^{n} | M E} (u^{n} | m, 0) & = \sum_{x^{n} \in X^{n}} P_{U^{n} X^{n} | M E} (u^{n}, x^{n} | m, 0) \\ = \sum_{\hat{s} \in \hat{S}} \sum_{x^{n} \in T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}} P_{U^{n} X^{n} | M E} (u^{n}, x^{n} | m, 0) \end{matrix}$

as

$\begin{matrix} P_{U^{n} X^{n} | M E} (u^{n}, x^{n} | m, 0) = 0 \end{matrix}$

for $x^{n} \notin ⋃_{\hat{s} \in \hat{S}} T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}$ . We realize that, for $u^{n} \in C_{m}$ and $x^{n} \in ⋃_{\hat{s} \in \hat{S}} T_{P_{X_{\hat{s}}}, ξ^{'}}^{n},$

$\begin{matrix} P_{U^{n} X^{n} | M E} (u^{n}, x^{n} | m, 0) & = \frac{P_{U^{n} X^{n} M E} (u^{n}, x^{n}, m, 0)}{P_{M E} (m, 0)} \\ = \frac{P_{U^{n} X^{n}} (u^{n}, x^{n})}{P_{M E} (m, 0)} P_{M E | U^{n} X^{n}} (m, 0 | u^{n}, x^{n}) = \frac{P_{U^{n} X^{n}} (u^{n}, x^{n})}{P_{M E} (m, 0)}, \end{matrix}$

where the last step follows as

$\begin{matrix} P_{M E | U^{n} X^{n}} (m, 0 | u^{n}, x^{n}) = 1 . \end{matrix}$

Thus, we get

$\begin{matrix} P_{U^{n} | M E} (u^{n} | m, 0) & = \sum_{\hat{s} \in \hat{S}} \sum_{x^{n} \in T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}} \frac{P_{U^{n} X^{n}} (u^{n}, x^{n})}{P_{M E} (m, 0)} \\ = \sum_{\hat{s} \in \hat{S}} \sum_{x^{n} \in T_{P_{X_{\hat{s}}}, ξ^{'}}^{n}} \frac{P_{X_{s}}^{n} (x^{n}) V_{\hat{s}}^{n} (u^{n} | x^{n})}{P_{M E} (m, 0)} \\ = \sum_{\hat{s} \in \hat{S}} \sum_{\begin{matrix} p \in P (n, X) : \\ | p (x) - p_{X_{\hat{s}}} (x) | \leq ξ^{'} \\ \forall x \in X \end{matrix}} \sum_{x^{n} \in T_{p}^{n}} \frac{\prod_{i = 1}^{n} P_{X_{s}} (x_{i}) V_{\hat{s}} (u_{i} | x_{i})}{P_{M E} (m, 0)} . \end{matrix}$

The last term is constant for all $u^{n}$ of the same type. Thus,

$\begin{matrix} P_{U^{n} | M E} (u^{n} | m, 0) = p_{C_{m}} \end{matrix}$

is constant for $u^{n} \in C_{m}$ . As

$\begin{matrix} P_{U^{n} | M E} (u^{n} | m, 0) = 0 \end{matrix}$

for $u^{n} \notin C_{m}$ , we have

$\begin{matrix} P_{U^{n} | M E} (u^{n} | m, 0) = \frac{1}{| C_{m} |} \end{matrix}$

for $u^{n} \in C_{m}$ . Now, we get

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cup E_{1}^{c} \cup E_{2}^{c} | m, 0) & \leq \sum_{\begin{matrix} (x^{n}, y^{n}, u^{n}) : \\ \land u^{n} \in C_{m} \land x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n} \\ \land h_{m} (f_{m}^{- 1} (u^{n})) = \tilde{k} \end{matrix}} P_{X^{n} Y^{n} U^{n} | M E} (x^{n}, y^{n}, u^{n} | m, 0) \\ \leq \sum_{\begin{matrix} u^{n} \in C_{m} \\ \land h_{m} (f_{m}^{- 1} (u^{n})) = \tilde{k} \end{matrix}} P_{U^{n} | M E} (u^{n} | m, 0) = | h_{m}^{- 1} (\tilde{k}) | p_{C_{m}} . \end{matrix}$

We have

$\begin{matrix} | h_{m}^{- 1} (\tilde{k}) | = | C_{m} | - min_{\hat{s} \in \hat{S}} ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉ ⌊\frac{| C_{m} |}{{min}_{\hat{s} \in \hat{S}} ⌈ {inf}_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉}⌋ \end{matrix}$

and get

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cup E_{1}^{c} \cup E_{2}^{c} | m, 0) \\ \leq \frac{1}{| C_{m} |} (| C_{m} | - min_{\hat{s} \in \hat{S}} ⌈ inf_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉ (\frac{| C_{m} |}{{min}_{\hat{s} \in \hat{S}} ⌈ {inf}_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉} - 1)) \\ = \frac{{min}_{\hat{s} \in \hat{S}} ⌈ {inf}_{s \in I (\hat{s})} exp (n (I (U_{\hat{s}}; Y_{s}) - δ)) ⌉}{| C_{m} |} \leq \frac{2}{exp (n \tilde{ϵ})} \end{matrix}$

or

$\begin{matrix} P_{X^{n} Y^{n} U^{n} | M E} (E_{3} \cup E_{1}^{c} \cup E_{2}^{c} | m, 0) = 0 \end{matrix}$

respectively, if, for the source state s, it holds that $s \in I (\hat{s})$ for the $\hat{s}$ corresponding to the smallest $C_{m, \hat{s}}$ . Here,

$\begin{matrix} \tilde{ϵ} = inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) - min_{{\hat{s}}^{'} \in \hat{S}} inf_{s \in I ({\hat{s}}^{'})} I (U_{{\hat{s}}^{'}}; Y_{s}) . \end{matrix}$

Thus, for n large enough,

$\begin{matrix} P_{e} \leq P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) + η + ϵ + \frac{2}{exp (n \tilde{ϵ})} \end{matrix}$

and Inequality (5) is fulfilled for small enough constants and n large enough.
We define $\tilde{k} : U^{n} \times M \to {0, 1}$

$\begin{matrix} \tilde{k} (u^{n}, m) = \{\begin{matrix} 1, & for u^{n} \in f_{m} (h_{m}^{- 1} (\tilde{k})), \\ 0, & otherwise, \end{matrix} \end{matrix}$

and the RV $\tilde{K} = \tilde{k} (U^{n}, M)$ . We have

$\begin{matrix} P_{K | M E \tilde{K}} (k | m, 0, 0) = P_{U^{n} | M E \tilde{K}} (f_{m} (h_{m}^{- 1} (k)) | m, 0, 0) . \end{matrix}$

Now, consider $u^{n} \in C_{m}$ . It holds that

$\begin{matrix} P_{U^{n} | M E \tilde{K}} (u^{n} | m, 0, 0) = \frac{P_{U^{n} | M E} (u^{n} | m, 0)}{P_{\tilde{K} | M E} (0 | m, 0)} P_{\tilde{K} | M E U^{n}} (0 | m, 0, u^{n}) . \end{matrix}$

We know

$\begin{matrix} P_{\tilde{K} | M E U^{n}} (0 | m, 0, u^{n}) = 1 \end{matrix}$

for $u^{n} \notin f_{m} (h_{m}^{- 1} (\tilde{k}))$ . Thus,

$\begin{matrix} P_{K | M E \tilde{K}} (k | m, 0, 0) = \frac{P_{U^{n} | M E} (h_{m}^{- 1} (k) | m, 0)}{P_{\tilde{K} | M E} (0 | m, 0)} = \frac{p_{C_{m}} | h_{m}^{- 1} (k) |}{P_{\tilde{K} | M E} (0 | m, 0)} \end{matrix}$

for all $k \in K$ . This means

$\begin{matrix} P_{K | M E \tilde{K}} (k | m, 0, 0) = \frac{1}{| K |}, \end{matrix}$

as $| h_{m}^{- 1} (k) |$ is constant for all $k \in K$ . We also know

$\begin{matrix} H (K | M = m, E = e, \tilde{K} = \tilde{k}) = log | K | \end{matrix}$

for $P_{M E \tilde{K}} (m, e, \tilde{k}) > 0$ , $(e, \tilde{k}) \neq (0, 0)$ as k is chosen according to a uniform distribution on $K$ in this case. Thus,

$\begin{matrix} log | K | & \geq H (K | M) \geq H (K | M E \tilde{K}) \\ = \sum_{\begin{matrix} (m, e, \tilde{k}) \\ \in M \times {0, 1} \times {0, 1} \end{matrix}} P_{M E \tilde{K}} (m, e, \tilde{k}) H (K | M = m, E = e, \tilde{K} = \tilde{k}) = log | K | . \end{matrix}$

This means Equations (6) and (7) are fulfilled.
For the secret-key rate, we have

$\begin{matrix} \frac{1}{n} log | K | \geq min_{\hat{s} \in \hat{S}} inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) - δ . \end{matrix}$

(10)
Finally, we analyse the privacy-leakage rate. We have

$\begin{matrix} I (X^{n}; M) & = H (M) - H (M | X^{n}) - H (M | U^{n}) + H (M | U^{n}) \\ = I (U^{n}; M) - H (M | X^{n}), \end{matrix}$

where we use $H (M | U^{n}) = 0$ for the second equality (see ([3], Problem 3.1)). Now, we use

$\begin{matrix} P_{M E} (M_{\hat{s}}, 0) & \geq P_{X^{n} Y^{n} U^{n}} (E_{1}^{c} \cup E_{2}^{c}) = P_{X_{s} Y_{s} U_{\hat{s}}}^{n} (E_{1}^{c} \cup E_{2}^{c}) \\ \geq P_{X_{s} Y_{s} U_{\hat{s}}}^{n} (E_{1}^{c}) + P_{X_{s} Y_{s} U_{\hat{s}}}^{n} (E_{2}^{c}) - 1 \\ = P_{X_{s} U_{\hat{s}}}^{n} (T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n}) + P_{U_{\hat{s}}}^{n} (C) - 1 \\ \geq 1 - η - P_{X_{s} U_{\hat{s}}}^{n} ({(T_{P_{X_{s} U_{\hat{s}}}, ξ^{″}}^{n})}^{c}) \geq 1 - ζ \end{matrix}$

for $ζ > 0$ and n large enough. We also use $P_{U^{n} | M E} (u^{n} | m, 0) = \frac{1}{| C_{m} |}$ for $u^{n} \in C_{m}$ and get

$\begin{matrix} H (U^{n} | M) & \geq H (U^{n} | M E) \\ \geq \sum_{m \in M_{\hat{s}}} P_{M E} (m, 0) H (U^{n} | M = m, E = 0) \\ \geq (1 - ζ) (min_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) - δ) n . \end{matrix}$

Thus,

$\begin{matrix} I (X^{n}; M) \leq H (U^{n}) - H (M | X^{n}) - n min_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) + n δ + ζ n min_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) . \end{matrix}$

We now use

$\begin{matrix} I (X^{n}; U^{n}) & = H (X_{s}^{n}) - H (X^{n} | U^{n}) \\ \leq H (X_{s}^{n}) - H (X^{n} | U^{n} T) \\ \leq H (X_{s}^{n}) - H (X^{n} | U^{n} T = 1) (1 - ϵ^{'}) \\ = H (X_{s}^{n}) - H (X_{s}^{n} | U_{\hat{s}}^{n} T = 1) (1 - ϵ^{'}) \\ = H (X_{s}^{n}) - H (X_{s}^{n} | U_{\hat{s}}^{n} T = 1) (1 - ϵ^{'}) - H (X_{s}^{n} | U_{\hat{s}}^{n} T = 0) ϵ^{'} + H (X_{s}^{n} | U_{\hat{s}}^{n} T = 0) ϵ^{'} \\ \leq I (X_{s}^{n}; U_{\hat{s}}^{n} T) + ϵ^{'} log | X | n \\ = ϵ^{'} log | X | n + I (X_{s}^{n}; U_{\hat{s}}^{n}) + I (T; X_{s}^{n} | U_{\hat{s}}^{n}) \\ \leq ϵ^{'} log | X | n + I (X_{s}^{n}; U_{\hat{s}}^{n}) + log 2, \end{matrix}$

where $T = t (X^{n})$ , $t : X^{n} \to {0, 1}$

$\begin{matrix} t (x^{n}) = \{\begin{matrix} 1, & x^{n} \in T_{P_{X_{s}}, ξ^{'}}^{n}, \\ 0, & else . \end{matrix} \end{matrix}$

Thus, $ϵ^{'}$ is arbitrarily small for large n.
Thus, we get

$\begin{matrix} I (X^{n}; M) \leq & H (U^{n}) - H (M | X^{n}) - n inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) \\ + n δ + ζ n inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) + ϵ^{'} log | X | n + I (X_{s}^{n}; U_{\hat{s}}^{n}) + log 2 - I (X^{n}; U^{n}) . \end{matrix}$

(11)

Again, using ([3], Problem 3.1), we get

$\begin{matrix} H (U^{n}) - H (M | X^{n}) - I (X^{n}; U^{n}) & = H (U^{n} | X^{n}) - H (M | X^{n}) \\ = H (U^{n} M | X^{n}) - H (M | X^{n}) \\ = H (U^{n} | M X^{n}) . \end{matrix}$

We also know that

$\begin{matrix} 0 & \leq I (U^{n}; Y^{n} | X^{n} M) \\ = H (Y^{n} | X^{n} M) - H (Y^{n} | X^{n} U^{n} M) \\ = H (Y^{n} | X^{n} M) - H (Y^{n} | X^{n} U^{n}) \\ \leq H (Y^{n} | X^{n}) - H (Y^{n} | X^{n} U^{n}) \\ = I (Y^{n}; U^{n} | X^{n}) = 0 . \end{matrix}$

Here, we use ([3], Problem 3.1) and $M - X^{n} - Y^{n}$ . Thus,

$\begin{matrix} I (U^{n}; X^{n} Y^{n} M) = I (U^{n}; X^{n} M) = I (U^{n}; Y^{n} M) + I (U^{n}; X^{n} | Y^{n} M) . \end{matrix}$

Thus,

$\begin{matrix} I (U^{n}; X^{n} M) \geq I (U^{n}; Y^{n} M) . \end{matrix}$

It follows that

$\begin{matrix} H (U^{n} | M X^{n}) \leq H (U^{n} | M Y^{n}) . \end{matrix}$

(12)

Now, we bound the right hand side of Inequality (11) using Inequality (12) and use Fano’s inequality. Thus, we have

$\begin{matrix} \frac{1}{n} I (X^{n}; M) \leq sup_{s \in I (\hat{s})} I (X_{s}; U_{\hat{s}}) - I (U_{\hat{s}}; Y_{s}) \\ (13) & + δ + ζ I (U_{\hat{s}}; Y_{s}) + ϵ^{'} log | X | + \frac{1}{n} log 2 + P_{e} log (| U | - 1) + \frac{h (P_{e})}{n} . \end{matrix}$

Here, we use

$\begin{matrix} I (X_{s}; U_{\hat{s}}) - inf_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s}) = sup_{s \in I (\hat{s})} I (X_{s}; U_{\hat{s}}) - I (U_{\hat{s}}; Y_{s}) \end{matrix}$

as $I (X_{s}; U_{\hat{s}})$ is constant for all $s \in I (\hat{s})$ .

Using these results, we conclude from Inequalities (10) and (13) that

\begin{matrix} R^{(P S C A)} (S) \supseteq ⋃_{U_{{\hat{s}}_{1}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} ⋂_{\hat{s} \in \hat{S}} R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}}) . \end{matrix}

Using the distributive law for sets, we can see that this is equivalent to

\begin{matrix} R^{(P S C A)} (S) \supseteq ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}}) \end{matrix}

(see Appendix B). We now consider the converse. Assume

X^{n} Y^{n}

are distributed i.i.d. according to

P_{X_{s} Y_{s}}

for an arbitrary

s \in S

. The following calculations hold for all

s \in S

. Similarly to the converse part of the proof of ([7], Theorem 3.1), we have

\begin{matrix} log | K | & \overset{(a)}{=} H (K) = I (K; \hat{K}) + H (K | \hat{K}) \\ \overset{(b)}{\leq} I (K; M Y^{n}) + F = I (K; M) + I (K; Y^{n} | M) + F \\ \overset{(c)}{\leq} I (Y^{n}; M K) + F = \sum_{i = 1}^{n} I (K M Y^{i - 1}; Y_{i}) + F, \end{matrix}

where we use Equation (6) for (a), Fano’s inequality with

F = δ n log | K | + 1

and the data processing inequality in combination with

K - M Y^{n} - \hat{K}

, which follows from the definition of the compound authentication protocol for (b) and Equation (7) for (c). From the definition of the compound authentication protocol, we also know that

Y^{n} - X^{n} - M K

. Using the definition of Markov chains, this implies

Y^{i - 1} - X^{i - 1} - M K Y_{i}

for all

i \in {2 \dots n}

(see Appendix C). (From

Y^{n} - X^{i - 1} X_{i}^{n} - M K,

we get

Y^{i - 1} Y_{i} - X^{i - 1} - M K X_{i}^{n}

using Implications (A11) and (A13). Then, we use Implication (A12) to get

Y^{i - 1} - X^{i - 1} Y_{i} - M K

and from this we get the desired result using Implication (A13).)

The equation

\begin{matrix} I (Y_{i} K M; X^{i - 1} Y^{i - 1}) = I (Y_{i} K M; X^{i - 1}) \end{matrix}

is equivalent to

Y^{i - 1} - X^{i - 1} - M K Y_{i}

([3], Definition 3.9). This is equivalent to

\begin{matrix} H (Y_{i} | K M X^{i - 1} Y^{i - 1}) = H (Y_{i} | K M X^{i - 1}) + (H (K M | X^{i - 1}) - H (K M | X^{i - 1} Y^{i - 1})) . \end{matrix}

Thus,

H (Y_{i} | K M Y^{i - 1}) \geq H (Y_{i} | K M X^{i - 1})

. Thus, we have

\begin{matrix} I (K M Y^{i - 1}; Y_{i}) \leq I (K M X^{i - 1}; Y_{i}), \end{matrix}

(14)

so

\begin{matrix} log | K | \leq \sum_{i = 1}^{n} I (K M X^{i - 1}; Y_{i}) + F . \end{matrix}

Now, we define

U_{i} = K M X^{i - 1}

for all

i \in {1 \dots n}

. This implies

U_{i} - X_{i} - Y_{i}

for all

i \in {1 \dots n}

, which can again be seen using the results from Appendix C. Let Q be a time sharing RV independent of all others and uniformly distributed on

Q = {1 \dots n}

and let

U = Q U_{Q}

,

X = X_{Q}

and

Y = Y_{Q}

. Then,

\begin{matrix} P_{U X Y} ((u, q), x, y) = P_{Q U_{q} X_{q} Y_{q}} (q, u, x, y) \overset{(a)}{=} P_{Q U_{q} | X_{q}} (u, q | x) P_{X_{q} Y_{q}} (x, y) \end{matrix}

for all

(u, q, x, y) \in U_{q} \times Q \times X \times Y

, where (

a)

follows from

U_{q} - X_{q} - Y_{q}

and the independence of Q. We have

\begin{matrix} P_{X Y} (x, y) = \sum_{q, u} P_{Q U_{q} X_{q} Y_{q}} (q, u, x, y) = \sum_{i = 1}^{n} \frac{1}{n} P_{X_{i} Y_{i}} (x, y) \overset{(a)}{=} P_{X_{s} Y_{s}} (x, y) = P_{X_{q} Y_{q}} (x, y) \end{matrix}

(15)

for an arbitrary

q \in Q

and

(x, y) \in X \times Y

, where

(a)

follows as

P_{X_{i} Y_{i}} = P_{X_{s} Y_{s}}

for all

i \in Q

as the RVs

X^{n} Y^{n}

are generated i.i.d. We also have for all

(u, q, x) \in U_{q} \times Q \times X

\begin{matrix} P_{U | X} (u, q | x) = \frac{\sum_{y \in Y} P_{Q U_{q} X_{q} Y_{q}} (q, u, x, y)}{P_{X} (x)} = \frac{P_{Q U_{q} X_{q}} (q, u, x)}{P_{X_{q}} (x)} = P_{Q U_{q} | X_{q}} (q, u | x) . \end{matrix}

Thus,

P_{U X Y} ((u, q), x, y) = P_{X Y} (x, y) P_{U | X} (u, q | x),

which means

U - X - Y

. We also have

\begin{matrix} log | K | & \leq \sum_{i = 1}^{n} I (U_{i}, Y_{i}) + F = n \sum_{i = 1}^{n} \frac{1}{n} I (U_{Q}, Y | Q = i) + F \\ = n I (U_{Q}; Y | Q) + F = n H (Y | Q) - H (Y | U_{Q} Q) + F \\ \leq n (H (Y) - H (Y | U_{Q} Q)) + F = n I (U_{Q} Q; Y) + F = n I (U; Y) + F . \end{matrix}

Thus, using the definition of F, we get

\begin{matrix} \frac{1}{n} log | K | \leq {(1 - δ)}^{- 1} (I (U; Y) + \frac{1}{n}), \end{matrix}

which implies

\begin{matrix} \frac{1}{n} log | K | \leq I (U; Y) + δ \end{matrix}

(16)

for

δ > 0

and n large enough. We also consider

\begin{matrix} I (X^{n}; M) & = H (M) - H (M | X^{n}) \\ \geq H (M | Y^{n}) - H (K M | X^{n}) \\ = H (K M | Y^{n}) - H (K | Y^{n} M) - H (K M | X^{n}) . \end{matrix}

From the definition of the compound storage model, we know

K - M Y^{n} - \hat{K}

. Using the data processing inequality, we get

I (K; M Y^{n}) \geq I (K; \hat{K}),

which means

H (K | M Y^{n}) \leq H (K | \hat{K}) \leq F

, where the last inequality follows from Fano’s inequality. Thus,

\begin{matrix} I (X^{n}; M) & \geq H (K M | Y^{n}) - H (K M | X^{n}) - F \\ = I (K M; X^{n}) - I (K M; Y^{n}) - F \\ = \sum_{i = 1}^{n} I (K M; X_{i} | X^{i - 1}) - \sum_{i = 1}^{n} I (K M; Y_{i} | Y^{i - 1}) - F \\ \overset{(a)}{=} \sum_{i = 1}^{n} I (K M X^{i - 1}; X_{i}) - \sum_{i = 1}^{n_{k}} I (K M Y^{i - 1}; Y_{i}) - F \\ \overset{(b)}{\geq} \sum_{i = 1}^{n} I (K M X^{i - 1}; X_{i}) - \sum_{i = 1}^{n} I (K M X^{i - 1}; Y_{i}) - F, \end{matrix}

where (a) follows as

X_{i}

and

Y_{i}

are i.i.d. and (b) follows from Inequality (14). With our definition of U, X and Y and the same argumentation as before, we get

\begin{matrix} \frac{1}{n} I (X^{n}; M) & \geq I (U; X) - I (U; Y) - \frac{F}{n} \\ \overset{(a)}{\geq} I (U; X) - I (U; Y) - δ \end{matrix}

(17)

for n large enough, where, for (

(a)

, we use the definition of F and Inequality (16). We have for all

(u, q, x) \in U_{q} \times Q \times X

\begin{matrix} P_{U X} ((q, u), x) & = P_{Q} (q) P_{U_{q} X_{q}} (u, x) \\ = P_{K M X^{q - 1} X_{q}} (k, m, x^{q - 1}, x_{q}) P_{Q} (q) \\ = P_{Q} (q) \sum_{x_{q + 1}^{n}} P_{K M X^{n}} (k, m, x^{n}) \\ \overset{(a)}{=} P_{Q} (q) \sum_{x_{q + 1}^{n}} P_{X^{n}} (x^{n}) P_{M | X^{n}} (m | x^{n}) P_{K | X^{n}} (k | x^{n}) \\ = P_{Q} (q) \sum_{x_{q + 1}^{n}} P_{X^{n}} (x^{n}) Θ (x^{n}) Φ (x^{n}), \end{matrix}

(18)

where (a) follows from

M - X^{n} - K

, which follows from the definition of the compound authentication protocol. As

P_{X^{n}}

is the same for all

s \in I (\hat{s})

,

\hat{s} \in \hat{S}

, this result implies that

P_{U X}

is the same for all

s \in I (\hat{s})

,

\hat{s} \in \hat{S}

. We get the bounds (16) and (17) for each

s \in S

. We denote the corresponding RVs

U X Y

by

U_{s} X_{s} Y_{s}

for all

s \in S

. The joint distribution of

X_{s} Y_{s}

is

P_{X_{s} Y_{s}} \in S

as we see from Equation (15). Thus, Equation (18) and the Inequalities (16) and (17) for all

s \in S

imply

\begin{matrix} R_{P S C S} (S) \subseteq ⋃_{U_{{\hat{s}}_{1}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} ⋂_{\hat{s} \in \hat{S}} R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}}) . \end{matrix}

We again use the distributive law for sets to get our result. The bounds on the cardinality of the alphabet of the auxiliary random variables can be derived as in [19]. ☐

Remark 9.

This result implies Theorem 2 as we use a deterministic decoder for the achievability proof.

Remark 10.

In [19], the authors also derive the compound capacity region for

| S | < \infty

, but, in contrast to this work, they consider deterministic protocols and require strong secrecy instead of perfect secrecy when defining achievability. This compound capacity region equals

R_{P S C A} (S)

.

8. Secure Storage

We now discuss some other applications of the already proven results apart from authentication. For this purpose, we take a look at some results for secure storage from [13,14], which follow directly from our results for authentication. Here, we again consider compound sets

S

with

| \hat{S} | < \infty

.

In [13], we consider the following model for secure storage with source uncertainty, where the corresponding scenario is depicted in Figure 3.

Definition 10.

Let

n \in N

. The compound storage model consists of a set

S \subseteq P (X \times Y)

of DMMSs with generic variables

X_{s} Y_{s}

,

s \in S

, (all on the same alphabets

X

and

Y

), a source

P_{D_{n}} \in P (D_{n})

that puts out a RV

D_{n}

, the (possibly randomized) encoder

Φ_{n} : X^{n} \times D_{n} \to M

and the (possibly randomized) decoder

Ψ_{n} : Y^{n} \times M \to {\hat{D}}_{n}

. Let

X^{n}

and

Y^{n}

be the output of one of the DMMSs in

S

, i.e.,

P_{X Y} = P_{X_{s} Y_{s}}

for an

s \in S

, but s is not known.

D_{n}

is independent of

X^{n} Y^{n}

. The RV M is generated from

X^{n}

and

D_{n}

using

Φ_{n}

. The RV

{\hat{D}}_{n}

is generated from

Y^{n}

and M using

Ψ_{n}

. We use the term compound storage protocol for

(Φ_{n}, Ψ_{n})

. Additionally, it holds that, for all

δ > 0

, there is an

n_{0} = n_{0} (δ)

such that for all

n \geq n_{0}

\begin{matrix} \frac{1}{n} D (P_{D_{n}} ∥ U_{D_{n}}) < δ . \end{matrix}

We define achievability for this model.

Definition 11.

A tuple

(R, L)

,

R, L \geq 0

, is an achievable storage rate versus privacy-leakage rate pair for the compound storage model if for every

δ > 0

there is an

n_{0} = n_{0} (δ)

such that for all

n \geq n_{0}

there exists a compound storage protocol such that for all

s \in S

\begin{matrix} \Pr (D_{n} & = {\hat{D}}_{n}) \geq 1 - δ, \\ I (M; D_{n}) & = 0, \\ \frac{1}{n} log | D_{n} | & \geq R - δ, \\ \frac{1}{n} I (M; X^{n}) & \leq L + δ, \end{matrix}

where

P_{X Y} = P_{X_{s} Y_{s}}

. We denote the corresponding storage protocols by PSCS-Protocols (Perfect-Secrecy- Compound-Storage-Protocols).

Definition 12.

The set of achievable rate pairs that are achievable using PSCS-Protocols is called the compound capacity region

R_{P S C S} (S)

.

We then can prove the following result.

Theorem 6.

It holds that

\begin{matrix} R_{P S C S} (S) = R_{P S C A} (S) . \end{matrix}

Remark 11.

The compound storage model is essentially equivalent to a compound version of the chosen secret system in [7]. For this reason, Theorem 6 follows using the same approach as the authors of [7].

We combine source compression and secure storage in [14] by considering the following model, which models the scenario depicted in Figure 4.

Definition 13.

Let

k, n_{k} \in N

. The compound source storage model consists of a set

S \subseteq P (X \times Y)

of DMMSs with generic variables

X_{s} Y_{s}

,

s \in S

, (all on the same alphabets

X

and

Y

), a general source

V

[20] that fulfills the strong converse property, the (possibly randomized) encoder

Φ_{k} : X^{n_{k}} \times V^{k} \to M

and the (possibly randomized) decoder

Ψ_{k} : Y^{n_{k}} \times M \to {\hat{V}}^{k}

. Let

X^{n_{k}}

and

Y^{n_{k}}

be the output of one of the DMMSs in

S

, i.e.,

P_{X Y} = P_{X_{s} Y_{s}}

for an

s \in S

, but s is not known. The RV M is generated from

X^{n_{k}}

and

V^{k}

using

Φ_{k}

. The RV

{\hat{V}}^{k}

is generated from

Y^{n_{k}}

and M using

Ψ_{k}

. We use the term compound source storage protocol for

(Φ_{k}, Ψ_{k})

.

For this model, we define achievability where we consider the output of the PUF source as a resource.

Definition 14.

A tuple

(B, L)

,

B, L \geq 0

, is an achievable performance pair for the compound source storage model if, for every

δ > 0

, there is a

k_{0} = k_{0} (δ)

such that, for all

k \geq k_{0},

there exists a compound source storage protocol such that, for all

s \in S,

\begin{matrix} \Pr (V^{k} = {\hat{V}}^{k}) \geq 1 - δ, \\ I (M; V^{k}) = 0, \\ \frac{n_{k}}{k} \leq B + δ, \\ \frac{1}{n_{k}} I (M; X^{n_{k}}) \leq L + δ, \end{matrix}

where

P_{X Y} = P_{X_{s} Y_{s}}

. We denote the corresponding compound source storage protocols by PSCSS-Protocols (Perfect-Secrecy-Compound-Source-Storage-Protocols).

Definition 15.

The set of achievable performance pairs that are achievable using PSCSS-Protocols is called the optimal performance region

R_{P S C S S} (S, V)

.

We then can prove the following results.

Theorem 7.

It holds that

\begin{matrix} R_{P S C S S} (S, V) \supseteq ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} {(B, L) : B \geq \frac{\bar{H} (V)}{{inf}_{s \in I (\hat{s})} I (U_{\hat{s}}; Y_{s})}, L \geq sup_{s \in I (\hat{s})} I (U_{\hat{s}}; X_{s}) - I (U_{\hat{s}}; Y_{s})} \\ \overset{(a)}{=} ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} R_{\hat{s}}^{(P S C S S)} (S, V, U_{\hat{s}}), \end{matrix}

where for

(a)

we define

R_{\hat{s}}^{(P S C S S)} (S, V, U_{\hat{s}})

appropriately. For all

\hat{s} \in \hat{S}

, the union is over all RVs

U_{\hat{s}}

such that, for all

s \in I (\hat{s}),

we have

U_{\hat{s}} - X_{s} - Y_{s}

.

Theorem 8.

For stationary ergodic sources

V

, it holds that

\begin{matrix} R_{P S C S S} (S, V) = ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} R_{\hat{s}}^{(P S C S S)} (S, V, U_{\hat{s}}) . \end{matrix}

For all

\hat{s} \in \hat{S}

, the union is over all RVs

U_{\hat{s}}

such that, for all

s \in I (\hat{s})

, we have

U_{\hat{s}} - X_{s} - Y_{s}

. For

| S | < \infty

, we only have to consider RVs

U_{\hat{s}}

with

| U_{\hat{s}} | \leq | X | + | I (\hat{s}) |

.

9. Conclusions

We derived the capacity region for the (compound) authentication model requiring perfect secrecy and uniform distribution of the key generated for authentication and compared the result to existing results where only strong secrecy and a weaker condition on the key distribution is required. The two capacity regions are the same. We could prove this result by allowing for randomized encoders, which are not necessarily used when deriving the capacity region corresponding to the weaker definition of achievability. We saw that we can use the results for authentication to prove corresponding results for secure storage.

As already mentioned, compound sources do not only model source uncertainty but also model attacks where an attacker can influence parameters of the source while the legitimate parties do not know which parameters the attacker chose. It is essential that in this scenario the parameter is constant for all symbols read from the source. An attack where the parameter can be varied while the source is used is fundamentally stronger. A characterization of achievable rates for this attack scenario is not known, except for the source model for secret key generation, which has been derived in [21]. For an overview of these types of attacks, see [22]. Recently, the corresponding problem for wiretap channels could be solved [23,24]. For the source model, the attacker can choose his strategy depending on the public data, which is a difficulty that does not appear for wiretap channels. Nevertheless the authors hope that, using techniques from the works concerning the wiretap channel, the open problem for the source model can be solved.

Acknowledgments

Funding is acknowledged from the German Research Foundation (DFG) via grant BO 1734/20-1 and from the Federal Ministry of Education and Research (BMBF) via grant 16KIS0118K. Holger Boche would like to thank Rainer Plaga, Federal Office for Information Security (BSI), for the discussion on PUFs and issues concerning different secrecy measures.

Author Contributions

Sebastian Baur and Holger Boche conceived this study and derived the results. Sebastian Baur wrote the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 4

Proof.

We prove the result for compound codes with the additional constraint on the decoding sets that, for

ζ > 0,

it holds that

\begin{matrix} ϕ^{- 1} (m) \subset ⋃_{W \in W} T_{W, ζ}^{n} (f (m)) \end{matrix}

(A1)

for all messages

m \in M_{f}

. Additionally, for

ζ^{'} > 0,

we require

\begin{matrix} f (m) \in \tilde{A} = A \cap T_{P, ζ^{'}}^{n} \end{matrix}

(A2)

for all

m \in M_{f}

. First, consider the case that

W

is a finite set. Let

(f, ϕ)

be such a code that can not be extended. Thus, for all

x^{n} \in \tilde{A}

, there is a

W \in W

such that

\begin{matrix} W^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) \ B | x^{n}) < 1 - ϵ, \end{matrix}

(A3)

where

B = ⋃_{m \in M_{f}} ϕ^{- 1} (m)

. It also holds that

\begin{matrix} P^{n} (\tilde{A}) \geq P^{n} (A) + P^{n} (T_{P, ζ^{'}}^{n}) - 1 \geq η / 2 \end{matrix}

for n large enough. We now consider the set

\begin{matrix} {\tilde{A}}_{W} = {x^{n} \in \tilde{A} : W^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) \ B | x^{n}) < 1 - ϵ} . \end{matrix}

We know

⋃_{W \in W} {\tilde{A}}_{W} = \tilde{A}

, as for all

x^{n} \in \tilde{A}

there is at least one

W \in W

with Inequality (A3). Thus,

\begin{matrix} η / 2 \leq P^{n} (⋃_{W \in W} {\tilde{A}}_{W}) & \leq \sum_{W \in W} P^{n} ({\tilde{A}}_{W}) \leq | W | max_{W \in W} P^{n} ({\tilde{A}}_{W}) . \end{matrix}

Thus, there is a

\bar{W} \in W

such that for all

x^{n} \in {\tilde{A}}_{\bar{W}}

\begin{matrix} {\bar{W}}^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) \ B | x^{n}) < 1 - ϵ \end{matrix}

and

\begin{matrix} P^{n} ({\tilde{A}}_{\bar{W}}) \geq \frac{η}{2 | W |} . \end{matrix}

Thus,

\begin{matrix} {\bar{W}}^{n} (B^{c} | x^{n}) + {\bar{W}}^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) | x^{n}) - {\bar{W}}^{n} (B^{c} \cup ⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) | x^{n}) \\ = {\bar{W}}^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) \ B | x^{n}) < 1 - ϵ, \end{matrix}

which means

\begin{matrix} {\bar{W}}^{n} (B | x^{n}) > ϵ - δ \end{matrix}

for all

x^{n} \in {\tilde{A}}_{\bar{W}}

as

{\bar{W}}^{n} (B^{c} | x^{n}) = 1 - {\bar{W}}^{n} (B | x^{n})

,

{\bar{W}}^{n} (⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) | x^{n}) \geq 1 - δ

for

δ > 0

and n large enough and

{\bar{W}}^{n} (B^{c} \cup ⋃_{\tilde{W} \in W} T_{\tilde{W}, ζ}^{n} (x^{n}) | x^{n}) \leq 1

. Thus, we have

\begin{matrix} {\bar{W}}^{n} (B \cap ⋃_{{\bar{x}}^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} ({\bar{x}}^{n}) | x^{n}) & \geq {\bar{W}}^{n} (B | x^{n}) + {\bar{W}}^{n} (⋃_{{\bar{x}}^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} ({\bar{x}}^{n}) | x^{n}) - 1 \\ \geq ϵ - δ + (1 - ξ) - 1 = ϵ - ξ - δ \end{matrix}

for all

x^{n} \in {\tilde{A}}_{\bar{W}}

,

ξ > 0

and n large enough. (We choose

ϵ

,

δ

and

ξ

such that

ϵ - ξ - δ > 0

.) Thus,

B^{'} = B \cap ⋃_{{\bar{x}}^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} ({\bar{x}}^{n})

is an

ϵ - ξ - δ

image of

{\tilde{A}}_{\bar{W}}

(see [3]). Thus,

\begin{matrix} | B \cap ⋃_{{\bar{x}}^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} ({\bar{x}}^{n}) | \geq g_{{\bar{W}}^{n}} ({\tilde{A}}_{\bar{W}}, ϵ - ξ - δ), \end{matrix}

where

g_{{\bar{W}}^{n}} ({\tilde{A}}_{\bar{W}}, ϵ - ξ - δ)

is defined as in [3]. We have

\begin{matrix} {(P \bar{W})}^{n} (B^{'}) & = \sum_{y^{n} \in B^{'}} \prod_{i = 1}^{n} \sum_{a \in X} P (a) \bar{W} (y_{i} | a) \\ \overset{(a)}{=} \sum_{y^{n} \in B^{'}} \sum_{x^{n} \in X^{n}} \prod_{i = 1}^{n} P (x_{i}) \bar{W} (y_{i} | x_{i}) \\ \geq \sum_{y^{n} \in B^{'}} \sum_{x^{n} \in {\tilde{A}}_{\bar{W}}} P^{n} (x^{n}) {\bar{W}}^{n} (y^{n} | x^{n}) \\ = \sum_{x^{n} \in {\tilde{A}}_{\bar{W}}} P^{n} (x^{n}) \sum_{y^{n} \in B^{'}} {\bar{W}}^{n} (y^{n} | x^{n}) \\ \geq (ϵ - δ - ξ) P^{n} ({\tilde{A}}_{\bar{W}}) \geq η / 2 (ϵ - δ - ξ) \frac{1}{| W |}, \end{matrix}

where (a) can be shown with induction. Using ([3], Lemma 2.14), we get for n large enough

\begin{matrix} \frac{1}{n} log | B^{'} | \geq H (P \bar{W}) - (γ + \frac{1}{n} log | W |) \end{matrix}

(A4)

with

γ > 0

. Additionally, we have

\begin{matrix} | B^{'} | & \overset{(a)}{=} | ⋃_{m \in M_{f}} ϕ^{- 1} (m) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n}) | \\ = | ⋃_{m \in M_{f}} [ϕ^{- 1} (m) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n})] | \\ \leq \sum_{m \in M_{f}} | ϕ^{- 1} (m) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n}) | \\ \overset{(b)}{\leq} \sum_{m \in M_{f}} | ⋃_{W \in W} T_{W, ζ}^{n} (f (m)) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n}) |, \end{matrix}

where (a) follows from the definition of B and (b) follows from Subset Relationship (A1). We now define

\begin{matrix} W_{m}^{*} = {W \in W : T_{W, ζ}^{n} (f (m)) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n}) \neq \emptyset} . \end{matrix}

As

\begin{matrix} T_{\bar{W}, ξ}^{n} (f (m)) \cap ⋃_{x^{n} \in T_{P, ζ^{'}}^{n}} T_{\bar{W}, ζ}^{n} (x^{n}) \neq \emptyset \end{matrix}

for all

m \in M_{f}

, which follows form Relation (A2), we have

\begin{matrix} | B^{'} | \leq \sum_{m \in M_{f}} max_{W \in W_{m}^{*}} | T_{W, ζ}^{n} (f (m)) | \cdot | W | . \end{matrix}

Let

\begin{matrix} W^{*} = arg max_{W \in ⋃_{m \in M_{f}} W_{m}^{*}} | T_{W, ζ}^{n} (f (m)) | . \end{matrix}

Thus, we get the upper bound

\begin{matrix} | B^{'} | \leq | M_{f} | exp (n (H (W^{*} | P) + γ^{'} + \frac{log | W |}{n})), \end{matrix}

(A5)

γ^{'} > 0

([3], Lemma 2.13).

For all

W \in W_{m}^{*}

and all

m \in M_{f}

there is a

y^{n} \in Y^{n}

such that

y^{n} \in T_{W, ζ}^{n} (f (m))

and

y^{n} \in T_{\bar{W}, ζ}^{n} (x^{n})

for a

x^{n} \in T_{P, ζ^{'}}^{n}

. Using Relation (A2), we have

y^{n} \in T_{P W, (ζ + ζ^{'}) | X |}^{n}

and

y^{n} \in T_{P \bar{W}, (ζ + ζ^{'}) | X |}^{n}

(see ([3], Lemma 2.10)). Let

ζ^{″} = (ζ + ζ^{'}) | X |

. Thus,

\begin{matrix} ∥ P W - P \bar{W} ∥_{1} & = \sum_{b \in Y} | P W (b) - P \bar{W} (b) | \\ = \sum_{b \in Y} | P W (b) - N (b | y^{n}) / n + N (b | y^{n}) / n - P \bar{W} (b) | \\ \leq \sum_{b \in Y} | P W (b) - N (b | y^{n}) / n | + | N (b | y^{n}) / n - P \bar{W} (b) | \leq 2 | Y | ζ^{″} . \end{matrix}

Using ([3], Lemma 2.7), we have

| H (P W) - H (P \bar{W}) | \leq 2 | Y | ζ^{″} log \frac{1}{2 ζ^{″}}

for all

W \in W_{m}^{*}

and all

m \in M_{f}

. Using Inequalities (A4), (A5) and the fact that

W^{*} \in W_{m}^{*}

for a

m \in M_{f}

, we get for

γ

,

γ^{'}

,

ζ

and

ζ^{'}

small enough and n large enough

\begin{matrix} \frac{1}{n} log | M_{f} | & \geq H (P \bar{W}) - H (W^{*} | P) - γ - γ^{'} - 2 \frac{log | W |}{n} \\ \geq H (P W^{*}) - 2 | Y | ζ^{″} log \frac{1}{2 ζ^{″}} - H (W^{*} | P) - γ - γ^{'} - 2 \frac{log | W |}{n} \\ \geq I (P; W^{*}) - τ \geq min_{W \in W} I (P; W) - τ . \end{matrix}

(A6)

Now, consider the case of an infinite set

W

. Let

M \in N

,

M \geq {2 | Y |}^{2}

. We construct the set

W^{*}

of channels

W^{*} : X \to Y

with the following properties. For all

W \in W

, there is a

W^{*} \in W^{*}

with

\begin{matrix} | W (y | x) - W^{*} (y | x) | \leq \frac{| Y |}{M} \end{matrix}

(A7)

for all

(x, y) \in X \times Y

,

\begin{matrix} W (y | x) \leq W^{*} (y | x) e^{{2 | Y |}^{2} / M} \end{matrix}

(A8)

for all

(x, y) \in X \times Y

and

\begin{matrix} | W^{*} {| \leq (1 + M)}^{| X | | Y |} . \end{matrix}

(A9)

Such a construction is possible as described in [18]. Using Inequalities (A9) and (A6), we know that there is a compound

(n, ϵ^{'})

-code,

ϵ > ϵ^{'} > 0

, for

W^{*}

with

\begin{matrix} \frac{1}{n} log | M_{f} | \geq min_{W \in W^{*}} I (P; W) - τ \end{matrix}

if M depends on n polynomially. We now show that this code is a compound

(n, ϵ)

-code for

W

with

\begin{matrix} \frac{1}{n} log | M_{f} | \geq inf_{W \in W} I (P; W) - τ . \end{matrix}

Let

W^{*} = arg {min}_{W \in W^{*}} I (P; W)

and let

W \in W

be the W corresponding to

W^{*}

. Then, we have

\begin{matrix} inf_{W \in W} I (P; W) \overset{(a)}{\leq} I (P; W) \overset{(b)}{\leq} I (P; W^{*}) + β \overset{(c)}{=} min_{W \in W^{*}} I (P; W) + β, \end{matrix}

β > 0

, where (a) follows from the definition of the infimum, (b) follows as Inequality (A7) implies

\begin{matrix} ∥ W (\cdot | a) - W^{*} {(\cdot | a) ∥}_{1} \leq \frac{{| Y |}^{2}}{M} \end{matrix}

for all

a \in X

. Thus, using ([3], Lemma 2.7), we have

\begin{matrix} | I (P; W) - I (P; W^{*}) | & = | H (W | P) - H (W^{*} | P) | \\ = | \sum_{a \in X} P (a) (H (W (\cdot | a)) - H (W^{*} (\cdot | a))) | \\ \leq \sum_{a \in X} P (a) | H (W (\cdot | a)) - H (W^{*} (\cdot | a)) | \leq \frac{{| Y |}^{2}}{M} log \frac{M}{| Y |} . \end{matrix}

For

M = n^{2}

, we get (b) for n large enough. Finally, (c) follows from the choice of

W^{*}

. Additionally, it holds that for each

W \in W

there is a

W^{*} \in W^{*}

with

\begin{matrix} W^{n} (y^{n} | x^{n}) \leq e^{{2 | Y |}^{2} n / M} {(W^{*})}^{n} (y^{n} | x^{n}), \end{matrix}

which follows from Inequality (A8). Thus, for all

m \in M_{f}

, we have

\begin{matrix} W^{n} ({(ϕ^{- 1} (m))}^{c} | f (m)) \leq {(W^{*})}^{n} ({(ϕ^{- 1} (m))}^{c} | f (m)) e^{{2 | Y |}^{2} n / M} \overset{a)}{\leq} e^{{2 | Y |}^{2} / n} ϵ^{'}, \end{matrix}

where (a) follows from our choice of M. Thus, for n large enough and

ϵ^{'}

small enough, we have

\begin{matrix} W^{n} ({(ϕ^{- 1} (m))}^{c} | f (m)) \leq ϵ . \end{matrix}

☐

Appendix B. Equivalence of Rate Regions

We have

\begin{matrix} ⋃_{U_{{\hat{s}}_{1}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} ⋂_{\hat{s} \in \hat{S}} R_{\hat{s}}^{(P S C A)} (S, U_{\hat{s}}) \overset{(a)}{=} ⋃_{U_{{\hat{s}}_{1}}} ⋃_{U_{{\hat{s}}_{2}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} (⋂_{\hat{s} \in \hat{S} \ {{\hat{s}}_{1}}} R_{\hat{s}} (S, U_{\hat{s}}) \cap R_{{\hat{s}}_{1}} (S, U_{{\hat{s}}_{1}})), \end{matrix}

where we drop the

(P S C A)

for a shorter notation in (a). We now use the distributive law for sets and get

\begin{matrix} ⋃_{U_{{\hat{s}}_{1}}} (⋃_{U_{{\hat{s}}_{2}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} ⋂_{\hat{s} \in \hat{S} \ {{\hat{s}}_{1}}} R_{\hat{s}} (S, U_{\hat{s}}) \cap R_{{\hat{s}}_{1}} (S, U_{{\hat{s}}_{1}})) . \end{matrix}

Now, we use the distributive law again and get

\begin{matrix} ⋃_{U_{{\hat{s}}_{2}}, \dots, U_{{\hat{s}}_{| \hat{S} |}}} ⋂_{\hat{s} \in \hat{S} \ {{\hat{s}}_{1}}} R_{\hat{s}} (S, U_{\hat{s}}) \cap ⋃_{U_{{\hat{s}}_{1}}} R_{{\hat{s}}_{1}} (S, U_{{\hat{s}}_{1}}) . \end{matrix}

Following these steps for all

\hat{s} \in \hat{S}

, we get

\begin{matrix} ⋂_{\hat{s} \in \hat{S}} ⋃_{U_{\hat{s}}} R_{\hat{s}}^{(P S C S)} (S, U_{\hat{s}}) . \end{matrix}

Appendix C. Modifying Markov Chains

Theorem A1.

Let A, B, C and D be jointly distributed RVs. It holds that

\begin{matrix} (A 10) & A - B - C \Leftrightarrow C - B - A, \\ (A 11) & A B - C - D \Rightarrow B - C - D, \\ (A 12) & A B - C - D \Rightarrow A - B C - D, \\ P_{A B C} (a, b, c) = P_{A B} (a, b) P_{C} (c) \forall (a, b, c) \in A \times B \times C, \\ (A 13) & \land A - B C - D \Rightarrow A - B - C D . \end{matrix}

Proof.

We give a proof for each of the statements.

We have

$\begin{matrix} P_{A B C} (a, b, c) & \overset{(a)}{=} P_{A | B} (a | b) P_{B C} (b, c) \\ = P_{A | B} (a | b) P_{C | B} (c | b) P_{B} (b) = P_{A B} (a, b) P_{C | B} (c | b) \end{matrix}$

for all $(a, b, c) \in A \times B \times C$ . Here, $(a)$ follows from $A - B - C$ . Thus, we see that Equivalence (A10) is true.
We have $P_{A B C D} (a, b, c, d) = P_{A B | C} (a, b | c) P_{C D} (c, d)$ for all $(a, b, c, d) \in A \times B \times C \times D$ from $A B - C - D$ . Summing both sides over all $b \in B$ , we get Implication (A11).
We have

$\begin{matrix} P_{A B C D} (a, b, c, d) & \overset{(a)}{=} P_{A B | C} (a, b | c) P_{C D} (c, d) \\ = P_{B | C} (b, c) P_{A | B C} (a | b, c) P_{C D} (c, d) \\ \overset{(b)}{=} P_{A | B C} (a | b, c) P_{B | C D} (b | c, d) P_{C D} (c, d) \\ = P_{A | B C} (a | b, c) P_{B C D} (b, c, d) \end{matrix}$

for all $(a, b, c, d) \in A \times B \times C \times D$ , where $(a)$ follows from $A B - C - D$ and $(b)$ from Implication (A11). This means Implication (A12) is true.
We have

$\begin{matrix} P_{A B C D} (a, b, c, d) & \overset{(a)}{=} P_{A | B C} (a | b, c) P_{B C D} (b, c, d) \\ = P_{A | B C} (a | b, c) P_{D | B C} (d | b, c) P_{B C} (b, c) \\ \overset{(b)}{=} P_{A B} (a, b) P_{C} (c) P_{D | B C} (d | b, c) \\ = P_{A | B} (a | b) P_{B} (b) P_{C} (c) P_{D | B C} (d | b, c) \\ \overset{(c)}{=} P_{A | B} (a | b) P_{B C} (b, c) P_{D | B C} (d | b, c) \\ = P_{A | B} (a | b) P_{B C D} (b, c, d) \end{matrix}$

for all $(a, b, c, d) \in A \times B \times C \times D$ , where $(a)$ follows from $A - B C - D$ and ( $b)$ and ( $c)$ follow as C is independent of $A B$ . Thus, we have Implication (A13).

☐

References

Schaefer, R.F.; Boche, H.; Khisti, A.; Poor, H.V. Information Theoretic Security and Privacy of Information Systems; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
Shannon, C.E. Communication theory of secrecy systems. Bell Syst. Tech. J. 1949, 28, 656–715. [Google Scholar] [CrossRef]
Csiszár, I.; Körner, J. Information Theory: Coding Theorems for Discrete Memoryless Systems; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Wyner, A.D. The wire-tap channel. Bell Syst. Tech. J. 1975, 54, 1355–1387. [Google Scholar] [CrossRef]
Bloch, M.; Barros, J. Physical-Layer Security: From Information Theory to Security Engineering; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Ahlswede, R.; Csiszár, I. Common randomness in information theory and cryptography. Part I: Secret sharing. IEEE Trans. Inf. Theory 1993, 39, 1121–1132. [Google Scholar] [CrossRef]
Ignatenko, T.; Willems, F.M. Biometric security from an information theoretical perspective. Found. Trends Commun. Inf. Theory 2012, 7, 135–316. [Google Scholar] [CrossRef]
Grigorescu, A.; Boche, H.; Schaefer, R.F. Robust PUF based authentication. In Proceedings of the IEEE International Workshop on Information Forensics and Security (WIFS), Rome, Italy, 16–19 November 2015; pp. 1–6. [Google Scholar]
Lai, L.; Ho, S.-W.; Poor, H.V. Privacy-security tradeoffs in biometric security systems. In Proceedings of the 46th Annual Allerton Conference on Communication, Control, and Computing, Urbana-Champaign, IL, USA, 23–26 September 2008; pp. 268–273. [Google Scholar]
Boche, H.; Wyrembelski, R.F. Secret key generation using compound sources-optimal key-rates and communication costs. In Proceedings of the 2013 9th International ITG Conference on Systems, Communication and Coding (SCC), München, Germany, 21–24 January 2013. [Google Scholar]
Grigorescu, A.; Boche, H.; Schaefer, R.F. Robust Biometric Authentication from an Information Theoretic Perspective. Entropy 2017, 19, 480. [Google Scholar] [CrossRef]
Baur, S.; Boche, H. Robust authentication and data storage with perfect secrecy. In Proceedings of the 2017 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), Atlanta, GA, USA, 1–4 May 2017; pp. 553–558. [Google Scholar]
Baur, S.; Boche, H. Robust Secure Storage of Data Sources with Perfect Secrecy. In Proceedings of the IEEE Workshop on Information Forensics and Security, Rennes, France, 4–7 December 2017. [Google Scholar]
Baur, S.; Boche, H. Storage of general data sources on a public database with security and privacy constraints. In Proceedings of the 2017 IEEE Conference on Communications and Network Security (CNS), Las Vegas, NV, USA, 9–11 October 2017; pp. 555–559. [Google Scholar]
Willems, F.; Ignatenko, T. Authentication based on secret-key generation. In Proceedings of the 2012 IEEE International Symposium on Information Theory Proceedings (ISIT), Cambridge, MA, USA, 1–6 July 2012; pp. 1792–1796. [Google Scholar]
Gallager, R. Information Theory and Reliable Communication; Springer: Berlin, Germany, 1968. [Google Scholar]
Wolfowitz, J. Coding Theorems of Information Theory; Springer: Berlin, Germany, 1978. [Google Scholar]
Blackwell, D.; Breiman, L.; Thomasian, A.J. The capacity of a class of channels. Ann. Math. Stat. 1959, 30, 1229–1241. [Google Scholar] [CrossRef]
Tavangaran, N.; Baur, S.; Grigorescu, A.; Boche, H. Compound biometric authentication systems with strong secrecy. In Proceedings of the 2017 11th International ITG Conference on Systems, Communication and Coding (SCC), Hamburg, Germany, 6–9 February 2017. [Google Scholar]
Han, T.S. Information-Spectrum Methods in Information Theory; Springer Science & Business Media: New York, NY, USA, 2013; Volume 50. [Google Scholar]
Boche, H.; Cai, N. Common Random Secret Key Generation on Arbitrarily Varying Source. In Proceedings of the 23rd International Symposium on Mathematical Theory of Networks and Systems (MTNS2018), Hong Kong, China, 16–20 July 2018. in press. [Google Scholar]
Schaefer, R.F.; Boche, H.; Poor, H.V. Secure Communication Under Channel Uncertainty and Adversarial Attacks. Proc. IEEE 2015, 103, 1796–1813. [Google Scholar] [CrossRef]
Wiese, M.; Nötzel, J.; Boche, H. A Channel Under Simultaneous Jamming and Eavesdropping Attack—Correlated Random Coding Capacities Under Strong Secrecy Criteria. IEEE Trans. Inf. Theory 2016, 62, 3844–3862. [Google Scholar] [CrossRef]
Nötzel, J.; Wiese, M.; Boche, H. The Arbitrarily Varying Wiretap Channel—Secret Randomness, Stability, and Super-Activation. IEEE Trans. Inf. Theory 2016, 62, 3504–3531. [Google Scholar] [CrossRef]

Figure 1. Authentication process considered in [7].

Figure 2. Authentication process with source uncertainty (as considered in [12]).

Figure 3. Secure storage process with source uncertainty (as considered in [13]).

Figure 4. Secure storage of a source (as considered in [14]).

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Baur, S.; Boche, H. Robust Secure Authentication and Data Storage with Perfect Secrecy. Cryptography 2018, 2, 8. https://doi.org/10.3390/cryptography2020008

AMA Style

Baur S, Boche H. Robust Secure Authentication and Data Storage with Perfect Secrecy. Cryptography. 2018; 2(2):8. https://doi.org/10.3390/cryptography2020008

Chicago/Turabian Style

Baur, Sebastian, and Holger Boche. 2018. "Robust Secure Authentication and Data Storage with Perfect Secrecy" Cryptography 2, no. 2: 8. https://doi.org/10.3390/cryptography2020008

Article Menu

Robust Secure Authentication and Data Storage with Perfect Secrecy

Abstract

1. Introduction

2. Authentication Model

3. Various Definitions of Achievability

4. Capacity Regions for the Authentication Model

5. Compound Authentication Model

6. Achievability for the Compound Model

7. Capacity Regions for the Compound Authentication Model

8. Secure Storage

9. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A. Proof of Theorem 4

Appendix B. Equivalence of Rate Regions

Appendix C. Modifying Markov Chains

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI