Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks †

Santoso, Bagus; Oohama, Yasutada

doi:10.3390/e21050469

Open AccessArticle

Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†

by

Bagus Santoso

^*,‡

and

Yasutada Oohama

^‡

University of Electro-Communications, 1-5-1 Chofugaoka, Tokyo 182-8585, Japan

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in Oohama, Y.; Santoso, B. Information theoretical analysis of side-channel attacks to the Shannon cipher system. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018; pp. 581–585.

^‡

These authors contributed equally to this work.

Entropy 2019, 21(5), 469; https://doi.org/10.3390/e21050469

Submission received: 11 March 2019 / Revised: 24 April 2019 / Accepted: 29 April 2019 / Published: 5 May 2019

(This article belongs to the Special Issue Multiuser Information Theory II)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we propose a new theoretical security model for Shannon cipher systems under side-channel attacks, where the adversary is not only allowed to collect ciphertexts by eavesdropping the public communication channel but is also allowed to collect the physical information leaked by the devices where the cipher system is implemented on, such as running time, power consumption, electromagnetic radiation, etc. Our model is very robust as it does not depend on the kind of physical information leaked by the devices. We also prove that in the case of one-time pad encryption, we can strengthen the secrecy/security of the cipher system by using an appropriate affine encoder. More precisely, we prove that for any distribution of the secret keys and any measurement device used for collecting the physical information, we can derive an achievable rate region for reliability and security such that if we compress the ciphertext using an affine encoder with a rate within the achievable rate region, then: (1) anyone with a secret key will be able to decrypt and decode the ciphertext correctly, but (2) any adversary who obtains the ciphertext and also the side physical information will not be able to obtain any information about the hidden source as long as the leaked physical information is encoded with a rate within the rate region. We derive our result by adapting the framework of the one helper source coding problem posed and investigated by Ahlswede and Körner (1975) and Wyner (1975). For reliability and security, we obtain our result by combining the result of Csizár (1982) on universal coding for a single source using linear codes and the exponential strong converse theorem of Oohama (2015) for the one helper source coding problem.

Keywords:

information theoretic security; side-channel attacks; Shannon cipher system; one helper source coding problem; strong converse theorem

1. Introduction

In most of theoretical security models for encryption schemes, the adversary only obtains information from the public communication channel. In such models, an adversary is often treated as an entity that tries to obtain information about the hidden source only from the ciphertexts that are sent through the public communication channel. However, in the real world, the encryption schemes are implemented on physical electronic devices, and it is widely known that any process executed in an electronic circuit will generate a certain kind of correlated physical phenomena as “side” effects, according to the type of process. For example, differences in inputs to a process in an electronic circuit can induce differences in the heat, power consumption, and electromagnetic radiation generated as byproducts by the devices. Therefore, we may consider that an adversary who has a certain degree of physical access to the devices may obtain some information on very sensitive hidden data, such as the keys used for the encryption, just by measuring the generated physical phenomena using appropriate measurement devices. More precisely, an adversary may deduce the value of the bits of the key by measuring the differences in the timing of the process of encryption or the differences in the power consumption, electromagnetic radiation, and other physical phenomena. This information channel where the adversary obtains data in the form of physical phenomena is called the side-channel, and attacks using the side-channel are known as side-channel attacks.

In the literature, there have been many works showing that adversaries have succeeded in breaking the security of cryptographic systems by exploiting side-channel information such as running time, power consumption, and electromagnetic radiation in the real physical world [1,2,3,4,5].

1.1. Our Contributions

1.1.1. Security Model for Side-Channel Attacks

In this paper, we propose a security model where the adversary attempts to obtain information about the hidden source by collecting data from (1) the public communication channel in the form of ciphertexts, and (2) the side-channel in the form of some physical data related to the encryption keys. Our proposed security model is illustrated in Figure 1.

Based on the security model illustrated above, we formulate a security problem of strengthening the security of Shannon cipher system where the encryption is implemented on a physical encryption device and the adversary attempts to obtain some information on the hidden source by collecting ciphertexts and performing side-channel attacks.

We describe our security model in a more formal way as follows. The source X is encrypted using an encryption device with secret key K installed. The result of the encryption, i.e., ciphertext C, is sent through a public communication channel to a data center where C is decrypted back into the source X using the same key K. The adversary

A

is allowed to obtain C from the public communication channel and is also equipped with an encoding device

φ_{A}

that encodes and processes the noisy large alphabet data Z, i.e., the measurement result of the physical information obtained from the side-channel, into the appropriate binary data

M_{A}

. It should be noted that in our model, we do not put any limitation on the kind of physical information measured by the adversary. Hence, any theoretical result based on this model automatically applies to any kind of side-channel attack, including timing analysis, power analysis, and electromagnetic (EM) analysis. In addition, the measurement device may just be a simple analog-to-digital converter that converts the analog data representing physical information leaked from the device into “noisy” digital data Z. In our model, we represent the measurement process as a communication channel W.

1.1.2. Main Result

As the main theoretical result, we show that we can strengthen the secrecy/security of the Shannon cipher implemented on a physical device against an adversary who collects the ciphertexts and launches side-channel attacks by a simple method of compressing the ciphertext C from a Shannon cipher using an affine encoder

φ

into

\tilde{C}

before releasing it into the public communication channel.

We prove that in the case of one-time pad encryption, we can strengthen the secrecy/security of the cipher system by using an appropriate affine encoder. More precisely, we prove that for any distribution of the secret key K and any measurement device (used to convert the physical information from a side-channel into the noisy large alphabet data Z), we can derive an achievable rate region for

(R_{A}, R)

such that if we compress the ciphertext C into

\tilde{C}

using the affine encoder

φ

, which has an encoding rate R inside the achievable region, then we can achieve reliability and security in the following sense:

anyone with secret key K can construct an appropriate decoder that decrypts and encodes $\tilde{C}$ with exponentially decaying error probability, but
the amount of information gained by any adversary $A$ who obtains the compressed ciphertext $\tilde{C}$ and encoded physical information $M_{A}$ is exponentially decaying to zero as long as the encoding device $φ_{A}$ encodes the side physical information into $M_{A}$ with a rate $R_{A}$ within the achievable rate region.

By utilizing the homomorphic property of one-time-pad and affine encoding, we are able to separate the theoretical analysis of reliability and security such that we can deal with each issue independently. For reliability, we mainly obtain our result by using the result of Csizár [6] on the universal coding for a single source using linear codes. For the security analysis, we derive our result by adapting the framework of the one helper source coding problem posed and investigated by Ahlswede, Körner [7] and Wyner [8]. Specifically, in order to derive the secrecy exponent, we utilize the exponential strong converse theorem of Oohama [9] for the one helper source coding problem. In [10], Watanabe and Oohama deal with a similar source coding problem, but their result is insufficient for deriving the lower bound of the secrecy exponent. We will explain the relation between our method and previous related works in more detail in Section 4.

1.2. Comparison to Existing Models of Side-Channel Attacks

The most important feature of our model is that we do not make any assumption about the type or characteristics of the physical information that is measured by the adversary. Several theoretical models analyzing the security of a cryptographic system against side-channel attacks have been proposed in the literature. However, most of the existing works are applicable only for specific characteristics of the leaked physical information. For example, Brier et al. [1] and Coron et al. [11] propose a statistical model for side-channel attacks using the information from power consumption and the running time, whereas Agrawal et al. [5] propose a statistical model for side-channel attacks using electromagnetic (EM) radiations. A more general model for side-channel attacks is proposed by Köpf et al. [12] and Backes et al. [13], but they are heavily dependent upon implementation on certain specific devices. Micali et al. [14] propose a very general security model to capture the side-channel attacks, but they fail to offer any hint of how to build a concrete countermeasure against the side-channel attacks. The closest existing model to ours is the general framework for analyzing side-channel attacks proposed by Standaert et al. [15]. The authors of [15] propose a countermeasure against side-channel attacks that is different from ours, i.e., noise insertion on implementation. It should be noted that the noise insertion countermeasure proposed by [15] is dependent on the characteristics of the leaked physical information. On the other hand, our countermeasure, i.e., compression using an affine encoder, is independent of the characteristics of the leaked physical information.

1.3. Comparison to Encoding before Encryption

In this paper, our proposed solution is to perform additional encoding in the form of compression after the encryption process. Our aim is that by compressing the ciphertext, we compress the key “indirectly” and increase the “flatness” of the key used in the compressed ciphertext (

\tilde{C}

) such that the adversary will not get much additional information from eavesdropping on the compressed ciphertext (

\tilde{C}

). Instead of performing the encoding after encryption, one may consider performing the encoding before encryption, i.e., encoding the source and the key “directly” before performing the encryption. However, since we need to apply two separate encodings on the source and the key, we can expect that the implementation cost is more expensive than our proposed solution, i.e., approximately double the cost of applying our proposed solution. Moreover, it is not completely clear whether our security analysis still applies for this case. For example, if the adversary performs the side-channel attacks on the key after it is encoded (before encryption), we need a complete remodeling of the security problem.

1.4. Organization of this Paper

This paper is structured as follows. In Section 2, we show the basic notations and definitions that we use throughout this paper, and we also describe the formal formulations of our model and the security problem. In Section 3, we explain the idea and the formulation of our proposed solution. In Section 4, we explain the relation between our formulation and previous related works. Based on this, we explain the theoretical challenge which we have to overcome to prove that our proposed solution is sound. In Section 5, we state our main theorem on the reliability and security of our solution. In Section 6, we show the proof of our main theorem. We put the proofs of other related propositions, lemmas, and theorems in the appendix.

2. Problem Formulation

In this section, we will introduce the general notations used throughout this paper and provide a description of the basic problem we are focusing on, i.e., side-channel attacks on Shannon cipher systems. We also explain the basic framework of the solution that we consider to solve the problem. Finally, we state the formulation of the reliability and security problem that we consider and aim to solve in this paper.

2.1. Preliminaries

In this subsection, we show the basic notations and related consensus used in this paper.

Random Source of Information and Key: Let X be a random variable from a finite set

X

. Let

{X_{t}}_{t = 1}^{\infty}

be a stationary discrete memoryless source (DMS) such that for each

t = 1, 2, \dots

,

X_{t}

takes values in the finite set

X

and obeys the same distribution as that of X denoted by

p_{X} = {p_{X} (x)}_{x \in X}

. The stationary DMS

{X_{t}}_{t = 1}^{\infty}

is specified with

p_{X}

. In addition, let K be a random variable taken from the same finite set

X

and representing the key used for encryption. Similarly, let

{K_{t}}_{t = 1}^{\infty}

be a stationary discrete memoryless source such that for each

t = 1, 2, \dots

,

K_{t}

takes values in the finite set

X

and obeys the same distribution as that of K denoted by

p_{K} = {p_{K} (k)}_{k \in X}

. The stationary DMS

{K_{t}}_{t = 1}^{\infty}

is specified with

p_{K}

. In this paper, we assume that

p_{K}

is the uniform distribution over

X

.

Random Variables and Sequences: We write the sequence of random variables with length n from the information source as follows:

X^{n} : = X_{1} X_{2} \dots X_{n}

. Similarly, strings with length n of

X^{n}

are written as

x^{n} : = x_{1} x_{2} \dots x_{n} \in X^{n}

. For

x^{n} \in X^{n}

,

p_{X^{n}} (x^{n})

stands for the probability of the occurrence of

x^{n}

. When the information source is memoryless, specified with

p_{X}

, the following equation holds:

p_{X^{n}} (x^{n}) = \prod_{t = 1}^{n} p_{X} (x_{t}) .

In this case, we write

p_{X^{n}} (x^{n})

as

p_{X}^{n} (x^{n})

. Similar notations are used for other random variables and sequences.

Consensus and Notations: Without loss of generality, throughout this paper, we assume that

X

is a finite field. The notation ⊕ is used to denote the field addition operation, while the notation ⊖ is used to denote the field subtraction operation, i.e.,

a ⊖ b = a \oplus (- b)

, for any elements

a, b \in X

. Throughout this paper, all logarithms are taken to the natural basis.

2.2. Basic System Description

In this subsection, we explain the basic system setting and the basic adversarial model we consider in this paper. First, let the information source and the key be generated independently by different parties

S_{gen}

and

K_{gen}

, respectively. In our setting, we assume the following:

The random key $K^{n}$ is generated by $K_{gen}$ from a uniform distribution.
The source is generated by $S_{gen}$ and is independent of the key.

Next, let the random source

X^{n}

from

S_{gen}

be sent to the node

L

, and let the random key

K^{n}

from

K_{gen}

also be sent to

L

. Further settings of our system are described as follows and are also shown in Figure 2.

Source Processing: At the node $L$ , $X^{n}$ is encrypted with the key $K^{n}$ using the encryption function $Enc$ . The ciphertext $C^{n}$ of $X^{n}$ is given by

$C^{n} : = Enc (X^{n}) = X^{n} \oplus K^{n} .$
Transmission: Next, the ciphertext $C^{n}$ is sent to the information processing center $D$ through a public communication channel. Meanwhile, the key $K^{n}$ is sent to $D$ through a private communication channel.
Sink Node Processing: In $D$ , we decrypt the ciphertext $C^{n}$ using the key $K^{n}$ through the corresponding decryption procedure $Dec$ defined by $Dec (C^{n}) = C^{n} ⊖ K^{n}$ . It is obvious that we can correctly reproduce the source output $X^{n}$ from $C^{n}$ and $K^{n}$ with the decryption function $Dec$ .

Side-Channel Attacks by Eavesdropper Adversary: An (eavesdropper) adversary

A

eavesdrops on the public communication channel in the system. The adversary

A

also uses side information obtained by side-channel attacks. In this paper, we introduce a new theoretical model of side-channel attacks that is described as follows. Let

Z

be a finite set and let

W : X \to Z

be a noisy channel. Let Z be a channel output from W for the random input variable K. We consider the discrete memoryless channel specified with W. Let

Z^{n} \in Z^{n}

be a random variable obtained as the channel output by connecting

K^{n} \in X^{n}

to the input channel. We write a conditional distribution on

Z^{n}

given

K^{n}

as

W^{n} = {\{W^{n} (z^{n} | k^{n})\}}_{(k^{n}, z^{n}) \in K^{n} \times Z^{n}} .

Since the channel is memoryless, we have

W^{n} (z^{n} | k^{n}) = \prod_{t = 1}^{n} W (z_{t} | k_{t}) .

(1)

On the above output

Z^{n}

of

W^{n}

for the input

K^{n}

, we assume the following:

The three random variables X, K, and Z satisfy $X ⊥ (K, Z)$ , which implies that $X^{n} ⊥ (K^{n}, Z^{n})$ .
W is given in the system and the adversary $A$ cannot control W.
Through side-channel attacks, the adversary $A$ can access $Z^{n}$ .

We next formulate the side information the adversary

A

obtains by side-channel attacks. For each

n = 1, 2, \dots

, let

φ_{A}^{(n)} : Z^{n} \to M_{A}^{(n)}

be an encoder function. Set

φ_{A} : = {φ_{A}^{(n)}}_{n = 1, 2, \dots} .

Let

R_{A}^{(n)} : = \frac{1}{n} log | | φ_{A} | | = \frac{1}{n} log | M_{A}^{(n)} |

be a rate of the encoder function

φ_{A}^{(n)}

. For

R_{A} > 0

, we set

F_{A}^{(n)} (R_{A}) : = {φ_{A}^{(n)} : R_{A}^{(n)} \leq R_{A}} .

For the encoded side information the adversary

A

obtains, we assume the following.

The adversary $A$ , having accessed $Z^{n}$ , obtains the encoded additional information $φ_{A}^{(n)} (Z^{n})$ . For each $n = 1, 2, \dots$ , the adversary $A$ can design $φ_{A}^{(n)}$ .
The sequence ${R_{A}^{(n)}}_{n = 1}^{\infty}$ must be upper-bounded by a prescribed value. In other words, the adversary $A$ must use $φ_{A}^{(n)}$ such that for some $R_{A}$ and for any sufficiently large n, $φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})$ .

On the Scope of Our Theoretical Model: When the

| Z |

is not so large, the adversary

A

may directly access

Z^{n}

. In contrast, in a real situation of side-channel attacks, often the noisy version

Z^{n}

of

K^{n}

can be regarded as very close to an analog random signal. In this case,

| Z |

is sufficiently large and the adversary

A

cannot obtain

Z^{n}

in a lossless form. Our theoretical model can address such situations of side-channel attacks.

2.3. Solution Framework

As the basic solution framework, we consider applying a post-encryption-compression coding system. The application of this system is illustrated in Figure 3.

Encoding at Source node $L$ : We first use $φ^{(n)}$ to encode the ciphertext $C^{n} = X^{n} \oplus K^{n}$ . The formal definition of $φ^{(n)}$ is $φ_{i}^{(n)} :$ $X^{n} \to$ $X^{m}$ . Let ${\tilde{C}}^{m} = φ^{(n)} (C^{n})$ . Instead of sending $C^{n}$ , we send ${\tilde{C}}^{m}$ to the public communication channel.
Decoding at Sink Nodes $D$ : $D$ receives ${\tilde{C}}^{m}$ from the public communication channel. Using the common key $K^{n}$ and the decoder function $Ψ^{(n)} : X^{m} \times X^{n} \to X^{n}$ , $D$ outputs an estimation ${\hat{X}}^{n} = Ψ^{(n)} ({\tilde{C}}^{m}, K^{n})$ of $X^{n}$ .

On Reliability and Security: From the description of our system in the previous section, the decoding process in our system above is successful if

{\hat{X}}^{n} = X^{n}

holds. Combining this and (6), it is clear that the decoding error probabilities

p_{e}

are as follows:

\begin{matrix} p_{e} = & p_{e} (φ^{(n)}, Ψ^{(n)} | p_{X}^{n}) : = Pr [Ψ^{(n)} (φ^{(n)} (X^{n})) \neq X^{n}] . \end{matrix}

Set

M_{A}^{(n)} = φ_{A}^{(n)} (Z^{n})

. The information leakage

Δ^{(n)}

on

X^{n}

from

({\tilde{C}}^{m}, M_{A}^{(n)})

is measured by the mutual information between

X^{n}

and

({\tilde{C}}^{m},

M_{A}^{(n)})

. This quantity is formally defined by

\begin{matrix} Δ^{(n)} = Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) : = I (X^{n}; {\tilde{C}}^{m}, M_{A}^{(n)}) . \end{matrix}

Reliable and Secure Framework:

Definition 1.

A quantity R is achievable under

R_{A}

> 0

for the system

Sys

if there exists a sequence

{(φ^{(n)},

Ψ^{(n)} {)}}_{n \geq 1}

such that

\forall ϵ > 0

,

\exists n_{0} = n_{0} (ϵ) \in N_{0}

,

\forall n \geq n_{0}

, we have

\begin{matrix} \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, p_{e} (φ^{(n)}, Ψ^{(n)} | p_{X}^{n}) \leq ϵ, \end{matrix}

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

,

\begin{matrix} Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq ϵ . \end{matrix}

Definition 2.

[Reliable and Secure Rate Region] Let

R_{Sys} (p_{X},

p_{K}, W)

denote the set of all

(R_{A}, R)

such that R is achievable under

R_{A}

. We call

R_{Sys} (p_{X}, p_{K},

W)

the reliable and secure rate region.

Definition 3.

A triple

(R, E, F)

is achievable under

R_{A} > 0

for the system

Sys

if there exists a sequence

{(φ^{(n)},

ψ^{(n)} {)}}_{n \geq 1}

such that

\forall ϵ > 0

,

\exists n_{0} = n_{0} (ϵ) \in N_{0}

,

\forall n

\geq n_{0}

, we have

\begin{matrix} \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq e^{- n (E - ϵ)}, \end{matrix}

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq e^{- n (F - ϵ)} . \end{matrix}

Definition 4 (Rate, Reliability, and Security Region).

Let

D_{Sys} (p_{X},

p_{K}, W)

denote the set of all

(R_{A}, R, E, F)

such that

(R, E, F)

is achievable under

R_{A}

. We call

D_{Sys} (p_{X},

p_{K}, W)

the rate, reliability and security region.

Our aim in this paper is to find the explicit inner bounds of

R_{Sys} (p_{X},

p_{K}, W)

and

D_{Sys} (p_{X},

p_{K}, W)

.

3. Proposed Idea: Affine Encoder as a Privacy Amplifier

In order to instantiate the basic solution framework mentioned in previous section, we propose the use of an affine encoder as the compression function

φ^{(n)}

. We show in this section that we can easily construct an affine encoder that is suitable for our solution framework based on a linear encoder. The instantiation of the solution framework with an affine encoder is illustrated in Figure 4.

Construction of the Affine Encoder: For each

n = 1, 2, \dots

, let

ϕ^{(n)} : X^{n} \to X^{m}

be a linear mapping. We define the mapping

ϕ^{(n)}

by

ϕ^{(n)} (x^{n}) = x^{n} A for x^{n} \in X^{n},

(2)

where A is a matrix with n rows and m columns. Entries of A are from

X

. We fix

b^{m} \in X^{m}

. Define the mapping

φ^{(n)} : X^{n} \to X^{m}

by

\begin{matrix} φ^{(n)} (k^{n}) : = & ϕ^{(n)} (k^{n}) \oplus b^{m} = k^{n} A \oplus b^{m}, for k^{n} \in X^{n} . \end{matrix}

(3)

The mapping

φ^{(n)}

is called the affine mapping induced by the linear mapping

ϕ^{(n)}

and constant vector

b^{m}

\in X^{m}

. By the definition of

φ^{(n)}

shown in (3), the following affine structure holds:

\begin{matrix} φ^{(n)} (x^{n} \oplus k^{n}) = (x^{n} \oplus k^{n}) A \oplus b^{m} = x^{n} A \oplus (k^{n} A \oplus b^{m}) = ϕ^{(n)} (x^{n}) \oplus φ^{(n)} (k^{n}), for x^{n}, k^{n} \in X^{n} . \end{matrix}

(4)

Next, let

ψ^{(n)}

be the corresponding decoder for

ϕ^{(n)}

such that

ψ^{(n)} : X^{m} \to X^{n} .

Note that

ψ^{(n)}

does not have a linear structure in general.

Description of Proposed Procedure: We describe the procedure of our privacy amplified system as follows.

Encoding of Ciphertext: First, we use $φ^{(n)}$ to encode the ciphertext $C^{n} = X^{n} \oplus K^{n}$ . Let ${\tilde{C}}^{m} = φ^{(n)} (C^{n})$ . Then, instead of sending $C^{n}$ , we send ${\tilde{C}}^{m}$ to the public communication channel. By the affine structure of the encoder $φ^{(n)}$ (shown in (4)) we have

$\begin{matrix} {\tilde{C}}^{m} = φ^{(n)} (X^{n} \oplus K^{n}) = ϕ^{(n)} (X^{n}) \oplus φ^{(n)} (K^{n}) = {\tilde{X}}^{m} \oplus {\tilde{K}}^{m}, \end{matrix}$

(5)

where we set ${\tilde{X}}^{m} : = ϕ^{(n)} (X^{n}), {\tilde{K}}^{m} : = φ^{(n)} (K^{n}) .$
Decoding at Sink Node $D$ : First, using the linear encoder $φ^{(n)}$ , $D$ encodes the key $K^{n}$ received through a private channel into ${\tilde{K}}^{m} =$ $(φ^{(n)} (K^{n})$ . Receiving ${\tilde{C}}^{m}$ from the public communication channel, $D$ computes ${\tilde{X}}^{m}$ in the following way. From (5), we have that the decoder $D$ can obtain ${\tilde{X}}^{m}$ $= ϕ^{(n)} (X^{n})$ by subtracting ${\tilde{K}}^{m} = φ^{(n)} (K^{n})$ from ${\tilde{C}}^{m}$ . Finally, $D$ outputs ${\hat{X}}^{n}$ by applying the decoder $ψ^{(n)}$ to ${\tilde{X}}^{m}$ as follows:

$\begin{matrix} {\hat{X}}^{n} & = ψ^{(n)} ({\tilde{X}}^{m}) = ψ^{(n)} (ϕ^{(n)} (X^{n})) . \end{matrix}$

(6)

Our concrete privacy-amplified system described above is illustrated in Figure 4.

Splitting of Reliability and Security

By the affine structure of the encoder function

φ^{(n)}

, the proposed privacy amplified system can be split into two coding problems. One is a source coding problem using a linear encoder

ϕ^{(n)}

. We hereafter call this Problem 0. The other is a privacy amplification problem using the affine encoder

φ^{(n)}

. We call this Problem 1. These two problems are shown in Figure 5.

On Reliability (Problem 0): From the description of our system in the previous section, the decoding process in our system above is successful if

{\hat{X}}^{n} = X^{n}

holds. Combining this and (6), it is clear that the decoding error probability

p_{e}

is as follows:

\begin{matrix} p_{e} = & p_{e} (φ^{(n)}, ψ^{(n)} | p_{X}^{n}) = Pr [ψ^{(n)} (ϕ^{(n)} (X^{n})) \neq X^{n}] . \end{matrix}

In Problem 0, we discuss the minimum rate R such that

\exists {(ϕ^{(n)},

ψ^{(n)} {)}}_{n \geq 1}

such that

\forall ϵ > 0

,

\exists n_{0} = n_{0} (ϵ) \in N_{0}

,

\forall n \geq n_{0}

, we have

\begin{matrix} \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R + ε, p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq ϵ . \end{matrix}

It is well known that this minimum is equal to

H (X)

when

{ϕ^{(n)}}_{n \geq}

is a sequence of general (nonlinear) encoders. Csiszár [6] proved the existence of a sequence of linear encoders and nonlinear decoders

{(ϕ^{(n)},

ψ^{(n)} {)}}_{n \geq 1}

such that for any

p_{X}

satisfying

R > H (X)

, the error probability

p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

decays exponentially as

n \to \infty

. His result is stated in the next section.

On Security (Problem 1): We assume that the adversary

A

knows

(A, b^{n})

defining the affine encoder

φ^{(n)}

. When

φ^{(n)}

has the affine structure shown in (4), the information leakage

Δ^{(n)}

measured by the mutual information between

X^{n}

and

({\tilde{C}}^{m},

M_{A}^{(n)})

has the following form:

\begin{matrix} Δ^{(n)} = Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) = I (X^{n}; {\tilde{C}}^{m}, M_{A}^{(n)}) = I (X^{n}; φ^{(n)} (X^{n} \oplus K^{n}), M_{A}^{(n)}), \\ \overset{(a)}{=} I (X^{n}; φ^{(n)} (X^{n}) \oplus ϕ^{(n)} (K^{n}), M_{A}^{(n)}) = I (X^{n}; {\tilde{X}}^{m} \oplus {\tilde{K}}^{m} | M_{A}^{(n)}) . \end{matrix}

(7)

Step (a) follows from

X_{1}^{n} ⊥ M_{A}^{(n)}

. Using (7), we upper bound

Δ^{(n)} = I (X^{n}; {\tilde{C}}^{m}, M_{A}^{(n)})

to obtain the following lemma.

Lemma 1.

\begin{matrix} Δ^{(n)} = I (X^{n}; {\tilde{C}}^{m}, M_{A}^{(n)}) \leq D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}|| p_{V^{m}}| p_{M_{A}^{(n)}}), \end{matrix}

(8)

where

p_{V^{m}}

represents the uniform distribution over

X^{m}

.

Proof.

We have the following chain of inequalities:

\begin{matrix} Δ^{(n)} = I (X^{n}; {\tilde{C}}^{m}, M_{A}^{(n)}) \overset{(a)}{=} I (X_{1}^{n}; {\tilde{X}}^{m} + {\tilde{K}}^{m} | M_{A}^{(n)}) \leq log | X^{m} | - H ({\tilde{X}}^{m} + {\tilde{K}}^{m} | X^{n}, M_{A}^{(n)}) \\ \overset{(b)}{=} log | X^{m} | - H ({\tilde{K}}^{m} | X^{n}, M_{A}^{(n)}) \overset{(c)}{=} log | X^{m} | - H ({\tilde{K}}^{m} | M_{A}^{(n)}) = D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}|| p_{V^{m}}| p_{M_{A}^{(n)}}) . \end{matrix}

Step (a) follows from (7). Step (b) follows from

{\tilde{X}}^{m} = ϕ^{(n)} (X^{n})

. Step (c) follows from

({\tilde{K}}^{m}, M_{A}^{(n)}) ⊥ X_{1}^{n}

. □

We set

ξ_{D}^{(n)} = ξ_{D}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) : = max_{φ_{A}^{(n)} \in F^{(n)} (R_{A})} D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}|| p_{V^{m}}| p_{M_{A}^{(n)}}) .

Then we have the following lemma.

Lemma 2.

For any affine encoder

φ^{(n)} : X^{n} \to X^{m}

, we have

Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq ξ_{D}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) .

The quantity

ξ_{D}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n})

will play an important role in deriving an explicit upper bound of

Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) .

In Problem 1, we consider the privacy amplification problem using the quantity

ξ_{D}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n})

as a security criterion. In this problem, we study an explicit characterization of the region denoted by

R_{P 1} (p_{K}, W)

, which consists of all pairs

(R, R_{A})

such that

\exists {φ^{(n)}}_{n \geq 1}

such that

\forall ε > 0, \exists n_{0} = n_{0} (ε) \in N_{0}, \forall n \geq n_{0},

\frac{1}{n} log | | φ^{(n)} | | = \frac{m}{n} log | X | \geq R - ε a n d ξ_{D}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) \leq ε .

In the next section, we discuss two previous works related to Problem 1.

4. Previous Related Works

In this section, we introduce approaches from previous existing work related to Problem 0 (reliability) and Problem 1 (security). Our goal is that by showing these previous approaches, it will be easier to understand our approach to analyzing reliability and security. In particular, for Problem 1 (security), we explain approaches used in similar problems in previous works and highlight their differences from Problem 1.

We first state a previous result related to Problem 0. Let

φ^{(n)}

be an affine encoder and

ϕ^{(n)}

be a linear encoder induced by

φ^{(n)}

. We define a function related to an exponential upper bound of

p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

. Let

\bar{X}

be an arbitrary random variable over

X

that has a probability distribution

p_{\bar{X}}

. Let

P (X)

denote the set of all probability distributions on

X

. For

R \geq 0

and

p_{X} \in

P (X)

, we define the following function:

\begin{matrix} E (R | p_{X}) & : = min_{p_{\bar{X}} \in P (X)} {{[R - H (\bar{X})]}^{+} + D (p_{\bar{X}} | | p_{X})} . \end{matrix}

By simple computation, we can prove that

E (R | p_{X})

takes positive values if and only if

R > H (X)

. We have the following result.

Theorem 1.

(Csiszár [6]). There exists a sequence

{{(ϕ^{(n)}, ψ^{(n)}}}_{n \geq 1}

such that for any

p_{X}

, we have

\begin{matrix} \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq e^{- n [E (R | p_{X}) - δ_{n}]}, \end{matrix}

(9)

where

δ_{n}

is defined by

\begin{matrix} δ_{n} : = \frac{1}{n} log [e {(n + 1)}^{3 | X |}] . \end{matrix}

Note that

δ_{n} \to 0

as

n \to \infty

.

It follows from Theorem 1 that if

R > H (X)

, then the error probability of decoding

p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

decays exponentially, and its exponent is lower bounded by the quantity

E (R | p_{X})

. Furthermore, the code

{(ϕ^{(n)}, ψ^{(n)})}_{n \geq 1}

is a universal code that depends only on the rate R and not on the value of

p_{X} \in P (X)

.

We next state two coding problems related to Problem 1. One is a problem on the privacy amplification for the bounded storage eavesdropper posed and investigated by Watanabe and Oohama [10]. The other is the one helper source coding problem posed and investigated by Ashlswede and Körner [7] and Wyner [16]. We hereafter call the former and latter problems, respectively, Problem 2 and Problem 3. Problems 1–3 are shown in Figure 6. As we can see from this figure, these three problems are based on the same communication scheme. The classes of encoder functions and the security criteria on

A

are different between these three problems. In Problem 1, the sequence of encoding functions

{φ^{(n)}}_{n \geq 1}

is restricted to the class of affine encoders to satisfy the homomorphic property. On the other hand, in Problems 2 and 3, we have no such restriction on the class of encoder functions. In descriptions of Problems 2 and 3, we state the difference in security criteria between Problems 1, 2, and 3. A comparison of three problems in terms of

{φ^{(n)}}_{n \geq 1}

and security criteria is summarized in Table 1.

In Problem 2, Alice and Bob share a random variable

K^{n}

of block length n, and an eavesdropper adversary

A

has a random variable

Z^{n}

that is correlated to

K^{n}

. In such a situation, Alice and Bob try to distill a secret key as long as possible. In [10], they considered a situation such that the adversary’s random variable

Z^{n}

is stored in a storage that is obtained as a function value of

Z^{n}

, and the rate of the storage size is bounded. This situation makes sense when the alphabet size of the adversary’s observation

Z^{n}

is too huge to be stored directly in a storage. In such a situation, Watanabe and Oohama [10] obtained an explicit characterization of the region

R_{WO} (p_{K}, W)

indicating the trade-off between the key rate

R = (m / n) log | X |

and the rate

R_{A} = (1 / n) log | M_{A}^{(n)}

of the storage size. In Problem 2, the variational distance

d (p_{V^{m}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{m} M_{A}^{(n)}})

between

p_{V^{m}} \times p_{M_{A}^{(n)}}

and

p_{{\tilde{K}}^{m} M_{A}^{(n)}})

is used as a security criterion instead of

D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}} | | p_{V^{m}} | p_{M_{A}^{(n)}})

in Problem 1. Define

ξ_{d}^{(n)} = ξ_{d}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) : = max_{φ_{A}^{(n)} \in F^{(n)} (R_{A})} d (p_{V^{m}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{m} M_{A}^{(n)}}) .

Then the formal definition of the region

R_{WO} (p_{K}, W)

is given by the following:

\begin{matrix} R_{WO} (p_{K}, W) : = & {(R_{A}, R) : \exists {φ^{(n)}}_{n \geq 1} such that \forall ε > 0, \exists n_{0} = n_{0} (ε) \in N_{0}, \forall n \geq n_{0}, \\ (m / n) log | X | \geq R - ε and ξ_{d}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) \leq ε} . \end{matrix}

In Problem 3, the adversary outputs an estimation

{\hat{K}}^{n}

of

K^{n}

from

{\tilde{K}}^{m} = φ^{(n)} (K^{n})

and

M_{A}^{(n)} = φ_{A}^{(n)} (Z^{n})

. Let

ψ_{A}^{(n)} : M^{(n)} \times X^{m}

be a decoder function of the adversary. Then

{\hat{K}}^{n}

is given by

{\hat{K}}^{n} = ψ_{A}^{(n)} (φ_{A}^{(n)} (Z^{n}), {\tilde{K}}^{m} = φ^{(n)} (K^{n}) .

Let

p_{e, A}^{(n)} = p_{e, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)} ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) : = \Pr \{K^{n} \neq ψ_{A}^{(n)} (φ_{A}^{(n)} (Z^{n}), φ^{(n)} (K^{n}))\}

be the error probability of decoding for Problem 3. The quantity

M_{A}^{(n)}

serves as a helper for the decoding of

K^{n}

from

{\tilde{K}}^{m}

. In Problem 3, Ahlswede and Körner [7] and Wyner [16] investigated an explicit characterization of the rate region

R_{AKW} (p_{K}, W)

indicating the trade-off between

R_{A}

and R under the condition that

p_{e, A}^{(n)} = \Pr {K^{n} \neq {\hat{K}}_{n}}

vanishes asymptotically. The region

R_{AKW} (p_{K}, W)

is formally defined by

\begin{matrix} R_{AKW} (p_{K}, W) : = & {(R_{A}, R) : \exists {{(φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)}}}_{n \geq 1} such that \\ \forall ε > 0, \exists n_{0} = n_{0} (ε) \in N_{0}, \forall n \geq n_{0}, \\ (m / n) log | X | \leq R + ε, φ_{A}^{(n)} \in F_{A} (R + ε), \\ and p_{e, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq ε} . \end{matrix}

The region

R_{AKW} (p_{K}, W)

was determined by Ashlswede and Körner [7] and Wyner [16]. To state their result, we define several quantities. Let U be an auxiliary random variable taking values in a finite set

U

. We assume that the joint distribution of

(U, Z, K)

is

p_{U Z K} (u, z, k) = p_{U} (u) p_{Z | U} (z | u) p_{K | Z} (k | z) .

The above condition is equivalent to

U \leftrightarrow Z \leftrightarrow K

. Define the set of probability distribution

p = p_{U Z K}

by

\begin{matrix} P (p_{K}, W) : = {p_{U Z K} : | U | \leq | Z | + 1, U \leftrightarrow Z \leftrightarrow K} . \end{matrix}

Set

\begin{matrix} R (p) : = & \begin{matrix} {(R_{A}, R) : R_{A}, R \geq 0, R_{A} \geq I (Z; U), R \geq H (K | U)}, \end{matrix} \\ R (p_{K}, W) : = & ⋃_{p \in P (p_{K}, W)} R (p) . \end{matrix}

We can show that the region

R (p_{K}, W)

satisfies the following property.

Property 1.

(a): The region $R (p_{K}, W)$ is a closed convex subset of $R_{+}^{2} : = {R_{A} \geq 0, R \geq 0}$ .
(b): For any $(p_{K}, W)$ , we have

$min_{(R_{A}, R) \in R (p_{K}, W)} (R_{A} + R) = H (K) .$

(10)

The minimum is attained by $(R_{A}, R) = (0, H (K$ $))$ . This result implies that

$\begin{matrix} R (p_{K}, W) \subseteq & {(R_{A}, R) : R_{A} + R \geq H (K)} \cap R_{+}^{2} . \end{matrix}$

Furthermore, the point $(0, H (K))$ always belongs to $R (p_{K}, W)$ .

Property 1 part (a) is a well-known property. Proof of Property 1 part (b) is easy. Proofs of Property 1 parts (a) and (b) are omitted. Typical shape of the region

R (p_{K}, W)

is shown in Figure 7.

The rate region

R_{AKW} (p_{K}, W)

was determined by Ahlswede and Körner [7] and Wyner [16]. Their result is the following.

Theorem 2.

(Ahlswede, Körner [7] andWyner [16])

\begin{matrix} R_{AKW} (p_{K}, W) = R (p_{K}, W) . \end{matrix}

Watanabe and Oohama [10] investigated an explicit form of

R_{WO} (p_{K}, W)

to show that it is equal to

R^{c} (p_{K}, W)

, that is, we have the following result.

Theorem 3.

(Watanabe and Oohama [10])

\begin{matrix} R_{WO} (p_{K}, W) = R_{AKW}^{c} (p_{K}, W) = R^{c} (p_{K}, W) . \end{matrix}

In the remaining part of this section, we investigate a relationship between Problems 2 and 3 to give an outline of the proof of this theorem. Let

p_{c, A}^{(n)} = p_{c, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) : = \Pr \{K^{n} = ψ_{A}^{(n)} (φ_{A}^{(n)} (Z^{n}), φ^{(n)} (K^{n}))\}

be the correct probability of decoding for Problem 3. The following lemma provides an important inequality to examine a relationship between these two problems.

Lemma 3.

For any

(φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)})

, we have the following:

p_{c, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq \frac{1}{{| X |}^{m}} + d (p_{V^{m}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{m} M_{A}^{(n)}}) .

Proof of this lemma is given in Appendix A. Using Lemma 3, we can easily prove the inclusion

R_{WO} (p_{K}, W) \subseteq R_{AKW} (p_{K}, W)

, which corresponds to the converse part of Theorem 3.

Proof of

R_{WO} (p_{K}, W) \subseteq R_{AKW}^{c} (p_{K}, W)

:

We assume that

(R_{A}, R) \in R_{AKW} (p_{K}, W)

. Then there exists

{{(φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)}}}_{n \geq 1}

such that

\forall ε > 0,

\exists n_{0} = n_{0} (ε) \in N_{0},

\forall n \geq n_{0}

,

\begin{matrix} \frac{m}{n} log | X | \leq R + ε, φ_{A}^{(n)} \in F_{A}^{(n)} (R + ε), \end{matrix}

(11)

\begin{matrix} and p_{e, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq ε . \end{matrix}

(12)

From the above sequence

{(φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)})}_{n \geq 1}

, we can construct the sequence

{{({\hat{φ}}^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)}}}_{n \geq 1}

such that

\begin{matrix} R + ε \geq \frac{1}{n} log | | {\hat{φ}}^{(n)} | | = \frac{\hat{m}}{n} log | X | \geq max \{R - ε, \frac{m}{n} log | X |\}, φ_{A}^{(n)} \in F_{A}^{(n)} (R + ε), \end{matrix}

(13)

\begin{matrix} p_{e, A}^{(n)} ({\hat{φ}}^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq p_{e, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq ε . \end{matrix}

(14)

Set

{\tilde{K}}^{\hat{m}} : = {\hat{φ}}^{(n)} (K^{n})

. Then from (14) and Lemma 3, we have

\begin{matrix} d (p_{V^{\hat{m}}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{\hat{m}} M_{A}^{(n)}}) \geq 1 - ε - \frac{1}{{| X |}^{\hat{m}}}, \end{matrix}

from which we have

\begin{matrix} d (p_{V^{\hat{m}}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{\hat{m}} M_{A}^{(n)}}) \geq 1 - 2 ε, \end{matrix}

(15)

for sufficiently large n. From (13), (15), and the definition of

R_{WO} (p_{K}, W)

, we can see that

(R_{A} + ε, R) \notin R_{WO} (p_{K}, W)

, or equivalent to

\begin{matrix} (R_{A} + ε, R) \in R_{WO}^{c} (p_{K}, W) \Leftrightarrow (R_{A}, R) \in R_{WO}^{c} (p_{K}, W) - ε (1, 0), \end{matrix}

(16)

where we set

R - (a, b) : = {(u, v) : (u + a, v + b) \in R}

. Since

(R_{A}, R) \in R_{AKW} (p_{K}, W)

is arbitrary, we have that

\begin{matrix} R_{AKW} (p_{K}, W) \subseteq R_{WO}^{c} (p_{K}, W) - ε (1, 0) \Leftrightarrow R_{AKW} (p_{K}, W) + ε (1, 0) \subseteq R_{WO}^{c} (p_{K}, W) \\ \Leftrightarrow R_{WO} (p_{K}, W) \subseteq R_{AKW}^{c} (p_{K}, W) + ε (1, 0) \Leftrightarrow R_{WO} (p_{K}, W) \subseteq R^{c} (p_{K}, W) + ε (1, 0) . \end{matrix}

(17)

By letting

ε \to 0

in (17) and considering that

R^{c} (p_{K}, W)

is an open set, we have that

R_{WO} (p_{K}, W) \subseteq R^{c} (p_{K}, W)

. □

To prove

R_{WO} (p_{K}, W) \supseteq R_{AKW}^{c}

, we examine an upper bound of

ξ_{d}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n})

. For

η > 0

, we define

\begin{matrix} ℘_{η}^{(n)} = ℘_{η}^{(n)} (R | p_{K}^{n}, W^{n}) : = p_{M_{A}^{(n)} Z^{n} K^{n}} \{R \geq \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η\}, \\ Φ_{d, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) : = max_{φ_{A}^{(n)} \in F^{(n)} (R_{A})} \{℘_{η}^{(n)} (R | p_{K}^{n}, W^{n}) + \sqrt{e^{- n η}}\} . \end{matrix}

According to Watanabe and Oohama [10], we have the following two propositions.

Proposition 1.

(Watanabe and Oohama [10]). Fix any positive

η > 0

.

\exists φ^{(n)} : X^{n} \to X^{m}

satisfying

(m / n) log | X | \geq R - 2 η

, we have

ξ_{d}^{(n)} (φ^{(n)}, R_{A} | p_{K}^{n}, W^{n}) \leq Φ_{d, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) .

Proposition 2.

(Watanabe and Oohama [10]). If

(R_{A}, R) \notin R (p_{K}, W)

, then for any

η > 0

and any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

lim_{n \to \infty} ℘_{η}^{(n)} (R | p_{K}^{n}, W) = 0,

which implies that

lim_{n \to \infty} Φ_{d, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) = 0 .

The inclusion

R_{WO} (p_{K}, W) \supseteq R_{AKW}^{c}

immediately follows from Propositions 1 and 2.

5. Reliability and Security Analysis

In this section, we state our main results. We use the affine encoder

φ^{(n)}

defined in the previous section. We upper bound

p_{e} = p_{e} (φ^{(n)}, ψ^{(n)} | p_{X}^{n})

and

Δ^{(n)} = Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})

to obtain inner bounds of

R_{Sys} (p_{X},

p_{K}, W)

and

D_{Sys} (p_{X},

p_{K}, W)

.

Let

\begin{matrix} Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) : = max_{φ_{A}^{(n)} \in F^{(n)} (R_{A})} \{n R ℘_{η}^{(n)} (R | p_{K}^{n}, W^{n}) + e^{- n η}\}, \\ Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) : = inf_{η > 0} Φ_{D, η} (R_{A}, R | p_{K}^{n} W^{n}) . \end{matrix}

Then we have the following proposition.

Proposition 3.

For any

R_{A}, R > 0

and any

(p_{K}, W)

, there exists a sequence of mappings

{(φ^{(n)}, ψ^{(n)})}_{n = 1}^{\infty}

such that for any

p_{X} \in P (X)

, we have

\begin{matrix} R - \frac{1}{n} \leq \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, \\ p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq e {(n + 1)}^{2 | X |} {{(n + 1)}^{| X |} + 1} e^{- n E (R | p_{X})}, \end{matrix}

(18)

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq {{(n + 1)}^{| X |} + 1} Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) . \end{matrix}

(19)

This proposition can be proved by several tools developed by previous works. The detail of the proof is given in the next section. As we stated in Proposition 2, Watanabe and Oohama [10] proved that if

(R_{A}, R) \notin R (p_{K}, W)

, then the quantity for any

η > 0

and any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, the quantity

℘_{η}^{(n)} (R | p_{K}^{n}, W)

. Their method can not be applied to the analysis of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

since the quantity

n R

is multiplied with the quantity

℘_{η}^{(n)} (R | p_{K}^{n}, W)

in the definition of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

. In this paper, we derive an upper bound of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

that decays exponentially as

n \to \infty

if

(R_{A}, R) \notin R (p_{K}, W)

. To derive the upper bound, we use a new method that is developed by Oohama to prove strong converse theorems in multi-terminal source or channel networks [9,17,18,19,20].

We define several functions and sets to describe the upper bound of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

. Set

\begin{matrix} Q (p_{K | Z}) : = & {q = q_{U Z K} : | U | \leq | Z |, U \leftrightarrow Z \leftrightarrow K, p_{K | Z} = q_{K | Z}} . \end{matrix}

For

(μ, α) \in {[0, 1]}^{2}

and for

q = q_{U Z K} \in Q (p_{K | Z})

, define

\begin{matrix} ω_{q | p_{Z}}^{(μ, α)} (z, k | u) : = \bar{α} log \frac{q_{Z} (z)}{p_{Z} (z)} + α [μ log \frac{q_{Z | U} (z | u)}{p_{Z} (z)} + \bar{μ} log \frac{1}{q_{K | U} (k | u)}], \\ Ω^{(μ, α)} (q | p_{Z}) : = - log E_{q} [exp \{- ω_{q | p_{Z}}^{(μ, α)} (Z, K | U)\}], Ω^{(μ, α)} (p_{K}, W) : = min_{\binom{}{q \in Q (p_{K | Z})}} Ω^{(μ, α)} (q | p_{Z}), \\ F^{(μ, α)} (μ R_{A} + \bar{μ} R_{} | p_{K}, W) : = \frac{Ω^{(μ, α)} (p_{K}, W) - α (μ R_{A} + \bar{μ} R_{})}{2 + α \bar{μ}}, \\ F (R_{A}, R_{} | p_{K}, W) : = sup_{(μ, α) \in {[0, 1]}^{2}} F^{(μ, α)} (μ R_{A} + \bar{μ} R_{} | p_{K}, W) . \end{matrix}

We next define a function serving as a lower bound of

F (R_{A}, R_{} | p_{K}, W)

. For each

p_{U Z K} \in P_{sh} (p_{K}, W)

, define

\begin{matrix} {\tilde{ω}}_{p}^{(μ)} (z, k | u) : = μ log \frac{p_{Z | U} (z | u)}{p_{Z} (z)} + \bar{μ} log \frac{1}{p_{K | U} (K | U)}, \\ {\tilde{Ω}}^{(μ, λ)} (p) : = - log E_{p} [exp \{- λ {\tilde{ω}}_{p}^{(μ)} (Z, K | U)\}], {\tilde{Ω}}^{(μ, λ)} (p_{K}, W) : = min_{\binom{}{p \in P_{sh} (p_{K}, W)}} {\tilde{Ω}}^{(μ, λ)} (p) . \end{matrix}

Furthermore, set

\begin{matrix} {\tilde{F}}^{(μ, λ)} (μ R_{A} + \bar{μ} R_{} | p_{K}, W) : = \frac{{\tilde{Ω}}^{(μ, λ)} (p_{K}, W) - λ (μ R_{A} + R_{})}{2 + λ (5 - μ)}, \\ \tilde{F} (R_{A}, R_{} | p_{K}, W) : = sup_{\binom{λ \geq 0,}{μ \in [0, 1]}} {\tilde{F}}^{(μ, λ)} (μ R_{A} + \bar{μ} R_{} | p_{K}, W) . \end{matrix}

We can show that the above functions satisfy the following property.

Property 2.

(a): The cardinality bound $| U | \leq | Z |$ in $Q (p_{K | Z})$ is sufficient to describe the quantity $Ω^{(μ, β, α)} (p_{K}, W)$ . Furthermore, the cardinality bound $| U | \leq | Z |$ in $P_{sh} (p_{K}, W)$ is sufficient to describe the quantity ${\tilde{Ω}}^{(μ, λ)} (p_{K}, W)$ .
(b): For any $R_{A}, R_{} \geq 0$ , we have

$\begin{matrix} F (R_{A}, R_{} | p_{K}, W) \geq \tilde{F} (R_{A}, R_{} | p_{K}, W) . \end{matrix}$
(c): For any $p = p_{U Z K} \in P_{sh} (p_{Z}, W)$ and any $(μ, λ) \in [0,$ ${1]}^{2}$ , we have

$0 \leq {\tilde{Ω}}^{(μ, λ)} (p) \leq μ log | Z | + \bar{μ} log | K | .$

(20)
(d): Fix any $p = p_{U Z K} \in P_{sh} (p_{K}, W)$ and $μ \in [0, 1]$ . For $λ \in [0, 1]$ , we define a probability distribution $p^{(λ)} = p_{U Z K}^{(λ)}$ by

$\begin{matrix} p^{(λ)} (u, z, k) : = \frac{p (u, z, k) exp \{- λ {\tilde{ω}}_{p}^{(μ)} (z, k | u)\}}{E_{p} [exp \{- λ {\tilde{ω}}_{p}^{(μ)} (Z, K | U)\}]} . \end{matrix}$

Then for $λ \in [0, 1 / 2]$ , ${\tilde{Ω}}^{(μ, λ)} (p)$ is twice differentiable. Furthermore, for $λ \in [0, 1 / 2]$ , we have

$\begin{matrix} \frac{d}{d λ} {\tilde{Ω}}^{(μ, λ)} (p) = E_{p^{(λ)}} [{\tilde{ω}}_{p}^{(μ)} (Z, K | U)], \frac{d^{2}}{d λ^{2}} {\tilde{Ω}}^{(μ, λ)} (p) = - {Var}_{p^{(λ)}} [{\tilde{ω}}_{p}^{(μ)} (Z, K | U)] . \end{matrix}$

The second equality implies that ${\tilde{Ω}}^{(μ, λ)} (p | p_{K}$ $, W)$ is a concave function of $λ \geq 0$ .
(e): For $(μ, λ) \in [0, 1] \times [0, 1 / 2]$ , define

$\begin{matrix} ρ^{(μ, λ)} (p_{K}, W) : = max_{\binom{(ν, p) \in [0, λ] \times P_{sh} (p_{K}, W) :}{{\tilde{Ω}}^{(μ, λ)} (p) = {\tilde{Ω}}^{(μ, λ)} (p_{K}, W)}} {Var}_{p^{(ν)}} [{\tilde{ω}}_{p}^{(μ)} (Z, K | U)], \end{matrix}$

and set

$\begin{matrix} ρ = ρ (p_{K}, W) : = max_{(μ, λ) \in [0, 1] \times [0, 1 / 2]} ρ^{(μ, λ)} (p_{K}, W) . \end{matrix}$

Then we have $ρ (p_{K}, W) < \infty$ . Furthermore, for any $(μ, λ) \in [0, 1] \times [0, 1 / 2]$ , we have

${\tilde{Ω}}^{(μ, λ)} (p_{K}, W) \geq λ R^{(μ)} (p_{K}, W) - \frac{λ^{2}}{2} ρ (p_{K}, W) .$
(f): For every $τ \in (0, (1 / 2) ρ (p_{K}, W))$ , the condition $(R_{A},$ $R_{} + τ) \notin R (p_{K}, W)$ implies

$\begin{matrix} \tilde{F} (R_{A}, R_{} | p_{K}, W) > \frac{ρ (p_{K}, W)}{4} \cdot g^{2} (\frac{τ}{ρ (p_{K}, W)}) > 0, \end{matrix}$

where g is the inverse function of $ϑ (a) : = a + (5 / 4) a^{2}, a \geq 0$ .

Proof of this property is found in Oohama [9] (extended version). On the upper bound of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

, we have the following:

Proposition 4.

For any

n \geq 1 / R

, we have

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) \leq 5 n R e^{- n F (R_{A}, R | p_{K}, W)} .

(21)

Proof of this proposition is given in the next section. Proposition 4 has a close connection with the one helper source coding problem, which is explained as Problem 3 in the previous section. In fact, for the proof we use the result Oohama [9] obtained for an explicit lower bound of the optimal exponent on the exponential decay of

p_{c, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n})

for

(R_{A}, R) \notin R_{AKW} (p_{K}, W)

. By Propositions 3 and 4, we obtain our main result shown below.

Theorem 4.

For any

R_{A}, R > 0

and any

(p_{K}, W)

, there exists a sequence of mappings

{(φ^{(n)}, ψ^{(n)})}_{n = 1}^{\infty}

such that for any

p_{X} \in P (X)

, we have

\begin{matrix} \frac{1}{n} - R \leq \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, \\ p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq e^{- n [E (R | p_{X}) - δ_{1, n}]} \end{matrix}

(22)

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq e^{- n [F (R_{A}, R | p_{K}, W) - δ_{2, n}]}, \end{matrix}

(23)

where

δ_{i, n}, i = 1, 2

are defined by

\begin{matrix} δ_{1, n} : = \frac{1}{n} log [e {(n + 1)}^{2 | X |} {{(n + 1)}^{| X |} + 1}], \\ δ_{2, n} : = \frac{1}{n} log [5 n R {{(n + 1)}^{| X |} + 1}] . \end{matrix}

Note that for

i = 1, 2

,

δ_{i, n} \to 0

as

n \to \infty

.

The functions

E (R | p_{X})

and

F (R_{A}, R | p_{K}, W)

take positive values if and only if

(R_{A}, R)

belongs to the set

\begin{matrix} {R > H (X)} \cap R^{c} (p_{K}, W) : = R_{Sys}^{(in)} (p_{X}, p_{K}, W) . \end{matrix}

Thus, by Theorem 4, under

(R_{A}, R) \in R_{Sys}^{(in)} (p_{X}, p_{K}, W),

we have the following:

In terms of reliability, $p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})$ goes to zero exponentially as n tends to infinity, and its exponent is lower bounded by the function $E (R | p_{X})$ .
In terms of security, for any $φ_{A}$ satisfying $φ_{A}^{(n)} \in$ $F_{A}^{(n)} (R_{A})$ , the information leakage $Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)}$ $| p_{X}^{n}, p_{K}^{n}, W^{n})$ on $X^{n}$ goes to zero exponentially as n tends to infinity, and its exponent is lower bounded by the function $F (R_{A}, R | p_{K}, W)$ .
The code that attains the exponent functions $E ($ $R | p_{X})$ is the universal code that depends only on R and not on the value of the distribution $p_{X}$ .

Define

\begin{matrix} D_{Sys}^{(in)} (p_{X}, p_{K}, W) : = {(R_{1}, R_{2}, E (R | p_{X}), F (R_{A}, R | p_{K})) : (R_{1}, R_{2}) \in R_{Sys}^{(in)} (p_{X}, p_{K}, W)} . \end{matrix}

From Theorem 4, we immediately obtain the following corollary.

Corollary 1.

\begin{matrix} R_{Sys}^{(in)} (p_{X}, p_{K}, W) \subseteq R_{Sys} (p_{X}, p_{K}, W), D_{Sys}^{(in)} (p_{X}, p_{K}, W) \subseteq D_{Sys} (p_{X}, p_{K}, W) . \end{matrix}

A typical shape of

{R > H (X)} \cap R^{c} (p_{K}, W)

is shown in Figure 8.

6. Proofs of the Results

In this section, we prove our main theorem, i.e., Theorem 4.

6.1. Types of Sequences and Their Properties

In this subsection, we present basic results on the types. These results are basic tools for our analysis of several bounds related to the error provability of decoding or security.

Definition 5.

For any n-sequence

x^{n} = x_{1} x_{2} \dots

x_{n} \in X^{n}

,

n (x | x^{n})

denotes the number of t such that

x_{t} = x

. The relative frequency

{\{n (x | x^{n}) / n\}}_{x \in X}

of the components of

x^{n}

is called the type of

x^{n}

denoted by

P_{x^{n}}

. The set that consists of all the types on

X

is denoted by

P_{n} (X)

. Let

\bar{X}

denote an arbitrary random variable whose distribution

P_{\bar{X}}

belongs to

P_{n} (X)

. For

p_{\bar{X}} \in P_{n} (X)

, set

T_{\bar{X}}^{n} : = \{x^{n} : P_{x^{n}} = p_{\bar{X}}\} .

For sets of types and joint types, the following lemma holds. For details of the proof, see Csiszár and Körner [21].

Lemma 4.

(a): $\begin{matrix} | P_{n} {(X) | \leq (n + 1)}^{| X |} . \end{matrix}$
(b): For $P_{\bar{X}} \in P_{n} (X)$ ,

$\begin{matrix} {(n + 1)}^{- | X |} e^{n H (\bar{X})} & \leq | T_{\bar{X}}^{n} | \leq e^{n H (\bar{X})} . \end{matrix}$
(c): For $x^{n} \in T_{\bar{X}}^{n}$ ,

$\begin{matrix} p_{X}^{n} (x^{n}) & = e^{- n [H (\bar{X}) + D (p_{\bar{X}} | | p_{X})]} . \end{matrix}$

By Lemma 4 parts (b) and (c), we immediately obtain the following lemma:

Lemma 5.

For

p_{\bar{X}} \in P_{n} (X)

,

\begin{matrix} p_{X}^{n} (T_{\bar{X}}^{n}) \leq e^{- n D (p_{\bar{X}} | | p_{X})} . \end{matrix}

6.2. Upper Bounds of $p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})$ , and $Δ_{n} (φ^{(n)}, φ_{A}^{(n)}$ $| p_{X}^{n}, p_{K}^{n}, W^{n})$

In this subsection, we evaluate upper bounds of

p_{e} (

ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

and

Δ_{n} (φ^{(n)}, φ_{A}^{(n)}

| p_{X}^{n}, p_{K}^{n}, W^{n})

. For

p_{e} (ϕ^{(n)}

, ψ^{(n)} | p_{X}^{n})

, we derive an upper bound that can be characterized with a quantity depending on

(ϕ^{(n)}, ψ^{(n)})

and type

P_{x^{n}}

of sequences

x^{n} \in X^{n}

. We first evaluate

p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

. For

x^{n} \in X^{n}

and

p_{\bar{X}} \in P_{n} (X)

, we define the following functions:

\begin{matrix} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)}) & : = \{\begin{matrix} 1 & if ψ^{(n)} (ϕ^{(n)} (x^{n})) \neq x^{n}, \\ 0 & otherwise, \end{matrix} \\ Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) & : = \frac{1}{| T_{\bar{X}}^{n} |} \sum_{x^{n} \in T_{\bar{X}}^{n}} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)}) . \end{matrix}

Then we have the following lemma.

Lemma 6.

In the proposed system, for any pair of

(ϕ^{(n)},

ψ^{(n)})

, we have

\begin{matrix} p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq \sum_{p_{\bar{X}} \in P_{n} (X)} Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) e^{- n D (p_{\bar{X}} | | p_{X})} . \end{matrix}

(24)

Proof.

We have the following chain of inequalities:

\begin{matrix} p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \overset{(a)}{=} \sum_{p_{\bar{X}} \in P_{n} (X)} \sum_{x^{n} \in T_{\bar{X}}^{n}} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)}) p_{X}^{n} (x^{n}) \\ = \sum_{p_{\bar{X}} \in P_{n} (X)} \frac{1}{| T_{\bar{X}}^{n} |} \sum_{x^{n} \in T_{\bar{X}}^{n}} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)}) | T_{\bar{X}}^{n} | p_{X}^{n} (x^{n}) \\ \overset{(b)}{=} \sum_{p_{\bar{X}} \in P_{n} (X)} \frac{1}{| T_{\bar{X}}^{n} |} \sum_{x^{n} \in T_{\bar{X}}^{n}} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)}) p_{X}^{n} (T_{\bar{X}}^{n}) \overset{(c)}{=} \sum_{p_{\bar{X}} \in P_{n} (X)} Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) p_{X}^{n} (T_{\bar{X}}^{n}) \\ \overset{(d)}{\leq} \sum_{p_{\bar{X}} \in P_{n} (X)} Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) e^{- n D (p_{\bar{X}} | | p_{X})} . \end{matrix}

Step (a) follows from the definition of

Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})

. Step (b) follows from the probabilities

p_{X}^{n} (x^{n})

for

x^{n} \in T_{\bar{X}}^{n}

taking an identical value. Step (c) follows from the definition of

Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})

. Step (d) follows from Lemma 5. □

6.3. Random Coding Arguments

We construct a pair of affine encoders

φ^{(n)} = (φ_{1}^{(n)}, φ_{e}^{(n)})

using the random coding method. For the joint decoder

ψ^{(n)}

, we propose the minimum entropy decoder used in Csiszár [6] and Oohama and Han [22].

Random Construction of Affine Encoders: We first choose m such that

m : = ⌊\frac{n R}{log | X |}⌋,

where

⌊ a ⌋

stands for the integer part of a. It is obvious that

R - \frac{1}{n} \leq \frac{m}{n} log | X | \leq R .

By definition (2) of

ϕ^{(n)}

, we have that for

x^{n} \in X^{n}

,

\begin{matrix} ϕ^{(n)} (x^{n}) = x^{n} A, \end{matrix}

where A is a matrix with n rows and m columns. By definition (3) of

φ^{(n)}

, we have that for

k^{n} \in X^{n}

,

\begin{matrix} φ^{(n)} (k^{n}) = k^{n} A + b^{m}, \end{matrix}

where

b^{m}

is a vector with m columns. Entries of A and

b^{m}

are from the field of

X

. These entries are selected at random, independently of each other, and with a uniform distribution. Randomly constructed linear encoder

ϕ^{(n)}

and affine encoder

φ^{(n)}

have three properties shown in the following lemma.

Lemma 7 (Properties of Linear/Affine Encoders).

(a): For any $x^{n}, v^{n} \in X^{n}$ with $x^{n} \neq v^{n}$ , we have

$\begin{matrix} Pr [ϕ^{(n)} (x^{n}) = ϕ^{(n)} (v^{n})] = Pr [(x^{n} ⊖ v^{n}) A = 0^{m}] = {| X |}^{- m} . \end{matrix}$

(25)
(b): For any $s^{n} \in X^{n}$ and for any ${\tilde{s}}^{m} \in X^{m}$ , we have

$\begin{matrix} Pr [φ^{(n)} (s^{n}) = {\tilde{s}}^{m}] = Pr [s^{n} A \oplus b^{m} = {\tilde{s}}^{m}] = {| X |}^{- m} . \end{matrix}$

(26)
(c): For any $s^{n}, t^{n} \in X^{n}$ with $s^{n} \neq t^{n}$ , and for any ${\tilde{s}}^{m} \in X^{m}$ , we have

$\begin{matrix} Pr [φ^{(n)} (s^{n}) = φ^{(n)} (t^{n}) = {\tilde{s}}^{m}] = Pr [s^{n} A \oplus b^{m} = t^{n} A \oplus b^{m} = {\tilde{s}}^{m}] = {| X |}^{- 2 m} . \end{matrix}$

(27)

Proof of this lemma is given in Appendix B. We next define the decoder function

ψ^{(n)} : X^{m} \to X^{n} .

To this end, we define the following quantities.

Definition 6.

For

x^{n} \in X^{n}

, we denote the entropy calculated from the type

P_{x^{n}}

by

H (x^{n})

. In other words, for a type

P_{\bar{X}} \in P_{n} (X)

such that

P_{\bar{X}} = P_{x^{n}}

, we define

H (x^{n}) = H (\bar{X})

.

Minimum Entropy Decoder: For

ϕ^{(n)} (x^{n}) = {\tilde{x}}^{m}

, we define the decoder function

ψ^{(n)} : X^{m} \to X^{n}

as follows:

ψ^{(n)} ({\tilde{x}}^{m}) : = \{\begin{matrix} {\hat{x}}^{n} & if ϕ^{(n)} ({\hat{x}}^{n}) = {\tilde{x}}^{m}, \\ and H ({\hat{x}}^{n}) < H ({\overset{ˇ}{x}}^{n}) \\ for all {\overset{ˇ}{x}}^{n} such that \\ ϕ^{(n)} ({\overset{ˇ}{x}}^{n}) = {\tilde{x}}^{m}, \\ and {\overset{ˇ}{x}}^{n} \neq {\hat{x}}^{n}, \\ arbitrary & if there is no such {\hat{x}}^{n} \in X^{n} . \end{matrix}

Error Probability Bound: In the following arguments, we let expectations based on the random choice of the affine encoder

φ^{(n)}

be denoted by

E

[

\cdot]

. Define

Λ_{\bar{X}} (R) : = e^{- n {[R - H (\bar{X})]}^{+}} .

Then we have the following lemma.

Lemma 8.

For any n and for any

P_{\bar{X}} \in P_{n} (X)

,

E [Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})] \leq e {(n + 1)}^{| X |} Λ_{\bar{X}} (R) .

Proof of this lemma is given in Appendix C.

Estimation of Approximation Error: Define

\begin{matrix} Θ (R, φ_{A}^{(n)} | p_{K^{n}}, W^{n}) : = \sum_{(a, k^{n}) \in M_{A}^{(n)} \times X^{n}} p_{M_{A}^{(n)} K^{n}} (a, k^{n}) log [1 + (e^{n R} - 1) p_{K^{n} | M_{A}^{(n)}} (k^{n} | a)] . \end{matrix}

Then we have the following lemma.

Lemma 9.

For any

n, m

satisfying

(m / n) log | X |

\leq R

, we have

\begin{matrix} E [D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}|| p_{V^{m}}| p_{M_{A}^{(n)}})] \leq Θ (R, φ_{A}^{(n)} | p_{K^{n}}, W^{n}) . \end{matrix}

(28)

Proof of this lemma is given in Appendix D. From the bound (28) in Lemma (9), we know that the quantity

Θ (R, φ_{A}^{(n)} | p_{K^{n}}, W^{n})

serves as an upper bound of the ensemble average of the conditional divergence

D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}

| | p_{V^{m}} | p_{M_{A}^{(n)}}) .

Hayashi [23] obtained the same upper bound of the ensemble average of the conditional divergence for an ensemble of universal

_{2}

functions. In this paper, we prove the bound (28) for an ensemble of affine encoders. To derive this bound, we need to use Lemma 7 parts (b) and (c), the two important properties that a class of random affine encoders satisfies. From Lemmas 1 and 9, we have the following corollary.

Corollary 2.

\begin{matrix} E [Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})] \leq Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) . \end{matrix}

Existence of Good Universal Code

(φ^{(n)}, ψ^{(n)})

:

From Lemma 8 and Corollary 2, we have the following lemma stating the existence of a good universal code

(φ^{(n)}, ψ^{(n)})

.

Lemma 10.

There exists at least one deterministic code

(φ^{(n)}, ψ^{(n)})

satisfying

(m / n) log | X | \leq R

, such that for any

p_{\bar{X}}

\in P_{n} (X)

,

\begin{matrix} Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) \leq e {(n + 1)}^{| X |} {{(n + 1)}^{| X |} + 1} Λ_{\bar{X}} (R) . \end{matrix}

Furthermore, for any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq {{(n + 1)}^{| X |} + 1} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) . \end{matrix}

Proof.

We have the following chain of inequalities:

\begin{matrix} E [\sum_{p_{\bar{X}} \in P_{n} (X)} \frac{Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})}{e {(n + 1)}^{| X |} Λ_{\bar{X}} (R)} + \frac{Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})}{Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n})}] \\ = \sum_{p_{\bar{X}} \in P_{n} (X)} \frac{E [Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})]}{e {(n + 1)}^{| X |} Λ_{\bar{X}} (R)} + \frac{E [Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})]}{Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n})} \\ \overset{(a)}{\leq} \sum_{p_{\bar{X}} \in P_{n} (X)} 1 + 1 = | P_{n} (X) | + 1 \overset{(b)}{\leq} {(n + 1)}^{| X |} + 1 . \end{matrix}

Step (a) follows from Lemma 8 and Corollary 2. Step (b) follows from Lemma 4 part (a). Hence, there exists at least one deterministic code

(φ^{(n)}, ψ^{(n)})

such that

\begin{matrix} \sum_{p_{\bar{X}} \in P_{n} (X)} \frac{Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})}{e {(n + 1)}^{| X |} Λ_{\bar{X}} (R)} + \frac{Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})}{Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n})} \leq {(n + 1)}^{| X |} + 1, \end{matrix}

from which we have that

\begin{matrix} \frac{Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)})}{e {(n + 1)}^{| X |} Λ_{\bar{X}} (R)} \leq {(n + 1)}^{| X |} + 1, \end{matrix}

for any

p_{\bar{X}} \in P_{n} (X)

. Furthermore, we have that for any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

,

\begin{matrix} \frac{Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n})}{Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n})} \leq {(n + 1)}^{| X |} + 1, \end{matrix}

completing the proof. □

Proposition 5.

For any

R_{A}, R > 0

and any

(p_{K}, W)

, there exists a sequence of mappings

{(φ^{(n)}, ψ^{(n)})}_{n = 1}^{\infty}

such that for any

p_{X} \in P (X)

, we have

\begin{matrix} R - \frac{1}{n} \leq \frac{1}{n} log | X^{m} | = \frac{m}{n} log | X | \leq R, \\ p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \leq e {(n + 1)}^{2 | X |} {{(n + 1)}^{| X |} + 1} e^{- n [E (R | p_{X})]} \end{matrix}

(29)

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Δ^{(n)} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq {{(n + 1)}^{| X |} + 1} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) . \end{matrix}

(30)

Proof.

By Lemma 10, there exists

(φ^{(n)},

ψ^{(n)})

satisfying

(m / n) log | X | \leq R

such that for any

p_{\bar{X}}

\in P_{n} (X)

,

\begin{matrix} Ξ_{\bar{X}} (ϕ^{(n)}, ψ^{(n)}) \leq e {(n + 1)}^{| X |} {{(n + 1)}^{| X |} + 1} Λ_{\bar{X}} (R) . \end{matrix}

(31)

Furthermore, for any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

,

\begin{matrix} Δ_{n} (φ^{(n)}, φ_{A}^{(n)} | p_{X}^{n}, p_{K}^{n}, W^{n}) \leq {{(n + 1)}^{| X |} + 1} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) . \end{matrix}

(32)

The bound (30) in Proposition 5 has already been proven in (32). Hence, it suffices to prove the bound (29) in Proposition 5 to complete the proof. On an upper bound of

p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})

, we have the following chain of inequalities:

\begin{array}{l} p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n}) \overset{(a)}{\leq} e {(n + 1)}^{| X |} {{(n + 1)}^{| X |} + 1} \sum_{p_{\bar{X}} \in P_{n} (X)} Λ_{\bar{X}} (R) e^{- n D (p_{\bar{X}} | | p_{X})} \\ \leq e {(n + 1)}^{| X |} {{(n + 1)}^{| X |} + 1} | P_{n} (X) | e^{- n [E (R | p_{X})]} \overset{(c)}{\leq} e {(n + 1)}^{2 | X |} {{(n + 1)}^{| X |} + 1} e^{- n E (R | p_{X})} . \end{array}

Step (a) follows from Lemma 6 and (31). Step (b) follows from Lemma 4 part (a). □

6.4. Explicit Upper Bound of $Θ (R_{1}, R_{2}, φ_{A}^{(n)} | p_{Z K_{1} K_{2}}^{n})$

In this subsection, we derive an explicit upper bound of

Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n})

that holds for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

. Here we recall the following definitions:

\begin{matrix} ℘_{η}^{(n)} = ℘_{η}^{(n)} (R | p_{K}^{n}, W^{n}) : = p_{M_{A}^{(n)} Z^{n} K^{n}} {R \geq \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η\}, \\ Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) : = max_{φ_{A}^{(n)} \in F^{(n)} (R_{A})} \{n R ℘_{η}^{(n)} (R | p_{K}^{n}, W^{n}) + e^{- n η}\}, \\ Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) : = inf_{η > 0} Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) . \end{matrix}

Then we have the following lemma.

Lemma 11.

For any

η > 0

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

\begin{matrix} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}), \end{matrix}

(33)

which implies that

\begin{matrix} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) \leq Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) . \end{matrix}

(34)

Proof.

We first observe that

\begin{matrix} Θ (R, φ_{A}^{(n)} | p_{K}^{n}, W^{n}) = E [log \{1 + (e^{n R} - 1) p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})\}] . \end{matrix}

(35)

We further observe the following:

\begin{matrix} R < \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η \Leftrightarrow e^{n R} p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)}) < e^{- n η} \\ \Rightarrow log \{1 + e^{n R} p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})\} \leq log (1 + e^{- n η}) \\ \overset{(a)}{\Rightarrow} log \{1 + e^{n R} p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})\} \leq e^{- n η} \\ \Rightarrow log \{1 + (e^{n R} - 1) p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})\} \leq e^{- n η} . \end{matrix}

(36)

Step (a) follows from

log (1 + a) \leq a

. We also note that

\begin{matrix} log \{1 + (e^{n R} - 1) p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})\} \leq log [e^{n R}] = n R . \end{matrix}

(37)

From (35), (36), and (37) we have the bound (33) in Lemma 11. □

Proof of Proposition 3:

This proposition immediately follows from Proposition 5 and Lemma 11. □

For the upper bound of

℘_{η}^{(n)}

, we have the following lemma.

Lemma 12.

For any

η > 0

and for any eavesdropper

A

with

φ_{A}

satisfying

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

, we have

℘_{η}^{(n)} \leq {\tilde{℘}}_{η}^{(n)} + 3 e^{- n η}

, where

\begin{matrix} {\tilde{℘}}_{η}^{(n)} : = p_{M_{A}^{(n)} Z^{n} K^{n}} { \end{matrix}

\begin{matrix} 0 & \geq \frac{1}{n} log \frac{{\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})}{p_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})} - η, \end{matrix}

(38)

\begin{matrix} 0 & \geq \frac{1}{n} log \frac{q_{Z^{n}} (Z^{n})}{p_{Z^{n}} (Z^{n})} - η, \end{matrix}

(39)

\begin{matrix} R_{A} & \geq \frac{1}{n} log \frac{p_{Z^{n} | M_{A}^{(n)}} (Z^{n} | M_{A}^{(n)})}{p_{Z^{n}} (Z^{n})} - η, \end{matrix}

\begin{matrix} R & \geq \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η} . \end{matrix}

(40)

The probability distributions appearing in the two inequalities (38) and (39) in the right members of (40) have a property that we can select them arbitrarily. In (38), we can choose any probability distribution

{\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}}

on

M_{A}^{(n)}

\times Z^{n}

\times X^{n}

. In (39), we can choose any distribution

q_{Z^{n}}

on

Z^{n}

.

Proof of this lemma is given in Appendix E.

Proof of Proposition 4:

The claim of Proposition 4 is that for

n \geq 1 / R

,

\begin{matrix} Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) \leq 5 n R e^{- n F (R_{A}, R | p_{K}, W)} . \end{matrix}

(41)

By Lemma 12 and the definition of

Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

, we have that for

n \geq 1 / R

,

\begin{matrix} Φ_{D, η}^{(n)} (R_{A}, R | p_{K}^{n} W^{n}) \leq n R ({\tilde{℘}}_{η}^{(n)} + 4 e^{- n η}) . \end{matrix}

(42)

The quantity

{\tilde{℘}}_{η}^{(n)} + 4 e^{- n η}

is the same as the upper bound on the correct probability of decoding for one helper source coding problem in Lemma 1 in Oohama [9] (extended version). In a manner similar to the derivation of the exponential upper bound of the correct probability of decoding for one helper source coding problem, we can prove that for any

φ_{A}^{(n)} \in F_{A}^{(n)} (R_{A})

and for some

η^{*} = η^{*} (n, R_{A}, R)

, we have

\begin{matrix} {\tilde{℘}}_{η^{*}}^{(n)} + 4 e^{- n η^{*}} \leq 5 e^{- n F (R_{A}, R | p_{K}, W)} . \end{matrix}

(43)

From (42), (43), and the definition of

Φ_{D}^{(n)} (R_{A}, R | p_{K}^{n} W^{n})

, we have (41). □

7. Conclusions

In this paper, we have proposed a novel security model for analyzing the security of Shannon cipher systems against an adversary that is not only eavesdropping the public communication channel to obtain ciphertexts but is also obtaining some physical information leaked by the device implementing the cipher system through side-channel attacks. We have also presented a countermeasure against such an adversary in the case of one-time pad encryption by using an affine encoder with certain properties. The main distinguishing feature of our countermeasure is that it is independent of the characteristics or the types of physical information leaked from the devices on which the cipher system is implemented.

Author Contributions

Both the first and the second authors contributed for the writing of the original draft of this paper. Other contributions of the first author include (but are not limited to): the conceptualization of the research goals and aims, the validation of the results, the visualization/presentation of the works, the review and editing. Other contributions of the second author include (but are not limited to): the conceptualization of the ideas, research goals and aims, the formal analysis and the supervision.

Funding

This research was funded by Japan Society for the Promotion of Science (JSPS) Kiban (B) 18H01438 and Japan Society for the Promotion of Science (JSPS) Kiban (C) 18K11292.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A. Correct Probability of Decoding and Variational Distance

In this appendix, we prove Lemma 3.

For

a \in M_{A}^{(n)}

, we set

D (a) = \{{\tilde{k}}^{m} : {\tilde{k}}^{m} = φ^{(n)} (k^{n}) and ψ_{A}^{(n)} ({\tilde{k}}^{m}, a) = k^{n} for some k^{n} \in X^{n}\} .

Then we have the following chain of inequalities:

\begin{matrix} d (p_{V^{m}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{m} M_{A}^{(n)}}) = \sum_{a \in M_{A}^{(n)}} p_{M_{A}^{(n)}} (a) \sum_{{\tilde{k}}^{m} \in X^{m}} |p_{{\tilde{K}}^{m} | M_{A}^{(n)}} ({\tilde{k}}^{m} | a) - \frac{1}{{| X |}^{m}}| \\ \geq \sum_{a \in M_{A}^{(n)}} p_{M_{A}^{(n)}} (a) \{p_{{\tilde{K}}^{m} | M_{A}^{(n)}} (D (a) | a) - \frac{| D (a) |}{{| X |}^{m}}\} = \sum_{a \in M_{A}^{(n)}} p_{M_{A}^{(n)}} (a) \{p_{{\tilde{K}}^{m} | M_{A}^{(n)}} (D (a) | a) - \frac{1}{{| X |}^{m}}\} \\ = p_{c, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} | p_{K}^{n}, W^{n}) - \frac{1}{{| X |}^{m}}, \end{matrix}

completing the proof. □

Appendix B. Proof of Lemma 7

Let

a_{l}^{m}

be the l-th low vector of the matrix A. For each

l = 1, 2, \dots, n

, let

A_{l}^{m} \in X^{m}

be a random vector that represents the randomness of the choice of

a_{l}^{m} \in X^{m}

. Let

B^{m} \in X^{m}

be a random vector that represents the randomness of the choice of

b^{m} \in X^{m}

. We first prove part (a). Without loss of generality, we may assume

x_{1} \neq v_{1}

. Under this assumption, we have the following:

\begin{matrix} (x^{n} ⊖ v^{n}) A = 0^{m} \Leftrightarrow \sum_{l = 1}^{n} (x_{l} ⊖ v_{l}) a_{l}^{m} = 0^{m} \Leftrightarrow a_{1}^{m} = \sum_{l = 2}^{n} \frac{v_{l} ⊖ x_{l}}{x_{1} ⊖ v_{1}} a_{l}^{m} . \end{matrix}

(A1)

Computing

Pr [ϕ (x^{n}) = ϕ (v^{n})]

, we have the following chain of equalities:

\begin{matrix} Pr [ϕ (x^{n}) = ϕ (v^{n})] = Pr [(y^{n} ⊖ w^{n}) A = 0^{m}] \overset{(a)}{=} Pr [a_{1}^{m} = \sum_{l = 2}^{n} \frac{w_{l} ⊖ y_{l}}{x_{1} ⊖ v_{1}} a_{l}^{m}] \\ \overset{(b)}{=} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 2}^{n}}{\in X^{(n - 1) m}}} \prod_{l = 2}^{n} P_{A_{l}^{m}} (a_{l}^{m}) P_{A_{1}^{m}} (\sum_{l = 2}^{n} \frac{w_{l} ⊖ x_{l}}{y_{1} ⊖ v_{1}} a_{l}^{m}) = {| X |}^{- m} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 2}^{n}}{\in X^{(n - 1) m}}} \prod_{l = 2}^{n} P_{A_{l}^{m}} (a_{l}^{m}) = {| X |}^{- m} . \end{matrix}

Step (a) follows from (A1). Step (b) follows from that n random vectors

A_{l}^{m}, l = 1, 2, \dots, n

are independent. We next prove part b. We have the following:

\begin{matrix} s^{n} A \oplus b^{m} = {\tilde{s}}^{m} \Leftrightarrow b^{m} = {\tilde{s}}^{m} ⊖ \{\sum_{l = 1}^{n} s_{l} a_{l}^{m}\} . \end{matrix}

(A2)

Computing

Pr [s^{n} A \oplus b^{m} = {\tilde{s}}^{m}]

, we have the following chain of equalities:

\begin{matrix} Pr [s^{n} A \oplus b^{m} = {\tilde{s}}^{m}] \overset{(a)}{=} Pr [b^{m} = {\tilde{s}}^{m} ⊖ \{\sum_{l = 1}^{n} s_{l} a_{l}^{m}\}] \\ \overset{(b)}{=} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 1}^{n}}{\in X^{n m}}} \prod_{l = 1}^{n} P_{A_{l}^{m}} (a_{l}^{m}) P_{B^{m}} ({\tilde{s}}^{m} ⊖ \{\sum_{l = 1}^{n} s_{l} a_{l}^{m}\}) \\ \overset{(c)}{=} {| X |}^{- m} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 1}^{n}}{\in X^{n m}}} \prod_{l = 1}^{n} P_{A_{l}^{m}} (a_{l}^{m}) = {| X |}^{- m} . \end{matrix}

Step (a) follows from (A2). Step (b) follows from that n random vectors

A_{l}^{m}, l = 1, 2, \dots, n

and

B^{m}

are independent. We finally prove the part (c). We first observe that

s^{n} \neq t^{n} \Leftrightarrow

is equivalent to

s_{i} \neq t_{i} for some i \in {1, 2, \dots, n} .

Without loss of generality, we may assume that

s_{1} \neq t_{1}

. Under this assumption, we have the following:

\begin{matrix} \begin{matrix} s^{n} A \oplus b^{m} = t^{n} A \oplus b^{m} = {\tilde{s}}^{m} \end{matrix} \\ \Leftrightarrow \begin{matrix} (s^{n} ⊖ t^{n}) A = 0, b^{m} = {\tilde{s}}^{m} ⊖ \{\sum_{l = 1}^{n} s_{l} a_{l}^{m}\} \end{matrix} \\ \Leftrightarrow \begin{matrix} a_{1}^{m} = \sum_{l = 2}^{n} \frac{t_{l} ⊖ s_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m}, b^{m} = {\tilde{s}}^{m} ⊖ \{\sum_{l = 1}^{n} s_{l} a_{l}^{m}\} \end{matrix} \\ \Leftrightarrow \begin{matrix} a_{1}^{m} = \sum_{l = 2}^{n} \frac{t_{l} ⊖ s_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m}, b^{m} = {\tilde{s}}^{m} \oplus \sum_{l = 2}^{n} \frac{t_{1} s_{l} ⊖ s_{1} t_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m} . \end{matrix} \end{matrix}

(A3)

Computing

Pr [s^{n} A \oplus b^{m} = t^{n} A \oplus b^{m} = {\tilde{s}}^{m}]

, we have the following chain of equalities:

\begin{matrix} Pr [s^{n} A \oplus b^{m} = t^{n} A \oplus b^{m} = {\tilde{s}}^{m}] \\ \overset{(a)}{=} Pr [a_{1}^{m} = \sum_{l = 2}^{n} \frac{t_{l} ⊖ s_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m} \land b^{m} = {\tilde{s}}^{m} \oplus \sum_{l = 2}^{n} \frac{t_{1} s_{l} ⊖ s_{1} t_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m}] \\ \overset{(b)}{=} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 2}^{n}}{\in X^{(n - 1) m}}} [\prod_{l = 2}^{n} P_{A_{l}^{m}} (a_{l}^{m})] P_{A_{1}^{m}} (\sum_{l = 2}^{n} \frac{t_{l} ⊖ s_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m}) P_{B^{m}} ({\tilde{s}}^{m} \oplus \sum_{l = 2}^{n} \frac{t_{1} s_{l} ⊖ s_{1} t_{l}}{s_{1} ⊖ t_{1}} a_{l}^{m}) \\ = {| X |}^{- 2 m} \sum_{\binom{{\{a_{l}^{m}\}}_{l = 2}^{n}}{\in X^{(n - 1) m}}} \prod_{l = 2}^{n} P_{A_{l}^{m}} (a_{l}^{m}) = {| X |}^{- 2 m} . \end{matrix}

Step (a) follows from (A3). Step (b) follows from the independent property on

A_{l}^{m}, l = 1, 2, \dots, n

and

B^{m} .

□

Appendix C. Proof of Lemma 8

In this appendix, we provide the proof of Lemma 8.

For simplicity of notation, we write

M = {| X |}^{m}

. For

x^{n} \in X^{n}

we set

\begin{matrix} B (x^{n}) & = \{({\overset{ˇ}{x}}^{n}) : H ({\overset{ˇ}{x}}^{n}) \leq H (x^{n}), P_{{\overset{ˇ}{x}}^{n}} = P_{x^{n}}\}, \end{matrix}

Using parts (a) and (b) of Lemma 4, we have following inequalities:

\begin{matrix} | B (x^{n}) | & \leq {(n + 1)}^{| X |} e^{n H (x^{n})}, \end{matrix}

(A4)

On an upper bound of

E [Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})]

, we have the following chain of inequalities:

\begin{matrix} E [Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})] \leq \sum_{\binom{{\overset{ˇ}{x}}^{n} \in B (x^{n}),}{{\overset{ˇ}{x}}^{n} \neq x^{n}}} Pr \{ϕ^{(n)} ({\overset{ˇ}{x}}^{n}) = ϕ^{(n)} (x^{n})\} \\ \overset{(a)}{\leq} \sum_{{\overset{ˇ}{x}}^{n} \in B (x^{n})} \frac{1}{M} = \frac{| B (x^{n}) |}{M} \overset{(b)}{\leq} e {(n + 1)}^{| X |} e^{- n [R - H (x^{n})]} . \end{matrix}

Step (a) follows from Lemma 7 part (a) and independent random constructions of linear encoders

ϕ_{1}^{(n)}

and

ϕ_{e}^{(n)}

. Step (b) follows from (A4) and

M \geq e^{n R - 1}, i = 1, 2

. On the other hand we have the obvious bound

E [Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})] \leq 1

. Hence we have

\begin{matrix} E [Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})] \leq e {(n + 1)}^{| X |} \{e^{- n {[R - H (x^{n})]}^{+}}\} . \end{matrix}

Hence we have

\begin{matrix} E [Ξ_{{\bar{X}}_{1} {\bar{X}}_{2}} (ϕ^{(n)}, ψ^{(n)})] = E [\frac{1}{| T_{\bar{X}}^{n} |} \sum_{x^{n} \in T_{\bar{X}}^{n}} Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})] & = \frac{1}{| T_{\bar{X}}^{n} |} \sum_{x^{n} \in T_{\bar{X}}^{n}} E [Ξ_{x^{n}} (ϕ^{(n)}, ψ^{(n)})] \\ \leq e {(n + 1)}^{| X |} \{e^{- n {[R - H (\bar{X})]}^{+}}\}, \end{matrix}

completing the proof. □

Appendix D. Proof of Lemma 9

In this appendix, we prove Lemma 9. This lemma immediately follows from the following lemma:

Lemma A1.

For any

n, m

satisfying

(m / n) log | X |

\leq R

, we have

\begin{matrix} E [D (p_{{\tilde{K}}^{m} | M_{A}^{(n)}}|| p_{V^{m}}| p_{M_{A}^{(n)}})] \\ \leq \sum_{(a, k^{n}) \in M_{A}^{(n)} \times X^{n}} p_{M_{A}^{(n)} K^{n}} (a, k^{n}) log [1 + (| X^{m} | - 1) p_{K^{n} | M_{A}^{(n)}} (k^{n} | a)] . \end{matrix}

(A5)

In fact, from

| X^{m} | \leq e^{n R}

and (A5) in Lemma A1, we have the bound (28) in Lemma 9. Thus, we prove Lemma A1 instead of proving Lemma 9.

In the following arguments, we use the following simplified notations:

\begin{matrix} k^{n}, K^{n} \in X^{n} & ⟹ k, K \in K, \\ {\tilde{k}}^{m}, {\tilde{K}}^{m} \in X^{m} & ⟹ l, L \in L, \\ φ^{(n)} : X^{n} \to X^{m} & ⟹ φ : K \to L, \end{matrix}

\begin{matrix} φ^{(n)} (k^{n}) = k^{n} A + b^{m} & ⟹ φ (k) = k A + b, \\ V^{m} \in X^{m} & ⟹ V \in L, \\ M_{A}^{(n)} \in M_{A}^{(n)} & ⟹ M \in M . \end{matrix}

We define

χ_{φ (k), l} = \{\begin{matrix} 1, if φ (k) = l, \\ 0, if φ (k) \neq l . \end{matrix}

Then, the conditional distribution of the random variable

L = L_{φ}

for given

M = a \in M

is

\begin{matrix} p_{L | M} (l | a) = \sum_{k \in K} p_{K | M} (k | a) χ_{φ (k), l} for l \in L . \end{matrix}

Define

\begin{matrix} Υ_{φ (k), l} : = χ_{φ (k), l} log [\begin{matrix}  \end{matrix} | L | \{\begin{matrix}  \end{matrix} \sum_{k^{'} \in K} p_{K | M} (k^{'} | a) χ_{φ (k^{'}), l} \begin{matrix}  \end{matrix}\} \begin{matrix}  \end{matrix}] . \end{matrix}

Then the conditional divergence between

p_{L | M}

and

p_{V}

for given M is given by

\begin{matrix} D (p_{L | M}|| p_{V}| p_{M}) = \sum_{(a, k) \in M \times K} \sum_{l \in L} p_{M K} (a, k) Υ_{φ (k), l} . \end{matrix}

(A6)

The quantity

Υ_{φ (k), l}

has the following form:

\begin{matrix} Υ_{φ (k), l} = χ_{φ (k), l} log {| L | (p_{K | M} (k | a) χ_{φ (k), l} + \sum_{k^{'} \in {k}^{c}} p_{K | M} (k^{'} | a) χ_{φ (k^{'}), l})\} . \end{matrix}

(A7)

The above form is useful for computing

E [Υ_{φ (k), l}]

.

Proof of Lemma A1:

Taking the expectation of both sides of (A7) with respect to the random choice of the entry of the matrix A and the vector b representing the affine encoder φ, we have

\begin{matrix} E [D (p_{L | M}|| p_{V}| p_{M})] = \sum_{(a, k) \in M \times K} \sum_{l \in L} p_{M K} (a, k) E [Υ_{φ (k), l}] . \end{matrix}

(A8)

To compute the expectation

E [Υ_{φ (k), l}]

, we introduce an expectation operator useful for the computation. Let

E_{φ (k) = l_{k}} [\cdot]

be an expectation operator based on the conditional probability measures

\Pr (\cdot | φ (k) = l_{k})

. Using this expectation operator, the quantity

E [Υ_{φ (k), l}]

can be written as

\begin{matrix} E [Υ_{φ (k), l}] = \sum_{l_{k} \in L} \Pr (φ (k) = l_{k}) E_{φ (k) = l_{k}} [Υ_{l_{k}, l}] . \end{matrix}

(A9)

Note that

Υ_{l_{k}, l} = \{\begin{matrix} 1, & if l_{k} = l, \\ 0, & otherwise . \end{matrix}

(A10)

From (A9) and (A10), we have

\begin{matrix} E [Υ_{φ (k), l}] = \Pr (φ (k) = l) E_{φ (k) = l} [Υ_{l, l}] = \frac{1}{| L |} E_{φ (k) = l} [Υ_{l, l}] . \end{matrix}

(A11)

Using (A7), the expectation

E_{φ (k) = l} [Υ_{l, l}]

can be written as

\begin{matrix} E_{φ (k) = l} [Υ_{l, l}] = E_{φ (k) = l} [log {| L | (p_{K | M} (k | a) + \sum_{k^{'} \in {k}^{c}} p_{K | M} (k^{'} | a) χ_{φ (k^{'}), l})\}] . \end{matrix}

(A12)

Applying Jensen’s inequality to the right member of (A12), we obtain the following upper bound of

E_{φ (k) = l} [Υ_{l, l}]

:

\begin{matrix} E_{φ (k) = l} [Υ_{l, l}] \leq log {| L | (p_{K | M} (k | a) + \sum_{k^{'} \in {k}^{c}} p_{K | M} (k^{'} | a) E_{φ (k) = l} [χ_{φ (k^{'}), l}])\} \\ \overset{(a)}{=} log \{| L | (p_{K | M} (k | a) + \sum_{k^{'} \in {k}^{c}} p_{K | M} (k^{'} | a) \frac{1}{| L |})\} = log \{1 + (| L | - 1) p_{K | M} (k | a)\} . \end{matrix}

(A13)

Step (a) follows from that by Lemma 7 parts (b) and (c),

\begin{matrix} E_{φ (k) = l} [χ_{φ (k^{'}), l}] & = \Pr (φ (k^{'}) = l | φ (k) = l) = \frac{1}{| L |} . \end{matrix}

From (A8), (A11), and (A13), we have the bound (A5) in Lemma A1. □

Appendix E. Proof of Lemma 12

To prove Lemma 12, we prepare a lemma. For simplicity of notation, set

| M_{A}^{(n)} | = M_{A}

. Define

B_{n} : = \{(a, z^{n}, k^{n}) : \frac{1}{n} log \frac{p_{M_{A}^{(n)} Z^{n} K^{n}} (a, z^{n}, k^{n})}{{\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}} (a, z^{n}, k^{n})} \geq - η\} .

Furthermore, define

\begin{matrix} {\tilde{C}}_{n} : = \{z^{n} : \frac{1}{n} log \frac{p_{Z^{n}} (z^{n})}{q_{Z^{n}} (z^{n})} \geq - η\}, \\ C_{n} : = {\tilde{C}}_{n} \times M_{A}^{(n)} \times X^{n}, C_{n}^{c} : = {\tilde{C}}_{n}^{c} \times M_{A}^{(n)} \times X^{n}, \\ {\tilde{D}}_{n} : = {(a, z^{n}) : \begin{matrix} a = φ_{A}^{(n)} (z^{n}), p_{Z^{n} | M_{A}^{(n)}} (z^{n} | a) \leq M_{A} e^{n η} p_{Z^{n}} (z^{n})}, \end{matrix} \\ D_{n} : = {\tilde{D}}_{n} \times X^{n}, D_{n}^{c} : = {\tilde{D}}_{n}^{c} \times X^{n}, \\ E_{n} : = {(a, z^{n}, k^{n}) : \begin{matrix} a = φ_{A}^{(n)} (z^{n}), p_{K^{n} | M_{A}^{(n)}} (k^{n} | a) \geq e^{- n (R + η)}} . \end{matrix} \end{matrix}

Then we have the following lemma.

Lemma A2.

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n}^{c}) & \leq e^{- n η}, \\ p_{M_{A}^{(n)} Z^{n} K^{n}} (C_{n}^{c}) & \leq e^{- n η}, \\ p_{M_{A}^{(n)} Z^{n} K^{n}} (D_{n}^{c}) & \leq e^{- n η} . \end{matrix}

Proof.

We first prove the first inequality.

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n}^{c}) & = \sum_{(a, z^{n}, k^{n}) \in B_{n}^{c}} p_{M_{A}^{(n)} Z^{n} K^{n}} (a, z^{n}, k^{n}) \\ \overset{(a)}{\leq} \sum_{(a, z^{n}, k^{n}) \in B_{n}^{c}} e^{- n η} {\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}} (a, z^{n}, k^{n}) \\ = e^{- n η} q_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n}^{c}) \leq e^{- n η} . \end{matrix}

Step (a) follows from the definition of

B_{n}

. For the second inequality we have

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (C_{n}^{c}) = p_{Z^{n}} ({\tilde{C}}_{n}^{c}) = \sum_{x^{n} \in {\tilde{C}}_{n}^{c}} p_{Z_{n}} (z^{n}) \\ \overset{(a)}{\leq} \sum_{x^{n} \in {\tilde{C}}_{n}^{c}} e^{- n η} q_{Z^{n}} (z^{n}) = e^{- n η} q_{Z^{n}} ({\tilde{C}}_{n}^{c}) \leq e^{- n η} . \end{matrix}

Step (a) follows from the definition of

C_{n}

. We finally prove the third inequality.

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (D_{n}^{c}) = p_{M_{A}^{(n)} Z^{n}} ({\tilde{D}}_{n}^{c}) & = & \sum_{a \in M_{A}^{(n)}} \sum_{\binom{z^{n} : φ_{A}^{(n)} (z^{n}) = a}{\binom{p_{Z^{n}} (z^{n}) \leq (e^{- n η} / M_{A})}{\times p_{Z^{n} | M_{A}^{(n)}} (z^{n} | a)}}} p_{Z^{n}} (z^{n}) \\ \leq & \frac{e^{- n η}}{M_{A}} \sum_{a \in M_{A}^{(n)}} \sum_{\binom{z^{n} : φ_{A}^{(n)} (z^{n}) = a}{\binom{p_{Z^{n}} (z^{n}) \leq (e^{- n η} / M_{A})}{\times p_{Z^{n} | M_{A}^{(n)}} (z^{n} | a)}}} p_{Z^{n} | M_{A}^{(n)}} (z^{n} | a) \\ \leq & \frac{e^{- n η}}{M_{A}} | M_{A}^{(n)} | = e^{- n η} . \end{matrix}

This completes the proof of Lemma A2. □

Proof of Lemma 12:

By definition, we have

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) \\ = p_{M_{A}^{(n)} Z^{n} K^{n}} \{\frac{1}{n} log \frac{p_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})}{{\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})} \geq - η, \\ 0 \geq \frac{1}{n} log \frac{q_{Z^{n}} (Z^{n})}{p_{Z^{n}} (Z^{n})} - η, \\ \frac{1}{n} log M_{A} \geq \frac{1}{n} log \frac{p_{Z^{n} | M_{A}^{(n)}} (Z^{n} | M_{A}^{(n)})}{p_{Z^{n}} (Z^{n})} - η, \\ R \geq \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η\} . \end{matrix}

Then for any

φ_{A}^{(n)}

satisfying

(1 / n) log | | φ_{A}^{(n)} | | \leq R_{A},

we have

\begin{matrix} p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) \\ \leq p_{M_{A}^{(n)} Z^{n} K^{n}} \{\frac{1}{n} log \frac{p_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})}{{\hat{q}}_{M_{A}^{(n)} Z^{n} K^{n}} (M_{A}^{(n)}, Z^{n}, K^{n})} \geq - η, \\ 0 \geq \frac{1}{n} log \frac{q_{Z^{n}} (Z^{n})}{p_{Z^{n}} (Z^{n})} - η, \\ R_{A} \geq \frac{1}{n} log \frac{p_{Z^{n} | M_{A}^{(n)}} (Z^{n} | M_{A}^{(n)})}{p_{Z^{n}} (Z^{n})} - η, \\ R \geq \frac{1}{n} log \frac{1}{p_{K^{n} | M_{A}^{(n)}} (K^{n} | M_{A}^{(n)})} - η\} . \end{matrix}

Hence, it suffices to show

\begin{matrix} ℘_{η}^{(n)} \leq p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) + 3 e^{- n η} \end{matrix}

to prove Lemma 12. We have the following chain of inequalities:

\begin{matrix} ℘ & \overset{(a)}{=} p_{M_{A}^{(n)} Z^{n} K^{n}} (E_{n}) \\ = p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) + p_{M_{A}^{(n)} Z^{n} K^{n}} ({[B_{n} \cap C_{n} \cap D_{n}]}^{c} \cap E_{n}) \\ \leq p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) + p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n}^{c}) + p_{M_{A}^{(n)} Z^{n} K^{n}} (C_{n}^{c}) + p_{M_{A}^{(n)} Z^{n} K^{n}} (D_{n}^{c}) \\ \overset{(b)}{\leq} p_{M_{A}^{(n)} Z^{n} K^{n}} (B_{n} \cap C_{n} \cap D_{n} \cap E_{n}) + 3 e^{- n η} = \tilde{℘} . \end{matrix}

Step (a) follows from the defintion of ℘. Step (b) follows from Lemma A2. ☐

References

Brier, E.; Clavier, C.; Olivier, F. Correlation Power Analysis with a Leakage Model. In International Workshop on Cryptographic Hardware and Embedded Systems; Joye, M., Quisquater, J.J., Eds.; Springer: Berlin/Heidelberg, Germany, 2004; pp. 16–29. [Google Scholar]
Quisquater, J.J.; Samyde, D. ElectroMagnetic Analysis (EMA): Measures and Counter-Measures for Smart Cards. In International Conference on Research in Smart Cards; Attali, I., Jensen, T., Eds.; Springer: London, UK, 2001; pp. 200–210. [Google Scholar]
Kocher, P.C. Timing Attacks on Implementations of Diffie-Hellman, RSA, DSS, and Other Systems. In Annual International Cryptology Conference; Springer: Berlin/Heidelberg, Germany, 1996; Volume 1109, pp. 104–113. [Google Scholar]
Kocher, P.C.; Jaffe, J.; Jun, B. Differential Power Analysis. In Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1999; Volume 1666, pp. 388–397. [Google Scholar]
Agrawal, D.; Archambeault, B.; Rao, J.R.; Rohatgi, P. The EM Side—Channel(s). In International Workshop on Cryptographic Hardware and Embedded Systems; Kaliski, B.S., Koç, ç.K., Paar, C., Eds.; Springer: Berlin/Heidelberg, Germany, 2003; pp. 29–45. [Google Scholar]
Csiszár, I. Linear Codes for Sources and Source Networks: Error Exponents, Universal Coding. IEEE Trans. Inform. Theory 1982, 28, 585–592. [Google Scholar]
Ahlswede, R.; Körner, J. Source Coding with Side Information and A Converse for The Degraded Broadcast Channel. IEEE Trans. Inform. Theory 1975, 21, 629–637. [Google Scholar]
Wyner, A.D. The Common Information of Two Dependent Random Variables. IEEE Trans. Inform. Theory 1975, 21, 163–179. [Google Scholar]
Oohama, Y. Exponent function for one helper source coding problem at rates outside the rate region. In Proceedings of the 2015 IEEE International Symposium on Information Theory (ISIT), Hong Kong, 14–19 June 2015; pp. 1575–1579. [Google Scholar]
Watanabe, S.; Oohama, Y. Privacy amplification theorem for bounded storage eavesdropper. In Proceedings of the 2012 IEEE Information Theory Workshop (ITW), Bangalore, India, 20–25 October 2012; pp. 177–181. [Google Scholar]
Coron, J.; Naccache, D.; Kocher, P.C. Statistics and secret leakage. ACM Trans. Embed. Comput. Syst. 2004, 3, 492–508. [Google Scholar]
Köpf, B.; Basin, D.A. An information-theoretic model for adaptive side-channel attacks. In Proceedings of the 2007 ACM Conference on Computer and Communications Security, CCS 2007, Alexandria, VA, USA, 28–31 January 2007; pp. 286–296. [Google Scholar]
Backes, M.; Köpf, B. Formally Bounding the Side-Channel Leakage in Unknown-Message Attacks. In European Symposium on Research in Computer Security; Springer: Berlin/Heidelberg, Germany, 2008; Volume 5283, pp. 517–532. [Google Scholar]
Micali, S.; Reyzin, L. Physically Observable Cryptography (Extended Abstract). In Theory of Cryptography Conference; Springer: Berlin/Heidelberg, Germany, 2004; Volume 2951, pp. 278–296. [Google Scholar]
Standaert, F.; Malkin, T.; Yung, M. A Unified Framework for the Analysis of Side-Channel Key Recovery Attacks. In Annual International Conference on the Theory and Applications of Cryptographic Techniques; Springer: Berlin/Heidelberg, Germany, 2009; Volume 5479, pp. 443–461. [Google Scholar]
Wyner, A.D. On Source Coding with Side Information at The Decoder. IEEE Trans. Inform. Theory 1975, 21, 294–300. [Google Scholar]
Oohama, Y. Strong converse exponent for degraded broadcast channels at rates outside the capacity region. In Proceedings of the 2015 IEEE International Symposium on Information Theory (ISIT), Hong Kong, China, 14–19 June 2015; pp. 939–943. [Google Scholar]
Oohama, Y. Strong converse theorems for degraded broadcast channels with feedback. In Proceedings of the 2015 IEEE International Symposium on Information Theory (ISIT), Hong Kong, China, 14—19 June 2015; pp. 2510–2514. [Google Scholar]
Oohama, Y. New Strong Converse for Asymmetric Broadcast Channels. arXiv 2016, arXiv:1604.02901. [Google Scholar]
Oohama, Y. Exponential Strong Converse for Source Coding with Side Information at the Decoder. Entropy 2018, 20, 352. [Google Scholar]
Csiszár, I.; Körner, J. Information Theory, Coding Theorems for Discrete Memoryless Systems, 2nd ed.; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Oohama, Y.; Han, T.S. Universal coding for the Slepian-Wolf data compression system and the strong converse theorem. IEEE Trans. Inform. Theory 1994, 40, 1908–1919. [Google Scholar]
Hayashi, M. Exponential Decreasing Rate of Leaked Information in Universal Random Privacy Amplification. IEEE Trans. Inform. Theory 2011, 57, 3989–4001. [Google Scholar]

Figure 1. Illustration of side-channel attacks.

Figure 2. Main problem: side-channel attacks on a Shannon cipher system.

Figure 3. Basic solution framework: post-encryption-compression coding system.

Figure 4. Our proposed solution: affine encoders as privacy amplifiers.

Figure 5. Two split problems: Problem 0 (Reliability) and Problem 1 (Security).

Figure 6. Three related coding problems.

Figure 7. Shape of the region

R (p_{K}, W)

.

Figure 7. Shape of the region

R (p_{K}, W)

.

Figure 8. The inner bound

R_{Sys}^{(in)} (p_{X}, p_{K}, W)

of the reliable and secure rate region

R_{Sys} (p_{X}, p_{K}

W)

.

Figure 8. The inner bound

R_{Sys}^{(in)} (p_{X}, p_{K}, W)

of the reliable and secure rate region

R_{Sys} (p_{X}, p_{K}

W)

.

Table 1. Differences between Problems 1, 2, and 3 in terms of

{φ^{(n)}}_{n \geq 1}

and security criteria.

Table 1. Differences between Problems 1, 2, and 3 in terms of

{φ^{(n)}}_{n \geq 1}

and security criteria.

	Problem 1	Problem 2	Problem 3
$φ^{(n)}$	affine encoders	general	general
Security Criteria	$D (p_{{\tilde{K}}^{m} \| M_{A}^{(n)}} \| \| p_{V^{m}} \| p_{M_{A}^{(n)}})$	$d (p_{V^{m}} \times p_{M_{A}^{(n)}}, p_{{\tilde{K}}^{m} M_{A}^{(n)}})$	$p_{c, A}^{(n)} (φ^{(n)}, φ_{A}^{(n)}, ψ_{A}^{(n)} \| p_{K}^{n}, W^{n})$

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Santoso, B.; Oohama, Y. Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†. Entropy 2019, 21, 469. https://doi.org/10.3390/e21050469

AMA Style

Santoso B, Oohama Y. Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†. Entropy. 2019; 21(5):469. https://doi.org/10.3390/e21050469

Chicago/Turabian Style

Santoso, Bagus, and Yasutada Oohama. 2019. "Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†" Entropy 21, no. 5: 469. https://doi.org/10.3390/e21050469

APA Style

Santoso, B., & Oohama, Y. (2019). Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†. Entropy, 21(5), 469. https://doi.org/10.3390/e21050469

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†

Abstract

1. Introduction

1.1. Our Contributions

1.1.1. Security Model for Side-Channel Attacks

1.1.2. Main Result

1.2. Comparison to Existing Models of Side-Channel Attacks

1.3. Comparison to Encoding before Encryption

1.4. Organization of this Paper

2. Problem Formulation

2.1. Preliminaries

2.2. Basic System Description

2.3. Solution Framework

3. Proposed Idea: Affine Encoder as a Privacy Amplifier

Splitting of Reliability and Security

4. Previous Related Works

5. Reliability and Security Analysis

6. Proofs of the Results

6.1. Types of Sequences and Their Properties

6.2. Upper Bounds of $p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})$ , and $Δ_{n} (φ^{(n)}, φ_{A}^{(n)}$ $| p_{X}^{n}, p_{K}^{n}, W^{n})$

6.3. Random Coding Arguments

6.4. Explicit Upper Bound of $Θ (R_{1}, R_{2}, φ_{A}^{(n)} | p_{Z K_{1} K_{2}}^{n})$

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Correct Probability of Decoding and Variational Distance

Appendix B. Proof of Lemma 7

Appendix C. Proof of Lemma 8

Appendix D. Proof of Lemma 9

Appendix E. Proof of Lemma 12

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks †

Abstract

1. Introduction

1.1. Our Contributions

1.1.1. Security Model for Side-Channel Attacks

1.1.2. Main Result

1.2. Comparison to Existing Models of Side-Channel Attacks

1.3. Comparison to Encoding before Encryption

1.4. Organization of this Paper

2. Problem Formulation

2.1. Preliminaries

2.2. Basic System Description

2.3. Solution Framework

3. Proposed Idea: Affine Encoder as a Privacy Amplifier

Splitting of Reliability and Security

4. Previous Related Works

5. Reliability and Security Analysis

6. Proofs of the Results

6.1. Types of Sequences and Their Properties

6.2. Upper Bounds of p e ( ϕ ( n ) , ψ ( n ) | p X n ) , and Δ n ( φ ( n ) , φ A ( n ) | p X n , p K n , W n )

6.3. Random Coding Arguments

6.4. Explicit Upper Bound of Θ ( R 1 , R 2 , φ A ( n ) | p Z K 1 K 2 n )

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Correct Probability of Decoding and Variational Distance

Appendix B. Proof of Lemma 7

Appendix C. Proof of Lemma 8

Appendix D. Proof of Lemma 9

Appendix E. Proof of Lemma 12

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Information Theoretic Security for Shannon Cipher System under Side-Channel Attacks ^†

6.2. Upper Bounds of $p_{e} (ϕ^{(n)}, ψ^{(n)} | p_{X}^{n})$ , and $Δ_{n} (φ^{(n)}, φ_{A}^{(n)}$ $| p_{X}^{n}, p_{K}^{n}, W^{n})$

6.4. Explicit Upper Bound of $Θ (R_{1}, R_{2}, φ_{A}^{(n)} | p_{Z K_{1} K_{2}}^{n})$