A Game-Theoretic Approach to Information-Flow Control via Protocol Composition

Mário S. Alvim; Konstantinos Chatzikokolakis; Yusuke Kawamoto; Catuscia Palamidessi

doi:10.3390/e20050382

,

and

¹

Computer Science Department, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte-MG 31270-110, Brazil

²

École Polytechnique, 91128 Palaiseau, France

³

Centre National de la Recherche Scientifique (CNRS), 91190 Gif-sur-Yvette, France

⁴

National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba 305-8560, Japan

Entropy2018, 20(5), 382;https://doi.org/10.3390/e20050382

This article belongs to the Special Issue Information Theory in Game Theory

Version Notes

Order Reprints

Abstract

In the inference attacks studied in Quantitative Information Flow (QIF), the attacker typically tries to interfere with the system in the attempt to increase its leakage of secret information. The defender, on the other hand, typically tries to decrease leakage by introducing some controlled noise. This noise introduction can be modeled as a type of protocol composition, i.e., a probabilistic choice among different protocols, and its effect on the amount of leakage depends heavily on whether or not this choice is visible to the attacker. In this work, we consider operators for modeling visible and hidden choice in protocol composition, and we study their algebraic properties. We then formalize the interplay between defender and attacker in a game-theoretic framework adapted to the specific issues of QIF, where the payoff is information leakage. We consider various kinds of leakage games, depending on whether players act simultaneously or sequentially, and on whether or not the choices of the defender are visible to the attacker. In the case of sequential games, the choice of the second player is generally a function of the choice of the first player, and his/her probabilistic choice can be either over the possible functions (mixed strategy) or it can be on the result of the function (behavioral strategy). We show that when the attacker moves first in a sequential game with a hidden choice, then behavioral strategies are more advantageous for the defender than mixed strategies. This contrasts with the standard game theory, where the two types of strategies are equivalent. Finally, we establish a hierarchy of these games in terms of their information leakage and provide methods for finding optimal strategies (at the points of equilibrium) for both attacker and defender in the various cases.

Keywords:

information leakage; quantitative information flow; game theory; algebraic properties

1. Introduction

A fundamental problem in computer security is the leakage of sensitive information due to the correlation of secret values with observables, i.e., any information accessible to the attacker, such as, for instance, the system’s outputs or execution time. The typical defense consists of reducing this correlation, which can be done in, essentially, two ways. The first, applicable when the correspondence secret-observable is deterministic, consists of coarsening the equivalence classes of secrets that give rise to the same observables. This can be achieved with post-processing, i.e., sequentially composing the original system with a program that removes information from observables. For example, a typical attack on encrypted web traffic consists of the analysis of the packets’ length, and a typical defense consists of padding extra bits so as to diminish the length variety [1].

The second kind of defense, on which we focus in this work, consists of adding controlled noise to the observables produced by the system. This can be usually seen as a composition of different protocols via probabilistic choice.

Example 1 (Differential privacy).

Consider a counting query f, namely a function that, applied to a dataset x, returns the number of individuals in x that satisfies a given property. A way to implement differential privacy [2] is to add geometrical noise to the result of f, so as to obtain a probability distribution P on integers of the form

P (z) = c e^{| z - f (x) |}

, where c is a normalization factor. The resulting mechanism can be interpreted as a probabilistic choice on protocols of the form

f (x), f (x) + 1, f (x) + 2, \dots, f (x) - 1, f (x) - 2, \dots

, where the probability assigned to

f (x) + n

and to

f (x) - n

decreases exponentially with n.

Example 2 (Dining cryptographers).

Consider two agents running the dining cryptographers protocol [3], which consists of tossing a fair binary coin and then declaring the exclusive or ⊕ of their secret value x and the result of the coin. The protocol can be thought of as the fair probabilistic choice of two protocols, one consisting simply of declaring x and the other declaring

x \oplus 1

.

Most of the work in the literature of Quantitative Information Flow (QIF) considers passive attacks, in which the attacker only observes the system. Notable exceptions are the works [4,5,6], which consider attackers who interact with and influence the system, possibly in an adaptive way, with the purpose of maximizing the leakage of information.

Example 3 (CRIME attack).

Compression Ratio Info-leak Made Easy (CRIME) [7] is a security exploit against secret web cookies over connections using the HTTPS and SPDY protocols and data compression. The idea is that the attacker can inject some content a into the communication of the secret x from the target site to the server. The server then compresses and encrypts the data, including both a and x, and sends back the result. By observing the length of the result, the attacker can then infer information about x. To mitigate the leakage, one possible defense would consist of transmitting, along with x, also an encryption method f selected randomly from a set F. Again, the resulting protocol can be seen as a composition, using probabilistic choice, of the protocols in the set F.

Example 4 (Timing side-channels).

Consider a password-checker, or any similar system in which the user authenticates himself/herself by entering a secret that is checked by the system. An adversary does not know the real secret, of course, but a timing side-channel could reveal the part (e.g., which bit) of the secret in which the adversary’s input fails. By repeating the process with different inputs, the adversary might be able to fully retrieve the secret. A possible counter measure is to make the side channel noisy, by randomizing the order in which the secret’s bits are checked against the user input. This example is studied in detail in Section 7.

In all examples above, the main use of the probabilistic choice is to obfuscate the relation between secrets and observables, thus reducing their correlation; and hence, the information leakage. To achieve this goal, it is essential that the attacker never comes to know the result of the choice. In the CRIME example, however, if f and a are chosen independently, then (in general) it is still better to choose f probabilistically, even if the attacker will come to know, afterwards, the choice of f. In fact, this is true also for the attacker: his/her best strategies (in general) are to choose a according to some probability distribution. Indeed, suppose that

F = {f_{1}, f_{2}}

are the defender’s choices and

A = {a_{1}, a_{2}}

are the attacker’s and that

f_{1} (\cdot, a_{1})

leaks more than

f_{1} (\cdot, a_{2})

, while

f_{2} (\cdot, a_{1})

leaks less than

f_{2} (\cdot, a_{2})

. This is a scenario like the matching pennies in game theory: if one player selects an action deterministically, the other player may exploit this choice and get an advantage. For each player, the optimal strategy is to play probabilistically, using a distribution that maximizes his/her own gain for all possible actions of the attacker. In zero-sum games, in which the gain of one player coincides with the loss of the other, the optimal pair of distributions always exists, and it is called the saddle point. It also coincides with the Nash equilibrium, which is defined as the point at which neither of the two players gets any advantage in changing his/her strategy unilaterally.

Motivated by these examples, this paper investigates the two kinds of choice, visible and hidden (to the attacker), in a game-theoretic setting. Looking at them as language operators, we study their algebraic properties, which will help reason about their behavior in games. We consider zero-sum games, in which the gain (for the attacker) is represented by the leakage. While for the visible choice, it is appropriate to use the “classic” game-theoretic framework, for the hidden choice, we need to adopt the more general framework of the information leakage games proposed in [6]. This happens because, in contrast with standard game theory, in games with hidden choice, the payoff of a mixed strategy is a convex function of the distribution on the defender’s pure actions, rather than simply the expected value of their utilities. We will consider both simultaneous games, in which each player chooses independently, and sequential games, in which one player chooses his/her action first. We aim at comparing all these situations and at identifying the precise advantage of the hidden choice over the visible one.

To measure leakage, we use the well-known information-theoretic model. A central notion in this model is that of entropy, but here, we use its converse, vulnerability, which represents the magnitude of the threat. In order to derive results as general as possible, we adopt the very comprehensive notion of vulnerability as any convex and continuous function, as used in [4] and [8]. This notion has been shown [8] to be, in a precise sense, the most general information measure w.r.t. a set of fundamental information-theoretic axioms. Our results, hence, apply to all information measures that respect such fundamental principles, including the widely-adopted measures of Bayes vulnerability (also known as min-vulnerability, also known as (the converse of) Bayes risk) [9,10], Shannon entropy [11], guessing entropy [12] and g-vulnerability [13].

The main contributions of this paper are:

We present a general framework for reasoning about information leakage in a game-theoretic setting, extending the notion of information leakage games proposed in [6] to both simultaneous and sequential games, with either a hidden or visible choice.
We present a rigorous compositional way, using visible and hidden choice operators, for representing attacker’s and defender’s actions in information leakage games. In particular, we study the algebraic properties of visible and hidden choice on channels and compare the two kinds of choice with respect to the capability of reducing leakage, in the presence of an adaptive attacker.
We provide a taxonomy of the various scenarios (simultaneous and sequential) showing when randomization is necessary, for either attacker or defender, to achieve optimality. Although it is well known in information flow that the defender’s best strategy is usually randomized, only recently has it been shown that when defender and attacker act simultaneously, the attacker’s optimal strategy also requires randomization [6].
We compare the vulnerability of the leakage games for these various scenarios and establish a hierarchy of leakage games based on the order between the value of the leakage in the Nash equilibrium. Furthermore, we show that when the attacker moves first in a sequential game with hidden choice, the behavioral strategies (where the defender chooses his/her probabilistic distribution after he/she has seen the choice of the attacker) are more advantageous for the defender than the mixed strategies (where the defender chooses the probabilistic distribution over his/her possible functional dependency on the choice of the attacker). This contrast with the standard game theory, where the two types of strategies are equivalent. Another difference is that in our attacker-first sequential games, there may not exist Nash equilibria with deterministic strategies for the defender (although the defender has full visibility of the attacker’s moves).
We use our framework in a detailed case study of a password-checking protocol. A naive program, which checks the password bit by bit and stops when it finds a mismatch, is clearly very insecure, because it reveals at each attempt (via a timing side-channel) the maximum correct prefix. On the other hand, if we continue checking until the end of the string (time padding), the program becomes very inefficient. We show that, by using probabilistic choice instead, we can obtain a good trade-off between security and efficiency.

Plan of the Paper

The remainder of the paper is organized as follows. In Section 2, we review some basic notions of game theory and quantitative information flow. In Section 3, we introduce our running example. In Section 4, we define the visible and hidden choice operators and demonstrate their algebraic properties. In Section 5, the core of the paper, we examine various scenarios for leakage games. In Section 6, we compare the vulnerability of the various leakage games and establish a hierarchy among those games. In Section 7, we show an application of our framework to a password checker. In Section 8, we discuss related work, and finally, in Section 9, we conclude.

A preliminary version of this paper appeared in [14]. One difference with respect to [14] is that in the present paper, we consider both behavioral and mixed strategies in the sequential games, while in [14], we only considered the latter. We also show that the two kinds of strategies are not equivalent in our context (Example 10: the optimal strategy profile yields a different payoff depending on whether the defender adopts mixed strategies or behavioral ones). In light of this difference, we provide new results that concern behavioral strategies, and in particular:

Theorem 3, which concerns the defender’s behavioral strategies in the defender-first game with visible choice (Game II),
the second half of Theorem 6, which deals with the adversary’s behavioral strategies in the attacker-first game with hidden choice (Game VI).

Furthermore, in this paper, we define formally all concepts and provide all the proofs. In particular, we provide a precise formulation of the comparison among games with visible/hidden choices (Propositions 4 and 5, Corollaries 3–5) in Section 6. Finally, in Section 7, we provide a new result, expressed by Theorem 7, regarding the optimal strategies for the defender in the presence of a uniform prior on passwords.

2. Preliminaries

In this section, we review some basic notions from game theory and quantitative information flow. We use the following notation: Given a set

I

, we denote by

D I

the set of all probability distributions over

I

. Given

μ \in D I

, its support

(μ) \overset{def}{=} {i \in I : μ (i) > 0}

is the set of its elements with positive probability. We use

i \leftarrow μ

to indicate that a value

i \in I

is sampled from a distribution

μ

on

I

. A set

S \subseteq R^{n}

is convex if

t s_{0} + (1 - t) s_{1} \in S

for all

s_{0}, s_{1} \in S

and

t \in [0, 1]

. For such a set, a function

f : S \to R

is convex if

f (t s_{0} + (1 - t) s_{1}) \leq t f (s_{0}) + (1 - t) f (s_{1})

for all

s_{0}, s_{1} \in S, t \in [0, 1]

, and concave if

- f

is convex.

2.1. Basic Concepts from Game Theory

2.1.1. Two-Player Games

Two-player games are a model for reasoning about the behavior of two players. In a game, each player has at its disposal a set of actions that he/she can perform, and he/she obtains some gain or loss depending on the actions chosen by both players. Gains and losses are defined using a real-valued payoff function. Each player is assumed to be rational, i.e., his/her choice is driven by the attempt to maximize his/her own expected payoff. We also assume that the set of possible actions and the payoff functions of both players are common knowledge.

In this paper, we only consider finite games, in which the set of actions available to the players is finite, which are also zero-sum games, so the payoff of one player is the loss of the other. Next, we introduce an important distinction between simultaneous and sequential games. In the following, we will call the two players defender and attacker.

2.1.2. Simultaneous Games

In a simultaneous game, each player chooses his/her action without knowing the action chosen by the other. The term “simultaneous” here does not mean that the players’ actions are chosen at the same time, but only that they are chosen independently. Formally, such a game is defined as a tuple (following the convention of security games, we set the first player to be the defender)

(D, A, u_{d}, u_{a})

, where

D

is a nonempty set of defender’s actions,

A

is a nonempty set of attacker’s actions,

u_{d} : D \times A \to R

is the defender’s payoff function and

u_{a} : D \times A \to R

is the attacker’s payoff function.

Each player may choose an action deterministically or probabilistically. A pure strategy of the defender (respectively attacker) is a deterministic choice of an action, i.e., an element

d \in D

(respectively

a \in A

). A pair

(d, a)

is called pure strategy profile, and

u_{d} (d, a)

,

u_{a} (d, a)

represent the defender’s and the attacker’s payoffs, respectively. A mixed strategy of the defender (respectively attacker) is a probabilistic choice of an action, defined as a probability distribution

δ \in D D

(respectively

α \in D A

). A pair

(δ, α)

is called mixed strategy profile. The defender’s and the attacker’s expected payoff functions for mixed strategies are defined, respectively, as:

\begin{matrix} U_{d} (δ, α) & \overset{def}{=} \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{E} u_{d} (d, a) = \sum_{\begin{matrix} d \in D \\ a \in A \end{matrix}} δ (d) α (a) u_{d} (d, a) and : \\ U_{a} (δ, α) & \overset{def}{=} \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{E} u_{a} (d, a) = \sum_{\begin{matrix} d \in D \\ a \in A \end{matrix}} δ (d) α (a) u_{a} (d, a) . \end{matrix}

A defender’s mixed strategy

δ \in D D

is the best response to an attacker’s mixed strategy

α \in D A

if

U_{d} (δ, α) = {max}_{δ^{'} \in D D} U_{d} (δ^{'}, α)

. Symmetrically,

α \in D A

is the best response to

δ \in D D

if

U_{a} (δ, α) = {max}_{α^{'} \in D A} U_{d} (δ, α^{'})

. A mixed-strategy Nash equilibrium is a profile

(δ^{*}, α^{*})

such that

δ^{*}

is the best response to

α^{*}

and vice versa. This means that in a Nash equilibrium, no unilateral deviation by any single player provides better payoff to that player. If

δ^{*}

and

α^{*}

are point distributions concentrated on some

d^{*} \in D

and

a^{*} \in A

, respectively, then

(δ^{*}, α^{*})

is a pure-strategy Nash equilibrium and will be denoted by

(d^{*}, a^{*})

. While not all games have a pure strategy Nash equilibrium, every finite game has a mixed strategy Nash equilibrium.

2.1.3. Sequential Games

In a sequential game, players may take turns in choosing their actions. In this paper, we only consider the case in which each player moves only once, in such a way that one of the players (the leader) chooses his/her action first, and commits to it, before the other player (the follower) makes his/her choice. The follower may have total knowledge of the choice made by the leader, or only partial. We refer to the two scenarios by the terms perfect and imperfect information, respectively. Another distinction is the kind of randomization used by the players, namely whether the follower chooses probabilistically his/her action after he/she knows (partially or totally) the move of the leader, or whether he/she chooses at the beginning of the game a probabilistic distribution on (deterministic) strategies that depend on the (partial or total) knowledge of the move of the leader. In the first case, the strategies are called behavioral, in the second case mixed.

We now give the precise definitions assuming that the leader is the defender. The definitions for the case in which the leader is the attacker are analogous.

A defender-first sequential game with perfect information is a tuple

(D, D \to A, u_{d}, u_{a})

where

D

,

A

,

u_{d}

and

u_{a}

are defined as in simultaneous games: The choice of an action

d \in D

represents a pure strategy of the defender. As for the attacker, his/her choice

a \in A

depends functionally on the prior choice d of the defender, and for this reason, the pure strategies of the attacker are functions

s_{a} : D \to A

. As for the probabilistic strategies, those of the defender are defined as in simultaneous games: namely, they are distributions

δ \in D D

. On the other hand, the attacker’s probabilistic strategies can be defined in two different ways: In the behavioral case, an attacker’s probabilistic strategy is a function

ϕ_{a} : D \to D (A)

. Namely, the attacker chooses a distribution on his/her actions after he/she sees the move of the defender. In the mixed case, an attacker’s probabilistic strategy is a probability distribution

σ_{a} \in D (D \to A)

. Namely, the attacker chooses a priori a distribution on pure strategies. The defender’s and the attacker’s expected payoff functions for mixed strategies are defined, respectively, as:

\begin{matrix} Behavioral case : & \{\begin{matrix} U_{d} (δ, ϕ_{a}) & \overset{def}{=} & \underset{d \leftarrow δ}{E} \underset{a \leftarrow ϕ_{a} (d)}{E} u_{d} (d, a) & = & \sum_{d \in D} δ (d) \sum_{a \in A} ϕ_{a} (d) (a) u_{d} (d, a) \\ U_{a} (δ, ϕ_{a}) & \overset{def}{=} & \underset{d \leftarrow δ}{E} \underset{a \leftarrow ϕ_{a} (d)}{E} u_{a} (d, a) & = & \sum_{d \in D} δ (d) \sum_{a \in A} ϕ_{a} (d) (a) u_{a} (d, a) \end{matrix} \\ Mixed case : & \{\begin{matrix} U_{d} (δ, σ_{a}) & \overset{def}{=} & \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}}{E} u_{d} (d, s_{a} (d)) & = & \sum_{\begin{matrix} d \in D \\ s_{a} : D \to A \end{matrix}} δ (d) σ_{a} (s_{a}) u_{d} (d, s_{a} (d)) \\ U_{a} (δ, σ_{a}) & \overset{def}{=} & \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}}{E} u_{a} (d, s_{a} (d)) & = & \sum_{\begin{matrix} d \in D \\ s_{a} : D \to A \end{matrix}} δ (d) σ_{a} (s_{a}) u_{a} (d, s_{a} (d)) \end{matrix} \end{matrix}

The case of imperfect information is typically formalized by assuming an indistinguishability (equivalence) relation over the actions chosen by the leader, representing a scenario in which the follower cannot distinguish between the actions belonging to the same equivalence class. The pure strategies of the followers, therefore, are functions from the set of the equivalence classes on the actions of the leader to his/her own actions. Formally, a defender-first sequential game with imperfect information is a tuple

(D, K_{a} \to A, u_{d}, u_{a})

where

D

,

A

,

u_{d}

and

u_{a}

are defined as in simultaneous games, and

K_{a}

is a partition of

D

. The expected payoff functions are defined as before, except that now the argument of

ϕ_{a}

and

s_{a}

is the equivalence class of d. Note that in the case in which all defender’s actions are indistinguishable from each other in the eyes of the attacker (totally imperfect information), we have

K_{a} = {D}

, and the expected payoff functions coincide with those of the simultaneous games. In contrast, in the games in which all defender’s actions are distinguishable from the viewpoint of the attacker (perfect information), we have

K_{a} = {{d} ∣ d \in D}

.

In the standard game theory, under the assumption of perfect recall (i.e., the players never forget what they have learned), behavioral and mixed strategies are equivalent, in the sense that for any behavioral strategy, there is a mixed strategy that yields the same payoff, and vice versa. This is true for both cases of perfect and imperfect information; see [15], Chapter 11.4. In our leakage games, however, this equivalence does not hold anymore, as will be shown in Section 5 and Section 6.

2.1.4. Zero-Sum Games and the Minimax Theorem

A game

(D, A, u_{d}, u_{a})

is zero-sum if for any

d \in D

and any

a \in A

, the defender’s loss is equivalent to the attacker’s gain, i.e.,

u_{d} (d, a) = - u_{a} (d, a)

. For brevity, in zero-sum games, we denote by u the attacker’s payoff function

u_{a}

and by U the attacker’s expected payoff

U_{a}

(Conventionally in game theory, the payoff u is set to be that of the first player, but we prefer to look at the payoff from the point of view of the attacker to be in line with the definition of payoff as vulnerability.). Consequently, the goal of the defender is to minimize U, and the goal of the attacker is to maximize it.

In simultaneous zero-sum games, the Nash equilibrium corresponds to the solution of the minimax problem (or equivalently, the maximin problem), namely the strategy profile

(δ^{*}, α^{*})

such that

U (δ^{*}, α^{*}) = {min}_{δ} {max}_{α} U (δ, α)

. The von Neumann’s minimax theorem, in fact, ensures that such a solution (which always exists) is stable.

Theorem 1 (von Neumann’s minimax theorem).

Let

X \subset R^{m}

and

Y \subset R^{n}

be compact convex sets, and

U : X \times Y \to R

be a continuous function such that

U (x, y)

is a convex function in

x \in X

and a concave function in

y \in Y

. Then:

min_{x \in X} max_{y \in Y} U (x, y) = max_{y \in Y} min_{x \in X} U (x, y) .

A related property is that, under the conditions of Theorem 1, there exists a saddle point

(x^{*}, y^{*})

s.t., for all

x \in X

and

y \in Y

:

U (x^{*}, y) \leq U (x^{*}, y^{*}) \leq U (x, y^{*})

.

The solution of the minimax problem can be obtained by using convex optimization techniques. In the case

U (x, y)

is affine in x and in y, we can also use linear optimization.

In the case

D

and

A

contain two elements each, there is a closed form for the solution. Let

D = {d_{0}, d_{1}}

and

A = {a_{0}, a_{1}}

, respectively. Let

u_{i j}

be the payoff of the defender on

d_{i}, a_{j}

. Then, the Nash equilibrium

(δ^{*}, α^{*})

is given by:

δ^{*} (d_{0}) = \frac{u_{11} - u_{10}}{u_{00} - u_{01} - u_{10} + u_{11}} α^{*} (a_{0}) = \frac{u_{11} - u_{01}}{u_{00} - u_{01} - u_{10} + u_{11}}

(1)

if these values are in

[0, 1]

. Note that, since there are only two elements, the strategy

δ^{*}

is completely specified by its value in

d_{0}

and analogously for

α^{*}

.

2.2. Quantitative Information Flow

Finally, we briefly review the standard framework of quantitative information flow, which is concerned with measuring the amount of information leakage in a (computational) system.

2.2.1. Secrets and Vulnerability

A secret is some piece of sensitive information the defender wants to protect, such as a user’s password, social security number or current location. The attacker usually only has some partial knowledge about the value of a secret, represented as a probability distribution on secrets called a prior. We denote by

X

the set of possible secrets, and we typically use

π

to denote a prior belonging to the set

D X

of probability distributions over

X

.

The vulnerability of a secret is a measure of the payoff that it represents for the attacker. In this paper, we consider a very general notion of vulnerability, following [8], and we define a vulnerability

V

to be any continuous and convex function of type

D X \to R

. It has been shown in [8] that these functions coincide with the set of g-vulnerabilities, and are, in a precise sense, the most general information measures w.r.t. a set of fundamental information-theoretic axioms (more precisely, if posterior vulnerability is defined as the expectation of the vulnerability of posterior distributions, the measure respects the fundamental information-theoretic properties of data-processing inequality (i.e., that post-processing can never increase information, but only destroy it) and of non-negativity of leakage (i.e., that by observing the output of a channel, an actor cannot, on average, lose information) if, and only if, vulnerability is convex). This notion, hence, subsumes all information measures that respect such fundamental principles, including the widely-adopted measures of Bayes vulnerability (also known as min-vulnerability, also known as (the converse of) Bayes risk) [9,10], Shannon entropy [11], guessing entropy [12] and g-vulnerability [13].

2.2.2. Channels, Posterior Vulnerability and Leakage

Computational systems can be modeled as information theoretic channels. A channel

C : X \times Y \to R

is a function in which

X

is a set of input values,

Y

is a set of output values and

C (x, y)

represents the conditional probability of the channel producing output

y \in Y

when input

x \in X

is provided. Every channel C satisfies

0 \leq C (x, y) \leq 1

for all

x \in X

and

y \in Y

, and

\sum_{y \in Y} C (x, y) = 1

for all

x \in X

.

A distribution

π \in D X

and a channel C with inputs

X

and outputs

Y

induce a joint distribution

p (x, y) = π (x) C (x, y)

on

X \times Y

, producing joint random variables

X, Y

with marginal probabilities

p (x) = \sum_{y} p (x, y)

and

p (y) = \sum_{x} p (x, y)

, and conditional probabilities

p (x ∣ y) = \frac{p (x, y)}{p (y)}

if

p (y) \neq 0

. For a given y (s.t.

p (y) \neq 0

), the conditional probabilities

p (x ∣ y)

for each

x \in X

form the posterior distribution

p_{X ∣ y}

.

A channel C in which

X

is a set of secret values and

Y

is a set of observable values produced by a system can be used to model computations on secrets. Assuming the attacker has prior knowledge

π

about the secret value, knows how a channel C works and can observe the channel’s outputs, the effect of the channel is to update the attacker’s knowledge from

π

to a collection of posteriors

p_{X ∣ y}

, each occurring with probability

p (y)

.

Given a vulnerability

V

, a prior

π

and a channel C, the posterior vulnerability

V [π, C]

is the vulnerability of the secret after the attacker has observed the output of the channel C. Formally:

V [π, C] \overset{def}{=} \sum_{y \in Y} p (y) V [p_{X ∣ y}]

.

Consider, for instance, the example of the password-checker with a timing side-channel from the Introduction (Example 4, also discussed in detail in Section 7). Here, the set of secrets

X

consists of all possible passwords (say, all strings of n bits), and a natural vulnerability function is Bayes-vulnerability, given by

V (π) = {max}_{x \in X} π (x)

. This function expresses the adversary’s probability of guessing correctly the password in one try; assuming that the passwords are chosen uniformly, i.e.,

π

is uniform, any guess would be correct with probability

2^{- n}

, giving

V (π) = 2^{- n}

. Now, imagine that the timing side-channel reveals that the adversary’s input failed on the first bit. The adversary now knows the first bit of the password (say 0); hence, the posterior

p_{X ∣ y}

assigns probability zero to all passwords with first bit one and probability

2^{- (n - 1)}

to all passwords with first bit zero. This happens for all possible posteriors, giving posterior vulnerability

V [π, C] = 2^{- (n - 1)}

(two-times greater than the prior

V

).

It is known from the literature [8] that the posterior vulnerability is a convex function of

π

. Namely, for any channel C, any family of distributions

{π_{i}}

and any set of convex coefficients

{c_{i}}

, we have:

\begin{matrix} V [\sum_{i} c_{i} π_{i}, C] \leq & \sum_{i} c_{i} V [π_{i}, C] \end{matrix}

The (information) leakage of channel C under prior

π

is a comparison between the vulnerability of the secret before the system was run (called prior vulnerability) and the posterior vulnerability of the secret. Leakage reflects how much the observation of the system’s outputs increases the attacker’s information about the secret. It can be defined either additively (

V [π, C] - V [π]

) or multiplicatively (

\frac{V [π, C]}{V [π]}

). In the password-checker example, the additive leakage is

2^{- (n - 1)} - 2^{- n} = 2^{- n}

, and the multiplicative leakage is

\frac{2^{- (n - 1)}}{2^{- n}} = 2

.

3. An Illustrative Example

We introduce an example that will serve as a running example throughout the paper. Although admittedly contrived, this example is simple and yet produces different leakage measures for all different combinations of visible/hidden choice and simultaneous/sequential games, thus providing a way to compare all different scenarios in which we are interested.

Consider that a binary secret must be processed by a program. As usual, a defender wants to protect the secret value, whereas an attacker wants to infer it by observing the system’s output. Assume the defender can choose which among two alternative versions of the program to run. Both programs take the secret value x as high input and a binary low input a whose value is chosen by the attacker. They both return the output in a low variable y (we adopt the usual convention in QIF of referring to secret variables, inputs and outputs in programs as high and to their observable counterparts as low). Program 0 returns the binary product of x and a, whereas Program 1 flips a coin with bias

\frac{a}{3}

(i.e., a coin that returns heads with probability

\frac{a}{3}

) and returns x if the result is heads and the complement

\bar{x}

of x otherwise. The two programs are represented in Figure 1.

Figure 1. Alternative programs for the running example.

The combined choices of the defender’s and of the attacker’s determine how the system behaves. Let

D = {0, 1}

represent the set of the defender’s choices, i.e., the index of the program to use, and

A = {0, 1}

represent the set of the attacker’s choices, i.e., the value of the low input a. We shall refer to the elements of

D

and

A

as actions. For each possible combination of actions

d \in D

and

a \in A

, we can construct a channel

C_{d a}

modeling how the resulting system behaves. Each channel

C_{d a}

is a function of type

X \times Y \to R

, where

X = {0, 1}

is the set of possible high input values for the system and

Y = {0, 1}

is the set of possible output values from the system. Intuitively, each channel provides the probability that the system (which was fixed by the defender) produces output

y \in Y

given that the high input is

x \in X

(and that the low input was fixed by the attacker). The four possible channels are depicted in Table 1.

Table 1. The four channels

C_{d a}

for

d, a \in {0, 1}

for the running example.

Note that channel

C_{00}

does not leak any information about the input x (i.e., it is non-interferent), whereas channels

C_{01}

and

C_{10}

completely reveal x. Channel

C_{11}

is an intermediate case: it leaks some information about x, but not all.

We want to investigate how the defender’s and the attacker’s choices influence the leakage of the system. For that, we can just consider the (simpler) notion of posterior vulnerability, since in order to make the comparison fair, we need to assume that the prior is always the same in the various scenarios, and this implies that the leakage is in a one-to-one correspondence with the posterior vulnerability (this happens for both additive and multiplicative leakage).

For this example, assume we are interested in Bayes vulnerability [9,10], defined as

V (π) = {max}_{x} π (x)

for every

π \in D X

. Assume for simplicity that the prior is the uniform prior

π_{u}

. In this case, we know from [16] that the posterior Bayes vulnerability of a channel is the sum of the greatest elements of each column, divided by the total number of inputs. Table 2 provides the Bayes vulnerability

V_{d a} \overset{def}{=} V [π_{u}, C_{d a}]

of each channel considered above.

Table 2. Bayes vulnerability of each channel

C_{d a}

for the running example.

Naturally, the attacker aims at maximizing the vulnerability of the system, while the defender tries to minimize it. The resulting vulnerability will depend on various factors, in particular on whether the two players make their choice simultaneously (i.e., without knowing the choice of the opponent) or sequentially. Clearly, if the choice of a player who moves first is known by an opponent who moves second, the opponent will be at an advantage. In the above example, for instance, if the defender knows the choice a of the attacker, the most convenient choice for him/her is to set

d = a

, and the vulnerability will be at most

\frac{2}{3}

. The other way around, if the attacker knows the choice d of the defender, the most convenient choice for him/her is to set

a \neq d

. The vulnerability in this case will be one.

Things become more complicated when players make choices simultaneously. None of the pure choices of d and a are the best for the corresponding player, because the vulnerability of the system depends also on the (unknown) choice of the other player. Yet, there is a strategy leading to the best possible situation for both players (the Nash equilibrium), but it is mixed (i.e., probabilistic), in that the players randomize their choices according to some precise distribution.

Another factor that affects vulnerability is whether or not the defender’s choice is known to the attacker at the moment in which he/she observes the output of the channel. Obviously, this corresponds to whether or not the attacker knows what channel he/she is observing. Both cases are plausible: naturally, the defender has all the interest in keeping his/her choice (and hence, the channel used) secret, since then, the attack will be less effective (i.e., leakage will be smaller). On the other hand, the attacker may be able to identify the channel used anyway, for instance because the two programs have different running times. We will call these two cases hidden and visible choice, respectively.

It is possible to model players’ strategies, as well as hidden and visible choices, as operations on channels. This means that we can look at the whole system as if it were a single channel, which will turn out to be useful for some proofs of our technical results. The next section is dedicated to the definition of these operators. We will calculate the exact values for our example in Section 5.

4. Choice Operators for Protocol Composition

In this section, we define the operators of visible and hidden choice for protocol composition. These operators are formally defined on the channel matrices of the protocols, and since channels are a particular kind of matrix, we use these matrix operations to define the operations of visible and hidden choice among channels and to prove important properties of these channel operations.

4.1. Matrices and Their Basic Operators

Given two sets

X

and

Y

, a matrix is a total function of type

X \times Y \to R

. Two matrices

M_{1} : X_{1} \times Y_{1} \to R

and

M_{2} : X_{2} \times Y_{2} \to R

are said to be compatible if

X_{1} = X_{2}

. If it is also the case that

Y_{1} = Y_{2}

, we say that the matrices have the same type. The scalar multiplication

r \cdot M

between a scalar r and a matrix M is defined as usual, and so is the summation

(\sum_{i \in I} M_{i}) (x, y) = M_{i_{1}} (x, y) + \dots + M_{i_{n}} (x, y)

of a family

{M_{i}}_{i \in I}

of matrices all of a same type.

Given a family

{M_{i}}_{i \in I}

of compatible matrices s.t. each

M_{i}

has type

X \times Y_{i} \to R

, their concatenation

⋄_{i \in I}

is the matrix having all columns of every matrix in the family, in such a way that every column is tagged with the matrix from which it came. Formally,

(⋄_{i \in I} M_{i}) (x, (y, j)) = M_{j} (x, y)

, if

y \in Y_{j}

, and the resulting matrix has type

X \times (⨆_{i \in I} Y_{i}) \to R

. (We use

⨆_{i \in I} Y_{i} = Y_{i_{1}} ⊔ Y_{i_{2}} ⊔ \dots ⊔ Y_{i_{n}}

to denote the disjoint union

{(y, i) ∣ y \in Y_{i}, i \in I}

of the sets

Y_{i_{1}}

,

Y_{i_{2}}

, …,

Y_{i_{n}}

.) When the family

{M_{i}}

has only two elements we may use the binary version ◊ of the concatenation operator. The following depicts the concatenation of two matrices

M_{1}

and

M_{2}

in tabular form.

\begin{array}{c} M_{1} & y_{1} & y_{2} \\ x_{1} & 1 & 2 \\ x_{2} & 3 & 4 \end{array} ⋄ \begin{array}{c} M_{2} & y_{1} & y_{2} & y_{3} \\ x_{1} & 5 & 6 & 7 \\ x_{2} & 8 & 9 & 10 \end{array} = \begin{array}{c} M_{1} ⋄ M_{2} & (y_{1}, 1) & (y_{2}, 1) & (y_{1}, 2) & (y_{2}, 2) & (y_{3}, 2) \\ x_{1} & 1 & 2 & 5 & 6 & 7 \\ x_{2} & 3 & 4 & 8 & 9 & 10 \end{array}

4.2. Channels and Their Hidden and Visible Choice Operators

A channel is a stochastic matrix, i.e., all elements are non-negative, and all rows sum up to one. Here, we will define two operators specific for channels. In the following, for any real value

0 \leq p \leq 1

, we denote by

\bar{p}

the value

1 - p

.

4.2.1. Hidden Choice

The first operator models a hidden probabilistic choice among channels. Consider a family

{\{C_{i}\}}_{i \in I}

of channels of the same type. Let

μ \in D I

be a probability distribution on the elements of the index set

I

. Consider an input x is fed to one of the channels in

{\{C_{i}\}}_{i \in I}

, where the channel is randomly picked according to

μ

. More precisely, an index

i \in I

is sampled with probability

μ (i)

, then the input x is fed to channel

C_{i}

, and the output y produced by the channel is then made visible, but not the index i of the channel that was used. Note that we consider hidden choice only among channels of the same type: if the sets of outputs were not identical, the produced output might implicitly reveal which channel was used.

Formally, given a family

{C_{i}}_{i \in I}

of channels s.t. each

C_{i}

has same type

X \times Y \to R

, the hidden choice operator

⨊_{i \leftarrow μ}

is defined as

⨊_{i \leftarrow μ} C_{i} = \sum_{i \in I} μ (i) C_{i}

.

Proposition 1 (Type of hidden choice).

Given a family

{C_{i}}_{i \in I}

of channels of type

X \times Y \to R

, and a distribution

μ

on

I

, the hidden choice

⨊_{i \leftarrow μ} C_{i}

is a channel of type

X \times Y \to R

.

See Appendix A for the proof.

In the particular case in which the family

{C_{i}}

has only two elements

C_{i_{1}}

and

C_{i_{2}}

, the distribution

μ

on indexes is completely determined by a real value

0 \leq p \leq 1

s.t.

μ (i_{1}) = p

and

μ (i_{2}) = \bar{p}

. In this case, we may use the binary version

_{p} \oplus

of the hidden choice operator:

C_{i_{1}}_{p} \oplus C_{i_{2}} = p C_{i_{1}} + \bar{p} C_{i_{2}}

. The example below depicts the hidden choice between channels

C_{1}

and

C_{2}

, with probability

p = \frac{1}{3}

.

\begin{array}{c} C_{1} & y_{1} & y_{2} \\ x_{1} & \frac{1}{2} & \frac{1}{2} \\ x_{2} & \frac{1}{3} & \frac{2}{3} \end{array}_{\frac{1}{3}} \oplus \begin{array}{c} C_{2} & y_{1} & y_{2} \\ x_{1} & \frac{1}{3} & \frac{2}{3} \\ x_{2} & \frac{1}{2} & \frac{1}{2} \end{array} = \begin{array}{c} C_{1}_{\frac{1}{3}} \oplus C_{2} & y_{1} & y_{2} \\ x_{1} & \frac{7}{18} & \frac{11}{18} \\ x_{2} & \frac{4}{9} & \frac{5}{9} \end{array}

4.2.2. Visible Choice

The second operator models a visible probabilistic choice among channels. Consider a family

{\{C_{i}\}}_{i \in I}

of compatible channels. Let

μ \in D I

be a probability distribution on the elements of the index set

I

. Consider an input x is fed to one of the channels in

{\{C_{i}\}}_{i \in I}

, where the channel is randomly picked according to

μ

. More precisely, an index

i \in I

is sampled with probability

μ (i)

, then the input x is fed to channel

C_{i}

, and the output y produced by the channel is then made visible, along with the index i of the channel that was used. Note that visible choice makes sense only between compatible channels, but it is not required that the output set of each channel be the same.

Formally, given

{C_{i}}_{i \in I}

of compatible channels s.t. each

C_{i}

has type

X \times Y_{i} \to R

, and a distribution

μ

on

I

, the visible choice operator

{⌊ \cdot ⌋}_{i \leftarrow μ}

is defined as

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i} = ⋄_{i \in I} μ (i) C_{i}

.

Proposition 2 (Type of visible choice).

Given a family

{C_{i}}_{i \in I}

of compatible channels s.t. each

C_{i}

has type

X \times Y_{i} \to R

and a distribution

μ

on

I

, the result of the visible choice

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}

is a channel of type

X \times (⨆_{i \in I} Y_{i}) \to R

.

See Appendix A for the proof.

In the particular case that the family

{C_{i}}

has only two elements

C_{i_{1}}

and

C_{i_{2}}

, the distribution

μ

on indexes is completely determined by a real value

0 \leq p \leq 1

s.t.

μ (i_{1}) = p

and

μ (i_{2}) = \bar{p}

. In this case, we may use the binary version

_{p} ⌊ \cdot ⌋

of the visible choice operator:

C_{i_{1}}_{p} ⌊ \cdot ⌋ C_{i_{2}} = p C_{i_{1}} ⋄ \bar{p} C_{i_{2}}

. The following depicts the visible choice between channels

C_{1}

and

C_{3}

, with probability

p = \frac{1}{3}

.

\begin{array}{c} C_{1} & y_{1} & y_{2} \\ x_{1} & \frac{1}{2} & \frac{1}{2} \\ x_{2} & \frac{1}{3} & \frac{2}{3} \end{array}_{\frac{1}{3}} ⌊ \cdot ⌋ \begin{array}{c} C_{3} & y_{1} & y_{3} \\ x_{1} & \frac{1}{3} & \frac{2}{3} \\ x_{2} & \frac{1}{2} & \frac{1}{2} \end{array} = \begin{array}{c} C_{1}_{\frac{1}{3}} ⌊ \cdot ⌋ C_{3} & (y_{1}, 1) & (y_{2}, 1) & (y_{1}, 3) & (y_{3}, 3) \\ x_{1} & \frac{1}{6} & \frac{1}{6} & \frac{2}{9} & \frac{4}{9} \\ x_{2} & \frac{1}{9} & \frac{2}{9} & \frac{1}{3} & \frac{1}{3} \end{array}

4.3. Properties of Hidden and Visible Choice Operators

We now prove algebraic properties of channel operators. These properties will be useful when we model a (more complex) protocol as the composition of smaller channels via hidden or visible choice.

Whereas the properties of hidden choice hold generally with equality, those of visible choice are subtler. For instance, visible choice is not idempotent, since in general

C_{p} ⌊ \cdot ⌋ C \neq C

(in fact, if C has type

X \times Y \to R

,

C_{p} ⌊ \cdot ⌋ C

has type

X \times (Y ⊔ Y) \to R

). However, idempotency and other properties involving visible choice hold if we replace the notion of equality with the more relaxed notion of “equivalence” between channels. Intuitively, two channels are equivalent if they have the same input space and yield the same value of vulnerability for every prior and every vulnerability function.

Definition 1 (Equivalence of channels).

Two compatible channels

C_{1}

and

C_{2}

with domain

X

are equivalent, denoted by

C_{1} \approx C_{2}

, if for every prior

π \in D X

and every posterior vulnerability

V

, we have

V [π, C_{1}] = V [π, C_{2}]

.

Two equivalent channels are indistinguishable from the point of view of information leakage, and in most cases, we can just identify them. Indeed, nowadays, there is a tendency to use abstract channels [8,17], which capture exactly the important behavior with respect to any form of leakage. In this paper, however, we cannot use abstract channels because the hidden choice operator needs a concrete representation in order to be defined unambiguously.

The first properties we prove regard idempotency of operators, which can be used do simplify the representation of some protocols.

Proposition 3 (Idempotency).

Given a family

{C_{i}}_{i \in I}

of channels s.t.

C_{i} = C

for all

i \in I

, and a distribution

μ

on

I

, then: (a)

⨊_{i \leftarrow μ} C_{i} = C

; and (b)

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i} \approx C

.

See Appendix A for the proof.

The following properties regard the reorganization of operators, and they will be essential in some technical results in which we invert the order in which hidden and visible choice are applied in a protocol.

Proposition 4 (“Reorganization of operators”).

Given a family

{C_{i j}}_{i \in I, j \in J}

of channels indexed by sets

I

and

J

, a distribution

μ

on

I

and a distribution

η

on

J

:

(a): $⨊_{i \leftarrow μ} ⨊_{j \leftarrow η} C_{i j} = ⨊_{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}} C_{i j}$ , if all $C_{i}$ ’s have the same type;
(b): ${⌊ \cdot ⌋}_{i \leftarrow μ} {⌊ \cdot ⌋}_{j \leftarrow η} C_{i j} \approx {⌊ \cdot ⌋}_{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}} C_{i j}$ , if all $C_{i}$ ’s are compatible; and
(c): $⨊_{i \leftarrow μ} {⌊ \cdot ⌋}_{j \leftarrow η} C_{i j} \approx {⌊ \cdot ⌋}_{j \leftarrow η} ⨊_{i \leftarrow μ} C_{i j}$ , if, for each i, all $C_{i j}$ ’s have the same type $X \times Y_{j} \to R$ .

See Appendix A for the proof.

Finally, analogous properties of the binary operators are shown in Appendix B.

4.4. Properties of Vulnerability w.r.t. Channel Operators

We now derive some relevant properties of vulnerability w.r.t. our channel operators, which will be later used to obtain the Nash equilibria in information leakage games with different choice operations.

The first result states that posterior vulnerability is convex w.r.t. hidden choice (this result was already presented in [6]) and linear w.r.t. to visible choice.

Theorem 2 (Convexity/linearity of posterior vulnerability w.r.t. choices).

Let

{C_{i}}_{i \in I}

be a family of channels and

μ

be a distribution on

I

. Then, for every distribution

π

on

X

and every vulnerability

V

:

1.: posterior vulnerability is convex w.r.t. to hidden choice: $V [π, ⨊_{i \leftarrow μ} C_{i}] \leq \sum_{i \in I} μ (i) V [π, C_{i}]$ if all $C_{i}$ ’s have the same type.
2.: posterior vulnerability is linear w.r.t. to visible choice: $V [π, {⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}] = \sum_{i \in I} μ (i) V [π, C_{i}]$ if all $C_{i}$ ’s are compatible.

Proof.

Let us call $X \times Y \to R$ the type of each channel $C_{i}$ in the family ${C_{i}}$ . Then:

$\begin{matrix} V [π, \underset{i \leftarrow μ}{⨊} C_{i}] = & V [π, \sum_{i} μ (i) C_{i}] & (by the definition of hidden choice) \\ = & \sum_{y \in Y} p (y) \cdot V [\frac{π (\cdot) \sum_{i} μ (i) C_{i} (\cdot, y)}{p (y)}] & (by the definition of posterior V) \\ = & \sum_{y \in Y} p (y) \cdot V [\sum_{i} μ (i) \frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] \\ \leq & \sum_{y \in Y} p (y) \cdot \sum_{i} μ (i) V [\frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] & (by the convexity of V) \\ = & \sum_{i} μ (i) \sum_{y \in Y} p (y) V [\frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] \\ = & \sum_{i} μ (i) V [π, C_{i}] \end{matrix}$

where $p (y) = \sum_{x \in X} π (x) \sum_{i} μ (i) C_{i} (x, y)$ .
Let us call $X \times Y_{i} \to R$ the type of each channel $C_{i}$ in the family ${C_{i}}$ . Then:

$\begin{matrix} V [π, {⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}] = & V [π, ⋄_{i} μ (i) C_{i}] & (by the definition of visible choice) \\ = & \sum_{y \in Y} p (y) \cdot V [\frac{π (\cdot) ⋄_{i} μ (i) C_{i} (\cdot, y)}{p (y)}] & (by the definition of posterior V) \\ = & \sum_{y \in Y} p (y) \cdot V [⋄_{i} μ (i) \frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] \\ = & \sum_{y \in Y} p (y) \cdot \sum_{i} μ (i) V [\frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] & (see (*) below) \\ = & \sum_{i} μ (i) \sum_{y \in Y} p (y) V [\frac{π (\cdot) C_{i} (\cdot, y)}{p (y)}] \\ = & \sum_{i} μ (i) V [π, C_{i}] \end{matrix}$

where $p (y) = \sum_{x \in X} π (x) \sum_{i} μ (i) C_{i} (x, y)$ , and step (*) holds because in the vulnerability of a concatenation of matrices, every column will contribute to the vulnerability in proportion to its weight in the concatenation; hence, it is possible to break the vulnerability of a concatenated matrix as the weighted sum of the vulnerabilities of its sub-matrices.

☐

The next result is concerned with posterior vulnerability under the composition of channels using both operators.

Corollary 1 (Convex-linear payoff function).

Let

{C_{i j}}_{i \in I, j \in J}

be a family of channels, all with domain

X

and with the same type, and let

π \in D X

, and

V

be any vulnerability. Define

U : D I \times D J \to R

as follows:

U (μ, η) \overset{def}{=} V [π, ⨊_{i \leftarrow μ} {⌊ \cdot ⌋}_{j \leftarrow η} C_{i j}]

. Then,

U

is convex on

μ

and linear on

η

.

Proof.

To see that

U (μ, η)

is convex on

μ

, note that:

\begin{matrix} U (μ, η) = & V [π, \underset{i \leftarrow μ}{⨊} \underset{j \leftarrow η}{⌊ \cdot ⌋} C_{i j}] & (by definition) \\ \leq & \sum_{i} μ (i) V [π, \underset{j \leftarrow η}{⌊ \cdot ⌋} C_{i j}] & (by Theorem 2) \end{matrix}

To see that

U (μ, η)

is linear on

η

, note that:

\begin{matrix} U (μ, η) = & V [π, \underset{i \leftarrow μ}{⨊} \underset{j \leftarrow η}{⌊ \cdot ⌋} C_{i j}] & (by definition) \\ = & V [π, \underset{j \leftarrow η}{⌊ \cdot ⌋} \underset{i \leftarrow μ}{⨊} C_{i j}] & (by Proposition 4) \\ = & \sum_{j} η (j) V [π, \underset{i \leftarrow μ}{⨊} C_{i j}] & (by Theorem 2) \end{matrix}

☐

5. Information Leakage Games

In this section, we present our framework for reasoning about information leakage, extending the notion of information leakage games proposed in [6] from only simultaneous games with hidden choice to both simultaneous and sequential games, with either hidden or visible choice.

In an information leakage game, the defender tries to minimize the leakage of information from the system, while the attacker tries to maximize it. In this basic scenario, their goals are just opposite (zero-sum). Both of them can influence the execution and the observable behavior of the system via a specific set of actions. We assume players to be rational (i.e., they are able to figure out what is the best strategy to maximize their expected payoff) and that the set of actions and the payoff function are common knowledge.

Players choose their own strategy, which in general may be probabilistic (i.e., behavioral or mixed) and choose their action by a random draw according to that strategy. After both players have performed their actions, the system runs and produces some output value, which is visible to the attacker and may leak some information about the secret. The amount of leakage constitutes the attacker’s gain and the defender’s loss.

To quantify the leakage, we model the system as an information-theoretic channel (cf. Section 2.2). We recall that leakage is defined as the difference (additive leakage) or the ratio (multiplicative leakage) between posterior and prior vulnerability. Since we are only interested in comparing the leakage of different channels for a given prior, we will define the payoff just as the posterior vulnerability, as the value of prior vulnerability will be the same for every channel.

5.1. Defining Information Leakage Games

A (information) leakage game consists of:

(1): two nonempty sets $D$ , $A$ of defender’s and attacker’s actions, respectively,
(2): a function $C : D \times A \to (X \times Y \to R)$ that associates with each pair of actions $(d, a) \in D \times A$ a channel $C_{d a} : X \times Y \to R$ ,
(3): a prior $π \in D X$ on secrets and
(4): a vulnerability measure $V$ , used to define the payoff function $u : D \times A \to R$ for pure strategies as $u (d, a) \overset{def}{=} V [π, C_{d a}]$ . We have only one payoff function because the game is zero-sum.

Like in traditional game theory, the order of actions and the extent by which a player knows the move performed by the opponent play a critical role in deciding strategies and determining the payoff. In security, however, knowledge of the opponent’s move affects the game in yet another way: the effectiveness of the attack, i.e., the amount of leakage, depends crucially on whether or not the attacker knows what channel is being used. It is therefore convenient to distinguish two phases in the leakage game:

Phase 1: determination of players’ strategies and the subsequent choice of their actions.
Each player determines the most convenient strategy (which in general is probabilistic) for himself/herself, and draws his/her action accordingly. One of the players may commit first to his/her action, and his/her choice may or may not be revealed to the follower. In general, knowledge of the leader’s action may help the follower choose a more advantageous strategy.
Phase 2: observation of the resulting channel’s output and payoff computation.
The attacker observes the output of the selected channel $C_{d a}$ and performs his/her attack on the secret. In case he/she knows the defender’s action, he/she is able to determine the exact channel $C_{d a}$ being used (since, of course, the attacker knows his/her own action), and his/her payoff will be the posterior vulnerability $V [π, C_{d a}]$ . However, if the attacker does not know exactly which channel has been used, then his/her payoff will be smaller.

Note that the issues raised in Phase 2 are typical of leakage games; they do not have a correspondence (to the best of our knowledge) in traditional game theory. Indeed, in traditional game theory, the resulting payoff is a deterministic function of all players’ actions. On the other hand, the extra level of randomization provided by the channel is central to security, as it reflects the principle of preventing the attacker from inferring the secret by obfuscating the link between the secret and observables.

Following the above discussion, we consider various possible scenarios for games, along two lines of classification. The first classification concerns Phase 1 of the game, in which strategies are selected and actions are drawn, and consists of three possible orders for the two players’ actions.

Simultaneous.
The players choose (draw) their actions in parallel, each without knowing the choice of the other.
Sequential, defender-first.
The defender draws an action, and commits to it, before the attacker does.
Sequential, attacker-first.
The attacker draws an action, and commits to it, before the defender does.

Note that these sequential games may present imperfect information (i.e., the follower may not know the leader’s action) and that we have to further specify whether we use behavioral or mixed strategies.

The second classification concerns Phase 2 of the game, in which some leakage occurs as a consequence of the attacker’s observation of the channel’s output and consists of two kinds of knowledge the attacker may have at this point about the channel that was used.

Visible choice.
The attacker knows the defender’s action when he/she observes the output of the channel, and therefore, he/she knows which channel is being used. Visible choice is modeled by the operator $⌊ \cdot ⌋$ .
Hidden choice.
The attacker does not know the defender’s action when he/she observes the output of the channel, and therefore, in general, he/she does not exactly know which channel is used (although in some special cases, he/she may infer it from the output). Hidden choice is modeled by the operator ⨊.

Note that the distinction between sequential and simultaneous games is orthogonal to that between visible and hidden choice. Sequential and simultaneous games model whether or not, respectively, the follower’s choice can be affected by knowledge of the leader’s action. This dichotomy captures how knowledge about the other player’s actions can help a player choose his/her own action, and it concerns how Phase 1 of the game occurs. On the other hand, visible and hidden choice capture whether or not, respectively, the attacker is able to fully determine the channel representing the system, once the defender and attacker’s actions have already been fixed. This dichotomy reflects the different amounts of information leaked by the system as viewed by the attacker, and it concerns how Phase 2 of the game occurs. For instance, in a simultaneous game, neither player can choose his/her action based on the choice of the other. However, depending on whether or not the defender’s choice is visible, the attacker will or will not, respectively, be able to completely recover the channel used, which will affect the amount of leakage.

If we consider also the subdivision of sequential games into perfect and imperfect information, there are 10 possible different combinations. Some, however, make little sense. For instance, the defender-first sequential game with perfect information (by the attacker) does not combine naturally with hidden choice ⨊, since that would mean that the attacker knows the action of the defender and chooses his/her strategy accordingly, but forgets it at the moment of computing the channel and its vulnerability (we assume perfect recall, i.e., the players never forget what they have learned). Yet, other combinations are not interesting, such as the attacker-first sequential game with (totally) imperfect information (by the defender), since it coincides with the simultaneous-game case. Note that the attacker and defender are not symmetric with respect to hiding/revealing their actions a and d, since the knowledge of a affects the game only in the usual sense of game theory (in Phase 1), while the knowledge of d also affects the computation of the payoff (in Phase 2). Note that the attacker and defender are not symmetric with respect to hiding/revealing their actions a and d, since the knowledge of a affects the game only in the usual sense of game theory, while the knowledge of d also affects the computation of the payoff (cf. “Phase 2” above). Other possible combinations would come from the distinction between behavioral and mixed strategies, but, as we will see, they are always equivalent except in one scenario, so for the sake of conciseness, we prefer to treat it as a case apart.

Table 3 lists the meaningful and interesting combinations. In Game V, we assume imperfect information: the attacker does not know the action chosen by the defender. In all the other sequential games, we assume that the follower has perfect information. In the remainder of this section, we discuss each game individually, using the example of Section 3 as a running example.

Table 3. Kinds of games we consider. Sequential games have perfect information, except for Game V.

5.1.1. Game I (Simultaneous with Visible Choice)

This simultaneous game can be represented by a tuple

(D, A, u)

. As in all games with visible choice

⌊ \cdot ⌋

, the expected payoff

U

of a mixed strategy profile

(δ, α)

is defined to be the expected value of u, as in traditional game theory:

\begin{matrix} U (δ, α) \overset{def}{=} \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{E} u (d, a) = \sum_{\begin{matrix} d \in D \\ a \in A \end{matrix}} δ (d) α (a) u (d, a), \end{matrix}

where we recall that

u (d, a) = V [π, C_{d a}]

.

From Theorem 2 (2), we derive that

U (δ, α) = V [π, {⌊ \cdot ⌋}_{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}} C_{d a}]

, and hence, the whole system can be equivalently regarded as the channel

{⌊ \cdot ⌋}_{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}} C_{d a}

. Still from Theorem 2 (2), we can derive that

U (δ, α)

is linear in

δ

and

α

. Therefore the Nash equilibrium can be computed using the standard method (cf. Section 2.1).

Example 5.

Consider the example of Section 3 in the setting of Game I, with a uniform prior. The Nash equilibrium

(δ^{*}, α^{*})

can be obtained using the closed formula from Section 2.1, and it is given by

δ^{*} (0) = α^{*} (0) = \frac{(\frac{2}{3} - 1)}{(\frac{1}{2} - 1 - 1 + \frac{2}{3})} = \frac{2}{5} .

The corresponding payoff is

U (δ^{*}, α^{*}) = \frac{2}{5} \frac{2}{5} \frac{1}{2} + \frac{2}{5} \frac{3}{5} + \frac{3}{5} \frac{2}{5} + \frac{3}{5} \frac{3}{5} \frac{2}{3} = \frac{4}{5}

.

5.1.2. Game II (Defender-First with Visible Choice)

This defender-first sequential game can be represented by a tuple

(D, D \to A, u)

. We will first consider mixed strategies for the follower (which in this case is the attacker), namely strategies of type

D (D \to A)

. Hence, a (mixed) strategy profile is of the form

(δ, σ_{a})

, with

δ \in D D

and

σ_{a} \in D (D \to A)

, and the corresponding payoff is:

\begin{matrix} U (δ, σ_{a}) \overset{def}{=} \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}}{E} u (d, s_{a} (d)) = \sum_{\begin{matrix} d \in D \\ s_{a} : D \to A \end{matrix}} δ (d) σ_{a} (s_{a}) u (d, s_{a} (d)), \end{matrix}

where

u (d, s_{a} (d)) = V [π, C_{d s_{a} (d)}]

.

Again, from Theorem 2 (2), we derive:

U (δ, σ_{a}) = V [π, {⌊ \cdot ⌋}_{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}} C_{d s_{a} (d)}]

, and hence, the system can be expressed as a channel

{⌊ \cdot ⌋}_{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}} C_{d s_{a} (d)}

. From the same theorem, we also derive that

U (δ, σ_{a})

is linear in

δ

and

σ_{a}

, so the mutually optimal strategies can be obtained again by solving the minimax problem. In this case, however, the solution is particularly simple, because there are always deterministic optimal strategy profiles. We first consider the case of attacker’s strategies of type

D (D \to A)

.

Theorem 3 (Pure-strategy Nash equilibrium in Game II: strategies of type

D (D \to A)

).

Consider a defender-first sequential game with visible choice and attacker’s strategies of type

D (D \to A)

. Let

d^{*} \overset{def}{=} {argmin}_{d} {max}_{a} u (d, a)

, and let

s_{a}^{*} : D \to A

be defined as

s_{a}^{*} (d) \overset{def}{=} {argmax}_{a} u (d, a)

(if there is more than one a that maximizes

u (d, a)

, we select one of them arbitrarily). Then, for every

δ \in D D

and

σ_{a} \in D (D \to A)

, we have

U (d^{*}, σ_{a}) \leq u (d^{*}, s_{a}^{*} (d^{*})) \leq U (δ, s_{a}^{*})

.

Proof.

Let

δ

and

σ_{a}

be arbitrary elements of

D D

and

D (D \to A)

, respectively. Then:

\begin{matrix} U (d^{*}, σ_{a}) = & \sum_{s_{a} : D \to A} σ_{a} (s_{a}) u (d^{*}, s_{a} (d^{*})) \\ \leq & \sum_{s_{a} : D \to A} σ_{a} (s_{a}) u (d^{*}, s_{a}^{*} (d^{*})) & (by the definition of s_{a}^{*}) \\ = & u (d^{*}, s_{a}^{*} (d^{*})) & (since σ_{a} is a distribution) \\ = & \sum_{d \in D} δ (d) u (d^{*}, s_{a}^{*} (d^{*})) & (since δ is a distribution) \\ \leq & \sum_{d \in D} δ (d) u (d, s_{a}^{*} (d)) & (by the definition of d^{*}) \\ = & U (δ, s_{a}^{*}) \end{matrix}

☐

Hence, to find the optimal strategy, it is sufficient for the defender to find the action

d^{*}

that minimizes

{max}_{a} u (d^{*}, a)

, while the attacker’s optimal choice is the pure strategy

s_{a}^{*}

such that

s_{a}^{*} (d) = {argmax}_{a} u (d, a)

, where d is the (visible) move by the defender.

Example 6.

Consider the example of Section 3 in the setting of Game II, with uniform prior. If the defender chooses zero, then the attacker chooses one. If the defender chooses one, then the attacker chooses zero. In both cases, the payoff is one. The game has therefore two solutions,

(δ_{1}^{*}, α_{1}^{*})

and

(δ_{2}^{*}, α_{2}^{*})

, with

δ_{1}^{*} (0) = 1

,

α_{1}^{*} (0) = 0

and

δ_{2}^{*} (0) = 0

,

α_{2}^{*} (1) = 1

.

Consider now the case of behavioral strategies. Following the same line of reasoning as before, we can see that under the strategy profile

(δ, ϕ_{a})

, the system can be expressed as the channel:

\underset{d \leftarrow δ}{⌊ \cdot ⌋} \underset{a \leftarrow ϕ_{a} (d)}{⌊ \cdot ⌋} C_{d a} .

This is also in this case that there are deterministic optimal strategy profiles. An optimal strategy for the follower (in this case, the attacker) consists of looking at the action d chosen by the leader and then selecting with probability one the action a that maximizes

u (d, a)

.

Theorem 4 (Pure-strategy Nash equilibrium in Game II: strategies of type

D \to D (A)

).

Consider a defender-first sequential game with visible choice and attacker’s strategies of type

D \to D (A)

. Let

d^{*} \overset{def}{=} {argmin}_{d} {max}_{a} u (d, a)

, and let

ϕ_{a}^{*} : D \to D (A)

be defined as

ϕ_{a}^{*} (d) (a) \overset{def}{=} 1

if

a = {argmax}_{a^{'}} u (d, a^{'})

(if there is more than one such a, we select one of them arbitrarily), and

ϕ_{a}^{*} (d) (a) \overset{def}{=} 0

otherwise. Then, for every

δ \in D D

and

ϕ_{a} : D \to D (A)

, we have:

U (d^{*}, ϕ_{a} (d^{*})) \leq U (d^{*}, ϕ_{a}^{*} (d^{*})) \leq U (δ, ϕ_{a}^{*})

.

Proof.

Let

a^{*}

be the action selected by

ϕ_{a}^{*} (d^{*})

, i.e.,

ϕ_{a}^{*} (d^{*}) (a^{*}) \overset{def}{=} 1

. Then,

u (d^{*}, a^{*}) = {max}_{a} u (d^{*}, a)

. Let

δ

and

ϕ_{a}

be arbitrary elements of

D D

and

D \to D (A)

, respectively. Then:

\begin{matrix} U (d^{*}, ϕ_{a} (d^{*})) = & \sum_{a \in A} ϕ_{a} (d^{*}) (a) u (d^{*}, a) \\ \leq & \sum_{a \in A} ϕ_{a} (d^{*}) (a) u (d^{*}, a^{*}) & (since u (d^{*}, a^{*}) = {max}_{a} u (d^{*}, a)) \\ = & u (d^{*}, a^{*}) & (since ϕ_{a} (d^{*}) is a distribution) \\ = & U (d^{*}, ϕ_{a}^{*} (d^{*})) & (by the definition of a^{*}) \\ = & \sum_{d \in D} δ (d) u (d^{*}, ϕ_{a}^{*} (d^{*})) & (since δ is a distribution) \\ \leq & \sum_{d \in D} δ (d) u (d, ϕ_{a}^{*} (d)) & (by the definition of d^{*} and of ϕ_{a}^{*}) \\ = & U (δ, ϕ_{a}^{*}) \end{matrix}

☐

As a consequence of Theorems 3 and 4, we can show that in the games, we consider that the payoff of the optimal mixed and behavioral strategy profiles coincide. Note that this result could also be derived from the result from standard game theory, which states that, in the cases we consider, for any behavioral strategy, there is a mixed strategy that yields the same payoff, and vice versa [15]. However, the proof of [15] relies on Khun’s theorem, which is non-constructive (and rather complicated, because it is for more general cases). In our scenario, the proof is very simple, as we will see in the following corollary. Furthermore, since such a result does not hold for leakage games with hidden choice, we think it will be useful to show the proof formally in order to analyze the difference.

Corollary 2 (Equivalence of optimal strategies of types

D (D \to A)

and

D \to D (A)

in Game II).

Consider a defender-first sequential game with visible choice, and let

d^{*}

,

s_{a}^{*}

and

ϕ_{a}^{*}

be defined as in Theorems 3 and 4, respectively. Then,

u (d^{*}, s_{a}^{*} (d^{*})) = U (d^{*}, ϕ_{a}^{*} (d^{*})) .

Proof.

The result follows immediately by observing that

u (d^{*}, s_{a}^{*} (d^{*})) = {max}_{a} u (d^{*}, a) = u (d^{*}, a^{*}) = U (d^{*}, ϕ_{a}^{*} (d^{*}))

. ☐

5.1.3. Game III (Attacker-First with Visible Choice)

This game is also a sequential game, but with the attacker as the leader. Therefore, it can be represented as a tuple of the form

(A \to D, A, u)

. It is the same as Game II, except that the roles of the attacker and the defender are inverted. In particular, the payoff of a mixed strategy profile

(σ_{d}, α) \in D (A \to D) \times D A

is given by:

\begin{matrix} U (σ_{d}, α) \overset{def}{=} \underset{\begin{matrix} s_{d} \leftarrow σ_{d} \\ a \leftarrow α \end{matrix}}{E} u (s_{d} (a), a) = \sum_{\begin{matrix} s_{d} : A \to D \\ a \in A \end{matrix}} σ_{d} (s_{d}) α (a) u (s_{d} (a), a) \end{matrix}

and by Theorem 2 (2), the whole system can be equivalently regarded as channel

\underset{\begin{matrix} s_{d} \leftarrow σ_{d} \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{s_{d} (a) a}

. For a behavioral strategy

(ϕ_{d}, α) \in (A \to D (D)) \times D A

, the payoff is given by:

\begin{matrix} U (ϕ_{d}, α) \overset{def}{=} \underset{a \leftarrow α}{E} \underset{d \leftarrow ϕ_{d} (a)}{E} u (d, a) = \sum_{a \in A} α (a) \sum_{d \in D} ϕ_{d} (a) (d) u (d, a) \end{matrix}

and by Theorem 2 (2), the whole system can be equivalently regarded as channel

\underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow ϕ_{d} (a)}{⌊ \cdot ⌋} C_{d a}

.

Obviously, the same results that we have obtained in the previous section for Game II hold also for Game III, with the role of attacker and defender switched. We collect all these results in the following theorem.

Theorem 5 (Pure-strategy Nash equilibria in Game III and equivalence of

D (A \to D)

and

(A \to D (D))

).

Consider a defender-first sequential game with visible choice. Let

a^{*} \overset{def}{=} {argmax}_{a} {min}_{d} u (d, a)

. Let

s_{d}^{*} : A \to D

be defined as

s_{d}^{*} (a) \overset{def}{=} {argmin}_{d} u (d, a)

, and let

ϕ_{d}^{*} : A \to D (D)

be defined as

ϕ_{d}^{*} (a) (d) \overset{def}{=} 1

if

d = {argmin}_{d^{'}} u (d^{'}, a)

. Then:

1.: For every $α \in D A$ and $σ_{d} \in D (A \to D)$ , we have $U (s_{d}^{*}, α) \leq u (s_{d}^{*} (a^{*}), a^{*}) \leq U (σ_{d}, a^{*})$ .
2.: For every $α \in D A$ and $ϕ_{d} : A \to D (D)$ , we have: $U (ϕ_{d}^{*}, α) \leq U (ϕ_{d}^{*} (a^{*}), a^{*}) \leq U (ϕ_{d} (a^{*}), a^{*})$ .
3.: $u (s_{d}^{*} (a^{*}), a^{*}) = U (ϕ_{d}^{*} (a^{*}), a^{*})$ .

Proof.

These results can be proven by following the same lines as the proofs of Theorems 3 and 4 and Corollary 2. ☐

Example 7.

Consider now the example of Section 3 in the setting of Game III, with uniform prior. If the attacker chooses zero, then the defender chooses zero, and the payoff is

\frac{1}{2}

. If the attacker chooses one, then the defender chooses one, and the payoff is

\frac{2}{3}

. The latter case is more convenient for the attacker; hence, the solution of the game is the strategy profile

(δ^{*}, α^{*})

with

δ^{*} (0) = 0

,

α^{*} (0) = 0

.

5.1.4. Game IV (Simultaneous with Hidden Choice)

The simultaneous game with hidden choice is a tuple

(D, A, u)

. However, it is not an ordinary game in the sense that the payoff of a mixed strategy profile cannot be defined by averaging the payoff of the corresponding pure strategies. More precisely, the payoff of a mixed profile is defined by averaging on the strategy of the attacker, but not on that of the defender. In fact, when hidden choice is used, there is an additional level of uncertainty in the relation between the observables and the secret from the point of view of the attacker, since he/she is not sure about which channel is producing those observables. A mixed strategy

δ

for the defender produces a convex combination of channels (the channels associated with the pure strategies) with the same coefficients, and we know from previous sections that the vulnerability is a convex function of the channel and in general is not linear.

In order to define the payoff of a mixed strategy profile

(δ, α)

, we need therefore to consider the channel that the attacker perceives given his/her limited knowledge. Let us assume that the action that the attacker draws from

α

is a. He does not know the action of the defender, but we can assume that he/she knows his/her strategy (each player can derive the optimal strategy of the opponent, under the assumption of common knowledge and rational players).

The channel the attacker will see is

⨊_{d \leftarrow δ} C_{d a}

, obtaining a corresponding payoff of

V [π, ⨊_{d \leftarrow δ} C_{d a}]

. By averaging on the strategy of the attacker, we obtain:

\begin{matrix} U (δ, α) \overset{def}{=} \underset{\begin{matrix} a \leftarrow α \end{matrix}}{E} V [π, \underset{d \leftarrow δ}{⨊} C_{d a}] = \sum_{a \in A} α (a) V [π, \underset{d \leftarrow δ}{⨊} C_{d a}] . \end{matrix}

From Theorem 2 (2), we derive:

U (δ, α) = V [π, {⌊ \cdot ⌋}_{a \leftarrow α} ⨊_{d \leftarrow δ} C_{d a}]

, and hence, the whole system can be equivalently regarded as channel

{⌊ \cdot ⌋}_{a \leftarrow α} ⨊_{d \leftarrow δ} C_{d a}

. Note that, by Proposition 4c, the order of the operators is interchangeable, and the system can be equivalently regarded as

⨊_{d \leftarrow δ} {⌊ \cdot ⌋}_{a \leftarrow α} C_{d a}

. This shows the robustness of this model.

From Corollary 1, we derive that

U (δ, α)

is convex in

δ

and linear in

η

; hence, we can compute the Nash equilibrium by the minimax method.

Example 8.

Consider now the example of Section 3 in the setting of Game IV. For

δ \in D D

and

α \in D A

, let

p = δ (0)

and

q = α (0)

. The system can be represented by the channel

(C_{00}_{p} \oplus C_{10})_{q} ⌊ \cdot ⌋ (C_{01}_{p} \oplus C_{11})

represented below.

\begin{matrix} \begin{array}{c} C_{00}_{p} \oplus C_{10} & y = 0 & y = 1 \\ x = 0 & p & \bar{p} \\ x = 1 & 1 & 0 \end{array}_{q} ⌊ \cdot ⌋ \begin{array}{c} C_{01}_{p} \oplus C_{11} & y = 0 & y = 1 \\ x = 0 & \frac{1}{3} + \frac{2}{3} p & \frac{2}{3} - \frac{2}{3} p \\ x = 1 & \frac{2}{3} - \frac{2}{3} p & \frac{1}{3} + \frac{2}{3} p \end{array} \end{matrix}

For uniform π, we have

V [π, C_{00}_{p} \oplus C_{10}] = 1 - \frac{1}{2} p

, while

V [π, C_{10}_{p} \oplus C_{11}]

is equal to

\frac{2}{3} - \frac{2}{3} p

if

p \leq \frac{1}{4}

and equal to

\frac{1}{3} + \frac{2}{3} p

if

p > \frac{1}{4}

. Hence the payoff, expressed in terms of p and q, is

U (p, q) = q (1 - \frac{1}{2} p) + \bar{q} (\frac{2}{3} - \frac{2}{3} p)

if

p \leq \frac{1}{4}

and

U (p, q) = q (1 - \frac{1}{2} p) + \bar{q} (\frac{1}{3} + \frac{2}{3} p)

if

p > \frac{1}{4}

. The Nash equilibrium can be computed by imposing that the partial derivatives of

U (p, q)

with respect to p and q are both zero, which means that we are in a saddle point. We have:

\frac{\partial U (p, q)}{\partial q} = \{\begin{matrix} \frac{1}{3} + \frac{1}{6} p & , i f p \leq \frac{1}{4} \\ \frac{2}{3} - \frac{7}{6} p & , i f p > \frac{1}{4} \end{matrix} \frac{\partial U (p, q)}{\partial p} = \{\begin{matrix} - \frac{2}{3} + \frac{1}{6} q & , i f p \leq \frac{1}{4} \\ \frac{2}{3} - \frac{7}{6} q & , i f p > \frac{1}{4} \end{matrix}

We can see that the equations

\frac{\partial U (p, q)}{\partial q} = 0

and

\frac{\partial U (p, q)}{\partial p} = 0

do not have solutions in

[0, 1]

for

p \leq \frac{1}{4}

, while for

p > \frac{1}{4}

, they have solution

p^{*} = q^{*} = \frac{4}{7}

. The pair

(p^{*}, q^{*})

thus constitutes the Nash equilibrium, and the corresponding payoff is

U (p^{*}, q^{*}) = \frac{5}{7}

.

5.1.5. Game V (Defender-First with Hidden Choice)

This is a defender-first sequential game with imperfect information; hence, it can be represented as a tuple of the form

(D, K_{a} \to A, u)

, where

K_{a}

is a partition of

D

. Since we are assuming perfect recall, and the attacker does not know anything about the action chosen by the defender in Phase 2, i.e., at the moment of the attack (except the probability distribution determined by his/her strategy), we must assume that the attacker does not know anything in Phase 1 either. Hence, the indistinguishability relation must be total, i.e.,

K_{a} = {D}

. However,

{D} \to A

is equivalent to

A

; hence, this kind of game is equivalent to Game IV. It is also a well-known fact in game theory that when in a sequential game the follower does not know the leader’s move before making his/her choice, the game is equivalent to a simultaneous game. (However, one could argue that, since the defender has already committed, the attacker does not need to perform the action corresponding to the Nash equilibrium, and any payoff-maximizing solution would be equally good for him.)

5.1.6. Game VI (Attacker-First with Hidden Choice)

This game is also a sequential game with the attacker as the leader; hence, it is a tuple of the form

(A \to D, A, u)

. It is similar to Game III, except that the payoff is convex on the strategy of the defender, instead of linear. We will see, however, that this causes quite some deviation from the properties of Game III and from standard game theory.

The payoff of the mixed strategy profile

(σ_{d}, α) \in D (A \to D) \times D A

is:

\begin{matrix} U (σ_{d}, α) \overset{def}{=} \underset{\begin{matrix} a \leftarrow α \end{matrix}}{E} V [π, \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}] = V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}], \end{matrix}

so the whole system can be equivalently regarded as channel

\underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}

.

The first important difference from Game III is that in Game VI, there may not exist optimal strategies, either mixed or behavioral, that are deterministic for the defender. On the other hand, for the attacker, there are always deterministic optimal strategies, and this is true independently of whether the defender uses mixed or behavioral strategies.

To show the existence of deterministic optimal strategies for the attacker, let us first introduce some standard notation for functions: given a variable x and an expression M,

λ x . M

represents the function that on the argument x gives as a result the value of M. Given two sets X and Y where Y is provided with an ordering ≤, the point-wise ordering on

X \to Y

is defined as follows: for

f, g : X \to Y

,

f \leq g

if and only if

\forall x \in X . f (x) \leq g (x)

.

Theorem 6 (Attacker’s pure-strategy Nash equilibrium in Game VI).

Consider an attacker-first sequential game with hidden choice.

1.: Mixed strategies, type $D (A \to D)$ . Let $a^{*} \overset{def}{=} {argmax}_{a} {min}_{σ_{d}} V [π, ⨊_{s_{d} \leftarrow σ_{d}} C_{s_{d} (a) a}]$ , and let $σ_{d}^{*} \overset{def}{=} {argmin}_{σ_{d}} λ a . V [π, ⨊_{s_{d} \leftarrow σ_{d}} C_{s_{d} (a) a}]$ . Then, for every $α \in D A$ and $σ_{d} \in D (A \to D)$ we have:

$U (σ_{d}^{*}, α) \leq U (σ_{d}^{*}, a^{*}) \leq U (σ_{d}, a^{*})$
2.: Behavioral strategies, type $A \to D (D)$ . Let $a^{*} \overset{def}{=} {argmax}_{a} {min}_{δ} V [π, ⨊_{d \leftarrow δ} C_{d a}]$ , and let $ϕ_{d}^{*} \overset{def}{=} {argmin}_{ϕ_{d}} λ a . V [π, ⨊_{d \leftarrow ϕ_{d} (a)} C_{d a}]$ (the minimization is with respect to the point-wise ordering). Then, for every $α \in D A$ and $ϕ_{d} : A \to D (D)$ , we have:

$U (ϕ_{d}^{*}, α) \leq U (ϕ_{d}^{*}, a^{*}) \leq U (ϕ_{d}, a^{*})$

Proof.

Let $α$ and $σ_{d}$ be arbitrary elements of $D A$ and $D (A \to D)$ , respectively. Then:

$\begin{matrix} U (σ_{d}^{*}, α) = & \sum_{a \in A} α (a) V [π, \underset{s_{d} \leftarrow σ_{d}^{*}}{⨊} C_{s_{d} (a) a}] \\ \leq & \sum_{a \in A} α (a) V [π, \underset{s_{d} \leftarrow σ_{d}^{*}}{⨊} C_{s_{d} (a^{*}) a^{*}}] & (by the definition of a^{*} and σ_{d}^{*}) \\ = & V [π, \underset{s_{d} \leftarrow σ_{d}^{*}}{⨊} C_{s_{d} (a^{*}) a^{*}}] (= U (σ_{d}^{*}, a^{*})) & (since α is a distribution) \\ \leq & V [π, \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a^{*}) a^{*}}] & (by the definition of σ_{d}^{*}) \\ = & U (σ_{d}, a^{*}) \end{matrix}$
Let $α$ and $ϕ_{d}$ be arbitrary elements of $D A$ and $A \to D (D)$ , respectively. Then:

$\begin{matrix} U (ϕ_{d}^{*}, α) = & \sum_{a \in A} α (a) V [π, \underset{d \leftarrow ϕ_{d}^{*} (a)}{⨊} C_{d a}] \\ \leq & \sum_{a \in A} α (a) V [π, \underset{d \leftarrow ϕ_{d}^{*} (a^{*})}{⨊} C_{d a^{*}}] & (by the definition of a^{*} and ϕ_{d}^{*}) \\ = & V [π, \underset{d \leftarrow ϕ_{d}^{*} (a^{*})}{⨊} C_{d a^{*}}] (= U (ϕ_{d}^{*}, a^{*})) & (since α is a distribution) \\ \leq & V [π, \underset{d \leftarrow ϕ_{d} (a^{*})}{⨊} C_{d a^{*}}] & (by the definition of ϕ_{d}^{*}) \\ = & U (ϕ_{d}, a^{*}) \end{matrix}$

☐

We show now, with the following example, that the optimal strategies for the defender are necessarily probabilistic.

Example 9.

Consider the channel matrices

C_{i j}

defined in Section 3 and define the following new channels:

C_{00}^{'} = C_{11}^{'} = C_{01}

and

C_{10}^{'} = C_{01}^{'} = C_{10}

. Define

D_{p}

as the result of the hidden choice, with probability p, between

C_{00}^{'}

and

C_{10}^{'}

, i.e.,

D_{p} \overset{def}{=} C_{00}^{'}_{p} \oplus C_{10}^{'}

, and observe that

D_{p} [0, 0] = D_{p} [1, 1] = p

and

D_{p} [1, 0] = D_{p} [0, 1] = 1 - p

. Furthermore,

D_{p} = C_{01}^{'}_{1 - p} \oplus C_{11}^{'}

. The vulnerability of

D_{p}

, for uniform π, is

V [π, D_{p}] = 1 - \frac{1}{2} p

for

p \leq \frac{1}{2}

and

V [π, D_{p}] = p

for

p > \frac{1}{2}

; hence, independently of the choice of the attacker, the best strategy for the defender is to choose

p = \frac{1}{2}

. Every other value for p gives a strictly higher vulnerability. Therefore, the best mixed strategy for the defender is

σ_{d}^{*}

defined as

σ_{d}^{*} (λ_{a} . 0) = σ_{d}^{*} (λ_{a} . 1) = \frac{1}{2}

. Similarly, the best behavioral strategy for the defender is

ϕ_{d}^{*}

defined as

ϕ_{d}^{*} (0) = ϕ_{d}^{*} (1) = λ_{d} . \frac{1}{2}

.

The second important difference from Game III is that in Game VI, behavioral strategies and mixed strategies are not necessarily equivalent. More precisely, there are cases in which the optimal strategy profile yields a different payoff depending on whether the defender adopts mixed strategies or behavioral ones. The following is an example in which this difference manifests itself.

Example 10.

Consider again the example of Section 3, this time in the setting of Game VI, and still with uniform prior π. Let us analyze first the case in which the defender uses behavioral strategies.

1.: Behavioral strategies, type $A \to D (D)$ . If the attacker chooses zero, which corresponds to committing to the system $C_{00}_{p} \oplus C_{10}$ , then the defender will choose $p = \frac{1}{4}$ , which minimizes its vulnerability. If he/she chooses one, which corresponds to committing to the system $C_{01}_{p} \oplus C_{11}$ , then the defender will choose $p = 1$ , which minimizes the vulnerability. In both cases, the leakage is $p = \frac{1}{2}$ ; hence, both of these strategies are solutions to the minimax. Note that in the first case, the strategy of the defender is probabilistic, while that of the attacker is pure in both cases.
2.: Mixed strategies, type $D (A \to D)$ . Observe that there are only four possible pure strategies for the defender, corresponding to the four functions $f_{i j} : A \to D$ for $i, j \in {0, 1}$ defined as $f_{i j} (a) \overset{def}{=} i$ if $i = j$ and $f_{i j} (a) \overset{def}{=} a \oplus i$ if $i \neq j$ . Consider a distribution $σ_{d} \in D (A \to D)$ , and let $p_{i j} \overset{def}{=} σ_{d} (f_{i j})$ . Then, we have $p_{i j} \geq 0$ and $\sum_{i, j} p_{i j} = 1$ . Observe that the attacker’s choice $a = 0$ determines the matrix $C_{00}_{p} \oplus C_{10}$ , with $p = p_{00} + p_{10}$ , whose vulnerability is $V [π, C_{00}_{p} \oplus C_{10}] = 1 - \frac{1}{2} p$ . On the other hand, the attacker’s choice $a = 1$ determines the matrix $C_{01}_{p^{'}} \oplus C_{11}$ , with $p^{'} = p_{00} + p_{01}$ , whose vulnerability is $V [π, C_{01}_{p^{'}} \oplus C_{11}] = \frac{2}{3} - \frac{2}{3} p$ for $p^{'} \leq \frac{1}{4}$ , and $V [π, C_{01}_{p^{'}} \oplus C_{11}] = \frac{1}{3} + \frac{2}{3} p$ for $p^{'} > \frac{1}{4}$ . By geometrical considerations (cf. the red dashed line in Figure 2), we can see that the optimal solutions for the defender are all those strategies, which give $p = \frac{6}{7}$ and $p^{'} = \frac{1}{7}$ , which yield payoff $\frac{4}{7}$ .

Figure 2. Summary of the results for the running example introduced in Section 3, for a uniform prior. Graph (A) is for the case of visible choice: it represents the Bayes vulnerability $V$ of $C_{00}_{p} ⌊ \cdot ⌋ C_{10}$ and of $C_{01}_{p} ⌊ \cdot ⌋ C_{11}$ (cases $a = 0$ and $a = 1$ , respectively), as a function of p; Graph (B) is for the case of hidden choice, and it represents the vulnerability of $C_{00}_{p} \oplus C_{10}$ and of $C_{01}_{p} \oplus C_{11}$ as a function of p. The table on the right gives the payoff in correspondence of the Nash equilibrium for the various games. VI $_{m}$ and VI $_{b}$ represent the attacker-first sequential games with defender strategy of type $D (A \to D)$ (mixed) and $A \to D (D)$ (behavioral), respectively.

The fact that behavioral and mixed strategies are not equivalent is related to the non-existence of deterministic optimal strategies. In fact, it is easy to see that from a behavioral deterministic strategy, we can construct a (deterministic) mixed strategy, and vice versa.

Figure 2 illustrates the graphs of the vulnerability of the various channel compositions and summarizes the results of this section.

6. Comparing the Leakage Games

In previous section, we have computed the vulnerability for the running example in the various kinds of games introduced in Section 5. The values we have obtained, listed in decreasing order, are as follows:

II : 1; I : \frac{4}{5}; IV : \frac{5}{7}; V : \frac{5}{7}; III : \frac{2}{3}; {VI}_{m} : \frac{4}{7}; {VI}_{b} : \frac{1}{2}

. This order is not accidental: in this section, we will prove that some of these relations between games hold for any vulnerability function, and for any prior. These results will allow us to reason about which kinds of scenarios and compositions are more convenient for the defender or, vice versa, for the attacker.

6.1. Simultaneous Games vs. Sequential Games

The relations between II, I and III and between IV–V and VI

_{m}

are typical in game theory: in any zero-sum sequential game, the leader’s payoff is less than or equal to his/her payoff in the corresponding simultaneous game. In fact, by acting first, the leader commits to an action, and this commitment can be exploited by the attacker to choose the best possible strategy relative to that action. (The fact that the leader has a disadvantage may seem counterintuitive because in many real games, it is the opposite: the player who moves first has an advantage. Such a discrepancy is due to the fact that these games feature preemptive moves, i.e., moves that, when made by one player, make impossible other moves for the other player. The games we are considering in this paper, on the contrary, do not consider preemptive moves.) In the following propositions, we give the precise formulation of these results in our framework, and we show how they can be derived formally.

Proposition 1 (Game II ⩾ Game I).

\begin{matrix} min_{δ} max_{σ_{a}} V [π, \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}}{⌊ \cdot ⌋} C_{d s_{a} (d)}] & \geq min_{δ} max_{α} V [π, \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{d a}] \end{matrix}

Proof.

We prove the first inequality as follows. Independently of

δ

, consider the attacker’s strategy

σ_{a}^{*}

that assigns probability one to the function

s_{a}^{*}

defined as

s_{a}^{*} (d) = {argmax}_{a} V [π, C_{d a}]

for any

d \in D

. Then, we have that:

\begin{matrix} min_{δ} max_{σ_{a}} V [π, \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a} \end{matrix}}{⌊ \cdot ⌋} C_{d s_{a} (d)}] \geq & min_{δ} V [π, \underset{\begin{matrix} d \leftarrow δ \\ s_{a} \leftarrow σ_{a}^{*} \end{matrix}}{⌊ \cdot ⌋} C_{d s_{a} (d)}] & (by maximization on σ_{a}) \\ = & min_{δ} V [π, \underset{d \leftarrow δ}{⌊ \cdot ⌋} C_{d s_{a}^{*} (d)}] & (by the definition of σ_{a}^{*}) \\ = & min_{δ} \sum_{d} δ (d) V [π, C_{d s_{a}^{*} (d)}] & (by Theorem 2 (2)) \\ = & min_{δ} \sum_{d} δ (d) max_{α} \sum_{a} α (a) V [π, C_{d s_{a}^{*} (d)}] & (\sin ce α is a distribution) \\ \geq & min_{δ} \sum_{d} δ (d) max_{α} \sum_{a} α (a) V [π, C_{d a}] & (by the definition of s_{a}^{*}) \\ \geq & min_{δ} max_{α} \sum_{d} δ (d) \sum_{a} α (a) V [π, C_{d a}] \\ = & min_{δ} max_{α} V [π, \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{d a}] & (by Theorem 2 (2)) \end{matrix}

☐

Note that the strategy

σ_{a}^{*}

is optimal for the attacker, so the first of the above inequalities is actually an equality. It is easy to see that the second inequalities comprise an equality, as well, because of the maximization on

α

. Therefore, the only inequalities that may be strict are comprised by the third one, and the reason why it may be strict is that on the left-hand side,

α

depends on d (and on

δ

), while on the right-hand side,

α

depends on

δ

, but not the actual d (that will be sampled from

δ

). This corresponds to the fact that in the defender-first sequential game, the attacker chooses his/her strategy after he/she knows the action d chosen by the defender, while in the simultaneous game, the attacker knows the strategy of the defender (i.e., the distribution

δ

he/she will use to choose probabilistically his/her actions), but not the actual action d that the defender will choose.

Analogous considerations can be done for the simultaneous versus the attacker-first case, which we will examine next.

Proposition 2 (Game I ≥ Game III).

\begin{matrix} min_{δ} max_{α} V [π, \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{d a}] \geq max_{α} min_{σ_{d}} V [π, \underset{\begin{matrix} s_{d} \leftarrow σ_{d} \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{s_{d} (a) a}] \end{matrix}

Proof.

Independently of

α

, consider the defender’s strategy

σ_{d}^{*}

that assigns probability one to the function

s_{d}^{*}

defined as

s_{d}^{*} (a) = {argmin}_{d} V [π, C_{d a}]

for any

a \in A

. Then, we have that:

\begin{matrix} min_{δ} max_{α} V [π, \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{d a}] = & min_{δ} max_{α} \sum_{d} δ (d) \sum_{a} α (a) V [π, C_{d a}] & (by Theorem 2 (2)) \\ = & max_{α} min_{δ} \sum_{d} δ (d) \sum_{a} α (a) V [π, C_{d a}] & (by Theorem 1) \\ \geq & max_{α} \sum_{a} α (a) min_{d} V [π, C_{d a}] \\ = & max_{α} \sum_{a} α (a) V [π, C_{s_{d}^{*} (a) a}] & (by the definition of s_{d}^{*}) \\ = & max_{α} \sum_{a} α (a) \sum_{s_{d}} σ_{d}^{*} (s_{d}) V [π, C_{s_{d} (a) a}] & (by the definition of σ_{d}^{*}) \\ = & max_{α} V [π, \underset{\begin{matrix} s_{d} \leftarrow σ_{d}^{*} \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{s_{d} (a) a}] & (by Theorem 2 (2)) \\ \geq & max_{α} min_{σ_{d}} V [π, \underset{\begin{matrix} s_{d} \leftarrow σ_{d} \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{s_{d} (a) a}] & (by minimization on σ_{d}) \end{matrix}

☐

Again, the strategy

σ_{d}^{*}

is optimal for the attacker, so the last of the above inequalities is actually an equality. Therefore, the only inequality that may be strict is the first one, and the strictness is due to the fact that on the left-hand side,

δ

does not depend on a, while on the right-hand side, it does. Intuitively, this corresponds to the intuition that if the defender knows the action of the attacker, then it may be able to choose a better strategy to reduce the leakage.

Proposition 3 (Game IV ⩾ Game VI

_{m}

).

\begin{matrix} min_{δ} max_{α} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow δ}{⨊} C_{d a}] & \geq max_{α} min_{σ_{d}} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}] \end{matrix}

Proof.

Given

α \in D A

, let

δ_{α}^{*} \overset{def}{=} {min}_{δ} \sum_{a} α (a) V [π, ⨊_{d \leftarrow δ} C_{d a}]

. For any

d \in D

, let

f_{d}

be the constant function defined as

f_{d} (a) = d

for any

a \in A

, and define

σ_{d}^{*} \in D (A \to D)

as

σ_{d}^{*} (f_{d}) \overset{def}{=} δ_{α}^{*} (d)

for any

d \in D

. Let

U (δ, α) = V [π, {⌊ \cdot ⌋}_{a \leftarrow α} ⨊_{d \leftarrow δ} C_{d a}]

. Then,

U (δ, α)

is convex in

δ

and linear in

α

. Hence, we have:

\begin{matrix} min_{δ} max_{α} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow δ}{⨊} C_{d a}] = & max_{α} min_{δ} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow δ}{⨊} C_{d a}] & (by Theorem 1) \\ = & max_{α} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow δ_{α}^{*}}{⨊} C_{d a}] & (by the definition of δ_{α}^{*}) \\ = & max_{α} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{f_{d} \leftarrow σ_{d}^{*}}{⨊} C_{f_{d} (a) a}] & (by the definition of σ_{d}^{*}) \\ \geq & max_{α} min_{σ_{d}} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}] & (by minimization on σ_{d}) \end{matrix}

☐

6.2. Visible Choice vs. Hidden Choice

We consider now the case of Games III and IV–V. In the running example, the payoff for III is lower than for IV–V, but it is easy to find other cases in which the situation is reversed. For instance, if in the running example, we set

C_{11}

to be the same as

C_{01}

, the payoff for III will be 1 (corresponding to the choice

a = 1

for the attacker), and that for IV–V will be

\frac{2}{3}

(corresponding to the Nash equilibrium

p^{*} = q^{*} = \frac{2}{3}

. Therefore, we conclude that Games III and IV–V are incomparable: there is no general ordering between them.

The relation between Games I and IV comes from the fact that they are both simultaneous games, and the only difference is the way in which the payoff is defined. The same holds for the case of Games III and VI

_{m}

, which are both attacker-first sequential games. The essence of the proof is expressed by the following proposition.

Proposition 4 (Visible choice ⩾ hidden choice).

For every

a \in A

and every

δ \in D D

, we have:

V [π, {⌊ \cdot ⌋}_{d \leftarrow δ} C_{d a}] \geq V [π, ⨊_{d \leftarrow δ} C_{d a}] .

Proof.

\begin{matrix} V [π, \underset{d \leftarrow δ}{⌊ \cdot ⌋} C_{d a}] = & \sum_{d \in D} δ (d) V [π, C_{d a}] & (by Theorem 2 (2)) \\ \geq & V [π, \underset{d \leftarrow δ}{⨊} C_{d a}] & (by Theorem 2 (1)) \end{matrix}

☐

From the above proposition, we can derive immediately the following corollaries:

Corollary 3 (Game I ⩾ Game IV).

\begin{matrix} min_{δ} max_{α} V [π, \underset{\begin{matrix} d \leftarrow δ \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{d a}] \geq min_{δ} max_{α} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow δ}{⨊} C_{d a}] \end{matrix}

Corollary 4 (Game III ⩾ Game VI

_{m}

).

\begin{matrix} max_{α} min_{σ_{d}} V [π, \underset{\begin{matrix} s_{d} \leftarrow σ_{d} \\ a \leftarrow α \end{matrix}}{⌊ \cdot ⌋} C_{s_{d} (a) a}] \geq max_{α} min_{σ_{d}} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}] \end{matrix}

Finally, we show that the vulnerability for the optimal solution in Game VI

_{m}

is always greater than or equal to that of Game VI

_{b}

, which means that for the defender, it is always convenient to use behavioral strategies. We can state actually a more general result: for any mixed strategy, there is always a behavioral strategy that gives the same payoff.

Proposition 5.

For any

α \in D A

and any

σ_{d} \in D (A \to D)

, there exists

ϕ_{d} : A \to D (D)

such that:

V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow ϕ_{d} (a)}{⨊} C_{d a}] = V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}]

Proof.

For

σ_{d} \in D (A \to D)

, define

ϕ_{d} : A \to D (D)

as follows: for any

a \in A

and

d \in D

,

ϕ_{d} (a) (d) \overset{def}{=} \sum_{s_{d} (a) = d} σ_{d} (s_{d})

and observe that for every

a \in A

, we have

\underset{d \leftarrow ϕ_{d} (a)}{⨊} C_{d a} = \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}

. ☐

From this proposition, we derive immediately the following corollary:

Corollary 5.

max_{α} min_{σ_{d}} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{s_{d} \leftarrow σ_{d}}{⨊} C_{s_{d} (a) a}] \geq max_{α} min_{ϕ_{d}} V [π, \underset{a \leftarrow α}{⌊ \cdot ⌋} \underset{d \leftarrow ϕ_{d} (a)}{⨊} C_{d a}]

The lattice in Figure 3 illustrates the results of this section about the relations between the various games. These relations can be used by the defender as guidelines to better protect the system, if he/she has some control over the rules of the game. Obviously, for the defender, the games lower in the ordering are to be preferred to compose protocols, since they yield a lower vulnerability for the result.

Figure 3. Order of games w.r.t. the payoff in the Nash equilibrium. Games higher in the lattice have larger payoff.

7. Case Study: A Safer, Faster Password-Checker

In this section, we apply our game-theoretic, compositional approach to show how a defender can mitigate an attacker’s typical timing side-channel attack, while avoiding the usual burden imposed on the password-checker’s efficiency by measures that make time consumption constant.

The following sections are organized as follows: We first provide a formalization of the trade-off between efficiency and security in password checkers using our framework of leakage games. We then illustrate the approach in a simple instance of the program for 3-bit passwords. Finally, we provide general results for the n-bit case regarding the defender’s optimal strategy in equilibrium.

7.1. Modeling the Trade-Off between Efficiency and Security as a Game

Consider the password-checker PWD_1…n of Algorithm 1, which performs a bitwise-check of an n-bit low-input

a = a_{1}, a_{2}, \dots, a_{n}

provided by the attacker against an n-bit secret password

x = x_{1}, x_{2}, \dots, x_{n}

. The bits are compared in increasing order (1, 2, …, n), with the low-input being rejected as soon as it mismatches the secret, and accepted otherwise.

Algorithm 1: Password-checker PWD_1…n.

The attacker can choose low-inputs to try to gain information about the password. Obviously, in case PWD_1…n accepts the low-input, the attacker learns that the password value is

a = x

. Yet, even when the low-input is rejected, there is some leakage of information: from the duration of the execution, the attacker can estimate how many iterations have been performed before the low-input was rejected, thus inferring a prefix of the secret password.

To model this scenario, let

X = {0, 1}^{n}

be the set of all possible n-bit passwords and

Y = {F} \times {1, 2, \dots, n} \cup {T} \times {n} = {(F, 1), (F, 2), (F, 3), \dots, (F, n), (T, n)}

be the set of observables produced by the system. Each observable is an ordered pair whose first element indicates whether or not the password was accepted (T or F, respectively), and the second element indicates the duration of the computation (1, 2, …, or n iterations).

For instance, consider a scenario with 3-bit passwords. Let PWD₁₂₃ be a password checker that performs the bitwise comparison in increasing order (1, 2, 3). Channel

C_{123, 101}

in Table 4 models PWD₁₂₃’s behavior when the attacker provides low-input

a = 101

. Note that this channel represents the fact that PWD₁₂₃ accepts the low-input when the secret is

x = 101

(the channel outputs

(T, 3)

with probability one), and otherwise rejects the low-input in a certain number of steps (e.g., the checker rejects the low-input in two steps when the password is

x = 110

, so in this case, the channel outputs

(F, 2)

with probability one).

Table 4. Channel

C_{123, 101}

modeling the case in which the defender compares bits in order (1, 2, 3) and the attacker picks low-input 101.

To quantify the password checker’s leakage of information, we will adopt Bayes vulnerability, so the prior Bayes vulnerability

V [π]

will correspond to the probability that the attacker guesses correctly the password in one try, whereas the posterior Bayes vulnerability

V [π, C]

will correspond to the probability that the attacker guesses correctly the password in one try, after he/she observes the output of the channel (i.e., after he/she has measured the time needed for the checker to accept or reject the low-input). For instance, in the 3-bit password scenario, if the prior distribution on all possible 3-bit passwords is

\hat{π} = (0.0137, 0.0548, 0.2191, 0.4382, 0.0002, 0.0002, 0.0548, 0.2191)

, the corresponding prior Bayes vulnerability is

V [\hat{π}] = 0.4382

. For prior

\hat{π}

above, the posterior Bayes vulnerability of channel

C_{123, 101}

is

V [\hat{π}, C_{123, 101}] = 0.6577

, which represents an increase in Bayes vulnerability of about

50 %

).

A way to mitigate this timing side-channel is to make the checker’s execution time independent of the secret. This can be done by by eliminating the break command within the loop in PWD_1…n, so no matter when the matching among high and low input happens, the password checker will always need n iterations to complete. For instance, in the context of our 3-bit password example, we can let PWD_cons be a constant-time 3-bit password checker that applies this counter measure. Channel

C_{cons, 101}

from Table 5 models PWD_cons’s behavior when the attacker’s low-input is

a = 101

. Note that this channel reveals only whether or not the low-input matches the secret value, but does not allow the attacker to infer a prefix of the password. Indeed, this channel’s posterior Bayes vulnerability is

V [\hat{π}, C_{123, 101}] = 0.4384

, which brings the multiplicative Bayes leakage down to an increase of only about

0.05 %

.

Table 5. Channel

C_{cons, 101}

modeling the case in which the defender runs a constant-time checker and the attacker picks low-input 101.

However, the original program is substantially more efficient than the modified one. Consider the general case of n-bit passwords and assume that either the password, or the program’s low input, is chosen uniformly at random. Because of this assumption, each bit being checked in the original program has probability

\frac{1}{2}

of being rejected. Hence, the program will finish after one iteration with probability

\frac{1}{2}

, after two iterations with probability

\frac{1}{4}

, and so on, up to the n-th iteration. After that, the program always finishes, so with the remaining probability

2^{- n}

, the program finishes after n iterations, giving a total expected time of:

\sum_{k = 1}^{n} k 2^{- k} + n 2^{- n} = 2 (1 - 2^{- n}) .

The above derivation is based on the series

\sum_{k = 1}^{n} k z^{k} = z \frac{1 - (n + 1) z^{n} + n z^{n + 1}}{{(1 - z)}^{2}}

. Hence, the expected running time of the original program (under the uniform assumption) is constant: always bounded by two, and converging to two as n grows. On the other hand, the running time of the modified constant-time program is n iterations, an

\frac{n}{2}

-fold increase.

Seeking some compromise between security and efficiency, assume the defender can employ different versions of the password-checker, each performing the bitwise comparison among low-input a and secret password x in a different order. More precisely, there is one version of the checker for every possible order in which the index i ranges in the control of the loop in Algorithm 1.

To determine a defender’s best choice of which versions of the checker to run, we model this problem as a game. The attacker’s set of actions

A

consists of all possible

2^{n}

low-inputs to the checker, and the defender’s set of actions

D

consists of all

n!

orders in which the checker can perform the bitwise comparison. There is, then, a channel

C_{a d} : X \times Y \to R

for each possible combination of

d \in D

,

a \in A

. In our framework, the payoff of a mixed strategy profile

(δ, α)

is given by:

U (δ, α) = E_{\begin{matrix} a \leftarrow α \end{matrix}} V [π, ⨊_{d \leftarrow δ} C_{d a}] .

For each pure strategy profile

(d, a)

, the payoff of the game will be the posterior Bayes vulnerability of the resulting channel

C_{d a}

(since, if we are measuring the information leakage, the prior vulnerability is the same for every channel once the prior is fixed).

In the 3-bit password scenario, the attacker’s actions

A = {000, 001, 010, 011, 100, 101, 110, 111}

are all possible 3-bit low-inputs, and the defender’s

D = {123, 132, 213, 231, 312, 321}

are all possible versions of the password checker (each action represents the order in which the 3 bits are checked). Table 6 depicts the corresponding payoffs of all 48 possible resulting channels

C_{a d}

with

d \in D

,

a \in A

, when the prior is still

\hat{π} = (0.0137, 0.0548, 0.2191, 0.4382, 0.0002, 0.0002, 0.0548, 0.2191)

. Note that the attacker’s and defender’s actions substantially affect the effectiveness of the attack: vulnerability ranges between 0.4934 and 0.9311 (and so, multiplicative leakage is in the range between an increase of

12 %

and one of

112 %

). Using techniques from [6], we can compute the best (mixed) strategy for the defender in this game, which turns out to be:

\begin{matrix} δ^{*} = (0.1667, 0.1667, 0.1667, 0.1667, 0.1667, 0.1667) . \end{matrix}

Table 6. Payoff for each pure strategy profile of 3-bit password scenario.

This strategy is part of an equilibrium and guarantees that for any choice of the attacker, the posterior Bayes vulnerability is at most

0.6573

(so the multiplicative leakage is bounded by

50 %

, an intermediate value between the minimum of about

12 %

and the maximum of about

112 %

).

The running time, on the other hand, of this new password-checker is the same as that of the original one. Under the assumption that either the password or the low-input is uniformly distributed, each check fails with probability

\frac{1}{2}

, giving a total expected time of

2 (1 - 2^{- n})

. Hence, this technique substantially decreases the program’s information leakage, without affecting at all its expected running time.

7.2. On Optimal Strategies for the Defender

Interestingly, in the 3-bit password case study from the previous section, the defender’s optimal strategy consists of uniformly sampling among all available versions of the checker. A uniform distribution seems to be an adequate candidate for the defender, but is it always the best choice for any prior and any number of bits?

We first answer this question in the case of a uniform prior

π

for arbitrary n-bit passwords, which already turns out to be challenging. Under this prior, and exploiting a crucial symmetry of the password checker (see the proof of Theorem 7), we can show that all strategies for the adversary are in fact equivalent, namely:

U (α, δ) = U (α^{'}, δ) for all α, α^{'}, δ .

For the defender, on the other hand, the situation is far from trivial: although all pure strategies d are still equivalent,

U (α, δ)

does in general depend on

δ

. By exploiting another symmetry of the password checker together with the symmetry of

V

, we can show that a uniform strategy is indeed optimal for the defender, as stated in the following result.

Theorem 7.

Consider the password checker program of Algorithm 1 for n-bit passwords, where the attacker controls the low input to the checker and the defender controls the order in which the bits are checked. If the prior

π

on possible passwords is uniform and the payoff is given by the posterior Bayes-vulnerability:

U (δ, α) = E_{\begin{matrix} a \leftarrow α \end{matrix}} V [π, ⨊_{d \leftarrow δ} C_{d a}]

, then the strategy

(δ^{*}, α)

where

δ^{*}

is uniform and

α

is arbitrary is an equilibrium strategy.

Perhaps surprisingly, however, Theorem 7 does not generalize to non-uniform priors (or to different vulnerability metrics). More precisely, when the prior on passwords is not uniform, the defender may benefit from assigning different probabilities to different versions of the password checker. This subtlety arises from the fact that the defender’s goal is not to maximize the attacker’s uncertainty about the selected password checker itself (i.e., the defender’s action), but it is rather to maximize the attacker’s uncertainty about the secret value. The following examples illustrate this (perhaps counter-intuitive) fact.

Consider again a 3-bit password scenario, similar to that of the previous section. Assume that the attacker knows only that the first bit of the password is surely zero, so that the prior on secrets is:

π^{(A)} = (0.25, 0.25, 0.25, 0.25, 0, 0, 0, 0) .

The payoff table for this case is presented in Table 7, and a corresponding equilibrium defender’s best strategy, computed using techniques from [6], is:

δ^{* (A)} = (0.25, 0.25, 0, 0.25, 0, 0.25) .

Table 7. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(A)}

.

Note that this equilibrium means that the defender never has to check the bits in the order (2, 1, 3) or in the order (3, 1, 2). The game’s payoff (i.e., posterior Bayes vulnerability) in this case is

0.5625

, which is smaller than the payoff of

0.5833

that would ensue in case the defender’s strategy were uniform. This means that, from the point of view of the defender, uniformly randomizing is not optimal.

Now, assume that the attacker knows that some passwords are more likely than others, even if all are possible, as reflected in the prior:

π^{(B)} = (0.25, 0.20, 0.15, 0.10, 0.10, 0.10, 0.05, 0.05) .

The payoff table for this case is presented in Table 8, and a corresponding equilibrium defender’s best strategy can be computed to be:

δ^{* (B)} = (0.1974, 0.1974, 0.1316, 0.1316, 0.1711, 0.1711) .

Table 8. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(B)}

.

Note that this equilibrium means that every version of the checker may be selected by the defender, but the probability distribution is not uniform. The game’s payoff (i.e., posterior Bayes vulnerability) in this case is

0.4553

, which is again smaller than the payoff of

0.4666

that would ensue in case the defender’s strategy were uniform.

8. Related Work

Many studies have applied game theory to analyses of security and privacy in networks [18,19,20], cryptography [21], anonymity [22], location privacy [23] and intrusion detection [24], to cite a few. See [25] for a survey.

In the context of quantitative information flow, most works consider only passive attackers. Boreale and Pampaloni [4] considered adaptive attackers, but not adaptive defenders, and show that in this case, the attacker’s optimal strategy can be always deterministic. Mardziel et al. [5] proposed a model for both adaptive attackers and defenders, but in none of their extensive case-studies did the attacker need a probabilistic strategy to maximize leakage. In this paper, we characterize when randomization is necessary, for either attacker or defender, to achieve optimality in our general information leakage games.

Security games have been employed to model and analyze payoffs between interacting agents, especially between a defender and an attacker. Korzhyk et al. [26] theoretically analyzed security games and studied the relationships between Stackelberg and Nash equilibria under various forms of imperfect information. Khouzani and Malacaria [27] studied leakage properties when perfect secrecy was not achievable due to constraints on the allowable size of the conflating sets and provided universally optimal strategies for a wide class of entropy measures and for g-entropies. In particular, they prove that designing a channel with minimum leakage is equivalent to computing Nash equilibria in a corresponding two-player zero-sum game of incomplete information for a range of entropy measures. These works, contrary to ours, do not consider games with hidden choice, in which optimal strategies differ from traditional game-theory.

Several security games have modeled leakage when the sensitive information is the defender’s choices themselves, rather than a system’s high input. For instance, Alon et al. [28] propose zero-sum games in which a defender chooses probabilities of secrets and an attacker chooses and learns some of the defender’s secrets. Then, they present how the leakage of the defender’s secrets has an influence on the defender’s optimal strategy. More recently, Xu et al. [29] showed zero-sum games in which the attacker obtains partial knowledge on the security resources that the defender protects and provided the defender’s optimal strategy under the attacker’s knowledge. Contrary to these studies, in this paper, we assume that a secret value is drawn from some prior distribution and is not the defender’s strategy itself.

Security games have also been used to provide optimal trade-offs between two conflicting desirable properties. Khouzani et al. [30] studied the clash between security and usability in the password selection and presented a game-theoretic framework for determining an optimal trade-off. They analyzed guessing attacks and derived the optimal policies for secret picking as Nash/Stackelberg equilibria. Yang et al. [31] proposed a game-theoretic framework to analyze user behavior in anonymity networks. They considered the cost of anonymity in terms of the loss of utility. They also considered incentives and their impact on users’ cooperation. Shokri et al. [32] presented a game-theoretic model for a designer to find the optimal privacy mechanism by taking the adversary’s knowledge into account. More specifically, they showed a Stackelberg Bayesian game in which a user first chooses a location obfuscation mechanism to maximize his/her location privacy and then an adversary tries to estimate the user’s location to minimize its error. In contrast, our work presents a more general framework that is not limited to a particular domain and focuses on protocol composition as a method to limit the leakage.

Regarding channel operators, the sequential and parallel composition of channels have been studied (e.g., [33]), but we are unaware of any explicit definition and investigation of hidden and visible choice operators. Although Kawamoto et al. [34] implicitly used the hidden choice to model a probabilistic system as the weighted sum of systems, they did not derive the set of algebraic properties we do for this operator, and for its interaction with the visible choice operator.

9. Conclusions and Future Work

In this paper, we used protocol composition to model the introduction of noise performed by the defender to prevent leakage of sensitive information. More precisely, we formalized visible and hidden probabilistic choices of different protocols. We then formalized the interplay between defender and attacker in a game-theoretic framework adapted to the specific issues of QIF, where the payoff is information leakage. We considered various kinds of leakage games, depending on whether players act simultaneously or sequentially, and whether the choices of the defender are visible or not to the attacker. We established a hierarchy of these games and provided methods for finding the optimal strategies (at the points of equilibrium) in the various cases. We also proved that in a sequential game with hidden choice, the behavioral strategies are more advantageous for the defender than the mixed strategies. This contrasts with the standard game theory, where the two types of strategies are equivalent.

As future research, we would like to extend leakage games to the case of repeated observations, i.e., when the attacker can observe the outcomes of the system in successive runs, under the assumption that both attacker and defender may change the channel in each run. We would also like to extend our framework to non zero-sum games, in which the costs of attack and defense are not equivalent, and to analyze differentially-private mechanisms.

Author Contributions

All authors contributed to the technical results and are listed in alphabetical order.

Acknowledgments

The authors are thankful to Arman Khouzani and Pedro O. S. Vaz de Melo for valuable discussions. This work was supported by JSPS and INRIAunder the project LOGIS of the Japan-France AYAMEProgram, by the PEPS 2018 project MAGIC and by the project Epistemic Interactive Concurrency (EPIC) from the STICAmSudProgram. Mário S. Alvim was supported by CNPq, CAPES and FAPEMIG. Yusuke Kawamoto was supported by JSPS KAKENHI Grant Number JP17K12667.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proofs of Technical Results

In this section, we provide the proofs of the technical results, which are not in the main body of the paper.

Appendix A.1. Preliminaries for Proofs

We start by providing some necessary background for the subsequent technical proofs.

Definition 1 states that two compatible channels (i.e., with the same input space) are equivalent if yield the same value of vulnerability for every prior and every vulnerability function. The result below, from the literature, provides necessary and sufficient conditions for two channels being equivalent. The result employs the extension of channels with an all-zero column as follows. For any channel C of type

X \times Y \to R

, its zero-column extension

C^{0}

is the channel of type

X \times (Y \cup {y_{0}}) \to R

, with

y_{0} \notin Y

, s.t.

C^{0} (x, y) = C (x, y)

for all

x \in X

,

y \in Y

, and

C^{0} (x, y_{0}) = 0

for all

x \in X

.

Lemma A1

(Characterization of channel equivalence [13,17]). Two channels

C_{1}

and

C_{2}

are equivalent iff every column of

C_{1}

is a convex combination of columns of

C_{2}^{0}

, and every column of

C_{2}

is a convex combination of columns of

C_{1}^{0}

.

Note that the result above implies that for being equivalent, any two channels must be compatible.

Appendix A.2. Proofs of Section 4

Proposition 1 (Type of hidden choice).

Given a family

{C_{i}}_{i \in I}

of channels of type

X \times Y \to R

, and a distribution

μ

on

I

, the hidden choice

⨊_{i \leftarrow μ} C_{i}

is a channel of type

X \times Y \to R

.

Proof.

Since hidden choice is defined as a summation of matrices, the type of

⨊_{i \leftarrow μ} C_{i}

is the same as the type of every

C_{i}

in the family.

To see that

⨊_{i \leftarrow μ} C_{i}

is a channel (i.e., all of its entries are non-negative, and all of its rows sum up to 1), first note that, since each

C_{i}

in the family is a channel matrix,

C_{i} (x, y)

lies in the interval

[0, 1]

for all

x \in X

,

y \in Y

. Since

μ

is a set of convex coefficients, from the definition of hidden choice it follows that also

\sum_{i \in I} μ (i) C_{i} (x, y)

must lie in the interval

[0, 1]

for every

x, y

.

Second, note that for all

x \in X

:

\begin{matrix} \sum_{y \in Y} (\underset{i \leftarrow μ}{⨊} C_{i}) (x, y) = & \sum_{y \in Y} \sum_{i \in I} μ (i) C_{i} (x, y) & (def . of hidden choice) \\ = & \sum_{i \in I} μ (i) \sum_{y \in Y} C_{i} (x, y) \\ = & \sum_{i \in I} μ (i) \cdot 1 & (C_{i} : X \times Y \to R are channels) \\ = & 1 & (μ is a prob . dist .) \end{matrix}

☐

Proposition 2 (Type of visible choice).

Given a family

{C_{i}}_{i \in I}

of compatible channels s.t. each

C_{i}

has type

X \times Y_{i} \to R

and a distribution

μ

on

I

, the result of the visible choice

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}

is a channel of type

X \times (⨆_{i \in I} Y_{i}) \to R

.

Proof.

Visible choice applied to a family

{C_{i}}

of channels scales each matrix

C_{i}

by a factor

μ (i)

, which preserves the type

X \times Y_{i} \to R

of each matrix, and then concatenates all the matrices so produced, yielding a result of type

X \times (⨆_{i \in I} Y_{i}) \to R

.

To see that

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}

is a channel (i.e., that all of its entries are non-negative, and that all rows sum-up to 1), note that each element of the visible choice on the family

{C_{i}}

can be denoted by

({⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}) (x, (y, j))

, where

x \in X

,

j \in I

, and

y \in Y_{j}

. Then, note that for all

x \in X

,

j \in I

, and

y \in Y_{j}

:

\begin{matrix} ({⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}) (x, (y, j)) = & (⋄_{i \in I} μ (i) C_{i}) (x, (y, j)) & (def . of visible choice) \\ = & (μ (j) C_{j}) (x, y) & (def . of concatenation) \\ = & μ (j) C_{j} (x, y) & (def . of scalar mult .) \end{matrix}

(A1)

which is a non-negative value, since, both

μ (j)

and

C_{j} (x, y)

are non-negative.

Finally, note that for all

x \in X

:

\begin{matrix} \sum_{\begin{matrix} j \in I \\ y \in Y_{j} \end{matrix}} ({⌊ \cdot ⌋}_{i \leftarrow μ} C_{i}) (x, (y, j)) = & \sum_{\begin{matrix} j \in I \\ y \in Y_{j} \end{matrix}} μ (j) C_{j} (x, y) & (by Equation (A 1)) \\ = & \sum_{j \in I} μ (j) \sum_{y \in Y_{j}} C_{j} (x, y) \\ = & \sum_{j \in I} μ (j) \cdot 1 & (C_{j} : X \times Y_{j} \to R are channels) \\ = & 1 & (μ is a prob . dist .) \end{matrix}

☐

Proposition 3 (Idempotency).

Given a family

{C_{i}}_{i \in I}

of channels s.t.

C_{i} = C

for all

i \in I

, and a distribution

μ

on

I

, then: (a)

⨊_{i \leftarrow μ} C_{i} = C

; and (b)

{⌊ \cdot ⌋}_{i \leftarrow μ} C_{i} \approx C

.

Proof.

(a): Idempotency of hidden choice:

$\begin{matrix} ⨊_{i \leftarrow μ} C_{i} = & \sum_{i} μ (i) C_{i} & (def . of hidden choice) \\ = & \sum_{i} μ (i) C & (\sin ce every C_{i} = C) \\ = & C \sum_{i} μ (i) \\ = & C & (\sin ce μ is a prob . dist .) \end{matrix}$
(b): Idempotency of visible choice:

$\begin{matrix} {⌊ \cdot ⌋}_{i \leftarrow μ} C_{i} = & ⋄_{i} μ (i) C_{i} & (def . of visible choice) \\ = & ⋄_{i} μ (i) C & (\sin ce every C_{i} = C) \\ \approx & C & (by Lemma A 1) \end{matrix}$

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.

☐

Proposition 4 (“Reorganization of operators”).

Given a family

{C_{i j}}_{i \in I, j \in J}

of channels indexed by sets

I

and

J

, a distribution

μ

on

I

and a distribution

η

on

J

:

(a): $⨊_{i \leftarrow μ} ⨊_{j \leftarrow η} C_{i j} = ⨊_{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}} C_{i j}$ , if all $C_{i}$ ’s have the same type;
(b): ${⌊ \cdot ⌋}_{i \leftarrow μ} {⌊ \cdot ⌋}_{j \leftarrow η} C_{i j} \approx {⌊ \cdot ⌋}_{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}} C_{i j}$ , if all $C_{i}$ ’s are compatible; and
(c): $⨊_{i \leftarrow μ} {⌊ \cdot ⌋}_{j \leftarrow η} C_{i j} \approx {⌊ \cdot ⌋}_{j \leftarrow η} ⨊_{i \leftarrow μ} C_{i j}$ , if, for each i, all $C_{i j}$ ’s have the same type $X \times Y_{j} \to R$ .

Proof.

(a): $\begin{matrix} \underset{i \leftarrow μ}{⨊} \underset{j \leftarrow η}{⨊} C_{i j} = & \underset{i \leftarrow μ}{⨊} (\sum_{j} η (j) C_{i j}) & (def . of hidden choice) \\ = & \sum_{i} μ (i) (\sum_{j} η (j) C_{i j}) & (def . of hidden choice) \\ = & \sum_{i, j} η (i) η (j) C_{i j} & (reorganizing the sums) \\ = & \underset{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}}{⨊} C_{i j} & (def . of hidden choice) \end{matrix}$
(b): $\begin{matrix} \underset{i \leftarrow μ}{⌊ \cdot ⌋} \underset{j \leftarrow η}{⌊ \cdot ⌋} C_{i j} = & \underset{i \leftarrow μ}{⌊ \cdot ⌋} (⋄_{j} C_{i j}) & (def . of visible choice) \\ = & ⋄_{i} μ (i) (⋄_{j} η (j) C_{i j}) & (def . of visible choice) \\ \approx & ⋄_{i j} μ (i) η (j) C_{i j} & (by Lemma A 1) \\ = & \underset{\begin{matrix} i \leftarrow μ \\ j \leftarrow η \end{matrix}}{⌊ \cdot ⌋} C_{i j} & (def . of visible choice) \end{matrix}$

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.
(c): $\begin{matrix} \underset{i \leftarrow μ}{⨊} \underset{j \leftarrow η}{⌊ \cdot ⌋} C_{i j} = & \underset{i \leftarrow μ}{⨊} (⋄_{j} C_{i j}) & (def . of visible choice) \\ = & \sum_{i} μ (i) (⋄_{j} η (j) C_{i j}) & (def . of hidden choice) \\ \approx & ⋄_{j} η (j) (\sum_{i} η (i) C_{i j}) & (by Lemma A 1) \\ = & \underset{j \leftarrow η}{⌊ \cdot ⌋} \underset{i \leftarrow μ}{⨊} C_{i j} & (def . of operators) \end{matrix}$

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.

☐

Appendix A.3. Proofs of Section 7

Theorem A1.

Consider the password checker program of Algorithm 1 for n-bit passwords, where the attacker controls the low input to the checker and the defender controls the order in which the bits are checked. If the prior

π

on possible passwords is uniform and the payoff is given by the posterior Bayes-vulnerability:

U (δ, α) = E_{\begin{matrix} a \leftarrow α \end{matrix}} V [π, ⨊_{d \leftarrow δ} C_{d a}]

, then the strategy

(δ^{*}, α)

where

δ^{*}

is uniform and

α

is arbitrary is an equilibrium strategy.

Proof.

We show that

(δ^{*}, α)

where

δ^{*}

is uniform and

α

is arbitrary, is a saddle point of

U (δ, α)

. The proof will rely on two crucial symmetries of the password checker, in combination with the use of a uniform prior. Note that, under uniform

π

, the Bayes-vulnerability of a channel C is proportional to the sum of the column maxima of C, namely

V [π, C] = \frac{1}{| X |} \sum_{y} {max}_{x} C (x, y)

. Given a permutation

σ

of

X

, define

C^{σ}

as C with its rows permuted, namely

C^{σ} (x, y) = C (σ (x), y)

. Note that

C^{σ}

can be written as

M_{σ} C

where

M_{σ}

is a permutation matrix. Permuting the rows does not affect the column maxima, hence

V [π, C] = V [π, C^{σ}]

.

For the attacker things are simple, since we can show that under a uniform

π

all attacker strategies are equivalent, that is,

U (α, δ) = U (α^{'}, δ) for all α, α^{'}, δ .

To show this, we use the first important symmetry of the password checker, which is due to the fact that the output of the algorithm only depends on whether

x_{i} \neq a_{i}

is true or false for each bit. Hence, any modification on a can be matched with a modification on x that preserves the output, namely for all

a, a^{'}

there exists a permutation

σ

of passwords such that

C_{d a} = C_{d a^{'}}^{σ}

for all d. Denote

C_{δ a} = ⨊_{d \leftarrow δ} C_{d a}

, we have that:

C_{δ a} = \underset{d \leftarrow δ}{⨊} C_{d a} = \underset{d \leftarrow δ}{⨊} M_{σ} C_{d a^{'}} = M_{σ} \underset{d \leftarrow δ}{⨊} C_{d a^{'}} = C_{δ a^{'}}^{σ}

From

V [π, C_{δ a}] = V [π, C_{δ a^{'}}^{σ}]

, we have that

U (a, δ) = U (a^{'}, δ)

for all pure strategies

a, a^{'}

, which implies

U (α, δ) = U (α^{'}, δ)

since

U (α, δ)

is linear on

α

. Therefore, in the remainder of the proof, we assume that the attacker plays the fixed pure strategy

a_{0}

, having the zero bitstring

0 \dots 0

as the low input.

For the defender, on the other hand, the situation is far from trivial: although all pure strategies d are still equivalent,

U (α, δ)

is only convex on

δ

and as a consequence the payoff highly depends on

δ

. Our goal is to show that:

U (a_{0}, δ) = \frac{1}{| X |} \sum_{y} max_{x} \sum_{d} δ (d) C_{d a_{0}} (x, y)

is minimized on the uniform

δ^{*}

. To do so, we show something stronger, namely that each addend of the y-sum is simultaneously minimized on the uniform

δ^{*}

. Fix an arbitrary y, and let:

f (δ) = max_{x} δ \cdot ψ_{x}

where · is the dot product and

ψ_{x} \in R^{| D |}

is the vector defined by

ψ_{x} (d) = C_{d a_{0}} (x, y)

. Note that, since

C_{d a_{0}}

is deterministic, all elements of

ψ_{x}

are either zero or one. Intuitively,

ψ_{x} (d) = 1

if x produces the fixed output y when the bit checking order is d.

We need to show that

f (δ)

is minimized on

δ^{*}

. However, f seen as a function on the whole

R^{| D |}

has no global minimum. In our case, though,

δ

is a probability distribution taking values in

D D \subset R^{| D |}

. That is, only

| D | - 1

elements of

δ

are free, so we can reduce the dimension by one as follows: fix some

d_{0} \in D

and let

\tilde{δ}, {\tilde{ψ}}_{x} \in R^{| D | - 1}

be the same as

δ, ψ_{x}

with the element corresponding to

d_{0}

removed. We have that

δ (d_{0}) = 1 - \sum_{d \neq d_{0}} δ (d) = 1 - \tilde{δ} \cdot 1

, where

1

is the “ones” vector; hence, we can rewrite

f (δ)

as a function of

\tilde{δ}

:

\begin{matrix} f (\tilde{δ}) & = max_{x} \tilde{δ} \cdot {\tilde{ψ}}_{x} + δ (d_{0}) ψ_{x} (d_{0}) \\ = max_{x} \tilde{δ} \cdot {\tilde{ψ}}_{x} + (1 - \tilde{δ} \cdot 1) ψ_{x} (d_{0}) \\ = max_{x} \tilde{δ} \cdot ({\tilde{ψ}}_{x} - ψ_{x} (d_{0}) 1) + ψ_{x} (d_{0}) \end{matrix}

Therefore, it is sufficient to show that

f (\tilde{δ})

is minimized on

{\tilde{δ}}^{*}

.

In the following, we use the fundamental concept of subgradients from convex analysis, which generalize gradients for non-differentiable functions. A vector v is a subgradient of a possibly non-differentiable convex function

g : S \to R

at

x_{0} \in S

iff:

g (x) - g (x_{0}) \geq v \cdot (x - x_{0}) for all x \in S .

It is well-known that g has a global minimum on

x_{0}

iff the

0

vector belongs to the set of subgradients of g at

x_{0}

(this generalizes the fact that differentiable convex functions have zero gradient on their global minimum). Recall also that

g (x) = x \cdot v + c

has a single subgradient v, while for

g (x) = {max}_{i} g_{i} (x)

, any subgradient of

g_{i}

for any branch i giving the max, is also a subgradient of g. Finally, the set of subgradients is convex, so any convex combination of them is also a subgradient.

Hence, the subgradients of

f (\tilde{δ})

are

{\tilde{ψ}}_{x} - ψ_{x} (d_{0}) 1

, for any x giving the maximum, as well as their convex combinations. Our goal is to show that on

{\tilde{δ}}^{*}

these include the zero vector.

We finally arrive to the second important symmetry of the password checker. Let

ρ

be a permutation of the set

{1, \dots, n}

of password bits. Such a permutation could be seen both as a permutation on

X

(i.e.,

ρ (x) = x_{ρ (1)} \dots x_{ρ (n)}

; note that

ρ (x)

has the same number of 0s and 1s as x), as well as a permutation on

D

(a bit checking order d is itself a permutation of bits, so we can set

ρ (d) = ρ \circ d

). Since all low bits of

a_{0}

are the same, applying

ρ

to both x and d does not change the outcome of the algorithm, that is

C_{d a_{0}} = C_{ρ (d) a_{0}}^{ρ}

.

Intuitively, if d is selected uniformly, then the attacker cannot distinguish x from

ρ (x)

since they produce the same output with the same probability. More concretely, let

x \sim x^{'}

denote that

x^{'} = ρ (x)

for some bit permutation

ρ

. Due to the aforementioned symmetry we have that

ψ_{x}

and

ψ_{x^{'}}

have the same number of 1s which means that

δ^{*} \cdot ψ_{x} = δ^{*} \cdot ψ_{x^{'}}

. Now, let

x^{*} \in {argmax}_{x} δ^{*} \cdot ψ_{x}

. We have that all

x \sim x^{*}

also give the max; hence, all vectors:

{\tilde{ψ}}_{x} - ψ_{x} (d_{0}) 1 x \sim x^{*}

(A2)

are subgradients of f on

{\tilde{δ}}^{*}

. Finally, for any

d, d^{'}

there is a bit permutation

ρ

such that

d^{'} = ρ (d)

. Since

ψ_{x} (d) = ψ_{ρ (x)} (ρ (d))

, we have that

(\sum_{x \sim x^{*}} ψ_{x}) (d) = k

(for some integer k) independently of d. Hence, letting

c = | {x | x \sim x^{*}} |

and averaging all subgradients of (A2), we get that:

\frac{1}{c} \sum_{x \sim x^{*}} ({\tilde{ψ}}_{x} - ψ_{x} (d_{0}) 1) = \frac{1}{c} (k 1 - k 1) = 0

is also a subgradient, which concludes the proof. ☐

Appendix B. Properties of Binary Versions of Channel Operators

In this section, we derive some relevant properties of the binary versions of the hidden and visible choice operators. We start with results regarding each operator individually.

Proposition A1 (Properties of the binary hidden choice).

For any channels

C_{1}

and

C_{2}

of the same type, and any values

0 \leq p, q \leq 1

, the binary hidden choice operator satisfies the following properties:

(a): Idempotency: $C_{1}_{p} \oplus C_{1} = C_{1}$
(b): Commutativity: $C_{1}_{p} \oplus C_{2} = C_{2}_{\bar{p}} \oplus C_{1}$
(c): Associativity:

$C_{1}_{p} \oplus (C_{2}_{q} \oplus C_{3}) = (\frac{1}{q} \cdot C_{1}_{p} \oplus C_{2})_{q} \oplus \bar{p} \cdot C_{3}$

if $q \neq 0$ .
(d): Absorption:

$(C_{1}_{p} \oplus C_{2})_{q} \oplus (C_{1}_{r} \oplus C_{2}) = C_{1}_{(p q + \bar{q} r)} \oplus C_{2} .$

Proof.

We will prove each property separately.

(a): Idempotency:

$\begin{matrix} C_{1}_{p} \oplus C_{1} = & p \cdot C_{1} + \bar{p} \cdot C_{1} & (def . of hidden choice) \\ = & (p + \bar{p}) \cdot C_{1} \\ = & C_{1} & (p + \bar{p} = 1) \end{matrix}$
(b): Commutativity:

$\begin{matrix} C_{1}_{p} \oplus C_{2} = & p \cdot C_{1} + \bar{p} \cdot C_{2} & (def . of hidden choice) \\ = & \bar{p} \cdot C_{2} + p \cdot C_{1} \\ = & C_{2}_{\bar{p}} \oplus C_{1} & (def . of hidden choice) \end{matrix}$
(c): Associativity:

$\begin{matrix} C_{1}_{p} \oplus (C_{2}_{q} \oplus C_{3}) \\ = & p \cdot C_{1} + \bar{p} (q \cdot C_{2} + \bar{q} \cdot C_{3}) & (def . of hidden choice) \\ = & p \cdot C_{1} + \bar{p} q \cdot C_{2} + \bar{p} \bar{q} \cdot C_{3} \\ = & q (p (\frac{1}{q} \cdot C_{1}) + \bar{p} \cdot C_{2}) + \bar{q} (\bar{p} \cdot C_{3}) \\ = & (\frac{1}{q} \cdot C_{1}_{p} \oplus C_{2})_{q} \oplus \bar{p} \cdot C_{3} & (def . of hidden choice) \end{matrix}$
(d): Absorption: First note that:

$\begin{matrix} (C_{1}_{p} \oplus C_{2})_{q} \oplus (C_{1}_{r} \oplus C_{2}) \\ = & q (p C_{1} + \bar{p} C_{2}) + \bar{q} (r C_{1} + \bar{r} C_{2}) & (def . of hidden choice) \\ = & p q C_{1} + \bar{p} q C_{2} + \bar{q} r C_{1} + \bar{q} \bar{r} C_{2} \\ = & (p q + \bar{q} r) C_{1} + (\bar{p} q + \bar{q} \bar{r}) C_{2} \\ = & C_{1}_{(p q + \bar{q} r)} \oplus C_{2} & (*) \end{matrix}$

To complete the proof, note that in step (*) above, $p q + \bar{q} r$ and $\bar{p} q + \bar{q} \bar{r}$ form a valid binary probability distribution (they are both non-negative, and they add up to one), then apply the definition of hidden choice.

☐

Proposition A2 (Properties of binary visible choice).

For any compatible channels

C_{1}

and

C_{2}

, and any values

0 \leq p, q \leq 1

, the visible choice operator satisfies the following properties:

(a): Idempotency: $C_{1}_{p} ⌊ \cdot ⌋ C_{1} \approx C_{1}$
(b): Commutativity: $C_{1}_{p} ⌊ \cdot ⌋ C_{2} \approx C_{2}_{\bar{p}} ⌊ \cdot ⌋ C_{1}$
(c): Associativity: $C_{1}_{p} ⌊ \cdot ⌋ (C_{2}_{q} ⌊ \cdot ⌋ C_{3}) \approx (\frac{1}{q} \cdot C_{1}_{p} ⌊ \cdot ⌋ C_{2})_{q} ⌊ \cdot ⌋ \bar{p} \cdot C_{3}$ if $q \neq 0$ .

Proof.

We will prove each property separately.

(a): Idempotency: $C_{1}_{p} ⌊ \cdot ⌋ C_{1} \approx C_{1}$ , by immediate application of Lemma A1, since every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.
(b): Commutativity:

$\begin{matrix} C_{1}_{p} ⌊ \cdot ⌋ C_{2} = & p \cdot C_{1} ⋄ \bar{p} \cdot C_{2} & (def . of visible choice) \\ \approx & \bar{p} \cdot C_{2} ⋄ p \cdot C_{1} & (by Lemma A 1) \\ = & C_{2}_{\bar{p}} ⌊ \cdot ⌋ C_{1} & (def . of visible choice) . \end{matrix}$

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.
(c): Associativity:

$\begin{matrix} C_{1}_{p} ⌊ \cdot ⌋ (C_{2}_{q} ⌊ \cdot ⌋ C_{3}) = & p \cdot C_{1} ⋄ \bar{p} (q \cdot C_{2} ⋄ \bar{q} \cdot C_{3}) & (def . of visible choice) \\ \approx & p \cdot C_{1} ⋄ (\bar{p} q \cdot C_{2} ⋄ \bar{p} \bar{q} \cdot C_{3}) & (by Lemma A 1) \\ = & q (p (\frac{1}{q} \cdot C_{1}) ⋄ \bar{p} \cdot C_{2}) ⋄ \bar{q} (\bar{p} \cdot C_{3}) \\ = & (\frac{1}{q} \cdot C_{1}_{p} ⌊ \cdot ⌋ C_{2})_{q} ⌊ \cdot ⌋ \bar{p} \cdot C_{3} & (def . of visible choice) \end{matrix}$

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side.

☐

Now, we turn our attention to the interaction between hidden and visible choice operators.

A first result is that hidden choice does not distribute over visible choice. To see why, note that

C_{1}_{p} \oplus (C_{2}_{q} ⌊ \cdot ⌋ C_{3})

and

(C_{1}_{p} \oplus C_{2})_{q} ⌊ \cdot ⌋ (C_{1}_{p} \oplus C_{3})

cannot be both defined: if the former is defined, then

C_{1}

must have the same type as

C_{2}_{q} ⌊ \cdot ⌋ C_{3}

, whereas if the latter is defined,

C_{1}

must have the same type as

C_{2}

, but

C_{2}_{q} ⌊ \cdot ⌋ C_{3}

and

C_{2}

do not have the same type (they have different output sets).

However, as the next result shows, visible choice distributes over hidden choice.

Proposition A3 (Distributivity of _p

⌊ \cdot ⌋

over _q⊕).

Let

C_{1}

,

C_{2}

and

C_{3}

be compatible channels, and

C_{2}

and

C_{3}

have the same type. Then, for any values

0 \leq p, q \leq 1

:

\begin{matrix} C_{1}_{p} ⌊ \cdot ⌋ (C_{2}_{q} \oplus C_{3}) \approx (C_{1}_{p} ⌊ \cdot ⌋ C_{2})_{q} \oplus (C_{1}_{p} ⌊ \cdot ⌋ C_{3}) . \end{matrix}

Proof.

\begin{matrix} C_{1}_{p} ⌊ \cdot ⌋ (C_{2}_{q} \oplus C_{3}) \\ = & (C_{1}_{q} \oplus C_{1})_{p} ⌊ \cdot ⌋ (C_{2}_{q} \oplus C_{3}) & (idempotency of visible choice) \\ = & p (q \cdot C_{1} + \bar{q} \cdot C_{1}) ⋄ \bar{p} (q \cdot C_{2} + \bar{q} \cdot C_{3}) & (def . of operators) \\ = & (p q \cdot C_{1} + p \bar{q} \cdot C_{1}) ⋄ (\bar{p} q \cdot C_{2} + \bar{p} \bar{q} \cdot C_{3}) \\ \approx & (p q \cdot C_{1} ⋄ \bar{p} q \cdot C_{2}) + (p \bar{q} \cdot C_{1} ⋄ \bar{p} \bar{q} \cdot C_{3}) & (by Lemma A 1) \\ = & q (p \cdot C_{1} ⋄ \bar{p} \cdot C_{2}) + \bar{q} (p \cdot C_{1} ⋄ \bar{p} \cdot C_{3}) \\ = & q (C_{1}_{p} ⌊ \cdot ⌋ C_{2}) + \bar{q} (C_{1}_{p} ⌊ \cdot ⌋ C_{3}) & (def . of visible choice) \\ = & (C_{1}_{p} ⌊ \cdot ⌋ C_{2})_{q} \oplus (C_{1}_{p} ⌊ \cdot ⌋ C_{3}) & (def . of hidden choice) \end{matrix}

In the above derivation, we can apply Lemma A1 because every column of the channel on each side of the equivalence can be written as a convex combination of the zero-column extension of the channel on the other side. ☐

References

Sun, Q.; Simon, D.R.; Wang, Y.M.; Russell, W.; Padmanabhan, V.N.; Qiu, L. Statistical identification of encrypted web browsing traffic. In Proceedings of the 2002 IEEE Symposium on Security and Privacy, Berkeley, CA, USA, 12–15 May 2002; pp. 19–30. [Google Scholar]
Dwork, C.; Mcsherry, F.; Nissim, K.; Smith, A. Calibrating noise to sensitivity in private data analysis. In Proceedings of the Theory of Cryptography Conference, New York, NY, USA, 4–7 March 2006; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2006; Volume 3876, pp. 265–284. [Google Scholar]
Chaum, D. The Dining Cryptographers Problem: Unconditional Sender and Recipient Untraceability. J. Cryptol. 1988, 1, 65–75. [Google Scholar] [CrossRef]
Boreale, M.; Pampaloni, F. Quantitative information flow under generic leakage functions and adaptive adversaries. Log. Methods Comput. Sci. 2015, 11, 166–181. [Google Scholar] [CrossRef]
Mardziel, P.; Alvim, M.S.; Hicks, M.W.; Clarkson, M.R. Quantifying Information Flow for Dynamic Secrets. In Proceedings of the 2014 IEEE Symposium on Security and Privacy (SP), San Jose, CA, USA, 18–21 May 2014; pp. 540–555. [Google Scholar]
Alvim, M.S.; Chatzikokolakis, K.; Kawamoto, Y.; Palamidessi, C. Information Leakage Games. In Proceedings of the International Conference on Decision and Game Theory for Security, Vienna, Austria, 23–25 October 2017; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2017; Volume 10575, pp. 437–457. [Google Scholar]
Rizzo, J.; Duong, T. The CRIME attack. In Proceedings of the 2012 8th EKOparty Security Conference, Buenos Aires, Argentina, 19–21 September 2012. [Google Scholar]
Alvim, M.S.; Chatzikokolakis, K.; McIver, A.; Morgan, C.; Palamidessi, C.; Smith, G. Axioms for Information Leakage. In Proceedings of the 2016 IEEE 29th Computer Security Foundations Symposium (CSF), Lisbon, Portugal, 27 June–1 July 2016; pp. 77–92. [Google Scholar]
Smith, G. On the Foundations of Quantitative Information Flow. In Proceedings of the International Conference on Foundations of Software Science and Computational Structures, York, UK, 22–29 March 2009; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2009; Volume 5504, pp. 288–302. [Google Scholar]
Chatzikokolakis, K.; Palamidessi, C.; Panangaden, P. On the Bayes risk in information-hiding protocols. J. Comput. Secur. 2008, 16, 531–571. [Google Scholar] [CrossRef]
Shannon, C.E. A Mathematical Theory of Communication. Bell Syst. Tech. J. 1948, 27, 379–423, 625–656. [Google Scholar] [CrossRef]
Massey, J.L. Guessing and Entropy. In Proceedings of the IEEE International Symposium on Information Theory, Trondheim, Norway, 27 June–1 July 1994; IEEE: Piscataway, NJ, USA, 1994; p. 204. [Google Scholar]
Alvim, M.S.; Chatzikokolakis, K.; Palamidessi, C.; Smith, G. Measuring Information Leakage Using Generalized Gain Functions. In Proceedings of the 2012 IEEE 25th Computer Security Foundations Symposium (CSF), Cambridge, MA, USA, 25–27 June 2012; pp. 265–279. [Google Scholar]
Alvim, M.S.; Chatzikokolakis, K.; Kawamoto, Y.; Palamidessi, C. Leakage and protocol composition in a game-theoretic perspective. In Proceedings of the International Conference on Principles of Security and Trust, Thessaloniki, Greece, 16–19 April 2018; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2018. [Google Scholar]
Osborne, M.J.; Rubinstein, A. A Course in Game Theory; The MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
Braun, C.; Chatzikokolakis, K.; Palamidessi, C. Quantitative Notions of Leakage for One-try Attacks. In Proceedings of the Proceedings of the 25th Conference on Mathematical Foundations of Programming Semantics, Oxford, UK, 3–7 April 2009; Electronic Notes in Theoretical Computer Science. Elsevier: New York, NY, USA, 2009; Volume 249, pp. 75–91. [Google Scholar]
McIver, A.; Morgan, C.; Smith, G.; Espinoza, B.; Meinicke, L. Abstract Channels and Their Robust Information-Leakage Ordering. In Proceedings of the International Conference on Principles of Security and Trust, Grenoble, France, 5–13 April 2014; Lecture Notes in Computer Science. Springer: Berlin, Germany, 2014; Volume 8414, pp. 83–102. [Google Scholar]
Basar, T. The Gaussian test channel with an intelligent jammer. IEEE Trans. Inf. Theory 1983, 29, 152–157. [Google Scholar] [CrossRef]
Grossklags, J.; Christin, N.; Chuang, J. Secure or Insure?: A Game-theoretic Analysis of Information Security Games. In Proceedings of the 17th International Conference on World Wide Web, Beijing, China, 21–25 April 2008; pp. 209–218. [Google Scholar]
Alpcan, T.; Buchegger, S. Security Games for Vehicular Networks. IEEE Trans. Mob. Comput. 2011, 10, 280–290. [Google Scholar] [CrossRef]
Katz, J. Bridging Game Theory and Cryptography: Recent Results and Future Directions. In Proceedings of the Theory of Cryptography Conference, Zurich, Switzerland, 9–11 February 2008; pp. 251–272. [Google Scholar]
Acquisti, A.; Dingledine, R.; Syverson, P.F. On the Economics of Anonymity. In Proceedings of the International Conference on Financial Cryptography, Guadeloupe, France, 27–30 January 2003; pp. 84–102. [Google Scholar]
Freudiger, J.; Manshaei, M.H.; Hubaux, J.P.; Parkes, D.C. On Non-cooperative Location Privacy: A Game-theoretic Analysis. In Proceedings of the 16th ACM Conference on Computer and Communications Security, Chicago, IL, USA, 9–13 November 2009; pp. 324–337. [Google Scholar]
Zhu, Q.; Fung, C.J.; Boutaba, R.; Basar, T. A game-theoretical approach to incentive design in collaborative intrusion detection networks. In Proceedings of the GameNets ’09 International Conference on Game Theory for Networks, Istanbul, Turkey, 13–15 May 2009; IEEE: Piscataway, NJ, USA, 2009; pp. 384–392. [Google Scholar]
Manshaei, M.H.; Zhu, Q.; Alpcan, T.; Bacşar, T.; Hubaux, J.P. Game Theory Meets Network Security and Privacy. ACM Comput. Surv. 2013, 45, 25. [Google Scholar] [CrossRef]
Korzhyk, D.; Yin, Z.; Kiekintveld, C.; Conitzer, V.; Tambe, M. Stackelberg vs. Nash in Security Games: An Extended Investigation of Interchangeability, Equivalence, and Uniqueness. J. Artif. Intell. Res. 2011, 41, 297–327. [Google Scholar]
Khouzani, M.H.R.; Malacaria, P. Relative Perfect Secrecy: Universally Optimal Strategies and Channel Design. In Proceedings of the 2016 IEEE 29th Computer Security Foundations Symposium (CSF), Lisbon, Portugal, 27 June–1 July 2016; pp. 61–76. [Google Scholar]
Alon, N.; Emek, Y.; Feldman, M.; Tennenholtz, M. Adversarial Leakage in Games. SIAM J. Discret. Math. 2013, 27, 363–385. [Google Scholar] [CrossRef][Green Version]
Xu, H.; Jiang, A.X.; Sinha, A.; Rabinovich, Z.; Dughmi, S.; Tambe, M. Security Games with Information Leakage: Modeling and Computation. In Proceedings of the 24th International Conference on Artificial Intelligence, Buenos Aires, Argentina, 25–31 July 2015; pp. 674–680. [Google Scholar]
Khouzani, M.H.R.; Mardziel, P.; Cid, C.; Srivatsa, M. Picking vs. Guessing Secrets: A Game-Theoretic Analysis. In Proceedings of the IEEE 28th Computer Security Foundations Symposium, Verona, Italy, 13–17 July 2015; pp. 243–257. [Google Scholar]
Yang, M.; Sassone, V.; Hamadou, S. A Game-Theoretic Analysis of Cooperation in Anonymity Networks. In Proceedings of the International Conference on Principles of Security and Trust, Tallinn, Estonia, 24 March–1 April 2012; pp. 269–289. [Google Scholar]
Shokri, R.; Theodorakopoulos, G.; Troncoso, C. Privacy Games Along Location Traces: A Game-Theoretic Framework for Optimizing Location Privacy. ACM Trans. Priv. Secur. 2017, 19, 11. [Google Scholar] [CrossRef]
Kawamoto, Y.; Chatzikokolakis, K.; Palamidessi, C. On the Compositionality of Quantitative Information Flow. Log. Methods Comput. Sci. 2017, 13, 1–31. [Google Scholar]
Kawamoto, Y.; Biondi, F.; Legay, A. Hybrid Statistical Estimation of Mutual Information for Quantifying Information Flow. In Proceedings of the International Symposium on Formal Methods, Limassol, Cyprus, 9–11 November 2016; pp. 406–425. [Google Scholar]

Figure 1. Alternative programs for the running example.

Figure 2. Summary of the results for the running example introduced in Section 3, for a uniform prior. Graph (A) is for the case of visible choice: it represents the Bayes vulnerability

V

of

C_{00}_{p} ⌊ \cdot ⌋ C_{10}

and of

C_{01}_{p} ⌊ \cdot ⌋ C_{11}

(cases

a = 0

and

a = 1

, respectively), as a function of p; Graph (B) is for the case of hidden choice, and it represents the vulnerability of

C_{00}_{p} \oplus C_{10}

and of

C_{01}_{p} \oplus C_{11}

as a function of p. The table on the right gives the payoff in correspondence of the Nash equilibrium for the various games. VI

_{m}

and VI

_{b}

represent the attacker-first sequential games with defender strategy of type

D (A \to D)

(mixed) and

A \to D (D)

(behavioral), respectively.

Figure 3. Order of games w.r.t. the payoff in the Nash equilibrium. Games higher in the lattice have larger payoff.

Table 1. The four channels

C_{d a}

for

d, a \in {0, 1}

for the running example.

Table 1. The four channels

C_{d a}

for

d, a \in {0, 1}

for the running example.

Table 2. Bayes vulnerability of each channel

C_{d a}

for the running example.

Table 2. Bayes vulnerability of each channel

C_{d a}

for the running example.

$V$	a = 0	a = 1
d = 0	$\frac{1}{2}$	1
d = 1	1	$\frac{2}{3}$

Table 3. Kinds of games we consider. Sequential games have perfect information, except for Game V.

		Order of Action
		Simultaneous	Defender First	Attacker First
Defender’s choice	visible $⌊ \cdot ⌋$	Game I	Game II	Game III
Defender’s choice	hidden ⨊	Game IV	Game V	Game VI

Table 4. Channel

C_{123, 101}

modeling the case in which the defender compares bits in order (1, 2, 3) and the attacker picks low-input 101.

Table 4. Channel

C_{123, 101}

modeling the case in which the defender compares bits in order (1, 2, 3) and the attacker picks low-input 101.

$C_{123, 101}$	y = (F, 1)	y = (F, 2)	y = (F, 3)	y = (T, 3)
x = 000	1	0	0	0
x = 001	1	0	0	0
x = 010	1	0	0	0
x = 011	1	0	0	0
x = 100	0	0	1	0
x = 101	0	0	0	1
x = 110	0	1	0	0
x = 111	0	1	0	0

Table 5. Channel

C_{cons, 101}

modeling the case in which the defender runs a constant-time checker and the attacker picks low-input 101.

Table 5. Channel

C_{cons, 101}

modeling the case in which the defender runs a constant-time checker and the attacker picks low-input 101.

$C_{cons, 101}$	y = (F, 3)	y = (T, 3)
x = 000	1	0
x = 001	1	0
x = 010	1	0
x = 011	1	0
x = 100	1	0
x = 101	0	1
x = 110	1	0
x = 111	1	0

Table 6. Payoff for each pure strategy profile of 3-bit password scenario.

		Attacker’s Action a
	U( $d$ , $a$ )	000	001	010	011	100	101	110	111
Defender’s action d	123	0.7257	0.7257	0.9311	0.9311	0.6577	0.6577	0.7122	0.7122
	132	0.8900	0.9311	0.8900	0.9311	0.7122	0.7122	0.7122	0.7122
	213	0.5068	0.5068	0.9311	0.9311	0.4934	0.4934	0.7668	0.7668
	231	0.5068	0.5068	0.7668	0.9311	0.5068	0.5068	0.7668	0.9311
	312	0.7257	0.9311	0.7257	0.9311	0.7122	0.8766	0.7122	0.8766
	321	0.6712	0.7122	0.7257	0.9311	0.6712	0.7122	0.7257	0.9311

Table 7. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(A)}

.

Table 7. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(A)}

.

		Attacker’s Action a
	U( $d$ , $a$ )	000	001	010	011	100	101	110	111
Defender’s action d	123	0.75	0.75	0.75	0.75	0.25	0.25	0.25	0.25
	132	0.75	0.75	0.75	0.75	0.25	0.25	0.25	0.25
	213	0.75	0.75	0.75	0.75	0.50	0.50	0.50	0.50
	231	0.75	0.75	0.75	0.75	0.75	0.75	0.75	0.75
	312	0.75	0.75	0.75	0.75	0.50	0.50	0.50	0.50
	321	0.75	0.75	0.75	0.75	0.75	0.75	0.75	0.75

Table 8. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(B)}

.

Table 8. Payoff table for each pure strategy profile of the 3-bit password scenario, under prior

π^{(B)}

.

		Attacker’s Action a
	U( $d$ , $a$ )	000	001	010	011	100	101	110	111
Defender’s action d	123	0.70	0.70	0.60	0.60	0.50	0.50	0.45	0.4
	132	0.70	0.65	0.70	0.65	0.50	0.50	0.50	0.5
	213	0.70	0.70	0.55	0.55	0.60	0.60	0.50	0.5
	231	0.70	0.70	0.55	0.55	0.70	0.70	0.55	0.5
	312	0.70	0.65	0.70	0.65	0.60	0.60	0.60	0.6
	321	0.70	0.65	0.65	0.60	0.70	0.65	0.65	0.6

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

A Game-Theoretic Approach to Information-Flow Control via Protocol Composition

Abstract

1. Introduction

Plan of the Paper

2. Preliminaries

2.1. Basic Concepts from Game Theory

2.1.1. Two-Player Games

2.1.2. Simultaneous Games

2.1.3. Sequential Games

2.1.4. Zero-Sum Games and the Minimax Theorem

2.2. Quantitative Information Flow

2.2.1. Secrets and Vulnerability

2.2.2. Channels, Posterior Vulnerability and Leakage

3. An Illustrative Example

4. Choice Operators for Protocol Composition

4.1. Matrices and Their Basic Operators

4.2. Channels and Their Hidden and Visible Choice Operators

4.2.1. Hidden Choice

4.2.2. Visible Choice

4.3. Properties of Hidden and Visible Choice Operators

4.4. Properties of Vulnerability w.r.t. Channel Operators

5. Information Leakage Games

5.1. Defining Information Leakage Games

5.1.1. Game I (Simultaneous with Visible Choice)

5.1.2. Game II (Defender-First with Visible Choice)

5.1.3. Game III (Attacker-First with Visible Choice)

5.1.4. Game IV (Simultaneous with Hidden Choice)

5.1.5. Game V (Defender-First with Hidden Choice)

5.1.6. Game VI (Attacker-First with Hidden Choice)

6. Comparing the Leakage Games

6.1. Simultaneous Games vs. Sequential Games

6.2. Visible Choice vs. Hidden Choice

7. Case Study: A Safer, Faster Password-Checker

7.1. Modeling the Trade-Off between Efficiency and Security as a Game

7.2. On Optimal Strategies for the Defender

8. Related Work

9. Conclusions and Future Work

Author Contributions

Acknowledgments

Conflicts of Interest

Appendix A. Proofs of Technical Results

Appendix A.1. Preliminaries for Proofs

Appendix A.2. Proofs of Section 4

Appendix A.3. Proofs of Section 7

Appendix B. Properties of Binary Versions of Channel Operators

References

Article Metrics

Citations

Article Access Statistics