Refinement Orders for Quantitative Information Flow and Differential Privacy

Chatzikokolakis, Konstantinos; Fernandes, Natasha; Palamidessi, Catuscia

doi:10.3390/jcp1010004

Open AccessArticle

Refinement Orders for Quantitative Information Flow and Differential Privacy^†

by

Konstantinos Chatzikokolakis

¹

,

Natasha Fernandes

^2,3

and

Catuscia Palamidessi

^3,*

¹

Department of Informatics and Telecommunications, National and Kapodistrian University of Athens Campus, Ilisia, 15784 Athens, Greece

²

Department of Computing, Macquarie University, Ryde City 2109, Australia

³

Inria and Institut Polytechnique de Paris, 91120 Palaiseau, France

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper by Chatzikokolakis, K.; Fernandes, N.; Palamidessi, C. Comparing systems: max-case refinement orders and application to differential privacy. In Proceedings of the 32nd IEEE Computer Security Foundations Symposium, Hoboken, NJ, USA, 25–28 June 2019.

J. Cybersecur. Priv. 2021, 1(1), 40-77; https://doi.org/10.3390/jcp1010004

Submission received: 27 October 2020 / Revised: 1 December 2020 / Accepted: 7 December 2020 / Published: 12 December 2020

Download

Browse Figures

Versions Notes

Abstract

:

Quantitative Information Flow (QIF) and Differential Privacy (DP) are both concerned with the protection of sensitive information, but they are rather different approaches. In particular, QIF considers the expected probability of a successful attack, while DP (in both its standard and local versions) is a max-case measure, in the sense that it is compromised by the existence of a possible attack, regardless of its probability. Comparing systems is a fundamental task in these areas: one wishes to guarantee that replacing a system A by a system B is a safe operation that is the privacy of B is no worse than that of A. In QIF, a refinement order provides strong such guarantees, while, in DP, mechanisms are typically compared w.r.t. the privacy parameter

ε

in their definition. In this paper, we explore a variety of refinement orders, inspired by the one of QIF, providing precise guarantees for max-case leakage. We study simple structural ways of characterising them, the relation between them, efficient methods for verifying them and their lattice properties. Moreover, we apply these orders in the task of comparing DP mechanisms, raising the question of whether the order based on

ε

provides strong privacy guarantees. We show that, while it is often the case for mechanisms of the same “family” (geometric, randomised response, etc.), it rarely holds across different families.

Keywords:

quantitative information flow; differential privacy; security refinement orderings

1. Introduction

The enormous growth in the use of internet-connected devices and the big-data revolution have created serious privacy concerns, and motivated an intensive area of research aimed at devising methods to protect the users’ sensitive information. During the last decade, two main frameworks have emerged in this area: Differential Privacy (DP) and Quantitative Information Flow (QIF).

Differential privacy (DP) [1] was originally developed in the area of statistical databases, and it aims at protecting the individuals’ data while allowing the release of aggregate information through queries. This is obtained by obfuscating the result of the query via the addition of controlled noise. Naturally, we need to assume that the curator, namely the entity collecting and storing the data and handling the queries, is honest and capable of protecting the data from security breaches. Since this assumption cannot always be guaranteed, a variant has been proposed: local differential privacy (LDP) [2], where the data are obfuscated individually before they are collected.

Both DP and LPD are subsumed by

d

-privacy [3], and, in this paper, we will use the latter as a unifying framework. The definition of

d

-privacy assumes an underlying metric structure on the data domain

X

. An obfuscation mechanism K for

X

is a probabilistic mapping from

X

to some output domain

Y

, namely a function from

X

to probabilistic distributions over

Y

. We will use the notation

K_{x, y}

to represent the probability that K on input x gives output y. The mechanism K is

ε \cdot d

-private, where

ε

is a parameter representing the privacy level, if

K_{x_{1}, y} \leq e^{ε d (x_{1}, x_{2})} K_{x_{2}, y} for all x_{1}, x_{2} \in X, y \in Y

(1)

Standard DP is obtained from this definition by assuming

X

to be a set of all datasets and

d

the Hamming distance between two datasets, seen as vectors or records (i.e., the number of positions in which the two datasets differ). Note that the more common definition of differential privacy assumes that

x_{1}, x_{2}

are adjacent, i.e., their Hamming distance is 1, and requires

K_{x_{1}, y} \leq e^{ε} K_{x_{2}, y}

. It is easy to prove that the two definitions are equivalent. As for LDP, it is obtained by considering the so-called discrete metric which assigns distance 0 to identical elements, and 1 otherwise.

The other framework for the protection of sensitive information, quantitative information flow (QIF), focuses on the potentialities and the goals of the attacker, and the research on this area has developed rigorous foundations based on information theory [4,5]. The idea is that a system processing some sensitive data from a random variable X and releasing some observable data as a random variable Y can be modelled as an information-theoretic channel with input X and output Y. The leakage is then measured in terms of correlation between X and Y. There are, however, many different ways to define such correlation, depending on the notion of adversary. In order to provide a unifying approach, Ref. [6] has proposed the theory of g-leakage, in which an adversary is characterized by a functional parameter g representing its gain for each possible outcomes of the attack.

One issue that arises in both frameworks is how to compare systems from the point of view of their privacy guarantees. It is important to have a rigorous and effective way to establish whether a mechanism is better or worse than another one, in order to guide the design and the implementation of mechanisms for information protection. This is not always an obvious task. To illustrate the point, consider the following examples.

Example 1.

Let

P_{1}, P_{2}, P_{3}

and

P_{4}

be the programs illustrated in Table 1, whereHis a “high” (i.e., secret) input andLis a “low” (i.e., public) output. We assume thatHis a uniformly distributed 32-bit integer with range

0 \leq H < 2^{32}

. All these programs leak information aboutHviaL, in different ways:

P_{1}

revealsHwhenever it is a multiple of 8 (H mod 8represents the integer division ofHby 8), and reveals nothing otherwise.

P_{2}

does the same thing wheneverHis a multiple of 4.

P_{3}

reveals the last 8 bits ofH(note thatH

& 0^{24} 1^{8}

represents the bitwise conjunction betweenHand a string of 24 bits “0” followed by 8 bits “1”). Analogously,

P_{4}

reveals the last 4 bits ofH. Now, it is clear that

P_{2}

leaks more than

P_{1}

, and that

P_{4}

leaks more than

P_{3}

, but how to compare, for instance,

P_{1}

, and

P_{3}

? It is debatable which one is worse because their behavior is very different:

P_{1}

reveals nothing in most cases, but when it does reveal something, it reveals everything.

P_{3}

, on the other hand, always reveals part of the secret. Clearly, we cannot decide which situation is worse, unless we have some more information about the goals and the capabilities of the attacker. For instance, if the adversary has only one attempt at his disposal (and no extra information), then the program

P_{3}

is better because even after the output ofLthere are still 24 bits ofHthat are unknown. On the other hand, if the adversary can repeat the attacks on program similar to

P_{3}

, then eventually it will uncover the secret entirely all the times.

Example 2.

Consider the domain of the integer numbers between 0 and 100, and consider a geometric mechanism (cf. Definition 9) on this domain, with

ε = \log \frac{27}{25}

. Then, consider a randomized response mechanism (cf. Definition 13) still on the same domain and

ε = \log 2

. The two mechanisms are illustrated in Figure 1. They both satisfy

d

-privacy, but for different

d

: in the first case

d

is the standard distance between numbers, while in the second case it is the discrete metric. Clearly, it does not make sense to compare these mechanisms on the basis of their respective privacy parameters ε because they represent different privacy properties, and it is not obvious how to compare them in general: The geometric mechanism tends to make the true value indistinguishable from his immediate neighbors, but it separates it from the values far away. The randomized response introduces the same level of confusion between the true value and any other value of the domain. Thus, which mechanism is more private depends on the kind of attack we want to mitigate: if the attacker is trying to guess an approximation of the value, then the randomized response is better. If the attacker is only interested in identifying the true value among the immediate neighbors, then the geometric is better. Indeed, note that in the subdomain of the numbers between 40 and 60 the geometric mechanism also satisfies

d

-privacy with

d

being the discrete metric and

ε = \log 2

, and in any subdomain smaller than that it satisfies the discrete metric

d

-privacy with

ε < \log 2

.

In this respect, the QIF approach has led to an elegant theory of refinement (pre)order (In this paper, we call

⊑_{G}^{avg}

and the other refinement relations “orders”, although, strictly speaking, they are preorders.)

⊑_{G}^{avg}

, which provides strong guarantees:

A ⊑_{G}^{avg} B

means that B is safer than A in all circumstances, in the sense that the expected gain of an attack on B is less than on A, for whatever kind of gain the attacker may be seeking. This means that we can always substitute the component A by B without compromising the security of the system. An appealing aspect of this particular refinement order is that it is characterized by a precise structural relation between the stochastic channels associated with A and B [6,7], which makes it easy to reason about, and relatively efficient to verify. It is important to remark that this order is based on an average notion of adversarial gain (vulnerability), defined by mediating over all possible observations and their probabilities. We call this perspective average-case.

At the other end of the spectrum, DP, LDP and

d

-privacy are max-case measures. In fact, by applying the Bayes theorem to (1), we obtain:

\frac{p (x_{1} ∣ y)}{p (x_{2} ∣ y)} \leq e^{ε d (x_{1}, x_{2})} \frac{π (x_{1})}{π (x_{2})} for all x_{1}, x_{2} \in X, y \in Y

(2)

where, for

i \in {1, 2}

,

π (x_{i})

is the

p r i o r

probability of

x_{i}

and

p (x_{i} ∣ y)

is the posterior probability of

x_{i}

given y. We can interpret

\frac{π (x_{1})}{π (x_{2})}

and

\frac{p (x_{1} ∣ y)}{p (x_{2} ∣ y)}

as knowledge about

X

: they represent how much more likely

x_{1}

is with respect to

x_{2}

, before (prior) and after (posterior) observing y, respectively. Thus, the property expresses a bound on how much the adversary can learn from each individual outcome of the mechanism (The property (2) is also the semantic interpretation of the guarantees of the Pufferfish framework (cf. [8], Section 3.1). The ratio

\frac{\frac{p (x_{1} ∣ y)}{p (x_{2} ∣ y)}}{\frac{π (x_{1})}{π (x_{2})}}

is known as odds ratio).

In the literature of DP, LDP and

d

-privacy, mechanisms are usually compared on the basis of their

ε

-value (In DP and LDP

ε

is a parameter that usually appears explicitly in the definition of the mechanism. In d-privacy, it is an implicit scaling factor.), which controls a bound on the log-likelihood ratio of an observation y given two “secrets”

x_{1}

and

x_{2}

: smaller

ε

means more privacy. In DP and LDP, the bound is

ε

itself, while in

d

-privacy it is

ε \times d (s_{1}, s_{2})

We remark that the relation induced by

ε

in

d

-privacy is fragile, in the sense that the definition of

d

-privacy assumes an underlying metric structure

d

on the data, and whether a mechanism B is “better” than A depends in general on the metric considered.

Average-case and max-case are different principles, suitable for different scenarios: the former represents the point of view of an organization, for instance an insurance company providing coverage for risks related to credit cards, which for the cost–benefit analysis is interested in reasoning in terms of expectation (expected cost of an attack). The max-case represents the point of view of an individual, who is interested in limiting the cost of any attack. As such, the max-case seems particularly suitable for the domain of privacy.

In this paper, we combine the max-case perspective with the robustness of the QIF approach, and we introduce two refinement orders:

$⊑_{Q}^{\max}$ , based on the max-case leakage introduced in [9]. This order takes into account all possible privacy breaches caused by any observable (like in the DP world), but it quantifies over all possible quasi-convex vulnerability functions (in the style of the QIF world).
$⊑_{M}^{prv}$ , based on $d$ -privacy (like in the DP world), but quantified over all metrics $d$ .

To underline the importance of a robust order, let us consider the case of the oblivious mechanisms for differential privacy: These mechanisms are of the form

K = H \circ f

, where

f : X \to Y

is a query, namely a function from datasets in

X

to some answer domain

Y

, and H is a probabilistic mechanism implementing the noise. The idea is that the system first computes the result

y \in Y

of the query (true answer), and then it applies H to y to obtain a reported answer z. In general, if we want K to be

ε

-DP, we need to tune the mechanism H in order to take into account the sensitivity of f, which is the maximum distance between the results of f on two adjacent databases, and as such it depends on the metric on

Y

. However, if we know that

K = H \circ f

is

ε

-DP, and that

H ⊑_{M}^{prv} H^{'}

for some other mechanism

H^{'}

, then we can safely substitute H by

H^{'}

as it is because one of our results (cf. Theorem A5) guarantees that

K^{'} = H^{'} \circ f

is also

ε

-DP. In other words,

H ⊑_{M}^{prv} H^{'}

implies that we can substitute H by

H^{'}

in an oblivious mechanism for whatever query f and whatever metric on

Y

, without the need to know the sensitivity of f and without the need to do any tuning of

H^{'}

. Thanks to Theorems 3 and 4, we know that this is the case also for

⊑_{G}^{avg}

and

⊑_{Q}^{\max}

. We illustrate this with the following example.

Example 3.

Consider datasets

x \in X

of records containing the age of people, expressed as natural numbers from 0 to 100, and assume that each dataset in

X

contains at least 100 records. Consider two queries,

f (x)

and

g (x)

, which give the rounded average age and the minimum age of the people in x, respectively. Finally, consider the truncated geometric mechanism

T G^{ε}

(cf. Definition 10), and the randomized response mechanism

R^{ε}

(cf. Definition 13). It is easy to see that

K_{1} = T G^{ε} \circ f

is ε-DP, and it is possible to prove that

T G^{ε} ⊑_{M}^{prv} R^{ε}

(cf. Theorem 14). We can then conclude that

K_{2} = R^{ε} \circ f

is ε-DP as well, and that in general it is safe to replace

T G^{ε}

by

R^{ε}

for whatever query. On the other hand,

R^{ε} / ⊑_{M}^{prv} T G^{ε}

, so we cannot expect that it is safe to replace

R^{ε}

by

T G^{ε}

in any context. In fact,

K_{3} = R^{ε} \circ g

is ε-DP, but

K_{4} = T G^{ε} \circ g

isnotε-DP, despite the fact that both mechanisms are constructed using the same privacy parameter ε. Hence, we can conclude that a refinement relation based only on the comparison of the ε parameters would not be robust, at least not for a direct replacement in an arbitrary context. Note that

K_{4}

is

100 \times ε

-DP. In order to make it ε-DP, we should divide the parameter ε by the sensitivity of g (with respect to the ordinary distance on natural numbers), which is 100, i.e., use

T G^{\frac{ε}{100}}

. For

R^{ε}

, this is not necessary because it is defined using the discrete metric on

{0, \dots, 100}

, and the sensitivity of g with respect to this metric is 1.

The robust orders allow us to take into account different kinds of adversaries. The following example shows what the idea is.

Example 4.

Consider the following three LDP mechanisms, represented by their stochastic matrices (where each element is the conditional probability of the outcome of the mechanism, given the secret value). The secrets are three possible economic situations of an individual, p, a and r, standing for “poor”, “average” and “rich”, respectively. The observable outcomes are r and

n

, standing for “rich” and “not rich”.

\begin{array}{c} A & n & r \\ p & \frac{3}{4} & \frac{1}{4} \\ a & \frac{1}{2} & \frac{1}{2} \\ r & \frac{1}{4} & \frac{3}{4} \end{array} \begin{array}{c} B & n & r \\ p & \frac{2}{3} & \frac{1}{3} \\ a & \frac{2}{3} & \frac{1}{3} \\ r & \frac{1}{3} & \frac{2}{3} \end{array} \begin{array}{c} C & n & r \\ p & \frac{2}{3} & \frac{1}{3} \\ a & \frac{1}{2} & \frac{1}{2} \\ r & \frac{1}{3} & \frac{2}{3} \end{array}

(3)

Let us assume that the prior distribution π on the secrets is uniform. We note that A is

(\log 3)

-LDP while B is

(\log 2)

-LDP. Hence, if we only look at the value of ε, we would think that B is better than A from the privacy point of view. However, there are attackers that gain more from B than from A (which means that, with respect to those attackers, the privacy of B is worse). For instance, this is the case when the attacker is only interested in discovering whether the person is rich or not. In fact, if we consider a gain 1 when the attacker guesses the right class (r versus (either p or a)) and 0 otherwise, we have that the highest possible gain in A is

(\frac{3}{4}) π (p) + (\frac{1}{2}) π (a) = \frac{5}{12}

, while in B is

(\frac{2}{3}) π (p) + (\frac{2}{3}) π (a) = \frac{4}{9}

, which is higher than

\frac{5}{12}

. This is consistent with our orders: it is possible to show that none of the three orders hold between A and B, and that therefore we should not expect B to be better (for privacy) than A with respect to all possible adversaries.

On the other hand, the mechanism C is also

(\log 2)

-LDP, and in this case we have that the relation

A ⊑_{Q}^{\max} C

holds, implying that we can safely replace A by C. We can also prove that the reverse does not hold, which means that C is strictly better than A.

A fundamental issue is how to prove that these robust orders hold: Since

⊑_{Q}^{\max}

and

⊑_{M}^{prv}

involve universal quantifications, it is important to devise finitary methods to verify them. To this purpose, we will study their characterizations as structural relations between stochastic matrices (representing the mechanisms to be compared), along the lines of what was done for

⊑_{G}^{avg}

.

We will also study the relation between the three orders (the two above and

⊑_{G}^{avg}

), and their algebraic properties. Finally, we will analyze various mechanisms for DP, LDP, and

d

-privacy to see in which cases the order induced by

ε

is consistent with the three orders above.

1.1. Contribution

The main contributions of this paper are the following:

We introduce two refinement orders for the max case, $⊑_{Q}^{\max}$ and $⊑_{M}^{prv}$ , that are robust with respect to a large class of adversaries.
We give structural characterizations of both $⊑_{Q}^{\max}$ and $⊑_{M}^{prv}$ in terms of relations on the stochastic matrices of the mechanisms under comparison. These relations help the intuition and open the way to verification.
We study efficient methods to verify the structural relations above. Furthermore, these methods are such that, when the verification fails, they produce counterexamples. In this way, it is possible to pin down what the problem is and try to correct it.
We show that $⊑^{avg} \subset ⊑^{\max} \subset ⊑^{prv}$ .
We apply the three orders ( $⊑_{G}^{avg}$ , $⊑_{Q}^{\max}$ , and $⊑_{M}^{prv}$ ) to the comparison of some well-known families of $d$ -private mechanisms: geometric, exponential and randomised response. We show that, in general, $A ⊑_{G}^{avg} B$ (and thus all the refinement orders between A and B) holds within the same family whenever the $ε$ of B is smaller than that of A.
We show that, if A and B are mechanisms from different families, then, even if the $ε$ of B is smaller than that of A, the relations $A ⊑_{G}^{avg} B$ and $A ⊑_{Q}^{\max} B$ do not hold, and in most cases $A ⊑_{M}^{prv} B$ does not hold either. We conclude that a comparison based only on the value of the $ε$ ’s is not robust across different families, at least not for the purposes illustrated above.
We study lattice-properties of these orders. In contrast to $⊑_{G}^{avg}$ , which was shown to not be a lattice, we prove that suprema and infima exist for $⊑_{Q}^{\max}$ and $⊑_{M}^{prv}$ , and that therefore these orders form lattices.

1.2. Related Work

We are not aware of many studies on refinement relations for QIF. Yasuoka and Terauchi [10] and Malacaria [11] have explored strong orders on deterministic mechanisms, focusing on the fact that such mechanisms induce partitions on the space of secrets. They showed that the orders produced by min-entropy leakage [5] and Shannon leakage [12,13] are the same and, moreover, they coincide with the partition refinement order in the Lattice of Information [14]. This order was extended to the probabilistic case in [6], resulting in the relation

⊑^{avg}

mentioned in Section 2. The same paper [6] proposed the theory of g-leakage and introduced the corresponding order

⊑_{G}^{avg}

. Furthermore, [6] proved that

⊑^{avg} \subseteq ⊑_{G}^{avg}

and conjectured that also the reverse should hold. This conjecture was then proved valid in [7]. The max-case leakage, on which the relation

⊑_{Q}^{\max}

is based, was introduced in [9], but

⊑_{Q}^{\max}

and its properties were not investigated. Finally,

⊑^{prv}

is a novel notion introduced in this paper.

In the field of differential privacy, on the other hand, there have been various works aimed at trying understand the operational meaning of the privacy parameter

ε

and at providing guidelines for the choice of its values. We mention, for example [15,16], which consider the value of

ε

from an economical point of view, in terms of cost. We are not aware, however, of studies aimed at establishing orders between the level of privacy of different mechanisms, except the one based on the comparison of the

ε

’s.

The relation between QIF and DP, LDP, and

d

-privacy is based on the so-called semantic interpretation of the privacy notions that regard these properties as expressing a bounds on the increase of knowledge (from prior to posterior) due to the answer reported by the mechanism. For

d

-privacy, the semantic interpretation is expressed by (2). To the best of our knowledge, this interpretation was first pointed out (for the location privacy instance) in [17]. The seminal paper on

d

-privacy, [3], also proposed a semantic interpretation, with a rather different flavor, although formally equivalent. As for DP, as explained in the Introduction, (2) instantiated to databases and Hamming distance corresponds to the odds ratio on which the semantics interpretation is based provided in [8]. Before that, another version of semantic interpretation was presented in [1] and proved equivalent to a form of DP called

ε

-indistinguishability. Essentially, in this version, an adversary that queries the database, and knows all the database except one record, cannot infer too much about this record from the answer to the query reported by the mechanism. Later on, an analogous version of semantic interpretation was reformulated in [18] and proved equivalent to DP. A different interpretation of DP, called semantic privacy, was proposed by [19]. This interpretation is based on a comparison between two posteriors (rather between the posterior and the prior), and the authors show that, within certain limits, it is equivalent to DP.

A short version of this paper, containing only some of the proofs, appeared in [20].

1.3. Plan of the Paper

In the next three sections, Section 2, Section 3 and Section 4, we define the order refinements

⊑_{G}^{avg}

,

⊑_{Q}^{\max}

and

⊑_{M}^{prv}

, respectively, and we study their properties. In Section 5.1, we investigate methods to verify them. In Section 6, we consider various mechanisms for DP and its variants, and we investigate the relation between the parameter

ε

and the orders introduced in this paper. In Section 7, we show that

⊑_{Q}^{\max}

and

⊑_{M}^{prv}

form a lattice. Finally, Section 8 provides conclusions.

Note: In order to make the reading of the paper more fluid, we have moved all the proofs to the appendix at the end of the paper.

2. Average-Case Refinement

We recall here some basic concepts from the literature. Table 2 lists the symbols used for the main concepts through the paper.

2.1. Vulnerability, Channels, and Leakage

Quantitative Information Flow studies the problem of quantifying the information leakage of a system (e.g., a program, or an anonymity protocol). A common model in this area is to consider that the user has a secret x from a finite set of possible secrets

X

, about which the adversary has some probabilistic knowledge

π : D X

(

D X

denoting the set of probability distributions over

X

). A function

V : D X \to R_{\geq 0}

is then employed to measure the vulnerability of our system:

V (π)

quantifies the adversary’s success in achieving some desired goal, when his knowledge about the secret is

π

.

Various such functions can be defined (e.g., employing well-known notions of entropy), but it quickly becomes apparent that no single vulnerability function is meaningful for all systems. The family of g-vulnerabilities [6] tries to address this issue by parametrizing V in an operational scenario: first, the adversary is assumed to possess a set of actions

W

; second, a gain function

g (w, x)

models the adversary’s gain when choosing action w and the real secret is x. g-vulnerability can be then defined as the expected gain of an optimal guess:

V_{g} (π) = \max_{w : W} \sum_{x : X} π_{x} g (w, x)

. Different adversaries can be modelled by proper choices of

W

and g. We denote by

G X

the set of all gain functions.

A system is then modelled as a channel: a probabilistic mapping from the (finite) set of secrets

X

to a finite set of observations

Y

, described by a stochastic matrix C, where

C_{x, y}

is the probability that secret x produces the observation y. When the adversary observes y, he can transform his initial knowledge

π

into a posterior knowledge

δ^{y} : D X

. Since each observation y is produced with some probability

a_{y}

, it is sometimes conceptually useful to consider that the result of running a channel C, on the initial knowledge

π

, is a “hyper” distribution

[π, C]

: a probability distribution on posteriors

δ^{y}

, each having probability

a_{y}

.

It is then natural to define the (average-case) posterior vulnerability of the system by applying V to each posterior

δ^{y}

, then averaging by its probability

a_{y}

of being produced:

V [π, C] : = \sum_{y} a_{y} V (δ^{y})

(4)

when defining vulnerability in this way, it can be shown [9] that V has to be convex on π; otherwise, fundamental properties (such as the data processing inequality) are violated. Any continuous and convex function V can be written as

V_{g}

for a properly chosen g, so, when studying average-case leakage, we can safely restrict to using g-vulnerability.

Leakage can be finally defined by comparing the prior and posterior vulnerabilities, e.g., as

L_{g}^{+} (π, C) = V_{g} [π, C] - V_{g} (π)

. (Comparing vulnerabilities “multiplicatively” is also possible but is orthogonal to the goals of this paper.)

2.2. Refinement

A fundamental question arises in the study of leakage: can we guarantee that a system B is no less safe than a system A? Having a family of vulnerability functions, we can naturally define a strong order

⊑_{G}^{avg}

on channels by explicitly requiring that B leaks (Note that comparing the leakage of

A, B

is equivalent to comparing their posterior vulnerability, so we choose the latter for simplicity.) no more than A, for all priors

π

and all gain functions

g : G X

: (Note also that quantifying over

g : G X

is equivalent to quantifying over all continuous and convex vulnerabilities.)

\begin{matrix} A ⊑_{G}^{avg} B iff & V_{g} [π, A] \geq V_{g} [π, B] for all g : G X, π : D X \end{matrix}

(5)

Although

⊑_{G}^{avg}

is intuitive and provides clear leakage guarantees, the explicit quantification over vulnerability functions makes it hard to reason about and verify. Thankfully, this order can be characterized in a “structural” way that is as a direct property of the channel matrix. We first define the refinement order

⊑^{avg}

on channels by requiring that B can be obtained by post-processing A by some other channel R that is:

A ⊑^{avg} B iff A R = B for some channel R

(6)

A fundamental result [6,7] states that

⊑^{avg}

and

⊑_{G}^{avg}

coincide.

We read

A ⊑^{avg} B

as “A is refined by B”, or “B is as safe as A”. When

A ⊑^{avg} B

holds, we have a strong privacy guarantee: we can safely replace A by B without decreasing the privacy of the system, independently from the adversary’s goals and his knowledge. However, refinement can be also useful in case

A / ⊑^{avg} B

; namely, we can conclude that some adversary must exist, modelled by a gain function g, and some initial knowledge

π

, such that the adversary actually prefers to interact with A rather than interacting with B. (Whether this adversary is of practical interest or not is a different issue, but we know that one exists.) Moreover, we can actually construct such a “counter-example” gain function; this is discussed in Section 5.

3. Max-Case Refinement

Although

⊑^{avg}, ⊑_{G}^{avg}

provides a strong and precise way of comparing systems, one could argue that average-case vulnerability might underestimate the threat of a system. More precisely, imagine that there is a certain observation y such that the corresponding posterior

δ^{y}

is highly vulnerable (e.g., the adversary can completely infer the real secret), but y happens with very small probability

a_{y}

. In this case, the average-case posterior vulnerability

V [π, C]

can be relatively small, although

V (δ^{y})

is large for that particular y.

If such a scenario is considered problematic, we can naturally quantify leakage using a max-case (called “worse”-case in some contexts, although the latter is more ambiguous, “worse” can refer to a variety of factors.) variant of posterior vulnerability, where all observations are treated equally regardless of their probability of being produced:

V^{\max} [π, C] : = \max_{y} V (δ^{y})

(7)

Under this definition, it can be shown [9] that V has to be quasi-convex on

π

(instead of convex), in order to satisfy fundamental properties (such as the data processing inequality). Hence, in the max-case, we no longer restrict to g-vulnerabilities (which are always convex), but we can use any vulnerability

V : Q X

, where

Q X

denotes the set of all continuous quasi-convex functions

D X \to R_{\geq 0}

.

Inspired by

⊑_{G}^{avg}

, we can now define a corresponding max-case leakage order.

Definition 1.

The max-case leakage order is defined as

\begin{matrix} A ⊑_{Q}^{\max} B iff & V^{\max} [π, A] \geq V^{\max} [π, B] & for all V : Q X, π : D X \end{matrix}

(8)

Similarly to its average-case variant,

⊑_{Q}^{\max}

provides clear privacy guarantees by explicitly requiring that B leaks no more than A for all adversaries (modelled as a vulnerability V). However, this explicit quantification makes the order hard to reason about and verify. We would thus like to characterize

⊑_{Q}^{\max}

by a refinement order that depends only on the structure of the two channels.

Given a channel C from

X

to

Y

, we denote by

\tilde{C}

the channel obtained by normalizing C’s columns (if a column consists of only zeroes, it is simply removed.) and then transposing:

{\tilde{C}}_{y, x} : = \frac{C_{x, y}}{\sum_{x} C_{x, y}}

(9)

Note that the row y of

\tilde{C}

can be seen as the posterior distribution

δ^{y}

obtained by C under the uniform prior. Note also that

\tilde{C}

is non-negative and its rows sum up to 1, so it is a valid channel from

Y

to

X

. The average-case refinement order required that B can be obtained by post-processing A. We define the max-case refinement order by requiring that

\tilde{B}

can be obtained by pre-processing

\tilde{A}

.

Definition 2.

The max-case refinement order is defined as

A ⊑^{\max} B

iff

R \tilde{A} = \tilde{B}

for some channel R.

Our goal now is to show that

⊑_{Q}^{\max}

and

⊑^{\max}

are different characterizations of the same order. To do so, we start by giving a “semantic” characterization of

⊑^{\max}

that is, expressing it, not in terms of the channel matrices A and B, but in terms of the posterior distributions that they produce. Thinking of

[π, C]

as a (“hyper”) distribution on the posteriors produced by

π

and C, its support supp

[π, C]

is the set of all posteriors produced with non-zero probability. We also denote by ch S the convex hull of S.

Theorem 1.

Let

π : D X

. If

A ⊑^{\max} B

, then the posteriors of B (under

π

) are convex-combinations of those of A, that is,

supp [π, B] \subseteq ch supp [π, A]

(10)

Moreover, if (A1) holds and

π

is full support, then

A ⊑^{\max} B

.

Note that, if (A1) holds for any full-support prior, then it must hold for all priors.

Theorem A1 has a nice geometric intuition (cf. Figure 2) that we are going to illustrate in the following example.

Example 5.

Consider the following systems.

\begin{array}{c} A & y_{1} & y_{2} & y_{3} & y_{4} \\ x_{1} & \frac{1}{3} & \frac{2}{9} & \frac{2}{9} & \frac{2}{9} \\ x_{2} & \frac{1}{9} & \frac{1}{3} & \frac{2}{9} & \frac{1}{3} \\ x_{3} & \frac{1}{9} & \frac{2}{9} & \frac{1}{3} & \frac{1}{3} \end{array} \begin{array}{c} B & y_{1} & y_{2} & y_{3} & y_{4} \\ x_{1} & \frac{1}{3} & \frac{2}{9} & \frac{2}{9} & \frac{1}{9} \\ x_{2} & \frac{2}{9} & \frac{1}{3} & \frac{2}{9} & \frac{1}{9} \\ x_{3} & \frac{2}{9} & \frac{2}{9} & \frac{1}{3} & \frac{1}{9} \end{array}

(11)

Consider the prior

π = (\frac{1}{2}, \frac{1}{4}, \frac{1}{4})

. The set of the posterior distributions generated by A under π are:

supp [π, A] = {(\frac{3}{4}, \frac{1}{8}, \frac{1}{8}), (\frac{4}{9}, \frac{1}{3}, \frac{2}{9}), (\frac{4}{9}, \frac{2}{9}, \frac{1}{3}), (\frac{2}{5}, \frac{3}{10}, \frac{3}{10})}

(12)

while those generated by B are:

supp [π, B] = {(\frac{3}{5}, \frac{1}{5}, \frac{1}{5}), (\frac{4}{9}, \frac{1}{3}, \frac{2}{9}), (\frac{4}{9}, \frac{2}{9}, \frac{1}{3}), (\frac{1}{2}, \frac{1}{4}, \frac{1}{4})}

(13)

These posteriors, and the convex hulls that they generate, are illustrated in Figure 2. The pink area is the ch supp

[π, A]

and the purple area is the ch supp

[π, B]

. We can see that supp

[π, B] \subseteq

ch supp

[π, A]

, or, equivalently, ch supp

[π, B] \subseteq

ch supp

[π, A]

.

We are now ready to give the main result of this section.

Theorem 2.

The orders

⊑^{\max}

and

⊑_{Q}^{\max}

coincide.

Similarly to the average case,

A ⊑^{\max} B

gives us a strong leakage guarantee: can safely replace A by B, knowing that, for any adversary, the max-case leakage of B can be no-larger than that of A. Moreover, in case

A {/ ⊑}^{\max} B

, we can always find an adversary, modelled by a vulnerability function V, who prefers (wrt the max-case) interacting with A that with B. Such a function is discussed in Section 5.

Finally, we resolve the question of how

⊑^{\max}

and

⊑^{avg}

are related.

Theorem 3.

⊑^{avg}

is strictly stronger than

⊑^{\max}

.

This result might appear counter-intuitive at first; one might expect

A ⊑^{\max} B

to imply

A ⊑^{avg} B

. To understand why it does not, note that the former only requires that, for each output

y_{B}

of B, there exists some output of

y_{A}

that is at least as vulnerable, regardless of how likely

y_{A}

and

y_{B}

are to happen (this is max-case, after all). We illustrate this with the following example.

Example 6.

Consider the following systems:

\begin{array}{c} A & y_{1} & y_{2} & y_{3} & y_{4} \\ x_{1} & \frac{3}{4} & 0 & \frac{1}{4} & 0 \\ x_{2} & \frac{3}{4} & \frac{1}{4} & 0 & 0 \\ x_{3} & 0 & \frac{1}{4} & \frac{1}{4} & \frac{1}{2} \end{array} \begin{array}{c} B & y_{1} & y_{2} & y_{3} \\ x_{1} & \frac{1}{2} & 0 & \frac{1}{2} \\ x_{2} & \frac{1}{2} & \frac{1}{2} & 0 \\ x_{3} & 0 & \frac{1}{2} & \frac{1}{2} \end{array}

(14)

Under the uniform prior, the

y_{1}, y_{2}, y_{3}

posteriors for both channels are the same, namely

(\frac{1}{2}, \frac{1}{2}, 0)

,

(0, \frac{1}{2}, \frac{1}{2})

and

(\frac{1}{2}, 0, \frac{1}{2})

, respectively. Thus, the knowledge that can be obtained by each output of B can be also obtained by some output of A (albeit with a different probability). Hence, from Theorem A1, we get that

A ⊑^{\max} B

. However, we can check (see Section 5.1) that B cannot be obtained by post-processing A, that is,

A {/ ⊑}^{avg} B

.

The other direction might also appear tricky: if B leaks no more than A in the average-case, it must also leak no more than B in the max-case. The quantification over all gain functions in the average-case is powerful enough to “detect” differences in max-case leakage. The above result also means that

⊑^{avg}

could be useful even if we are interested in the max-case, since it gives us

⊑^{\max}

for free.

4. Privacy-Based Refinement

Thus far, we have compared systems based on their (average-case or max-case) leakage. In this section, we turn our attention to the model of differential privacy and discuss new ways of ordering mechanisms based on that model.

4.1. Differential Privacy and $d$ -Privacy

Differential privacy relies on the observation that some pairs of secrets need to be indistinguishable from the point of view of the adversary in order to provide some meaningful notion of privacy; for instance, databases differing in a single individual should not be distinguishable, otherwise the privacy of that individual is violated. At the same, other pairs of secrets can be allowed to be distinguishable in order to provide some utility; for instance, distinguishing databases differing in many individuals allows us to answer a statistical query about those individuals.

This idea can be formalized by a distinguishability metric

d

. (To be precise,

d

is an extended pseudo metric, namely one in which distinct secrets can have distance 0, and distance

+ \infty

is allowed.) Intuitively,

d (x, x^{'})

models how distinguishable we allow these secrets to be. A value 0 means that we require x and

x^{'}

to be completely indistinguishable to the adversary, while

+ \infty

means that she can distinguish them completely.

In this context, a mechanism is simply a channel (the two terms will be used interchangeably), mapping secrets

X

to some observations

Y

. Denote by

M X

the set of all metrics on

X

. Given

d \in M X

, we define

d

-privacy as follows:

Definition 3.

A channel C satisfies

d

-privacy iff

C_{x, y} \leq e^{d (x, x^{'})} C_{x^{'}, y} for all x, x^{'} \in X, y \in Y

(15)

Intuitively, this definition requires that the closer x and

x^{'}

are (as measured by

d

), the more similar (probabilistically) the output of the mechanism on these secrets should be.

Remark 1.

Note that the definition of

d

-privacy given in (1) is slightly different from the above one because of the presence of ε in the exponent. Indeed, it is common to scale

d

by a privacy parameter

ε \geq 0

, in which case

d

can be thought of as the “kind” and ε as the “amount” of privacy. In other words, the structure determined by

d

on the data specifies how we want to distinguish each pair of data, and ε specifies (uniformly) the degree of the distinction. Note that

ε \cdot d

is itself a metric, so the two definitions are equivalent.

Using a generic metric

d

in this definition allows us to express different scenarios, depending on the domain

X

on which the mechanism is applied and the choice of

d

. For instance, in the standard model of differential privacy, the mechanism is applied to a database x (i.e.,

X

is the set of all databases), and produces some observation y (e.g., a number). The Hamming metric

d_{H}

—defined as the number of individuals in which x and

x^{'}

differ—captures standard differential privacy.

4.2. Oblivious Mechanisms

In the case of an oblivious mechanism, a query

f : X \to Y

is first applied to database x, and a noise mechanism H from

Y

to

Z

is applied to

y = f (x)

, producing an observation z. In this case, it is useful to study the privacy of H wrt some metric

d_{Y}

on

Y

. Then, to reason about the

d_{X}

-privacy of the whole mechanism

H \circ f

, we can first compute the sensitivity of f wrt

d_{X}, d_{Y}

:

Δ_{d_{X}, d_{Y}}^{f} = \max_{x, x^{'}} \frac{d_{Y} (f (x), f (x^{'}))}{d_{X} (x, x^{'})}

(16)

and then use the following property [3]:

\begin{matrix} If H satisfies d_{Y} - privacy, then H \circ f satisfies Δ_{d_{X}, d_{Y}}^{f} \cdot d_{X} - privacy . \end{matrix}

(17)

For instance, the geometric mechanism

G^{ε}

satisfies

ε \cdot d_{E}

-privacy (where

d_{E}

denotes the Euclidean metric), hence it can be applied to any numeric query f: the resulting mechanism

G^{ε} \circ f

satisfies

Δ_{d_{H}, d_{E}}^{f} \cdot ε

-differential privacy. The sensitivity wrt the Hamming and Euclidean metrics reduces to

Δ_{d_{H}, d_{E}}^{f} = \max_{x \sim x^{'}} | f (x) - f (x^{'}) |

where

x \sim x^{'}

denotes

d_{H} (x, x^{'}) = 1

.

4.3. Applying Noise to the Data of a Single Individual

There are also scenarios in which a mechanism C is applied directly to the data of a single individual (that is,

X

is the set of possible values). For instance, in the local model of differential privacy [2], the value of each individual is obfuscated before sending them to an untrusted curator. In this case, C should satisfy

d_{D}

-privacy, where

d_{D}

is the discrete metric, since any change in the individual’s value should have negligible effects.

Moreover, in the context of location-based services, a user might want to obfuscate his location before sending it to the service provider. In this context, it is natural to require that locations that are geographically close are indistinguishable, while far away ones are allowed to be distinguished (in order to provide the service). In other words, we wish to provide

d_{E}

-privacy, for the Euclidean metric on

R^{2}

, called geo-indistinguishability in [17].

4.4. Comparing Mechanisms by Their “Smallest $ε$ ” (For Fixed $d$ )

Scaling

d

by a privacy parameter

ε

allows us to turn

d

-privacy (for some fixed

d

) into a quantitative “leakage” measure, by associating each channel to the smallest ε by which we can scale

d

without violating privacy.

Definition 4.

The privacy-based leakage (wrt

d

) of a channel C is defined as

{Priv}_{d} (C) : = \inf {ε \geq 0 | C satisfies ε \cdot d - privacy}

(18)

Note that

{Priv}_{d} (C) = + \infty

iff there is no such

ε

; also

{Priv}_{d} (C) \leq 1

iff C satisfies

d

-privacy.

It is then natural to compare two mechanisms A and B based on their “smallest

ε

”.

Definition 5.

Define

A ⊑_{d}^{prv} B

iff

{Priv}_{d} (A) \geq {Priv}_{d} (B)

.

For instance,

A ⊑_{d_{H}}^{prv} B

means that B satisfies standard differential privacy for

ε

at least as small as the one of A.

4.5. Privacy-Based Leakage and Refinement Orders

When discussing the average- and max-case leakage orders

⊑_{G}^{avg}, ⊑_{Q}^{\max}

, we obtained strong leakage guarantees by quantifying over all vulnerability functions. It is thus natural to investigate a similar quantification in the context of

d

-privacy. Namely, we define a stronger privacy-based “leakage” order, by comparing mechanisms not on a single metric

d

, but on all metrics simultaneously.

Definition 6.

The privacy-based leakage order is defined as

A ⊑_{M}^{prv} B

iff

A ⊑_{d}^{prv} B

for all

d \in M X

.

Similarly to the other leakage orders, the drawback of

⊑_{M}^{prv}

is that it quantifies over an uncountable family of metrics. As a consequence, our first goal is to characterize it as a property of the channel matrix alone, which would make it much easier to reason about or verify.

To do so, we start by recalling an alternative way of thinking about

d

-privacy. Consider the multiplicative total variation distance between probability distributions

μ, μ^{'} \in D Y

, defined as:

{tv}_{\otimes} (μ, μ^{'}) : = \max_{y : Y} | \ln \frac{μ_{y}}{μ_{y}^{'}} |

(19)

If we think of C as a function

X \to D Y

(mapping every x to the distribution

C_{x, -}

), C satisfies

d

-privacy iff

{tv}_{\otimes} (C_{x, -}, C_{x^{'}, -}) \leq d (x, x^{'})

, in other words iff C is non-expansive (1-Lipschitz) wrt

{tv}_{\otimes}, d

.

Then, we introduce the concept of the distinguishability metric

d_{C} \in M X

induced by the channel C, defined as

d_{C} (x, x^{'}) : = {tv}_{\otimes} (C_{x, -}, C_{x^{'}, -})

(20)

Intuitively,

d_{C} (x, x^{'})

expresses exactly how much the channel distinguishes (wrt

{tv}_{\otimes}

) the secrets

x, x^{'}

. It is easy to see that

d_{C}

is the smallest metric for which C is private; in other words, for any

d

:

C satisfies d - privacy iff d \geq d_{C}

(21)

We can now give a refinement order on mechanisms, by comparing their corresponding induced metrics.

Definition 7.

The privacy-based refinement order is defined as

A ⊑^{prv} B

iff

d_{A} \geq d_{B}

, or equivalently iff B satisfies

d_{A}

-privacy.

This achieves our goal of goal of characterizing

⊑_{M}^{prv}

.

Proposition 1.

The orders

⊑_{M}^{prv}

and

⊑^{prv}

coincide.

We now turn our attention to the question of how these orders relate to each other.

Theorem 4.

⊑^{\max}

is strictly stronger than

⊑^{prv}

, which is strictly stronger than

⊑_{d}^{prv}

.

The fact that

⊑^{\max}

is stronger than

⊑^{prv}

is due to the fact than

{Priv}_{d}

can be seen as a max-case information leakage, for a properly constructed vulnerability function

V_{d}

. This is discussed in detail in Section 4.7. This implication means that

⊑^{avg}, ⊑^{\max}

can be useful even if we “only” care about

d

-privacy.

4.6. Application to Oblivious Mechanisms

We conclude the discussion on privacy-based refinement by showing the usefulness of our strong

⊑^{prv}

order in the case of oblivious mechanisms.

Theorem 5.

Let

f : X \to Y

be any query and

A, B

be two mechanisms on

Y

. If

A ⊑^{prv} B

, then

A \circ f ⊑^{prv} B \circ f

.

This means that replacing A by B is the context of an oblivious mechanism is always safe, regardless of the query (and its sensitivity) and regardless of the metric by which the privacy of the composed mechanism is evaluated.

Assume, for instance that we care about standard differential privacy, and we have properly constructed A such that

A \circ f

satisfies

ε

-differential privacy for some

ε

. If we know that

A ⊑^{prv} B

(several such cases are discussed in Section 6), we can replace A by B without even knowing what f does. The mechanism

B \circ f

is guaranteed to also satisfy

ε

-differential privacy.

Note also that the above theorem fails for the weaker order

⊑_{d}^{prv}

. Establishing

A ⊑_{d_{Y}}^{prv} B

for some metric

d_{Y} : M Y

gives no guarantees that

A \circ f ⊑_{d_{X}}^{prv} B \circ f

for some other metric of interest

d_{X} : M X

. It is possible that replacing A by B in that case is not safe (one would need to re-examine the behavior of B, and possibly reconfigure it to the sensitivity of f).

Table 3 summarizes the relations between the various orderings.

4.7. Privacy as Max-Case Capacity

In this section we show that

d

-privacy can be expressed as a (max-case) information leakage. Note that this provides an alternative proof that

⊑^{\max}

is stronger than

⊑^{prv}

. We start this by defining a suitable vulnerability function:

Definition 8.

The

d

-vulnerability function

V_{d}

is defined as

V_{d} (π) : = \inf {ε \geq 0 | \forall x, x^{'} \in X, π_{x} \leq e^{ε \cdot d (x, x^{'})} π_{x^{'}}}

(22)

Note the difference between

V_{d} (π)

(a vulnerability function on distributions) and

{Priv}_{d} (C)

(a “leakage” measure on channels).

A fundamental notion in QIF is that of capacity: the maximization of leakage over all priors. In turns out that, for

V_{d}

, the capacity-realizing prior is the uniform one. In the following,

L_{d}^{+, \max}

denotes the additive max-case

d

leakage, namely:

L_{d}^{+, \max} (π, C) = V_{d}^{\max} [π, C] - V_{d} (π)

(23)

and

M L_{d}^{+, \max}

denotes the additive max-case

d

-capacity, namely:

M L_{d}^{+, \max} (C) = \max_{π} L_{d}^{+, \max} (π, C)

(24)

Theorem 6.

M L_{d}^{+, \max}

is always achieved on a uniform prior

π^{u}

. Namely

\max_{π} L_{d}^{+, \max} (π, C) = L_{d}^{+, \max} (π^{u}, C) = V_{d}^{\max} [π^{u}, C]

(25)

This finally brings us to our goal of expressing

{Priv}_{d}

in terms of information leakage (for a proper vulnerability function).

Theorem 7.

[DP as max-case capacity] C satisfies

ε \cdot d

-privacy iff

M L_{d}^{+, \max} (C) \leq ε

. In other words:

M L_{d}^{+, \max} (C) = {Priv}_{d} (C)

.

5. Verifying the Refinement Orders

We now turn our attention to the problem of checking whether the various orders hold, given two explicit representations of channels A and B (in terms of their matrices). We show that, for all orders, this question can be answered in time polynomial in the size of the matrices. Moreover, when one of the order fails, we discuss how to obtain a counter-example (e.g., a gain function g or a vulnerability function V), demonstrating this fact. All the methods discussed in the section have been implemented in a publicly available library, and have been used in the experimental results of Section 6.

5.1. Average-Case Refinement

Verifying

A ⊑^{avg} B

can be done in polynomial time (in the size of

A, B

) by solving the system of equations

A R = B

, with variables R, under the linear constraints that R is a channel matrix (non-negative and rows sum up to 1). However, if the system has no solution (i.e.,

A {/ ⊑}^{avg} B

), this method does not provide us with a counter-example gain function g.

We now show that there is an alternative efficient method: define

C^{↑} = {C R | R is a channel}

, the set of all channels obtainable by post-processing C. The idea is to compute the projection of B on

A^{↑}

. Clearly, the projection is B itself iff

A ⊑^{avg} B

; otherwise, the projection can be directly used to construct a counter-example g.

Theorem 8.

Let

B^{*}

be the projection ofBon

A^{↑}

.

1.: If $B = B^{*}$ , then $A ⊑^{avg} B$ .
2.: Otherwise, let $G = B - B^{*}$ . The gain function $g (w, x) = G_{x, w}$ provides a counter-example to $A ⊑^{avg} B$ , which is $V_{g} (π^{u}, A) < V_{g} (π^{u}, B)$ , for uniform $π^{u}$ .

Since

{∥ x - y ∥}_{2}^{2} = x^{T} x - 2 x^{T} y + y^{T} y

, the projection of y to a convex set can be written as

\min_{x} x^{T} x - 2 x^{T} y

for

A x \leq b

. This is a quadratic program with Q being the identity matrix, which is positive definite, hence it can be solved in polynomial time.

Note that the proof that

⊑_{G}^{avg}

is stronger than

⊑^{avg}

(the “coriaceous” theorem of [7]) uses the hyperplane-separation theorem to show the existence of a counter example g in case

A {/ ⊑}^{avg} B

. The above technique essentially computes such a separating hyperplane.

5.2. Max-Case Refinement

Similarly to

⊑^{avg}

, we can verify

A ⊑^{\max} B

directly using its definition, by solving the system

R \tilde{A} = \tilde{B}

under the constraint that R is a channel.

In contrast to

⊑^{avg}

, when

A {/ ⊑}^{\max} B

, the proof of Theorem 2 directly gives us a counter-example:

V (σ) : = \min_{σ^{'} : S} {∥ σ - σ^{'} ∥}_{2}

(26)

where

S =

ch supp

[π, A]

and

π

is any full-support prior. For this vulnerability function, it holds that

V^{\max} (π, A) < V^{\max} (π, B)

.

5.3. Privacy-Based Refinement

The

⊑^{prv}

order can be verified directly from its definition, by checking that

d_{A} \geq d_{B}

. This can be done in time

O (| X |^{2} | Y |)

, by computing

{tv}_{\otimes} (C_{x, -}, C_{x^{'}, -})

for each pair of secrets. If

A {/ ⊑}^{prv} B

, then

d = d_{B}

provides an immediate counter-example metric, since B satisfies

d_{B}

-privacy, but A does not.

6. Application: Comparing DP Mechanisms

In differential privacy, it is common to compare the privacy guarantees provided by different mechanisms by ‘comparing the epsilons’. However, it is interesting to ask to what extent

ε

-equivalent mechanisms are comparable wrt the other leakage measures defined here—or we might want to know whether reducing

ε

in a mechanism also corresponds to a refinement of it. This could be useful if, for example, it is important to understand the privacy properties of a mechanism with respect to any max-case leakage measure, and not just the DP measure given by

ε

.

Since the

ε

-based order given by

⊑_{d}^{prv}

is (strictly) the weakest of the orders considered here, it cannot be the case that we always get a refinement (wrt other orders). However, it may be true that, for particular families of mechanisms, some (or all) of the refinement orders hold.

We investigate three families of mechanisms commonly used in DP or LDP: geometric, exponential and randomized response mechanisms.

6.1. Preliminaries

We define each family of mechanisms in terms of their channel construction. We assume that mechanisms operate on a set of inputs (denoted by

X

) and produce a set of outputs (denoted

Y

). In this sense, our mechanisms can be seen as oblivious (as in standard DP) or as LDP mechanisms. (We use the term ‘mechanism’ in either sense). We denote by

M^{ε}

a mechanism parametrized by

ε

, where

ε

is defined to be the same as

{Priv}_{d} (M)

. For the purposes of comparison, we make sure that we use the best possible

ε

for each mechanism. In order to compare mechanisms, we restrict our input and output domains of interest to sequences of non-negative integers. We assume

X, Y

are finite unless specified. In addition, as we are operating in the framework of

d

-privacy, it is necessary to provide an appropriate metric defined over

X

; here, it makes sense to use the Euclidean distance metric

d_{E}

.

Definition 9.

A geometric mechanism is a channel

(X, Z, G^{ε})

, parametrized by

ε \geq 0

constructed as follows:

\begin{matrix} G_{x, y}^{ε} & = \frac{(1 - α) \cdot α^{d_{E} (x, y)}}{1 + α} & for all x \in X, y \in Z \end{matrix}

(27)

where

α = e^{- ε}

and

d_{E} (x, y) = ∥ x - y ∥

. Such a mechanism satisfies

ε \cdot d_{E}

-privacy.

In practice, the truncated geometric mechanism is preferred to the infinite geometric. We define the truncated geometric mechanism as follows.

Definition 10.

A truncated geometric mechanism is a channel

(X, Y, T G^{ε})

, parametrized by

ε \geq 0

with

X \subseteq Y

constructed as follows:

\begin{matrix} T G_{x, y}^{ε} & = \frac{(1 - α) \cdot α^{d_{E} (x, y)}}{1 + α} & for all y \neq \min Y, \max Y \end{matrix}

(28)

\begin{matrix} T G_{x, y}^{ε} & = \frac{α^{d_{E} (x, y)}}{1 + α} & for y = \min Y, \max Y \end{matrix}

(29)

where

α = e^{- ε}

and

d_{E} (x, y) = ∥ x - y ∥

. Such a mechanism satisfies

ε \cdot d_{E}

-privacy.

It is also possible to define the ‘over-truncated’ geometric mechanism whose input space is not entirely included in the output space.

Definition 11.

An over-truncated geometric mechanism is a channel

(X, Y, O T G^{ε})

, parametrized by

ε \geq 0

with

X / \subseteq Y

constructed as follows:

1.: Start with the truncated geometric mechanism $(X, X \cup Y, T G^{ε})$ .
2.: Sum up the columns at each end until the output domain is reached.

Such a mechanism satisfies

ε \cdot d_{E}

-privacy.

For example, the set of inputs to an over-truncated geometric mechanism could be integers in the range

[0 \dots 100]

, but the output space may have a range of

[0 \dots 50]

or perhaps

[- 50 \dots 50]

. In either of these cases, the mechanism has to ‘over-truncate’ the inputs to accommodate the output space.

We remark that we do not consider the over-truncated mechanism a particularly useful mechanism in practice. However, we provide results on this mechanism for completeness since its construction is possible, if unusual.

Definition 12.

An exponential mechanism is a channel

(X, Y, E^{α})

, parametrized by

ε \geq 0

constructed as follows:

\begin{matrix} E_{x, y}^{α} & = λ_{x} \cdot e^{- \frac{ε}{2} d_{E} (x, y)} & for all x \in X, y \in Y \end{matrix}

(30)

where

λ_{x}

are normalizing constants ensuring

\sum_{y} E_{x, y}^{α} = 1

. Such a mechanism satisfies

α \cdot d_{E}

-privacy where

α \geq \frac{ε}{2}

(which can be calculated exactly from the channel construction).

Note that the construction presented in Definition 12 uses the Euclidean distance metric since we only consider integer domains. The general construction of the exponential mechanism uses arbitrary domains and arbitrary metrics. Note that its parameter

ε

does not correspond to the true (best-case)

ε

-DP guarantee that it provides. We will denote by

E^{ε}

the exponential mechanism with ‘true’ privacy parameter

ε

rather than the reported one, as our intention is to capture the privacy guarantee provided by the channel in order to make reasonable comparisons.

Definition 13.

A randomized response mechanism is a channel

(X, Y, R^{ε})

, parametrized by

ε \geq 0

constructed as follows:

\begin{matrix} R_{x, y}^{ε} & = \frac{e^{ε (1 - d_{D} (x, y))}}{e^{ε} + n} & for all x, y \in Y \end{matrix}

(31)

\begin{matrix} R_{x, y}^{ε} & = \frac{1}{n + 1} & for all x \notin Y \end{matrix}

(32)

where

n = ∥ Y ∥ - 1

and

d_{D}

is the discrete metric (that is,

d_{D} (x, x) = 0

and

d_{D} (x, y) = 1

for

x \in Y, x \neq y

). Such a mechanism satisfies

ε \cdot d_{D}

-privacy.

We note that the randomized response mechanism also satisfies

ε \cdot d_{E}

-privacy.

Intuitively, the randomized response mechanism returns the true answer with high probability and all other responses with equal probability. In the case where the input x lies outside

Y

(that is, in ‘over-truncated’ mechanisms), all of the outputs (corresponding to the outlying inputs) have equal probability.

Example 7.

The following are examples of each of the mechanisms described above, represented as channel matrices. For this example, we set

ε = \log (2)

for the geometric and randomized response mechanisms, while, for the exponential mechanism, we use

ε = \log (4)

.

\begin{matrix} \begin{array}{c} T G & x_{1} & x_{2} & x_{3} \\ x_{1} & \frac{2}{3} & \frac{1}{6} & \frac{1}{6} \\ x_{2} & \frac{1}{3} & \frac{1}{3} & \frac{1}{3} \\ x_{3} & \frac{1}{6} & \frac{1}{6} & \frac{2}{3} \end{array} & \begin{array}{c} O T G & x_{1} & x_{2} \\ x_{1} & \frac{2}{3} & \frac{1}{3} \\ x_{2} & \frac{1}{3} & \frac{2}{3} \\ x_{3} & \frac{1}{6} & \frac{5}{6} \end{array} \\ \begin{array}{c} E & x_{1} & x_{2} & x_{3} \\ x_{1} & \frac{4}{7} & \frac{2}{7} & \frac{1}{7} \\ x_{2} & \frac{1}{4} & \frac{1}{2} & \frac{1}{4} \\ x_{3} & \frac{1}{7} & \frac{2}{7} & \frac{4}{7} \end{array} & \begin{array}{c} R & x_{1} & x_{2} & x_{3} \\ x_{1} & \frac{1}{2} & \frac{1}{4} & \frac{1}{4} \\ x_{2} & \frac{1}{4} & \frac{1}{2} & \frac{1}{4} \\ x_{3} & \frac{1}{4} & \frac{1}{4} & \frac{1}{2} \end{array} \end{matrix}

(33)

Note that the exponential mechanism here actually satisfies

\log (\frac{16}{7}) \cdot d_{E}

-privacy even though it is specified by

ε = \log (4)

.

We now have three families of mechanisms which we can characterize by channels, and which satisfy

ε \cdot d_{E}

-privacy. For the remainder of this section, we will refer only to the

ε

parameter and take

d_{E}

as given, as we wish to understand the effect of changing

ε

(for a fixed metric) on the various leakage measures.

6.2. Refinement Order within Families of Mechanisms

We first ask which refinement orders hold within a family of mechanisms. That is, when does reducing

ε

for a particular mechanism produce a refinement? Since we have the convenient order

⊑^{avg} \subset ⊑^{\max} \subset ⊑^{prv}

, it is useful to first check if

⊑^{avg}

holds as we get the other refinements ‘for free’.

For the (infinite) geometric mechanism, we have the following result.

Theorem 9.

Let

G^{ε}, G^{ε^{'}}

be geometric mechanisms. Then,

G^{ε} ⊑^{avg} G^{ε^{'}}

iff

ε \geq ε^{'}

. That is, decreasing

ε

produces a refinement of the mechanism.

This means that reducing

ε

in an infinite geometric mechanism is safe against any adversary that can be modelled using, for example, max-case or average-case vulnerabilities.

For the truncated geometric mechanism, we get the same result.

Theorem 10.

Let

T G^{ε}, T G^{ε^{'}}

be truncated geometric mechanisms. Then,

T G^{ε} ⊑^{avg} T G^{ε^{'}}

iff

ε \geq ε^{'}

. That is, decreasing

ε

produces a refinement of the mechanism.

However, the over-truncated geometric mechanism does not behave so well.

Theorem 11.

Let

O T G^{ε}, O T G^{ε^{'}}

be over-truncated geometric mechanisms. Then,

O T G^{ε} {/ ⊑}^{avg} O T G^{ε^{'}}

for any

ε \neq ε^{'}

. That is, decreasing

ε

does not produce a refinement.

We can think of this last class of geometrics as ‘skinny’ mechanisms, that is, corresponding to a channel with a smaller output space than input space.

Intuitively, this theorem means that we can always find some (average-case) adversary who prefers the over-truncated geometric mechanism with the smaller

ε

.

We remark that the gain function we found can be easily calculated by treating the columns of channel A as vectors, and finding a vector orthogonal to both of these. This follows from the results in Section 5.1. Since the columns of A cannot span the space

R^{3}

, it is always possible to find such a vector, and, when this vector is not orthogonal to the ‘column space’ of B, it can be used to construct a gain function preferring B to A.

Even though the

⊑^{avg}

refinement does not hold, we can check whether the other refinements are satisfied.

Theorem 12.

Let

O T G^{ε}

be an over-truncated geometric mechanism. Then, reducing

ε

does not produce a

⊑^{\max}

refinement; however, it does produce a

⊑^{prv}

refinement.

This means that, although a smaller

ε

does not provide safety against all max-case adversaries, it does produce a safer mechanism wrt d-privacy for any choice of metric we like.

Intuitively, the

⊑^{prv}

order relates mechanisms based on how they distinguish inputs. Specifically, if

A ⊑^{prv} B

, then, for any pair of inputs

x, x^{'}

, the corresponding output distributions are ‘further apart’ in channel A than in channel B, and thus the inputs are more distinguishable using channel A. When

⊑^{prv}

fails to hold, it means that there are some inputs in A which are more distinguishable than in B, and vice versa. This means an adversary who is interested in distinguishing some particular pair of inputs would prefer one mechanism to the other.

We now consider the exponential mechanism. In this case, we do not have a theoretical result, but, experimentally, it appears that the exponential mechanism respects refinement, so we present the following conjecture.

Conjecture 1.

Let

E^{ε}

be an exponential mechanism. Then, decreasing ε in E produces a refinement. That is,

E^{ε} ⊑^{avg} E^{ε^{'}}

iff

ε \geq ε^{'}

.

Finally, we consider the randomized response mechanism.

Theorem 13.

Let

R^{ε}

be a randomized response mechanism. Then, decreasing

ε

in R produces a refinement. That is,

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

.

In conclusion, we can say that, in general, the usual DP families of mechanisms are ‘well-behaved’ wrt all of the refinement orders. This means that it is safe (wrt any adversary we model here) to replace a mechanism from a particular family with another mechanism from the same family with a lower

ε

.

6.3. Refinement Order between Families of Mechanisms

Now, we explore whether it is possible to compare mechanisms from different families. We first ask: can we compare mechanisms which have the same

ε

? We assume that the input and output domains are the same, and the intention is to decide whether to replace one mechanism with another.

Theorem 14.

Let R be a randomized response mechanism, E an exponential mechanism and TG a truncated geometric mechanism. Then,

T G^{ε} ⊑^{prv} R^{ε}

and

T G^{ε} ⊑^{prv} E^{ε}

. However,

⊑^{prv}

does not hold between

E^{ε}

and

R^{ε}

.

Proof.

We present a counter-example to show

E^{ε} {/ ⊑}^{prv} R^{ε}

and

R^{ε} {/ ⊑}^{prv} E^{ε}

. The remainder of the proof is in Appendix D.

Consider the following channels:

\begin{array}{c} A & x_{1} & x_{2} & x_{3} & x_{4} \\ x_{1} & \frac{8}{15} & \frac{4}{15} & \frac{2}{15} & \frac{1}{15} \\ x_{2} & \frac{2}{9} & \frac{4}{9} & \frac{2}{9} & \frac{1}{9} \\ x_{3} & \frac{1}{9} & \frac{2}{9} & \frac{4}{9} & \frac{2}{9} \\ x_{4} & \frac{1}{15} & \frac{2}{15} & \frac{4}{15} & \frac{8}{15} \end{array} \begin{array}{c} B & x_{1} & x_{2} & x_{3} & x_{4} \\ x_{1} & \frac{4}{9} & \frac{5}{27} & \frac{5}{27} & \frac{5}{27} \\ x_{2} & \frac{5}{27} & \frac{4}{9} & \frac{5}{27} & \frac{5}{27} \\ x_{3} & \frac{5}{27} & \frac{5}{27} & \frac{4}{9} & \frac{5}{27} \\ x_{4} & \frac{5}{27} & \frac{5}{27} & \frac{5}{27} & \frac{4}{9} \end{array}

(34)

Channel A represents an exponential mechanism and channel B a randomized response mechanism. Both have (true)

ε

of

\log (\frac{12}{5})

. (Channel A was generated using

ε = \log (4)

. However, as noted earlier, this corresponds to a lower true

ε

.) However,

d_{A} (x_{1}, x_{3}) > d_{B} (x_{1}, x_{3})

and

d_{A} (x_{2}, x_{3}) < d_{B} (x_{2}, x_{3})

. Thus, A does not satisfy

d_{B}

-privacy, nor does B satisfy

d_{A}

-privacy. □

Intuitively, the randomized response mechanism maintains the same (

ε

) distinguishability level between inputs, whereas the exponential mechanism causes some inputs to be less distinguishable than others. This means that, for the same (true)

ε

, an adversary who is interested in certain inputs could learn more from the randomized response than the exponential. In the above counter-example, points

x_{2}, x_{3}

in the exponential mechanism of channel A are less distinguishable than the corresponding points in the randomized response mechanism B.

As an example, let’s say the mechanisms are to be used in geo-location privacy and the inputs represent adjacent locations (such as addresses along a street). Then, an adversary (your boss) may be interested in how far you are from work, and therefore wants to be able to distinguish between points distant from

x_{1}

(your office) and points within the vicinity of your office, without requiring your precise location. Your boss chooses channel A as the most informative. However, another adversary (your suspicious partner) is more concerned about where exactly you are, and is particularly interested in distinguishing between your expected position (

x_{2}

, the boulangerie) versus your suspected position (

x_{3}

, the brothel). Your partner chooses channel B as the most informative.

Regarding the other refinements, we find (experimentally) that in general they do not hold between families of mechanisms. (Recall that we only need to produce a single counter-example to show that a refinement doesn’t hold, and this can be done using the methods presented in Section 5.)

We next check what happens when we compare mechanisms with different epsilons. We note the following.

Theorem 15.

For any (truncated geometric, randomized response, exponential) mechanisms

M_{1}^{ε_{1}}, M_{2}^{ε_{2}}

, if

M_{1}^{ε_{1}} ⊑ M_{2}^{ε_{2}}

for any of our refinements (

⊑^{avg}, ⊑^{\max}, ⊑^{prv}

), then

M_{1}^{ε_{1}} ⊑ M_{2}^{ε_{2}^{'}}

for

ε_{2}^{'} < ε_{2}

.

Proof.

This follows directly from transitivity of the refinement relations, and our results on refinement with families of mechanisms. (We recall however that our result for the exponential mechanism is only a conjecture.) □

This tells us that, once we have a refinement between mechanisms, it continues to hold for reduced

ε

in the refining mechanism.

Corollary 1.

Let

G, T G, R, E

be the geometric, truncated geometric, randomized response and exponential mechanisms respectively. Then, for all

ε^{'} \leq ε

, we have that

T G^{ε} ⊑^{prv} R^{ε^{'}}

,

T G^{ε} ⊑^{prv} E^{ε^{'}}

,

G^{ε} ⊑^{prv} R^{ε^{'}}

and

G^{ε} ⊑^{prv} E^{ε^{'}}

.

Thus, it is safe to ‘compare epsilons’ wrt

⊑^{prv}

if we want to replace a geometric mechanism with either a randomized response or exponential mechanism. (As with the previous theorem, note that the results for the exponential mechanism are stated as conjecture only, and this conjecture is assumed in the statement of this corollary.) What this means is that if, for example, we have a geometric mechanism

T G

that operates on databases with distance measured using the Hamming metric

d_{H}

and satisfying

ε \cdot d_{H}

-privacy, then any randomized response mechanism R parametrized by

ε^{'} \leq ε

will also satisfy

ε \cdot d_{H}

-privacy. Moreover, if we decide we’d rather use the Manhattan metric

d_{M}

to measure distance between the databases, then we only need to check that

T G

also satisfies

ε \cdot d_{M}

-privacy, as this implies that R will too.

The following Table 4, Table 5 and Table 6 summarize the refinement relations with respect to the various families of mechanisms.

6.4. Asymptotic Behavior

We now consider the behavior of the relations when

ε

approximates 0, which represents the absence of leakage. We start with the following result:

Theorem 16.

Every (truncated geometric, randomized response, exponential) mechanism is ‘the safest possible mechanism’ when parametrized by

ε = 0

. That is,

L^{ε} ⊑^{avg} M^{0}

for all mechanisms

L, M

(possibly from different families) and

ε > 0

.

While this result may be unsurprising, it means that we know that refinement must eventually occur when we reduce

ε

. It is interesting then to ask just when this refinement occurs. We examine this question experimentally by considering different mechanisms and investigating for which values of

ε

average-case refinement holds. For simplicity of presentation, we show results for

5 \times 5

matrices, noting that we observed similar results for experiments across different matrix dimensions, at least wrt the coarse-grained comparison of plots that we do here. The results are plotted in Figure 3.

The plots show the relationship between

ε_{1}

(x-axis) and

ε_{2}

(y-axis) where

ε_{1}

parametrizes the mechanism being refined and

ε_{2}

parametrizes the refining mechanism. For example, the blue line on the top graph represents

T G^{ε_{1}} ⊑^{avg} E^{ε_{2}}

. We fix

ε_{1}

and ask for what value of

ε_{2}

do we get a

⊑^{avg}

refinement. Notice that the line

ε_{1} = ε_{2}

corresponds to the same mechanism in both axes (since every mechanism refines itself).

We can see that refining the randomized response mechanism requires much smaller values of epsilon in the other mechanisms. For example, from the middle graph, we can see that

R^{4} ⊑^{avg} T G^{1}

(approximately) whereas, from the top graph, we have

T G^{4} ⊑^{avg} R^{4}

. This means that the randomized response mechanism is very ‘safe’ against average-case adversaries compared with the other mechanisms, as it is much more ‘difficult’ to refine than the other mechanisms.

We also notice that, for ‘large’ values of

ε_{1}

, the exponential and geometric mechanisms refine each other for approximately the same

ε_{2}

values. This suggests that, for these values, the epsilons are comparable (that is, the mechanisms are equally ‘safe’ for similar values of

ε

). However, smaller values of

ε_{1}

require a (relatively) large reduction in

ε_{2}

to obtain a refinement.

6.5. Discussion

At the beginning of this section, we asked whether it is safe to compare differential privacy mechanisms by ‘comparing the epsilons’. We found that it is safe to compare epsilons within families of mechanisms (except in the unusual case of the over-truncated geometric mechanism). However, when comparing different mechanisms, it is not safe to just compare the epsilons, since none of the refinements hold in general. Once a ‘safe’ pair of epsilons has been calculated, then reducing epsilon in the refining mechanism is always safe. However, computing safe epsilons relies on the ability to construct a channel representation, which may not always be feasible.

7. Lattice Properties

The orders

⊑^{avg}, ⊑^{\max}

and

⊑^{prv}

are all reflexive and transitive (i.e., preorders), but not anti-symmetric (i.e., not partial orders). This is due to the fact that there exist channels that have “syntactic” differences but the same semantics; e.g., two channels having their columns swapped. However, if we are only interested in a specific type of leakage, then all channels such that

A ⊑ B ⊑ A

(where ⊑ is one of

⊑^{avg}, ⊑^{\max}, ⊑^{prv}

) have identical leakage, so we can view them as the “same channel” (either by working on the equivalence classes of

⊑ \cup ⊒

or by writing all channels in some canonical form).

Seeing now ⊑ as a partial order, the natural question is whether it forms a lattice that is whether suprema and infima exist. If it exists, the supremum

A \lor B

has an interesting property: it is the “least safe” channel that is safer than both A and B (any channel C such that

A ⊑ C

and

B ⊑ C

would necessarily satisfy

A \lor B ⊑ C

). If we wanted a channel that is safer than both A and B,

A \lor B

would be a natural choice.

In this section, we briefly discuss this problem and show that—in contrast to

⊑^{avg}

—both

⊑^{\max}

and

⊑^{prv}

do have suprema and infima (i.e., they form a lattice).

7.1. Average-Case Refinement

In the case of

⊑^{avg}

, “equivalent” channels are those producing the exact same hypers. However, even if we identify such channels, it is known [7] that two channels

A, B

do not necessarily have a least upper bound wrt

⊑^{avg}

, hence

⊑^{avg}

does not form a lattice.

7.2. Max-Case Refinement

In the case of

⊑^{\max}

, “equivalent” channels are those producing the same posteriors (or more generally the same convex hull of posteriors). However, in contrast to

⊑^{avg}

, if we identify such channels that is if we represent a channel only by the convex hull of its posteriors, then

⊑^{\max}

becomes a lattice.

First, note that given a finite set of posteriors

P = {δ^{y} | y}

, such that

π \in

ch

{δ^{y}}_{y}

, i.e., such that

π = \sum_{y} a_{y} δ^{y}

, it is easy to construct a channel C producing each posterior

δ^{y}

with output probability

a_{y}

. It suffices to take

C_{x, y} : = δ_{x}^{y} a_{y} / π_{x}

.

Thus,

A \lor^{\max} B

can be simply constructed by taking the intersection of the convex hulls of the posteriors of

A, B

. This intersection is a convex polytope itself, so it has (finitely many) extreme points, so we can construct

A \lor^{\max} B

as the channel having exactly those as posteriors.

A \land^{\max} B

, on the other hand, can be constructed as the channel having as posteriors the union of those of A and B.

Note that computing the intersection of polytopes is NP-hard in general [21], so

A \lor^{\max} B

might be hard to construct. However, efficient special cases do exist [22]; we leave the study of the hardness of

\lor^{\max}

as future work.

7.3. Privacy-Based Refinement

In the case

⊑^{prv}

, “equivalent” channels are those producing the same induced metric, i.e.,

d_{A} = d_{B}

. Representing channels only by their induced metric, we can use the fact that

M X

does form a lattice under ≥. We first show that any metric can be turned into a corresponding channel.

Theorem 17.

For any metric

d : M X

, we can construct a channel

C^{d}

such that

d_{C^{d}} = d

.

Then,

A \lor^{prv} B

will be simply the channel whose metric is

d_{A} \lor d_{B}

, where ∨ is the supremum in the lattice of metrics, and similarly for

\land^{prv}

.

Note that the infimum of two metrics

d_{1}, d_{2}

is simply the max of the two (which is always a metric). The supremum, however, is more tricky, since the min of two metrics is not always a metric: the triangle inequality might be violated. Thus, we first need to take the min of

d_{1}, d_{2}

, then compute its “triangle closure”, by finding the shortest path between all pairs of elements, for instance using the well-known Floyd–Warshall algorithm.

8. Conclusions

We have investigated various refinement orders for mechanisms for information protection, combining the max-case perspective typical of DP and its variants with the robustness of the QIF approach. We have provided structural characterizations of these preorders and methods to verify them efficiently. Then, we have considered various DP mechanisms, and investigated the relation between the

ε

-based measurement of privacy and our orders. We have shown that, while within the same family of mechanisms, a smaller

ε

implies the refinement order, this is almost never the case for mechanisms belonging to different families.

Author Contributions

Conceptualization, K.C., N.F. and C.P.; methodology, K.C., N.F. and C.P.; software, K.C. and N.F.; validation, K.C. and N.F.; formal analysis, K.C. and N.F.; investigation, K.C. and N.F.; resources, C.P.; data curation, K.C. and N.F.; writing—original draft preparation, K.C. and N.F.; writing—review and editing, K.C., N.F. and C.P.; visualization, K.C. and N.F.; supervision, K.C. and C.P.; project administration, K.C. and C.P.; funding acquisition, C.P. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been supported by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme, Grant agreement No. 835294, by the project ANR-16-CE25-0011 REPAS, and by the Equipe Associée LOGIS.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proofs of Results about the Max-Case Refinement

Theorem A1.

Let

π : D X

. If

A ⊑^{\max} B

, then the posteriors ofB(under

π

) are convex-combinations of those ofA, that is,

supp [π, B] \subseteq ch supp [π, A]

(A1)

Moreover, if (A1) holds and

π

is full support, then

A ⊑^{\max} B

.

Proof.

Note that seeing

π

as a row vector,

π A

and

π B

are the output distributions of A and B, respectively. Denote by

α^{y}

and

β^{z}

the posteriors of

[π, A]

and

[π, B]

, respectively; we have as many posteriors as the elements in the support of the output distributions, that is, for each

y :

supp

π A, z :

supp

π B

. (A1) can be written as

\forall z : supp π B . (β^{z} = \sum_{y} c_{y}^{z} α^{y} where \sum_{y} c_{y}^{z} = 1)

(A2)

The proof consists of two parts: first, we show that (A2) for uniform

π

is equivalent to

A ⊑^{\max} B

. Second, we show that (A2) for full-support

π

implies (A2) for any other prior.

For the first part, letting

π

be uniform, we show that (A2) is equivalant to

R \tilde{A} = \tilde{B}

. This is easy to see since since the y-th row of

\tilde{A}

is

α^{y}

and the z-th row of

\tilde{B}

is

β^{z}

. Hence, we can construct R from the convex coefficients, and vice versa, as

R_{z, y} = c_{y}^{z}

.

For the second part, let

π : D X

be full-support and

\hat{π} : D X

be arbitrary. Since supp

\hat{π} \subseteq

supp

π

, we necessarily have supp

\hat{π} C \subseteq

supp

\hat{π} C

for any channel C. Assume that (A1) holds for

π

and let

c_{y}^{z}

be the corresponding convex coefficients. Fixing an arbitrary

z :

supp

\hat{π} B

, define

{\hat{c}}_{y}^{z} : = c_{y}^{z} \frac{{(\hat{π} A)}_{y}}{{(π A)}_{y}} \frac{{(π B)}_{z}}{{(\hat{π} B)}_{z}}

(A3)

We first show that

\begin{matrix} \sum_{y} c_{y}^{z} \frac{{(\hat{π} A)}_{y}}{{(π A)}_{y}} \\ = \sum_{x} \frac{{\hat{π}}_{x}}{π_{x}} \sum_{y} c_{y}^{z} \frac{π_{x} A_{x, y}}{{(π A)}_{y}} & Expand \hat{π} A, rearrangement \\ = \sum_{x} \frac{{\hat{π}}_{x}}{π_{x}} \sum_{y} c_{y}^{z} α_{x}^{y} & Def . of α^{y} \\ = \sum_{x} \frac{{\hat{π}}_{x}}{π_{x}} β_{x}^{z} & (A 1) \\ = \sum_{x} \frac{{\hat{π}}_{x}}{π_{x}} \frac{π_{x} B_{x, z}}{{(π B)}_{z}} & Def . of β^{z} \\ = \frac{{(\hat{π} B)}_{z}}{(π B) z} & rearrangement \end{matrix}

From this, it follows that

\sum_{y} {\hat{c}}_{y}^{z} = 1

.

Finally, denote by

{\hat{α}}^{y}

and

{\hat{β}}^{z}

the posteriors of

[\hat{π}, A]

and

[\hat{π}, B]

, respectively; we show that (A1) holds for

\hat{π}

. Fixing

x : X

, we have that

\begin{matrix} \sum_{y} {\hat{c}}_{y}^{z} {\hat{α}}_{x}^{y} \\ = \sum_{y} c_{y}^{z} \frac{{(\hat{π} A)}_{y}}{{(π A)}_{y}} \frac{{(π B)}_{z}}{{(\hat{π} B)}_{z}} \frac{{\hat{π}}_{x} A_{x, y}}{{(\hat{π} A)}_{y}} & Def of c_{y}^{z} and {\hat{α}}^{y} \\ = \frac{(π B) z}{{(\hat{π} B)}_{z}} \frac{{\hat{π}}_{x}}{π_{x}} \sum_{y} c_{y}^{z} \frac{π_{x} A_{x, y}}{{(π A)}_{y}} & Def of d_{y}^{z}, rearrangement \\ = \frac{(π B) z}{{(\hat{π} B)}_{z}} \frac{{\hat{π}}_{x}}{π_{x}} \sum_{y} c_{y}^{z} α_{x}^{y} & Def of α^{y} \\ = \frac{(π B) z}{{(\hat{π} B)}_{z}} \frac{{\hat{π}}_{x}}{π_{x}} β_{x}^{z} & (A 1) \\ = \frac{(π B) z}{{(\hat{π} B)}_{z}} \frac{{\hat{π}}_{x}}{π_{x}} \frac{π_{x} B_{x, z}}{{(π B)}_{z}} & Def of β^{y} \\ = \frac{{\hat{π}}_{x} B_{x, z}}{{(\hat{π} B)}_{z}} & Rearrangement \\ = {\hat{β}}_{x}^{z} & Def of {\hat{β}}^{z} \end{matrix}

□

Theorem A2.

The orders

⊑^{\max}

and

⊑_{Q}^{\max}

coincide.

Proof.

Fix some arbitrary

π

and denote by

α^{y}

and

β^{z}

the posteriors of

[π, A]

and

[π, B]

, respectively. Assuming

A ⊑^{\max} B

, from Theorem A1, we get that each

β^{z}

can be written as a convex combination

\sum_{y} c_{y}^{z} α^{y}

. Hence,

\begin{matrix} V^{\max} [π, B] \\ = & \max_{z} V (β^{z}) & Def of V^{\max} \\ = & \max_{z} V (\sum_{y} c_{y}^{z} α^{y}) & Theorem A 1 \\ \leq & \max_{z} \max_{y} V (α^{y}) & quasi - convexity of V \\ = & \max_{y} V (α^{y}) \\ = & V^{\max} [π, A] \end{matrix}

from which

A ⊑_{Q}^{\max} B

follows.

Now, assume that

A {/ ⊑}^{\max} B

, let

S =

ch supp

[π, A] \subseteq D X

and define a vulnerability function

V : Q X

that maps every prior

σ : D X

to its Euclidean distance from S, that is,

V (σ) : = \min_{σ^{'} : S} {∥ σ - σ^{'} ∥}_{2}

(A4)

Since S is a convex set, it is well known that

V (σ)

is convex on

σ

(hence also quasi-convex). Note that

V (σ) = 0

for all

σ \in S

and strictly positive anywhere else.

By definition of S, we have that

α^{y} \in S

and hence

V (α^{y}) = 0

for all posteriors of A, as a consequence

V^{\max} [π, A] = 0

. On the other hand, since

A {/ ⊑}^{\max} B

, from Theorem A1, we get that there exists some posterior of B such that

δ^{z} \notin S

. As a consequence,

V^{\max} [π, B] \geq V (δ^{z}) > 0 = V^{\max} [π, A]

, which implies that

A {/ ⊑}_{Q}^{\max} B

. □

Theorem A3.

⊑^{avg}

is strictly stronger than

⊑^{\max}

.

Proof.

The “stronger” part is essentially the data-processing inequality for max-case vulnerability [9] (Prop. 14). To show it directly, assume that

A ⊑^{avg} B

, that is,

A R = B

for some channel R, and define a channel S from

Z

to

Y

as

S_{z, y} : = R_{y, z} \frac{\sum_{x} A_{x, y}}{\sum_{x} B_{x, z}}

(A5)

It is easy to check that S is a valid channel, i.e., that

\sum_{y} S_{z, y} = 1

for all y. Moreover, we have that

\begin{matrix} {(S \tilde{A})}_{z, x} \\ = \sum_{y} R_{y, z} \frac{\sum_{x} A_{x, y}}{\sum_{x} B_{x, z}} \frac{A_{x, y}}{\sum_{x} A_{x, y}} & Def of S, \tilde{A} \\ = \frac{\sum_{y} A_{x, y} R_{y, z}}{\sum_{x} B_{x, z}} & Algebra \\ = \frac{B_{x, z}}{\sum_{x} B_{x, z}} & A R = B \\ = {\tilde{B}}_{z, x} & Def of \tilde{B} \end{matrix}

hence

A ⊑^{\max} B

.

The “strictly” part has already been shown in the body of the paper: The two matrices A and B in (14) provide an example in which

A ⊑^{\max} B

, while

B {/ ⊑}^{\max} A

. □

Appendix B. Proofs of the Results about the Privacy-Based Refinement

Proposition A1.

The orders

⊑_{M}^{prv}

and

⊑^{prv}

coincide.

Proof.

Assuming

A ⊑_{M}^{prv} B

, recall that a channel C satisfies

d

-privacy iff

{Priv}_{d} (C) \leq 1

. Note also that

{Priv}_{d_{C}} (C) = 1

. Setting

d = d_{A}

, we get

1 = {Priv}_{d_{A}} (A) \geq {Priv}_{d_{A}} (B)

, which implies that B satisfies

d_{A}

-privacy, hence

A ⊑^{\max} B

.

Assuming

A ⊑^{prv} B

, to show that

A ⊑_{M}^{prv} B

, it is equivalent to show that A satisfies

d

-privacy only if B also satisfies it. Let

d \in M X

, if A satisfies

d

-privacy, then

d \geq d_{A} \geq d_{B}

; hence, B also satisfies

d

-privacy, concluding the proof. □

Theorem A4.

⊑^{\max}

is strictly stronger than

⊑^{prv}

, which is strictly stronger than

⊑_{d}^{prv}

.

Proof.

The “stronger” part is a direct consequence of the fact that

{Priv}_{d} (C)

can be expressed as max-case capacity for a suitable vulnerability measure

V_{d} : Q X

(more concretely, a consequence of Theorems 2, 6 and 7). This is discussed in detail in Section 4.7.

For the “strictly” part, consider the following counter-example:

\begin{array}{c} A & y_{1} & y_{2} \\ x_{1} & 0.8 & 0.2 \\ x_{2} & 0.4 & 0.6 \end{array} \begin{array}{c} B & y_{1} & y_{2} \\ x_{1} & 0.4 & 0.6 \\ x_{2} & 0.8 & 0.2 \end{array}

(A6)

The only difference between A and B is that the two rows have been swapped. Hence,

d_{A} = d_{B}

, which implies

A ⊑^{prv} B ⊑^{prv} A

. However, the posteriors of

A, B

(for uniform prior) are (written in columns):

\begin{array}{c} A & y_{1} & y_{2} \\ x_{1} & \frac{2}{3} & \frac{1}{4} \\ x_{2} & \frac{1}{3} & \frac{3}{4} \end{array} \begin{array}{c} B & y_{1} & y_{2} \\ x_{1} & \frac{1}{3} & \frac{3}{4} \\ x_{2} & \frac{2}{3} & \frac{1}{4} \end{array}

(A7)

Since

(\frac{3}{4}, \frac{1}{4})

cannot be written as a convex combination of

(\frac{2}{3}, \frac{1}{3})

and

(\frac{1}{4}, \frac{3}{4})

, and similarly

(\frac{1}{4}, \frac{3}{4})

cannot be written as a convex combination of

(\frac{1}{3}, \frac{2}{3})

and

(\frac{3}{4}, \frac{1}{4})

, from Theorem A1, we conclude that

A {/ ⊑}^{\max} B {/ ⊑}^{\max} A

. □

Theorem A5.

Let

f : X \to Y

be any query and

A, B

be two mechanisms on

Y

. If

A ⊑^{prv} B

, then

A \circ f ⊑^{prv} B \circ f

.

Proof.

Define

d_{A \circ f} (x_{1}, x_{2}) = d_{A} (f (x_{1}), f (x_{2}))

, and similarly for

d_{B \circ f}

. Then, we have:

d_{A \circ f} (x_{1}, x_{2}) = d_{A} (f (x_{1}), f (x_{2})) \geq d_{B} (f (x_{1}), f (x_{2})) = d_{B \circ f}

(A8)

□

Theorem A6.

M L_{d}^{+, \max}

is always achieved on a uniform prior

π^{u}

. Namely

\max_{π} L_{d}^{+, \max} (π, C) = L_{d}^{+, \max} (π^{u}, C) = V_{d}^{\max} [π^{u}, C]

(A9)

Proof.

Fix

π, C

, and let

(a, δ^{y})

and

(b, ρ^{y})

be the outer and inners of

[π, V_{d}]

and

[π^{u}, V_{d}]

, respectively. Since

δ_{x}^{y} = C_{x, y} π_{x} a_{y}^{- 1}

and

ρ_{x}^{y} = C_{x, y} {| X |}^{- 1} b_{y}^{- 1}

, we have that:

\begin{matrix} V_{d} (δ^{y}) & = \max_{x, x^{'}} d^{- 1} (x, x^{'}) | \ln \frac{C_{x, y} π_{x}}{C_{x^{'}, y} π_{x^{'}}} | and \end{matrix}

(A10)

\begin{matrix} V_{d} (ρ^{y}) & = \max_{x, x^{'}} d^{- 1} (x, x^{'}) | \ln \frac{C_{x, y}}{C_{x^{'}, y}} | \end{matrix}

(A11)

Moreover, it holds that:

\begin{matrix} V_{d} (δ^{y}) \\ = & \max_{x, x^{'}} d^{- 1} (x, x^{'}) | \ln \frac{C_{x, y} π_{x}}{C_{x^{'}, y} π_{x^{'}}} | \\ \leq & \max_{x, x^{'}} d^{- 1} (x, x^{'}) (| \ln \frac{C_{x, y}}{C_{x^{'}, y}} | + | \ln \frac{π_{x}}{π_{x^{'}}} |) & triangle inequality \\ \leq & \max_{x, x^{'}} (d^{- 1} (x, x^{'}) | \ln \frac{C_{x, y}}{C_{x^{'}, y}} |) + & independent \max \\ \max_{x, x^{'}} (d^{- 1} (x, x^{'}) | \ln \frac{π_{x}}{π_{x^{'}}} |) \\ = & V_{d} (ρ^{y}) + V_{d} (π) \end{matrix}

Finally, we have that:

\begin{matrix} L_{d}^{+, \max} (π, C) \\ = & \max_{y} V_{d} (δ^{y}) - V_{d} (π) \\ \leq & \max_{y} (V_{d} (ρ^{y}) + V_{d} (π)) - V_{d} (π) \\ = & \max_{y} V_{d} (ρ^{y}) \\ = & V_{d}^{\max} [π^{u}, C] \\ = & L_{d}^{+, \max} (π^{u}, C) since V_{d} (π^{u}) = 0 \end{matrix}

which concludes the proof. □

Theorem A7.

[DP as max-case capacity] C satisfies

ε \cdot d

-privacy iff

M L_{d}^{+, \max} (C) \leq ε

. In other words:

M L_{d}^{+, \max} (C) = {Priv}_{d} (C)

.

Proof.

Let

ρ^{y}

denote the inners of

[π^{u}, V_{d}]

. From Theorem 6, we have that

M L_{d}^{+, \max} (C) \leq ε iff V_{d} (ρ^{y}) \leq ε for all y

which, from the definition of

V_{d}

, holds iff

C_{x, y} \leq e^{ε \cdot d (x, x^{'})} C_{x^{'}, y}

for all

x, x^{'}, y

. □

Appendix C. Proofs of the Results on the Refinement Verification

Proposition A2

(Projection theorem, [23] (Prop. 1.1.9)). Let

C \subset R^{n}

be closed and convex and let

z \in R^{n}

. There exists a unique

z^{*} \in C

that minimizes

{∥ z - x ∥}_{2}

over

x \in C

, called the projection of z on C. Moreover, a vector

z^{*}

is the projection of z on C iff

(z - z^{*}) \cdot (x - z^{*}) \leq 0 for all x \in C

(A12)

Theorem A8.

Let

B^{*}

be the projection ofBon

A^{↑}

.

1.: If $B = B^{*}$ , then $A ⊑^{avg} B$ .
2.: Otherwise, let $G = B - B^{*}$ . The gain function $g (w, x) = G_{x, w}$ provides a counter-example to $A ⊑^{avg} B$ , which is $V_{g} (π^{u}, A) < V_{g} (π^{u}, B)$ , for uniform $π^{u}$ .

Proof.

(1) is immediate from the definition of

⊑^{avg}

. For (2), we first show that

B \cdot G > B^{*} \cdot G \geq X \cdot G for all X \in A^{↑}

(A13)

(in other words that

X \cdot G = B^{*} \cdot G

is a hyperplane with normal G, separating B from

A^{↑}

). For the left-hand inequality, we have

B \cdot G - B^{*} \cdot G = G \cdot G = {∥ G ∥}_{2}^{2} > 0

. Moreover, since

A^{↑}

is closed and convex, from the projection theorem (Proposition A2), we get that

(B - B^{*}) \cdot (X - B^{*}) \leq 0

for all

X \in A^{↑}

, from which

B^{*} \cdot G \geq X \cdot G

directly follows.

The proof continues similarly to the one of [7] (Theorem 9). We write posterior vulnerability (for uniform prior) as a maximization over all remapping strategies

S_{A}, S_{B}

for

A, B

respectively, namely

\begin{matrix} V_{g} (π, A) & = \frac{1}{| X |} \max_{A S_{A} \in A^{↑}} A S_{A} \cdot G \end{matrix}

(A14)

\begin{matrix} V_{g} (π, B) & = \frac{1}{| X |} \max_{B S_{B} \in B^{↑}} B S_{B} \cdot G \end{matrix}

(A15)

Then,

V_{g} (π^{u}, A) < V_{g} (π^{u}, B)

follows from (A13) and the fact that

B \in B^{↑}

(the identity is a remapping strategy). □

Appendix D. Proofs of Results about Refinement Comparison

We call a truncated geometric mechanism ‘square’ if is has the same input and output space (that is, the channel representation is a square matrix).

We first show that geometric mechanisms and truncated geometric mechanisms are equivalent to square mechanisms under

⊑^{avg}

.

Lemma A1.

Let

G^{ε}

be a geometric mechanism. Then, the reduced (abstract) channel form of

G^{ε}

is the square channel

(X, X, T G^{ε})

.

Proof.

First, note that the square channel is obtained from the (infinite) geometric by summing up all the ‘extra’ columns (i.e., those columns in

Y \ X

). Now, note that these ‘extra’ columns are scalar multiples of each other, since each column has the form

[\begin{matrix} \frac{(1 - α) \cdot α^{k}}{1 + α} \\ \frac{(1 - α) \cdot α^{k - 1}}{1 + α} \\ \frac{(1 - α) \cdot α^{k - 2}}{1 + α} \\ \dots \end{matrix}]

(A16)

for increasing values of k. Thus, the ‘summing up’ operation is a valid reduction operation, and so the infinite geometric is reducible to the corresponding square channel. □

Lemma A2.

Let

T G^{ε}

be a truncated geometric mechanism. Then, the reduced (abstract) channel form of

T G^{ε}

is the square channel

(X, X, T G^{ε})

.

Proof.

First, note that the truncated geometric is obtained from the infinite geometric by summing up columns at the ends of the matrix. This is exactly the ‘reduction’ step noted above. We can continue, as above, to sum up ‘extra’ columns until we get a square matrix. □

Corollary A1.

Any

⊑^{avg}

refinement that holds for a square geometric mechanism

(X, X, G^{ε})

also holds for any truncated geometric mechanism or (the) geometric mechanism

G^{ε}

having domain

X

.

Note that we only define truncation as far as the square geometric matrix, since at this point the columns of the matrix are linearly independent and can no longer be truncated via matrix reduction operations. We now show that refinement holds for the square geometric mechanisms.

Lemma A3.

Let

T G^{ε}

be a square geometric mechanism. Then, decreasing ε produces a refinement of it. That is,

T G^{ε} ⊑^{avg} T G^{ε^{'}}

iff

ε \geq ε^{'}

.

Proof.

The square geometric mechanism

T G^{ε}

has the following form:

\begin{array}{c} T G^{ε} & x_{1} & x_{2} & \dots & x_{n} \\ x_{1} & \frac{1}{1 + α} & \frac{α \cdot (1 - α)}{1 + α} & \dots & \frac{α^{n - 1}}{1 + α} \\ x_{2} & \frac{α}{1 + α} & \frac{1 - α}{1 + α} & \dots & \frac{α^{n - 2}}{1 + α} \\ \dots & \dots & \dots & \dots & \dots \\ x_{n} & \frac{α^{n - 1}}{1 + α} & \frac{α^{n - 2} \cdot (1 - α)}{1 + α} & \dots & \frac{1}{1 + α} \end{array}

(A17)

where

α = e^{- ε}

, and similarly for

T G^{ε^{'}}

with

α = e^{- ε^{'}}

.

Now, this matrix is invertible and the inverse has the following form:

\begin{array}{c} {(T G^{ε})}^{- 1} & x_{1} & x_{2} & x_{3} & x_{4} & \dots \\ x_{1} & \frac{1}{1 - α} & \frac{- α}{1 - α} & 0 & 0 & \dots \\ x_{2} & \frac{- α}{{(1 - α)}^{2}} & \frac{1 + α^{2}}{{(1 - α)}^{2}} & \frac{- α}{{(1 - α)}^{2}} & 0 & \dots \\ x_{3} & 0 & \frac{- α}{{(1 - α)}^{2}} & \frac{1 + α^{2}}{{(1 - α)}^{2}} & \frac{- α}{{(1 - α)}^{2}} & \dots \\ x_{4} & 0 & 0 & \frac{- α}{{(1 - α)}^{2}} & \frac{1 + α^{2}}{{(1 - α)}^{2}} & \dots \\ \dots & \dots & \dots & \dots & \dots & \dots \end{array}

(A18)

Recalling that

T G^{ε} ⊑^{avg} T G^{ε^{'}} iff T G^{ε^{'}} = T G^{ε} R

(A19)

for some channel R, we can construct a suitable R using

{(T G^{ε})}^{- 1}

, namely

R = {(T G^{ε})}^{- 1} \cdot T G^{ε^{'}}

. It suffices to show that R is a valid channel.

It is clear that the rows of R sum to 1, since it is the product of matrices with rows summing to 1.

Multiplying out the matrix R yields:

\begin{array}{c} R & x_{1} & x_{2} & x_{3} & \dots \\ x_{1} & \frac{1 - α β}{(1 - α) (1 + β)} & \frac{(β - α) (1 - β)}{(1 - α) (1 + β)} & \frac{β (β - α) (1 - β)}{(1 - α) (1 + β)} & \dots \\ x_{2} & \frac{(1 - α β) (β - α)}{{(1 - α)}^{2} (1 + β)} & \frac{(1 - 2 α β + α^{2}) (1 - β)}{{(1 - α)}^{2} (1 + β)} & \frac{(1 - α β) (1 - β) (β - α)}{{(1 - α)}^{2} (1 - β)} & \dots \\ x_{3} & \frac{β (1 - α β) (β - α)}{{(1 - α)}^{2} (1 - β)} & \frac{(1 - α β) (1 - β) (β - α)}{{(1 - α)}^{2} (1 - β)} & \frac{(1 - 2 α β + α^{2}) (1 - β)}{{(1 - α)}^{2} (1 + β)} & \dots \\ \dots & \dots & \dots & \dots & \dots \end{array}

(A20)

where

α = e^{- ε}

and

β = e^{- ε^{'}}

. The only way that any of these matrix entries can be less than 0 is if

α > β

, or

ε < ε^{'}

. Thus, R is a valid channel precisely when

ε \geq ε^{'}

and so

T G^{ε} ⊑^{avg} T G^{ε^{'}}

as required. □

The following theorems now follow from the previous lemmas.

Theorem A9.

Let

G^{ε}, G^{ε^{'}}

be geometric mechanisms. Then,

G^{ε} ⊑^{avg} G^{ε^{'}}

iff

ε \geq ε^{'}

. That is, decreasing

ε

produces a refinement of the mechanism.

Proof.

Using Lemma A1, we can express

G^{ε}

as a square channel and from Lemma A3 it follows that the refinement holds. □

Theorem A10.

Let

T G^{ε}, T G^{ε^{'}}

be truncated geometric mechanisms. Then,

T G^{ε} ⊑^{avg} T G^{ε^{'}}

iff

ε \geq ε^{'}

. That is, decreasing

ε

produces a refinement of the mechanism.

Proof.

As above, using Lemmas A2 and A3. □

Theorem A11.

Let

O T G^{ε}, O T G^{ε^{'}}

be over-truncated geometric mechanisms. Then,

O T G^{ε} {/ ⊑}^{avg} O T G^{ε^{'}}

for any

ε \neq ε^{'}

. That is, decreasing

ε

does not produce a refinement.

Proof.

Consider the following counter-example:

\begin{array}{c} A^{ε} & x_{1} & x_{2} \\ x_{1} & \frac{4}{5} & \frac{1}{5} \\ x_{2} & \frac{1}{5} & \frac{4}{5} \\ x_{3} & \frac{1}{20} & \frac{19}{20} \end{array} \begin{array}{c} B^{ε^{'}} & x_{1} & x_{2} \\ x_{1} & \frac{2}{3} & \frac{1}{3} \\ x_{2} & \frac{1}{3} & \frac{2}{3} \\ x_{3} & \frac{1}{6} & \frac{5}{6} \end{array}

(A21)

Channels

A^{ε}

and

B^{ε^{'}}

are over-truncated geometric mechanisms parametrized by

ε = 2 \log 2

,

ε^{'} = \log 2

, respectively. We expect

B^{ε^{'}}

to be safer than

A^{ε}

, that is,

V_{G} [π^{u}, B^{ε^{'}}] < V_{G} [π^{u}, A^{ε}]

. However, under the uniform prior

π^{u}

, the gain function

\begin{array}{c} G & w_{1} & w_{2} \\ x_{1} & \frac{1}{5} & 0 \\ x_{2} & 0 & 1 \\ x_{3} & \frac{4}{5} & 0 \end{array}

(A22)

yields

V_{G} [π^{u}, A^{ε}] = 0.33

and

V_{G} [π^{u}, B^{ε^{'}}] = 0.36

, thus

B^{ε^{'}}

leaks more than

A^{ε}

for this adversary. (In fact, for this gain function, we have

V_{G} [π^{u}, A^{ε}] = V_{G} (π^{u})

and so the adversary learns nothing from observing the output of

A^{ε}

). □

Theorem A12.

Let

O T G^{ε}

be an over-truncated geometric mechanism. Then, reducing

ε

does not produce a

⊑^{\max}

refinement; however, it does produce a

⊑^{prv}

refinement.

Proof.

We show the first part using a counter-example. Consider the following channels:

\begin{array}{c} A^{ε} & x_{1} & x_{2} \\ x_{1} & 4 / 5 & 1 / 5 \\ x_{2} & 1 / 5 & 4 / 5 \\ x_{3} & 1 / 20 & 19 / 20 \end{array} \begin{array}{c} B^{ε^{'}} & x_{1} & x_{2} \\ x_{1} & 2 / 3 & 1 / 3 \\ x_{2} & 1 / 3 & 2 / 3 \\ x_{3} & 1 / 6 & 5 / 6 \end{array}

(A23)

Channels

A^{ε}

and

B^{ε^{'}}

are over-truncated geometric mechanisms using

ε = 2 \log 2

,

ε^{'} = \log 2

, respectively. We can define the (prior) vulnerability V as the usual (convex) g-vulnerability. Then, under a uniform prior

π^{u}

, the gain function given by:

\begin{matrix} g (w, x_{1}) & = \frac{1}{5} \end{matrix}

(A24)

\begin{matrix} g (w, x_{2}) & = - 1 \end{matrix}

(A25)

\begin{matrix} g (w, x_{3}) & = \frac{4}{5} \end{matrix}

(A26)

yields

V^{\max} [π^{u}, A] = 0

and

V^{\max} [π^{u}, B] = \frac{2}{55}

. Thus,

A {/ ⊑}^{\max} B

.

For the second part, we first note that for any square geometric channel

A^{ε}

we have

d_{A} (x, x^{'}) = ε

exactly when

x, x^{'}

are adjacent rows in the matrix (this can be seen from the construction of the square channel). Now, the over-truncated geometric is obtained by summing columns of the square geometric. By construction, the square geometric A has adjacent elements

A_{x, y}, A_{x^{'}, y}

satisfying

\frac{A_{x, y}}{A_{x^{'}, y}} = e^{ε}

when

x > x^{'}

and x is above (or on) the diagonal of the channel matrix; otherwise,

\frac{A_{x, y}}{A_{x^{'}, y}} = e^{- ε}

. This means that each (over)-truncation step maintains the

\frac{A_{x, y}}{A_{x^{'}, y}}

ratio except when

x, y

and

x^{'}, y^{'}

occur on diagonal elements, in which case their sum is between

e^{- ε}

and

e^{ε}

. Since this affects only two elements in each row, we still have that

d_{A} (x, x^{'}) = ε

(until the final truncation step to produce a single 1 vector). Therefore, since

⊑^{prv}

holds for the square matrix, and it holds under truncation, we must have that it holds for over-truncated geometric mechanisms. Thus, reducing

ε

corresponds to refinement under

⊑^{prv}

as required. □

We show that the randomized response mechanism behaves well with respect to

⊑^{avg}

by considering three cases.

Firstly, we consider the case where

X = Y

. We use the following lemmas:

Lemma A4.

Let

R^{α}

,

R^{β}

be ‘square’ randomized response mechanisms. Then,

B = R^{α} R^{β}

is a randomized response mechanism with parameter

ε = l o g \frac{e^{α + β} + k}{e^{α} + e^{β} + k - 1}

where

k + 1

is the dimension of

R^{α}

,

R^{β}

and B.

Proof.

Observe that

R^{α}

can be factorised as

\frac{1}{e^{α} + k} R

where R has the form:

\begin{array}{c} R & x_{1} & x_{2} & \dots & x_{k + 1} \\ x_{1} & e^{α} & 1 & \dots & 1 \\ x_{2} & 1 & e^{α} & \dots & 1 \\ x_{k + 1} & 1 & 1 & \dots & e^{α} \end{array}

(A27)

and similarly for

R^{β}

. Multiplying out gives the matrix:

\begin{array}{c} B & x_{1} & x_{2} & \dots & x_{k + 1} \\ x_{1} & e^{α + β} + k & e^{α} + e^{β} + (k - 1) & \dots & e^{α} + e^{β} + (k - 1) \\ x_{2} & e^{α} + e^{β} + (k - 1) & e^{α + β} + k & \dots & e^{α} + e^{β} + (k - 1) \\ x_{k + 1} & e^{α} + e^{β} + (k - 1) & e^{α} + e^{β} + (k - 1) & \dots & e^{α + β} + k \end{array}

(A28)

(Note that the constant co-efficient factorised out the front does not affect the

ε

calculation for the channel). This is exactly the randomized response mechanism required. □

Lemma A5.

For any

a \geq 1, b \geq 0

, the function

f (x) = \frac{a e^{x} + b}{e^{x} + a + b - 1}

(A29)

defined for

x \geq 0

is increasing and has range

[1, a)

.

Proof.

We can see that

f (x)

is continuous for the given domain and the derivative

f^{'} (x) = \frac{e^{x} (a - 1) (a + b)}{{(e^{x} + a + b - 1)}^{2}}

is

\geq 0

for all

a \geq 1, b \geq 0

.

Additionally, at

x = 0

, the function is defined and equal to 1. In addition,

\begin{matrix} \lim_{x \to \infty} \frac{a e^{x} + b}{e^{x} + a + b - 1} & = \lim_{x \to \infty} \frac{a e^{x}}{e^{x}} \end{matrix}

(A30)

\begin{matrix} = a \end{matrix}

(A31)

□

Lemma A6.

Let

R^{ε}

,

R^{ε^{'}}

be randomized response mechanisms represented by square matrices (that is,

X = Y

). Then,

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

.

Proof.

Note first that

R^{ε}

,

R^{ε^{'}}

are in reduced (abstract channel) form and so the partial order of

⊑^{avg}

holds.

From Lemma A4, we know that the composition of two randomized response mechanisms is another randomized response mechanism. Therefore, for the reverse direction, if

ε > ε^{'}

then Lemma A5 tells us that we can find a randomized response mechanism

R^{'}

such that

R^{ε} R^{'} = R^{ε^{'}}

. In the case of equality, we can choose the identity mechanism.

For the forward direct, we show the contrapositive. If

ε < ε^{'}

, then we know there exists an R such that

R^{ε^{'}} R = R^{ε}

. However, this means that

R^{ε^{'}} ⊑^{avg} R^{ε}

. Since the matrices are reduced (as channels), then

⊑^{avg}

is a partial order and so this implies

R^{ε} {/ ⊑}^{avg} R^{ε^{'}}

. Thus, we must have

R^{ε} ⊑^{avg} R^{ε^{'}} \Rightarrow ε \geq ε^{'}

. □

Interestingly, we can use this result to show the second case where we consider ‘over-truncated’ mechanisms.

Lemma A7.

Let

R^{ε}

,

R^{ε^{'}}

be randomized response mechanisms with

Y \subset X

. Then,

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

.

Proof.

Notice that this ‘over-truncated’ randomized response mechanism is just a square randomized response mechanism ‘glued’ onto a matrix containing all values

\frac{1}{n + 1}

.

Denote the corresponding square mechanisms by

S^{ε}, S^{ε^{'}}

(note the parameters are the same) and denote by N the matrix containing only

\frac{1}{n + 1}

(note that this is the same matrix for both

R^{ε}

and

R^{ε^{'}}

).

From Lemma A6, we can find a square randomized response mechanism R satisfying

S^{ε} R = S^{ε^{'}}

iff

ε \geq ε^{'}

. Notice that R must be doubly symmetric, since its ith row is the same as its ith column. Thus, the dot product of any column of R with any row vector containing only

\frac{1}{n + 1}

must yield

\frac{1}{n + 1}

. In addition, we must have

N * R = N

. Now, we have that

R^{ε} R

is just

S^{ε} R

glued onto N which is the same as

S^{ε^{'}}

glued onto N. In addition, thus R also satisfies

R^{ε} R = R^{ε^{'}}

. In addition, following the same arguments as for Lemma A6, we have

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

. □

Finally, we consider the case where

X \subset Y

.

Lemma A8.

Let

R^{ε}

,

R^{ε^{'}}

be randomized response mechanisms with

X \subset Y

. Then,

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

.

Proof.

The channel matrix

R^{ε}

is equivalent to the square randomized response mechanism

S^{ε}

of dimension

∥ Y ∥ \times ∥ Y ∥

with the bottom

∥ Y ∥ - ∥ X ∥

rows removed (and similarly for

R^{ε^{'}}

). This means any solution R for

S^{ε}

is a solution for

R^{ε}

. Thus, for the reverse direction, if

ε \geq ε^{'}

, we can always find an R such that

R^{ε} ⊑^{avg} R^{ε^{'}}

.

For the forward direction, we prove the contrapositive. Note that in this case we cannot assume a partial order relation for

⊑^{avg}

, since there may be columns of

R^{ε}

which are identical. If

ε < ε^{'}

, we want to show that

R^{ε} {/ ⊑}^{avg} R^{ε^{'}}

. To do this, we need to find a gain function and prior

π

such that

V_{g} (π, R^{ε}) < V_{g} (π, R^{ε^{'}})

. The min-entropy leakage will do: this is simply the sum of the column maxima of the channel matrix. For the randomized response channels, this is given by

V (R^{ε}) = \frac{a e^{ε} + b}{e^{ε} + a + b - 1}

(A32)

for a channel with dimensions

a \times (a + b)

. This is an increasing function of

ε

(for

a \geq 1

), in fact the derivative is always positive for

a > 1

, hence we must have

ε < ε^{'} \Rightarrow V (R^{ε}) < V (R^{ε^{'}})

. Thus,

R^{ε} {/ ⊑}^{avg} R^{ε^{'}}

. □

We can now conclude the following theorem from the main body of the paper:

Theorem A13.

Let

R^{ε}

be a randomized response mechanism. Then, decreasing

ε

in R produces a refinement. That is,

R^{ε} ⊑^{avg} R^{ε^{'}}

iff

ε \geq ε^{'}

.

Proof.

Follows from Lemmas A6–A8. □

Theorem A14.

Let R be a randomized response mechanism, E an exponential mechanism and TG a truncated geometric mechanism. Then,

T G^{ε} ⊑^{prv} R^{ε}

and

T G^{ε} ⊑^{prv} E^{ε}

. However,

⊑^{prv}

does not hold between

E^{ε}

and

R^{ε}

.

Proof.

We first show that

T G^{ε} ⊑^{prv} R^{ε}

, which is equivalent to showing that

d_{R} \leq d_{T G}

. For the geometric mechanism, we have

d_{T G} (x, x^{'}) = ε d (x, x^{'})

for all

x, x^{'} \in X

. For the randomized response mechanism, we have

d_{R} (x, x^{'}) = ε

or

d_{R} (x, x^{'}) = 0

(when

x, x^{'} \notin Y

). Thus,

d_{R} \leq d_{T G}

and so

T G^{ε} ⊑^{prv} R^{ε}

.

We now show

T G^{ε} ⊑^{prv} E^{ε}

. Recall that we parametrize the exponential mechanism by the smallest possible

ε

such that it satisfies

ε d

-privacy. In this case, we find that, for any pair

x, x^{'}

we have

d_{E} (x, x^{'}) \leq ε d (x, x^{'})

, whereas, for the geometric mechanism, we have

d_{T G} (x, x^{'}) = ε d (x, x^{'})

. Therefore,

d_{E} \leq d_{T G}

and so

T G^{ε} ⊑^{prv} E^{ε}

.

The proof of

⊑^{prv}

not holding between

E^{ε}

and

R^{ε}

was provided in the main body of the paper. □

Theorem A15.

Every (truncated geometric, randomized response, exponential) mechanism is ‘the safest possible mechanism’ when parametrized by

ε = 0

. That is,

L^{ε} ⊑^{avg} M^{0}

for all mechanisms

L, M

(possibly from different families) and

ε > 0

.

Proof.

The intuition is that all channels parametrized by

ε = 0

are equivalent to the

1

channel (that is, the

m \times 1

channel consisting only of 1 s). Indeed, the exponential and randomized response mechanisms parametrized by

ε = 0

have every element equal to

\frac{1}{n}

where

n = ∥ Y ∥

. These clearly reduce to the

1

channel. The truncated geometric mechanism contains all 0s except for the first and last column which contain

\frac{1}{2}

. Again, this reduces to the

1

channel. Since the

1

channel refines everything (that is,

L ⊑^{avg} 1

for any channel L), the result follows. □

Appendix E. Proofs of the Results about the Lattice Properties

Theorem A16.

For any metric

d : M X

, we can construct a channel

C^{d}

such that

d_{C^{d}} = d

.

Proof.

Letting

d : M X

, we first show that, for any

x_{0} : X

, we can construct a channel

C^{x_{0}}

whose induced metric is below

d

but coincides with it on all distances to

x_{0}

, that is:

d_{C^{x_{0}}} \leq d and d_{C^{x_{0}}} (x_{0}, x) = d (x_{0}, x) for all x : X

(A33)

To construct

C^{x_{0}}

, we use just two outputs (i.e.,

Y = {y_{1}, y_{2}}

) and we use the fact that

{tv}_{\otimes}

on

D Y

admits a geodesic that is a curve

γ : [0, + \infty] \to D Y

such that

{tv}_{\otimes} (γ (t), γ (t^{'})) = | t - t^{'} | for all t, t^{'} : [0, + \infty]

(A34)

For instance, we can check that

γ (t) = (e^{- t - 1}, 1 - e^{- t - 1})

is such a geodesic.

We can now use the geodesic, to assign probability distributions on each secret such that the properties (A33) are satisfied. Concretely, define each row x of

C^{x_{0}}

as:

C_{x, -}^{x_{0}} : = γ (d (x_{0}, x))

(A35)

We now check that the properties (A33) are satisfied:

\begin{matrix} d_{C^{x_{0}}} (x_{1}, x_{2}) \\ = & {tv}_{\otimes} (γ (d (x_{0}, x_{1})), γ (d (x_{0}, x_{2})) & Def of d_{C}, C^{x_{0}} \\ = & | d (x_{0}, x_{1}) - d (x_{0}, x_{2}) | & γ is a geodesic \\ \leq & d (x_{1}, x_{2}) & triangle ineq . for d \end{matrix}

and also

\begin{matrix} d_{C^{x_{0}}} (x_{0}, x) \\ = & {tv}_{\otimes} (γ (d (x_{0}, x_{0})), γ (d (x_{0}, x)) & Def . of d_{C}, C^{x_{0}} \\ = & | d (x_{0}, x_{0}) - d (x_{0}, x) | & γ is a geodesic \\ = & d (x_{0}, x) & d (x_{0}, x_{0}) = 0 \end{matrix}

Finally,

C^{d}

is constructed as the visible choice of all

{C^{x}}_{x}

. As a consequence,

d_{C^{d}}

will be the max of the corresponding induced metrics

{d_{C^{x}}}_{x}

. From this and (A33), we can easily conclude that

d_{C^{d}} = d

.

Finally, note that the visible choice adds the columns of all mechanisms, so the constructed channel has

2 | X |

columns. However, the equality of distances in (A33) is given by the first column of

C^{\tilde{x}}

(this is because of the way

γ

is constructed), hence we can merge all second columns together, giving finally a simple construction for

C^{d}

with

Y = X \cup {⊥}

(i.e., having

| X | + 1

columns)

\begin{matrix} C_{x, y}^{d} & = {| X |}^{- 1} e^{- d (x, y) - 1} & x, y \in X \end{matrix}

(A36)

\begin{matrix} C_{x, ⊥}^{d} & = 1 - {| X |}^{- 1} \sum_{y : X} e^{- d (x, x^{'}) - 1} & x \in X \end{matrix}

(A37)

□

References

Dwork, C.; Mcsherry, F.; Nissim, K.; Smith, A. Calibrating noise to sensitivity in private data analysis. In Proceedings of the Third Theory of Cryptography Conference (TCC), New York, NY, USA, 4–7 March 2006; Lecture Notes in Computer Science. Halevi, S., Rabin, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; Volume 3876, pp. 265–284. [Google Scholar]
Duchi, J.C.; Jordan, M.I.; Wainwright, M.J. Local Privacy and Statistical Minimax Rates. In Proceedings of the 54th Annual IEEE Symposium on Foundations of Computer Science (FOCS), Berkeley, CA, USA, 26–29 October 2013; pp. 429–438. [Google Scholar] [CrossRef]
Chatzikokolakis, K.; Andrés, M.E.; Bordenabe, N.E.; Palamidessi, C. Broadening the scope of Differential Privacy using metrics. In Proceedings of the 13th International Symposium on Privacy Enhancing Technologies (PETS 2013), Bloomington, IN, USA, 10–12 July 2013; Lecture Notes in Computer Science. De Cristofaro, E., Wright, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 7981, pp. 82–102. [Google Scholar]
Köpf, B.; Basin, D.A. An information-theoretic model for adaptive side-channel attacks. In Proceedings of the 2007 ACM Conference on Computer and Communications Security (CCS 2007), Alexandria, VA, USA, 28–31 October 2007; Ning, P., di Vimercati, S.D.C., Syverson, P.F., Eds.; ACM: New York, NY, USA, 2007; pp. 286–296. [Google Scholar] [CrossRef] [Green Version]
Smith, G. On the Foundations of Quantitative Information Flow. In Proceedings of the 12th International Conference on Foundations of Software Science and Computation Structures (FOSSACS 2009), York, UK, 22–29 March 2009; de Alfaro, L., Ed.; Springer: York, UK, 2009; Volume 5504, pp. 288–302. [Google Scholar]
Alvim, M.S.; Chatzikokolakis, K.; Palamidessi, C.; Smith, G. Measuring Information Leakage Using Generalized Gain Functions. In Proceedings of the 25th IEEE Computer Security Foundations Symposium (CSF), Cambridge, MA, USA, 25–27 June 2012; pp. 265–279. [Google Scholar] [CrossRef] [Green Version]
McIver, A.; Morgan, C.; Smith, G.; Espinoza, B.; Meinicke, L. Abstract Channels and Their Robust Information-Leakage Ordering. In Proceedings of the Third International Conference on Principles of Security and Trust (POST), Grenoble, France, 5–13 April 2014; Lecture Notes in Computer Science. Abadi, M., Kremer, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2014; Volume 8414, pp. 83–102. [Google Scholar]
Kifer, D.; Machanavajjhala, A. Pufferfish: A framework for mathematical privacy definitions. ACM Trans. Database Syst. 2014, 39, 3:1–3:36. [Google Scholar] [CrossRef]
Alvim, M.S.; Chatzikokolakis, K.; McIver, A.; Morgan, C.; Palamidessi, C.; Smith, G. Axioms for Information Leakage. In Proceedings of the 29th IEEE Computer Security Foundations Symposium (CSF), Lisbon, Portugal, 27 June–1 July 2016; pp. 77–92. [Google Scholar] [CrossRef] [Green Version]
Yasuoka, H.; Terauchi, T. Quantitative Information Flow—Verification Hardness and Possibilities. In Proceedings of the 23rd IEEE Computer Security Foundations Symposium, Edinburgh, UK, 17–19 July 2010; pp. 15–27. [Google Scholar] [CrossRef] [Green Version]
Malacaria, P. Algebraic foundations for quantitative information flow. Math. Struct. Comput. Sci. 2015, 25, 404–428. [Google Scholar] [CrossRef] [Green Version]
Clark, D.; Hunt, S.; Malacaria, P. Quantitative Information Flow, Relations and Polymorphic Types. J. Log. Comput. 2005, 18, 181–199. [Google Scholar] [CrossRef]
Malacaria, P. Assessing security threats of looping constructs. In Proceedings of the 34th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages (POPL 2007), Nice, France, 17–19 January 2007; Hofmann, M., Felleisen, M., Eds.; ACM: New York, NY, USA, 2007; pp. 225–235. [Google Scholar]
Landauer, J.; Redmond, T. A Lattice of Information. In Proceedings of theComputer Security Foundations Workshop VI, Franconia, NH, USA, 15–17 June 1993; pp. 65–70. [Google Scholar]
Hsu, J.; Gaboardi, M.; Haeberlen, A.; Khanna, S.; Narayan, A.; Pierce, B.C.; Roth, A. Differential Privacy: An Economic Method for Choosing Epsilon. In Proceedings of the IEEE 27th Computer Security Foundations Symposium, CSF 2014, Vienna, Austria, 19–22 July 2014; pp. 398–410. [Google Scholar] [CrossRef] [Green Version]
Ghosh, A.; Roth, A. Selling Privacy at Auction. In Proceedings of the 12th ACM Conference on Electronic Commerce, San Jose, CA, USA, 5–9 June 2011; ACM: New York, NY, USA, 2011; pp. 199–208. [Google Scholar] [CrossRef]
Andrés, M.E.; Bordenabe, N.E.; Chatzikokolakis, K.; Palamidessi, C. Geo-indistinguishability: Differential privacy for location-based systems. In Proceedings of the 20th ACM Conference on Computer and Communications Security (CCS 2013), Berlin, Germany, 4–8 November 2013; ACM: New York, NY, USA, 2013; pp. 901–914. [Google Scholar] [CrossRef] [Green Version]
Barthe, G.; Köpf, B. Information-theoretic Bounds for Differentially Private Mechanisms. In Proceedings of the 24th IEEE Computer Security Foundations Symposium (CSF), Cernay-la-Ville, France, 27–29 June 2011; pp. 191–204. [Google Scholar]
Prasad Kasiviswanathan, S.; Smith, A. On the ‘Semantics’ of Differential Privacy: A Bayesian Formulation. J. Priv. Confidentiality 2008, 6. [Google Scholar] [CrossRef]
Chatzikokolakis, K.; Fernandes, N.; Palamidessi, C. Comparing systems: Max-case refinement orders and application to differential privacy. In Proceedings of the 32nd IEEE Computer Security Foundations Symposium, Hoboken, NJ, USA, 25–28 June 2019; pp. 442–457. [Google Scholar] [CrossRef] [Green Version]
Tiwary, H.R. On the Hardness of Computing Intersection, Union and Minkowski Sum of Polytopes. Discret. Comput. Geom. 2008, 40, 469–479. [Google Scholar] [CrossRef] [Green Version]
Fukuda, K.; Liebling, T.M.; Lütolf, C. Extended Convex Hull. Comput. Geom. 2000, 20, 13–23. [Google Scholar] [CrossRef] [Green Version]
Bertsekas, D.P. Convex Optimization Theory; Athena Scientific: Belmont, MA, USA, 2009. [Google Scholar]

Figure 1. Comparison between the geometric (red) and the randomized response (blue) mechanisms. The area between 40 and 60, delimited by the green lines, represents the sub-domain where the geometric mechanism satisfies also the discrete metric

d

-privacy with

ε = \log 2

.

Figure 1. Comparison between the geometric (red) and the randomized response (blue) mechanisms. The area between 40 and 60, delimited by the green lines, represents the sub-domain where the geometric mechanism satisfies also the discrete metric

d

-privacy with

ε = \log 2

.

Figure 2. The simplex and the convex hulls of the posterior distributions in Example 5.

Figure 3. Refinement of mechanisms under

⊑^{avg}

for

5 \times 5

channels. The x-axis represents the

ε

on the LHS of the relation, and the y-axis represents the one on the RHS. The top graph represents refinement of the truncated geometric mechanism (that is,

T G ⊑^{avg}

), the middle graph is refinement of randomized response (

R ⊑^{avg}

), and the bottom graph is refinement of the exponential mechanism (

E ⊑^{avg}

).

Figure 3. Refinement of mechanisms under

⊑^{avg}

for

5 \times 5

channels. The x-axis represents the

ε

on the LHS of the relation, and the y-axis represents the one on the RHS. The top graph represents refinement of the truncated geometric mechanism (that is,

T G ⊑^{avg}

), the middle graph is refinement of randomized response (

R ⊑^{avg}

), and the bottom graph is refinement of the exponential mechanism (

E ⊑^{avg}

).

Table 1. Programs that take in input a secret H and leak information about H via the output L.

$P_{1}$	$P_{2}$	$P_{3}$	$P_{4}$
`if H mod 8 = 0 then`	`if H mod 4 = 0 then`	`L := H $& 0^{24} 1^{8}$`	`L := H $& 0^{28} 1^{4}$`
`L := H`	`L := H`
`else`	`else`
`L := 1`	`L := 1`

Table 2. Definitions and symbols used in this paper.

Defn	Vulnerabilities
Equation (4)	$V [π, C] : = \sum_{y} a_{y} V (δ^{y})$
Section 3	$V^{\max} [π, C] : = \max_{y} V (δ^{y})$
Definition 8	$V_{d} (π) : = \inf {ε \geq 0 \| \forall x, x^{'} \in X, π_{x} \leq e^{ε \cdot d (x, x^{'})} π_{x^{'}}}$
Defn	Leakage Measures
Definition 4	${Priv}_{d} (C) : = \inf {ε \geq 0 \| C satisfies ε \cdot d - privacy}$
Equation (23)	$L_{d}^{+, \max} (π, C) : = V_{d}^{\max} [π, C] - V_{d} (π)$
Equation (24)	$M L_{d}^{+, \max} (C) : = \max_{π} L_{d}^{+, \max} (π, C)$
Defn	Refinement Orders
Equation (6)	$A ⊑^{avg} B iff A R = B$ for some channel R
Definition 2	$A ⊑^{\max} B iff R \tilde{A} = \tilde{B}$ for some channel R
Definition 7	$A ⊑^{prv} B$ iff B satisfies $d_{A}$ -privacy
Defn	Leakage Orders
Section 2.2	$A ⊑_{G}^{avg} B iff \forall g : G X, \forall π : D X, V_{g} [π, A] \geq V_{g} [π, B]$
Definition 1	$A ⊑_{Q}^{\max} B iff \forall V : Q X, \forall π : D X, V^{\max} [π, A] \geq V^{\max} [π, B]$
Definition 6	$A ⊑_{M}^{prv} B iff \forall d \in M X, A ⊑_{d}^{prv} B$
Definition 5	$A ⊑_{d}^{prv} B iff {Priv}_{d} (A) \geq {Priv}_{d} (B)$

Table 3. Comparison of leakage and refinement orders. All implications are strict.

Leakage Orders		Refinement Orders
$⊑_{G}^{avg}$	⇔	$⊑^{avg}$
⇓		⇓
$⊑_{Q}^{\max}$	⇔	$⊑^{\max}$
⇓		⇓
$⊑_{M}^{prv}$	⇔	$⊑^{prv}$
⇘		⇙
	$⊑_{d}^{prv}$

Table 4. The refinements respected by families of mechanisms for decreasing

ε

. We recall that the results in the grey cells are based on Conjecture 1.

Table 4. The refinements respected by families of mechanisms for decreasing

ε

. We recall that the results in the grey cells are based on Conjecture 1.

	Are These Valid for Decreasing $ε$ ?
Mechanism	$⊑^{avg}$	$⊑^{\max}$	$⊑^{prv}$
Geometric	Y	Y	Y
Truncated Geometric	Y	Y	Y
Over-Truncated Geometric	N	N	Y
Exponential	Y	Y	Y
Randomized Response	Y	Y	Y

Table 5. Comparing different families of mechanisms with respect to the different refinements under the same

ε

.

Table 5. Comparing different families of mechanisms with respect to the different refinements under the same

ε

.

Refinements across Families with Same $ε$
$T G {/ ⊑}^{avg} R$	$T G {/ ⊑}^{\max} R$	$T G ⊑^{prv} R$
$R {/ ⊑}^{avg} T G$	$R {/ ⊑}^{\max} T G$	$R {/ ⊑}^{prv} T G$
$T G {/ ⊑}^{avg} E$	$T G {/ ⊑}^{\max} E$	$T G ⊑^{prv} E$
$E {/ ⊑}^{avg} T G$	$E {/ ⊑}^{\max} T G$	$E {/ ⊑}^{prv} T G$
$G {/ ⊑}^{avg} R$	$G {/ ⊑}^{\max} R$	$G ⊑^{prv} R$
$R {/ ⊑}^{avg} G$	$R {/ ⊑}^{\max} G$	$R {/ ⊑}^{prv} G$
$G {/ ⊑}^{avg} E$	$G {/ ⊑}^{\max} E$	$G ⊑^{prv} E$
$E {/ ⊑}^{avg} G$	$E {/ ⊑}^{\max} G$	$E {/ ⊑}^{prv} G$
$R {/ ⊑}^{avg} E$	$R {/ ⊑}^{\max} E$	$R {/ ⊑}^{prv} E$
$E {/ ⊑}^{avg} R$	$E {/ ⊑}^{\max} R$	$E {/ ⊑}^{prv} R$

Table 6. Comparing different families of mechanisms with differing

ε

. We recall that the results in the grey cells are based on Conjecture 1.

Table 6. Comparing different families of mechanisms with differing

ε

. We recall that the results in the grey cells are based on Conjecture 1.

Comparison of Refinements with $ε_{1} > ε_{2}$ .
$T G^{ε_{1}} {/ ⊑}^{avg} R^{ε_{2}}$	$T G^{ε_{1}} {/ ⊑}^{\max} R^{ε_{2}}$	$T G^{ε_{1}} ⊑^{prv} R^{ε_{2}}$
$R^{ε_{1}} {/ ⊑}^{avg} T G^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{\max} T G^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{prv} T G^{ε_{2}}$
$T G^{ε_{1}} {/ ⊑}^{avg} E$	$T G^{ε_{1}} {/ ⊑}^{\max} E^{ε_{2}}$	$T G^{ε_{1}} ⊑^{prv} E^{ε_{2}}$
$E^{ε_{1}} {/ ⊑}^{avg} T G$	$E^{ε_{1}} {/ ⊑}^{\max} T G^{ε_{2}}$	$E^{ε_{1}} {/ ⊑}^{prv} T G^{ε_{2}}$
$G^{ε_{1}} {/ ⊑}^{avg} R^{ε_{2}}$	$G^{ε_{1}} {/ ⊑}^{\max} R^{ε_{2}}$	$G^{ε_{1}} ⊑^{prv} R^{ε_{2}}$
$R^{ε_{1}} {/ ⊑}^{avg} G^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{\max} G^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{prv} G^{ε_{2}}$
$G^{ε_{1}} {/ ⊑}^{avg} E^{ε_{2}}$	$G^{ε_{1}} {/ ⊑}^{\max} E^{ε_{2}}$	$G^{ε_{1}} ⊑^{prv} E^{ε_{2}}$
$E^{ε_{1}} {/ ⊑}^{avg} G^{ε_{2}}$	$E^{ε_{1}} {/ ⊑}^{\max} G^{ε_{2}}$	$E^{ε_{1}} {/ ⊑}^{prv} G^{ε_{2}}$
$R^{ε_{1}} {/ ⊑}^{avg} E^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{\max} E^{ε_{2}}$	$R^{ε_{1}} {/ ⊑}^{prv} E^{ε_{2}}$
$E^{ε_{1}} {/ ⊑}^{avg} R^{ε_{2}}$	$E^{ε_{1}} {/ ⊑}^{\max} R^{ε_{2}}$	$E^{ε_{1}} {/ ⊑}^{prv} R^{ε_{2}}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chatzikokolakis, K.; Fernandes, N.; Palamidessi, C. Refinement Orders for Quantitative Information Flow and Differential Privacy. J. Cybersecur. Priv. 2021, 1, 40-77. https://doi.org/10.3390/jcp1010004

AMA Style

Chatzikokolakis K, Fernandes N, Palamidessi C. Refinement Orders for Quantitative Information Flow and Differential Privacy. Journal of Cybersecurity and Privacy. 2021; 1(1):40-77. https://doi.org/10.3390/jcp1010004

Chicago/Turabian Style

Chatzikokolakis, Konstantinos, Natasha Fernandes, and Catuscia Palamidessi. 2021. "Refinement Orders for Quantitative Information Flow and Differential Privacy" Journal of Cybersecurity and Privacy 1, no. 1: 40-77. https://doi.org/10.3390/jcp1010004

APA Style

Chatzikokolakis, K., Fernandes, N., & Palamidessi, C. (2021). Refinement Orders for Quantitative Information Flow and Differential Privacy. Journal of Cybersecurity and Privacy, 1(1), 40-77. https://doi.org/10.3390/jcp1010004

Article Menu

Refinement Orders for Quantitative Information Flow and Differential Privacy †

Abstract

1. Introduction

1.1. Contribution

1.2. Related Work

1.3. Plan of the Paper

2. Average-Case Refinement

2.1. Vulnerability, Channels, and Leakage

2.2. Refinement

3. Max-Case Refinement

4. Privacy-Based Refinement

4.1. Differential Privacy and d -Privacy

4.2. Oblivious Mechanisms

4.3. Applying Noise to the Data of a Single Individual

4.4. Comparing Mechanisms by Their “Smallest ε ” (For Fixed d )

4.5. Privacy-Based Leakage and Refinement Orders

4.6. Application to Oblivious Mechanisms

4.7. Privacy as Max-Case Capacity

5. Verifying the Refinement Orders

5.1. Average-Case Refinement

5.2. Max-Case Refinement

5.3. Privacy-Based Refinement

6. Application: Comparing DP Mechanisms

6.1. Preliminaries

6.2. Refinement Order within Families of Mechanisms

6.3. Refinement Order between Families of Mechanisms

6.4. Asymptotic Behavior

6.5. Discussion

7. Lattice Properties

7.1. Average-Case Refinement

7.2. Max-Case Refinement

7.3. Privacy-Based Refinement

8. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proofs of Results about the Max-Case Refinement

Appendix B. Proofs of the Results about the Privacy-Based Refinement

Appendix C. Proofs of the Results on the Refinement Verification

Appendix D. Proofs of Results about Refinement Comparison

Appendix E. Proofs of the Results about the Lattice Properties

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Refinement Orders for Quantitative Information Flow and Differential Privacy^†

4.1. Differential Privacy and $d$ -Privacy

4.4. Comparing Mechanisms by Their “Smallest $ε$ ” (For Fixed $d$ )