Simple Majority Consensus in Networks with Unreliable Communication

Tamir, Ran; Livshits, Ariel; Shadmi, Yonatan

doi:10.3390/e24030333

Open AccessArticle

Simple Majority Consensus in Networks with Unreliable Communication^†

by

Ran Tamir

^1,*,

Ariel Livshits

² and

Yonatan Shadmi

²

¹

Signal and Information Processing Laboratory, ETH Zürich, 8092 Zürich, Switzerland

²

The Andrew and Erna Viterbi Faculty of Electrical and Computer Engineering, Technion-Israel Institute of Technology, Technion City, Haifa 3200003, Israel

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in the proceedings of the 2021 International Symposium on Distributed Computing.

Entropy 2022, 24(3), 333; https://doi.org/10.3390/e24030333

Submission received: 24 January 2022 / Revised: 16 February 2022 / Accepted: 24 February 2022 / Published: 25 February 2022

(This article belongs to the Section Multidisciplinary Applications)

Download Versions Notes

Abstract

:

In this work, we analyze the performance of a simple majority-rule protocol solving a fundamental coordination problem in distributed systems—binary majority consensus—in the presence of probabilistic message loss. Using probabilistic analysis for a large-scale, fully-connected, network of

2 n

agents, we prove that the Simple Majority Protocol (SMP) reaches consensus in only three communication rounds, with probability approaching 1 as n grows to infinity. Moreover, if the difference between the numbers of agents that hold different opinions grows at a rate of

\sqrt{n}

, then the SMP with only two communication rounds attains consensus on the majority opinion of the network, and if this difference grows faster than

\sqrt{n}

, then the SMP reaches consensus on the majority opinion of the network in a single round, with probability converging to 1 as exponentially fast as

n \to \infty

. We also provide some converse results, showing that these requirements are not only sufficient, but also necessary.

Keywords:

binary majority consensus; fully-connected network; majority dynamics; multiagent systems; noisy network

1. Introduction

The digital age drove forth the need for easy and fast access to information. The world wide web has facilitated the existence of many useful multiagent systems from messaging apps to cryptocurrency [1] and distributed data storage (or cloud services) [2,3]. However, the design of multiagent systems inherently requires agents to communicate and coordinate according to a prescribed shared protocol to achieve a common goal. For example, messaging apps must always show messages in the same order to all participants in a conversation, which is challenging when user clocks are not necessarily synchronized [4,5]. Cryptocurrencies employ decentralized data structures to register currency transactions, which require a vast majority of users to agree upon its current state [6]. Distributed data storage services must show consistent views of stored files in the presence of multiple concurrent reading and writing operations [7,8].

In the pursuit of developing such distributed protocols, much of the literature routinely makes two powerful assumptions. The first is that communication links are reliable [9,10,11], i.e., all messages between agents are eventually delivered. The second is that there exists an upper bound on the transmission delay of messages from one agent to another (usually the maximum propagation time of links) [12]. Nonetheless, communication networks are notoriously unreliable [13,14,15]. In fact, actual communication links may suffer from sudden crashes, resulting in messages in transit to be lost forever. In an effort to ensure reliability, distributed applications are generally built upon a reliable broadcast layer implemented by the Transmission Control Protocol (TCP) [16]—one of the main protocols in the internet protocol suite. However, while TCP guarantees eventual delivery of all sent messages, it does not provide any upper time bound on delivery time [17] (p. 9). In practice, these assumptions do not hold simultaneously.

In this work, we assume no such underlying structure exists and analyze the performance of a simple majority-rule protocol solving a fundamental coordination problem in distributed systems-binary majority consensus, in the presence of probabilistic message loss. Using probabilistic analysis for a large scale, fully-connected network of

2 n

agents, we prove that the Simple Majority Protocol (SMP) converges rapidly to a consensus on the majority opinion of the network with probability approaching 1 as

n \to \infty

, given that the difference between the numbers of agents that hold different opinions grows as fast as

\sqrt{n}

. Otherwise, if the difference between the numbers of agents that hold different opinions is relatively close to zero, then the SMP still converges extremely fast to a consensus, but not necessarily on the initial majority opinion of the network.

1.1. Importance of Reliable Communication

Reliability of communication is essential to guarantee coordination in almost all cases. The pitfalls and design challenges of coordination when communication is unreliable is best illustrated by the two generals’ problem, which was popularized by Jim Gray [18].

Consider two generals who must coordinate a joint attack on an enemy. Both generals must attack simultaneously for the attack to succeed. While the two generals agreed that they will attack, they have not agreed upon a time for the attack. To coordinate, they can send messages to one another by running messengers. However, the messengers can be captured by the enemy and their messages will therefore not reach their destination.

Due to the uncertainty of message delivery, there exists no deterministic joint communication protocol that guarantees coordinated attack. To see this, assume there exists such a protocol by contradiction. Since a deterministic protocol must solve the problem in a finite number of steps, then the protocol prescribes a fixed number of message exchanges between the two generals, after which both must attack together. Some of these messages are successfully delivered and some are lost. Consider the last successfully delivered message in a run of the protocol, after which the recipient is confident enough to attack without the need for any further correspondence. Suppose this message was lost instead, then the recipient will hold off and not attack. However, the sender does not know about this last communication failure. By the protocol definition he must attack anyway, despite his counterpart’s reluctance—contradicting the assumption that the protocol was a solution to the problem.

1.2. Majority Consensus

The impossibility result of the two generals’ problem had far-reaching implications in the field of distributed protocols and databases, including the study of binary consensus [19]. In the binary consensus problem, every agent is initially assigned some binary value, referred to as the agent’s initial opinion. The goal of a protocol that solves consensus is to have every agent eventually decide on the same opinion, thus reaching agreement throughout the system. More formally, given any initial assignment of agent opinions, a run of a protocol which solves consensus must exhibit the following three properties:

Decision: every agent eventually decides on some opinion $v \in {0, 1}$ ;
Agreement: if some agent decided on v, no opinion other than v can be decided on by any other agent;
Nontriviality: if some agent has decided on v, then v was an opinion initially assigned to some agent.

Consensus is a fundamental problem in distributed systems, as many other coordination problems were shown to be directly reducible to and from consensus. The list includes agreeing on what transactions to commit to a database [20], state machine replication [21], atomic snapshots [22], total ordering of concurrent events [23], and the two generals’ problem, implying that no protocol can guarantee all three properties when communication is unreliable [24].

In light of this, it is interesting to consider a variation of the two generals’ problem where the probability of a messenger getting captured is p (independently of other messengers) [25,26]. While coordinated attack is still deterministically impossible, it is straightforward to design a protocol that guarantees success with probability at least q, which can be as close as desired to 1. The first general simply sends

⌈ \log_{p} (1 - q) ⌉

messengers, then attacks at the specified time without waiting for a reply, and the second general attacks if any messenger from the first general arrives.

In this work, we investigate whether leveraging such an assumption helps to solve binary majority consensus, in which the nontriviality clause stipulates that if a majority of agents initially hold the same opinion, then all agents must decide on this opinion. This variant of consensus is utilized when the agreed upon opinion holds importance beyond facilitating agreement. For example, a distributed system of sensors capable of detecting natural gas could use majority consensus to answer the question “Is the amount of gas in the air greater than 10,000 ppm?” to help detect a gas leak in a gas processing center.

We analyze the performance of the SMP in a complete graph of communication, i.e., where each agent has an active communication channel to every other agent in the system. In SMP, agents communicate in equal-length time intervals called rounds. All messages are sent at the beginning of a communication round, and they either arrive by the end of the round or are considered lost. We assume that all message loss events are statistically independent and identically distributed with some constant probability.

The SMP can be briefly described as follows: in each round, every agent sends its current opinion to all other agents. Then, it waits to receive all messages from other agents proposing their own opinions. If a majority of received messages propose the same opinion, then the agent adopts this opinion for the next round. All ties are reconciled by readopting the agent’s own opinion. After a fixed number of rounds r, each agent decides on its currently adopted opinion.

Similarly to the probabilistic protocol for the two generals’ problem discussed above, the SMP does not solve consensus deterministically, but rather provides probabilistic guarantees instead. The Decision and Nontriviality properties of classical consensus are assured, since all agents decide by the end of round r and any opinion that was decided on, was proposed by some agent. However, Agreement is not assured, since there always exists a nonzero probability of a run of the protocol in which message losses cause one agent to see only one opinion and another agent to see only the other, thus making them disagree. Likewise, Nontriviality of majority consensus is not guaranteed, since the majority opinion could be hidden from some agent. We will show in this article that the probability of these runs is negligible as the number of agents, n, tends to infinity, thus demonstrating that unreliable communication is not an insurmountable obstacle for coordination.

Specifically, we prove that the SMP with

r = 3

reaches classical consensus with probability converging to 1 as n tends to infinity. In a system of

2 n

agents, let

δ_{n}

be the number of agents that are initially assigned the majority opinion minus n. For simplicity, assume the majority opinion is always the same for all n. We show that if

δ_{n}

grows at a rate of

\sqrt{n}

, then the SMP with

r = 2

reaches majority consensus with probability approaching 1 as

n \to \infty

. We also show that if

δ_{n}

grows at a rate faster than

\sqrt{n}

, then the SMP with

r = 1

reaches majority consensus with probability that converges to 1 exponentially fast.

We also show that these achievability results are, in fact, tight. We will prove that if

δ_{n} = 0

, then

r = 3

communication rounds is a necessary condition, since the probability to reach consensus with only

r = 2

rounds converges to 0 as

n \to \infty

. Similarly, if

δ_{n}

grows as slow as

\sqrt{n}

, then

r = 2

rounds are a necessary condition to reach majority consensus.

1.3. Related Work

The problem of binary majority consensus was extensively researched in many different fields and contexts including autonomous systems [27,28,29,30], distributed systems [31,32,33], and information theory [34,35,36]. Almost always the problem is studied in the context of possible failure of some aspect of the network. In distributed systems, failure most often arises from agents behaving maliciously, failing to follow the protocol, or outright crashing. Consequently, protocols that solve consensus (and majority consensus by extension) are designed to tolerate a certain fraction of the set of agents failing [37,38]. Transmission faults (i.e., message loss, erasure, or addition) can be considered an extension of agent failure, but doing so may lead to false conclusions. For example, in a system of n agents, the entire system may be considered faulty even if only one message from each agent is lost. However, as shown by Santoro and Widmayer [39], the system may tolerate up to

n - 1

messages losses in a round and still reach consensus. Additionally, assuming a probability distribution on message loss is consistent with how network protocols are analyzed. The most notable example is that TCP throughput was shown to be inversely proportional to the square root of the link’s average packet (i.e., message) loss probability [40].

In [27,29,30,34], the authors studied the effects of message loss, random topology, Gaussian noise, and faulty agents, on the SMP’s convergence rate, i.e., the fraction of initial assignments of agent opinions (out of

2^{n}

) resulting in successful agreement. Specifically, in [30] computer simulations showed an improvement in the convergence rate of the SMP as the message loss probability increased up to

0.8

, after which the rate begins to decrease to zero. In contrast, we are interested in the maximal probability of failure over any initial assignment of agent opinions, since we cannot assume any distribution or frequency on the input to the consensus problem.

Mustafa and Pekeč [28] studied the requirements on the connectivity of the network such that, under assumption of reliable communication, SMP achieves consensus on any initial assignment of agent opinions. Their main result is that the SMP computes the majority consensus successfully only in highly-connected networks. This conclusion led us to analyze the SMP under the assumption of a fully-connected network. However, message loss may actually improve the chances of consensus in graphs with lesser degrees of connectivity, as shown in [30]. We leave the proof of this hypothesis to future work. Additionally, the complete graph assumption is a valid approximation for unstructured overlays in peer to peer networks, e.g., Freenet, Gnutella, and Fast Track [41].

Our work closely resembles the work performed in [35,36]. These articles have shown that in a lossless fully-connected network where agents poll a portion of their neighbors uniformly at random, the SMP converges quickly to majority consensus with probability of error (in the sense that agreement was reached, but not on the majority opinion) that decays exponentially with n. While assuming the existence of infinite agents in a system may initially seem ludicrous and impractical, our own computer simulations of the SMP showed that these kind of results hold true even if the number of agents is on order of

10^{6}

, which is already the case in cryptocurrency protocols. We add another assumption of unreliable communication and show that this, essentially, does not change the outcome.

Yet, another line of relatively recent work deserves a special attention. In [42], a local polling protocol is proposed, and it is proved that it reaches consensus on the initial global majority in general graphs with certain degree properties. An estimation on the number of required steps to reach consensus is provided. In [43], similar results were given for random regular graphs. In both of these papers, it is assumed that a clear bias exists between the two initial opinions, in contrast to our main assumption in the current work, that the initial condition may be completely unbiased. In [44], the binary consensus problem was tackled from a different angle. For a random graph

G (n, p)

with a connectivity parameter

p \in (0, 1)

and any given

ϵ \in (0, 1)

, this work reveals what the initial difference between the two camps should be, such that the larger camp will eventually win with probability at least as high as

1 - ϵ

. In [45], the binary consensus problem was solved for relatively sparse random graphs but with random initial states, which is slightly different than the assumptions in the current work. A remarkable result was proved in [45], stating that a consensus can be reached in at most four communication rounds.

The remaining part of the paper is organized as follows. In Section 2, we establish notation conventions. In Section 3, we formalize the model, the protocol, and the objectives of this work. In Section 4, we provide and discuss the main results of this work, and in Section 5, we prove them.

2. Notation Conventions

Throughout the paper, random variables will be denoted by capital letters, realizations will be denoted by the corresponding lower case letters, and their alphabets will be denoted by calligraphic letters. Random vectors and their realizations will be denoted, respectively, by boldface capital and lower case letters. Their alphabets will be superscripted by their dimensions. The binary Kullback–Leibler divergence function between two binary probability distributions with parameters

α, β \in [0, 1]

is defined as:

\begin{matrix} D (α ∥ β) = α log (\frac{α}{β}) + (1 - α) log (\frac{1 - α}{1 - β}), \end{matrix}

(1)

where logarithms, here and throughout the sequel, are understood to be taken to the natural base. The cumulative distribution function of a standard normal random variable is defined by:

\begin{matrix} Φ (t) = \int_{- \infty}^{t} \frac{1}{\sqrt{2 π}} exp \{- \frac{s^{2}}{2}\} d s . \end{matrix}

(2)

The probability of an event

E

will be denoted by

P {E}

, and the expectation operator with respect to a probability distribution Q will be denoted by

E_{Q} [\cdot]

, where the subscript will often be omitted. The variance of a random variable X is denoted by

Var [X]

. The indicator function of an event

A

will be denoted by

𝟙 {A}

. The set

{1, 2, \dots, n}

will often be denoted by

[1 : n]

. For

x = (x_{1}, x_{2}, \dots, x_{n}) \in X^{n}

and for any

a \in X

, let us denote:

\begin{matrix} N (x; a) = \sum_{i = 1}^{n} 𝟙 {x_{i} = a} . \end{matrix}

(3)

For two non-negative sequences

a_{n}

and

b_{n}

, the sequence

A_{n} = n + a_{n}

is called asymmetric of exact order of

b_{n}

if there exists some

α > 0

, such that

{lim}_{n \to \infty} \frac{a_{n}}{b_{n}} = α

. Moreover, the sequence

A_{n} = n + a_{n}

is called asymmetric of order larger than

b_{n}

if

{lim}_{n \to \infty} \frac{a_{n}}{b_{n}} = \infty

.

3. Model, Protocol, and Objectives

Assume a set of

2 n

agents, and denote their assignment of initial opinions by

x_{0, n} \in {0, 1}^{2 n}

. The vector

x_{0, n}

is called the initial state. Denote the numbers of zeros and ones in

x_{0, n}

by

I_{0}

and

I_{1}

, respectively. At each round, each agent transmits its current state to all other agents. If a message sent between any pair of agents arrives, then it is assumed to be delivered correctly. Otherwise, if

x \in {0, 1}

is transmitted between any pair of agents, but got lost, then the designated receiver receives the default symbol e. This assumption is only made for the purpose of making the definitions that follow brighter. For a sent message

x \in {0, 1}

and a received message

Y \in {0, e, 1}

, we assume that all message losses are statistically independent and identically distributed according to

P (Y = 0 | x = 0) = P (Y = 1 | x = 1) = 1 - q

and

P (Y = e | x = 0) = P (Y = e | x = 1) = q

, where

q \in [0, 1]

is the loss parameter of the network. The binary erasure channel is characterized by a similar conditional distribution, but note that the actual faults in our model are message losses, not to be confused with erasures, which are different kinds of faults. The two extreme cases of a reliable network (i.e., with

q = 0

) and a completely unreliable network (i.e., with

q = 1

) are of less interest, for obvious reasons; hence, we assume throughout that

q \in (0, 1)

.

At round

ℓ \geq 1

, the agent

i \in [1 : 2 n]

receives the (random) vector:

\begin{matrix} y_{ℓ}^{i} = (y_{ℓ}^{i} (1), y_{ℓ}^{i} (2), \dots, y_{ℓ}^{i} (i - 1), y_{ℓ}^{i} (i + 1), \dots, y_{ℓ}^{i} (2 n)) \in {0, e, 1}^{2 n - 1}, \end{matrix}

(4)

and for

a \in {0, 1}

, he calculates the enumerators:

\begin{matrix} N_{ℓ, i} (a) = 𝟙 {x_{ℓ - 1} (i) = a} + \sum_{j \neq i} 𝟙 {y_{ℓ}^{i} (j) = a} . \end{matrix}

(5)

In the SMP, each agent updates (note that we use this terminology even if the value of an agent does not change between two consecutive rounds) its value according to the more common value at hand, i.e., agent i chooses:

\begin{matrix} x_{ℓ} (i) = \{\begin{matrix} 0 & if N_{ℓ, i} (0) > N_{ℓ, i} (1) \\ 1 & if N_{ℓ, i} (0) < N_{ℓ, i} (1) \\ x_{ℓ - 1} (i) & if N_{ℓ, i} (0) = N_{ℓ, i} (1) \end{matrix} . \end{matrix}

(6)

The vector

x_{ℓ} \in {0, 1}^{2 n}

is called the state at the end of round ℓ.

A specific SMP defines a priori the number of rounds until termination. Let us denote by SMP

(r)

the SMP with r rounds of communication until termination. We say that the SMP

(r)

attains consensus if:

\begin{matrix} x_{r} (1) = x_{r} (2) = \dots = x_{r} (2 n), \end{matrix}

(7)

and denote this event by

C_{n}

. Similarly, we say that the SMP

(r)

attains majority consensus if the following holds:

\begin{matrix} I_{0} > I_{1} & \to x_{r} (1) = x_{r} (2) = \dots = x_{r} (2 n) = 0, \end{matrix}

(8)

\begin{matrix} I_{0} < I_{1} & \to x_{r} (1) = x_{r} (2) = \dots = x_{r} (2 n) = 1, \end{matrix}

(9)

\begin{matrix} I_{0} = I_{1} & \to x_{r} (1) = x_{r} (2) = \dots = x_{r} (2 n), \end{matrix}

(10)

and denote this event by

C_{n}^{m}

.

For a specific initial state

x_{0, n}

, the probability of error in achieving consensus is defined as

P_{e} (x_{0, n}) = P [C_{n}^{c}]

. The maximal error probability with respect to the initial state is defined by:

\begin{matrix} P_{e, m a x} = max_{x_{0, n} \in {0, 1}^{2 n}} P_{e} (x_{0, n}) . \end{matrix}

(11)

The error probability in achieving majority consensus is defined similarly and denoted

P_{e}^{m} (x_{0, n})

.

Now, the first objective of this work is to prove that the SMP requires only very few rounds of communication to attain consensus, with a maximal error probability that converges to 0 when

n \to \infty

. The second objective is to determine for which initial states it is possible to also achieve majority consensus with a small probability of error.

4. Main Results

Our first main result is the following, which is proved in Section 5.1.

Theorem 1.

Let

{x_{0, n}}_{n \geq 1}

, be a sequence of initial states over

2 n

agents. Assume that the

2 n

agents communicate over a network with a loss parameter

q \in (0, 1)

. Then:

If ${x_{0, n}}_{n \geq 1}$ is asymmetric of order larger than $\sqrt{n}$ , the SMP $(1)$ attains $P [C_{n}^{m}] \overset{n \to \infty}{\to} 1$ .
If ${x_{0, n}}_{n \geq 1}$ is asymmetric of exact order of $\sqrt{n}$ , the SMP $(2)$ attains $P [C_{n}^{m}] \overset{n \to \infty}{\to} 1$ .
For any ${x_{0, n}}_{n \geq 1}$ , the SMP $(3)$ attains $P [C_{n}] \overset{n \to \infty}{\to} 1$ .

We now provide a short discussion on the results of Theorem 1.

Theorem 1 shows that the SMP requires at most three rounds of communications to attain consensus, in the limit of an infinite number of agents. Consensus on the majority cannot be ensured for all possible initial states, but only for those initial states that have a significant majority to one of the sides. To understand this fact better, consider the following special case. Assume a network with

2 n

agents, such that

I_{0} = n + log (n)

and

I_{1} = n - log (n)

. Since this majority in favor of the zeros is so weak, then it is most likely that the random losses in the network will completely hide it; we expect that about half of the agents will have

N_{1, i} (0) > N_{1, i} (1)

, thus updating their current opinion to ‘0’, while the other half will update their current opinion to ‘1’s. We conclude that the state at the end of round 1 is probabilistically equivalent to a sequence of

2 n

fair coin tosses, and hence, with a probability of about one half, the majority at the end of round 1 will be different from the initial majority.

More quantitatively, let

I_{0} = n + a_{n}

and

I_{1} = n - a_{n}

, where

{a_{n}}_{n \geq 1}

is a non-negative, nondecreasing sequence. Moreover, for an agent with an initial opinion ‘0’, let

p_{n}

denote the sequence of probabilities of the events that such an agent updates its opinion to ‘0’. Then, the following trichotomy is seen inside the proof of Theorem 1.

Lemma 1.

The following trichotomy holds:

If ${lim}_{n \to \infty} \frac{a_{n}}{\sqrt{n}} = 0$ , then $p_{n} \overset{n \to \infty}{\to} \frac{1}{2}$ .
If ${lim}_{n \to \infty} \frac{a_{n}}{\sqrt{n}} = α \in (0, \infty)$ then $p_{n} \overset{n \to \infty}{\to} β (α, q) \in (\frac{1}{2}, 1)$ .
If ${lim}_{n \to \infty} \frac{a_{n}}{\sqrt{n}} = \infty$ , then $p_{n} \overset{n \to \infty}{\to} 1$ .

One of the most surprising facts, at least to the authors of this work, is the following. For highly symmetric initial states, although

p_{n} \overset{n \to \infty}{\to} \frac{1}{2}

(which is proved in Appendix C), it turns out (see Proposition 3 in Section 5.1) that after a single round of communication, the initial symmetry breaks equiprobably into one of the sides. Moreover, for the symmetric case of

I_{0} = I_{1} = n

, we prove in Propositions 3 and 4 that with a probability converging to 1, the state at the end of round 1 will be asymmetric of exact order of

\sqrt{n}

. Then, according to the second point in Lemma 1, the state at the end of round two is going to have a significant majority to one of the sides, and thus, according to the third point in Lemma 1, only one more round of communication is required to achieve consensus. If the initial state is already asymmetric of exact order of

\sqrt{n}

, then only two rounds of communication are needed for attaining consensus, and in this case, it is guaranteed (with high probability) that all agents agree on the initial majority opinion.

The phenomenon that the initial symmetry breaks into a sufficient majority after the first round is of key importance, since it makes the convergence of the SMP so rapid. In fact, we also conclude that the faulty communication between the agents even helps in attaining consensus, by breaking the symmetry in some extreme cases, e.g., consider the case of

I_{0} = I_{1} = n

and a reliable network (i.e., the case of

q = 0

). Then, ad infinitum, the state at the end of any round will be symmetric. Otherwise when losses exist according to some

q \in (0, 1)

, this will not be the case, even if the percentage of losses is extremely small (but fixed at all n).

A significant difference exists between the first point of Theorem 1 and its last two points, which is the following. The first point of Theorem 1 is based on Proposition 1 in Section 5.1, which is mainly proved by using the Chernoff bound. Since the Chernoff bound is a nonasymptotic tool, we acquire a large-deviations result, i.e., for a given sequence

{a_{n}}_{n \geq 1}

(with the condition

{lim}_{n \to \infty} \frac{a_{n}}{\sqrt{n}} = \infty

), we propose a tight upper bound on

P_{e}^{m} (x_{0, n})

, which holds for any finite n (this tightness follows from the fact that a lower bound with a matching exponent can be derived as well). This result is obviously stronger than just

P [C_{n}^{m}] \overset{n \to \infty}{\to} 1

. On the other hand, the second and the third points of Theorem 1 are based on Propositions 2 and 3 in Section 5.1, respectively. Since the proofs of these propositions involve central limit theorems, we merely arrive at asymptotic results. As a consequence, we do not know at what rates the probabilities in the second and the third points of Theorem 1 converge to one.

Since the results of the second and the third points of Theorem 1 are merely asymptotic, a few words on finite n effects are in order. We base the following facts on computer simulations of the SMP. On the one hand, convergence to consensus at more than three rounds is definitely possible, but only when the initial state is symmetric or almost symmetric. The reason for that is the fact mentioned above, according to which, the state at round 1 is probabilistically equivalent to a sequence of

2 n

fair coin tosses, and hence, the probability that the state at round 1 is again symmetric behaves asymptotically as

1 / \sqrt{n}

(upper and lower bounds can be derived using the Stirling’s bounds to

n!

), which is not negligible at all, even for a relatively large number of agents. For relatively small values of n, we observed several realizations with even more than a single returning to a fully symmetric state. Although quite rare, these events should be taken into consideration in practical implementations.

All the results provided in Theorem 1 are, in fact, achievability results, i.e., they only tell under what conditions consensus can be attained. Hence, it is worth investigating whether consensus may be attained by the SMP with even less communication rounds than required in Theorem 1. In the following result, which is the second main result of this work and is proved in Section 5.2, we show that for highly symmetric initial states, three rounds of communications are not only sufficient, but also necessary.

Theorem 2.

Let

{x_{0, n}}_{n \geq 1}

be a sequence of symmetric initial states over

2 n

agents, i.e.,

N (x_{0, n}; 0)

= N (x_{0, n}; 1) = n

for all n. Assume that the

2 n

agents communicate over a network with a loss parameter

q \in (0, 1)

. Then, the SMP

(2)

attains

P [C_{n}] \overset{n \to \infty}{\to} 0

.

While Theorem 2 provides a converse result with regard to the third point of Theorem 1, a similar converse result can also be established with regard to the second point of Theorem 1. If the initial state is asymmetric of exact order of

\sqrt{n}

, then the SMP will likely not attain consensus after only a single round of communication, and furthermore, the probability of reaching consensus will tend to 0 as

n \to \infty

. We omit the proof of this negative result.

5. Proofs

5.1. Proof of Theorem 1

The first point of Theorem 1 is proved via the following result, which is proved in Appendix A.

Proposition 1.

Let

{A_{n}}_{n = 1}^{\infty}

be a sequence such that

{lim}_{n \to \infty} \frac{A_{n}}{\sqrt{n}} = \infty

. For an initial state

x_{0, n} \in {0, 1}^{2 n}

with at least

n + A_{n}

zeros or at least

n + A_{n}

ones and a channel parameter

q \in [0, 1)

, the SMP

(1)

attains

P [C_{n}^{m}] \overset{n \to \infty}{\to} 1

. Specifically, if

{lim}_{n \to \infty} \frac{A_{n}}{n} < 1

, then:

\begin{matrix} P_{e}^{m} (x_{0, n}) \leq 2 n \sqrt{\frac{n + A_{n}}{n - A_{n}}} \cdot exp \{- (1 - q) \cdot \frac{A_{n}^{2}}{n}\} . \end{matrix}

(12)

To prove the second point of Theorem 1, we rely on the following result, which is proved in Appendix B.

Proposition 2.

Let

q \in [0, 1)

be a channel parameter. Let

α > 0

be fixed and let

0 < ϵ < Φ (t_{0}) - \frac{1}{2}

, where

t_{0} = \sqrt{2 α^{2} (1 - q) / q}

. Then, the SMP

(1)

attains the following.

If $x_{0, n} \in {0, 1}^{2 n}$ has at least $n + α \sqrt{n}$ zeros, then:

$\begin{matrix} P \{N (X_{1}; 0) \geq 2 n (Φ (t_{0}) - ϵ)\} \overset{n \to \infty}{\to} 1 . \end{matrix}$

(13)
If $x_{0, n} \in {0, 1}^{2 n}$ has at least $n + α \sqrt{n}$ ones, then

$\begin{matrix} P \{N (X_{1}; 1) \geq 2 n (Φ (t_{0}) - ϵ)\} \overset{n \to \infty}{\to} 1 . \end{matrix}$

(14)

Then, combining the results of Propositions 1 and 2 using the law of total probability, the second point of Theorem 1 follows immediately.

To prove the third point of Theorem 1, we provide one more result. The following proposition shows that if the initial state is symmetric, then the state at round one will be asymmetric of order at least

\sqrt{n}

. This result is proved in Appendix C.

Proposition 3.

Let

x_{0, n} \in {0, 1}^{2 n}

be an initial state with n zeros and n ones and let

q \in (0, 1)

be a channel parameter. Let

ϵ > 0

be given. Then, there exist

δ = δ (ϵ)

with

δ (ϵ) \overset{ϵ \to 0}{\to} 0

and

M (ϵ)

, such that for all

n \geq M (ϵ)

,

\begin{matrix} P \{{N (X_{1}; 0) \leq n - δ \sqrt{n}} \cup {N (X_{1}; 0) \geq n + δ \sqrt{n}}\} \geq 1 - ϵ . \end{matrix}

(15)

We are now able to prove the third point of Theorem 1. Let

ϵ_{1}, ϵ_{3} > 0

be given, and let

δ

be as in Proposition 3 corresponding to

ϵ_{3}

. Also, let

t_{0} = \sqrt{2 δ^{2} (1 - q) / q}

, choose

ϵ_{2} > 0

such that

Φ (t_{0}) - ϵ_{2} > 1 / 2

, and denote

β = 2 (Φ (t_{0}) - ϵ_{2}) - 1

. Define the following events:

\begin{matrix} A_{n} = \{N (X_{1}; 0) \leq n - δ \sqrt{n} or N (X_{1}; 0) \geq n + δ \sqrt{n}\}, \end{matrix}

(16)

and

\begin{matrix} B_{n} = \{N (X_{2}; 0) \leq (1 - β) n or N (X_{2}; 0) \geq (1 + β) n\} . \end{matrix}

(17)

Then, consider the following:

\begin{matrix} P {C_{n}} & = P {N (X_{3}; 0) = 0 or N (X_{3}; 0) = 2 n} \\ = P {N (X_{3}; 0) = 0 or N (X_{3}; 0) = 2 n | B_{n}} \cdot P {B_{n}} \end{matrix}

(18)

\begin{matrix} + P {N (X_{3}; 0) = 0 or N (X_{3}; 0) = 2 n | B_{n}^{c}} \cdot P {B_{n}^{c}} \end{matrix}

(19)

\begin{matrix} \geq P {N (X_{3}; 0) = 0 or N (X_{3}; 0) = 2 n | B_{n}} \cdot P {B_{n}} \end{matrix}

(20)

\begin{matrix} \geq (1 - ϵ_{1}) \cdot P {B_{n}}, \end{matrix}

(21)

where (19) follows from the law of total probability and (21) holds for all large enough n, due to Proposition 1. Furthermore,

\begin{matrix} P {B_{n}} & = P {B_{n} | A_{n}} \cdot P {A_{n}} + P {B_{n} | A_{n}^{c}} \cdot P {A_{n}^{c}} \end{matrix}

(22)

\begin{matrix} \geq P {B_{n} | A_{n}} \cdot P {A_{n}} \end{matrix}

(23)

\begin{matrix} \geq (1 - ϵ_{2}) \cdot P {A_{n}} \end{matrix}

(24)

\begin{matrix} \geq (1 - ϵ_{2}) \cdot (1 - ϵ_{3}), \end{matrix}

(25)

where (22) is again due to the law of total probability, (24) follows from Proposition 2 for all n sufficiently large, and (25) follows from Proposition 3, also for all n sufficiently large. Substituting (25) back into (21), we conclude that

P {C_{n}}

can be made arbitrarily close to 1, which implies the result in the third point in Theorem 1.

5.2. Proof of Theorem 2

The following proposition, which is proved in Appendix D, shows that if the initial state is symmetric, then the state at round one cannot be asymmetric of order larger than

\sqrt{n}

.

Proposition 4.

Let

{B_{n}}_{n = 1}^{\infty}

be a sequence such that

{lim}_{n \to \infty} \frac{B_{n}}{\sqrt{n}} = \infty

. For an initial state

x_{0, n} \in {0, 1}^{2 n}

with n zeros and n ones and a channel parameter

q \in (0, 1)

, the following holds:

\begin{matrix} P \{{N (X_{1}; 0) \leq n - B_{n}} \cup {N (X_{1}; 0) \geq n + B_{n}}\} \leq 2 exp \{- \frac{B_{n}^{2}}{n}\} . \end{matrix}

(26)

We also have the following result, which is proved in Appendix E.

Proposition 5.

Let

{C_{n}}_{n = 1}^{\infty}

be a sequence such that

{lim}_{n \to \infty} \frac{C_{n}}{n} = 0

. Let

x_{0, n} \in {0, 1}^{2 n}

be an initial state with

n + C_{n}

zeros or

n + C_{n}

ones. Let

q \in (0, 1)

be a channel parameter and denote the constant

f_{q} = 32 / min {q, 1 - q}

. Then, the SMP

(1)

is characterized by:

\begin{matrix} P {C_{n}} \leq exp \{- C_{n}^{2} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\}\} . \end{matrix}

(27)

We are now in a good position to prove Theorem 2. Let

C (q) = \frac{1}{2} f_{q}^{- 1}

, choose the sequence:

\begin{matrix} Θ_{n} = \sqrt{C (q) n log (n)}, \end{matrix}

(28)

and define the sequence of events:

\begin{matrix} F_{n} = {N (X_{1}; 0) \leq n - Θ_{n}} \cup {N (X_{1}; 0) \geq n + Θ_{n}} . \end{matrix}

(29)

According to Proposition 4, we have that:

\begin{matrix} P \{F_{n}\} & \leq 2 exp \{- \frac{C (q) n log (n)}{n}\} \end{matrix}

(30)

\begin{matrix} = 2 exp \{- C (q) log (n)\} \end{matrix}

(31)

\begin{matrix} = \frac{2}{n^{C (q)}}, \end{matrix}

(32)

which converges to zero as

n \to \infty

. In addition, it follows from Proposition 5 that:

\begin{matrix} P {C_{n} | F_{n}^{c}} & \leq exp \{- C (q) n log (n) \cdot exp \{- f_{q} \cdot \frac{C (q) n log (n)}{n - \sqrt{C (q) n log (n)}}\}\} \end{matrix}

(33)

\begin{matrix} \leq exp \{- C (q) n log (n) \cdot exp \{- \frac{\frac{1}{2} n log (n)}{n - \frac{1}{2} n}\}\} \end{matrix}

(34)

\begin{matrix} = exp \{- C (q) n log (n) \cdot exp \{- log (n)\}\} \end{matrix}

(35)

\begin{matrix} = exp \{- C (q) n log (n) n^{- 1}\} \end{matrix}

(36)

\begin{matrix} = exp \{- C (q) log (n)\} \end{matrix}

(37)

\begin{matrix} = \frac{1}{n^{C (q)}}, \end{matrix}

(38)

where (34) holds for all large enough n. Then, consider the following:

\begin{matrix} P {C_{n}} & = P {C_{n} | F_{n}} \cdot P {F_{n}} + P {C_{n} | F_{n}^{c}} \cdot P {F_{n}^{c}} \end{matrix}

(39)

\begin{matrix} \leq P {F_{n}} + P {C_{n} | F_{n}^{c}} \end{matrix}

(40)

\begin{matrix} \leq \frac{2}{n^{C (q)}} + \frac{1}{n^{C (q)}} \end{matrix}

(41)

\begin{matrix} = \frac{3}{n^{C (q)}} \overset{n \to \infty}{\to} 0, \end{matrix}

(42)

where (39) is due to the law of total probability and (41) follows from (32) and (38). The proof of Theorem 2 is complete.

Author Contributions

All authors contributed equally to this research work. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Proposition 1

Due to symmetry, we only analyze the case

I_{0} > I_{1}

. It follows from the union bound that:

\begin{matrix} P_{e}^{m} (x_{0, n}) & = P \{⋃_{i = 1}^{2 n} {X_{1} (i) = 1}\} \end{matrix}

(A1)

\begin{matrix} \leq \sum_{i = 1}^{2 n} P \{X_{1} (i) = 1\} . \end{matrix}

(A2)

In the following, let us denote by

Ber (p)

a Bernoulli random variable with a success probability p and by

Bin (n, p)

a binomial random variable with n independent experiments, each one with a success probability p. We adopt the following convention: if an event contains at least two binomial random variables, then we assume that they are statistically independent.

Let us denote

q^{'} = 1 - q

. If an agent starts with a ‘0’, then the probability to decide in favor of ‘1’ is upper-bounded by:

\begin{matrix} P \{Bin (n - A_{n}, q^{'}) \geq Bin (n + A_{n} - 1, q^{'}) + 1 + 1\} \end{matrix}

(A3)

\begin{matrix} \leq P \{Bin (n - A_{n}, q^{'}) \geq Bin (n + A_{n} - 1, q^{'}) + Bin (1, q^{'})\} \end{matrix}

(A4)

\begin{matrix} = P \{Bin (n - A_{n}, q^{'}) \geq Bin (n + A_{n}, q^{'})\}, \end{matrix}

(A5)

where the addition of the second 1 in (A3) follows from the need to strictly break the tie to adopt ‘1’ and (A4) is due to the fact that

Bin (1, q^{'}) \leq 2

with probability one.

If an agent starts with a ‘1’, then the probability to decide ‘1’ is upper-bounded by:

\begin{matrix} P \{Bin (n - A_{n} - 1, q^{'}) + 1 \geq Bin (n + A_{n}, q^{'})\} \\ \leq P \{Bin (n - A_{n}, q^{'}) + 1 \geq Bin (n + A_{n}, q^{'})\} . \end{matrix}

(A6)

Since (A6) cannot be smaller than (A5), we continue with (A6). From now on, we prove that the probability in (A6), to be denoted by

P_{n}

, converges to zero as

n \to \infty

. Let:

\begin{matrix} X_{n} = \sum_{ℓ = 1}^{n - A_{n}} I_{ℓ}, Y_{n} = \sum_{k = 1}^{n + A_{n}} J_{k}, \end{matrix}

(A7)

where

I_{ℓ} \sim Ber (q^{'})

, for all

ℓ \in {1, 2, \dots, n - A_{n}}

,

J_{k} \sim Ber (q^{'})

, for all

k \in {1, 2, \dots, n + A_{n}}

, and all of these binary random variables are independent. Now:

\begin{matrix} P_{n} & = P {X_{n} + 1 \geq Y_{n}} \end{matrix}

(A8)

\begin{matrix} = P \{e^{λ (X_{n} - Y_{n} + 1)} \geq 1\} \end{matrix}

(A9)

\begin{matrix} \leq E [e^{λ (X_{n} - Y_{n} + 1)}], \end{matrix}

(A10)

where (A10) is due to Markov’s inequality. Since (A10) holds for every

λ \geq 0

, it follows that:

\begin{matrix} P_{n} \leq inf_{λ \geq 0} E [e^{λ (X_{n} - Y_{n} + 1)}] . \end{matrix}

(A11)

We obtain that:

\begin{matrix} E [e^{λ (X_{n} - Y_{n} + 1)}] & = e^{λ} \cdot E [exp \{λ (\sum_{ℓ = 1}^{n - A_{n}} I_{ℓ} - \sum_{k = 1}^{n + A_{n}} J_{k})\}] \end{matrix}

(A12)

\begin{matrix} = e^{λ} \cdot E [\prod_{ℓ = 1}^{n - A_{n}} e^{λ I_{ℓ}} \cdot \prod_{k = 1}^{n + A_{n}} e^{- λ J_{k}}] \end{matrix}

(A13)

\begin{matrix} = e^{λ} \cdot \prod_{ℓ = 1}^{n - A_{n}} E [e^{λ I_{ℓ}}] \cdot \prod_{k = 1}^{n + A_{n}} E [e^{- λ J_{k}}] \end{matrix}

(A14)

\begin{matrix} = e^{λ} \cdot {[1 + q^{'} (e^{λ} - 1)]}^{n - A_{n}} \cdot {[1 + q^{'} (e^{- λ} - 1)]}^{n + A_{n}} \end{matrix}

(A15)

\begin{matrix} \leq e^{λ} \cdot {[exp {q^{'} (e^{λ} - 1)}]}^{n - A_{n}} \cdot {[exp {q^{'} (e^{- λ} - 1)}]}^{n + A_{n}} \end{matrix}

(A16)

\begin{matrix} = exp \{λ + q^{'} (e^{λ} - 1) (n - A_{n}) + q^{'} (e^{- λ} - 1) (n + A_{n})\}, \end{matrix}

(A17)

where (A14) is due to the independence of all binary random variables and (A16) follows from the inequality

1 + x \leq e^{x}

. Upon defining:

\begin{matrix} f (λ) = λ + q^{'} (e^{λ} - 1) (n - A_{n}) + q^{'} (e^{- λ} - 1) (n + A_{n}), \end{matrix}

(A18)

we find that

\begin{matrix} f^{'} (λ) = 1 + q^{'} e^{λ} (n - A_{n}) - q^{'} e^{- λ} (n + A_{n}) . \end{matrix}

(A19)

To facilitate expressions, we solve for

f^{'} (λ) = 1

and find that:

\begin{matrix} e^{λ^{*}} = \sqrt{\frac{n + A_{n}}{n - A_{n}}} . \end{matrix}

(A20)

Substituting it back into (A17) yields that:

\begin{matrix} P_{n} & \leq exp \{λ^{*} + q^{'} (e^{λ^{*}} - 1) (n - A_{n}) + q^{'} (e^{- λ^{*}} - 1) (n + A_{n})\} \end{matrix}

(A21)

\begin{matrix} = \sqrt{\frac{n + A_{n}}{n - A_{n}}} \cdot exp \{q^{'} (\sqrt{\frac{n + A_{n}}{n - A_{n}}} - 1) (n - A_{n}) + q^{'} (\sqrt{\frac{n - A_{n}}{n + A_{n}}} - 1) (n + A_{n})\} \end{matrix}

(A22)

\begin{matrix} = \sqrt{\frac{n + A_{n}}{n - A_{n}}} \cdot exp \{q^{'} [\sqrt{(n + A_{n}) (n - A_{n})} - n + A_{n}]\} \\ \times exp \{q^{'} [\sqrt{(n - A_{n}) (n + A_{n})} - n - A_{n}]\} \end{matrix}

(A23)

\begin{matrix} = \sqrt{\frac{n + A_{n}}{n - A_{n}}} \cdot exp \{2 q^{'} (\sqrt{n^{2} - A_{n}^{2}} - n)\} . \end{matrix}

(A24)

Consider the following:

\begin{matrix} \sqrt{n^{2} - A_{n}^{2}} - n & = \sqrt{n^{2} (1 - \frac{A_{n}^{2}}{n^{2}})} - n \end{matrix}

(A25)

\begin{matrix} = n \sqrt{1 - \frac{A_{n}^{2}}{n^{2}}} - n \end{matrix}

(A26)

\begin{matrix} \leq n (1 - \frac{A_{n}^{2}}{2 n^{2}}) - n \end{matrix}

(A27)

\begin{matrix} = - \frac{A_{n}^{2}}{2 n}, \end{matrix}

(A28)

where (A27) follows from the inequality

\sqrt{1 - t} \leq 1 - t / 2

. Continuing from (A2), we arrive at

\begin{matrix} P_{e}^{m} (x_{0, n}) & \leq 2 n \sqrt{\frac{n + A_{n}}{n - A_{n}}} \cdot exp \{- (1 - q) \cdot \frac{A_{n}^{2}}{n}\}, \end{matrix}

(A29)

which converges to zero when

n \to \infty

, as long as

{lim}_{n \to \infty} \frac{A_{n}}{\sqrt{n}} = \infty

and

{lim}_{n \to \infty} \frac{A_{n}}{n} < 1

.

For the case of

{lim}_{n \to \infty} \frac{A_{n}}{n} = 1

, consider the following. Let

{A_{n}}_{n = 1}^{\infty}

be any sequence with

{lim}_{n \to \infty} \frac{A_{n}}{n} = 1

and let

{A_{n}^{'}}_{n = 1}^{\infty}

be a sequence with

{lim}_{n \to \infty} \frac{A_{n}^{'}}{n} = α

, for

α \in (0, 1)

. Then, for sufficiently large n,

A_{n} \geq A_{n}^{'}

, and thus, it follows that:

\begin{matrix} P \{Bin (n - A_{n}, q^{'}) + 1 \geq Bin (n + A_{n}, q^{'})\} \\ \leq P \{Bin (n - A_{n}^{'}, q^{'}) + 1 \geq Bin (n + A_{n}^{'}, q^{'})\} \overset{n \to \infty}{\to} 0, \end{matrix}

(A30)

which completes the proof of Proposition 1.

Appendix B. Proof of Proposition 2

Step 1: The Limit of the Probability to Decide ‘1’

If an agent starts with a ‘1’, then the probability to decide in favor of ‘1’ is given by:

\begin{matrix} P \{Bin (n - α \sqrt{n} - 1, q^{'}) + 1 \geq Bin (n + α \sqrt{n}, q^{'})\}, \end{matrix}

(A31)

and if an agent starts with a ‘0’, then the probability to decide in favor of ‘1’ is given by

\begin{matrix} P \{Bin (n - α \sqrt{n}, q^{'}) \geq Bin (n + α \sqrt{n} - 1, q^{'}) + 2\} \\ = P \{Bin (n - α \sqrt{n}, q^{'}) \geq Bin (n, q^{'}) + Bin (α \sqrt{n} - 1, q^{'}) + 2\} . \end{matrix}

(A32)

From now on, we prove that the probability in (A32), to be denoted by

P_{n}

, converges to a value, which is strictly smaller than

\frac{1}{2}

for all sufficiently large n. An identical result also holds for the probability in (A31), the proof of which is very similar and hence omitted.

Let

I_{ℓ} \sim Ber (q^{'})

, for all

ℓ \in {1, 2, \dots, n - α \sqrt{n}}

,

J_{s} \sim Ber (q^{'})

, for all

s \in {1, 2, \dots, n}

, as well as

K_{m} \sim Ber (q^{'})

, for all

m \in {1, 2, \dots, α \sqrt{n} - 1}

, and all of these binary random variables are independent. Consider the following:

\begin{matrix} P_{n} & = P \{\sum_{ℓ = 1}^{n - α \sqrt{n}} I_{ℓ} \geq \sum_{s = 1}^{n} J_{s} + \sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2\} \end{matrix}

(A33)

\begin{matrix} = P \{\sum_{ℓ = 1}^{n - α \sqrt{n}} I_{ℓ} - (n - α \sqrt{n} + α \sqrt{n}) q^{'} \geq \sum_{s = 1}^{n} J_{s} - n q^{'} + \sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2\} \end{matrix}

(A34)

\begin{matrix} = P \{\sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'}) - α \sqrt{n} q^{'} \geq \sum_{s = 1}^{n} (J_{s} - q^{'}) + \sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2\} \end{matrix}

(A35)

\begin{matrix} = P \{\frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'}) \geq \frac{1}{\sqrt{n}} \sum_{s = 1}^{n} (J_{s} - q^{'}) + \frac{1}{\sqrt{n}} (\sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2) + α q^{'}\} . \end{matrix}

(A36)

Let us denote:

\begin{matrix} X_{n} = \frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'}), Y_{n} = \frac{1}{\sqrt{n}} \sum_{s = 1}^{n} (J_{s} - q^{'}), Z_{n} = \frac{1}{\sqrt{n}} (\sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2) . \end{matrix}

(A37)

It follows directly from the central limit theorem [46] (p. 112, Theorem 2.4.1.) that

Y_{n}

converges in distribution to

Y \sim N (0, σ^{2})

, where

σ^{2} = q^{'} (1 - q^{'})

. Concerning the sequence

X_{n}

, we first write it as follows:

\begin{matrix} X_{n} & = \frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'}) \end{matrix}

(A38)

\begin{matrix} = \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}} \cdot \frac{1}{\sqrt{n - α \sqrt{n}}} \sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'}) \end{matrix}

(A39)

\begin{matrix} \overset{▵}{=} \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}} {\tilde{X}}_{n}, \end{matrix}

(A40)

where

{\tilde{X}}_{n}

converges in distribution to

X \sim N (0, σ^{2})

, again, from the central limit theorem. To conclude that

X_{n}

itself converges in distribution to

X \sim N (0, σ^{2})

, we only need to prove that

| {\tilde{X}}_{n} - X_{n} |

converges in distribution to 0. We have that:

\begin{matrix} lim_{n \to \infty} E {[{\tilde{X}}_{n} - X_{n}]}^{2} & = lim_{n \to \infty} E {[(1 - \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}}) \cdot \frac{1}{\sqrt{n - α \sqrt{n}}} \sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'})]}^{2} \end{matrix}

(A41)

\begin{matrix} = lim_{n \to \infty} {(1 - \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}})}^{2} \cdot \frac{1}{n - α \sqrt{n}} \cdot E {[\sum_{ℓ = 1}^{n - α \sqrt{n}} (I_{ℓ} - q^{'})]}^{2} \end{matrix}

(A42)

\begin{matrix} = lim_{n \to \infty} {(1 - \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}})}^{2} \cdot \frac{1}{n - α \sqrt{n}} \cdot \sum_{ℓ = 1}^{n - α \sqrt{n}} E [{(I_{ℓ} - q^{'})}^{2}] \end{matrix}

(A43)

\begin{matrix} = lim_{n \to \infty} {(1 - \frac{\sqrt{n - α \sqrt{n}}}{\sqrt{n}})}^{2} \cdot E [{(I_{1} - q^{'})}^{2}] \end{matrix}

(A44)

\begin{matrix} = 0, \end{matrix}

(A45)

which proves that that

| {\tilde{X}}_{n} - X_{n} |

converges in

L^{2}

to 0, thus also in distribution. It then follows from [47] (Theorem 3.1) that

X_{n}

converges in distribution to

X \sim N (0, σ^{2})

.

Concerning the sequence

Z_{n}

, consider the following:

\begin{matrix} lim_{n \to \infty} E [Z_{n}] & = lim_{n \to \infty} E [\frac{1}{\sqrt{n}} (\sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2)] \end{matrix}

(A46)

\begin{matrix} = lim_{n \to \infty} \frac{1}{\sqrt{n}} (\sum_{m = 1}^{α \sqrt{n} - 1} q^{'} + 2) \end{matrix}

(A47)

\begin{matrix} = α q^{'}, \end{matrix}

(A48)

and furthermore

\begin{matrix} lim_{n \to \infty} Var [Z_{n}] & = lim_{n \to \infty} Var [\frac{1}{\sqrt{n}} (\sum_{m = 1}^{α \sqrt{n} - 1} K_{m} + 2)] \end{matrix}

(A49)

\begin{matrix} = lim_{n \to \infty} \frac{1}{n} \sum_{m = 1}^{α \sqrt{n} - 1} q^{'} (1 - q^{'}) \end{matrix}

(A50)

\begin{matrix} = 0 . \end{matrix}

(A51)

It follows that

Z_{n}

converges in

L^{2}

to

Z = α q^{'}

, i.e., a deterministic random variable. Hence,

Z_{n}

also converges to

Z = α q^{'}

in probability [46] (Lemma 1.3.5). Now, for

ϵ > 0

arbitrarily small, consider the following:

\begin{matrix} P_{n} & = P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'}\} \\ = P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'} | Z_{n} \geq α q^{'} - ϵ\} P \{Z_{n} \geq α q^{'} - ϵ\} \end{matrix}

(A52)

\begin{matrix} + P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'} | Z_{n} < α q^{'} - ϵ\} P \{Z_{n} < α q^{'} - ϵ\} \end{matrix}

(A53)

\begin{matrix} \leq P \{X_{n} \geq Y_{n} + α q^{'} - ϵ + α q^{'} | Z_{n} \geq α q^{'} - ϵ\} P \{Z_{n} \geq α q^{'} - ϵ\} + P \{Z_{n} < α q^{'} - ϵ\} \end{matrix}

(A54)

\begin{matrix} = P \{X_{n} \geq Y_{n} + 2 α q^{'} - ϵ\} P \{Z_{n} \geq α q^{'} - ϵ\} + P \{Z_{n} < α q^{'} - ϵ\}, \end{matrix}

(A55)

where (A53) is due to the law of total probability and (A55) follows from the fact that

(X_{n}, Y_{n})

are independent of

Z_{n}

. Since

{I_{ℓ}}

and

{J_{s}}

are all independent, the joint law of the pair

(X_{n}, Y_{n})

converges to the joint law of

(X, Y)

and

X, Y

are independent. Hence, by Portmanteau’s theorem [47] (p. 16, Theorem 2.1), and the fact that

Z_{n}

converges to

Z = α q^{'}

in probability:

\begin{matrix} \underset{n \to \infty}{lim sup} P_{n} & \leq P \{X - Y \geq 2 α q^{'} - ϵ\} \end{matrix}

(A56)

\begin{matrix} = P \{N (0, 2 q (1 - q)) \geq 2 α q^{'} - ϵ\} \end{matrix}

(A57)

\begin{matrix} = Q (t_{0}^{-} (ϵ)), \end{matrix}

(A58)

where

\begin{matrix} t_{0}^{-} (ϵ) = \frac{2 α q^{'} - ϵ}{\sqrt{2 q (1 - q)}}, \end{matrix}

(A59)

and

\begin{matrix} Q (t) \overset{▵}{=} \int_{t}^{\infty} \frac{1}{\sqrt{2 π}} exp \{- \frac{s^{2}}{2}\} d s . \end{matrix}

(A60)

In a similar fashion:

\begin{matrix} P_{n} & = P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'}\} \\ = P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'} | Z_{n} \leq α q^{'} + ϵ\} P \{Z_{n} \leq α q^{'} + ϵ\} \end{matrix}

(A61)

\begin{matrix} + P \{X_{n} \geq Y_{n} + Z_{n} + α q^{'} | Z_{n} > α q^{'} - ϵ\} P \{Z_{n} > α q^{'} - ϵ\} \end{matrix}

(A62)

\begin{matrix} \geq P \{X_{n} \geq Y_{n} + α q^{'} + ϵ + α q^{'} | Z_{n} \leq α q^{'} + ϵ\} P \{Z_{n} \leq α q^{'} + ϵ\} \end{matrix}

(A63)

\begin{matrix} = P \{X_{n} \geq Y_{n} + 2 α q^{'} + ϵ\} P \{Z_{n} \leq α q^{'} + ϵ\}, \end{matrix}

(A64)

and thus

\begin{matrix} \underset{n \to \infty}{lim inf} P_{n} & \geq P \{X - Y \geq 2 α q^{'} + ϵ\} \end{matrix}

(A65)

\begin{matrix} = P \{N (0, 2 q (1 - q)) \geq 2 α q^{'} + ϵ\} \end{matrix}

(A66)

\begin{matrix} = Q (t_{0}^{+} (ϵ)), \end{matrix}

(A67)

where

\begin{matrix} t_{0}^{+} (ϵ) = \frac{2 α q^{'} + ϵ}{\sqrt{2 q (1 - q)}} . \end{matrix}

(A68)

From the continuity of the Q-function and the fact that

ϵ > 0

is arbitrarily small, we conclude that:

\begin{matrix} \underset{n \to \infty}{lim sup} P_{n} \leq Q (t_{0}) \leq \underset{n \to \infty}{lim inf} P_{n}, \end{matrix}

(A69)

where

\begin{matrix} t_{0} = \sqrt{\frac{2 α^{2} (1 - q)}{q}}, \end{matrix}

(A70)

and hence

\begin{matrix} lim_{n \to \infty} P_{n} = Q (t_{0}) . \end{matrix}

(A71)

Now, for any

α > 0

and

q \in (0, 1)

, the expression in (A70) is strictly positive, and thus

{lim}_{n \to \infty} P_{n} = Q (t_{0}) < \frac{1}{2}

. We conclude that for all

0 < δ < \frac{1}{2} - Q (t_{0})

,

P_{n} \leq Q (t_{0}) + δ < \frac{1}{2}

holds for all sufficiently large n.

Step 2: Many Zeros with High Probability

Let

0 < δ < \frac{1}{2} - Q (t_{0})

be given. Let

Q_{n}^{0}, Q_{n}^{1}

denote the probabilities of deciding ‘0’, for the two possible initial states. Since

P_{n} \leq Q (t_{0}) + δ < \frac{1}{2}

for all sufficiently large n, it follows that

min {Q_{n}^{0}, Q_{n}^{1}} \geq Φ (t_{0}) - δ > \frac{1}{2}

for all sufficiently large n, where

Φ (t)

is defined in (2).

Let

ϵ > δ > 0

such that

Φ (t_{0}) - δ > Φ (t_{0}) - ϵ > \frac{1}{2}

. We now prove that the probability of drawing a relatively small number of zeros tends to 0 as

n \to \infty

. Denote

N_{0} = N (X_{1}; 0)

and consider the following for

s \geq 0

:

\begin{matrix} P \{N_{0} \leq 2 n (Φ (t_{0}) - ϵ)\} & = P \{e^{- s N_{0}} \geq e^{- 2 n s (Φ (t_{0}) - ϵ)}\} \end{matrix}

(A72)

\begin{matrix} \leq \frac{E [e^{- s N_{0}}]}{e^{- 2 n s (Φ (t_{0}) - ϵ)}}, \end{matrix}

(A73)

where (A73) is due to Markov’s inequality. Since (A73) holds for every

s \geq 0

, it follows that:

\begin{matrix} P \{N_{0} \leq 2 n (Φ (t_{0}) - ϵ)\} \leq inf_{s \geq 0} \frac{E [e^{- s N_{0}}]}{e^{- 2 n s (Φ (t_{0}) - ϵ)}} . \end{matrix}

(A74)

Note that:

\begin{matrix} N_{0} = \sum_{ℓ = 1}^{n + α \sqrt{n}} I_{ℓ} + \sum_{k = 1}^{n - α \sqrt{n}} J_{k}, \end{matrix}

(A75)

where

I_{ℓ} \sim Ber (Q_{n}^{0})

, for all

ℓ \in {1, 2, \dots, n + α \sqrt{n}}

,

J_{k} \sim Ber (Q_{n}^{1})

, for all

k \in {1, 2, \dots, n - α \sqrt{n}}

, and all of these binary random variables are independent. We obtain that:

\begin{matrix} E [e^{- s N_{0}}] & = E [exp \{- s (\sum_{ℓ = 1}^{n + α \sqrt{n}} I_{ℓ} + \sum_{k = 1}^{n - α \sqrt{n}} J_{k})\}] \end{matrix}

(A76)

\begin{matrix} = E [\prod_{ℓ = 1}^{n + α \sqrt{n}} e^{- s I_{ℓ}} \cdot \prod_{k = 1}^{n - α \sqrt{n}} e^{- s J_{k}}] \end{matrix}

(A77)

\begin{matrix} = \prod_{ℓ = 1}^{n + α \sqrt{n}} E [e^{- s I_{ℓ}}] \cdot \prod_{k = 1}^{n - α \sqrt{n}} E [e^{- s J_{k}}] \end{matrix}

(A78)

\begin{matrix} = {[1 + Q_{n}^{0} (e^{- s} - 1)]}^{n + α \sqrt{n}} \cdot {[1 + Q_{n}^{1} (e^{- s} - 1)]}^{n - α \sqrt{n}} \end{matrix}

(A79)

\begin{matrix} \leq {[1 + (Φ (t_{0}) - δ) (e^{- s} - 1)]}^{n + α \sqrt{n}} \cdot {[1 + (Φ (t_{0}) - δ) (e^{- s} - 1)]}^{n - α \sqrt{n}} \end{matrix}

(A80)

\begin{matrix} = {[1 + (Φ (t_{0}) - δ) (e^{- s} - 1)]}^{2 n}, \end{matrix}

(A81)

where (A78) is due to the independence of all binary random variables and (A80) is true since

min {Q_{n}^{0}, Q_{n}^{1}} \geq Φ (t_{0}) - δ

for all sufficiently large n and

e^{- s} - 1 \leq 0

. Substituting (A81) back into (A74) yields that:

\begin{matrix} P \{N_{0} \leq 2 n (Φ (t_{0}) - ϵ)\} \end{matrix}

\begin{matrix} \leq inf_{s \geq 0} exp \{2 n log [1 + (Φ (t_{0}) - δ) (e^{- s} - 1)] + 2 n s (Φ (t_{0}) - ϵ)\} \end{matrix}

(A82)

\begin{matrix} = exp \{2 n \cdot inf_{s \geq 0} {log [1 + (Φ (t_{0}) - δ) (e^{- s} - 1)] + s (Φ (t_{0}) - ϵ)}\} . \end{matrix}

(A83)

Upon defining:

\begin{matrix} g (s) = log [1 + (Φ (t_{0}) - δ) (e^{- s} - 1)] + s (Φ (t_{0}) - ϵ), \end{matrix}

(A84)

we find that the solution to

g^{'} (s) = 0

is given by

\begin{matrix} s^{*} = log (\frac{(Φ (t_{0}) - δ) [1 - (Φ (t_{0}) - ϵ)]}{[1 - (Φ (t_{0}) - δ)] (Φ (t_{0}) - ϵ)}) . \end{matrix}

(A85)

Substituting it back into (A84) yields that:

\begin{matrix} g (s^{*}) & = log (1 + (Φ (t_{0}) - δ) [\frac{[1 - (Φ (t_{0}) - δ)] (Φ (t_{0}) - ϵ)}{(Φ (t_{0}) - δ) [1 - (Φ (t_{0}) - ϵ)]} - 1]) \\ + (Φ (t_{0}) - ϵ) log (\frac{(Φ (t_{0}) - δ) [1 - (Φ (t_{0}) - ϵ)]}{[1 - (Φ (t_{0}) - δ)] (Φ (t_{0}) - ϵ)}) \end{matrix}

(A86)

\begin{matrix} = log (\frac{1 - (Φ (t_{0}) - δ)}{1 - (Φ (t_{0}) - ϵ)}) + (Φ (t_{0}) - ϵ) log (\frac{Φ (t_{0}) - δ}{Φ (t_{0}) - ϵ}) \\ + (Φ (t_{0}) - ϵ) log (\frac{1 - (Φ (t_{0}) - ϵ)}{1 - (Φ (t_{0}) - δ)}) \end{matrix}

(A87)

\begin{matrix} = - (Φ (t_{0}) - ϵ) log (\frac{Φ (t_{0}) - ϵ}{Φ (t_{0}) - δ}) - (1 - (Φ (t_{0}) - ϵ)) log (\frac{1 - (Φ (t_{0}) - ϵ)}{1 - (Φ (t_{0}) - δ)}) \end{matrix}

(A88)

\begin{matrix} = - D (Φ (t_{0}) - ϵ ∥ Φ (t_{0}) - δ) . \end{matrix}

(A89)

We upper-bound the expression in (A89) using Pinsker’s inequality [48,49]. Recall that the total variation distance between two probability distributions P and Q is defined by:

\begin{matrix} | P - Q | = \frac{1}{2} \sum_{x \in X} | P (x) - Q (x) |, \end{matrix}

(A90)

and the Kullback–Leibler divergence is defined by

\begin{matrix} D (P ∥ Q) = \sum_{x \in X} P (x) log \frac{P (x)}{Q (x)} . \end{matrix}

(A91)

Then, Pinsker’s inequality asserts that:

\begin{matrix} D (P ∥ Q) \geq {2 | P - Q |}^{2} . \end{matrix}

(A92)

Thus, we arrive at:

\begin{matrix} P \{N_{0} \leq 2 n (Φ (t_{0}) - ϵ)\} & \leq exp \{- 2 n D (Φ (t_{0}) - ϵ ∥ Φ (t_{0}) - δ)\} \end{matrix}

(A93)

\begin{matrix} \leq exp \{- 4 n {(ϵ - δ)}^{2}\} . \end{matrix}

(A94)

Hence, we conclude that for all n sufficiently large:

\begin{matrix} P \{N_{0} \geq 2 n (Φ (t_{0}) - ϵ)\} \geq 1 - exp \{- 4 n {(ϵ - δ)}^{2}\}, \end{matrix}

(A95)

which converges to 1 as

n \to \infty

. Proposition 2 is now proved.

Appendix C. Proof of Proposition 3

Denote

N_{0} = N (X_{1}; 0)

. Let

{p_{n}}

denote the sequence of probabilities of the events that an agent with an initial value ‘0’ updates its value to ‘0’ after a single round of communication.

Step 1: An Upper Bound on the PMF of the Binomial Distribution

We start by upper-bounding the probability mass function (PMF) of the binomial random variable

X = Bin (n, p)

, which is given by:

\begin{matrix} P_{X} (k) = (\binom{n}{k}) p^{k} {(1 - p)}^{n - k}, k \in [0 : n] . \end{matrix}

(A96)

To upper-bound the binomial coefficient in (A96), we invoke the following Stirling’s bounds:

\begin{matrix} \sqrt{2 π n} \cdot n^{n} \cdot e^{- n} \leq n! \leq e \sqrt{n} \cdot n^{n} \cdot e^{- n}, \end{matrix}

(A97)

and obtain the following

\begin{matrix} (\binom{n}{k}) & = \frac{n!}{k! \cdot (n - k)!} \end{matrix}

(A98)

\begin{matrix} \leq \frac{e \sqrt{n} \cdot n^{n} \cdot e^{- n}}{\sqrt{2 π k} \cdot k^{k} \cdot e^{- k} \cdot \sqrt{2 π (n - k)} \cdot {(n - k)}^{n - k} \cdot e^{- (n - k)}} \end{matrix}

(A99)

\begin{matrix} = \frac{e \sqrt{n} \cdot n^{n}}{\sqrt{2 π k} \cdot k^{k} \cdot \sqrt{2 π (n - k)} \cdot {(n - k)}^{n - k}} \end{matrix}

(A100)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} \frac{n^{k} \cdot n^{n - k}}{k^{k} \cdot {(n - k)}^{n - k}} \end{matrix}

(A101)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- k log (\frac{k}{n}) - (n - k) log (\frac{n - k}{n})\} \end{matrix}

(A102)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- n [\frac{k}{n} log (\frac{k}{n}) + (1 - \frac{k}{n}) log (1 - \frac{k}{n})]\} . \end{matrix}

(A103)

Substituting (A103) back into (A96) yields:

\begin{matrix} P_{X} (k) \end{matrix}

\begin{matrix} \leq \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- n [\frac{k}{n} log (\frac{k}{n}) + (1 - \frac{k}{n}) log (1 - \frac{k}{n})]\} \cdot p^{k} {(1 - p)}^{n - k} \end{matrix}

(A104)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- n [\frac{k}{n} log (\frac{k}{n}) + (1 - \frac{k}{n}) log (1 - \frac{k}{n})]\} \end{matrix}

\begin{matrix} \times exp \{- n [\frac{k}{n} log (\frac{1}{p}) + (1 - \frac{k}{n}) log (\frac{1}{1 - p})]\} \end{matrix}

(A105)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- n [\frac{k}{n} log (\frac{k / n}{p}) + (1 - \frac{k}{n}) log (\frac{1 - k / n}{1 - p})]\} \end{matrix}

(A106)

\begin{matrix} = \frac{e}{2 π} \sqrt{\frac{n}{k (n - k)}} exp \{- n D (\frac{k}{n} ∥ p)\}, \end{matrix}

(A107)

where

D (α ∥ β)

, for

α, β \in [0, 1]

, is defined in (1).

Step 2: The Limit of ${p_{n}}$ is $\frac{1}{2}$

First, we show that

{p_{n}}

is lower-bounded by

\frac{1}{2}

. For

q^{'} = 1 - q

, denote:

\begin{matrix} Z \sim Bin (n - 1, q^{'}), X, Y \sim Bin (n, q^{'}) . \end{matrix}

(A108)

We have that:

\begin{matrix} p_{n} & = P \{Z + 1 \geq Y\} \end{matrix}

(A109)

\begin{matrix} \geq P \{Bin (n - 1, q^{'}) + Bin (1, q^{'}) \geq Y\} \end{matrix}

(A110)

\begin{matrix} = P \{X \geq Y\}, \end{matrix}

(A111)

where (A110) is true since

Bin (1, q^{'}) \leq 1

with probability one. It follows by symmetry that

\begin{matrix} 1 & = P {X > Y} + P {X < Y} + P {X = Y} \end{matrix}

(A112)

\begin{matrix} = 2 P {X > Y} + P {X = Y}, \end{matrix}

(A113)

or,

\begin{matrix} P {X > Y} = \frac{1}{2} - \frac{1}{2} \cdot P {X = Y}, \end{matrix}

(A114)

which implies that

\begin{matrix} p_{n} & \geq P {X \geq Y} \end{matrix}

(A115)

\begin{matrix} = P {X > Y} + P {X = Y} \end{matrix}

(A116)

\begin{matrix} = \frac{1}{2} + \frac{1}{2} \cdot P {X = Y} \end{matrix}

(A117)

\begin{matrix} \geq \frac{1}{2} . \end{matrix}

(A118)

Next, we upper-bound the sequence

{p_{n}}

. Note that:

\begin{matrix} p_{n} & = P \{Bin (n - 1, q^{'}) + 1 \geq Bin (n, q^{'})\} \end{matrix}

(A119)

\begin{matrix} \leq P \{Bin (n, q^{'}) + 1 \geq Bin (n, q^{'})\} \end{matrix}

(A120)

\begin{matrix} = P \{X + 1 \geq Y\} \end{matrix}

(A121)

\begin{matrix} = P \{X \geq Y\} + P \{X + 1 = Y\} \end{matrix}

(A122)

\begin{matrix} = \frac{1}{2} + \frac{1}{2} \cdot P {X = Y} + P \{X + 1 = Y\} . \end{matrix}

(A123)

As for the last term in (A123), we have that:

\begin{matrix} P \{X + 1 = Y\} & = \sum_{ℓ = 0}^{n - 1} P {X = ℓ} \cdot P {Y = ℓ + 1} \end{matrix}

(A124)

\begin{matrix} \leq \sqrt{\sum_{ℓ = 0}^{n - 1} {(P {X = ℓ})}^{2}} \sqrt{\sum_{ℓ = 0}^{n - 1} {(P {Y = ℓ + 1})}^{2}} \end{matrix}

(A125)

\begin{matrix} = \sqrt{\sum_{ℓ = 0}^{n - 1} {(P {X = ℓ})}^{2}} \sqrt{\sum_{ℓ = 1}^{n} {(P {Y = ℓ})}^{2}} \end{matrix}

(A126)

\begin{matrix} \leq \sqrt{\sum_{ℓ = 0}^{n} {(P {X = ℓ})}^{2}} \sqrt{\sum_{ℓ = 0}^{n} {(P {Y = ℓ})}^{2}} \end{matrix}

(A127)

\begin{matrix} = \sum_{ℓ = 0}^{n} {(P {X = ℓ})}^{2} \end{matrix}

(A128)

\begin{matrix} = \sum_{ℓ = 0}^{n} P {X = ℓ} \cdot P {Y = ℓ} \end{matrix}

(A129)

\begin{matrix} = P {X = Y}, \end{matrix}

(A130)

where (A125) follows from the Cauchy–Schwarz inequality. Substituting (A130) back into (A123) yields that:

\begin{matrix} p_{n} & \leq \frac{1}{2} + \frac{3}{2} \cdot P {X = Y} . \end{matrix}

(A131)

Now, consider the following:

\begin{matrix} P {X = Y} & = \sum_{ℓ = 0}^{n} {(P {X = ℓ})}^{2} \end{matrix}

(A132)

\begin{matrix} = \sum_{ℓ = 0}^{n} {[(\binom{n}{ℓ}) {(1 - q)}^{ℓ} q^{n - ℓ}]}^{2} \end{matrix}

(A133)

\begin{matrix} = {[(\binom{n}{0}) {(1 - q)}^{0} q^{n}]}^{2} + \sum_{ℓ = 1}^{n - 1} {[(\binom{n}{ℓ}) {(1 - q)}^{ℓ} q^{n - ℓ}]}^{2} + {[(\binom{n}{n}) {(1 - q)}^{n} q^{0}]}^{2} \end{matrix}

(A134)

\begin{matrix} = q^{2 n} + \sum_{ℓ = 1}^{n - 1} {[(\binom{n}{ℓ}) {(1 - q)}^{ℓ} q^{n - ℓ}]}^{2} + {(1 - q)}^{2 n} . \end{matrix}

(A135)

As for the middle term in (A135), it follows from (A107) that:

\begin{matrix} \sum_{ℓ = 1}^{n - 1} {[(\binom{n}{ℓ}) {(1 - q)}^{ℓ} q^{n - ℓ}]}^{2} & \leq \sum_{ℓ = 1}^{n - 1} {(\frac{e}{2 π})}^{2} \frac{n}{ℓ (n - ℓ)} exp \{- 2 n D (\frac{ℓ}{n} ∥ 1 - q)\} . \end{matrix}

(A136)

To upper-bound (A136) let

ϵ_{n} = 1 / \sqrt[4]{n}

, for

n = 1, 2, \dots

and define the set of numbers:

\begin{matrix} N_{n} = {n (1 - q - ϵ_{n}), n (1 - q - ϵ_{n}) + 1, \dots, n (1 - q), \dots, n (1 - q + ϵ_{n})}, \end{matrix}

(A137)

whose cardinality is given by

\begin{matrix} | N_{n} | = 2 n ϵ_{n} + 1 . \end{matrix}

(A138)

Denote

M_{n} = {1, 2, \dots, n - 1} \cap N_{n}^{c}

. For any

ℓ \in M_{n}

, it follows from Pinsker’s inequality that:

\begin{matrix} D (\frac{ℓ}{n} ∥ 1 - q) & \geq D (1 - q + ϵ_{n} ∥ 1 - q) \end{matrix}

(A139)

\begin{matrix} \geq 2 ϵ_{n}^{2} . \end{matrix}

(A140)

We now continue from (A136) and arrive at:

\begin{matrix} \sum_{ℓ = 1}^{n - 1} {(\frac{e}{2 π})}^{2} \frac{n}{ℓ (n - ℓ)} exp \{- 2 n D (\frac{ℓ}{n} ∥ 1 - q)\} \end{matrix}

\begin{matrix} \leq \sum_{ℓ \in M_{n}} {(\frac{e}{2 π})}^{2} \frac{n}{ℓ (n - ℓ)} exp \{- 4 n ϵ_{n}^{2}\} + \sum_{ℓ \in N_{n}} {(\frac{e}{2 π})}^{2} \frac{n}{ℓ (n - ℓ)} \end{matrix}

(A141)

\begin{matrix} \leq \sum_{ℓ \in M_{n}} {(\frac{e}{2 π})}^{2} \frac{n}{(n - 1)} exp \{- 4 n ϵ_{n}^{2}\} + \sum_{ℓ \in N_{n}} {(\frac{e}{2 π})}^{2} \frac{n}{n (1 - q - ϵ_{n}) [n - n (1 - q - ϵ_{n})]} \end{matrix}

(A142)

\begin{matrix} \leq {(\frac{e}{2 π})}^{2} n exp \{- 4 n ϵ_{n}^{2}\} + {(\frac{e}{2 π})}^{2} \frac{2 n ϵ_{n} + 1}{n (q + ϵ_{n}) (1 - q - ϵ_{n})} \end{matrix}

(A143)

\begin{matrix} = {(\frac{e}{2 π})}^{2} n exp \{- 4 n^{1 / 2}\} + {(\frac{e}{2 π})}^{2} \frac{2 n^{3 / 4} + 1}{n (q + n^{- 1 / 4}) (1 - q - n^{- 1 / 4})}, \end{matrix}

(A144)

where (A141) follows from (A140) and the fact that

D (α ∥ β) \geq 0

in general. The inequality in (A142) is because of the following reasons. First, the minimizers of

ℓ (n - ℓ)

in

M_{n}

are 1 or

n - 1

. Second, the minimizer of

ℓ (n - ℓ)

in

N_{n}

is the endpoint of

N_{n}

which is the most distant from

1 / 2

. For simplicity, we assumed without loss of generality that

q \in (1 / 2, 1)

. The passage to (A143) is due to the fact that

| M_{n} | \leq n - 1

as well as (A138) and in (A144), we substituted

ϵ_{n} = 1 / \sqrt[4]{n}

. Denote the expression in (A144) by

G_{n}

and notice that this expression converges to zero as

n \to \infty

. We substitute

G_{n}

back into (A135) and then into (A131). Since

{p_{n}}

is lower-bounded by

\frac{1}{2}

, we conclude that:

\begin{matrix} \frac{1}{2} \leq p_{n} \leq \frac{1}{2} + \frac{3}{2} \cdot [q^{2 n} + G_{n} + {(1 - q)}^{2 n}] . \end{matrix}

(A145)

Thus,

{p_{n}}

converges to

\frac{1}{2}

as long as

q \neq 0, 1

.

Step 3: Asymptotic Behavior of the Number of Zeros

We would like to prove that the random variable

| N_{0} - n | / \sqrt{n}

is bounded away from zero with an overwhelmingly high probability at large n. Note that:

\begin{matrix} N_{0} = \sum_{ℓ = 1}^{n} I_{n, ℓ} + \sum_{ℓ = 1}^{n} J_{n, ℓ}, \end{matrix}

(A146)

where

I_{n, ℓ} \sim Ber (p_{n})

and

J_{n, ℓ} \sim Ber (1 - p_{n})

, for all

ℓ \in {1, 2, \dots, n}

, and all of these binary random variables are independent. Let

ϵ > 0

and

δ (ϵ) > 0

, that will be specified later on with the property that

δ (ϵ) \overset{ϵ \to 0}{\to} 0

. Consider the following:

\begin{matrix} P \{|\frac{N_{0} - n}{\sqrt{n}}| \geq δ (ϵ)\} = P \{|\frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n} (I_{n, ℓ} - p_{n}) + \frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n} (J_{n, ℓ} - (1 - p_{n}))| \geq δ (ϵ)\} . \end{matrix}

(A147)

To conclude that the two normalized sums inside the probability in (A147) converge in distribution to normal random variables, we invoke Lindeberg–Feller central limit theorem [46] (p. 116, Theorem 2.4.5.). First, we introduce the concept of a “triangular array” of variables. A triangular array of random variables is of the form

{X_{n, i}}

,

n \geq 1

,

1 \leq i \leq n

, where for every n, the random variables

X_{n, 1}, X_{n, 2}, \dots, X_{n, n}

are independent, have zero mean, and have finite variance. Then, one has the following result.

Theorem A1.

(Lindeberg–Feller CLT) Suppose

{X_{n, i}}

is a triangular array such that:

\begin{matrix} Z_{n} & = \frac{1}{n} \sum_{i = 1}^{n} X_{n, i}, \end{matrix}

(A148)

\begin{matrix} s_{n}^{2} & = \frac{1}{n} \sum_{i = 1}^{n} Var [X_{n, i}], \end{matrix}

(A149)

and

s_{n}^{2} \to s^{2} \neq 0

. If the Lindeberg condition holds: for every

ϵ > 0

,

\begin{matrix} \frac{1}{n} \sum_{i = 1}^{n} E [X_{n, i}^{2} 𝟙 {| X_{n, i} | \geq ϵ \sqrt{n}}] \to 0, \end{matrix}

(A150)

then

\sqrt{n} Z_{n} \overset{d}{\to} N (0, s^{2})

.

Now, concerning the left-hand-side normalized sum inside the probability in (A147), notice that:

\begin{matrix} s_{n}^{2} & = \frac{1}{n} \sum_{ℓ = 1}^{n} Var [I_{n, ℓ} - p_{n}] \end{matrix}

(A151)

\begin{matrix} = \frac{1}{n} \sum_{ℓ = 1}^{n} p_{n} (1 - p_{n}) \end{matrix}

(A152)

\begin{matrix} = p_{n} (1 - p_{n}), \end{matrix}

(A153)

which converges to

s^{2} = \frac{1}{4}

as

n \to \infty

. In addition, Lindeberg’s condition in (A150) is trivially satisfied since all the random variables in our setting are bounded. Thus, it follows by Lindeberg–Feller CLT that:

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n} (I_{n, ℓ} - p_{n}) \overset{d}{\to} X \sim N (0, \frac{1}{4}) . \end{matrix}

(A154)

From exactly the same considerations:

\begin{matrix} \frac{1}{\sqrt{n}} \sum_{ℓ = 1}^{n} (J_{n, ℓ} - (1 - p_{n})) \overset{d}{\to} Y \sim N (0, \frac{1}{4}), \end{matrix}

(A155)

and

X, Y

are independent since

{I_{n, ℓ}}

and

{J_{n, ℓ}}

are all independent. We continue from (A147) and arrive at:

\begin{matrix} lim_{n \to \infty} P \{|\frac{N_{0} - n}{\sqrt{n}}| \geq δ (ϵ)\} & = P \{|X + Y| \geq δ (ϵ)\} \end{matrix}

(A156)

\begin{matrix} = P \{|N (0, \frac{1}{2})| \geq δ (ϵ)\} \end{matrix}

(A157)

\begin{matrix} = 1 - \frac{ϵ}{2}, \end{matrix}

(A158)

which can obviously be satisfied by a proper choice of

δ (ϵ)

. We conclude that for any

ϵ > 0

, there exists some

M (ϵ)

, such that for all

n \geq M (ϵ)

:

\begin{matrix} P \{|\frac{N_{0} - n}{\sqrt{n}}| \geq δ (ϵ)\} \geq 1 - ϵ, \end{matrix}

(A159)

which completes the proof of Proposition 3.

Appendix D. Proof of Proposition 4

Let us denote

N = N (X_{1}; 0)

. For any

μ \geq 0

, it follows from Markov’s inequality that:

\begin{matrix} P \{N \geq n + B_{n}\} & = P \{e^{μ N} \geq e^{μ (n + B_{n})}\} \end{matrix}

(A160)

\begin{matrix} \leq \frac{E [e^{μ N}]}{e^{μ (n + B_{n})}}, \end{matrix}

(A161)

and thus, since (A161) holds for every

μ \geq 0

, it follows that

\begin{matrix} P \{N \geq n + B_{n}\} & \leq inf_{μ \geq 0} \frac{E [e^{μ N}]}{e^{μ (n + B_{n})}} . \end{matrix}

(A162)

Note that:

\begin{matrix} N = \sum_{m = 1}^{n} I_{m} + \sum_{m = 1}^{n} J_{m}, \end{matrix}

(A163)

where

I_{m} \sim Ber (p_{n})

and

J_{m} \sim Ber (1 - p_{n})

, for all

m \in {1, 2, \dots, n}

, and all of these binary random variables are independent. We obtain that:

\begin{matrix} E [e^{μ N}] & = E [exp \{μ (\sum_{m = 1}^{n} I_{m} + \sum_{m = 1}^{n} J_{m})\}] \end{matrix}

(A164)

\begin{matrix} = E [\prod_{m = 1}^{n} e^{μ I_{m}} \cdot \prod_{m = 1}^{n} e^{μ J_{m}}] \end{matrix}

(A165)

\begin{matrix} = \prod_{m = 1}^{n} E [e^{μ I_{m}}] \cdot \prod_{m = 1}^{n} E [e^{μ J_{m}}] \end{matrix}

(A166)

\begin{matrix} = {(1 - p_{n} + p_{n} e^{μ})}^{n} \cdot {(p_{n} + (1 - p_{n}) e^{μ})}^{n} \end{matrix}

(A167)

\begin{matrix} = {\{[1 + p_{n} (e^{μ} - 1)] \cdot [1 + (1 - p_{n}) (e^{μ} - 1)]\}}^{n} \end{matrix}

(A168)

\begin{matrix} \leq {\{[1 + \frac{1}{2} (e^{μ} - 1)] \cdot [1 + \frac{1}{2} (e^{μ} - 1)]\}}^{n} \end{matrix}

(A169)

\begin{matrix} = {[1 + \frac{1}{2} (e^{μ} - 1)]}^{2 n} . \end{matrix}

(A170)

where (A166) is due to the independence of all binary random variables and (A169) follows from the fact that the expression in (A168) is maximized for

p_{n} = \frac{1}{2}

.

Substituting (A170) back into (A162) yields that:

\begin{matrix} P \{N \geq n + B_{n}\} & \leq inf_{μ \geq 0} \frac{{[1 + \frac{1}{2} (e^{μ} - 1)]}^{2 n}}{exp {μ (n + B_{n})}} \end{matrix}

(A171)

\begin{matrix} = inf_{μ \geq 0} exp \{2 n log [1 + \frac{1}{2} (e^{μ} - 1)] - μ (n + B_{n})\} \end{matrix}

(A172)

\begin{matrix} = exp \{inf_{μ \geq 0} {2 n log [1 + \frac{1}{2} (e^{μ} - 1)] - μ (n + B_{n})}\} . \end{matrix}

(A173)

Upon defining:

\begin{matrix} f (μ) = 2 n log [1 + \frac{1}{2} (e^{μ} - 1)] - μ (n + B_{n}), \end{matrix}

(A174)

we find that the solution to

f^{'} (μ) = 0

is given by

\begin{matrix} μ^{*} = log (\frac{n + B_{n}}{n - B_{n}}) . \end{matrix}

(A175)

Substituting it back into (A173) provides that:

\begin{matrix} P \{N \geq n + B_{n}\} \end{matrix}

\begin{matrix} \leq exp \{2 n log [1 + \frac{1}{2} (e^{μ^{*}} - 1)] - μ^{*} (n + B_{n})\} \end{matrix}

(A176)

\begin{matrix} = exp \{2 n log [1 + \frac{1}{2} (\frac{n + B_{n}}{n - B_{n}} - 1)] - (n + B_{n}) log (\frac{n + B_{n}}{n - B_{n}})\} \end{matrix}

(A177)

\begin{matrix} = exp \{2 n log (1 + \frac{B_{n}}{n - B_{n}}) - (n + B_{n}) log (\frac{n + B_{n}}{n - B_{n}})\} \end{matrix}

(A178)

\begin{matrix} = exp \{2 n log (\frac{n}{n - B_{n}}) - (n + B_{n}) log (\frac{n + B_{n}}{n - B_{n}})\} \end{matrix}

(A179)

\begin{matrix} = exp \{(n - B_{n}) log (\frac{n}{n - B_{n}}) + (n + B_{n}) log (\frac{n}{n + B_{n}})\} \end{matrix}

(A180)

\begin{matrix} = exp \{- n \cdot [(1 - \frac{B_{n}}{n}) log (1 - \frac{B_{n}}{n}) + (1 + \frac{B_{n}}{n}) log (1 + \frac{B_{n}}{n})]\} . \end{matrix}

(A181)

Consider the function:

\begin{matrix} g (t) = (1 - t) log (1 - t) + (1 + t) log (1 + t), \end{matrix}

(A182)

which is symmetric around

t = 0

. Its first order and second order derivatives are given by

\begin{matrix} g^{'} (t) = log (\frac{1 + t}{1 - t}), \end{matrix}

(A183)

and

\begin{matrix} g^{″} (t) = \frac{2}{(1 + t) (1 - t)} . \end{matrix}

(A184)

Hence, we conclude that

g (t) \geq t^{2}

, and thus:

\begin{matrix} P \{N \geq n + B_{n}\} \leq exp \{- n \cdot {(\frac{B_{n}}{n})}^{2}\} = exp \{- \frac{B_{n}^{2}}{n}\}, \end{matrix}

(A185)

which completes the proof of Proposition 4.

Appendix E. Proof of Proposition 5

Step 1: A Simplification for the Consensus Probability

Due to symmetry, we only analyze the case

I_{0} > I_{1}

. It follows that:

\begin{matrix} P {C_{n}} & = P {N (X_{1}; 0) = 2 n} \end{matrix}

(A186)

\begin{matrix} = P \{⋂_{i = 1}^{2 n} {X_{1} (i) = 0}\} \end{matrix}

(A187)

\begin{matrix} = \prod_{i = 1}^{2 n} P \{X_{1} (i) = 0\} \end{matrix}

(A188)

\begin{matrix} = \prod_{i = 1}^{2 n} (1 - P \{X_{1} (i) = 1\}) . \end{matrix}

(A189)

Step 2: A Lower Bound on $P \{X_{1} (i) = 1\}$

If an agent starts with a ‘0’, then the probability to decide in favor of ‘1’ is lower-bounded by:

\begin{matrix} P \{Bin (n - C_{n}, q^{'}) \geq Bin (n + C_{n} - 1, q^{'}) + 2\} \\ \geq P \{Bin (n - C_{n}, q^{'}) \geq Bin (n + C_{n}, q^{'}) + 2\} . \end{matrix}

(A190)

If an agent starts with a ‘1’, then the probability to decide in favor of ‘1’ is lower-bounded by:

\begin{matrix} P \{Bin (n - C_{n} - 1, q^{'}) + 1 \geq Bin (n + C_{n}, q^{'})\} \end{matrix}

\begin{matrix} \geq P \{Bin (n - C_{n} - 1, q^{'}) + Bin (1, q^{'}) \geq Bin (n + C_{n}, q^{'})\} \end{matrix}

(A191)

\begin{matrix} = P \{Bin (n - C_{n}, q^{'}) \geq Bin (n + C_{n}, q^{'})\} . \end{matrix}

(A192)

Since (A190) cannot be larger than (A192), we continue with (A190). From now on, we lower-bound the probability in (A190), to be denoted by

Q_{n}

. The probability in (A190) can be written explicitly as:

\begin{matrix} Q_{n} = \sum_{ℓ = 0}^{n - C_{n}} \sum_{k = 0}^{n + C_{n}} (\binom{n - C_{n}}{ℓ}) {(1 - q)}^{ℓ} q^{n - C_{n} - ℓ} (\binom{n + C_{n}}{k}) {(1 - q)}^{k} q^{n + C_{n} - k} 𝟙 {ℓ \geq k + 2} . \end{matrix}

(A193)

We continue by lower-bounding the PMF of the binomial random variable

X = Bin (n, p)

, which is given by:

\begin{matrix} P_{X} (k) = (\binom{n}{k}) p^{k} {(1 - p)}^{n - k}, k \in [0 : n] . \end{matrix}

(A194)

To lower-bound the binomial coefficient in (A194), we use the Stirling’s bounds in (A97) and obtain that:

\begin{matrix} (\binom{n}{k}) & = \frac{n!}{k! \cdot (n - k)!} \end{matrix}

(A195)

\begin{matrix} \geq \frac{\sqrt{2 π}}{e^{2}} \sqrt{\frac{n}{k (n - k)}} exp \{- n [\frac{k}{n} log (\frac{k}{n}) + (1 - \frac{k}{n}) log (1 - \frac{k}{n})]\} . \end{matrix}

(A196)

Substituting (A196) back into (A194) yields:

\begin{matrix} P_{X} (k) & \geq \frac{\sqrt{2 π}}{e^{2}} \sqrt{\frac{n}{k (n - k)}} exp \{- n D (\frac{k}{n} ∥ p)\}, \end{matrix}

(A197)

where

D (α ∥ β)

, for

α, β \in [0, 1]

, is defined in (1). Substituting twice this lower bound into (A193), we arrive at:

\begin{matrix} Q_{n} & \geq \frac{2 π}{e^{4}} \sum_{ℓ = 0}^{n - C_{n}} \sum_{k = 0}^{ℓ - 2} \sqrt{\frac{n - C_{n}}{ℓ (n - C_{n} - ℓ)}} exp \{- (n - C_{n}) D (\frac{ℓ}{n - C_{n}} ∥ 1 - q)\} \\ \times \sqrt{\frac{n + C_{n}}{k (n + C_{n} - k)}} exp \{- (n + C_{n}) D (\frac{k}{n + C_{n}} ∥ 1 - q)\} . \end{matrix}

(A198)

As for the square-root factors in (A198), we have the following:

\begin{matrix} \sqrt{\frac{n - C_{n}}{ℓ (n - C_{n} - ℓ)}} \cdot \sqrt{\frac{n + C_{n}}{k (n + C_{n} - k)}} \end{matrix}

\begin{matrix} \geq \sqrt{\frac{n - C_{n}}{\frac{1}{2} (n - C_{n}) (n - C_{n} - \frac{1}{2} (n - C_{n}))}} \cdot \sqrt{\frac{n + C_{n}}{\frac{1}{2} (n + C_{n}) (n + C_{n} - \frac{1}{2} (n + C_{n}))}} \end{matrix}

(A199)

\begin{matrix} = \sqrt{\frac{4 (n - C_{n})}{{(n - C_{n})}^{2}}} \cdot \sqrt{\frac{4 (n + C_{n})}{{(n + C_{n})}^{2}}} \end{matrix}

(A200)

\begin{matrix} = 4 \sqrt{\frac{1}{(n - C_{n}) (n + C_{n})}} \end{matrix}

(A201)

\begin{matrix} = 4 \sqrt{\frac{1}{n^{2} - C_{n}^{2}}} \end{matrix}

(A202)

\begin{matrix} \geq \frac{4}{n}, \end{matrix}

(A203)

where (A199) is due to the fact that a square has the maximal area among all rectangles with a fixed perimeter. Lower-bounding (A198) using (A203) yields:

\begin{matrix} Q_{n} & \geq \frac{8 π}{e^{4} n} \sum_{ℓ = 0}^{n - C_{n}} \sum_{k = 0}^{ℓ - 2} exp \{- (n - C_{n}) D (\frac{ℓ}{n - C_{n}} ∥ 1 - q)\} \\ \times exp \{- (n + C_{n}) D (\frac{k}{n + C_{n}} ∥ 1 - q)\} \end{matrix}

(A204)

\begin{matrix} \geq \frac{8 π}{e^{4} n} \sum_{ℓ = (n - C_{n}) (1 - q)}^{(n + C_{n}) (1 - q)} \sum_{k = 0}^{ℓ - 2} exp \{- (n - C_{n}) D (\frac{ℓ}{n - C_{n}} ∥ 1 - q)\} \\ \times exp \{- (n + C_{n}) D (\frac{k}{n + C_{n}} ∥ 1 - q)\} \end{matrix}

(A205)

\begin{matrix} \geq \frac{8 π}{e^{4} n} \sum_{ℓ = (n - C_{n}) (1 - q)}^{(n + C_{n}) (1 - q)} \sum_{k = ℓ - C_{n} - 2}^{ℓ - 2} exp \{- (n - C_{n}) D (\frac{ℓ}{n - C_{n}} ∥ 1 - q)\} \\ \times exp \{- (n + C_{n}) D (\frac{k}{n + C_{n}} ∥ 1 - q)\} \end{matrix}

(A206)

\begin{matrix} = \frac{8 π}{e^{4} n} \sum_{ℓ = (n - C_{n}) (1 - q)}^{(n + C_{n}) (1 - q)} \sum_{j = 0}^{C_{n}} exp \{- (n - C_{n}) D (\frac{ℓ}{n - C_{n}} ∥ 1 - q)\} \\ \times exp \{- (n + C_{n}) D (\frac{ℓ - 2 - j}{n + C_{n}} ∥ 1 - q)\} \end{matrix}

(A207)

\begin{matrix} = \frac{8 π}{e^{4} n} \sum_{m = 0}^{2 C_{n}} \sum_{j = 0}^{C_{n}} exp \{- (n - C_{n}) D (\frac{(n - C_{n} + m) (1 - q)}{n - C_{n}} ∥ 1 - q)\} \\ \times exp \{- (n + C_{n}) D (\frac{(n - C_{n} + m) (1 - q) - 2 - j}{n + C_{n}} ∥ 1 - q)\}, \end{matrix}

(A208)

where (A205) follows from the condition

{lim}_{n \to \infty} C_{n} / n = 0

, which implies that for all large enough n, both

(n - C_{n}) (1 - q) \geq 0

and

(n + C_{n}) (1 - q) \leq n - C_{n}

hold. The inequality in (A206) also follows from the condition

{lim}_{n \to \infty} C_{n} / n = 0

, since for all

(n - C_{n}) (1 - q) \leq ℓ \leq (n + C_{n}) (1 - q)

, it holds that

ℓ - C_{n} - 2 \geq 0

, for all sufficiently large n. In (A207) we changed the summation index from k to j according to

k = ℓ - j - 2

, with

j \in {0, 1, \dots, C_{n}}

, and in (A208) we changed the summation index from ℓ to m according to

ℓ = (n - C_{n} + m) (1 - q)

, with

m \in {0, 1, \dots, 2 C_{n}}

. To upper-bound the divergence terms in (A208), we invoke the following reverse Pinsker inequality [50] (p. 5974, Eq. (23)):

\begin{matrix} D (P ∥ Q) \leq (\frac{2}{Q_{m i n}}) \cdot {| P - Q |}^{2}, \end{matrix}

(A209)

when

\begin{matrix} Q_{m i n} = min_{x \in X} Q (x) . \end{matrix}

(A210)

Let us define

Δ_{q} = min {q, 1 - q}

. Then, after some algebraic work, we arrive at:

\begin{matrix} Q_{n} & \geq \frac{8 π}{e^{4} n} \sum_{m = 0}^{2 C_{n}} \sum_{j = 0}^{C_{n}} exp \{- (n - C_{n}) \cdot \frac{2}{Δ_{q}} \frac{{(1 - q)}^{2} m^{2}}{{(n - C_{n})}^{2}}\} \end{matrix}

\begin{matrix} \times exp \{- (n + C_{n}) \cdot \frac{2}{Δ_{q}} \frac{{[(1 - q) (2 C_{n} - m) + 2 + j]}^{2}}{{(n + C_{n})}^{2}}\} \end{matrix}

(A211)

\begin{matrix} = \frac{8 π}{e^{4} n} \sum_{m = 0}^{2 C_{n}} \sum_{j = 0}^{C_{n}} exp \{- \frac{2}{Δ_{q}} \frac{{(1 - q)}^{2} m^{2}}{n - C_{n}}\} \cdot exp \{- \frac{2}{Δ_{q}} \frac{{[(1 - q) (2 C_{n} - m) + 2 + j]}^{2}}{n + C_{n}}\} \end{matrix}

(A212)

\begin{matrix} \geq \frac{8 π}{e^{4} n} \sum_{m = 0}^{2 C_{n}} \sum_{j = 0}^{C_{n}} exp \{- \frac{2}{Δ_{q}} \frac{m^{2}}{n - C_{n}}\} \cdot exp \{- \frac{2}{Δ_{q}} \frac{{[(2 C_{n} - m) + 2 C_{n}]}^{2}}{n - C_{n}}\} \end{matrix}

(A213)

\begin{matrix} \geq \frac{8 π C_{n}}{e^{4} n} \sum_{m = 0}^{2 C_{n}} exp \{- \frac{2}{Δ_{q}} \frac{m^{2}}{n - C_{n}}\} \cdot exp \{- \frac{2}{Δ_{q}} \frac{{(4 C_{n} - m)}^{2}}{n - C_{n}}\} \end{matrix}

(A214)

\begin{matrix} = \frac{8 π C_{n}}{e^{4} n} \sum_{m = 0}^{2 C_{n}} exp \{- \frac{2}{Δ_{q}} \cdot \frac{m^{2} + {(4 C_{n} - m)}^{2}}{n - C_{n}}\} \end{matrix}

(A215)

\begin{matrix} = \frac{8 π C_{n}}{e^{4} n} \sum_{m = 0}^{2 C_{n}} exp \{- \frac{4}{Δ_{q}} \cdot \frac{{(2 C_{n} - m)}^{2} + 4 C_{n}^{2}}{n - C_{n}}\}, \end{matrix}

(A216)

where (A213) is true since

1 \geq 1 - q

,

n - C_{n} \leq n + C_{n}

, and due to the fact that

2 + j

is obviously upper-bounded by

2 C_{n}

. Now, the exponent in (A216) is maximized at

m = 0

, and thus:

\begin{matrix} Q_{n} & \geq \frac{8 π C_{n}}{e^{4} n} \sum_{m = 0}^{2 C_{n}} exp \{- \frac{4}{Δ_{q}} \cdot \frac{8 C_{n}^{2}}{n - C_{n}}\} \end{matrix}

(A217)

\begin{matrix} \geq \frac{16 π}{e^{4}} \frac{C_{n}^{2}}{n} exp \{- \frac{32}{Δ_{q}} \cdot \frac{C_{n}^{2}}{n - C_{n}}\} . \end{matrix}

(A218)

Step 3: Wrapping Up

We denote the constant

f_{q} = 32 / Δ_{q}

. Continuing from (A189), we finally arrive at:

\begin{matrix} P {C_{n}} & \leq \prod_{i = 1}^{2 n} (1 - \frac{16 π}{e^{4}} \frac{C_{n}^{2}}{n} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\}) \end{matrix}

(A219)

\begin{matrix} = {(1 - \frac{16 π}{e^{4}} \frac{C_{n}^{2}}{n} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\})}^{2 n} \end{matrix}

(A220)

\begin{matrix} = exp \{2 n \cdot log (1 - \frac{16 π}{e^{4}} \frac{C_{n}^{2}}{n} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\})\} \end{matrix}

(A221)

\begin{matrix} \leq exp \{- \frac{32 π}{e^{4}} C_{n}^{2} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\}\} \end{matrix}

(A222)

\begin{matrix} \leq exp \{- C_{n}^{2} \cdot exp \{- f_{q} \cdot \frac{C_{n}^{2}}{n - C_{n}}\}\}, \end{matrix}

(A223)

where (A222) follows from the inequality

log (1 - y) \leq - y

. This completes the proof of Proposition 5.

References

Vujičić, D.; Jagodić, D.; Ranić, S. Blockchain technology, bitcoin, and ethereum: A brief overview. In Proceedings of the 2018 17th International Symposium Infoteh-Jahorina (Infoteh), IEEE, East Sarajevo, Bosnia, 21–23 March 2018; pp. 1–6. [Google Scholar]
Yang, C.-T.; Shih, W.-C.; Huang, C.-L.; Jiang, F.-C.; Chu, W.C.-C. On construction of a distributed data storage system in cloud. Computing 2016, 98, 93–118. [Google Scholar] [CrossRef]
Dingledine, R.; Freedman, M.J.; Molnar, D. The free haven project: Distributed anonymous storage service. In Designing Privacy Enhancing Technologies; Springer: Berlin/Heidelberg, Germany, 2001; pp. 67–95. [Google Scholar]
Fidge, C.J. Timestamps in message-passing systems that preserve the partial ordering. Proc. Aust. Comput. Sci. Conf. 1987, 10, 56–66. [Google Scholar]
Mattern, F. Virtual Time and Global States of Distributed Systems; Department of Computer Science, University of Kaiserslautem: Kaiserslautern, Germany, 1989. [Google Scholar]
Waldo, J. A hitchhiker’s guide to the blockchain universe. In Communications of the ACM; ACM: New York, NY, USA, 2019; Volume 62, pp. 38–42. [Google Scholar]
Liu, Q.; Wang, G.; Wu, J. Consistency as a service: Auditing cloud consistency. In Proceedings of the IEEE Transactions on Network and Service Management; IEEE: New York, NY, USA, 2014; Volume 11, pp. 25–35. [Google Scholar]
Kraska, T.; Hentschel, M.; Alonso, G.; Kossmann, D. Consistency rationing in the cloud: Pay only when it matters. In Proceedings of the VLDB Endowment, Lyon, France, 24–28 August 2009; Volume 2, pp. 253–264. [Google Scholar]
Chandra, T.D.; Toueg, S. Unreliable failure detectors for reliable distributed systems. J. ACM (JACM) 1996, 43, 225–267. [Google Scholar] [CrossRef]
Hurfin, M.; Raynal, M. A simple and fast asynchronous consensus protocol based on a weak failure detector. Distrib. Comput. 1999, 12, 209–223. [Google Scholar] [CrossRef]
Schiper, A. Early consensus in an asynchronous system with a weak failure detector. Distrib. Comput. 1997, 10, 149–157. [Google Scholar] [CrossRef] [Green Version]
Aguilera, M.K. Stumbling over consensus research: Misunderstandings and issues. In Replication; Springer: Berlin/Heidelberg, Germany, 2010; pp. 59–72. [Google Scholar]
Borran, F.; Prakash, R.; Schiper, A. Consensus Problem in Wireless Ad Hoc Networks: Addressing the Right Issues. Technical Reports. 2007. Available online: https://infoscience.epfl.ch/record/114619 (accessed on 14 January 2022).
Zieliński, P. Indirect Channels: A Bandwidth-Saving Technique for Fault-Tolerant Protocols; Tech. Rep.; University of Cambridge, Computer Laboratory: Cambridge, UK, 2007. [Google Scholar]
Guerraoui, R.; Hurfinn, M.; Mostéfaoui, A.; Oliveira, R.; Raynal, M.; Schiper, A. Consensus in asynchronous distributed systems: A concise guided tour. In Advances in Distributed Systems; Springer: Berlin/Heidelberg, Germany, 2000; pp. 33–47. [Google Scholar]
Tanenbaum, A.S.; Wetherall, D. Computer Networks, 5th ed.; Prentice Hall: Hoboken, NJ, USA, 2011. [Google Scholar]
Freiling, F.C.; Guerraoui, R.; Kuznetsov, P. The failure detector abstraction. ACM Comput. Surv. (CSUR) 2011, 43, 1–40. [Google Scholar] [CrossRef] [Green Version]
Gray, J.N. Notes on data base operating systems. In Operating Systems; Springer: Berlin/Heidelberg, Germany, 1978; pp. 393–481. [Google Scholar]
Fischer, M.J. The consensus problem in unreliable distributed systems (a brief survey). In International Conference on Fundamentals of Computation Theory; Springer: Berlin/Heidelberg, Germany, 1983; pp. 127–140. [Google Scholar]
Gray, J.; Lamport, L. Consensus on transaction commit. ACM Trans. Database Syst. (TODS) 2006, 31, 133–160. [Google Scholar] [CrossRef]
Antoniadis, K.; Guerraoui, R.; Malkhi, D.; Seredinschi, D.-A. State Machine Replication Is More Expensive than Consensus; Technical Reports. 2018. Available online: https://infoscience.epfl.ch/record/256238 (accessed on 14 January 2022).
Attiya, H.; Rachman, O. Atomic snapshots in o(nlogn) operations. SIAM J. Comput. 1998, 27, 319–340. [Google Scholar] [CrossRef]
Lamport, L. Time, clocks, and the ordering of events in a distributed system. In Concurrency: The Works of Leslie Lamport; ACM Books: New Yrok, NY, USA, 2019; pp. 179–196. [Google Scholar]
Lynch, N.A. Distributed Algorithms; Elsevier: Amsterdam, The Netherlands, 1996. [Google Scholar]
Halpern, J.Y.; Tuttle, M.R. Knowledge, probability, and adversaries. J. ACM (JACM) 1993, 40, 917–960. [Google Scholar] [CrossRef] [Green Version]
Rubinstein, A. The electronic mail game: Strategic behavior under almost common knowledge. Am. Econ. Rev. 1989, 79, 385–391. [Google Scholar]
Gács, P.; Kurdyumov, G.L.; Levin, L.A. One-dimensional uniform arrays that wash out finite islands. Probl. Peredachi Inform. 1978, 14, 92–96. [Google Scholar]
Mustafa, N.H.; Pekeč, A. Majority consensus and the local majority rule. In International Colloquium on Automata, Languages, and Programming; Springer: Berlin/Heidelberg, Germany, 2001; pp. 530–542. [Google Scholar]
Moreira, A.A.; Mathur, A.; Diermeier, D.; Amaral, L.A. Efficient system-wide coordination in noisy environments. Proc. Natl. Acad. Sci. USA 2004, 101, 12085–12090. [Google Scholar] [CrossRef] [Green Version]
Gogolev, A.; Marchenko, N.; Marcenaro, L.; Bettstetter, C. Distributed binary consensus in networks with disturbances. ACM Trans. Auton. Adapt. Syst. (TAAS) 2015, 10, 1–17. [Google Scholar] [CrossRef]
Thomas, R.H. A majority consensus approach to concurrency control for multiple copy databases. ACM Trans. Database Syst. (TODS) 1979, 4, 180–209. [Google Scholar] [CrossRef]
Breitwieser, H.; Leszak, M. A distributed transaction processing protocol based on majority consensus. In Proceedings of the First ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing, Ottawa, ON, Canada, 18–20 August 1982; pp. 224–237. [Google Scholar]
Kanrar, S.; Chattopadhyay, S.; Chaki, N. A new hybrid mutual exclusion algorithm in the absence of majority consensus. In Advanced Computing and Systems for Security; Springer: Berlin/Heidelberg, Germany, 2016; pp. 201–214. [Google Scholar]
Mostofi, Y. Binary consensus with gaussian communication noise: A probabilistic approach. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, IEEE, New Orleans, LA, USA, 12–14 December 2007; pp. 2528–2533. [Google Scholar]
Perron, E.; Vasudevan, D.; Vojnovic, M. Using three states for binary consensus on complete graphs. In Proceedings of the IEEE INFOCOM 2009, IEEE, Rio de Janeiro, Brazil, 19–25 April 2009; pp. 2527–2535. [Google Scholar]
Cruise, J.; Ganesh, A. Probabilistic consensus via polling and majority rules. Queueing Syst. 2014, 78, 99–120. [Google Scholar] [CrossRef]
Wensley, J.H.; Lamport, L.; Goldberg, J.; Green, M.W.; Levitt, K.N.; Melliar-Smith, P.M.; Shostak, R.E.; Weinstock, C.B. Sift: Design and analysis of a fault-tolerant computer for aircraft control. Proc. IEEE 1978, 66, 1240–1255. [Google Scholar] [CrossRef]
Pease, M.; Shostak, R.; Lamport, L. Reaching agreement in the presence of faults. J. ACM (JACM) 1980, 27, 228–234. [Google Scholar] [CrossRef] [Green Version]
Santoro, N.; Widmayer, P. Time is not a healer. In Annual Symposium on Theoretical Aspects of Computer Science; Springer: Berlin/Heidelberg, Germany, 1989; pp. 304–313. [Google Scholar]
Padhye, J.; Firoiu, V.; Towsley, D.; Kurose, J. Modeling tcp throughput: A simple model and its empirical validation. In Proceedings of the ACM SIGCOMM’98 conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, Vancouver, BC, Canada, 31 August–4 September 1998; pp. 303–314. [Google Scholar]
Lua, E.K.; Crowcroft, J.; Pias, M.; Sharma, R.; Lim, S. A survey and comparison of peer-to-peer overlay network schemes. IEEE Commun. Surv. Tutor. 2005, 7, 72–93. [Google Scholar]
Abdullah, M.A.; Draief, M. Global majority consensus by local majority polling on graphs of a given degree sequence. Discret. Appl. Math. 2015, 180, 1–10. [Google Scholar] [CrossRef]
Gärtner, B.; Zehmakan, A.N. Majority model on random regular graphs. In Proceedings of the Latin American Symposium on Theoretical Informatics, Buenos Aires, Argentina, 16–19 April 2018; pp. 572–583. [Google Scholar]
Tran, L.; Vu, V. Reaching a consensus on random networks: The power of few. Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2020). Leibniz Int. Proc. Inform. 2020, 176, 20:1–20:15. [Google Scholar]
Fountoulakis, N.; Kang, M.; Makai, T. Resolution of a conjecture on majority dynamics: Rapid stabilization in dense random graphs. Random Struct. Algorithms 2020, 57, 1134–1156. [Google Scholar] [CrossRef]
Durrett, R. Probability: Theory and Examples, 2nd ed.; Cambridge University Press: Cambridge, UK, 1996. [Google Scholar]
Billingsley, P. Convergence of Probability Measures, 2nd ed.; John Wiley & Sons Inc.: New York, NY, USA, 1999. [Google Scholar]
Csiszár, I. Information-type measures of difference of probability distributions and indirect observations. Stud. Sci. Math. Hung. 1967, 2, 299–318. [Google Scholar]
Kullback, S. A lower bound for discrimination information in terms of variation. IEEE Trans. Inf. Theory 1967, 13, 126–127. [Google Scholar] [CrossRef]
Sason, I.; Verdú, S. f-divergence inequalities. IEEE Trans. Inf. Theory 2016, 62, 5973–6006. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tamir, R.; Livshits, A.; Shadmi, Y. Simple Majority Consensus in Networks with Unreliable Communication. Entropy 2022, 24, 333. https://doi.org/10.3390/e24030333

AMA Style

Tamir R, Livshits A, Shadmi Y. Simple Majority Consensus in Networks with Unreliable Communication. Entropy. 2022; 24(3):333. https://doi.org/10.3390/e24030333

Chicago/Turabian Style

Tamir, Ran, Ariel Livshits, and Yonatan Shadmi. 2022. "Simple Majority Consensus in Networks with Unreliable Communication" Entropy 24, no. 3: 333. https://doi.org/10.3390/e24030333

APA Style

Tamir, R., Livshits, A., & Shadmi, Y. (2022). Simple Majority Consensus in Networks with Unreliable Communication. Entropy, 24(3), 333. https://doi.org/10.3390/e24030333

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Simple Majority Consensus in Networks with Unreliable Communication^†

Abstract

1. Introduction

1.1. Importance of Reliable Communication

1.2. Majority Consensus

1.3. Related Work

2. Notation Conventions

3. Model, Protocol, and Objectives

4. Main Results

5. Proofs

5.1. Proof of Theorem 1

5.2. Proof of Theorem 2

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Proposition 1

Appendix B. Proof of Proposition 2

Appendix C. Proof of Proposition 3

Appendix D. Proof of Proposition 4

Appendix E. Proof of Proposition 5

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Simple Majority Consensus in Networks with Unreliable Communication †

Abstract

1. Introduction

1.1. Importance of Reliable Communication

1.2. Majority Consensus

1.3. Related Work

2. Notation Conventions

3. Model, Protocol, and Objectives

4. Main Results

5. Proofs

5.1. Proof of Theorem 1

5.2. Proof of Theorem 2

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Proposition 1

Appendix B. Proof of Proposition 2

Appendix C. Proof of Proposition 3

Appendix D. Proof of Proposition 4

Appendix E. Proof of Proposition 5

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Simple Majority Consensus in Networks with Unreliable Communication^†