Profitable Double-Spending Attacks

Jehyuk Jang; Heung-No Lee

doi:10.3390/app10238477

and

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology (GIST), 123 Cheomdangwagi-ro, Buk-gu, Gwangju 61005, Korea

^*

Author to whom correspondence should be addressed.

Appl. Sci.2020, 10(23), 8477;https://doi.org/10.3390/app10238477

This article belongs to the Special Issue New Trends in Blockchain Technology

Version Notes

Order Reprints

Abstract

Our aim in this paper is to investigate the profitability of double-spending (DS) attacks that manipulate an a priori mined transaction in a blockchain. It was well understood that a successful DS attack is established when the proportion of computing power an attacker possesses is higher than that of the honest network. What is not yet well understood is how threatening a DS attack with less than 50% computing power used can be. Namely, DS attacks at any proportion can be a threat as long as the chance to make a good profit exists. Profit is obtained when the revenue from making a successful DS attack is greater than the cost of carrying out one. We have developed a novel probability theory for calculating a finite time attack probability. This can be used to size up attack resources needed to obtain the profit. The results enable us to derive a sufficient and necessary condition on the value of a transaction targeted by a DS attack. Our result is quite surprising: we theoretically show how a DS attack at any proportion of computing power can be made profitable. Given one’s transaction value, the results can also be used to assess the risk of a DS attack. An example of profitable DS attack against BitcoinCash is provided.

Keywords:

blockchain; double-spending attack; Fraud risk analysis; profitability; time-finite analysis; probability distribution; combinatorics

1. Introduction

A blockchain is a distributed ledger which has originated from the desire to find a novel alternative to centralized ledgers such as transactions through third parties [1]. Besides the role as a ledger, blockchains have been applied to many areas, e.g., managing the access authority to shared data in the cloud network [2] and averting collusion in e-Auction [3]. In a blockchain network based on the proof-of-work (PoW) mechanism, each miner verifies transactions and tries to put them into a block and mold the block to an existing chain by solving a cryptographic puzzle. This series of processes is called mining. However, the success of mining a block is given to only a single miner who solves the cryptographic puzzle for the first time. The reward of minting a certain amount of coins to the winner motivates more miners to join and remain in the network. As a result, blockchains have been designed so that the validity of transactions is confirmed by a lot of decentralized miners in the network.

A consensus mechanism is programmed for decentralized peers in a network to share a common chain. If a full-node succeeds in generating a new block, it has the latest version of the chain. All of the nodes in the network continuously communicate with each other to share the latest chain. A node may run into a situation in which it encounters mutually different chains more than one. In such a case, it utilizes a consensus rule with which it selects a single chain. Satoshi Nakamoto suggested the longest chain consensus for Bitcoin protocol in which the node selects the longest chain among all competing chains [1]. There are also other consensus rules [4,5], but a common goal of consensus rules is to select the single chain by which the most computation resources have been consumed based on the belief that it may have been verified by the largest number of miners.

A double-spending (DS) attack aims to double-spend a cryptocurrency for the worth of which a corresponding delivery of goods or services has already been completed. The records of payment are written in transactions and shared in a network via the status-quo chain. Thus, to double spend, attackers need to replace the status-quo chain in the network with their new one, after taking the goods or services. For example, under the longest chain consensus, this attack will be possible if an attacker builds a longer chain than the status-quo. Nakamoto [1] and Rosenfeld [6] have shown that the higher computing power is employed, the higher probability to make a DS attack successful is. In addition, if an attacker invests more computing power than that invested by a network, a success of DS attack is guaranteed. Such attacks are called the 51% attack.

In the last few years, unfortunately, blockchain networks have been recentralized [7,8], which make them vulnerable to DS attacks. To increase the chance of mining blocks, some nodes may form a pool of computing chips. The problem arises when a limited number of pools occupy a major proportion of the computing power in the network. For example, the pie chart (date accessed from BTC.com on November 24, 2020) shown in Figure 1 illustrates the proportion of computing power in the Bitcoin network as of January 2020. In the chart, five pools such as F2Pool, BTC.com, Poolin, and Huobi.pool occupy more than 50% of the total computing power of Bitcoin. In a recentralized network, since most computing resources are concentrated on a small number of pools, it could be not difficult for them to conspire to alter the block content for their own benefits, if aiming to double-spend. Indeed, there have been a number of reports in 2018 and 2019 in which cryptocurrencies such as Verge, BitcoinGold, Ethereum Classic, Feathercoin, and Vertcoin suffered from DS attacks and millions of US dollars have been lost [9].

Figure 1. Computation power distribution among the largest mining pools.

In addition to the recentralization, the advent of rental services which lend the computing resources can be a concern as well [10]. Rental services such as nicehash.com which provide a brokerage service between the suppliers and the consumers have indeed become available. The rental service can be misused for making DS attacks easier. The presence of such computing resource rental services significantly reduce the cost of making a profit from double spending. This is because renting a required computing power for a few hours is much cheaper than building such a computing network. Indeed, nicehash.com attracts DS attackers to use their service by posting one-hour fees for renting 51% of the total computing power against dozens of blockchain networks on their website Crypto51.App (accessed on 26 November 2020).

Success by making DS attacks is possible but is believed to be difficult for a public blockchain with a large pool of mining network support. By the results in [1,6], 51% attack has been considered as the requirement for a successful DS attack [11]. This conclusion however shall be reconsidered given our result in the sequel that there are significant chances of making a good profit from DS attacks regardless of the proportion of computing power. The problem to consider, therefore, is to analyze the profitability of such attacks.

The analysis of attack profitability requires the ability to predict the time an attack will take, since the profit would be a function of time. Studies in [12,13,14,15,16,17,18,19,20] provided DS attack profitability analyses, but their time predictions were not accurate. Specifically, to make the time prediction easier, they either added impractical assumptions to the DS attack model defined by Nakamoto [1] and Rosenfeld [6] or oversimplified the time prediction formula (see Section 6 for details). Whereas, we follow the definition of DS attack in [1,6], and therefore we need to develop a new set of mathematical tools for precise analysis of attack profitability that we aim to report in this paper.

1.1. Contributions

We study the profitability of DS attacks. The concept of cut-time is introduced. Cut-time is defined to be the duration of time, from the start time to the end time of an attack. For each DS attempt, the attacker needs to pay for the cost to run his mining rig. A rational attacker would not, therefore, continue an attack indefinitely especially when operating within the regime of less than 50% computing power. To reduce the cost, the attacker needs to figure out how his attack success probability rolls out to be as the time progresses. We define that a DS attack is profitable if and only if the expected profit, the difference between revenue and cost (see Equation (33)), is positive. Our contributions are summarized into two folds:

First, we theoretically show that DS attacks can be profitable not only in the regime of 51% attack but also in the sub-50% regime where the computing power invested by the attacker is smaller than that invested by the target network. Specifically, a sufficient and necessary condition is derived for profitable DS attacks on the minimum value of target transaction. In the sub-50% regime, we also show that profitable DS attacks necessitate setting a finite cut-time.

Second, we derive novel mathematical results that are useful for an analysis of the attack success time. Specifically, the probability distribution function and the first moment expectation of the attack success time have been derived. They enable us to estimate the expected profit of a DS attack for a given cut-time. All mathematical results are numerically-calculable. All numerical examples of the theoretical results given in this paper are reproducible in our web-site (https://codeocean.com/capsule/2308305/tree).

1.2. Organization of the Paper

In Section 2, we define DS attack scenario and sufficient and necessary conditions required for successful DS attacks. Also, we define random variables that are useful in analyzing the attack profits. Section 3 comprises the analytic results of stochastics of the time-finite attack success. In Section 4, we define the profit function of DS attacks, followed by new theoretical results about the conditions for making them profitable. In Section 5, an example analysis of DS attack profitability in sub-50% regime against BitcoinCash network is given. Section 6 compares our results with related works. Finally, Section 7 concludes the paper with a summary.

2. The Attack Model

We define DS attack that we consider throughout this paper. We also define DS attack achieving (DSA) time, which is the least time spent for an occurrence of double-spending. The DSA time is a random variable derived from a random walk of Poisson counting processes (PCP).

2.1. Attack Scenario

We extend a DS attack scenario which has been considered by Nakamoto [1] and Rosenfeld [6]. Specifically, we add a time-finite attack scenario. There are two groups of miners, the normal group of honest miners and a single attacker. The normal group tends the honest chain.

When the attacker decides to launch a DS attack, he/she makes a target transaction for the payment of goods or services. In the target transaction, the transfer of cryptocurrency ownership from the attacker to a victim is written. We denote t = 0 as the time at which the last block of the honest chain has been generated. At time t = 0, the attacker announces the target transaction to normal group so that normal group starts to put it into the honest chain. At the same time t = 0, the attacker makes a fork of the honest chain which stems from the last block and builds it in secret. We refer to this secret fork as fraudulent chain. In the fraudulent chain, a fraudulent transaction is contained which alters the target transaction in a way that deceives the victim and benefits the attacker.

Before shipping goods or providing services to the attacker, the victim will obviously choose to wait for a few more blocks on the honest chain in addition to the block on which the his/her transaction has been entered, i.e., so-called block confirmation. Karame et al. [21] showed the importance of block confirmation: attackers are able to double-spend against zero block-confirmation even without mining a single block on the fraudulent chain at all. The number of blocks the victim chooses to wait for is referred to as the block confirmation number

N_{B C} \in ℕ

, which includes the block on which the target transaction is entered.

The attacker chooses to make the fraudulent chain public if his/her attack was successful. An attack is successful if the fraudulent chain is longer than the honest chain after the moment the block confirmation is satisfied. We define two necessary conditions

G^{(1)}

,

G^{(2)}

, for a success of DS attack:

Definition 1.

A DS attack succeeds only if there exists a DS attack achieving (DSA) time

T_{D S A} \in (0, \infty)

such that

$G^{(1)}$ : (block confirmation) the length of the honest chain for the duration of time $T_{D S A}$ has grown greater than or equal to N_BC, and
$G^{(2)}$ : (success in PoW competition) the length of the fraudulent chain for the duration of time $T_{D S A}$ has grown longer than that of the honest chain.

Rational attackers will not wait for his success indefinitely since growing the attacker’s chain incurs the expense per time spent for operating the computing power. The attack thus shall put a limit to the end time to cut loss. We refer to this end time as the cut-time

t_{c u t} \in ℝ^{+}

. A sufficient condition for the success of DS attack can be defined with applying the cut-time

t_{c u t}

:

Definition 2.

For a given cut-time

t_{c u t} \in ℝ^{+}

, the success of DS attack is declared if, and only if, there exists a DSA time

T_{D S A} \in (0, t_{c u t})

at which

G^{(1)}

and

G^{(2)}

in Definition 1 have been achieved.

2.2. Stochastic Model

We model the conditions in Definition 2 with a stochastic model. We fit the block generation process using the PCP [22] with a given block generation rate

λ

(blocks per second). Including Nakamoto [1] and Rosenfeld [6], it has been most conventional to analyze the block generation process of a blockchain using PCP. A rationale why the block generation process is modeled as PCP is given in Bowden et al. [23], where experiments show the fitness of PCP model to real data samples from a live network.

We denote the lengths of the honest chain and the fraudulent chain over time

t \in (0, \infty]

by two independent PCPs,

H (t) \in ℕ_{0}

with the block generation rate

λ_{H}

(blocks per second) and

A (t) \in ℕ_{0}

with the block generation rate

λ_{A}

, respectively. Both processes start at the time origin

t = 0

(at which the DS attack is launched) at which the both chains are at the zero states, i.e.,

H (0) = A (0) = 0

. Each chain independently increases at most by 1 at a time point. An increment of 1 in the counting process occurs when the pertinent network adds a new block to its chain.

We represent the difference between

A (t)

and

H (t)

in a discrete-time domain as a random walk

S_{i} \in ℤ

for

i \in ℕ

. For this purpose, we first define two continuous stochastic processes

M (t)

and

S (t)

, which are respectively defined as

M (t) : = H (t) + A (t),

(1)

and

S (t) : = H (t) - A (t) .

(2)

The first process

M (t)

is also a PCP [22] with the rate

λ_{T} : = λ_{A} + λ_{H} .

(3)

The second process

S (t)

is the continuous-time analog of the random walk

S_{i} \in ℤ

for

i \in ℕ

such that

S_{i} : = S (T_{i}),

(4)

where

T_{i}

is the state progression time defined by

T_{i} : = \inf \{t \in ℝ^{+} : M (t) = i\},

(5)

which increases as

i

increases. Random walk

S_{i}

is a stationary Markov chain starting from

S_{0} = 0

. The state transition probabilities [22] are given by

p_{A} : = \Pr (S_{i} = n - 1 | S_{i - 1} = n) = \frac{λ_{A}}{λ_{T}},

(6)

and

p_{H} : = \Pr (S_{i} = n + 1 | S_{i - 1} = n) = \frac{λ_{H}}{λ_{T}},

(7)

for all

i \in ℕ

and

n \in ℤ

. The state transition probabilities

p_{H}

and

p_{A}

are the proportions of computing power occupied by the normal miners and that by the attacker, respectively.

We define independent and identically distributed (i.i.d.) state transition random variables

Δ_{i} \in \{\pm 1\} ~ Bernoulli (p_{H})

as

Δ_{i} : = S_{i} - S_{i - 1},

(8)

for

i \in ℕ

. Note that

S_{i} = \sum_{k = 0}^{i} Δ_{k}

.

Definition 3.

A DS attack

DS (p_{A}, t_{c u t}; N_{B C})

is a random experiment that picks a sample

ω \in Ω

. Each element

ω

is an infinite-length sequence of pairs of

T_{i}

and

Δ_{i}

in Equations (5) and (8) for all

i \in ℕ

, i.e.,

ω : = ((T_{1}, Δ_{1}), (T_{2}, Δ_{2}), \dots, (T_{\infty}, Δ_{\infty})) .

(9)

The set

Ω

is the universal set of all possible

ω

, i.e.,

Ω : = \{ω \in {\{ℝ^{+} \times \{\pm 1\}\}}^{\infty}\} .

(10)

For given a DS sample

ω \in Ω

and a state index

i \in ℕ

, we denote projections

π_{T_{i}} (ω) : = T_{i}

(11)

and

π_{Δ_{i}} (ω) : = Δ_{i}

(12)

that retrieve the progression time

T_{i}

and the transition

Δ_{i}

of the

i

-th state, respectively.

2.3. DS Attack Achieving Time

Definition 4.

For a DS sample

ω

of

DS (p_{A}, t_{c u t}; N_{B C})

, we define the DSA time

T_{D S A}

which measures the least one among the state progression times

π_{T_{i}} (ω)

of state indices

i

at which

ω

achieves the necessary conditions

G^{(1)}

and

G^{(2)}

in Definition 1.

To express

T_{D S A}

as a random variable, we construct event sets

D_{j}^{(1)} \subset Ω

and

D_{i, j}^{(2)} \subset Ω

. The sets

D_{j}^{(1)}

for

j \in \{N_{B C}, N_{B C} + 1, \dots, \infty\}

consist of DS samples

ω

which achieves the block confirmation

G^{(1)}

at state

j

for the first time. The sets

D_{i, j}^{(2)}

for

i \in \{j, j + 1, \dots, \infty\}

and

j \in \{N_{B C}, N_{B C} + 1, \dots, \infty\}

consists of

ω

which achieves the success in the PoW competition

G^{(2)}

at state

i

for the first time, given that

G^{(1)}

has been already achieved at state

j

. Subsequently, we aim for the samples

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

to achieve the two conditions in Definition 1 at a state pair

(i, j)

for the first time.

Formally, we first construct a set

D_{j}^{(1)}

focusing only on the first

j

transitions

Δ_{k}

for

k = 1, \dots, j

of DS samples

ω \in Ω

with two requirements; one is that they must have

N_{B C}

number of

+ 1

’s and

j - N_{B C}

number of

- 1

’s; and the other is that the

j

-th transition

Δ_{j}

must be

+ 1

to guarantee that they have never been achieved in any states prior to the state

j

. The former requirement implies that all

ω \in D_{j}^{(1)}

hold

S_{j} = \sum_{k = 1}^{j} π_{Δ_{i}} (ω) = 2 N_{B C} - j

. For example, when

N_{B C} = 2

and

j = 5,

a sequence

(+ 1, - 1, - 1, - 1, + 1, \dots)

of state transitions satisfies the first requirement, and also satisfies

S_{j} = 2 N_{B C} - j

.

We next construct a set

D_{i, j}^{(2)} \subset Ω

which does not care about the first

j

transitions

Δ_{k}

for

k = 1, \dots, j

, but only focuses on the interim transitions

Δ_{m}

for

m = j + 1, \dots, i

. By the definition, all sequences

ω \in D_{i, j}^{(2)}

must achieve

G^{(1)}

before the

j

-th state, which implies that they must hold

S_{j} = 2 N_{B C} - j

. The rest requirement for each

ω \in D_{i, j}^{(2)}

is that the state changes from starting state

S_{j} = 2 N_{B C} - j

to state

S_{i} = - 1

, while any interim states

S_{k}

remain non-negative; i.e.,

S_{k} \geq 0

for each

k = j + 1, \dots, i - 1

.

The sets

D_{j}^{(1)}

for all

j

are mutually exclusive as each of them represents the first satisfaction of the block confirmation condition exactly at the

j

-th state. For example, if

ω \in D_{5}^{(1)}

then

ω \notin D_{6}^{(1)}

since

ω

already has achieved the block confirmation at the 5-th state for the first time before reaching the 6-th state. The sets

D_{i, j}^{(2)}

for all

(i, j)

are also mutually exclusive for the same reason. Thus, their intersections

D_{j}^{(1)} \cap D_{i, j}^{(2)}

for all

(i, j)

are also mutually exclusive.

By Definition 4, the attack achieving time

T_{D S A}

can be measured if there exist index pairs

(i, j)

such that

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

. By the mutual exclusivity of

D_{j}^{(1)} \cap D_{i, j}^{(2)}

, if there exists such a pair

(i, j)

, it must be unique. In addition, if

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

,

T_{D S A}

equals

π_{T_{i}} (ω)

, since the state progression time

T_{k}

is non-decreasing as

k

increases. As the result,

T_{D S A}

can be rewritten as follows,

T_{D S A} = \{\begin{matrix} π_{T_{i}} (ω), & i f \exists (i, j) \in ℕ^{2} : ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}, \\ \infty, & o t h e r w i s e . \end{matrix}

(13)

3. The Attack Probabilities

We aim to calculate the probability distribution function (PDF) of the DSA time

T_{D S A}

. Using this, the success probability of DS attack with a given cut-time

t_{c u t}

can be figured out as the probability that

T_{D S A} < t_{c u t}

. Also, the expectation of attack success time can be calculated. The expected attack success time will be used in Section 4 to estimate the attack profits.

From Equation (13), the PDF of

T_{D S A}

requires the probabilities of two random events: one is the state progression time

T_{i}

in Equation (5); and the other is the event that a given state index

i

satisfies

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

. It has been well known that

T_{i}

follows Erlang distribution [22] given as

f_{T_{i}} (t) = \frac{λ_{T} {(λ_{T} t)}^{i - 1} e^{- λ_{T} t}}{(i - 1)!} .

(14)

We provide the probability for the latter event, i.e.,

p_{D S A}_{, i} =

\Pr (ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)})

in the following Lemma 1:

Lemma 1.

For a sample

ω

of random experiment

DS (p_{A}, t_{c u t}; N_{B C}),

the probability

p_{D S A, i} =

\Pr (ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)})

can be computed as

p_{D S A, i} = \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) C_{\frac{i - 1}{2} - N_{B C}, 2 N_{B C} - j} p_{A}^{\frac{i + 1}{2}} p_{H}^{\frac{i - 1}{2}} + (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{H}^{N_{B C}} p_{A}^{i - N_{B C}}

(15)

for odd

i > 2 N_{B C}

, where

C_{n, m}

is the ballot number [24] given by

C_{n, m} : = \{\begin{matrix} \frac{m + 1}{n + m + 1} (\begin{matrix} 2 n + m \\ n \end{matrix}), & n, m \in ℤ^{+} \cup \{0\}, \\ 0, & o t h e r w i s e, \end{matrix}

(16)

and for

i \leq 2 N_{B C}

and for all even-numbered

i

,

p_{D S A, i} = 0

.

Proof.

See Appendix A. □

By taking infinite summations of

p_{D S A, i}

in Lemma 1 for all indices

i \in ℕ

, we can compute the probability

ℙ_{D S A}

that a DS attack will ever achieve the necessary conditions in Definition 1.

Corollary 1.

For a sample

ω

of random experiment

DS (p_{A}, t_{c u t}; N_{B C})

with

t_{c u t} = \infty

, the probability

ℙ_{D S A}

has an algebraic expression

ℙ_{D S A} = \{\begin{matrix} 1, & p_{H} \leq p_{A}, \\ 1 - p_{A}^{N_{B C} + 1} p_{H}^{N_{B C}} \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) A_{j}, & p_{H} > p_{A}, \end{matrix}

(17)

where

A_{j} : = p_{A}^{j - 2 N_{B C} - 1} - p_{H}^{j - 2 N_{B C} - 1} .

(18)

Proof.

See Appendix B. □

From Equation (13), the PDF of

T_{D S A}

follows the PDF of

T_{i}

at a given state index

i

, if at which it holds that

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

, with the probability of

p_{D S A, i}

. If there does not exist such an index

i

, with the probability of

1 - ℙ_{D S A}

, then

T_{D S A} = \infty

. Thus, we can write the PDF

f_{T_{D S A}}

of

T_{D S A}

as follows,

\begin{array}{l} f_{T_{D S A}} (t) = & \sum_{i = 2 N_{B C} + 1}^{\infty} p_{D S A, i} f_{T_{i}} (t) \\ + (1 - ℙ_{D S A}) δ (t - \infty), \end{array}

(19)

where

δ (t)

is the Dirac delta function.

Proposition 1.

The PDF

f_{T_{D S A}}

has an analytic expression:

\begin{array}{l} f_{T_{D S A}} (t) = \frac{p_{A} λ_{T} e^{- λ_{T} t} {(p_{A} p_{H} {(λ_{T} t)}^{2})}^{N_{B C}}}{(2 N_{B C})!} \cdot \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix})_{2} F_{3} (a; b; p_{A} p_{H} {(λ_{T} t)}^{2}) \\ + \frac{e^{- λ_{T} t}}{t} \frac{{(p_{H} λ_{T} t)}^{N_{B C}}}{(N_{B C} - 1)!} (e^{p_{A} λ_{T} t} - \sum_{i = 0}^{N_{B C}} \frac{{(p_{A} λ_{T} t)}^{i}}{i!}) + (1 - ℙ_{D S A}) δ (t - \infty), \end{array}

(20)

where

_{p} F_{q} (a; b; x)

is the generalized hypergeometric function (See Appendix E for definition) with the parameter vectors

a = [\begin{matrix} N_{B C} + 1 - j / 2 \\ N_{B C} + 1 / 2 - j / 2 \end{matrix}]

(21)

and

b = [\begin{matrix} 2 N_{B C} + 2 - j \\ N_{B C} + 1 \\ N_{B C} + 1 / 2 \end{matrix}] .

(22)

Proof.

See Appendix C. □

By Definition 2, the probability

ℙ_{A S}

that a DS attack

DS (p_{A}, t_{c u t}; N_{B C})

succeeds equals

ℙ_{A S} (t_{c u t}) = \Pr (T_{D S A} < t_{c u t})

(23)

Note that for a special case of

t_{c u t} = \infty

,

ℙ_{A S} (t_{c u t}) = ℙ_{D S A}

, which coincides with the result in Rosenfeld [6].

It will be shown to be convenient to define the attack success time

T_{A S}

of a DS attack as

T_{A S} : = \{\begin{matrix} T_{D S A}, & i f T_{D S A} < t_{c u t}, \\ not defined, & o t h e r w i s e . \end{matrix}

(24)

A random variable for

T_{D S A} > t_{c u t}

does not need to be defined since it is not useful. The PDF

f_{T_{A S}}

of

T_{A S}

is just a scaled version of

f_{T_{D S A}} (t)

for

0 < t < t_{c u t}

, which is given in Equation (20), with a scaling factor of

ℙ_{A S}^{- 1}

. Formally, the PDF

f_{T_{A S}} (t)

equals

f_{T_{A S}} (t) = \{\begin{matrix} \frac{f_{T_{D S A}} (t)}{ℙ_{A S}}, & f o r 0 \leq t < t_{c u t}, \\ 0, & f o r t \geq t_{c u t} . \end{matrix}

(25)

The expectation of attack success time is computed as

𝔼_{T_{A S}} (t_{c u t}) = \frac{\int_{0}^{t_{c u t}} t f_{T_{D S A}} (t) d t}{ℙ_{A S} (t_{c u t})} .

(26)

The following Proposition 2 gives an explicit formula of

𝔼_{T_{A S}}

for the special case when

t_{c u t} = \infty

.

Proposition 2.

Let

p_{M} : = \max (p_{A}, p_{H}),

p_{m} : = \min (p_{A}, p_{H}) .

If

t_{c u t} = \infty

, the expectation

𝔼_{T_{A S}} (t_{c u t})

has a closed-form expression:

\lim_{t_{c u t} \to \infty} 𝔼_{T_{A S}} (t_{c u t}) = \frac{λ_{T}^{- 1} (\sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) Z_{j} + \frac{N_{B C}}{p_{H}})}{ℙ_{D S A}},

(27)

where

Z_{j} : = p_{A} p_{m}^{N_{B C}} p_{M}^{- (N_{B C} - j + 1)} (\frac{2 N_{B C} - 2 j p_{m} + 1}{p_{M} - p_{m}}) - j p_{A}^{- (N_{B C} - j)} p_{H}^{N_{B C}} .

(28)

Proof.

See Appendix B. □

4. Profitable DS Attacks

The previous probabilistic analyses in [1,6] have shown that the success of DS attacks is not guaranteed when

p_{A} < 0.5

. However, DS attacks with

p_{A} < 0.5

can be vigorously pursued as long as they bring profit.

We analyze the profitability of DS attacks and to this end, we define a profit function

P

of a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

, where

C

is the value of a fraudulent transaction, in terms of revenue and operating expense (OPEX) of the computing power.

The OPEX

X

(e.g., the rental fee for the computing power) and the block mining reward

R

tend to increase with respect to

λ_{A}

and the time

t

consumed during the attack. Thus,

X

and

R

are expressed as functions of

λ_{A}

and

t

, and they can be any increasing function; e.g., linear, exponential, or logarithm. We define

X

and

R

, respectively, as follows:

X (λ_{A}, t) : = γ λ_{A} t {(\log_{x_{1}} x_{2})}^{λ_{A}} {(\log_{x_{3}} x_{4})}^{t}

(29)

for real constants

γ > 0

,

x_{1}, x_{2} > 1

, and

x_{3}, x_{4} > 1

, and

R (λ_{A}, t) : = β λ_{A} t {(\log_{r_{1}} r_{2})}^{λ_{A}} {(\log_{r_{3}} r_{4})}^{t}

(30)

for real constants

β > 0

,

r_{1}, r_{2} > 1

, and

r_{3}, r_{4} > 1

. We denote the ratio of

γ

and

β

by

μ : = β γ^{- 1} .

(31)

With regards to

P

, if an attack succeeds, the revenue comes from

C

, as it is double-spent, added to

R

for the number of blocks mined during the time duration

T_{A S}

, i.e.,

R (λ_{A}, T_{A S})

. In this case, the cost is the OPEX for the time duration

T_{A S}

, i.e.,

X (λ_{A}, T_{A S})

. If the attack fails, the cost is the OPEX

X (λ_{A}, t_{c u t})

for the time duration

t_{c u t}

, and there is no revenue. Hence, for a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

, we define

P

as follows,

P : = \{\begin{matrix} C + R (λ_{A}, T_{A S}) - X (λ_{A}, T_{A S}), & i f T_{D S A} < t_{c u t}, \\ - X (λ_{A}, t_{c u t}), & o t h e r w i s e . \end{matrix}

(32)

Subsequently, the expected profit function is

\begin{array}{l} 𝔼_{P} & = ℙ_{A S} (t_{c u t}) \cdot (C + 𝔼 [R (λ_{A}, T_{A S})] - 𝔼 [X (λ_{A}, T_{A S})]) - (1 - ℙ_{A S} (t_{c u t})) X (λ_{A}, t_{c u t}) \\ = ℙ_{A S} (t_{c u t}) \cdot (C + 𝔼 [R (λ_{A}, T_{A S})]) - 𝔼_{X}, \end{array}

(33)

where

𝔼_{X}

is the expected OPEX defined as

𝔼_{X} : = ℙ_{A S} (t_{c u t}) 𝔼 [X (λ_{A}, T_{A S})] + (1 - ℙ_{A S} (t_{c u t})) X (λ_{A}, t_{c u t}) .

(34)

Definition 5.

A DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

is said to be profitable if and only if the expected profit

𝔼_{P} > 0

, where

𝔼_{P}

is defined in Equation (33).

The key factor in determining the profitability of DS attacks is the value C of the fraudulent transaction. Thus, attackers would be interested in the minimum value required for profitable DS attacks [25]. Definition 5 implies that a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

is profitable if and only if

C > C_{Req .}

, where the required value of target transaction

C_{Req .}

is

C_{Req .} = \frac{𝔼_{X}}{ℙ_{A S}} - 𝔼 [R (λ_{A}, T_{A S})] .

(35)

The following results in Theorem 1 and Theorem 2 focus on the case where both

X (λ_{A}, t)

and

R (λ_{A}, t)

are linearly increasing functions of

λ_{A}

and

t

.

Theorem 1.

Suppose

x_{1} = x_{2}

and

x_{3} = x_{4}

in Equation (29), and

r_{1} = r_{2}

and

r_{3} = r_{4}

in Equation (30). Then, a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

for any

p_{A} \in (0, 1)

and for any

t_{c u t} \in (0, \infty]

is profitable if and only if

C > C_{Req .}

, where

C_{Req .} = \frac{(1 - ℙ_{A S} (t_{c u t}))}{ℙ_{A S} (t_{c u t})} γ λ_{A} t_{c u t} - (μ - 1) γ λ_{A} 𝔼_{T_{A S}} (t_{c u t}) .

(36)

Proof.

Substituting

x_{1} = x_{2}

,

x_{3} = x_{4}

,

r_{1} = r_{2}

, and

r_{3} = r_{4}

into Equation (35) results in Equation (36). □

Theorem 1 shows that not only superior attackers with

p_{A} \in (0.5, 1)

but also inferior attackers with

p_{A} \in (0, 0.5)

are able to expect profitable DS attacks once a high enough value

C

greater than

C_{Req .}

of the target transaction is designed. The condition

C_{Req .}

in Equation (36) can be pre-computed before carrying out an attack, as it stochastically estimates the future expected cost, for a given position

p_{A} \in (0, 1)

and a cut-time

t_{c u t}

of an attacker, and a given set of network environment parameters

γ

and

β

.

Table 1 and Table 2 list the resources including

C_{Req .}

,

𝔼_{X}

, and

𝔼_{T_{A S}}

required for profitable DS attacks respectively using

p_{A} = 0.35

and

p_{A} = 0.4,

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

. Note that the expectation of the time spent for the block confirmation equals

N_{B C} λ_{H}^{- 1}

, and we let

t_{c u t}

linear to it. In other words, as normal traders wait for

N_{B C} λ_{H}^{- 1}

seconds on the average, attackers shall be tolerable as well and wait for the same scale of time duration. Note that the

ℙ_{A S}

for

N_{B C} = 1

is smaller than that for

N_{B C} = 3

due to not long enough

t_{c u t}

. We scaled the results by parameters

λ_{H}

and

γ

, which we will explain how to obtain from the internet in the next subsection.

Table 1. Numerical computations of required resources for profitable double-spending (DS) attacks with

p_{A} = 0.35

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

Table 2. Numerical computations of required resources for profitable DS attacks with

p_{A} = 0.4

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

The following Theorem 2 is for the inferior attackers with

p_{A} \in (0, 0.5)

and shows the importance of setting a finite

t_{c u t}

.

Theorem 2.

Suppose

x_{1} = x_{2}

and

x_{3} = x_{4}

in Equation (29), and

r_{1} = r_{2}

and

r_{3} = r_{4}

in Equation (30). Then, a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

with

p_{A} \in (0, 0.5)

is profitable only if

t_{c u t} < \infty

.

Proof.

For any

p_{A} \in (0, 0.5)

, it always holds that

ℙ_{A S} < 1

. In this case, if

t_{c u t} \to \infty

then

C_{Req .} \to \infty

from Equation (36); i.e., infinite value

C

of fraudulent transaction is required for a DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

to be profitable. Thus, for a DS attack with

p_{A} \in (0, 0.5)

to be profitable, a finite cut-time

t_{c u t} < \infty

must be set. □

Theorem 2 shows that for

p_{A} \in (0, 0.5)

, setting

t_{c u t} = \infty

is expected to incur infinite deficit. On the contrary, for

p_{A} \in (0.5, 1)

, what we have numerically checked out but omitted due to space limitation is the result that

𝔼_{P}

is an increasing function of

t_{c u t}

; i.e., setting

t_{c u t} = \infty

is the optimal choice in the superior attack regime. Applying

p_{A} \in (0.5, 1)

and

t_{c u t} = \infty

into Equation (36) leads to

ℙ_{A S} = 1

, and thus

C_{Req .}

turns into

C_{Req .} = - (μ - 1) γ λ_{A} 𝔼_{T_{A S}},

(37)

where a closed-form expression of

𝔼_{T_{A S}}

is given in Proposition 2. In this case, if

β > γ

; i.e.,

μ > 1

, DS attacks are always profitable regardless of

C

. According to nicehash.com, most networks maintain

β > γ

by the economic equilibrium. As the result, in addition to the results in [1] and [6] that DS attacks with

p_{A} \in (0.5, 1)

guarantee probabilistic success, we show that such attacks guarantee economic gain as well.

5. Practical Example of Profitable DS Attacks against BitcoinCash

We analyze resources required for profitable DS attacks against BitcoinCash network. The resources include the computing power proportion

p_{A}

, expected OPEX

𝔼_{X}

, expected attack success time

𝔼_{T_{A S}}

, and the required value of fraudulent transaction

C_{Req .}

.

To this end, we first recall the parameters involved in block mining reward

R

and the OPEX

X

. The parameters used in Equation (29) and Equation (30) are assumed to

x_{1} = x_{2}

,

x_{3} = x_{4}

,

r_{1} = r_{2}

, and

r_{3} = r_{4}

which lead to linear functions

X (λ_{A}, t)

and

R (λ_{A}, t)

with respect to

λ_{A}

and

t

. There are three more parameters:

γ

,

β

, and

λ_{H}^{- 1}

. From Equation (29) and Equation (30), the parameter

γ

is the expected cost spent per generating a block; and the parameter

β

is the reward per generating a block. Parameter

λ_{H}^{- 1}

is the average block generation time of the honest chain. All the parameters are different for each blockchain network.

In BitcoinCash, the reward

β

per block mining was 12.5 BCH (without transaction fees), which is around

β = 0.44

BTC per block mining (as of 26 February 2020). The average block generation time was fixed at

λ_{H}^{- 1} = 600

seconds.

The parameter

γ

is obtainable from nicehash.com. BitcoinCash uses the SHA-256 cryptographic puzzle for which the unit of computation is hash. As of 26th Feb. 2020, the rental fee for 1-peta (P) hashes per second for a day was around 0.017 BTC, which was around

1.97 \times 10^{- 7}

BTC per second. In other words, the rental fee was approximately

1.97 \times 10^{- 22}

BTC per the computing of a hash. Referring to BTC.com, the network’s computing speed is 3.57-exa (E) hashes per second; i.e.,

3 . 57 E \cdot 600 = 2142 E

hashes are needed to generate one block on the average. As the result, the parameter

γ

is obtained as

\begin{matrix} γ = 1.97 \times 10^{- 22} [BTC / hash] \times 2142 E [hashes / block mining] \\ \approx 0.422 [BTC / block mining] . \end{matrix}

(38)

Note that it holds

β > γ

. From Equation (37), this relationship makes DS attack

DS (C, p_{A}, t_{c u t}; N_{B C})

with

p_{A} > 0.5

and

t_{c u t} = \infty

always profitable regardless of the value

C

of target transaction.

In case of DS attacks with

p_{A} < 0.5

, the cut-time

t_{c u t}

must be determined as a finite value for profitable DS attacks by Theorem 2. We set

t_{c u t} = c N_{B C} λ_{H}^{- 1} = 12000

seconds with

c = 4

and

p_{A} = 0.35

. We compute the resources required for profitable DS attacks against BitcoinCash when

N_{B C} = 5

. Results are obtainable from the values in Table 1 and Table 2 by multiplying the scaling parameters

γ = 0.422

and

λ_{H}^{- 1} = 600

and by substituting

μ = β γ^{- 1} = 1.04

and

c = 4

.

As the results, we obtain

ℙ_{A S} \approx 0.218

,

𝔼_{T_{A S}} \approx 5200

seconds,

𝔼_{X} \approx 3.98

BTC, and

C_{Req .} \approx 16.22

BTC. One can compute expected running time; i.e., the expected time spent for a single DS attack attempt as

ℙ_{A S} 𝔼_{T_{A S}} + (1 - ℙ_{A S}) t_{c u t}

, which is around 2 h and 55 min. That is to say, attackers can repeatedly perform

n

number of attacks every 2 h and 55 min on the average. With the value

C

of target transaction, by the strong law of large numbers, the multiple attack attempts will return net profit

n P_{A S} (t_{c u t}) \cdot (C - C_{Re q .})

as

n \to \infty

with probability 1.

6. Related Works

By Nakamoto [1] and Rosenfeld [6], the probabilities have been studied that a DS attack will ever succeed when there is no time limit, i.e., the cut-time is set to

t_{c u t} = \infty

. Both of them applied PCPs to model the growth of chains

H (t)

and

A (t)

. On one hand, the main difference between them was in probability calculations of the block confirmation process in Definition 1. Rosenfeld applied the PCPs to both

H (t)

and

A (t)

, whereas Nakamoto assumed the time spent for

H (t) \geq N_{B C}

deterministic to simplify the calculation. On the other hand, they both used the gambler’s ruin approach to obtain the asymptotical behavior of

S_{i}

as

i \to \infty

by manipulating the recurrence relationship between two adjacent states. Namely, their results are based on an assumption that an indefinite number of attack chances are given [12].

On the contrary, we introduce the cut-time

t_{c u t}

which generalizes analytical framework to the more interesting finite attack time and inferior attacker regime. By setting

t_{c u t}

infinite, the same result

ℙ_{D S A}

was obtained in [6] as well. By setting a finite

t_{c u t}

, our results shall be useful when attack chances are limited due to limited amount of resources such as time and cost. In addition, we show in Theorem 2 that DS attacks with

p_{A} < 0.5

must set a finite

t_{c u t}

in order to expect a non-negative profit. It shall be noted that there has been no intermediate result like

p_{D S A, i}

in Lemma 1. We use Lemma 1 to derive the novel results.

Rosenfeld [6] and Bissias et al. [13] have analyzed the profitability of DS attacks. However, they put additional assumptions on the attack scenario to simplify the calculation of the attack time. Specifically, Rosenfeld assumed the attack time to be a constant. Bissias et al. assumed that the attack stops if either the normal peers or the attacker achieves the block confirmation first. On the contrary, in our model, an attack can be continued for a random attack time as long as it brings profit, even if the normal peers achieve the block confirmation before the attacker does.

In Zaghloul et al. [14], the profit of DS attack has been analyzed. Interestingly, they have discussed the need of cut-time for DS attacks with

p_{A} < 0.5

, which is theoretically proven in this paper in Theorem 2. They also calculated the profit of DS attacks with a finite time-limit (see Section IV-C in [14]), but their calculation was not as precise as ours in three points:

First, the probability of attack success within a finite time-limit, i.e.,

ℙ_{A S} (t_{c u t})

in Equation (23) was never considered, which requires the distribution of the DS achieving time, i.e., T_DSA given in Proposition 1. Instead, their calculation used

ℙ_{D S A}

referring to the result in Rosenfeld [6]. This contradicts their time-limited attack scenario, since

ℙ_{D S A}

in [6] was resulted from the assumption of infinite time-limit.

Second, they approximated costs and revenues of DS attack spent within a time-limit. Estimation of the costs and revenues requires estimations of the numbers of blocks respectively mined by honest nodes and attackers within a time-limit, but those were assumed to be constant. This was due to the absence of the time analysis we provide in Proposition 1.

Third, they assumed the average block generation rates

λ_{H}

,

λ_{A}

respectively by honest miners and by attackers are always the same. Since, the proportions

p_{H}

,

p_{A}

of computing power occupied by the two groups can be quite different in general, such a result is not very useful. We agree to their assumption that most blockchains control the difficulty of block mining puzzle to keep the average speed of block generation constant, and thus

λ_{H}

can be considered as a constant. However,

λ_{A}

should be left as a varying quantity by

p_{A}

. The fact is that the computing power invested by the attacker cannot be monitored by the honest network and thus it cannot be reflected in the difficulty control routine.

Budish [15] conducted simulations on the profitability of DS attacks only in the cases of

p_{A} > 0.5

. Under the cases, a condition on the value of the target transaction that makes DS attacks not profitable has been given based on the simulations. We give theoretical and numerically-calculable results for any

p_{A} \in (0, 1)

, which do not require massive simulations.

Gervais et al. [16] and Sompolinsky et al. [12] have used a Markov decision process (MDP) to analyze profits from DS attacks. These works differ from our contributions in the following regards:

First, they did not follow the DS attacks scenario considered by Nakamoto [1] and Rosenfeld [6]. Instead, the scenario in [12] was a special case of the pre-mining strategy which was introduced in [17,18]. We show that the success of DS attack under this scenario is even more difficult to occur than the success of the DS attack under the scenario of Nakamoto and Rosenfeld (see Appendix D for details). Also, the attack scenario in [16] went even further by modifying the condition for block confirmation in Definition 1. Specifically, under our definition, it is required for the honest chain to have added

N_{B C}

blocks, while under their condition it was fraudulent for the chain to do so (see Section 3 of [16]). Thus, it was not ensured that the potential victim has shipped the goods or service, and an attack success did not guarantee for the attacker to obtain the benefit of attacking.

Second, one important new advance in this paper is the derivation of the time analysis

f_{T_{A S}}

given in Proposition 1. When one uses the MDP framework, one can obtain similar information such as the estimations for the attack success time

E_{T_{A S}}

, the future profit P that an attacker will earn in the end, and the minimum value of target transaction

C_{Req .}

. However, using MDP to make such estimations would have required extensive Monte Carlo simulations. Using our mathematical results, such estimations can be obtained without Monte Carlo simulations.

In addition, we believe that our mathematical results can be utilized in the MDP frameworks to improve the reliability of analyses. Conventionally, a rational user of an MDP will make a decision at every state whether to stop or to continue the process by comparing the rewards that will be incurred in the future by his/her decision. The rewards for stop actions are clear because such actions are either an attack success or a give-up. The reward for the continue action is complex because it needs to consider all the actions in all future possible states as well. In [12,16], the rewards for the continue action were over-simplified as they were evaluated only for the very next state and did not include the estimation of the profits in further future actions. To improve the reliability, the PDF

f_{T_{A S}}

in Proposition 1 can be used at any intermediate Markov state to estimate the future profits. Specifically, the conditional expectation of the time left for an attack success

T_{A S}

given

T_{A S} > τ

can be calculated using

f_{T_{A S}}

, where

τ

is the observable time elapsed for reaching the current state. Once the time left is estimated, the estimation of future profits can be updated by substituting it into Equation (33). That is to say, at each state, the estimation of profits can be updated and used as the rewards resulting from the continue action.

Goffard [19] and Karame et al. [20] have derived the PDFs of attack success time, but none of their DS attack scenarios matched with ours in Definition 1. In [19], Goffard derived the PDF of catch-up time spent for the fraudulent chain to catch up with the honest chain given that the length of honest chain is initially ahead by several blocks. The author used counting processes such as order statistic point process and renewal process which are more general than PCP, but there was no analytic result similar to what is given in Proposition 1. In [20], Karame et al. derived the PDF of the first attack success time under a fast-payment model which fixed

N_{B C} = 0

. To sum up, the attack success time in neither analysis included the time spent for achieving the first condition: the block confirmation should be realized.

7. Discussion and Conclusions

We showed that DS attacks using 50% or a lower proportion of computing power can be profitable and thus quite threatening. We provided how much quantitative resources are required to make a profitable DS attack. We derive the PDF of attack success time which enables us to figure out the operating time and the expense of mining rigs. We provided MATLAB codes on the website (https://codeocean.com/capsule/2308305/tree) for numerical evaluation of the expected profit function in Equation (33). We also listed an example of the minimum resources required for a profitable DS attack, which is applicable to any blockchain networks by substituting the network parameters

γ

,

β

, and

λ_{H}

. We also showed a more specific example of the required resources against BitcoinCash network.

Our results quantitatively guide how to set a block confirmation number for a safe transaction. The lower the block confirmation number is, the lower the minimum resource is required for a profitable attack. A solution can be utilized by the network developers to discourage such an attack. On the one hand, given a block confirmation number, we can have the value of any transaction to be set below the required value of making a profitable attack in a given network. On the other hand, given the value of transaction, the network can provide a service to inform the payee with the lowest block confirmation number that leads to negative DS attack profit.

Author Contributions

Conceptualization, J.J. and H.-N.L.; methodology, J.J.; software, J.J.; validation, J.J.; formal analysis, J.J.; investigation, J.J.; resources, J.J.; data curation, J.J.; writing—original draft preparation, J.J.; writing—review and editing, J.J. and H.-N.L.; visualization, J.J. and H.-N.L.; supervision, H.-N.L.; project administration, H.-N.L.; funding acquisition, H.-N.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by Institute of Information & Communications Technology Planning & Evaluation, grant number 2020-0-00958. This work was partially supported by a National Research Foundation of Korea (NRF) grant funded by the Korean government (MSIP) (NRF-2018R1A2A1A19018665).

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

Proof of Lemma 1.

For a given sample

ω

and a given index

i

, the event

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

is equivalent to the event that there exists an intermediate state index

j

such that

ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}

. By the mutual exclusiveness of

D_{j}^{(1)} \cap D_{i, j}^{(2)}

for integers

j

, such a state

j

is unique if it exists. Thus, we can write the probability

p_{D S A, i}

as follows,

\begin{array}{l} p_{D S A, i} & = \Pr (\exists j \in ℕ : ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}) \\ = \sum_{j = N_{B C}}^{\infty} \Pr (ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}) . \end{array}

(A1)

Note that

D_{j}^{(1)} \cap (D_{i, j}^{(2)}) = ϕ

for

i \leq 2 N_{B C}

, since the minimum number of states for an attack success is

2 N_{B C} + 1

:

N_{B C}

number of

+ 1

’s state transitions for the block confirmation; and

N_{B C} + 1

number of

- 1

’s state transitions for the success of PoW competition. Thus,

p_{D S A, i} = 0

for

i \leq 2 N_{B C}

. □

We further explore

D_{j}^{(1)}

and

D_{i, j}^{(2)}

. We divide the domain of state index

j

in Equation (A1) into two exclusive domains; one is

j \leq 2 N_{B C}

; and the other is

j > 2 N_{B C}

. First, for

j \leq 2 N_{B C}

, two sets

D_{j}^{(1)}

and

D_{i, j}^{(2)}

are independent, since their requirements on the state transitions are focusing on disjoint indices of state by their definitions. Formally,

\Pr (ω \in D_{j}^{(1)} \cap D_{i, j}^{(2)}) =

\Pr (ω \in D_{j}^{(1)}) \Pr (ω \in D_{i, j}^{(2)})

. Second, we explore the domain

j > 2 N_{B C}

. By the definition of

D_{j}^{(1)}

, all

ω \in D_{j}^{(1)}

satisfy

S_{j} =

\sum_{k = 1}^{j} π_{Δ_{k}} (ω) =

2 N_{B C} - j

. Thus, for every

j > 2 N_{B C}

,

S_{j}

is already negative, which implies all

ω \in D_{j}^{(1)}

satisfy both and at state

j

. The set

D_{i, j}^{(2)} = ϕ

for

j > 2 N_{B C}

and

j < i

, since the state

S_{j} = 2 N_{B C} - j

contradicts one requirement of

D_{i, j}^{(2)}

: the interim transitions between the states

j

and

i

should be non-negative. For

j > 2 N_{B C}

and

j = i

, let us set

D_{i, j}^{(2)} = Ω,

since there is no interim state to apply the requirement to. To sum up,

D_{j}^{(1)} \cap D_{i, j}^{(2)} = D_{i}^{(1)}

for

j > 2 N_{B C}

and

i = j

, and

D_{j}^{(1)} \cap (D_{i, j}^{(2)}) = ϕ

for

j > 2 N_{B C}

and

i > j

. Subsequently, Equation (A1) is computed as

p_{D S A, i} = \sum_{j = N_{B C}}^{2 N_{B C}} \Pr (ω \in D_{j}^{(1)}) \Pr (ω \in D_{i, j}^{(2)}) + \Pr (ω \in D_{i}^{(1)}) .

(A2)

We now compute the ingredient probabilities

\Pr (ω \in D_{j}^{(1)})

and

\Pr (ω \in D_{i, j}^{(2)})

in Equation (A2). First, by the definition, all samples in

D_{j}^{(1)}

must have

N_{B C} - 1

number of

+ 1

’s state transitions among the first

j - 1

transitions. And the rest of the

j - 1

transitions must be valued by

- 1

. In addition, the

j

-th transition must be valued by

+ 1

so that the block confirmation is achieved exactly at the

j

-th state index. As the result, the probability

\Pr (ω \in D_{j}^{(1)})

equals the point mass function of a negative binomial distribution:

\Pr (ω \in D_{j}^{(1)}) = (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{H}^{N_{B C}} p_{A}^{j - N_{B C}} .

(A3)

Second, computing the probability

\Pr (ω \in D_{i, j}^{(2)})

starts from counting the number of combinations of state transitions satisfying the requirements of set

D_{i, j}^{(2)}

. Recall the requirements on every element of

D_{i, j}^{(2)}

, for

j = N_{B C}, \dots, 2 N_{B C}

, are that the state starts from the state

S_{j} = 2 N_{B C} - j

and ends at the state

S_{i} = - 1

while all the

i - j - 1

number of interim states remain nonnegative. The

i

-th transition should be

Δ_{i} = - 1

so that the success of PoW competition is achieved exactly at the state index i. The number of combinations of such state transitions can be counted using the ballot number

C_{n, m}

[24], which is the number of random walks that consist of

2 n + m

steps and never become negative, starting from the origin and ending at the point

m

. In our problem, the number of random walk steps is

2 n + m = i - j - 1

with

m = 2 N_{B C} - j

. As a result, by multiplying the probabilities

p_{A}

and

p_{H}

for state transitions, the probability

\Pr (ω \in D_{i, j}^{(2)})

is computed as

\Pr (ω \in D_{i, j}^{(2)}) = C_{n, m} p_{A}^{(n + m + 1)} p_{H}^{n},

(A4)

where

2 n + m = i - j - 1

and

m = 2 N_{B C} - j

.

Finally, substituting Equations (A3) and (A4) into Equation (A2) results in Equation (15).

Appendix B

Proof of Corollary 1.

Taking infinite summations of

p_{D S A, i}

for all indices

i

results in

ℙ_{D S A}

:

ℙ_{D S A} = \sum_{i = 2 N_{B C} + 1}^{\infty} p_{D S A, i}

(A5)

By substituting

p_{D S A, i}

in Lemma 1 into Equation (A5), the probability

ℙ_{D S A}

becomes

ℙ_{D S A} = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A} \sum_{i = 2 N_{B C} + 1}^{\infty} C_{\frac{i - 1}{2} - N_{B C}, 2 N_{B C} - j} {(p_{A} p_{H})}^{\frac{i - 1}{2}} + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} \sum_{i = 2 N_{B C} + 1}^{\infty} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i} .

(A6)

By rearranging the indices

i

in the summations, we can obtain

\begin{array}{l} ℙ_{D S A} & = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A} \sum_{i = 0}^{\infty} C_{i, 2 N_{B C} - j} {(p_{A} p_{H})}^{i + N_{B C}} \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} (\sum_{i = N_{B C}}^{\infty} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i} - \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i}) . \end{array}

(A7)

We define two generating functions as

M_{k} (x) : = \sum_{i = 0}^{\infty} C_{i, k} x^{i},

(A8)

and

G_{k} (x) : = \sum_{i = k}^{\infty} (\begin{matrix} i \\ k \end{matrix}) x^{i} .

(A9)

By substituting

M_{k}

and

G_{k}

into Equation (A7), we can write

\begin{array}{l} ℙ_{D S A} & = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A} {(p_{A} p_{H})}^{N_{B C}} M_{2 N_{B C} - j} (p_{A} p_{H}) \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} (p_{A} G_{N_{B C} - 1} (p_{A}) - \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i}) \end{array}

(A10)

The function

M_{k} (x)

is a generating function of the ballot numbers

C_{i, k}

, for which the algebraic expression given in [26] is

M_{k} (x) = {(\frac{2}{1 + \sqrt{1 - 4 x}})}^{k + 1} .

(A11)

Putting

x = p_{A} p_{H}

into

M_{k} (x)

results in

\begin{array}{l} M_{k} (p_{A} p_{H}) & = {(\frac{2}{1 + \sqrt{1 - 4 p_{A} p_{H}}})}^{k + 1} \\ = \{\begin{matrix} {(\frac{2}{1 + \sqrt{1 - 4 p_{A} (1 - p_{A})}})}^{k + 1}, & i f p_{A} < p_{H}, \\ {(\frac{2}{1 + \sqrt{1 - 4 (1 - p_{H}) p_{H}}})}^{k + 1}, & i f p_{A} \geq p_{H} \end{matrix} \\ = {(\frac{1}{p_{M}})}^{k + 1}, \end{array}

(A12)

where

p_{M} : = \max (p_{H}, p_{A})

. The function

G_{k} (x)

is a generating function of binomial coefficients, and the algebraic expression for it is given in [27]:

G_{k} (x) = \frac{x^{k}}{{(1 - x)}^{k + 1}} .

(A13)

Putting

x = p_{A}

into

G_{k} (x)

results in

G_{k} (p_{A}) = p_{H}^{- 1} {(\frac{p_{A}}{p_{H}})}^{k} .

(A14)

Substituting Equation (A12) and Equation (A14) into Equation (A10) provides

ℙ_{D S A} = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A} {(p_{A} p_{H})}^{N_{B C}} p_{M}^{- (2 N_{B C} - j + 1)} + 1 - {(\frac{p_{H}}{p_{A}})}^{N_{B C}} \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i} .

(A15)

We define

p_{m} : = \min (p_{A}, p_{H}),

then the relationship

p_{A} p_{H} = p_{m} p_{M}

holds. By rearranging the order of operands, we can obtain

ℙ_{D S A} = 1 - \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) ({(\frac{p_{H}}{p_{A}})}^{N_{B C}} p_{A}^{j} - \frac{p_{A}}{p_{M}} {(\frac{p_{m}}{p_{M}})}^{N_{B C}} p_{M}^{j}),

(A16)

which is equal to Equation (17). □

Proof of Proposition 2.

From Equations (19) and (26), when

t_{c u t} = \infty

, we obtain

\begin{array}{l} 𝔼_{T_{A S}} & = \frac{\lim_{t_{c u t} \to \infty^{-}} \int_{0}^{t_{c u t}} t f_{T_{D S A}} (t) d t}{ℙ_{A S} (t_{c u t})} = \frac{\sum_{i = 2 N_{B C} + 1}^{\infty} 𝔼 [T_{i}] p_{D S A, i}}{ℙ_{D S A}} \\ = \frac{\sum_{i = 2 N_{B C} + 1}^{\infty} \frac{i}{λ_{T}} p_{D S A, i}}{ℙ_{D S A}}, \end{array}

(A17)

where E[T_i] = iλ_T⁻¹ [22]. By substituting P_DsA,i in Equation (15) into Equation (A17) and rearranging the order of operands, we obtain

\begin{array}{l} λ_{T} ℙ_{D S A} 𝔼_{T_{A S}} & = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) \sum_{i = 2 N_{B C}}^{\infty} (i + 1) C_{\frac{i}{2} - N_{B C}, 2 N_{B C} - j} p_{A}^{\frac{i + 2}{2}} p_{H}^{\frac{i}{2}} \\ + \sum_{i = N_{B C} - 1}^{\infty} (i + 1) (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) p_{A}^{i + 1 - N_{B C}} p_{H}^{N_{B C}} - \sum_{i = N_{B C} - 1}^{2 N_{B C} - 1} (i + 1) (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) p_{A}^{i + 1 - N_{B C}} p_{H}^{N_{B C}} . \end{array}

(A18)

By rearranging the indices of summations, we arrive at

\begin{array}{l} λ_{T} ℙ_{D S A} 𝔼_{T_{A S}} & = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{N_{B C} + 1} p_{H}^{N_{B C}} \cdot \sum_{i = 0}^{\infty} (2 i + 2 N_{B C} + 1) C_{i, 2 N_{B C} - j} {(p_{A} p_{H})}^{i} \\ + p_{A} {(\frac{p_{H}}{p_{A}})}^{N_{B C}} \sum_{i = N_{B C} - 1}^{\infty} (i + 1) (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) p_{A}^{i} - \sum_{i = N_{B C}}^{2 N_{B C}} i (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i - N_{B C}} p_{H}^{N_{B C}} . \end{array}

(A19)

By substituting the generating functions

M_{k} (x)

and

G_{k} (x)

defined respectively in Equation (A8) and Equation (A9), Equation (A19) becomes

\begin{array}{l} λ_{T} ℙ_{D S A} 𝔼_{T_{A S}} & = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{N_{B C} + 1} {p_{H}}^{N_{B C}} \cdot (2 \sum_{i = 0}^{\infty} i C_{i, 2 N_{B C} - j} {(p_{A} p_{H})}^{i} + (2 N_{B C} + 1) M_{2 N_{B C} - j} (p_{A} p_{H})) \\ + p_{A} {(\frac{p_{H}}{p_{A}})}^{N_{B C}} (\sum_{i = N_{B C} - 1}^{\infty} i (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{i} + G_{N_{B C} - 1} (p_{A})) - \sum_{i = N_{B C}}^{2 N_{B C}} i (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{i - N_{B C}} {p_{H}}^{N_{B C}} . \end{array}

(A20)

We use the following relationships,

\sum_{i = 0}^{\infty} i C_{i, k} x^{i} = x {M^{'}}_{k} (x)

(A21)

and

\sum_{i = k}^{\infty} i (\begin{matrix} i \\ k \end{matrix}) x^{i} = x {G^{'}}_{k} (x),

(A22)

and their derivatives are given by

\begin{matrix} {M^{'}}_{k} (x) : = \frac{d}{d x} M_{k} (x) = \sum_{i = 0}^{\infty} i C_{i, k} x^{i - 1} \\ = \frac{(k + 1)}{\sqrt{1 - 4 x}} {(\frac{2}{1 + \sqrt{1 - 4 x}})}^{k + 2} \end{matrix}

(A23)

and

\begin{array}{l} {G^{'}}_{k} (x) : & = \frac{d}{d x} G_{k} (x) \\ = \sum_{i = k}^{\infty} i (\begin{matrix} i \\ k \end{matrix}) x^{i - 1} \\ = \frac{(k x^{k - 1} + x^{k})}{{(1 - x)}^{k + 2}} . \end{array}

(A24)

By substituting Equation (A21) and Equation (A22) into Equation (A20), we obtain

\begin{matrix} λ_{T} ℙ_{D S A} 𝔼_{T_{A S}} = \sum_{j = N_{B C}}^{2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{N_{B C} + 1} p_{H}^{N_{B C}} \cdot (2 p_{A} p_{H} {M^{'}}_{2 N_{B C} - j} (p_{A} p_{H}) + (2 N_{B C} + 1) M_{2 N_{B C} - j} (p_{A} p_{H})) \\ + p_{A} {(\frac{p_{H}}{p_{A}})}^{N_{B C}} (p_{A} {G^{'}}_{N_{B C} - 1} (p_{A}) + G_{N_{B C} - 1} (p_{A})) - \sum_{i = N_{B C}}^{2 N_{B C}} i (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i - N_{B C}} p_{H}^{N_{B C}} \end{matrix}

(A25)

Putting

x = p_{A} p_{H}

into

{M^{'}}_{k} (x)

in Equation (A23) results in

{M^{'}}_{k} (p_{A} p_{H}) = {M^{'}}_{k} (p_{m} p_{M}) = \frac{(k + 1)}{1 - 2 p_{m}} {(\frac{1}{p_{M}})}^{k + 2} .

(A26)

Putting

x = p_{A}

into

{G^{'}}_{k} (x)

in Equation (A24) gives

{G^{'}}_{k} (p_{A}) = \frac{(k p_{A}^{k - 1} + p_{A}^{k})}{p_{H}^{k + 2}} .

(A27)

By substituting Equation (A12), Equation (A14), Equation (A26), and Equation (A27) into Equation (A25), we finally obtain Equation (27). □

Appendix C

Proof of Proposition 1.

We use a generating function and generalized hypergeometric functions to compute the infinite summations in Equation (19).

By substituting P_DsA,i in Equation (15) and

f_{T_{i}} (t)

in Equation (14) into Equation (19), we arrive at

\begin{array}{l} f_{T_{D S A}} (t) - (1 - ℙ_{D S A}) δ (t - \infty) & = \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) \sum_{i = 2 N_{B C} + 1}^{\infty} C_{\frac{i - 1}{2} - N_{B C}, 2 N_{B C} - j} p_{A}^{\frac{i + 1}{2}} p_{H}^{\frac{i - 1}{2}} \frac{λ_{T}^{i} t^{i - 1} e^{- λ_{T} t}}{(i - 1)!} \\ + \sum_{i = 2 N_{B C} + 1}^{\infty} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{H}^{N_{B C}} p_{A}^{i - N_{B C}} \frac{λ_{T}^{i} t^{i - 1} e^{- λ_{T} t}}{(i - 1)!} . \end{array}

(A28)

By rearranging the indices of summations and the order of operands, we obtain

\begin{array}{l} f_{T_{D S A}} (t) - (1 - ℙ_{D S A}) δ (t - \infty) & = \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) \sum_{i = 0}^{\infty} (C_{i, 2 N_{B C} - j} {p_{A}}^{N_{B C} + i + 1} {p_{H}}^{N_{B C} + i} \cdot \frac{{λ_{T}}^{2 N_{B C} + 2 i + 1} t^{2 N_{B C} + 2 i} e^{- λ_{T} t}}{(2 N_{B C} + 2 i)!}) \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} e^{- λ_{T} t} (\sum_{i = N_{B C}}^{\infty} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{i} \frac{{λ_{T}}^{i} t^{i - 1}}{(i - 1)!} - \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{i} \frac{{λ_{T}}^{i} t^{i - 1}}{(i - 1)!}) . \end{array}

(A29)

We can define two generating functions as

B (x) : = \sum_{i = 0}^{\infty} C_{i, 2 N_{B C} - j} \frac{x^{i}}{(2 N_{B C} + 2 i)!} = (2 N_{B C} - j + 1) \sum_{i = 0}^{\infty} \frac{(2 i + 2 N_{B C} - j)!}{i! (i + 2 N_{B C} - j + 1)!} \frac{x^{i}}{(2 N_{B C} + 2 i)!},

(A30)

and

H (x) : = \sum_{i = N_{B C}}^{\infty} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) \frac{x^{i - 1}}{(i - 1)!} = \sum_{i = N_{B C} - 1}^{\infty} (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) \frac{x^{i}}{i!} .

(A31)

By substituting

B (x)

and

H (x)

into Equation (A29), we obtain

\begin{array}{l} f_{T_{D S A}} (t) - (1 - ℙ_{D S A}) δ (t - \infty) & = \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix}) p_{A} λ_{T} e^{- λ_{T} t} {(p_{A} p_{H} {(λ_{T} t)}^{2})}^{N_{B C}} B (p_{A} p_{H} {(λ_{T} t)}^{2}) \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} e^{- λ_{T} t} (p_{A} λ_{T} H (p_{A} λ_{T} t) - \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) p_{A}^{i} \frac{λ_{T}^{i} t^{i - 1}}{(i - 1)!}) . \end{array}

(A32)

We replace function

B (x)

in Equation (A30) with the generalized hypergeometric functions (See Appendix E for definition). For this purpose, we first denote the sequences in

B (x)

by

β_{i} : = \frac{(2 i + 2 N_{B C} - j)!}{i! (i + 2 N_{B C} - j + 1)!} \frac{1}{(2 N_{B C} + 2 i)!},

(A33)

and

β_{0} : = \frac{1}{(2 N_{B C} - j + 1) (2 N_{B C})!} .

(A34)

Next, the function

B (x)

can be rewritten as

B (x) = (2 N_{B C} - j + 1) \sum_{i = 0}^{\infty} β_{i} x^{i} = (2 N_{B C} - j + 1) β_{0} (x^{0} + \frac{β_{1}}{β_{0}} x^{1} + \frac{β_{2}}{β_{1}} \frac{β_{1}}{β_{0}} x^{2} + \dots) .

(A35)

The reformulated sequence in Equation (A35) is computed by

\frac{β_{i + 1}}{β_{i}} = \frac{(i + 1 + N_{B C} - j / 2) (i + 1 / 2 + N_{B C} - j / 2)}{(i + 2 + 2 N_{B C} - j) (i + 1 + N_{B C}) (i + 1 / 2 + N_{B C}) (i + 1)},

(A36)

which has 2 polynomials in

i

on the numerator and 3 polynomials in

i

except for

(i + 1)

on the denominator.

B (x)

can be expressed in terms of a generalized hypergeometric function

_{2} F_{3}

[28] as follows,

\begin{array}{l} B (x) & = (2 N_{B C} - j + 1) β_{0}_{2} F_{3} (a_{j}; b_{j}; x) \\ = \frac{1}{(2 N_{B C})!}_{2} F_{3} (a_{j}; b_{j}; x), \end{array}

(A37)

where vectors

a_{j}

and

b_{j}

respectively defined in Equations (21) and (22) are the constants in the polynomials in

i

of the numerator and denominator in Equation (A31), respectively.

We use a closed-form expression of generating function

H (x)

in Equation (A31) given by

\begin{array}{l} H (x) & = \sum_{i = N_{B C} - 1}^{\infty} (\begin{matrix} i \\ N_{B C} - 1 \end{matrix}) \frac{x^{i}}{i!} = \frac{1}{(N_{B C} - 1)!} \sum_{i = N_{B C} - 1}^{\infty} \frac{x^{i}}{(i - N_{B C} + 1)!} \\ = \frac{x^{N_{B C} - 1}}{(N_{B C} - 1)!} e^{x}, \end{array}

(A38)

where the following relationship is used [29]:

\sum_{i = 0}^{\infty} \frac{x^{i}}{i!} = e^{x} .

(A39)

By substituting Equation (A37) and Equation (A38) into Equation (A32), we arrive at

\begin{matrix} f_{T_{D S A}} (t) - (1 - ℙ_{D S A}) δ (t - \infty) \\ = \frac{p_{A} λ_{T} e^{- λ_{T} t} {(p_{A} p_{H} {(λ_{T} t)}^{2})}^{N_{B C}}}{(2 N_{B C})!} \cdot \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix})_{2} F_{3} (a_{j}; b_{j}; p_{A} p_{H} {(λ_{T} t)}^{2}) \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} e^{- λ_{T} t} (p_{A} λ_{T} \frac{{(p_{A} λ_{T} t)}^{N_{B C} - 1}}{(N_{B C} - 1)!} e^{p_{A} λ_{T} t} - \sum_{i = N_{B C}}^{2 N_{B C}} (\begin{matrix} i - 1 \\ N_{B C} - 1 \end{matrix}) {p_{A}}^{i} \frac{{λ_{T}}^{i} t^{i - 1}}{(i - 1)!}) \\ = \frac{p_{A} λ_{T} e^{- λ_{T} t} {(p_{A} p_{H} {(λ_{T} t)}^{2})}^{N_{B C}}}{(2 N_{B C})!} \cdot \sum_{j = N_{B C}}^{j = 2 N_{B C}} (\begin{matrix} j - 1 \\ N_{B C} - 1 \end{matrix})_{2} F_{3} (a_{j}; b_{j}; p_{A} p_{H} {(λ_{T} t)}^{2}) \\ + {(\frac{p_{H}}{p_{A}})}^{N_{B C}} e^{- λ_{T} t} (p_{A} λ_{T} \frac{{(p_{A} λ_{T} t)}^{N_{B C} - 1}}{(N_{B C} - 1)!} e^{p_{A} λ_{T} t} - \frac{1}{(N_{B C} - 1)!} \sum_{i = N_{B C}}^{2 N_{B C}} {p_{A}}^{i} \frac{{λ_{T}}^{i} t^{i - 1}}{(i - N_{B C})!}) . \end{matrix}

(A40)

We obtain Equation (20) by rearranging the indices of the summations and the order of operands. □

Appendix D

Comparison of Attack Success Probabilities of DS Attack and Pre-Mining Attack

In [12], a special case of pre-mining strategy has been considered, where the condition for a DS attack success was different from Definition 1. Specifically, the only condition was to have the fraudulent chain to grow longer than the honest chain by

N_{B C}

, i.e.,

A (t) > H (t) + N_{B C}

(see Section 7 of [12]). We refer to

ℙ_{pre - mine}

as the probability of satisfying this condition. The literature has shown that satisfying this condition suffices a success of DS attack [12]. What they have not shown, however, is that this condition is not a necessary one. Thus, we here aim to show that their condition is indeed not a necessary condition, by showing that

ℙ_{D S A} > ℙ_{pre - mine}

for all

p_{A} \in (0, 0.5)

. First, it has been given that

ℙ_{pre - mine} = {(p_{A} / p_{H})}^{N_{B C} + 1}

. Under the condition of [12], it is required that the fraudulent chain catches up with the honest chain with additional N_BC blocks. The catch-up probability has been derived by Nakamoto in [1] using the gambler’s ruin approach as

{(p_{A} / p_{H})}^{k}

, where

k

is the number of blocks that the honest chain leads by at the initial status. Next, we refer to an intermediate step in the derivation of

ℙ_{D S A}

by Rosenfeld [6]:

ℙ_{D S A} = \sum_{k = 0}^{N_{B C} + 1} (\begin{matrix} N_{B C} + k - 1 \\ k \end{matrix}) p_{H}^{N_{B C}} p_{A}^{k} {(\frac{p_{A}}{p_{H}})}^{N_{B C} - k + 1} + \sum_{k = N_{B C} + 2}^{\infty} (\begin{matrix} N_{B C} + k - 1 \\ k \end{matrix}) p_{H}^{N_{B C}} p_{A}^{k} .

(A41)

Finally, clear inequalities can be used to show

ℙ_{D S A} > ℙ_{pre - mine}

:

\begin{array}{l} ℙ_{D S A} & > \sum_{k = 0}^{N_{B C} + 1} (\begin{matrix} N_{B C} + k - 1 \\ k \end{matrix}) p_{H}^{N_{B C}} p_{A}^{k} {(\frac{p_{A}}{p_{H}})}^{N_{B C} - k + 1} + \sum_{k = N_{B C} + 2}^{\infty} (\begin{matrix} N_{B C} + k - 1 \\ k \end{matrix}) p_{H}^{N_{B C}} p_{A}^{k} {(\frac{p_{A}}{p_{H}})}^{N_{B C} + 1} \\ > {(\frac{p_{A}}{p_{H}})}^{N_{B C} + 1} \sum_{k = 0}^{\infty} (\begin{matrix} N_{B C} + k - 1 \\ k \end{matrix}) p_{H}^{N_{B C}} p_{A}^{k} \\ = {(\frac{p_{A}}{p_{H}})}^{N_{B C} + 1} = ℙ_{pre - mine} . \end{array}

(A42)

For numerical example, when

p_{A} = 0.35

and

N_{B C} = 5

the probabilities can be computed as

ℙ_{D S A} = 0.2287

and

ℙ_{pre - mine} = 0.0244

. As the gap is significant, it is shown that the DS attack success condition defined in [12] was indeed only a sufficient condition, set to be too strict.

Appendix E

Generalized Hypergeometric Function

We define generalized hypergeometric series and generalized hypergeometric functions [28].

For a variable

z

and a given set of coefficients

β_{0}, \dots, β_{\infty}

, if the ratio of coefficients

b_{n}

can be expressed in terms of two polynomials

A (n)

and

B (n)

in

n

as follows,

\frac{β_{n + 1}}{β_{n}} = \frac{A (n)}{B (n) (n + 1)}

(A43)

for all integer

n \geq 0

, a power series

\sum_{n \geq 0} β_{n} z^{n}

is a generalized hypergeometric series, where the polynomials are in the forms of

A (n) = c (a_{1} + n) \dots (a_{p} + n)

(A44)

and

B (n) = d (b_{1} + n) \dots (b_{q} + n),

(A45)

for real numbers

c

and

d

and complex numbers

a_{1}, \dots, a_{p}

and

b_{1}, \dots, b_{q}

. The generalized hypergeometric series is denoted by

_{p} F_{q} (a; b; z) : = \sum_{n \geq 0} β_{n} z^{n},

(A46)

where

a

and

b

are the vectors of

a_{1}, \dots, a_{p}

and

b_{1}, \dots, b_{q}

, respectively.

A generalized hypergeomteric series can be a generalized hypergeometric function, if it converges. If

p < q + 1

, the ratio Equation (A43) goes to zero as

n \to \infty

. This implies the series Equation (A46) converges for any finite value

z

and thus can be defined as a function.

References

Nakamoto, S. Bitcoin: A Peer-to-Peer Electronic Cash System. 2008. Available online: https://bitcoin.org/bitcoin.pdf (accessed on 26 November 2020).
Ritzdorf, H.; Soriente, C.; Karame, G.O.; Marinovic, S.; Gruber, D.; Capkun, S. Toward Shared Ownership in the Cloud. IEEE Trans. Inf. Forensics Secur. 2018, 13, 3019–3034. [Google Scholar] [CrossRef]
Wu, S.; Chen, Y.; Wang, Q.; Li, M.; Wang, C.; Luo, X. CReam: A Smart Contract Enabled Collusion-Resistant e-Auction. IEEE Trans. Inf. Forensics Secur. 2018, 14, 1687–1701. [Google Scholar] [CrossRef]
Nguyen, G.-T.; Kim, K. A Survey about Consensus Algorithms Used in Blockchain. J. Inf. Process. Syst. 2018, 14, 101–128. [Google Scholar] [CrossRef]
Sompolinsky, Y.; Zohar, A. Secure High-Rate Transaction Processing in Bitcoin; Böhme, R., Okamoto, T., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; pp. 507–527. [Google Scholar]
Rosenfeld, M. Analysis of Hashrate-Based Double Spending. arXiv 2014, arXiv:1402.2009 [cs]. [Google Scholar]
Beikverdi, A.; Song, J. Trend of centralization in Bitcoin’s distributed network. In Proceedings of the 2015 IEEE/ACIS 16th International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), Takamatsu, Japan, 1–3 June 2015; pp. 1–6. [Google Scholar]
Gervais, A.; Karame, G.O.; Capkun, V.; Capkun, S. Is Bitcoin a Decentralized Currency? IEEE Secur. Priv. 2014, 12, 54–60. [Google Scholar] [CrossRef]
Attah, E. Five most prolific 51% attacks in crypto: Verge, Ethereum Classic, Bitcoin Gold, Feathercoin, Vertcoin. CryptoSlate. Available online: https://cryptoslate.com/prolific-51-attacks-crypto-verge-ethereum-classic-bitcoin-gold-feathercoin-vertcoin/ (accessed on 26 November 2020).
Bonneau, J. Why Buy When You Can Rent? Bribery Attacks on Bitcoin Consensus; Springer: Berlin, Germany, 2016. [Google Scholar]
Sayeed, S.; Marco-Gisbert, H. Assessing Blockchain Consensus and Security Mechanisms against the 51% Attack. Appl. Sci. 2019, 9, 1788. [Google Scholar] [CrossRef]
Sompolinsky, Y.; Zohar, A. Bitcoin’s Security Model Revisited. arXiv 2016, arXiv:1605.09193 [cs]. [Google Scholar]
Bissias, G.; Levine, B.N.; Ozisik, A.P.; Andresen, G. An Analysis of Attacks on Blockchain Consensus. arXiv 2016, arXiv:1610.07985 [cs]. [Google Scholar]
Zaghloul, E.; Li, T.; Mutka, M.W.; Ren, J. Bitcoin and Blockchain: Security and Privacy. IEEE Internet Things J. 2020, 7, 10288–10313. [Google Scholar] [CrossRef]
Budish, E.B. The Economic Limits of Bitcoin and the Blockchain. SSRN J. 2018. [Google Scholar] [CrossRef]
Gervais, A.; Karame, G.O.; Wüst, K.; Glykantzis, V.; Ritzdorf, H.; Capkun, S. On the Security and Performance of Proof of Work Blockchains. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security—CCS’16, Vienna, Austria, 24–28 October 2016; pp. 3–16. [Google Scholar]
Ramezan, G.; Leung, C.; Jane Wang, Z. A Strong Adaptive, Strategic Double-Spending Attack on Blockchains. In Proceedings of the 2018 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData), Halifax, NS, Canada, 30 July–3 August 2018; pp. 1219–1227. [Google Scholar]
Pinzón, C.; Rocha, C. Double-spend Attack Models with Time Advantange for Bitcoin. Electron. Notes Theor. Comput. Sci. 2016, 329, 79–103. [Google Scholar] [CrossRef]
Goffard, P.-O. Fraud risk assessment within blockchain transactions. Adv. Appl. Probab. 2019, 51, 443–467. [Google Scholar] [CrossRef]
Karame, G.O.; Androulaki, E.; Roeschlin, M.; Gervais, A.; Čapkun, S. Misbehavior in Bitcoin: A Study of Double-Spending and Accountability. ACM Trans. Inf. Syst. Secur. 2015, 18, 2:1–2:32. [Google Scholar] [CrossRef]
Karame, G.O.; Androulaki, E.; Capkun, S. Double-spending fast payments in bitcoin. In Proceedings of the 2012 ACM Conference on Computer and Communications Security—CCS ’12, Raleigh, NC, USA, 16–18 October 2012; p. 906. [Google Scholar]
Papoulis, A.; Pillai, S.U. Random walks and other applications. In Probability, Random Variables and Stochastic Processes; McGraw-Hill Europe: Boston, MA, USA, 2002; ISBN 978-0-07-122661-5. [Google Scholar]
Bowden, R.; Keeler, H.P.; Krzesinski, A.E.; Taylor, P.G. Block arrivals in the Bitcoin blockchain. arXiv 2018, arXiv:1801.07447 [cs]. [Google Scholar]
Flajolet, P.; Sedgewick, R. Combinatorial structures and ordinary generating functions. In Analytic Combinatorics; Cambridge University Press: Cambridge, UK, 2009; ISBN 978-1-139-47716-1. [Google Scholar]
Conti, M.; Sandeep Kumar, E.; Lal, C.; Ruj, S. A Survey on Security and Privacy Issues of Bitcoin. IEEE Commun. Surv. Tutor. 2018, 20, 3416–3452. [Google Scholar] [CrossRef]
Wilf, H.S. Analytic and asymptotic methods. In Generatingfunctionology, 3rd ed.; A K Peters/CRC Press: Wellesley, MA, USA, 2005; ISBN 978-1-56881-279-3. [Google Scholar]
Wilf, H.S. Introductory ideas and examples. In Generatingfunctionology, 3rd ed.; A K Peters/CRC Press: Wellesley, MA, USA, 2005; ISBN 978-1-56881-279-3. [Google Scholar]
Gasper, G.; Rahman, M. Basic Hypergeometric series. In Basic Hypergeometric Series; Encyclopedia of Mathematics and Its Applications; Cambridge University Press: Cambridge, UK, 2004; Volume 96, ISBN 978-0-521-83357-8. [Google Scholar]
Flajolet, P.; Sedgewick, R. Labelled structures and exponential generating functions. In Analytic Combinatorics; Cambridge University Press: Cambridge, UK, 2009; ISBN 978-1-139-47716-1. [Google Scholar]

Figure 1. Computation power distribution among the largest mining pools.

Table 1. Numerical computations of required resources for profitable double-spending (DS) attacks with

p_{A} = 0.35

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

Table 1. Numerical computations of required resources for profitable double-spending (DS) attacks with

p_{A} = 0.35

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

$Block Confirmation Number (N_{B C})$	1	3	5	7	9
$Attack success probability (ℙ_{A S})$	0.315	0.279	0.218	0.170	0.132
$Expected attack success time (𝔼_{T_{A S}}) (Scaled by λ_{H}^{- 1})$	2.004	5.518	8.681	11.694	14.607
$Expected OPEX (𝔼_{X}) (Scaled by γ)$	1.815	5.487	9.440	13.588	17.859
$Required value of target transaction (C_{Suf .}) (Scaled by γ)$	$1.079 \cdot (1 - μ) + 4.680$	$2.971 \cdot (1 - μ) + 16.68$	$4.675 \cdot (1 - μ) + 38.62$	$6.297 \cdot (1 - μ) + 73.84$	$7.866 \cdot (1 - μ) + 127.00$

Table 2. Numerical computations of required resources for profitable DS attacks with

p_{A} = 0.4

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

Table 2. Numerical computations of required resources for profitable DS attacks with

p_{A} = 0.4

when

t_{c u t} = c N_{B C} λ_{H}^{- 1}

with

c = 4

.

$Block Confirmation Number (N_{B C})$	1	3	5	7	9
$Attack success probability (ℙ_{A S})$	0.411	0.419	0.376	0.334	0.297
$Expected attack success time (𝔼_{T_{A S}}) (Scaled by λ_{H}^{- 1})$	1.953	5.338	8.434	11.418	14.325
$Expected OPEX (𝔼_{X}) (Scaled by γ)$	2.106	6.139	10.436	14.977	19.716
$Required value of target transaction (C_{Suf .}) (Scaled by γ)$	$1.302 \cdot (1 - μ) + 3.819$	$3.559 \cdot (1 - μ) + 11.10$	$5.622 \cdot (1 - μ) + 22.15$	$7.612 \cdot (1 - μ) + 37.25$	$9.550 \cdot (1 - μ) + 56.96$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Profitable Double-Spending Attacks

Abstract

1. Introduction

1.1. Contributions

1.2. Organization of the Paper

2. The Attack Model

2.1. Attack Scenario

2.2. Stochastic Model

2.3. DS Attack Achieving Time

3. The Attack Probabilities

4. Profitable DS Attacks

5. Practical Example of Profitable DS Attacks against BitcoinCash

6. Related Works

7. Discussion and Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

Comparison of Attack Success Probabilities of DS Attack and Pre-Mining Attack

Appendix E

Generalized Hypergeometric Function

References

Article Metrics

Citations

Article Access Statistics