Provably Secure Covert Communication on Blockchain

Partala, Juha

doi:10.3390/cryptography2030018

Open AccessArticle

Provably Secure Covert Communication on Blockchain

by

Juha Partala

Physiological Signal Analysis Team, Center for Machine Vision and Signal Analysis, University of Oulu, P.O. Box 5000, FI-90014 Oulu, Finland

Cryptography 2018, 2(3), 18; https://doi.org/10.3390/cryptography2030018

Submission received: 29 June 2018 / Revised: 9 August 2018 / Accepted: 13 August 2018 / Published: 20 August 2018

(This article belongs to the Special Issue Advances of Blockchain Technology and Its Applications)

Download

Browse Figure

Versions Notes

Abstract

:

Blockchain is a public open ledger that provides data integrity in a distributed manner. It is the underlying technology of cryptocurrencies and an increasing number of related applications, such as smart contracts. The open nature of blockchain together with strong integrity guarantees on the stored data makes it a compelling platform for covert communication. In this paper, we suggest a method of securely embedding covert messages into a blockchain. We formulate a simplified ideal blockchain model based on existing implementations and devise a protocol that enables two parties to covertly communicate through the blockchain following that model. We also formulate a rigorous definition for the security and covertness of such a protocol based on computational indistinguishability. Finally, we show that our method satisfies this definition in the random oracle model for the underlying cryptographic hash function.

Keywords:

cryptography; steganography; privacy; random oracle model

1. Introduction

Covert communication channels are designed to protect the relationship between the transmitter and the receiver by hiding the fact that secret communication is taking place in the first place. Such channels can be used, for example, in military communication or in the presence of an authoritarian government. Secure covert communication can be implemented as a combination of cryptography and steganography. Cryptography ensures that the communicated message stays private while steganography is used to hide the fact that there is encrypted communication. However, steganography requires a medium. It requires that there is a channel where innocuous communication is taking place and both the transmitter and the receiver have access to that channel. In addition, the channel needs to be reliable so that the receiver gets the transmitted message with high probability and it has not been manipulated with.

The blockchain technology was first introduced as the underlying mechanism for cryptocurrency in Bitcoin [1] as an open decentralized method of providing trust. Blockchain is a public distributed ledger implemented as a continuously growing chain of blocks that is designed to provide authenticity of the data without any centralized parties. Its integrity is ensured collaboratively by the majority of the nodes participating in the blockchain network. Therefore, data recorded into a blockchain is inherently resilient to manipulation, since manipulation would require the adversary to control a large fraction of the nodes. Since its introduction, blockchain technology has attracted a lot of interest in various applications, such as smart contracts enabled by Ethereum [2], Internet of Things [3], healthcare [4] and medical data management [5]. Typically, blockchain networks, such as those underlying cryptocurrencies, are free for anyone to join. This openness together with strong integrity guarantees on the stored data make blockchains an interesting platform for implementing covert communication channels.

There are three main advantages of blockchain compared to other cover mediums: (1) it is anonymous and free to join, meaning that the communication parties have free access, (2) submitted data cannot be altered and, in particular, the integrity guarantees are not provided by any centralized party, but the consensus of the entire network, and (3) published data cannot be removed, meaning that no authority can apply censorship to already published data. Since the blockchain is immutable, alteration of the covert messages is virtually impossible and the embedding of covert information is free to be fragile. This is not the case, for example, for images on an image board.

In this paper, we suggest a method of submitting covert messages through a blockchain considered as a payment platform. Due to the immutable nature of the blockchain, the sender’s ability to embed into it is limited. However, since everybody is free to submit payments into the chain, we apply those payments to convey encrypted messages to a receiver. We start by providing a model for the blockchain, called a simplified ideal blockchain, where irrelevant technical details have been abstracted away. We then devise

BLOCCE

, a method of embedding and extracting reliably into a blockchain following this model by submitting payments and show that the method is reliable and runs in expected polynomial time excluding the time spent waiting for new blocks to appear into the chain. Based on the provable security of stegosystems, we then formulate a definition of secure covert communication on a blockchain based on the hardness of distinguishing the payload containing payments from random payments. Finally, we prove that our method satisfies this definition. The embedding rate of our method is rather inefficient and it is considered in a simplified model. Therefore, our proposal can be seen more as a proof-of-concept scheme than a practical scheme that is ready to be implemented. However, our proposal is the first to achieve a covert channel over a public blockchain with provable security.

The paper is organized as follows. Section 2 explains the preliminaries for the rest of the paper such as the utilized cryptographic primitives and steganography. In Section 3, we describe our model of a simplified ideal blockchain. The suggested method is explained in detail in Section 4 and its security is studied and proved in Section 5. Finally, Section 6 and Section 7 provide the discussion on future work and the conclusion, respectively.

Related Work

To the best of our knowledge, there are no suggestions implementing provably secure covert channels on a blockchain. However, there are methods and services that systematically insert arbitrary data into a blockchain [6]. The insertion can be based on either the input scripts, that unlock funds for the transaction, or the output scripts that specify the receiver of the unlocked funds. Depending on the transaction type (coinbase transactions that generate new currency or normal ones that make payments) deviations from the standard payment templates enable users to insert a message into the blockchain. This can be achieved, for example, by using dead conditional branches in an input script or replacing receiver public keys or script hashes with arbitrary data. Especially, methods manipulating the receiver address bear similarities to the one suggested in this paper. However, none of the existing methods address the covertness and privacy of the inserted data, which is the main motivation of our proposal. See [6] for a recent survey on existing blockchain data insertion methods and [7] for the methods that are used to detect and thwart them.

Our work is also related to the wider topic of privacy of blockchain based applications. Privacy was not an inherent property of the original Bitcoin design [1]. In fact, due to its open and public nature, there are several threats to the privacy of blockchain users [8]. However, there are several suggestions in the literature that attempt to address these issues by improving the anonymity, for example, by unlinking payments from their origin [9], using zero-knowledge proofs [10] or blind signatures [11]. There are also suggestions that use blockchain as a medium for secure communication. Typically, these suggestion apply encryption to protect the stored application data. For example, atrusted blind escrow service with encrypted data using a blockchain was suggested in Ref. [12]. In Ref. [13], an energy trading platform with bi-directional encrypted message streams was suggested. A decentralized data storage based on blockchain with end-to-end encryption for the internet of things was suggested in [14]. In order to provide a tamper-proof media transaction framework, blockchain technology was combined with digital watermarking of the media in [15].

2. Preliminaries

2.1. Notation

Let

D

be a probability distribution. A random variable V that is distributed according to

D

is denoted by

V \sim D

. If an element u is sampled from a probability distribution

D

, we denote

u \leftarrow D

. The uniform probability distribution on a set X is denoted by

U (X)

. For probabilistic algorithms, we apply the standard notation [16]. When an algorithm

A

is run with an input x and it outputs y, we denote

y \leftarrow A (x)

. Algorithms can be given access to oracles. Oracles are considered as “black boxes” that can compute, for example, functions or other algorithms. An algorithm

A

with oracle access to an oracle

O

is denoted by

A^{O}

. We assume that the time complexity of calling an oracle and receiving its answer is constant time

O (1)

.

The length of a string

σ = σ_{1} σ_{2} \dots σ_{n} \in {0, 1}^{n}

is denoted by

| σ | = n

. The least significant bit (LSB) of

σ = σ_{1} σ_{2} \dots σ_{n}

is

σ_{n}

. The binary string of length s consisting solely of ones is denoted by

1^{s}

and is often used as a security parameter for cryptographic primitives. A function

ϵ : N \to R

is negligible if for every

k \in N

there

n^{'} \in N

such that

| ϵ (n) | \leq n^{- k}

for every

n \geq n^{'}

.

2.2. Cryptographics Primitives

Our method is based on the perceived randomness of the outputs of a hash function. Let

{0, 1}^{*}

denote the set of all finite-length strings over

{0, 1}

together with the empty string (Kleene closure). We follow the random oracle model where a cryptographic hash function

H : {0, 1}^{*} \to {0, 1}^{n}

is modeled as a random oracle. For each input string

m \in {0, 1}^{*}

, the output

H (m)

is independently and uniformly randomly selected (and fixed for subsequent queries with the same input) from

{0, 1}^{n}

.

A symmetric encryption scheme

SE = (Gen, Enc, Dec)

, where

Gen

is a key generation algorithm that on input a security parameter

1^{s}

outputs a secret key

k \leftarrow Gen (1^{s})

,

Enc

is an encryption algorithm that on input a key k and a plaintext message m outputs a ciphertext

c \leftarrow Enc (k, m)

and

Dec

is a decryption algorithm that on input a key k and a ciphertext message c outputs a plaintext message m such that

m \leftarrow Dec (k, c) = Dec (k, Enc (k, m))

.

Algorithm 1 Pseudorandom Ciphertext Experiment
1:	procedure $PRC_{EXP}_{A}^{SE}$ ( $1^{s}$ )
2:	$k \leftarrow Gen (1^{s})$
3:	$(m, S) \leftarrow A_{1}^{{Enc}_{k}} (1^{s})$	▹ S is internal state information of $A$
4:	$b \leftarrow U ({0, 1})$
5:	if $b = 1$ then
6:	$c \leftarrow Enc (k, m)$	▹ Use the actual encryption
7:	else
8:	$c \leftarrow U ({0, 1}^{\| Enc (k, m) \|})$	▹ Use a random string
9:	end if
10:	$b^{'} \leftarrow A_{2}^{{Enc}_{k}} (c, S)$
11:	if $b = b^{'}$ then
12:	return 1	▹ $A$ guessed correctly
13:	else
14:	return 0	▹ $A$ did not guess correctly
15:	end if
16:	end procedure

A symmetric encryption scheme

SE

has pseudorandom ciphertexts under a chosen plaintext attack if a probabilistic polynomial time adversary is unable to distinguish the ciphertext message

c \in {0, 1}^{n}

from a uniformly random string

σ \in {0, 1}^{n}

even if he was able to choose the plaintext message. Formally, a polynomial time adversary

A = (A_{1}, A_{2})

is run in two stages

A_{1}

and

A_{2}

and is given oracle access to

{Enc}_{k}

, the encryption algorithm under a random secret key k. First,

A_{1}

, is run with oracle access to

{Enc}_{k}

and it outputs a cleartext message m together with its internal state information S that it may need in the second phase. Then, based on a coin toss, the second phase

A_{2}

is run with either the correct encryption of the chosen plaintext message or a completely random string from the ciphertext space (together with the state information). This pseudorandom ciphertext experiment is defined in Algorithm 1.

The encryption scheme

SE

has pseudorandom ciphertexts under a chosen plaintext attack if the success advantage,

{Adv}_{A, SE}^{PRC} (s) = |\frac{1}{2} - Pr [PRC_{EXP}_{A}^{SE} (1^{s}) = 1]|,

is negligible for every probabilistic polynomial time adversary

A

. For an example of a cryptosystem with pseudorandom ciphertexts see for example [17].

A digital signature scheme is a public key authentication mechanism that enables a user to sign messages so that anybody can later verify the authenticity of that signature. A digital signature scheme

Σ = (Gen, Sign, Verify)

consists of three algorithms such that

Gen

on input the security parameter

1^{s}

outputs a private and public key pair

(s_{k}, p_{k}) \leftarrow Gen (1^{s})

,

Sign

outputs a digital signature

σ \leftarrow Sign (s_{k}, m)

of a message m computed using the private key

s_{k}

and

Verify

verifies the signature

σ

by outputting

1 \leftarrow Verify (p_{k}, m, σ)

if and only if

σ

is a valid signature for m. A digital signature scheme should be existentially unforgeable meaning that no adversary should be able to generate valid signatures for any message m without the private key

s_{k}

. For details, see for example [18].

2.3. Steganography

We follow the terminology of Hopper et al. [19], where Alice tries to convey a hidden message to Bob. The communication channel from Alice to Bob is modeled as a probability distribution on bit sequences. Each of the communication bits is augmented with a monotonically increasing timestamp. Communication on the channel is viewed as sampling from this distribution, possibly adaptively bit-by-bit. That is, a channel distribution

C

produces elements of the form

(b_{1}, t_{1}), (b_{2}, t_{2}), \dots, (b_{n}, t_{n}),

where

b_{i} \in {0, 1}

and

t_{i} \leq t_{i + 1}

for every

i \in {1, 2, \dots, n}

. Let

H

denote a channel history of bits drawn from the channel. We denote by

C_{H}

the channel distribution conditioned on the history

H

of already sampled bits together with their timestamps.

Definition 1.

A stegosystem

Π = (Embed, Extract)

with security parameter s on a channel

C

consists of two probabilistic algorithms such that

1.: The embedding algorithm $Embed$ takes as input a secret key $k \in {0, 1}^{n_{k}}$ , a hiddentext message $m \in {0, 1}^{*}$ and a channel history $H \in {0, 1}^{*} \times R$ and returns a stegotext message $c \in {0, 1}^{*}$ .
2.: The extraction algorithm $Extract$ takes as input a secret key $k \in {0, 1}^{n_{k}}$ and a stegotext message $c \in {0, 1}^{*}$ and outputs a hiddentext message $m \in {0, 1}^{*}$ .

In our investigation, the channel history

H

that is given as input to

Embed

and the stegotext c given as input to

Extract

will be replaced by access to the blockchain. That is, in our investigation, the blockchain represents the channel history.

Definition 2.

The reliability of a stegosystem

Π = (Embed, Extract)

with messages of length n is the probability

min_{m \in {0, 1}^{n}, H} Pr [Extract (k, Embed (k, m, H))] .

To be useful, we require that the stegosystem has high reliability for hiddentext messages of some length. That is, there is

n \in N

such that for hiddentext messages of length n, the reliability is reasonably high (for example 2/3) meaning that messages can be reliably embedded and extracted.

We also need to stegosystem to be secure. The security is modeled using a chosen hiddentext attack by giving an adversary

A

access to the channel

C

through a sampling oracle M and to an additional oracle which is either

Π_{k}

that outputs stegotexts embedded by the stegosystem

Π

under a random key k or an oracle O that outputs random elements of

C_{H}

for the current history

H

each with probability 1/2 [20]. The adversary is free to choose the hidden message m and needs to distinguish which of these sets of two oracles it was given. If the advantage of distinguishing between these cases,

|Pr [A^{M, Π_{k}} (1^{s}) = 1] - Pr [A^{M, O} (1^{s}) = 1]|,

where s is the security parameter, is negligible for any adversary, then the stegosystem is secure. For details, see [20].

3. A Simplified Blockchain Model

We shall start by describing our model for the blockchain. In order to clarify our disposition, we apply a simplified model that abstracts away technical details that are irrelevant for our investigation. For example, in practice, consensus can be reached with several different mechanisms, such as proof-of-work or proof-of-stake. However, for our method, we do not need to know the working details of the implementation as long as immutability is guaranteed. Our model is similar to the idealized public ledger model from [21]. The simplified ideal blockchain

B = (C, Σ, H, Read, Submit)

consists of a chain of blocks C, an existentially unforgeable [18] digital signature scheme

Σ

, a cryptographic hash function H that is modeled as a random oracle and two oracles

Read

and

Submit

that can be used to read and write data to the blockchain. We shall describe these components in detail below.

For privacy reasons, payment addresses should be used only once for each payment. While it is technically possible, address re-use is generally considered bad practice since it may lead to identity exposure (For Bitcoin, see for example (https://en.bitcoin.it/wiki/Address_reuse)). Typically, the pseudonyms (that is, the addresses) in a blockchain are made to look like random. For example, in Bitcoin and Ethereum the address is computed by hashing the public key. In the random oracle model, the hash function outputs random strings. That is, for any public key

p_{k}^{(i)}

, the corresponding address

a^{(i)} = H (p_{k}^{(i)})

is a random string.

The simplified ideal blockchain contains

Payee identities. In order to join the systems, an individual i generates a private and public key pair $(s_{k}^{(i)}, p_{k}^{(i)})$ using a digital signature scheme $Σ$ . When paying, individuals are represented by their public keys $p_{k}^{(i)}$ .
Recipient addresses. Payments to an individual i are sent to a one-time address $a^{(i)} \in {0, 1}^{n}$ that the recipient can publish after hashing her public key: $a^{(i)} = H (p_{k}^{(i)})$ . Each payment address appears only once in a payment and a new private and public key pair $(s_{k}^{(i)}, p_{k}^{(i)})$ is generated to receive additional payments.
Money. The blockchain records the amount of money $μ^{(i)}$ that is associated to an address $a^{(i)}$ and, consequently, to a public key $p_{k}^{(i)}$ .
Initial status of the blockchain. The initial total amount of money is distributed among a finite set of L individuals represented by their addresses $a^{(1)}, a^{(2)}, \dots, a^{(L)}$ . An amount of money $μ^{(i)} \geq 0$ is associated to each of these addresses and the initial status of the blockchain

$C_{0} = ((a^{(1)}, μ^{(1)}), (a^{(2)}, μ^{(2)}), \dots, (a^{(L)}, μ^{(L)}))$

is public knowledge and the first block of the chain.
Payments. Let $p_{k}^{(i)}$ and $p_{k}^{(j)}$ be two distinct public keys. A valid payment P of amount μ from a user i to a user j is a tuple

$P = (p_{k}^{(i)}, a^{(j)}, μ, t, σ),$

where $a^{(j)} = H (p_{k}^{(j)})$ is the address of user j, $μ$ is the amount of payment such that $μ \leq μ^{(i)}$ , t is a unique identifier or a timestamp for the payment and $σ$ is a digital signature,

$σ = Sign (s_{k}^{(i)}, (p_{k}^{(i)}, a^{(j)}, μ, t)) .$

Once the payment is published in the blockchain, $μ^{(i)}$ and $μ^{(j)}$ are (implicitly) updated accordingly.
The ideal chain of blocks. For the simplified ideal blockchain, all published payments are valid and are published in a tamper-proof chain C of payment blocks

$C = (C_{0}, C_{1}, C_{2}, \dots)$

where $C_{0}$ is the initial status of the blockchain and the block $C_{i + 1}$ consists of all of the valid payments submitted to the blockchain after the publication of block $C_{i}$ . In the ideal system, a new block appears after a finite and fixed amount of time.

Note that users are free to join the system by generating their own private and public key pairs

(s_{k}, p_{k})

. Any address generated from a public key appearing as the recipient of a payment is valid even if it has never appeared in the blockchain before. In practice, addresses are substrings of the digest

H (p_{k}^{(i)})

in order to make then of a manageable length. However, for simplicity, we use the whole digest. For our proofs to work, it is imperative that a unique address is generated for every payment.

To enable access to the blockchain, in our model the users are given oracle access to oracles

Read

and

Submit

which can be used to read the contents of the blockchain block by block and to submit new payments. These are defined as

$Read (i)$ on input the block number $i \in N_{\geq 0}$ outputs the block $C_{i}$ from the chain $(C_{0}, C_{1}, C_{2}, \dots, C_{i - 1}, C_{i}, C_{i + 1}, \dots)$ if it has appeared already. If the last block that has appeared is $C_{i^{'}}$ where $i^{'} < i$ , $Read$ outputs ⊥ indicating error.
$Submit (P)$ on input a payment $P = (p_{k}^{(i)}, a^{(j)}, μ, t, σ)$ verifies that the payment is valid and saves it to be published in the next block to appear. If the payment is invalid, it is discarded.

4. The Suggested $BLOCCE$ Scheme

In this section, we give a detailed description of our method called

BLOCCE = ({Gen}_{BLOCCE}, Embed, Extract)

(Blockchain Covert Channel). We first give a general overview of the scheme. Then we describe the embedding and extraction procedures in detail. Finally, we show that embedding runs in expected polynomial time (exluding the time spent waiting for new blocks to appear on the chain) and that the method is reliable.

4.1. General Overview of $BLOCCE$

In our scenario, Alice attempts to convey a hidden message to Bob through the blockchain, while the adversary attempts to detect any such communication. Contrary to traditional steganography, Alice has no ability to alter the “covertexts” (that is, the blocks) stored in the blockchain due to the consensus mechanism. However, Alice has complete control on the payments she submits to the chain. In addition, the consensus mechanism will protect these payments from the possible modification attempts of the adversary. We will apply these payments and, in particular, the payment addresses to convey a hidden message to Bob. For simplicity, we shall send one bit for each block appearing in the chain. The general overview of the scheme is the following:

Alice generates a number of private and public key pairs

$(s_{k}^{(1)}, p_{k}^{(1)}), (s_{k}^{(1)}, p_{k}^{(1)}), \dots, (s_{k}^{(n)}, p_{k}^{(n)})$

and generates the payment addresses $a^{(1)}, a^{(2)}, \dots, a^{(n)}$ corresponding to these keys.
Alice generates payments (of small amounts) from her own account to these addresses and, depending on the hiddentext message m, orders them such that the least significant bits (LSBs) of the payment addresses form m.
Alice submits the payments in the correct order to the blockchain.
Bob reads the blockchain for payments made by Alice and reads the hiddentext message from the LSBs of the payment addresses.

Note that Alice does not lose any money by running the scheme excluding the possible transaction costs, since she controls the generated key pairs

(s_{k}^{(i)}, p_{k}^{(i)})

. While the transaction costs for certain blockchain implementations using the proof-of-work paradigm may be significant, there are also blockchains with other consensus mechanisms that do not require transaction costs. The scheme has been depicted in a simplified form in Figure 1.

4.2. Embedding Into the Blockchain

In this section, we give a detailed description of the embedding algorithm. For simplicity, we assume that Alice sends exactly one payment to the blockchain for each published block. This means that we are able to embed at most one bit for each published block. It would be possible to embed multiple bits, but it would complicate the formulation of the method. The embedding of a single bit simplifies our disposition and makes it more clear for the reader. For the same reason, we assume that the length of the hidden text is fixed and known to both Alice and Bob. We would not need to make such an assumption, but it greatly simplifies the description of the algorithms, as well as makes the discussion more clear.

Let

B = (C, Σ, H, Read, Submit)

be a simplified ideal blockchain, where

Σ = ({Gen}_{Σ}, Sign, Verify)

and let the security parameter employed by the blockchain implementation be

1^{s}

. Let

H : {0, 1}^{*} \to {0, 1}^{n}

. Let also

SE = ({Gen}_{SE}, Enc, Dec)

be a symmetric encryption scheme that has pseudorandom ciphertexts under the chosen plaintext attack and suppose that the security parameter

1^{s}

is used for key generation. We note that it is crucial that an encryption scheme with pseudorandom ciphertexts is used. The argument of the security later in Section 5 is based the pseudorandomness of the ciphertexts. It should be noted that the utility of such encryption schemes in the construction of steganographic algorithms has been already observed in the literature [17,22].

Algorithm 2 Embedding Algorithm
1:	procedure $Embed$ ( $(k, λ), m, B$ )
2:	$c \leftarrow Enc (k, m)$
3:	Concatenate $c^{'} = λ \| \| c$
4:	Set $N = \| c^{'} \|$
5:	Interpret $c^{'}$ as a bit representation $c_{1}^{'} c_{2}^{'} \dots c_{N}^{'} \in {0, 1}^{N}$
6:	$i = 1$
7:	while $i \leq N$ do
8:	Generate unseen $(s_{k}, p_{k}) \leftarrow {Gen}_{Σ} (1^{s})$
9:	$a \leftarrow H (p_{k}^{(i)})$
10:	Interpret a as a bit representation $a_{1} a_{2} \dots a_{n} \in {0, 1}^{n}$
11:	if $a_{n} = c_{i}^{'}$ then
12:	$μ \leftarrow M_{H}$
13:	Generate a unique identifier t for the payment
14:	$σ \leftarrow Sign (s_{k}^{(A)}, (p_{k}^{(A)}, a, μ, t))$
15:	$Submit (p_{k}^{(A)}, a, μ, t, σ)$
16:	Wait for the blockchain to publish a new block
17:	Update $H$
18:	$i \leftarrow i + 1$
19:	end if
20:	end while
21:	end procedure

In the following, let the private and public key pair of Alice be

(s_{k}^{(A)}, p_{k}^{(A)})

. Let

H

denote the history of payments Alice has made through the blockchain. We denote by

M_{H}

the probability distribution on the amount of money in a payment of Alice conditioned on the history

H

. It is important that the payments are made according to this distribution to prevent the adversary from detecting the communication. For any consequent payments, we assume that

H

is updated with the most recent payment and

M_{H}

is the resulting probability distribution on the amount of money. For clarity, we also assume that Alice does not run out of money.

On input a security parameter

1^{s}

, the key generation algorithm

{Gen}_{BLOCCE}

outputs a secret key of the form

(λ, k)

, where

λ

is a uniformly random message start indicator such that

λ \in {0, 1}^{n_{λ}}, n_{λ} \in N,

that will enable Bob to detect the start of the hidden message and k is an encryption key

k \leftarrow {Gen}_{SE} (1^{s})

. The length

n_{λ}

of the message start indicator will play a role in the consideration of the reliability of the extraction. We assume that the concatenated length of the message start indicator

λ

together with the ciphertext

c \leftarrow Enc (k, m)

are known to both Alice and Bob and let us denote this length by

N = n_{λ} + | c |

, where

| c |

is the length of the ciphertext c. Embedding is described in Algorithm 2.

Note that since the algorithm waits for a new block to be published after the submission of a payment, only a single payment from

p_{k}^{(A)}

is ever included into a single block. This will help Bob to extract the message.

4.3. Extraction

Extraction is straightforward. Bob will read the blocks from the chain and scans for any transactions made by Alice. Once Bob detects the secret message start indicator

λ

, he can read the encrypted hidden message. Since there is a single payment from

p_{k}^{(A)}

for each block, the message can be gathered in the correct order. Extraction is described in Algorithm 3.

Algorithm 3 Extraction Algorithm
1:	procedure $Extract$ ( $(λ, k), B$ )
2:	$i = 1$
3:	$j = 1$
4:	while have not found $λ$ yet do ▹ Scan for $λ$
5:	$C = Read (j)$
6:	if $C = ⊥$ then
7:	Wait until a block appears and read it: $C = Read (j)$
8:	end if
9:	for any payment $P \in C$ do
10:	if P is from $p_{k}^{(A)}$ then
11:	Extract address a from P and get the LSB $a_{n}$
12:	Scan if we have found the entire $λ \in {0, 1}^{n_{λ}}$
13:	end if
14:	end for
15:	$j \leftarrow j + 1$
16:	end while
17:	$i = 1$ ▹ Now reading the encrypted hidden message
18:	while $i \leq N - n_{λ}$ do
19:	$C = Read (j)$
20:	if $C = ⊥$ then
21:	Wait until a block appears and read it: $C = Read (j)$
22:	end if
23:	for any payment $P \in C$ do
24:	if P is from $p_{k}^{(A)}$ then
25:	Extract address a from P and get the LSB $a_{n}$
26:	$c_{i} \leftarrow a_{n}$
27:	$i \leftarrow i + 1$
28:	end if
29:	end for
30:	$j \leftarrow j + 1$
31:	end while
32:	Compile $c = c_{1} c_{2} \dots c_{N - n_{λ}}$
33:	$m \leftarrow Dec (k, c)$
34:	output m
35:	end procedure

The variables used in

Embed

,

Extract

and the following proofs have been collected into Table 1 for easy reference.

4.4. Computational Complexity and Correctness

We shall now show that the embedding can be done in expected polynomial time excluding the time spent waiting for new blocks to appear. That is, we consider here the computational complexity theoretic notion of “time”, which is the number of computational steps needed for the algorithm to finish its task. Then, we show that the method is correct. That is, for a fixed

N \in N

and for any payload

(λ, c) \in {0, 1}^{N}

, Alice can embed it into the blockchain and Bob is able to extract it with high reliability.

Let us first establish the computational complexity of the embedding algorithm.

Proposition 1.

Suppose that

Submit

and

μ \leftarrow M_{H}

run in

O (1)

for any payment history

H

. Let

c_{E}

be the computational complexity of the encryption algorithm

Enc

,

c_{t}

be the complexity of generating a unique identifier for a payment,

c_{{Gen}_{Σ}}

be the complexity of the digital signature key generation

{Gen}_{Σ}

,

c_{H}

be the complexity of computing the hash of a public key and

c_{σ}

be the complexity of generating a signature for a payment.

Embed

runs in expected time of

O (c_{E} (N - n_{λ}) + N \cdot (2 c_{{Gen}_{Σ}} + 2 c_{H} + c_{t} + c_{σ})),

where N is the number of embedded bits and

n_{λ}

is the length of the message start indicator.

Proof.

Let

m \in {0, 1}^{N - n_{λ}}

be arbitrary. First, Alice runs a single encryption of m which takes

c_{E} (N - n_{λ})

steps. The while-loop is repeated for until i reaches the total number of embedded bits N. Now, i is updated whenever the LSB

a_{n}

of

a = H (p_{k}^{(i)})

is equal to

c_{i}^{'}

. Suppose that the while-loop ends in a total of M iterations. Let

A_{v}

denote the random variable on

{0, 1}

such that

A_{v} = 1

whenever

a_{n}

obtained from the call to H in the v-th while-loop iteration equals

c_{i}^{'}

and

A_{v} = 0

otherwise. Since H is a random oracle,

A_{1}, A_{2}, \dots

are independent and distributed according to the Bernoulli distribution with probability 1/2.

Let now I denote the random variable corresponding to i. We have

I = \sum_{v = 1}^{M} A_{v}

, where M is the number of calls we had to make to H. That is, I is distributed according to the binomial distribution

Bin (M, 1 / 2)

that has the expected value of

M / 2

. Since we need to match N values, we expect the while-loop to run

M = 2 N

times. In each of the loop iterations, we run

{Gen}_{Σ}

. Finally, lines 12–18 are repeated exactly N times. ☐

Note that

Embed

has to wait for a new block to be generated by the blockchain for each of the bits. Therefore, the actual time spent embedding depends also on the blockchain implementation. Even though the computational complexity of embedding is low (virtually close to the complexity of the applied encryption algorithm), for certain blockchains with inefficient block generation the actual time spent embedding can be long. For example, if the block generation takes time T which is significantly greater than the time needed to encrypt m, the time to embed N bits takes time

N \cdot T

since the majority of the time is spent waiting for new blocks. The same is naturally true for extraction.

Next, we prove that

BLOCCE

is reliable provided that the length of the message start indicator

λ

is large enough.

Proposition 2.

BLOCCE

on a simplified ideal blockchain

B

with a suitably long message start indicator λ is correct and reliable. That is, for messages of any length, Bob receives Alice’s message with the reliability of at least

1 - (L_{A} - n_{λ} + 1) \cdot 2^{- n_{λ}}

where

L_{A}

is the total number of payments Alice has submitted into the blockchain and

n_{λ}

is the length of the message start indicator λ.

Proof.

By our assumptions, the simplified ideal blockchain is tamper-proof meaning that the adversary is unable to prevent Alice’s submissions from appearing on the chain. Therefore, we may assume that Bob receives every block submitted by Alice. For simplicity, let us assume that all of the message has already appeared on the chain if it has been sent.

Suppose that Alice transmitted m. Let us first show that the scheme is correct, whenever Bob has detected the correct message start indicator

λ

. We shall later show that this happens with high probability. By the description of

Embed

, Alice submits the ciphertext c directly after

λ

by submitting a single bit for each new block. Since the blockchain is tamper-proof, Bob receives all of these blocks and, by the description of

Extract

, extracts the correct c. Finally, Bob decrypts c to obtain the correct hidden message m.

Let us now show that the method is reliable. That is, let us show that Bob is able to detect the correct message start indicator

λ

with high probability. By the description of

Extract

, Bob first scans the blockchain for all of the payments from

p_{k}^{(A)}

, extracts the LSB

a_{n}

of each address a and scans for the appearance of

λ \in {0, 1}^{n_{λ}}

. There are two ways that the extraction can fail:

The LSBs of the addresses of the payments Alice has made before transmitting $(λ, c)$ accidentally form $λ$ that Bob misinterprets as the start of the hidden message.
Alice has not submitted any message, but the LSBs of the addresses of her payments form $λ$ .

These two cases are similar, but provided that Alice has submitted the same number of payments in both cases, in the latter the reliability is lower. To see this, we observe that in the first case, we are trying to find a false match for

λ

in the subchain of blocks that appeared before the true match for

λ

, while in the latter case, we have the whole blockchain to search for the false match. Therefore, we can restrict ourselves to the second case. By our assumptions, Alice submits exactly one payment for each published block. We now have to derive a upper bound on the probability of these payments forming

λ

. Now, H is a random oracle and both

λ

and the addresses are sampled from the uniform distribution.

Let

α = α_{1} α_{2} \dots α_{L_{A}} \in {0, 1}^{L_{A}}

be the string of LSBs of the addresses extacted (in order) from all of Alice’s payments recorded into the blockchain (thus far). Let

A_{1}, A_{2}, \dots, A_{L_{A}}

denote the random variables corresponding to the choise of

α_{1}, α_{2}, \dots, α_{L_{A}}

each chosen independently and uniformly at random from

{0, 1}

. We derive an upper bound for the probability of hitting

λ

,

Pr [λ substring of α] .

We observe that

λ

can appear in any starting position

i \in {0, 1, \dots, L_{A} - n_{λ}}

in which case we have

λ = α_{i + 1} α_{i + 2} \dots α_{i + n_{λ}}

. Therefore, we estimate

Pr [λ substring of α]

with sum of the probabilities

Pr [A_{i + 1} A_{i + 2} \dots A_{i + n_{λ}} = λ]

for

i \in {0, 1, \dots, L_{A} - n_{λ}}

. This sum over-counts, since for large

L_{A}

,

λ

can appear in multiple positions. However, we are only interested in deriving an upper bound. We have,

\begin{matrix} Pr [λ substring of α] & \leq & \sum_{i = 0}^{L_{A} - n_{λ}} Pr [A_{i + 1} A_{i + 2} \dots A_{i + n_{λ}} = λ] \\ = & \sum_{i = 0}^{L_{A} - n_{λ}} \prod_{j = 1}^{w} Pr [A_{i + j} = λ_{j}] \\ = & \sum_{i = 0}^{L_{A} - n_{λ}} 2^{- n_{λ}} = (L_{A} - n_{λ} + 1) \cdot 2^{- n_{λ}} . \end{matrix}

Therefore, the reliability of

BLOCCE

is at least

1 - Pr [λ substring of α] \geq 1 - (L_{A} - n_{λ} + 1) \cdot 2^{- n_{λ}} .

☐

Interestingly, the reliability depends essentially only on the length of the message start indicator. Note that we have assumed that Alice has submitted a single bit for each block. If multiple bits are embedded into a single block, Proposition 2 needs to be updated accordingly. Finally, the result also assumes that addresses are not reused.

5. Security

In this section, we consider the security of

BLOCCE

. Following the notions of security for a stegosystem [20], we derive a security definition for the blockchain based covert channel by modeling a chosen hiddentext attack of a probabilistic polynomial time adversary on

BLOCCE

. We start by listing our assumptions. We then proceed to the formulation of a security definition called payment indistinguishability that requires the adversary to distinguish the payload containing payments from random. Finally, we show that

BLOCCE

satisfies this definition.

5.1. Assumptions

Our security proofs are based on a simplified ideal blockchain

B

. In particular, we assume that digital signatures are existentially unforgeable and the applied cryptographic hash function is modeled as a random oracle. There are three participants in our scenario:

Alice represents the transmitter of our scheme and is known through her (payee identity) public key $p_{k}^{(A)}$ . She has agreed with the recipient Bob beforehand on a secret key $(λ, k) \in {0, 1}^{n_{λ}} \times {0, 1}^{n_{k}}$ that is not known to anyone else. She attempts to send a confidential message m to Bob through the simplified ideal blockchain $B$ . Both Alice and Bob know the total amount of embedded bits N. Finally, Alice is aware of her “normal” distribution of payment amounts $M_{H}$ given her history of payments $H$ and is able to sample from it.
Bob represents the recipient of our scheme. He expects a confidential message from Alice through the blockchain, knows the public key $p_{k}^{(A)}$ of Alice, as well as the secret key $(λ, k)$ and the total number of embedded bits N.
The adverary attempts to detect the presence of covert communication on the blockchain. We assume that the adversary knows Alice and her public key $p_{k}^{(A)}$ . The warden also has complete access to the blockchain $B$ through the oracles $Read$ and $Submit$ . The job of the adversary is to distinguish the secret communication payments from regular payments.

5.2. Payment Indistinguishability

We apply a computational indistinguishability based approach for the security definition. In particular, we define a scenario, where the adversary has to distinguish the payments containing the hidden message from a set of random payments. To formalize this into a rigorous security definition, we can apply a chosen hiddentext attack described in Section 2.3. We model the situation by defining the following payment distinguishing experiment against a stegosystem

Π

in which the adversary

A

attempts to distinguish between the scenarios where it is either given a set of randomly generated addresses or a set of addresses containing the conceiled message each with probability 1/2.

We give the adversary full control to choose the hiddentext message m and to observe the blockchain. However, we do not give the adversary the power to block Alice from sending payments to the blockchain or to prevent Bob from observing the blockchain. We also do not give him access to the private keys generated by Alice or to the secret key

(λ, k)

shared between Alice and Bob. Note that since the digital signature scheme is unforgeable, this means that the adversary cannot masquerade as Alice and forge messages into the blockchain.

The probabilistic polynomial time adversary

A = (A_{1}, A_{2})

is modeled in two stages

A_{1}

and

A_{2}

. In the first stage, it is given access to the full block history of the blockchain through the oracle

Read

, the ability to submit payments through the oracle

Submit

, as well as the public key

p_{k}^{(A)}

of Alice. In the first stage, the job of the adversary is to output a hiddentext message m that Alice is required to send to Bob. In the experiment, a coin is then tossed and, based on the outcome, either the stegosystem

Π

is applied to send m into the blockchain or a set of random valid payments from Alice is generated. In the second stage,

A_{2}

is invoked with oracle access to the blockchain and it eventually outputs a bit trying to distinguish whether the blockchain contained the hidden message or just a set of random valid payments. As with the pseudorandom ciphetext experiment, a string storing the internal state information S of the adversary is also passed from

A_{1}

to

A_{2}

.

The rigorous definition of the payment distinguishing experiment is the following.

Algorithm 4 Payment Distinguishing Experiment
1:	procedure $PAY_DIST_{EXP}_{A}^{Π, B}$ ( $1^{s}$ )
2:	$(s_{k}, p_{k}) \leftarrow {Gen}_{Σ} (1^{s})$
3:	$(m, S) \leftarrow A_{1}^{Read, Submit} (p_{k})$ ▹ S is internal state information of the adversary that can be passed to the second stage
4:	$(λ, k) \leftarrow {Gen}_{Π} (1^{s})$
5:	$b \leftarrow U ({0, 1})$
6:	if $b = 1$ then ▹ Actual message is sent to the blockchain
7:	$Embed ((λ, k), m, B)$
8:	else ▹ Random payments are sent to the blockchain
9:	$n_{λ} \leftarrow \| λ \|$
10:	$N \leftarrow \| Enc (k, m) \| + n_{λ}$ ▹ $Enc$ is the encryption scheme used by $Π$
11:	Generate N random addresses $a_{i}$ for $i \in {1, 2, \dots, N}$
12:	Simulate $Embed$ to generate payments to $a_{i}$
13:	Submit payments to blockchain one-by-one as $Embed$ does
14:	end if
15:	$b^{'} \leftarrow A_{2}^{Read, Submit} (p_{k}, S)$
16:	if $b = b^{'}$ then
17:	return 1 ▹ $A$ guessed correctly
18:	else
19:	return 0 ▹ $A$ did not guess correctly
20:	end if
21:	end procedure

Definition 3.

Let

Π = (B, {Gen}_{Π}, Embed, Extract)

be a blockchain stegosystem based on a simplified ideal blockchain

B = (C, Σ, H, Read, Submit)

and let

A = (A_{1}, A_{2})

be a two stage probabilistic polynomial time adversary. The payment distinguishing experiment is defined by Algorithm 4.

We also define the advantage of an adversary in detecting the hidden message based on the payment distinguishing experiment.

Definition 4.

The payment detection advantage of an adversary

A

on a blockchain stegosystem Π on a simplified ideal blockchain

B

is

{Adv}_{A, Π, B}^{PAY_DETECT} (s) = |\frac{1}{2} - Pr [PAY_DIST_{EXP}_{A}^{Π, B} (1^{s}) = 1]| .

If the payment detection advantage of the adversary is significantly greater that 0, then, in practice, the adverasry is able to detect the conceiled message. For a secure system, we want this advantage to be negligible.

Definition 5.

The blockchain stegosystem Π securely embeds into the blockchain

B

if for every probabilistic polynomial time adversary

A

, there is a negligible function ϵ such that

{Adv}_{A, Π, B}^{PAY_DETECT} (s) \leq ϵ (s)

for every

s \geq 1

.

5.3. Security Proof of $BLOCCE$

We shall now show that

BLOCCE

securely embeds into the blockchain. In particular, we derive an algorithm that reduces the problem of distinguishing the ciphertexts of the encryption scheme

SE

used by

BLOCCE

to the problem of distinguishing the payments made with

BLOCCE

. Since

SE

is assumed to have pseudorandom ciphertexts, this shows that there is no adversary that succeeds in the payment distinguishing experiment with non-negligible advantage.

Proposition 3.

BLOCCE

securely embeds into a simplified ideal blockchain

B

. For every probabilistic polynomial time adversary

A

there is a probabilistic polynomial time adversary

A^{'}

such that

{Adv}_{A^{'}, SE}^{PRC} (s) = {Adv}_{A, BLOCCE, B}^{PAY_DETECT} (s) \leq ϵ (s),

where

SE

is the encryption scheme used in

BLOCCE

and ϵ is a negligible function.

Proof.

Let

A = (A_{1}, A_{2})

be any two-stage probabilistic polynomial time algorithm considered as an adversary against

BLOCCE

. We need to show that there is a negligible function

ϵ

such that

{Adv}_{A, BLOCCE, B}^{PAY_DETECT} (s) \leq ϵ (s) .

Suppose that there was an adversary

A

that succeeds with a non-negligible advantage. Based on such an adversary, we shall construct a probabilistic polynomial time algorithm

A^{'}

that applies

A

to achieve a high ciphertext distinguishing advantage for the symmetric encryption scheme

SE

. In particular, we show that the ciphertext distinguishing advantage is at least the advantage of

A

in the payment distinguishing experiment. Since, by the assumptions,

SE

has pseudorandom ciphertexts (see Section 4.2), and thus the ciphertext distinguishing advantage is negligible for every adversary, we get the claim.

For this, let

A^{'} = (A_{1}^{'}, A_{2}^{'})

be a two-stage adversary, that applies

A = (A_{1}, A_{2})

, given below. In the description, we need to save the status information S that

A_{1}

outputs in order to invoke

A_{2}

in the later state. In addition, since we are emulating a payment distinguishing experiment, we also need to initialize a blockchain and to maintain its state. Therefore, we store the state and internal information of the blockchain into an information string

S^{'}

for a coherent second stage

A_{2}^{'}

.

The adversary

A^{'} = (A_{1}^{'}, A_{2}^{'})

is described in Algorithms 5 and 6.

Algorithm 5 First Stage of the Adversary $A^{'}$
1:	procedure ${A'}_{1}^{{Enc}_{k}}$ ( $1^{s}$ )
2:	Initialize a blockchain $B$
3:	$(s_{k}, p_{k}) \leftarrow {Gen}_{Σ} (1^{s})$
4:	$(m, S) \leftarrow A_{1}^{Read, Submit} (p_{k})$ ▹ Answers the queries according to the specification of $B$
5:	$S^{'} \leftarrow$ state and internal information of $B$
6:	output $(m, (S, S^{'}, p_{k}, s_{k}))$
7:	end procedure

Algorithm 6 Second Stage of the Adversary $A^{'}$
1:	procedure ${A'}_{2}^{{Enc}_{k}}$ ( $c, (S, S^{'}, p_{k}, s_{k})$ )
2:	Initialize a blockchain $B$ according to the state $S^{'}$
3:	$(λ, k) \leftarrow {Gen}_{BLOCCE} (1^{s})$
4:	Embed $λ \| \| c$ into $B$ by simulating $Embed$
5:	$b^{'} \leftarrow A_{2}^{Read, Submit} (p_{k}, S)$
6:	output $b^{'}$
7:	end procedure

If

A

is probabilistic polynomial time, so is

A^{'}

. Suppose that

A^{'}

was run in an

PRC_EXP

experiment. Let D denote the random variable corresponding to the experiment coin toss (b of

PRC_EXP

line 4) such that

D = 1

if

A^{'}

was given the correct

c \leftarrow Enc (k, m)

under a random key k and

D = 0

if it was given a random

c \leftarrow U ({0, 1}^{N - n_{λ}})

. Depending on D, we have the following two cases:

Suppose first that $D = 1$ and $A^{'}$ was given the correct $c \leftarrow Enc (k, m)$ . Then $A^{'}$ embeds $λ | | c$ into the blockchain which follows the payment distinguishing experiment for $A$ for the case $b = 1$ . Since $A_{2}^{'}$ outputs the same bit $b^{'}$ as $A_{2}$ we have

$Pr [A^{'} succeeds in PRC_EXP | D = 1] = Pr [A succeeds | D = 1] .$
Suppose now that $D = 0$ and c is a uniformly random string. By the description of ${Gen}_{BLOCCE}$ , $λ$ is also uniformly random, meaning that a uniformly random string $λ | | c$ gets embedded into the blockchain. By the description of the payment distinguishing experiment, this is equal to the case $b = 0$ and

$Pr [A^{'} succeeds in PRC_EXP | D = 0] = Pr [A succeeds | D = 0] .$

We have established that

A^{'}

, which we constructed to be an adversary to distinguish the ciphertexts of

SE

from random, succeeds in its experiment if and only if

A

succeeds in its own payment distinguishing experiment. Therefore,

Pr [PRC_{EXP}_{A^{'}}^{SE} (1^{s}) = 1] = Pr [PAY_DIST_{EXP}_{A}^{BLOCCE, B} (1^{s}) = 1] .

By the definition of advantage,

{Adv}_{A^{'}, SE}^{PRC} (s) = {Adv}_{A, BLOCCE, B}^{PAY_DETECT} (s) .

By the definition of

BLOCCE

(see Section 4.2), the applied symmetric encryption scheme

SE

has pseudorandom ciphertexts under a chosen plaintext attack, and there is a negligible function

ϵ

such that

{Adv}_{A^{'}, SE}^{PRC} (s) \leq ϵ (s)

for every

s \geq 1

(see Section 2.2). Since

A

was any two-stage probabilistic polynomial time adversary and

{Adv}_{A, BLOCCE, B}^{PAY_DETECT} (s) = {Adv}_{A^{'}, SE}^{PRC} (s) \leq ϵ (s),

we have the claim and

BLOCCE

securely embeds into a simplified ideal blockchain. ☐

6. Discussion and Future Work

To simplify our investigations, we restricted ourselves to the embedding of a single bit for each block. For many applications, such as Bitcoin, the time to publish a new block is counted in minutes instead of seconds. Therefore, the throughput of our scheme is low. However, it is easy to increase the number of embedded bits per block. One way to do it would be to match multiple LSBs of the address. However, in such a case, the expected computation time for embedding grows exponentially in the number of bits, since the bits are drawn from the random oracle. Proposition 2 would also need to be updated accordingly. It is more efficient to submit several payments into the same block. In such a case, the ordering of the message bits have to be ensured, for example, by using the unique payment identifier t or the public payer keys.

Our method requires the user to distribute the message bits over several payments. For many contemporary blockchains that apply the proof-of-work paradign for it consensus mechanism the transaction costs may be significant for larger hidden messages. However, there are newer consensus mechanisms that aim to address the energy usage issue of the proof-of-work paradigm to improve block generation efficiency and to ultimately possibly remove transaction costs in certain applications. Such mechanisms include proof-of-stake, distributed proof-of-stake and byzantine fault tolerance based mechanisms. For example, at the moment, Ethereum is planning on moving to the prove-of-stake paradigm. While there will be high transaction costs in certain use cases of blockchain, we believe that, in many applications, transaction costs will not be an issue and our method will prove to be useful.

Performance of our method can be also increased by pre-computing a list of L addresses, where L is significantly greater than N, the total length of the embedded message. Since the LSBs of these generated addresses are random, approximately one half will be ones and the rest zeros. Once Alice is ready to embed the payload

(λ, c)

, she can pick the addresses in order from the pre-generated list. This approach would also mitigate the computational complexity of the embedding of multiple bits into a single payment. Furthermore, addresses to embed

λ

can be pre-computed immediately after the key

(λ, k)

has been agreed on. However, we leave these considerations for future work.

It should be noted that it is important that the hidden message is first encrypted using an encryption scheme

SE

that has pseudorandom ciphertexts. If it is not the case, the adversary is able to detect the non-random nature of the LSBs of the addresses in Alice’s payments. For the same reason, the underlying blockchain should not allow the reuse of addresses. In addition to being a privacy risk [23,24,25], it would render the payment indistinguishability approach to the security inapplicable. For such systems, we would need to formulate a security definition that is based on the probability distribution of “normal” payments of Alice and the payload containing payments should be indinstinguishable from this distribution. We leave these considerations also for future work.

In this paper, we have not implemented BLOCCE in practice. Instead, we have considered it in the simplified ideal blockchain model that abstracts away details that are not relevant for the theoretical investigations. For example, the network is completely abstracted away in the simplified model. However, the details, such as the network, are relevant when considering a practical implementation of the scheme. We leave it as future work to investigate the secure implementation of BLOCCE using an existing blockchain such as Ethereum.

7. Conclusions

We suggest the first provably secure method called

BLOCCE

of implementing a covert communication channel over a blockchain. Our proofs are shown in the random oracle model, where the cryptographic hash function used by the blockchain is modeled as a random oracle. We formulate a simplified ideal blockchain that models the blockchain implementations underlying existing cryptocurrencies. Based on this model, we suggest a method of embedding a single bit for each block using payments submitted to the blockchain. We show that the method is reliable and runs in expected polynomial time. The method can be generalized to embed multiple bits to increase the throughput. To model the security of covert channels on a blockchain, we formulate the notion of payment indistinguishability, where the transmitted hidden message should be computationally indistinguishable from random payments. Finally, we show that

BLOCCE

satisfies this definition.

Funding

This research was supported by Tekes, the Finnish Funding Agency for Technology and Innovation in VitalSens project.

Conflicts of Interest

The authors declare no conflict of interest.

References

Nakamoto, S. Bitcoin: A Peer-to-Peer Electronic Cash System. 2008. Available online: https://bitcoin.org/bitcoin.pdf (accessed on 12 February 2018).
Wood, G. Ethereum: A Secure Decentralized Transaction Ledger. 2014. Available online: http://gavwood.com/paper.pdf (accessed on 13 February 2018).
Christidis, K.; Devetsikiotis, M. Blockchains and Smart Contracts for the Internet of Things. IEEE Access 2016, 4, 2292–2303. [Google Scholar] [CrossRef]
Mettler, M. Blockchain technology in healthcare: The revolution starts here. In Proceedings of the 2016 IEEE 18th International Conference on e-Health Networking, Applications and Services (Healthcom), Munich, Germany, 14–16 September 2016; pp. 1–3. [Google Scholar]
Azaria, A.; Ekblaw, A.; Vieira, T.; Lippman, A. MedRec: Using Blockchain for Medical Data Access and Permission Management. In Proceedings of the 2016 2nd International Conference on Open and Big Data (OBD), Vienna, Austria, 22–24 August 2016; pp. 25–30. [Google Scholar]
Matzutt, R.; Hiller, J.; Henze, M.; Ziegeldorf, J.H.; Müllmann, D.; Hohlfeld, O.; Wehrle, K. A Quantitative Analysis of the Impact of Arbitrary Blockchain Content on Bitcoin. In Proceedings of the 22nd International Conference on Financial Cryptography and Data Security (FC), Santa Barbara, 26 February–2 March 2018; Springer: Berlin, Germany, 2018. [Google Scholar]
Matzutt, R.; Henze, M.; Ziegeldorf, J.H.; Hiller, J.; Wehrle, K. Thwarting Unwanted Blockchain Content Insertion. In Proceedings of the 2018 IEEE International Conference on Cloud Engineering (IC2E), Orlando, FL, USA, 17–20 April 2018; pp. 364–370. [Google Scholar]
Barber, S.; Boyen, X.; Shi, E.; Uzun, E. Bitter to Better—How to Make Bitcoin a Better Currency. In Financial Cryptography and Data Security; Keromytis, A.D., Ed.; Springer: Berlin, Germany, 2012; pp. 399–414. [Google Scholar]
Miers, I.; Garman, C.; Green, M.; Rubin, A.D. Zerocoin: Anonymous Distributed E-Cash from Bitcoin. In Proceedings of the 2013 IEEE Symposium on Security and Privacy, Berkeley, CA, USA, 19–22 May 2013; pp. 397–411. [Google Scholar]
Sasson, E.B.; Chiesa, A.; Garman, C.; Green, M.; Miers, I.; Tromer, E.; Virza, M. Zerocash: Decentralized Anonymous Payments from Bitcoin. In Proceedings of the 2014 IEEE Symposium on Security and Privacy, Berkeley, CA, USA, 18–21 May 2014; pp. 459–474. [Google Scholar]
Heilman, E.; Baldimtsi, F.; Goldberg, S. Blindly Signed Contracts: Anonymous On-Blockchain and Off-Blockchain Bitcoin Transactions. In Financial Cryptography and Data Security; Clark, J., Meiklejohn, S., Ryan, P.Y., Wallach, D., Brenner, M., Rohloff, K., Eds.; Springer: Berlin, Germany, 2016; pp. 43–60. [Google Scholar]
Zyskind, G.; Nathan, O.; Pentland, A. Decentralizing Privacy: Using Blockchain to Protect Personal Data. In Proceedings of the 2015 IEEE Security and Privacy Workshops, San Jose, CA, USA, 21–22 May 2015; pp. 180–184. [Google Scholar]
Aitzhan, N.Z.; Svetinovic, D. Security and Privacy in Decentralized Energy Trading through Multi-signatures, Blockchain and Anonymous Messaging Streams. IEEE Trans. Dependable Secur. Comput. 2016. [Google Scholar] [CrossRef]
Shafagh, H.; Burkhalter, L.; Hithnawi, A.; Duquennoy, S. Towards Blockchain-based Auditable Storage and Sharing of IoT Data. In Proceedings of the 2017 on Cloud Computing Security Workshop, CCSW ’17, Dallas, TX, USA, 3 November 2017; ACM: New York, NY, USA, 2017; pp. 45–50. [Google Scholar] [Green Version]
Bhowmik, D.; Feng, T. The multimedia blockchain: A distributed and tamper-proof media transaction framework. In Proceedings of the 2017 22nd International Conference on Digital Signal Processing (DSP), London, UK, 23–25 August 2017; pp. 1–5. [Google Scholar]
Goldwasser, S.; Micali, S.; Rivest, R.L. A digital signature scheme secure against adaptive chosen-message attacks. SIAM J. Comput. 1988, 17, 281–308. [Google Scholar] [CrossRef]
Möller, B. A Public-Key Encryption Scheme with Pseudo-random Ciphertexts. In Computer Security–ESORICS 2004; Samarati, P., Ryan, P., Gollmann, D., Molva, R., Eds.; Springer: Berlin, Germany, 2004; pp. 335–351. [Google Scholar]
Katz, J.; Lindell, Y. Introduction to Modern Cryptography; Chapman & Hall/CRC: Boca Raton, FL, USA, 2007. [Google Scholar]
Hopper, N.J.; Langford, J.; von Ahn, L. Provably Secure Steganography. In Advances in Cryptology—CRYPTO 2002; Yung, M., Ed.; Springer: Berlin, Germany, 2002; pp. 77–92. [Google Scholar]
Dedić, N.; Itkis, G.; Reyzin, L.; Russell, S. Upper and Lower Bounds on Black-Box Steganography. J. Cryptol. 2009, 22, 365–394. [Google Scholar] [CrossRef]
Micali, S. ALGORAND: The Efficient and Democratic Ledger. ArXiv, 2016; arXiv:1607.01341. [Google Scholar]
Backes, M.; Cachin, C. Public-Key Steganography with Active Attacks. In Theory of Cryptography; Kilian, J., Ed.; Springer: Berlin, Germany, 2005; pp. 210–226. [Google Scholar]
Ron, D.; Shamir, A. Quantitative Analysis of the Full Bitcoin Transaction Graph. In Financial Cryptography and Data Security; Sadeghi, A.R., Ed.; Springer: Berlin, Germany, 2013; pp. 6–24. [Google Scholar]
Bos, J.W.; Halderman, J.A.; Heninger, N.; Moore, J.; Naehrig, M.; Wustrow, E. Elliptic Curve Cryptography in Practice. In Financial Cryptography and Data Security; Christin, N., Safavi-Naini, R., Eds.; Springer: Berlin, Germany, 2014; pp. 157–175. [Google Scholar]
Fleder, M.; Kester, M.S.; Pillai, S. Bitcoin Transaction Graph Analysis. ArXiv, 2015; arXiv:1502.01657. [Google Scholar]

Figure 1. A simplified overview of the suggested method. Alice submits payments to the blockchain such that the LSBs of the addresses form the message m. Bob can read the message bits from those addresses and form m. In the actual method, encryption is also used and the start of the message is indicated.

Table 1. Used variables and notations.

Variable	Explanation
H	hash function
$(s_{k}, p_{k})$	private and public key pair for the digital signature scheme
$(s_{k}^{(A)}, p_{k}^{(A)})$	Alice’s keypair
k	secret key for symmetric encryption
$λ$	message start indicator
$n_{λ}$	length of $λ$
m	hiddentext message
c	$c \leftarrow Enc (k, m)$
$c^{'}$	concatenation $c^{'} = λ \| \| c$
N	total number of embedded bits $N = \| c^{'} \|$
$a, a^{(i)}$	address computed by hashing a public key
$H$	the history of payments Alice has made through the blockchain
$M_{H}$	distribution on the amount of money in Alice’s payment conditioned on the history of payments $H$
$μ$	payment amount
t	unique payment identifier
$σ$	digital signature
$C, C_{i}$	block in the blockchain
P	payment
$L_{A}$	total number of payments made by Alice

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Partala, J. Provably Secure Covert Communication on Blockchain. Cryptography 2018, 2, 18. https://doi.org/10.3390/cryptography2030018

AMA Style

Partala J. Provably Secure Covert Communication on Blockchain. Cryptography. 2018; 2(3):18. https://doi.org/10.3390/cryptography2030018

Chicago/Turabian Style

Partala, Juha. 2018. "Provably Secure Covert Communication on Blockchain" Cryptography 2, no. 3: 18. https://doi.org/10.3390/cryptography2030018

APA Style

Partala, J. (2018). Provably Secure Covert Communication on Blockchain. Cryptography, 2(3), 18. https://doi.org/10.3390/cryptography2030018

Article Menu

Provably Secure Covert Communication on Blockchain

Abstract

1. Introduction

Related Work

2. Preliminaries

2.1. Notation

2.2. Cryptographics Primitives

2.3. Steganography

3. A Simplified Blockchain Model

4. The Suggested $BLOCCE$ Scheme

4.1. General Overview of $BLOCCE$

4.2. Embedding Into the Blockchain

4.3. Extraction

4.4. Computational Complexity and Correctness

5. Security

5.1. Assumptions

5.2. Payment Indistinguishability

5.3. Security Proof of $BLOCCE$

6. Discussion and Future Work

7. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Provably Secure Covert Communication on Blockchain

Abstract

1. Introduction

Related Work

2. Preliminaries

2.1. Notation

2.2. Cryptographics Primitives

2.3. Steganography

3. A Simplified Blockchain Model

4. The Suggested BLOCCE Scheme

4.1. General Overview of BLOCCE

4.2. Embedding Into the Blockchain

4.3. Extraction

4.4. Computational Complexity and Correctness

5. Security

5.1. Assumptions

5.2. Payment Indistinguishability

5.3. Security Proof of BLOCCE

6. Discussion and Future Work

7. Conclusions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4. The Suggested $BLOCCE$ Scheme

4.1. General Overview of $BLOCCE$

5.3. Security Proof of $BLOCCE$