From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains

Li, Weijie; Jiang, Shan; Ni, Bina; Liang, Weipeng; Wang, Yu

doi:10.3390/sym17101618

Open AccessArticle

From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains

by

Weijie Li

^1,2

,

Shan Jiang

³,

Bina Ni

^1,2,

Weipeng Liang

⁴ and

Yu Wang

^1,2,*

¹

School of Artificial Intelligence, Guangzhou University, Guangzhou 510006, China

²

Guangdong Key Laboratory of Blockchain Security, Guangzhou University, Guangzhou 510006, China

³

School of Science, Computing and Engineering Technologies, Swinburne University of Technology, Melbourne, VIC 3122, Australia

⁴

China Telecom Corporation Limited Jiangmen Branch, Jiangmen 529000, China

^*

Author to whom correspondence should be addressed.

Symmetry 2025, 17(10), 1618; https://doi.org/10.3390/sym17101618

Submission received: 25 August 2025 / Revised: 12 September 2025 / Accepted: 15 September 2025 / Published: 30 September 2025

(This article belongs to the Special Issue Applications Based on Symmetry in Adversarial Machine Learning)

Download

Browse Figures

Versions Notes

Abstract

Proof-of-Work (PoW) blockchains with symmetric consensus face threats such as selfish mining, bribery mining, block withholding, and replay attacks. This paper introduces a hybrid attack, Bribery–Stubborn Mining (BSbM), which integrates stubborn mining’s delayed chain publication with bribery incentives to recruit miners during forks. Simulation experiments confirm that BSbM yields additional revenue. To obtain even higher revenue, we propose Leading Hidden Bribery–Stubborn Mining (LHBSbM) based on BSbM. By concealing and delaying broadcasts, LHBSbM constructs a triple fork, maintaining three chains at the same height. Upon revealing the private chain, two public blocks can be isolated, breaking the single-block limit of double-fork attacks. Theoretical analysis shows that LHBSbM raises the attacker’s maximum effective block rate from

α / (1 - α)

to

α / (1 - α - β)

. Experimental results indicate that, under ideal conditions (

r = 0

), BSbM becomes profitable once the attacker’s hash rate (

α

) exceeds approximately

34 %

and further confirm that, under certain conditions, LHBSbM nearly doubles isolated blocks compared to BSbM, yielding greater profits. Finally, potential defenses against such hybrid attacks are discussed, offering new insights for blockchain security.

Keywords:

blockchain; proof-of-work; mining attack; stubborn mining; bribery attack; triple fork

1. Introduction

Blockchain technology first appeared in a paper [1] by Satoshi Nakamoto, published in November 2008. This paper describes blockchain technology as a decentralized distributed ledger technology that enables data to be recorded and stored securely, traceably, and transparently while remaining tamper-proof. The paper also introduced the world’s first decentralized digital cryptocurrency—Bitcoin. Although initially developed as a transaction platform for cryptocurrency, the impact of blockchain technology extends far beyond this. Its applications are no longer confined to finance, with fields such as the Internet of Things, healthcare, education, and the arts inevitably being transformed by this technology. As blockchain technology advances rapidly and gains widespread adoption, concerns about its security are also receiving increasing attention [2,3,4,5,6,7]. Among these, mining attacks represent one of the most severe threats to the security of blockchain systems [8]. By exploiting vulnerabilities in blockchain consensus mechanisms, attackers can launch mining attacks that disrupt the system’s incentive balance and potentially cause the entire blockchain system to collapse.

Many mainstream cryptocurrencies, such as Bitcoin, Dogecoin, and Litecoin [9], rely on the PoW consensus mechanism [10,11]. In PoW systems, miners compete to solve computationally intensive puzzles in order to validate transactions and secure the network. As a reward, they receive block subsidies and transaction fees. Although PoW-based blockchains integrate cryptographic techniques and decentralized architectures to enhance security, they are not immune to all forms of attack, particularly strategic attacks targeting the consensus mechanism and incentive model. The “permissionless” nature of blockchain provides a relatively low barrier to entry for malicious actors.

PoW consensus relies on the fundamental principle of symmetry, whereby all participants compete on a level playing field. However, this principle is vulnerable to symmetry-breaking attacks [12] that exploit private chain withholding to create information asymmetry and gain unfair advantages. Building on this concept, our work introduces hybrid strategies that induce profound behavioral and structural asymmetries through Bribery–Stubborn Mining and multi-fork orchestration. These tactics pose more complex threats to the network.

The earliest view held that miners could maximize their mining rewards by strictly following the blockchain’s consensus rules, which was believed to be the optimal strategy [13]. However, this conventional belief was challenged in 2013 with the emergence of the selfish mining attack [14], demonstrating that miners could gain additional revenue by deliberately deviating from the protocol. This opened the door to a series of increasingly sophisticated strategies, including the more aggressive stubborn mining (SbM) [15], block withholding attacks (BWH) [16], and various hybrid approaches.

Among these, hybrid attacks have proven particularly potent. While strategies like bribery–selfish mining (BSM) [17] demonstrated the effectiveness of combining economic incentives with protocol exploits, the potential of pairing bribery with the more aggressive stubborn mining strategy [15] remains a critical and unexplored area of research.

This paper argues that this specific combination has been underexplored and poses a unique threat. Unlike selfish mining, stubborn mining’s tactic of maintaining a private chain even when falling behind prolongs fork competitions, which amplifies the decisive role of bribes in branch selection. Therefore, we first propose the Bribery–Stubborn Mining (BSM) strategy, where attackers selectively bribe target miners to collaborate during these prolonged forks. This hybrid attack is highly effective and yields additional revenue, but it is constrained by a revenue ceiling. To obtain higher revenue, we introduce the Leading Hidden Bribery–Stubborn Mining (LHBSbM) strategy. By delaying the release of key blocks to construct a triple fork, thereby breaking the traditional double-fork framework, LHBSbM can simultaneously orphan two public blocks, securing additional lead time for the attacker’s private chain and increasing the overall expected profit. This paper is an extended version of our previous conference paper [18].

The key contributions of this paper are as follows:

We propose a hybrid strategy for Bribery–Stubborn Mining that integrates economic bribery with the nonconceding rules of stubborn mining. Through simulation analysis, we derive the criteria under which it is optimal for a rational miner to accept a bribe.
We propose an attack strategy termed Leading Hidden Bribery–Stubborn Mining (LHBSbM), which creates triple-fork scenarios. By constructing a triple fork, the attacker can orphan blocks mined by target miners, increase the effective block share, extend the private chain lead, and consequently increase the expected revenue share.
We develop a Markov chain-based [19] analytical model to evaluate the expected rewards and incentive compatibility for both attackers and target miners.
We conduct extensive simulations to validate the effectiveness of the proposed strategies compared with traditional mining approaches.

The remainder of this paper is organized as follows. Section 2 reviews several classes of attacks that exploit weaknesses of Proof-of-Work consensus. Section 3 introduces the proposed Bribery–Stubborn Mining (BSbM) and Leading Hidden Bribery–Stubborn Mining (LHBSbM) strategies. We present their attack models, state transition rules, and formalize the theoretical analysis using a Markov chain-based framework. Section 4 details the simulation setup and evaluates the performance of the proposed strategies under various conditions. Comparisons are made against honest mining and traditional stubborn mining in terms of reward efficiency and chain control. Section 5 primarily reviews our work, discusses potential defenses against such attacks, and outlines directions for future research. Section 6 concludes our work.

2. Related Work

In each round of new block creation within a blockchain system, every node has the opportunity to package transactions and create a new block. However, if multiple nodes generate different new blocks simultaneously, it leads to a blockchain fork, which prevents consensus from being achieved. Therefore, one node must be selected to lead the consensus process; the new block constructed by this node will be recognized by the blockchain network. To reduce the likelihood of malicious nodes leading the consensus process, leadership can be assigned at random. However, simple random assignment is insecure; for example, attackers could launch a Sybil attack [20] by impersonating multiple nodes to increase their chances of gaining consensus leadership. This challenge led to the development of the Proof-of-Work (PoW) consensus mechanism. In PoW, nodes compete by expending computational resources to compute a hash value. The node that successfully calculates the hash value gains leadership rights. This mechanism effectively transforms simple random assignment into a proportional representation of a node’s computational power. In blockchain systems based on the PoW consensus mechanism, the scale of the computational power possessed by miners is the key determinant of mining success. In other words, the greater a miner’s computational power, the greater their probability of mining a new block and receiving the corresponding reward. However, in such decentralized, reward-driven blockchain systems lacking effective oversight, new mining attack strategies continue to be developed.

Based on different mining attack methods, mining attacks against Proof-of-Work (PoW) blockchains can be grouped into four categories: (i) 51% attack, (ii) selfish mining attack, (iii) block withholding attack, and (iv) denial-of-service (DoS) attack.

2.1. Selfish Mining Attack

Selfish mining, proposed by Eyal et al. [14] in 2013, is an attack strategy that manipulates block publication so that the attacker’s profitability threshold falls below that of a 51% attack. Upon finding a block, the attacker withholds it instead of broadcasting immediately, keeps it on a private chain, and releases it at strategically advantageous moments to override blocks mined by honest participants. Building on this idea, Nayak et al. [15] introduced a more aggressive family termed stubborn mining, whose core principle is to avoid abandoning the private chain. They describe three cases: lead stubborn mining (if an honest block appears while the private chain is already two blocks ahead, the attacker continues mining privately rather than publishing to override), Competitive Stubborn Mining (when the private and public chains are tied and a fork exists, the attacker keeps mining privately instead of overriding), and trail stubborn mining (the attacker persists on the private chain even when lagging) [15].

In terms of optimization, Sapirshtein et al. [21] proposed an optimal selfish mining policy using reinforcement learning, modeling the miner as an intelligent agent that learns the best action for each state via iterative updates; in theory, the resulting policy approaches the upper bound of selfish mining profitability. Zhang et al. [22] extended this reinforcement learning approach to stubborn mining, designing adaptive policies that respond to fork conditions. Beyond single-strategy optimization, hybrid attacks have been explored. Gao et al. proposed a bribery–selfish mining strategy that augments blocks with additional fee incentives to recruit target miners [17]. Yang et al. [23] modeled bribery-based selfish mining as a Markov decision process, analyzing dynamic rewards and learning an optimal bribery–selfish policy. Multi-attacker behavior has also been studied: Azimy et al. [24] evaluated attack efficiency with a blockchain multi-attacker simulator, and Bai et al. [25] examined the broader impact of multiple selfish miners on system stability. Additional variants include Jeyasheela et al.’s Q-learning-based selfish mining strategy [26] and an undetectable selfish mining design by Bahrani et al. [27] that aims to suppress orphan-block anomalies and evade detection.

To counter selfish mining, a range of defenses has been proposed. Billah [28] introduced Freshness Preferred, which penalizes withheld blocks by monitoring timestamps and reducing rewards for delayed publication, thereby raising the selfish mining threshold from about 25% to 32% under their model. On the detection side, Saad et al. [29] analyze transaction volume, serial numbers, and mining costs to flag anomalous behavior, and Wang et al. [30] showed that uncle block mechanisms can reduce the profitability of selfish and stubborn mining. Other mitigation strategies include Bicer et al.’s timestamp-based defenses [31] and Lee et al.’s “detective” strategy that checks whether a pool’s claimed previous-block hash has already appeared on-chain [32]. Additional proposals include Habib et al.’s virtual block mechanism that constrains release timing [33], Wang et al.’s SM-NEEDLE deep learning detector [34], and Nikhalat-Jahromi et al.’s AI-based defense that assigns time-dependent weights to downrank likely withheld blocks [35].

2.2. The 51% Attack

The 51% attack, first described by King et al., is among the earliest mining attacks and features both the highest entry threshold and the greatest destructive power. Once an adversary controls more than 50% of the network’s total hash power, it can dictate block production and chain selection, effectively seizing control of the blockchain and undermining decentralization. The attacker can initiate reorganizations at any height, create forks, and roll back transactions, thereby enabling double spending [36] (i.e., invalidating previously confirmed transactions). Under idealized assumptions, the attacker’s profit is theoretically unbounded.

Since the advent of blockchain technology, the 51% attack has attracted sustained attention as a potential security threat. Miller et al. [37] report a centralizing trend in Bitcoin’s hash power, suggesting a rising likelihood of 51% attacks. Karame et al. [36] demonstrate the practical feasibility of double spending in Bitcoin’s fast-payment scenarios. To counter 51% attacks, several defenses have been proposed. Yang [38] designs a weighted-difficulty PoW protocol in which, even if an attacker’s branch advances faster than the main chain, its historical weighted difficulty remains lower than that of the main chain, preventing the malicious branch from overtaking. Sayeed [39] proposes a penalty mechanism that increases an attacker’s cost by comparing the height of incoming blocks with the current chain length. Bae [40] introduces random miner grouping based on hash functions and wallet addresses; after a block is mined, the group responsible for the next block is selected via the current network hash value, thereby constraining mining within specific groups and effectively reducing the risk of 51% attacks.

2.3. Block Withholding Attack

Block withholding attacks (BWH), proposed by Rosenfeld [41] in 2011, were initially regarded as a “sabotage-without-profit” strategy. In Rosenfeld’s formulation, an attacker joins a target pool but discards any full blocks it finds there. Bag et al. later introduced a new BWH model that manipulates the allocation of mining power so that the attacker’s profitability threshold falls below that of a 51% attack. The idea is to depress the overall effective hash rate and thereby raise the attacker’s relative share to extract extra revenue. Concretely, the attacker splits its hash power into two parts: one mines honestly, while the other infiltrates the victim pool to share its public payouts but discards blocks found within that pool, reducing the pool’s income.

Laszka et al. [42] developed a game-theoretic model to explore strategic interactions among pools, identifying conditions under which mutual non-aggression is sustainable and when attacked pools become marginalized. Kwon et al. [43] introduced Fork-After-Withholding (FAW), which combines BWH with selfish mining: instead of discarding the withheld block, the attacker strategically releases it to create an intentional fork. By leveraging the advantages of both BWH and selfish mining, FAW can yield up to 2.5× the extra revenue of pure BWH. Gao et al. proposed the Power Adjusting (PAW) attack, which improves on FAW by dynamically reallocating hash power according to pool conditions, thereby lowering the hash-rate threshold required for a successful attack. Dong et al. [44] presented a hybrid self-sustaining attack, and Wang et al. [45] proposed a hybrid BWH strategy modeled via reinforcement learning to determine optimal actions across states.

To defend against BWH, multiple strategies have been proposed. Schrijvers, Bag, and Chen et al. [46,47,48] advocate modifying pool reward mechanisms, while Zhou et al. [49] suggest dynamically partitioning a pool into multiple sub-pools and placing suspicious miners into smaller groups, thereby breaking the conditions necessary for excess attacker profit.

2.4. Denial-of-Service Attack

Denial-of-service (DoS) attacks [50] aim to exhaust a target’s resources so that it cannot provide normal service. In blockchain networks, a typical DoS variant is the eclipse attack [51], in which an adversary controls all of a victim node’s external connections and isolates it from the rest of the network. Nayak et al. [15] proposed a hybrid strategy that combines selfish mining with eclipse attacks, using network isolation to increase the extra revenue of selfish mining. Mirkin et al. [52] introduced the Blockchain Denial-of-Service (BDoS) attack, which, unlike traditional network-layer DoS, exploits protocol-level incentives: by convincing others that the next block has already been found by the attacker, miners whose expected reward falls below their power cost voluntarily stop mining. Wang et al. [53,54] proposed SDoS, a combination of selfish mining and DoS, which depresses miners’ willingness to mine, increases the attacker’s relative hash share, and thus raises the fraction of blocks mined by the attacker.

To defend against DoS, several mechanisms have been proposed. Ilyas et al. [55] use deep neural networks to detect DDoS attacks, and Sousa et al. [56] apply machine learning-based detection for DoS. Raikwar et al. [57] propose a mitigation technique based on verifiable delay functions (VDFs).

3. Attack Strategy and Theoretical Modeling

In this section, we introduce two attack methods: Bribery–Stubborn Mining (BSbM) and its advanced variant, the Triple-fork-based Leading Hidden Bribery–Stubborn Mining (LHBSbM). We first elaborate on the attack framework and theoretical foundations of the BSbM strategy. Then, we present the LHBSbM strategy, which further amplifies the attacker’s profit advantage by carefully constructing triple-fork scenarios. The theoretical models presented in this paper provide a rigorous basis for understanding the mechanisms and potential profitability of these attacks.

In this study, we selected Markov chains as our primary analytical model due to their exceptional suitability for modeling the randomness and state dependency inherent in blockchain mining processes. The entire system can be precisely described as a set of discrete states corresponding to the attacker’s lead in public chain blocks, with transitions between states being probabilistic and dependent on the distribution of miners’ computational power. The primary objective of our theoretical analysis is to determine the long-term steady-state payoffs for each participant. For this task, the Markov chain framework serves as the standard and most effective tool.

3.1. Bribery–Stubborn Mining

3.1.1. Attack Strategy

The core principle of BSbM is to first induce a blockchain fork through strategic attacks and then attach appropriate bribery fees at the fork to incentivize target miners to assist the attacker in extending the attacker’s fork. This process aims to override blocks mined by honest miners and thereby increase the attacker’s mining rewards. In the BSbM attack model, miners in the system are classified into three categories: attackers, honest miners, and target miners. Attackers are malicious mining nodes that launch the attack by employing various strategies and adjusting bribery fees to create forks and attract target miners to persistently mine on the attacker’s private chain, thus securing mining rewards disproportionate to their computational power share. Honest miners are nodes that do not accept bribes from the attacker and follow the blockchain protocol, mining on the public chain according to the first valid block they receive based on network communication. Target miners are rational nodes in the system who, during a blockchain fork, can choose whether to accept the attacker’s bribe and mine on the attacker’s chain or reject it and continue mining on the honest public chain.

The BSbM strategy is characterized by several key parameters and state variables:

L: Represents the lead in block count of the attacker’s private chain over the current longest public chain.
F: Denotes the composition of the current blockchain fork. $F = 0$ indicates no fork; $F = 1$ indicates a fork between the honest miners and the attacker; $F = 2$ indicates a fork on the public chain where the honest public chain branch contains blocks mined by target miners.
R: Indicates the honest miner following rate, which represents the proportion of honest miners who choose to mine on the attacker’s chain after a fork occurs in the public chain.

The BSbM attack, for the purpose of this description, is primarily discussed in the context of competitive stubborn mining. The strategy can be adapted for lead stubborn and trail stubborn scenarios, with corresponding modifications to the attacker’s actions. At the beginning of the attack, the attacker accepts the honest public chain as both the private chain and the attacker’s chain. At this point, there is no fork in the public chain, and the block lead count is zero. Since the attacker will immediately release their block upon the honest miner publishing a new block, honest miners in the network receive both competing forks at approximately the same time. Therefore, the follow ratio R is set to 0.5, meaning that half of the honest miners mine on the honest public chain, while the other half mine on the attacker’s chain. The specific attacker strategy is illustrated in Table 1, where

c a s e

denotes the three types of miners finding a new block, and

s t a t e

represents the current state of the blockchain. B indicates that the target miner accepts the bribe and helps the attacker extend their private chain, while

! B

denotes the opposite.

3.1.2. State Transitions and Event Modeling

Figure 1 illustrates the first two states involved in the BSbM attack, showing how the attack begins and evolves as the attacker starts to build a private chain.

State 0

At this stage, no fork exists in the public blockchain, and all miners are mining on the public chain. The green, blue, and red arrows represent the mining locations of honest miners, target miners, and the attacker, respectively. The square blocks denote the public blockchain without any forks, as shown in Figure 1a. The events occurring under State 0 include the following:

Event 0-1: An honest miner finds and publishes a new block on the public chain, remaining in State 0.
Event 0-2: The attacker finds a new block on the public chain and adds it to the private chain, transitioning to State 1.
Event 0-3: The target miner finds and publishes a new block on the public chain, remaining in State 0.

State 1

At this stage, no fork exists on the public chain, and the attacker’s private chain leads by one block (

L = 1

). The dashed circle represents the unpublished block on the private chain. The attacker mines on the private chain, while honest and target miners continue mining on the public chain, as illustrated in Figure 1b. The events occurring under State 1 include the following:

Event 1-1: An honest miner finds and publishes a new block on the public chain; the attacker then publishes their private chain to create a fork, transitioning to State $0_{o}^{'}$ .
Event 1-2: The attacker mines a new block on the private chain, extending it, and transitions to State 2.
Event 1-3: The target miner finds and publishes a new block on the public chain; the attacker publishes their private chain to form a fork, transitioning to State $0_{b}^{'}$ .

State $0_{o}^{'}$

In this state, a fork exists between the honest public chain and the attacker’s chain, where the honest chain consists solely of blocks mined by honest miners. The two forked chains have equal length. The diamond and solid-circle lines represent the honest chain and attacker chain, respectively. The hollow blue arrows indicate the two scenarios where the target miner either rejects or accepts the bribe. The branch length n is a positive integer. The attacker mines on the attacker’s chain, while an r proportion of honest miners mine on the attacker’s chain, and the remaining

1 - r

proportion mine on the honest chain. The target miner chooses to mine on either chain based on the trade-off between accepting the bribe and mining profitability, as shown in Figure 2. The events occurring under State

0_{o}^{'}

include the following:

Event $0_{o}^{'}$ -1: An honest miner finds and publishes a block on the honest chain, transitioning to State 0.
Event $0_{o}^{'}$ -2: An honest miner finds and publishes a block on the attacker’s chain, transitioning to State 0.
Event $0_{o}^{'}$ -3: The attacker finds a block on the attacker’s chain, extends the private chain, and transitions to State $1_{o}^{'}$ .
Event $0_{o}^{'}$ -4: The target miner finds and publishes a block on either the attacker’s chain or the honest chain, transitioning to State 0.

State $0_{b}^{'}$

Here, the blockchain splits into two equal-length forks: the honest chain and the attacker’s chain. The honest chain contains blocks mined by the target miner, marked as triangles in the figure (these may represent any of the n blocks on the fork, with the first shown as an example). The attacker continues mining privately; some honest miners (r proportion) mine on the attacker’s chain, while others mine on the honest chain. The target miner mines on the honest chain, as depicted in Figure 3. In this state, the following events may occur:

An honest miner finds and publishes a block on the honest chain, returning the system to State 0.
An honest miner finds and publishes a block on the attacker’s chain, also leading back to State 0.
The attacker mines a new block on their private chain, advancing to State $1_{b}^{'}$ .
The target miner mines and publishes a block on the honest chain, returning to State 0.

State $1_{o}^{'}$

In this state, the public chain forks into an honest chain and an attacker’s chain, where the honest chain consists solely of blocks mined by honest miners, and the attacker’s private chain leads by one block. The attacker mines on the private chain, while an r proportion of honest miners mine on the attacker’s chain, and the remaining

1 - r

proportion mine on the honest chain. The target miner chooses to mine on either chain based on the trade-off between accepting the bribe and mining profitability, as illustrated in Figure 4. The events occurring under State

1_{o}^{'}

include the following:

Event $1_{o}^{'}$ -1: An honest miner finds and publishes a block on the honest chain; the attacker then publishes their hidden private blocks, transitioning to State $0_{o}^{'}$ .
Event $1_{o}^{'}$ -2: An honest miner finds and publishes a block on the attacker’s chain; the attacker publishes their hidden private blocks, transitioning to State $0_{o}^{'}$ .
Event $1_{o}^{'}$ -3: The attacker mines a new block on the private chain, extending it, and transitions to State 2.
Event $1_{o}^{'}$ -4: The target miner finds and publishes a block on either the attacker’s chain or the honest chain; the attacker publishes their hidden private blocks, transitioning to State $0_{b}^{'}$ .

State $1_{b}^{'}$

The blockchain currently forks into two branches: the honest chain, which includes blocks mined by the target miner, and the attacker’s private chain, which holds a one-block lead. Mining activities proceed with the attacker working on their private chain, while a fraction r of honest miners support the attacker’s branch; the rest mine on the honest chain. The target miner mines exclusively on the honest chain, as depicted in Figure 5.

Possible events in this state are as follows:

Event $1_{b}^{'}$ -1: A block is mined and published by an honest miner on the honest chain, prompting the attacker to reveal their private blocks and transition to State $0_{b}^{'}$ .
Event $1_{b}^{'}$ -2: A block is mined and published by an honest miner on the attacker’s chain, causing the attacker to disclose their private chain and move to State $0_{o}^{'}$ .
Event $1_{b}^{'}$ -3: The attacker mines a new block on their private chain, thereby advancing to State 2.
Event $1_{b}^{'}$ -4: The target miner finds and publishes a block on the honest chain, which leads the attacker to reveal their private blocks and return to State $0_{b}^{'}$ .

State 2

The attacker’s private chain leads the public chain by two blocks, with the fork length m taking the values 0 or n. Mining continues with the attacker working on their private chain, while both honest and target miners mine on the public chain. When the honest chain contains blocks mined by the target miner, the target miner mines specifically on the honest chain; otherwise, they mine on the public chain. These situations are depicted in Figure 6.

Several events can occur under State 2:

Event 2-1: A block is mined and published by an honest miner on the public chain, prompting the attacker to reveal two private hidden blocks and revert to State 0.
Event 2-2: The attacker mines a new block on their private chain, extending it and moving the system into State 3.
Event 2-3: The target miner mines and publishes a block on the public chain, leading the attacker to reveal two private blocks and return to State 0.

State L ( $L \geq 3$ )

At this time, the attacker leads by L blocks. The attacker mines on the private chain, while the target miner and honest miners mine on the public chain. If a fork exists and the honest chain contains blocks mined by the target miner, the target miner mines on the attacker’s chain. If a fork exists but the honest chain does not contain blocks mined by the target miner, the target miner mines on the public chain. These are shown in Figure 7.

The events occurring under State L include the following:

Event L-1: An honest miner finds and publishes a block on the public chain, and the attacker reveals one hidden block from the private chain and transitions to State $L - 1$ .
Event L-2: The attacker mines a new block on the private chain, adds it to the private chain, and transitions to State $L + 1$ .
Event L-3: The target miner finds and publishes a block on the public chain, and the attacker reveals one hidden block from the private chain and transitions to State $L - 1$ .

3.1.3. Theoretical Analysis

We first analyze the mining rewards of miners within the BSbM attack framework and then proceed to model and analyze the state transitions of the blockchain system using a Markov chain.

Attacker’s Reward

The attacker obtains a block reward when event 1-2,

1_{o}^{'}

-3,

1_{b}^{'}

-3, or L-2 (

L \geq 2

) occurs. For events 0-2 and

0_{o}^{'}

-3, if the target miner accepts the bribe, the attacker has a certain probability of obtaining a block reward as follows:

P_{a}^{e} = α + \frac{(r o + β) o + (o α) (α + r o + β)}{1 - α (1 - r) o} + \frac{β r o + β α (α + r o)}{1 - α [(1 - r) o + β]}

(1)

Here, the parameters

α

,

β

, and o represent the mining power of the attacker, the target miner, and the honest miners, respectively. The variable r denotes the proportion of honest miners who follow the attacker’s chain. If the target miner rejects the bribe, the probability that the attacker obtains a block reward is given by

P_{b}^{e} = α + \frac{(o + β) r o + (o + β) α (α + r o)}{1 - α [(1 - r) o + β]}

(2)

When event

0_{b}^{'}

-3 occurs, the attacker has a probability of

P_{b}^{e}

of obtaining a block reward.

Target Miner’s Reward

The target miner obtains a block reward when event 0-3,

0_{o}^{'}

-4, or

0_{b}^{'}

-4 occurs. For events 1-3,

1_{o}^{'}

-4, and

1_{b}^{'}

-4, the target miner has the following probability of obtaining a block reward:

P_{c}^{e} = \frac{(1 - r o) + β}{1 - α [(1 - r) o + β]}

(3)

Honest Miners’ Reward

Honest miners obtain a block reward when event 0-1,

0_{o}^{'}

-1,

0_{o}^{'}

-2,

0_{b}^{'}

-1, or

0_{b}^{'}

-2 occurs. For events 1-1,

1_{o}^{'}

-1,

1_{o}^{'}

-2, and

1_{b}^{'}

-2, if the target miner accepts the bribe, the honest miners have the following probability of obtaining a block reward:

P_{d}^{e} = \frac{1 - r}{1 - α (1 - r) o}

(4)

If the target miner rejects the bribe, the probability that honest miners obtain a block reward is given by

P_{e}^{e} = \frac{(1 - r) o + β}{1 - α [(1 - r) o + β]}

(5)

When event

1_{b}^{'}

-1 occurs, honest miners have a probability of

P_{e}^{e}

of obtaining a block reward.

The Markov chain model representing the state transitions of the blockchain system under the BSbM attack is illustrated in Figure 8, where each state corresponds to the blockchain system states described in the previous section.

In the competitive BSbM mining model,

p_{L}^{E}

(

0 \leq L \leq + \infty

, where L is an integer) denotes the probability of states 0 to L, and

p_{L_{m}^{'}}^{E}

(with

L = 0

or 1, and

m = o

or b) represents the probability of states

0_{o}^{'}

,

0_{b}^{'}

,

1_{o}^{'}

, and

1_{b}^{'}

. The state probabilities of the Markov model depicted in Figure 8 are given as follows.

\{\begin{matrix} p_{0}^{E} = (1 - α) p_{0}^{E} + (1 - α) p_{0_{o}}^{' E} + (1 - α) p_{0_{b}}^{' E} + (1 - α) p_{2}^{E} \\ p_{0_{o}}^{' E} = (1 - α - β) p_{1}^{E} + (1 - α - β) p_{1_{o}}^{' E} + r (1 - α - β) p_{1_{b}}^{' E} \\ p_{0_{b}}^{' E} = β p_{1}^{E} + β p_{1_{o}}^{' E} + (β + (1 - r) (1 - α - β)) p_{1_{b}}^{' E} \\ p_{1}^{E} = α p_{0}^{E} \\ p_{1_{o}}^{' E} = α p_{0_{o}}^{' E} \\ p_{1_{b}}^{' E} = α p_{0_{b}}^{' E} \\ p_{2}^{E} = α p_{1}^{E} + α p_{1_{o}}^{' E} + α p_{1_{b}}^{' E} + (1 - α) p_{3}^{E} \\ p_{L}^{E} = α p_{L - 1}^{E} + (1 - α) p_{L + 1}^{E}, L \geq 3 \\ 1 = \sum_{k = 0}^{+ \infty} p_{k}^{E} + p_{0_{o}}^{' E} + p_{0_{b}}^{' E} + p_{1_{o}}^{' E} + p_{1_{b}}^{' E} \end{matrix}

(6)

Further calculations yield the explicit formulas for the state probabilities as follows.

\{\begin{matrix} p_{2}^{E} = \frac{1}{\frac{1 - α}{α} + \frac{1 - α}{α^{2}} + \frac{1 - α}{1 - 2 α}} \\ p_{0}^{E} = \frac{1 - 2 α + 2 α^{2} - α^{3}}{α^{2}} p_{2}^{E} \\ p_{1}^{E} = \frac{1 - 2 α + 2 α^{2} - α^{3}}{α} p_{2}^{E} \\ p_{0_{o}}^{E} = \frac{(r (1 - α - β) (1 - α^{2}) + (1 - α - β)) \frac{1 - 2 α + 2 α^{2} - α^{3}}{α}}{r (1 - α - β) α + 1 - α + α^{2} + α β} p_{2}^{E} \\ p_{1_{o}}^{E} = \frac{(r (1 - α - β) (1 - α^{2}) + (1 - α - β)) \frac{1 - 2 α + 2 α^{2} - α^{3}}{α}}{r (1 - α - β) + (1 - α + α^{2} + α β) \frac{1}{α}} p_{2}^{E} \\ p_{0_{b}}^{E} = (\frac{{(1 - α)}^{2}}{α} - \frac{(r (1 - α - β) (1 - α^{2}) + (1 - α - β)) \frac{1 - 2 α + 2 α^{2} - α^{3}}{α}}{r (1 - α - β) α + 1 - α + α^{2} + α β}) p_{2}^{E} \\ p_{1_{b}}^{E} = ({(1 - α)}^{2} - \frac{(r (1 - α - β) (1 - α^{2}) + (1 - α - β)) \frac{1 - 2 α + 2 α^{2} - α^{3}}{α}}{r (1 - α - β) + (1 - α + α^{2} + α β) \frac{1}{α}}) p_{2}^{E} \\ p_{L}^{E} = {(\frac{α}{1 - α})}^{k - 2} p_{2}^{E}, when L \geq 3 \end{matrix}

(7)

Based on the aforementioned model, the miners’ rewards can be analyzed to determine the target miner’s optimal mining strategy and the attacker’s range of possible rewards. The following conclusions can be drawn:

Conclusion 1. When the attacker launches the BSbM attack, the target miner achieves higher mining rewards by accepting the attacker’s bribe and mining on the attacker’s chain during a blockchain fork, compared to rejecting the bribe and mining on the honest public chain. In other words, choosing to accept the bribe and assist the attacker in mining during a fork is the optimal mining strategy for the target miner.

Proof.

If the target miner chooses to accept the attacker’s bribe and assist in mining, the target miner’s reward

R_{b}^{E B}

is given by

R_{b}^{E B} = p_{0}^{E} β + p_{0_{o}^{'}}^{E} β + p_{0_{b}^{'}}^{E} β + (p_{1}^{E} + p_{1_{o}^{'}}^{E} + p_{1_{b}^{'}}^{E}) β P_{c}^{e} + ε R_{a}^{E}

(8)

If the target miner chooses to reject the attacker’s bribe, the reward

R_{b}^{E B^{'}}

is

R_{b}^{E B^{'}} = p_{0}^{E} β + p_{0_{o}^{'}}^{E} β + p_{0_{b}^{'}}^{E} β + (p_{1}^{E} + p_{1_{o}^{'}}^{E} + p_{1_{b}^{'}}^{E}) β P_{c}^{e}

(9)

Combining the above equations, we have

R_{b}^{E B} - R_{b}^{E B^{'}} = ε R_{a}^{E}

(10)

Since

ε R_{a}^{E} \geq 0

, where

ε R_{a}^{E}

is the bribery payment from the attacker, it follows that

R_{b}^{E B} \geq R_{b}^{E B^{'}}

. □

Conclusion 2. In the BSbM attack, there exists an appropriate bribery factor

ε

such that the attacker achieves higher mining rewards when the target miner accepts the bribe compared to rejecting it. Moreover, the attacker’s maximum reward is attained when

ε = 0

.

Proof.

If the target miner chooses to accept the attacker’s bribe and assist in mining, the attacker’s reward

R_{a}^{E}

is given by

R_{a}^{E} = p_{1}^{E} α + p_{1_{o}^{'}}^{E} α + p_{1_{b}^{'}}^{E} α + \sum_{L = 2}^{+ \infty} p_{L}^{E} α + (p_{0}^{E} α + p_{0_{o}^{'}}^{E} α) P_{a}^{e} + p_{0_{b}^{'}}^{E} α P_{b}^{e}

(11)

Considering the bribery payment

ε R_{a}^{E}

the attacker must pay, the attacker’s total reward

R_{a}^{E B}

is

R_{a}^{E B} = (1 - ε) R_{a}^{E}

(12)

If the target miner chooses to reject the attacker’s bribe, the attacker’s reward

R_{a}^{E B^{'}}

is

R_{a}^{E B^{'}} = p_{1}^{E} α + p_{1_{o}^{'}}^{E} α + p_{1_{b}^{'}}^{E} α + \sum_{L = 2}^{+ \infty} p_{L}^{E} α + (p_{0}^{E} α + p_{0_{o}^{'}}^{E} α) P_{b}^{e} + p_{0_{b}^{'}}^{E} α P_{b}^{e}

(13)

Combining the above equations, we have

R_{a}^{E B} - R_{a}^{E B^{'}} = (p_{0}^{E} α + p_{0_{o}^{'}}^{E} α) (P_{a}^{e} - P_{b}^{e}) - ε R_{a}^{E}

(14)

Since

P_{a}^{e} > P_{b}^{e}

, it follows that

(p_{0}^{E} α + p_{0_{o}^{'}}^{E} α) (P_{a}^{e} - P_{b}^{e}) > 0,

which leads to the critical condition:

0 \leq ε \leq \frac{(p_{0}^{E} + p_{0_{o}^{'}}^{E}) α (P_{a}^{e} - P_{b}^{e})}{R_{a}^{E}} \Rightarrow R_{a}^{E B} \geq R_{a}^{E B^{'}}

(15)

The maximum reward when

ε = 0

is

(p_{0}^{E} + p_{0_{o}}^{E}) α (P_{a}^{e} - P_{b}^{e})

. □

3.2. Triple-Fork-Based Leading Hidden BSbM

In Proof-of-Work-based blockchain systems, miners’ rewards are determined by their effective block occupancy rate. Attacks based on selfish mining principles, including BSbM, increase the attacker’s effective block occupancy by first creating a fork and then discarding blocks on the public chain. However, this attack approach has limitations when considering fork structures: under the same block height, it can only discard one block, resulting in a maximum effective block occupancy of

α / (1 - α)

for the attacker. In actual mining, the blockchain may experience three forks, which potentially allow the discarding of more blocks at the same block height, thus increasing the attacker’s effective block occupancy. Therefore, this paper further proposes a Triple-fork-based Leading Hidden BSbM (LHBSbM) attack, which creates a triple fork to extend the attacker’s private chain lead time, thereby increasing the attacker’s block occupancy and enhancing mining rewards.

3.2.1. Attack Strategy

When the blockchain system is in State L (

L \geq 3

), if an honest miner successfully mines a new block, the attacker immediately publishes one block from their private chain. This attack strategy causes the attacker’s chain and the honest public chain to form a fork of equal length, prompting miners to extend one of the two public chain branches. In the LHBSbM attack strategy, this state is referred to as the leading state, where the attacker does not reveal private chain blocks upon honest miners’ successful mining, but instead enters a hidden state. In the hidden state, if the next new block is mined by the target miner on the attacker’s chain, the blockchain system forms a triple fork. Under this triple-fork state, the attacker’s private chain can discard two blocks at the same block height, thereby increasing the attacker’s effective block occupancy.

To further improve the attacker’s effective block occupancy, this paper proposes the LHBSbM (Leading Hidden BSbM) attack strategy based on a triple fork. When in the initial state or L < 2, the LHBSbM strategy follows the same logic as BSbM. The complete strategy is detailed in Table 2.

3.2.2. State Transitions and Event Modeling

The blockchain states and events under the LHBSbM attack are described as follows.

State L ( $L \geq 3, L \in Z^{+}$ )

As shown in Figure 9, when the mining time is N, the blockchain is in a leading state. The attacker’s private chain leads both the honest chain and the attacker’s public fork by L blocks. The attacker mines on the private chain, the target miner mines on the attacker’s fork, and honest miners split their mining power: a fraction r mines on the attacker’s fork, and the remaining

1 - r

mines on the honest chain.

If Event A occurs, at time

N + 1

, the blockchain enters a hidden state. In this state, the attacker’s private chain leads the honest chain and the attacker’s fork by

L - 1

and L blocks, respectively. The attacker continues to mine on the private chain. The target miner chooses to mine on either the attacker’s fork or the honest chain based on whether the bribe is accepted and the comparative expected reward. Honest miners mine on the honest chain.

If Event B occurs, at time

N + 2

, the blockchain enters a triple-fork state. Here, the attacker’s private chain leads both the honest chain and the attacker’s fork by

L - 1

blocks. The attacker continues to mine on the private chain, while the target miner mines on the attacker’s fork and honest miners mine on the honest chain.

Event A: An honest miner finds and publishes a block on the honest chain. The attacker then hides the private chain, transitioning the system to the hidden state.
Event B: The target miner accepts the bribe from the attacker and mines a block on the attacker’s fork, resulting in a triple-fork state.

In the triple-fork state, the worst-case scenario is that the attacker immediately publishes the private chain to eliminate two blocks (eliminating at least one more block than BSbM). The ideal scenario is shown in Figure 10, where Event A and Event B occur alternately. The private chain leads the honest public chain and the attacker chain by one and two blocks, respectively. When the attacker publishes the private chain, the number of blocks that can be eliminated is approximately 2L (doubling the number of eliminated blocks compared to BSbM).

3.2.3. Theoretical Analysis

In the Bitcoin system, a new block is generated approximately every 10 min on average. Under the assumption of constant mining power, the mining time is proportional to the probability of successfully mining a block. Therefore, the following conclusion can be drawn regarding the private chain lead time in the LHBSbM attack:

Conclusion 1. Compared to the classical double-fork attack, the LHBSbM attack achieves a longer private chain lead time, meaning the attacker has more time to mine on the private chain. This implies that under equal mining power, the attacker has a greater chance to extend the private chain and enlarge the lead, thereby increasing mining revenue.

Proof.

In the LHBSbM attack, when the system state is greater than 2 and a triple fork is successfully created, the attacker can override up to two public chain blocks with each private chain block released—one mined by honest miners and the other by target miners. When publishing N private chain blocks, the attacker can maintain a lead time equivalent to

2 N - 3

block generation intervals. If the triple-fork construction fails, the LHBSbM attack degenerates to the ordinary double-fork scenario, where the minimum lead time for N private blocks is

N - 1

block intervals (the same as the classical double-fork attack). In other states, the attacker follows the same strategy as in the double-fork attack, maintaining the same lead time. Therefore, the total private chain lead time generated in LHBSbM is longer. □

Conclusion 2. Compared with the maximum effective block occupancy rate

\frac{α}{1 - α}

under the double-fork scenario, the LHBSbM attack achieves a higher maximum effective block occupancy rate. If the attacker’s mining power

α

exceeds one-third of the total system power, the maximum effective block occupancy rate reaches 1. If the attacker’s mining power

α

is less than one-third, then when the target miner’s power

β

exceeds that of the honest miners, the maximum effective block occupancy rate is

\frac{α}{β}

; otherwise, it is

\frac{α}{1 - α - β}

.

Proof.

Let

A^{T}

,

B^{T}

, and

H^{T}

denote the numbers of blocks mined by the attacker, target miners, and honest miners, respectively, during time T. The

A^{T}

blocks mined by the attacker can cover two public chains of lengths

A^{T} - 1

and

A^{T} - 2

at most. Let

B^{T^{'}}

and

H^{T^{'}}

be the numbers of target miner and honest miner blocks not covered by the attacker, respectively. Then we have

B^{T^{'}} = B^{T} - A^{T} - 1

(16)

H^{T^{'}} = H^{T} - A^{T} - 2

(17)

When the uncovered blocks mined by target miners and honest miners overlap, if

B^{T^{'}} < H^{T^{'}}

, the minimal effective block count after coverage is

H^{T^{'}}

; if

B^{T^{'}} > H^{T^{'}}

, it is

B^{T^{'}}

. Thus, the attacker’s effective block rate

v r

is

v r = \{\begin{matrix} \frac{A^{T}}{A^{T} + B^{T^{'}}}, & when B^{T^{'}} > H^{T^{'}} \\ \frac{A^{T}}{A^{T} + H^{T^{'}}}, & when B^{T^{'}} < H^{T^{'}} \end{matrix}

(18)

According to the law of large numbers, we have

lim_{T \to \infty} A^{T} = α, lim_{T \to \infty} B^{T} = β, lim_{T \to \infty} H^{T} = 1 - α - β

(19)

If the attacker’s mining power

α

exceeds one-third of the total system power, and both target and honest miners have less than one-third each, then

B^{T^{'}} < 0

and

H^{T^{'}} < 0

. Setting

B^{T^{'}} = 0

and

H^{T^{'}} = 0

yields

v r = lim_{T \to \infty} \{\begin{matrix} \frac{A^{T}}{A^{T} + B^{T^{'}}} = 1, & when B^{T^{'}} > H^{T^{'}} \\ \frac{A^{T}}{A^{T} + H^{T^{'}}} = 1, & when B^{T^{'}} < H^{T^{'}} \end{matrix}

(20)

When the attacker’s mining power is less than one-third, combining the above gives

v r = lim_{T \to \infty} \{\begin{matrix} \frac{A^{T}}{A^{T} + B^{T^{'}}} = \frac{α}{β}, & when B^{T^{'}} > H^{T^{'}} \\ \frac{A^{T}}{A^{T} + H^{T^{'}}} = \frac{α}{1 - α - β}, & when B^{T^{'}} < H^{T^{'}} \end{matrix}

(21)

Considering that the target miner’s mining power is usually less than that of honest miners in practice and that, when

α > \frac{1}{3}

, 1 as the limit does not effectively reflect the positive correlation between mining power and reward, we therefore use

\frac{α}{1 - α - β}

to represent the maximum effective block occupancy rate of the attacker. □

4. Simulation and Evaluation

To evaluate the effectiveness of the proposed BSbM and LHBSbM attack strategies, we conducted simulation experiments that emulated the block generation and fork resolution process in a Proof-of-Work blockchain. The simulation considered three types of miners: the attacker (hash power

α

), the target miner (hash power

β

), and honest miners (hash power

o = 1 - α - β

). All experiments were conducted on a physical machine running Ubuntu 23.10. Using Python 3.10, we simulated a blockchain with 1,000,000 blocks on a PC equipped with an Intel Xeon W-2275 @ 3.30 GHz CPU and 125 GB RAM for experimentation and analysis.

4.1. Evaluation of BSbM Attack

This subsection presents the simulation analysis of the Bribery–Stubborn Mining (BSbM) attack. In these experiments, the target miner’s hash power

β

is set to 0.1, and the bribery factor

ϵ

is 0.02. Mining revenues for all miners are normalized for comparison.

4.1.1. Target Miner’s Revenue

We first analyze the optimal mining strategy for the target miner when a fork occurs in the BSbM attack.

Figure 11 shows the percentage of additional mining revenue that the target miner gains by accepting the attacker’s bribe compared to refusing it. The simulation results align with the theoretical analysis, confirming that accepting the bribe and mining on the attacker’s chain always provides higher revenue for the target miner. This additional revenue increases as the attacker’s hash power

α

grows due to the larger rewards and bribes that the attacker can afford.

Figure 12 further details the target miner’s extra revenue under different honest miner follow rates (

r \in {0, 0.5, 1}

), comparing bribe acceptance (solid lines) and refusal (dashed lines). The results consistently demonstrate that accepting the bribe is more profitable. A higher r leads to reduced gains for the target miner. Notably, there exists a critical point below which accepting the bribe may still result in a net loss, but less than the loss from refusing it, thus confirming that accepting the bribe remains the optimal strategy.

4.1.2. Attacker’s Revenue

Next, we evaluate the attacker’s revenue under the BSbM strategy. Figure 13 illustrates the attacker’s additional revenue compared to honest mining when the target miner refuses the bribe. Even without cooperation, an attacker with sufficient hash power

α

and a favorable follow ratio r can still outperform honest mining.

Conversely, Figure 14 shows the attacker’s extra revenue when the target miner accepts the bribe. In this case, the required threshold for

α

to make the attack profitable is lower. For instance, with

r = 0

, profitability begins when

α \approx 0.34

.

Figure 15 compares attack revenue with and without bribery for

ϵ = 0.02

under different r values. The attacker consistently benefits from offering bribes (solid lines above dashed). However, as

α

increases, the marginal benefit of bribery decreases. When the attacker’s hash power is large, the bribe may become unnecessary.

4.2. Evaluation of LHBSbM Optimization

This subsection evaluates the Triple-fork-based Leading Hidden BSbM (LHBSbM) attack, which is an optimization of BSbM. LHBSbM aims to further increase the attacker’s effective block share and private chain mining lead time by creating a blockchain triple fork.

4.2.1. Comparison of Discarded Blocks (Extra Mining Time)

The first experiment compares the average number of discarded blocks (which translates to extra mining time for the attacker) generated by LHBSbM versus the competitive BSbM strategy. For this simulation, the attacker’s hash power

α

is set to 0.4, the target miner’s hash power

β

to 0.1, and the bribery factor

ϵ

to 0.02. The analysis focuses on scenarios where the initial state is greater than or equal to 2, as these are conditions under which LHBSbM can initiate a triple fork. Each data point is an average over 1,000,000 mining cycles, measuring the extra mining time gained from the initial state until the system returns to State 0.

Figure 16 presents the results, where the red line denotes LHBSbM and the blue line denotes competitive BSbM. Across all simulated initial states (from 2 to 48), LHBSbM consistently results in a higher average number of discarded blocks than BSbM. Furthermore, the difference in the average number of discarded blocks between the two strategies widens as the initial state (attacker’s lead) increases, indicating that LHBSbM’s advantage in gaining extra mining time becomes more pronounced with a larger initial lead.

4.2.2. Optimized Attack Revenue

The second experiment is designed to compare the attack revenue of LHBSbM against BSbM more directly. In this setup, the attacker employs the LHBSbM strategy when their private chain lead is greater than two blocks and reverts to the BSbM strategy in other states. It is assumed that rational target miners will only assist the attacker if the attacker’s hash power

α

exceeds 0.35; otherwise, they behave as honest miners.

Figure 17 shows the attack revenue, where E-LHBSbM (solid lines) represents the revenue from BSbM optimized with the LHBSbM strategy, and E-BSbM (dashed lines) represents the revenue from the original BSbM attack, for r values of 0, 0.5, and 1. The results clearly indicate that the BSbM attack, when optimized with the LHBSbM triple-fork strategy, yields higher revenue for the attacker compared to the original BSbM attack across the tested r values.

4.3. Summary

The simulation results validate the theoretical analyses of both the BSbM and LHBSbM attack strategies. For the BSbM attack, it is demonstrated that accepting the attacker’s bribe is the optimal strategy for a rational target miner, providing them with higher potential revenue compared to refusing the bribe or honest mining. The attacker, in turn, can achieve greater revenue than honest mining by employing BSbM, with the success threshold being lower when the target miner cooperates.

The LHBSbM optimization further enhances the attacker’s capabilities. By creating a triple fork, LHBSbM allows the attacker to secure more extra mining time (by orphaning more blocks) and achieve higher overall attack revenue compared to the standard BSbM approach. These findings underscore the increased risks posed by such advanced selfish mining variants to Proof-of-Work blockchains.

5. Discussion

This part reviews our work, discusses potential defenses against such attacks, and outlines directions for future research.

5.1. Review of Our Work

This study investigates symmetry-breaking attacks in Proof-of-Work (PoW) blockchains perpetrated by rational miners and proposes two new and more sophisticated mining attack strategies. First, we introduce Bribery–Stubborn Mining (BSbM), which innovatively combines economic bribery with the nonconceding tactics of stubborn mining to incentivize targeted miners to collaborate during forks. Compared with bribery–selfish mining, BSbM leverages stubborn mining’s nonconceding behavior to prolong fork competition, thereby making targeted bribes more effective.

Building on this, and considering the threat posed by more complex fork scenarios, we design Leading Hidden Bribery–Stubborn Mining (LHBSbM). LHBSbM uses concealed leading actions to construct a triple fork, more efficiently orphaning blocks mined by honest miners and bribed miners. Relative to BSbM, LHBSbM increases the attacker’s payoff ceiling by raising the honest block orphaning rate and extending the lead of the attacker’s private chain.

5.2. Defensive Measures

The purpose of studying these attacks is to anticipate possible adversarial behaviors and develop stronger defenses to protect the network. For hybrid attacks like BSbM and LHBSbM that combine economic incentives with protocol manipulation, potential countermeasures can be explored at the protocol level, the network level, and within economic and game-theoretic models.

Protocol-Level Modifications

Incentive restructuring: Introduce penalty mechanisms that detect anomalous timestamps and penalize delayed block publication, increasing the cost of stubborn withholding.
Weighting optimization: Develop time-based weighting frameworks so the protocol deprioritizes blocks that appear to have been strategically withheld.

Network-Level Monitoring

Anomaly detection: Deploy monitoring that flags abnormal activity by analyzing indicators such as transaction volume, serial numbers, and mining cost.
Pool behavior identification: Detect dishonest pools by checking whether the claimed previous-block hash has already been published on-chain.
Countermeasures for stealthy variants: For attacks that suppress orphan blocks to evade detection, incorporate deeper behavioral pattern analysis.

Economic and Game Theory Countermeasures

Anti-bribery via smart contracts:Use contract-based mechanisms to counter bribery that induces rational miners to deviate from the honest protocol.

5.3. Potential Directions for Future Research

Drawing on prior work and current AI techniques, we outline several promising directions for advancing blockchain security:

AI for strategy discovery and dynamic defense: Use generative AI (e.g., ChatGPT [58], DeepSeek [59], Google Gemini [60]) to propose priors for composite attack and defense strategies; build AI-based dynamic defense systems that analyze miner behavior and fork signals in real time and recommend parameter adjustments such as confirmation depth, tie-breaking rules, and timestamp penalties.
Multi-attacker modeling: Extend the model from a single attacker to environments with multiple competing or collaborating attackers; use evolutionary and Markov games to characterize bribery target selection, bidding, and payoff allocation equilibria and analyze their effects on stability and thresholds; and integrate contract-based anti-bribery and study incentive compatibility and anti-collusion conditions.
Model refinement with decision processes: Employ Markov decision processes to model bribery-based selfish mining, analyze dynamic rewards, and optimize attacker behavior.
Extended multi-attacker equilibria:Further generalize to competitive or cooperative multi-attacker settings using evolutionary, stochastic, and Markov games to characterize equilibria for bribery target selection and payoff allocation and to analyze thresholds, stability, and convergence.

6. Conclusions

This paper investigated strategic attacks on PoW blockchains and introduced two hybrid mining strategies: Bribery–Stubborn Mining (BSbM) and its advanced variant, Leading Hidden Bribery–Stubborn Mining (LHBSbM).

BSbM combines economic bribery with stubborn mining behavior to create an incentive-compatible setting in which a rational target miner optimally accepts a bribe and assists the attacker during forks, increasing the likelihood of disproportionate rewards relative to the attacker’s hash share. Building on this, LHBSbM orchestrates carefully timed delayed publication to construct triple-fork states; our analysis and simulations show that it can simultaneously orphan multiple honest blocks, significantly extend the lead of the attacker’s private chain, and yield higher overall revenue than traditional double-fork attacks.

These results highlight the growing sophistication of mining attacks and underscore that PoW consensus remains vulnerable when economic incentives and protocol-level tactics interact. These results underscore the need for defenses at the protocol, network, and pool levels and lay the groundwork for strengthening PoW systems against hybrid bribery and stubborn attacks.

Author Contributions

Conceptualization, W.L. (Weijie Li) and Y.W.; methodology, S.J. and W.L. (Weijie Li); software, B.N. and W.L. (Weipeng Liang); validation, W.L. (Weijie Li), S.J. and Y.W.; formal analysis, S.J.; investigation, B.N.; resources, W.L. (Weipeng Liang); data curation, B.N.; writing—original draft preparation, W.L. (Weijie Li); writing—review and editing, S.J. and Y.W.; visualization, W.L. (Weijie Li); supervision, Y.W.; project administration, Y.W.; funding acquisition, Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (NSFC), grant no. U21A20463; and by the Science and Technology Program of Guangzhou, grant no. 2024A03J0403.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Weipeng Liang was employed by the company China Telecom Corporation Limited Jiangmen Branch. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflicts of interest.

References

Nakamoto, S. Bitcoin: A Peer-to-Peer Electronic Cash System. 2008. Available online: https://ssrn.com/abstract=3440802 (accessed on 16 September 2025).
Zhang, R.; Xue, R.; Liu, L. Security and privacy on blockchain. ACM Comput. Surv. (CSUR) 2019, 52, 51. [Google Scholar] [CrossRef]
Stephen, R.; Alex, A. A review on blockchain security. IOP Conf. Ser. Mater. Sci. Eng. 2018, 396, 012030. [Google Scholar] [CrossRef]
Guo, H.; Yu, X. A survey on blockchain technology and its security. Blockchain Res. Appl. 2022, 3, 100067. [Google Scholar] [CrossRef]
Li, X.; Jiang, P.; Chen, T.; Luo, X.; Wen, Q. A survey on the security of blockchain systems. Future Gener. Comput. Syst. 2020, 107, 841–853. [Google Scholar] [CrossRef]
Wu, H.; Yao, Q.; Liu, Z.; Huang, B.; Zhuang, Y.; Tang, H.; Liu, E. Blockchain for finance: A survey. IET Blockchain 2024, 4, 101–123. [Google Scholar] [CrossRef]
Zhou, W.; Lyu, D.; Li, X. Blockchain security based on cryptography: A review. arXiv 2025, arXiv:2508.01280. [Google Scholar] [CrossRef]
Singh, S.; Hosen, A.S.; Yoon, B. Blockchain security attacks, challenges, and solutions for the future distributed iot network. IEEE Access 2021, 9, 13938–13959. [Google Scholar] [CrossRef]
Zuniga, E.; Gonzalez, M. The Environmental Cost of Bitcoin and PoW Cryptocurrencies: Analyzing Their Energy Consumption and Carbon Emissions. 2024. Available online: https://www.researchgate.net/profile/Elizabeth-Zuniga-3/publication/377416277_The_Environmental_Cost_of_Bitcoin_and_PoW_Cryptocurrencies_Analyzing_their_Energy_Consumption_and_Carbon_Emissions/links/65a6522e5582153a6828a1be/The-Environmental-Cost-of-Bitcoin-and-PoW-Cryptocurrencies-Analyzing-their-Energy-Consumption-and-Carbon-Emissions.pdf (accessed on 16 September 2025).
Cao, B.; Zhang, Z.; Feng, D.; Zhang, S.; Zhang, L.; Peng, M.; Li, Y. Performance analysis and comparison of PoW, PoS and DAG based blockchains. Digit. Commun. Netw. 2020, 6, 480–485. [Google Scholar] [CrossRef]
Alharby, M.; Alssaiari, A.; Alateef, S.; Thomas, N.; Moorsel, A.v. A quantitative analysis of the security of PoW-based blockchains. Clust. Comput. 2024, 27, 14113–14130. [Google Scholar] [CrossRef]
Aggarwal, S.; Kumar, N. Attacks on blockchain. In Advances in Computers; Elsevier: Amsterdam, The Netherlands, 2021; Volume 121, pp. 399–410. [Google Scholar]
Meybodi, M.A.; Goharshady, A.K.; Hooshmandasl, M.R.; Shakiba, A. Optimal mining: Maximizing bitcoin miners’ revenues from transaction fees. In Proceedings of the 2022 IEEE International Conference on Blockchain (Blockchain), Espoo, Finland, 22–25 August 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 266–273. [Google Scholar]
Eyal, I.; Sirer, E.G. Majority is not enough: Bitcoin mining is vulnerable. Commun. ACM 2018, 61, 95–102. [Google Scholar] [CrossRef]
Nayak, K.; Kumar, S.; Miller, A.; Shi, E. Stubborn mining: Generalizing selfish mining and combining with an eclipse attack. In Proceedings of the 2016 IEEE European Symposium on Security and Privacy (EuroS&P), Saarbruecken, Germany, 21–24 March 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 305–320. [Google Scholar]
Bag, S.; Ruj, S.; Sakurai, K. Bitcoin block withholding attack: Analysis and mitigation. IEEE Trans. Inf. Forensics Secur. 2016, 12, 1967–1978. [Google Scholar] [CrossRef]
Gao, S.; Li, Z.; Peng, Z.; Xiao, B. Power adjusting and bribery racing: Novel mining attacks in the bitcoin system. In Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security, London, UK, 11–15 November 2019; pp. 833–850. [Google Scholar]
Liang, W.; Jiang, S.; Li, W.; Wang, Y. Leading Hide Bribery Stubborn Mining Attack. In Proceedings of the 6th ACM International Symposium on Blockchain and Secure Critical Infrastructure, Singapore, 1–5 July 2024; pp. 1–6. [Google Scholar]
Norris, J.R. Markov Chains; Cambridge University Press: Cambridge, UK, 1998; Number 2. [Google Scholar]
Cheng, Y.; Deng, X.; Li, Y.; Yan, X. Tight incentive analysis of Sybil attacks against the market equilibrium of resource exchange over general networks. Games Econ. Behav. 2024, 148, 566–610. [Google Scholar] [CrossRef]
Sapirshtein, A.; Sompolinsky, Y.; Zohar, A. Optimal selfish mining strategies in bitcoin. In Proceedings of the International Conference on Financial Cryptography and Data Security, Christ Church, Barbados, 22–26 February 2016; Springer: Berlin/Heidelberg, Germany, 2016; pp. 515–532. [Google Scholar]
Zhang, Y.; Liu, M.; Guo, J.; Wang, Z.; Wang, Y.; Liang, T.; Singh, S.K. Optimal revenue analysis of the stubborn mining based on Markov decision process. In Proceedings of the International Conference on Machine Learning for Cyber Security, Guangzhou, China, 2–4 December 2022; Springer: Berlin/Heidelberg, Germany, 2022; pp. 299–308. [Google Scholar]
Yang, G.; Wang, Y.; Wang, Z.; Tian, Y.; Yu, X.; Li, S. IPBSM: An optimal bribery selfish mining in the presence of intelligent and pure attackers. Int. J. Intell. Syst. 2020, 35, 1735–1748. [Google Scholar] [CrossRef]
Azimy, H.; Ghorbani, A. Competitive selfish mining. In Proceedings of the 2019 17th International Conference on Privacy, Security and Trust (PST), Fredericton, NB, Canada, 26–28 August 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–8. [Google Scholar]
Bai, Q.; Xu, Y.; Liu, N.; Wang, X. Blockchain mining with multiple selfish miners. IEEE Trans. Inf. Forensics Secur. 2023, 18, 3116–3131. [Google Scholar] [CrossRef]
Jeyasheela Rakkini, M.; Geetha, K. Q-Learning model for selfish miners with optional stopping theorem for honest miners. Int. Trans. Oper. Res. 2024, 31, 3975–3998. [Google Scholar] [CrossRef]
Bahrani, M.; Weinberg, S.M. Undetectable selfish mining. In Proceedings of the 25th ACM Conference on Economics and Computation, New Haven, CT, USA, 8–11 July 2024; pp. 1017–1044. [Google Scholar]
Billah, S. One Weird Trick to Stop Selfish Miners: Fresh Bitcoins, a Solution for the Honest Miner. 2015. Available online: https://www.researchgate.net/profile/Saki-Billah/publication/290955755_One_Weird_Trick_to_Stop_Selfish_Miners_Fresh_Bitcoins_A_Solution_for_the_Honest_Miner/links/569cd9b708ae2e9667eb2555/One-Weird-Trick-to-Stop-Selfish-Miners-Fresh-Bitcoins-A-Solution-for-the-Honest-Miner.pdf (accessed on 16 September 2025).
Saad, M.; Njilla, L.; Kamhoua, C.; Mohaisen, A. Countering selfish mining in blockchains. In Proceedings of the 2019 International Conference on Computing, Networking and Communications (ICNC), Honolulu, HI, USA, 18–21 February 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 360–364. [Google Scholar]
Wang, Z.; Liu, J.; Wu, Q.; Zhang, Y.; Yu, H.; Zhou, Z. An analytic evaluation for the impact of uncle blocks by selfish and stubborn mining in an imperfect Ethereum network. Comput. Secur. 2019, 87, 101581. [Google Scholar] [CrossRef]
Biçer, O.; Küpçü, A. FORTIS: Selfish Mining Mitigation by (FOR) geable (TI) me (S) tamps. Distrib. Ledger Technol. Res. Pract. 2023, 2, 28. [Google Scholar] [CrossRef]
Lee, S.; Kim, S. Rethinking selfish mining under pooled mining. ICT Express 2023, 9, 356–361. [Google Scholar] [CrossRef]
Habib, M.A.; Manik, M.M.H. A technique to avoid blockchain denial of service (bdos) and selfish mining attack. In Proceedings of the 2023 Fifth International Conference on Blockchain Computing and Applications (BCCA), Kuwait City, Kuwait, 24–26 October 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 585–590. [Google Scholar]
Wang, Y.; Li, C.; Zhang, Y.; Li, T.; Ning, J.; Gai, K.; Choo, K.K.R. A detection method against selfish mining-like attacks based on ensemble deep learning in IoT. IEEE Internet Things J. 2024, 11, 19564–19574. [Google Scholar] [CrossRef]
Nikhalat-Jahromi, A.; Saghiri, A.M.; Meybodi, M.R. Nik defense: An artificial intelligence based defense mechanism against selfish mining in bitcoin. arXiv 2023, arXiv:2301.11463. [Google Scholar] [CrossRef]
Karame, G.O.; Androulaki, E.; Capkun, S. Double-spending fast payments in bitcoin. In Proceedings of the 2012 ACM Conference on Computer and Communications Security, Raleigh, NC, USA, 16–18 October 2012; pp. 906–917. [Google Scholar]
Miller, A.; Kosba, A.; Katz, J.; Shi, E. Nonoutsourceable scratch-off puzzles to discourage bitcoin mining coalitions. In Proceedings of the 22nd ACM Sigsac Conference on Computer and Communications Security, Denver, CO, USA, 12–16 October 2015; pp. 680–691. [Google Scholar]
Yang, X.; Chen, Y.; Chen, X. Effective scheme against 51% attack on proof-of-work blockchain with history weighted information. In Proceedings of the 2019 IEEE International Conference on Blockchain (Blockchain), Atlanta, GA, USA, 14–17 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 261–265. [Google Scholar]
Sayeed, S.; Marco-Gisbert, H. Assessing blockchain consensus and security mechanisms against the 51% attack. Appl. Sci. 2019, 9, 1788. [Google Scholar] [CrossRef]
Bae, J.; Lim, H. Random mining group selection to prevent 51% attacks on bitcoin. In Proceedings of the 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), Luxembourg, 25–28 June 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 81–82. [Google Scholar]
Rosenfeld, M. Analysis of bitcoin pooled mining reward systems. arXiv 2011, arXiv:1112.4980. [Google Scholar] [CrossRef]
Laszka, A.; Johnson, B.; Grossklags, J. When Bitcoin mining pools run dry: A game-theoretic analysis of the long-term impact of attacks between mining pools. In Proceedings of the International Conference on Financial Cryptography and Data Security, San Juan, Puerto Rico, 26–30 January 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 63–77. [Google Scholar]
Kwon, Y.; Kim, D.; Son, Y.; Vasserman, E.; Kim, Y. Be selfish and avoid dilemmas: Fork after withholding (faw) attacks on bitcoin. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, Dallas, TX, USA, 30 October–3 November 2017; pp. 195–209. [Google Scholar]
Dong, X.; Wu, F.; Faree, A.; Guo, D.; Shen, Y.; Ma, J. Selfholding: A combined attack model using selfish mining with block withholding attack. Comput. Secur. 2019, 87, 101584. [Google Scholar] [CrossRef]
Wang, Y.; Yang, G.; Li, T.; Zhang, L.; Wang, Y.; Ke, L.; Dou, Y.; Li, S.; Yu, X. Optimal mixed block withholding attacks based on reinforcement learning. Int. J. Intell. Syst. 2020, 35, 2032–2048. [Google Scholar] [CrossRef]
Schrijvers, O.; Bonneau, J.; Boneh, D.; Roughgarden, T. Incentive compatibility of bitcoin mining pool reward functions. In Proceedings of the International Conference on Financial Cryptography and Data Security, Christ Church, Barbados, 22–26 February 2016; Springer: Berlin/Heidelberg, Germany, 2016; pp. 477–498. [Google Scholar]
Bag, S.; Sakurai, K. Yet another note on block withholding attack on bitcoin mining pools. In Proceedings of the International Conference on Information Security, Honolulu, HI, USA, 3–6 September 2016; Springer: Berlin/Heidelberg, Germany, 2016; pp. 167–180. [Google Scholar]
Chen, H.; Chen, Y.; Xiong, Z.; Han, M.; He, Z.; Liu, B.; Wang, Z.; Ma, Z. Prevention method of block withholding attack based on miners’ mining behavior in blockchain. Appl. Intell. 2023, 53, 9878–9896. [Google Scholar] [CrossRef]
Zhou, Z.; Chen, W.; Li, L.; Zhang, Y. simuBits: Pool Security Verification of Novel Mining Attacks. In Proceedings of the International Conference on Provable Security, Wuhan, China, 20–22 October 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 429–447. [Google Scholar]
Gasti, P.; Tsudik, G.; Uzun, E.; Zhang, L. DoS and DDoS in named data networking. In Proceedings of the 2013 22nd International Conference on Computer Communication and Networks (ICCCN), Nassau, Bahamas, 30 July–2 August 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 1–7. [Google Scholar]
Heilman, E.; Kendler, A.; Zohar, A.; Goldberg, S. Eclipse attacks on Bitcoin’s peer-to-peer network. In Proceedings of the 24th USENIX Security Symposium (USENIX Security 15), Washington, DC, USA, 12–14 August 2015; pp. 129–144. [Google Scholar]
Mirkin, M.; Ji, Y.; Pang, J.; Klages-Mundt, A.; Eyal, I.; Juels, A. Bdos: Blockchain denial-of-service. In Proceedings of the 2020 ACM SIGSAC Conference on Computer and Communications Security, Virtual Event, 9–13 November 2020; pp. 601–619. [Google Scholar]
Wang, Q.; Xia, T.; Wang, D.; Ren, Y.; Miao, G.; Choo, K.K.R. SDoS: Selfish mining-based denial-of-service attack. IEEE Trans. Inf. Forensics Secur. 2022, 17, 3335–3349. [Google Scholar] [CrossRef]
Wang, Q.; Li, C.; Xia, T.; Ren, Y.; Wang, D.; Zhang, G.; Choo, K.K.R. Optimal selfish mining-based denial-of-service attack. IEEE Trans. Inf. Forensics Secur. 2023, 19, 835–850. [Google Scholar] [CrossRef]
Ilyas, B.; Kumar, A.; Setitra, M.A.; Bensalem, Z.A.; Lei, H. Prevention of DDoS attacks using an optimized deep learning approach in blockchain technology. Trans. Emerg. Telecommun. Technol. 2023, 34, e4729. [Google Scholar] [CrossRef]
Sousa, J.E.A.; Oliveira, V.C.; Valadares, J.A.; Vieira, A.B.; Bernardino, H.S.; Villela, S.M.; Goncalves, G.D. Fighting under-price DoS attack in ethereum with machine learning techniques. ACM SIGMETRICS Perform. Eval. Rev. 2021, 48, 24–27. [Google Scholar] [CrossRef]
Raikwar, M.; Gligoroski, D. Dos attacks on blockchain ecosystem. In Proceedings of the European Conference on Parallel Processing, Lisbon, Portugal, 30 August–3 September 2021; Springer: Berlin/Heidelberg, Germany, 2021; pp. 230–242. [Google Scholar]
Nazir, A.; Wang, Z. A comprehensive survey of ChatGPT: Advancements, applications, prospects, and challenges. Meta-Radiology 2023, 1, 100022. [Google Scholar] [CrossRef]
Deng, Z.; Ma, W.; Han, Q.L.; Zhou, W.; Zhu, X.; Wen, S.; Xiang, Y. Exploring DeepSeek: A Survey on Advances, Applications, Challenges and Future Directions. IEEE/CAA J. Autom. Sin. 2025, 12, 872–893. [Google Scholar] [CrossRef]
McIntosh, T.R.; Susnjak, T.; Liu, T.; Watters, P.; Xu, D.; Liu, D.; Halgamuge, M.N. From google gemini to openai Q*(Q-Star): A survey on reshaping the generative artificial intelligence (AI) research landscape. Technologies 2025, 13, 51. [Google Scholar] [CrossRef]

Figure 1. State 0 and State 1 in the BSbM attack. (a) State 0: No fork; all miners mine on the public chain. (b) State 1: Attacker has a 1-block lead on the private chain. The meanings of the notations in the figure are as follows, Square: block before the public chain fork; Diamond: block mined by an honest miner; Circular solid line: open block mined by an attacker; Circular dashed line: hidden block mined by an attacker; triangle: block mined by a targeted miner; Green arrow: honest miner’s mining power; Blue arrow: target miner’s mining power, one of which will be chosen in different cases of ACCEPT and RETURN; Red arrow: attacker’s mining power.

Figure 2. State

0_{o}^{'}

under the BSbM attack.

Figure 2. State

0_{o}^{'}

under the BSbM attack.

Figure 3. State

0_{b}^{'}

under the BSbM attack.

Figure 3. State

0_{b}^{'}

under the BSbM attack.

Figure 4. State

1_{o}^{'}

under the BSbM attack.

Figure 4. State

1_{o}^{'}

under the BSbM attack.

Figure 5. State

1_{b}^{'}

under the BSbM attack.

Figure 5. State

1_{b}^{'}

under the BSbM attack.

Figure 6. State 2 under the BSbM attack. (a) Target miner’s block exists on the honest chain. (b) No target miner block on the honest chain.

Figure 7. State L under the BSbM attack. (a) The target miner’s block exists on the honest chain. (b) No target miner block exists on the honest chain.

Figure 8. Markov chain model of the BSbM attack.

Figure 9. State L: The construction process of the triple fork in LHBSbM.

Figure 10. LHBSbM triple-fork ideal state.

Figure 11. Target miner’s additional revenue by accepting vs. refusing the bribe under different

α

values.

Figure 11. Target miner’s additional revenue by accepting vs. refusing the bribe under different

α

values.

Figure 12. Comparison of the target miner’s revenue for accepting and refusing the bribe under different honest miner follow ratios r.

Figure 13. The attacker’s additional revenue when the target miner refuses the bribe.

Figure 14. The attacker’s additional revenue when the target miner accepts the bribe.

Figure 15. Comparison of the attacker’s revenue when the target miner accepts vs. refuses the bribe.

Figure 16. Average number of abandoned blocks for LHBSbM attack and BSbM attack in different initial states.

Figure 17. Comparison of attack revenue between LHBSbM and BSbM for different r values.

Table 1. Attack strategy under competitive BSbM (H: height of honest chain; A: height of attacker’s private chain; S: current chain reference; r: random number; B: bribery flag for target miner).

Initialization		$S, A \leftarrow H; L, F = 0; R = 0.5; r \leftarrow random (0, 1)$
	Case	Honest Miner Finds Block	Target Miner Finds Block	Attacker Finds Block
State		Honest Miner Finds Block	Target Miner Finds Block	Attacker Finds Block
$L = 0, F = 0$		$H \leftarrow H + 1; S, A \leftarrow H$	$H \leftarrow H + 1; S, A \leftarrow H$	$S \leftarrow S + 1;$ $L = L + 1$
$L = 0, F = 1$		$r \leftarrow r a n d o m (0, 1)$ $r < R$ : $A \leftarrow A + 1; S, H \leftarrow A; F = 0$	$! B$ : $H \leftarrow H + 1; S, A \leftarrow H; F = 0$ B: $A \leftarrow A + 1; S, H \leftarrow A; F = 0$
$L = 0, F = 2$		$r \geq R$ : $H \leftarrow H + 1; S, A \leftarrow H; F = 0$	$H \leftarrow H + 1; S, A \leftarrow H; F = 0$
$L = 1, F = 0$		$H \leftarrow H + 1; A \leftarrow A + 1; L = 0; F = 1$	$H \leftarrow H + 1; A \leftarrow A + 1; L = 0; F = 2$
$L = 1, F = 1$		$r \leftarrow r a n d o m (0, 1)$ $r < R$ : $A \leftarrow A + 1; H \leftarrow A; A \leftarrow S; L = 0$	$! B$ : $H \leftarrow H + 1; A \leftarrow S; L = 0; F = 2$ B: $A \leftarrow A + 1; H \leftarrow A; A \leftarrow S; L = 0; F = 2$
$L = 1, F = 2$		$r \geq R$ : $H \leftarrow H + 1; A \leftarrow S; L = 0$	$H \leftarrow H + 1; A \leftarrow S; L = 0$
$L = 2$		$H, A \leftarrow S; L = 0; F = 0$	$H, A \leftarrow S; L = 0; F = 0$
$L \geq 3$		$H \leftarrow H + 1; A \leftarrow A + 1; L = L - 1$	$H \leftarrow H + 1; A \leftarrow A + 1; L = L - 1$

Table 2. The attack strategy under the LHBSbM attack. (

L^{'}

denotes auxiliary lead blocks for the attacker’s private chain in triple-fork scenarios;

S^{'}

indicates fork state flags;

H S

is the hidden mining state flag).

Table 2. The attack strategy under the LHBSbM attack. (

L^{'}

denotes auxiliary lead blocks for the attacker’s private chain in triple-fork scenarios;

S^{'}

indicates fork state flags;

H S

is the hidden mining state flag).

Initialization		$same; S, A \leftarrow H; L = 0; L^{'} = 1; HS = 0; S^{'} = 0; random (); r$
	Case	Honest Miner Finds Block	Target Miner Finds Block	Attacker Finds Block
State		Honest Miner Finds Block	Target Miner Finds Block	Attacker Finds Block
$L < 2$		same	same
$L = 2$		$H, A \leftarrow S; L = 0; S^{'} = 0; L^{'} = 1$	$S^{'} = 0 : H, A \leftarrow S; L = 0; S^{'} = 0; L^{'} = 1$ $S^{'} = 1 : A \leftarrow A + 1; L^{'} = L; S^{'} = 2$ $S^{'} = 2, L^{'} = L : H, A \leftarrow S; L = 0; S^{'} = 0; L^{'} = 1$ $S^{'} = 2, L^{'} > L : A \leftarrow A + 1; L^{'} = L$
$L \geq 3, S^{'} = 0$		$H S = 0 : H \leftarrow H + 1; L = L - 1; S^{'} = 1$ $H S = 1, r a n d o m () < r : A \leftarrow A + 1;$ $H \leftarrow A; A = n e w (A); L = L - 1; H S = 0$ $H S = 1, r a n d o m () > r : H \leftarrow H + 1;$ $L = L - 1; A \leftarrow A + 1$	$H S = 0 : A \leftarrow A + 1; H \leftarrow A;$ $A = n e w (A); L = L - 1; H S = 1$ $H S = 1 : H \leftarrow H + 1; A \leftarrow A + 1; L = L - 1$	$S \leftarrow S + 1;$ $L = L + 1;$ $L^{'} = L + 1$
$L \geq 3, S^{'} = 1$		$H \leftarrow H + 1; L = L - 1; A = A + 1$	$A \leftarrow A + 1; L^{'} = L; S^{'} = 2$
$L \geq 3, S^{'} = 2$		$L^{'} > L : H \leftarrow H + 1; L = L + 1$ $S^{'} = 1; A = n e w (A)$ $L^{'} = L : H \leftarrow H + 1; L = L - 1$	$L^{'} = L : A \leftarrow A + 1; H \leftarrow A$ $A = n e w (A); L = L - 1; S^{'} = 0; H S = 1$ $L^{'} > L : A \leftarrow A + 1; L^{'} = L$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Li, W.; Jiang, S.; Ni, B.; Liang, W.; Wang, Y. From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains. Symmetry 2025, 17, 1618. https://doi.org/10.3390/sym17101618

AMA Style

Li W, Jiang S, Ni B, Liang W, Wang Y. From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains. Symmetry. 2025; 17(10):1618. https://doi.org/10.3390/sym17101618

Chicago/Turabian Style

Li, Weijie, Shan Jiang, Bina Ni, Weipeng Liang, and Yu Wang. 2025. "From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains" Symmetry 17, no. 10: 1618. https://doi.org/10.3390/sym17101618

APA Style

Li, W., Jiang, S., Ni, B., Liang, W., & Wang, Y. (2025). From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains. Symmetry, 17(10), 1618. https://doi.org/10.3390/sym17101618

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

From Bribery–Stubborn Mining to Leading Hidden Triple-Fork Strategies for Incentive Optimization in PoW Blockchains

Abstract

1. Introduction

2. Related Work

2.1. Selfish Mining Attack

2.2. The 51% Attack

2.3. Block Withholding Attack

2.4. Denial-of-Service Attack

3. Attack Strategy and Theoretical Modeling

3.1. Bribery–Stubborn Mining

3.1.1. Attack Strategy

3.1.2. State Transitions and Event Modeling

State 0

State 1

State 0 o ′

State 0 b ′

State 1 o ′

State 1 b ′

State 2

State L ( L ≥ 3 )

3.1.3. Theoretical Analysis

Attacker’s Reward

Target Miner’s Reward

Honest Miners’ Reward

3.2. Triple-Fork-Based Leading Hidden BSbM

3.2.1. Attack Strategy

3.2.2. State Transitions and Event Modeling

State L ( L ≥ 3 , L ∈ Z + )

3.2.3. Theoretical Analysis

4. Simulation and Evaluation

4.1. Evaluation of BSbM Attack

4.1.1. Target Miner’s Revenue

4.1.2. Attacker’s Revenue

4.2. Evaluation of LHBSbM Optimization

4.2.1. Comparison of Discarded Blocks (Extra Mining Time)

4.2.2. Optimized Attack Revenue

4.3. Summary

5. Discussion

5.1. Review of Our Work

5.2. Defensive Measures

Protocol-Level Modifications

Network-Level Monitoring

Economic and Game Theory Countermeasures

5.3. Potential Directions for Future Research

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

State $0_{o}^{'}$

State $0_{b}^{'}$

State $1_{o}^{'}$

State $1_{b}^{'}$

State L ( $L \geq 3$ )

State L ( $L \geq 3, L \in Z^{+}$ )