Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff

Liang, Guoling; Su, Zhaoyu; Li, Chunhai; Chen, Mingfeng; Zhao, Feng

doi:10.3390/info17040339

Open AccessArticle

Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff

by

Guoling Liang

^1,2

,

Zhaoyu Su

¹

,

Chunhai Li

¹

,

Mingfeng Chen

¹

and

Feng Zhao

^1,*

¹

Guangxi Engineering Research Center of Industrial Internet Security and Blockchain, Guilin University of Electronic Technology, Guilin 541004, China

²

School of Physics and Telecommunication Engineering, Yulin Normal University, Yulin 537000, China

^*

Author to whom correspondence should be addressed.

Information 2026, 17(4), 339; https://doi.org/10.3390/info17040339

Submission received: 25 February 2026 / Revised: 20 March 2026 / Accepted: 23 March 2026 / Published: 1 April 2026

(This article belongs to the Section Information and Communications Technology)

Download

Browse Figures

Versions Notes

Abstract

The integration of blockchain into parked vehicle edge computing (PVEC) has emerged as a promising approach to mitigate the inherent trust challenges in distributed and untrusted computing environments. However, during task offloading and consensus, vehicles are vulnerable to location information disclosure, leading to privacy leakage. To address this problem, we propose a location differential privacy-enabled blockchain PVEC (DBPVEC) framework to protect location information during offloading and consensus. Specifically, we design a location differential privacy mechanism based on the Laplace mechanism and theoretically prove that it satisfies ε-differential privacy. This mechanism perturbs vehicles’ locations, and a privacy-preserving offloading strategy is designed to enhance the Hotstuff consensus and protect location privacy in edge computing. Subsequently, we formulate a joint optimization problem, considering system energy consumption, latency, and privacy strength. To solve it, we design a two-layer deep reinforcement learning (DRL) algorithm, with a Deep Q-Network (DQN) as the upper layer and a Deep Deterministic Policy Gradient (DDPG) as the lower layer, to determine the optimal offloading strategy. The experimental results demonstrate that our scheme achieves significant reductions compared to the two baseline methods: the total cost decreases by 68.31% and 63.25%, energy consumption by 9.96% and 16.27%, and delay by 31.46% and 18.07%, respectively. Moreover, it effectively preserves vehicle location privacy during task offloading and consensus while maintaining favorable performance in energy consumption and latency.

Keywords:

task offloading; differential privacy; blockchain; PVEC; Hotstuff

1. Introduction

Parked vehicles (PVs) provide abundant communication and computing resources for vehicle edge computing (VEC), significantly alleviating the shortages of computing, communication, and storage resources that are inherent in traditional VEC, thereby forming the paradigm of parked vehicle edge computing [1]. Task offloading, a key technology in PVEC, leverages the abundant idle resources of PVs by transferring computational tasks from terminal devices to them. This approach enables rapid task execution and conserves device battery power while enhancing the overall system performance [2], thereby providing users with low-latency and high-efficiency computing services.

However, because task offloading and transactions in VEC are conducted in open wireless environments, ensuring transparency, immutability, and overall system security for multi-party resource transactions becomes challenging in complex scenarios [3]. This inherent vulnerability gives rise to significant trust and security challenges in task offloading for PVEC. To address these issues, researchers have introduced blockchain into task offloading to establish a trusted offloading environment. Blockchain and PVEC share a common distributed network architecture and possess comparable storage and computing capabilities [4], which naturally facilitate the integration of blockchain into PVEC task offloading. Through its transparent and tamper-proof consensus mechanism, blockchain ensures data integrity and immutability, thereby enhancing trust among participants [5,6].

Despite the trust guarantees provided by blockchain, the privacy of participating vehicles remains vulnerable in wireless environments. In particular, location information can be leaked during both the task offloading phase and the consensus phase. For example, an adversary could deploy rogue roadside units to monitor wireless signals and infer a vehicle’s location by analyzing the timing and power of the transmitted messages [7]. Alternatively, by observing which PVs consistently participate in consensus rounds in a given geographic area, an attacker can track their movement patterns, even if the transaction content is encrypted. Such location leakage can lead to serious consequences like physical tracking or targeted attacks.

Several existing studies have attempted to integrate privacy-preserving mechanisms with blockchain in vehicular networks. However, these works either focus on identity privacy or data confidentiality and do not specifically address location privacy during task offloading [8,9]. More importantly, they treat privacy protection as an independent module, neglecting its impact on system performance metrics such as latency and energy consumption. The additional noise injected for privacy inevitably degrades the communication quality (e.g., Signal-to-Interference-plus-Noise Ratio, SINR) among consensus nodes, which in turn affects the consensus efficiency. To the best of our knowledge, no prior work has jointly optimized the privacy strength and system performance in a blockchain-based task offloading framework for PVEC.

Beyond privacy concerns, the architectural design of existing blockchain-based task offloading frameworks also poses scalability challenges. Most of them adopt a single-chain architecture deployed on Roadside Units (RSUs), which handle all consensus tasks. As the number of vehicles grows, the consensus overhead on RSUs increases dramatically, leading to higher latency and communication bottlenecks. Moreover, to optimize offloading decisions under dynamic environments, recent studies have employed DRL algorithms. However, these algorithms typically use a single-layer network that struggles with the inherent mixed discrete-continuous action space. The single network must simultaneously handle both types of actions, resulting in high computational complexity, slow convergence, and poor sample efficiency—issues that are particularly critical in resource-constrained vehicular environments. Therefore, designing a scalable and computationally efficient framework that jointly addresses privacy, security, and performance remains an open challenge.

To overcome the aforementioned issues, a DBPVEC offloading framework is proposed. Compared with traditional blockchain-based PVEC, this framework incorporates parked vehicles as consensus nodes and forms a dual-chain structure with RSUs, thereby enhancing the scalability. In addition, location differential privacy is employed during the user offloading process to protect users’ location privacy. Moreover, during the Hotstuff consensus process, the locations of PVs are perturbed by adopting the location differential privacy mechanism to ensure that their location information is not divulged. Then, a two-layer DRL algorithm is developed to perform joint optimization of the system energy consumption, latency, and privacy strength, striking a balance between the system performance and security. The primary contributions of this study are listed as follows.

We propose a novel BPVEC task offloading framework with location differential privacy. In this framework, parked vehicles are introduced as consensus nodes and, together with RSUs, form a dual-blockchain architecture. This architecture establishes a trusted task offloading environment while improving scalability. In addition, it incorporates the location differential privacy to protect the location privacy of both users and consensus nodes, thereby effectively ensuring the security of task offloading.
We design a location differential privacy mechanism based on the Laplace mechanism. Considering the impact of location perturbation on the wireless communication quality, we leverage the low communication complexity of the Hotstuff consensus. We then integrate the location differential privacy mechanism with the Hotstuff consensus to construct a novel location privacy-enhanced Hotstuff consensus algorithm. Specifically, before the parked vehicle consensus nodes execute the Hotstuff consensus, their locations are perturbed with a consideration of the consensus energy consumption and latency to mitigate the impact on system performance.
We build a joint optimization problem based on the system energy consumption, latency and privacy strength. By adopting DQN as the upper layer and DDPG as the lower layer, a two-layer DRL model is established to determine the optimal offloading to roadside unit ratio, privacy budget, number of consensus nodes and block size.
We conduct simulation experiments to evaluate the performance of our proposed scheme. The results demonstrate that our approach not only preserves the vehicle’s location privacy during task offloading and consensus, but also achieves favorable performance in terms of energy consumption and latency.

The rest of this paper is organized as follows. Section 2 reviews the recent related work on blockchain-based task offloading and privacy-preserving task offloading. Section 3 introduces the principle of differential privacy protection. The system model of DBPVEC is designed in Section 4. Section 5 discusses the joint optimization problem of system energy consumption, latency, and privacy strength. Section 6 designs the location differential privacy Hotstuff consensus algorithm and establishes a two-layer DRL model. In Section 7, we evaluate the performance of the proposed scheme, based on the experimental results. Finally, Section 8 concludes this paper.

2. Related Work

2.1. Blockchain for Task Offloading

Traditional PVEC adopts edge servers or cloud servers to centrally manage vehicle information and transactions during task offloading, making the system prone to single-point failures and vulnerable to data tampering. To address this, researchers have integrated blockchain into edge computing. For instance, the literature [10] combined blockchain with edge computing to tackle the user trust management issue, protecting the information privacy of user vehicles in the offloading process and resolving the bottleneck problem caused by a single edge computing node. Similarly, the works in [11,12] deployed blockchain on edge servers to achieve cross-regional identity recognition and trust management. While these works effectively utilized blockchain for identity authentication, they stopped short of leveraging it to build a fully trusted network for secure task offloading and transactions. To build a trusted network, Ren et al. [13] integrated Software-Defined Networking (SDN) into a traditional network architecture, where a consortium blockchain served as the infrastructure to share network topology information across SDN controllers, establishing a trusted offloading environment. The study in [14] proposed a blockchain-based vehicle edge computing (BEVEC) framework to construct a trusted offloading network; this framework relied on DPoS to ensure data accuracy and integrity, and designed a system utility function to measure the performance of BEVEC. Despite establishing trust, these frameworks often overlooked the challenge of user selfishness. To incentivize resource sharing while ensuring user information security, reference [15] adopted multi-step smart contracts for secure resource sharing and devised a BFT-based PoS consensus. However, this approach lacked joint optimization of system performance. To further enhance the system performance and blockchain efficiency, Liang et al. [16] developed a blockchain-based decentralized task offloading framework that adopts a multi-factor score to dynamically allocate tasks among computing nodes. The PoS consensus and smart contract-based dynamic pricing were combined to incentivize participation and balance workloads, leading to reduced latency and improved system efficiency. This work represented a step towards joint optimization, but its focus remained primarily on offloading efficiency. A more comprehensive integration was presented in [17] with the FBMTO framework, which explicitly balanced the costs and benefits of offloading. The consensus mechanism is a key factor affecting the blockchain performance. Therefore, improving the efficiency of the consensus mechanism is crucial for enhancing the system performance. The literature [4,18] improved PBFT and conducted joint optimization to enhance system performance. Considering the impact of parking, Huang et al. [19] utilized blockchain technology to implement decentralized PVFC, modeling and solving the optimal smart contract design problem to minimize the total payment amount for users, with similar studies in [20,21]. However, the aforementioned studies lack consideration of the information leakage of vehicles during the offloading and consensus processes, which greatly threatens vehicle security.

2.2. Differential Privacy for Task Offloading

To protect the privacy of user data during task offloading, the works in [22,23] focused on the data privacy of offloading tasks, applying differential privacy to protect offloaded data and realizing task offloading while safeguarding user data privacy. In light of the features of user offloaded data, Tchaye-Kondi et al. [24] utilized differential privacy to let edge devices perturb sensitive data features with random noise before transmission to untrusted central servers, which effectively preserved the privacy of sensitive information. While effective for the data content, these methods did not protect the contextual privacy of the task offloading. In [25], the authors leveraged deep learning to optimize task offloading while incorporating differential privacy to safeguard parameter privacy during deep learning execution. The literature [26] employed a histogram-driven local differential privacy algorithm and proposed the k-neighbor joint optimization algorithm (K-NJTA) for task offloading and resource allocation to optimize the global latency of task execution. Although these studies integrated privacy, they treated it as an external constraint, rather than an integral part of the optimization objective. Aimed at the security, latency, and overhead concerns in mobile edge computing, Zhang et al. [27] adopted differential privacy to perturb users’ location information before task offloading. This mechanism prevented malicious edge servers from accessing user privacy, while the performance of the edge computing system was also taken into consideration. Similar efforts have been made in references [28,29]. However, the above studies focused on location privacy protection in single-edge server scenarios. In multi-server environments, the study in [30] introduced an innovative location privacy-preserving offloading framework, where each user perturbs their actual location in a reasonable perturbation region and was provided with differential privacy guarantees. To realize the dynamic adjustment of location perturbation, the works in [31,32] introduced reinforcement learning on the basis of the aforementioned work, designing a DQN-DP task scheduling strategy to dynamically select offloading strategies and location perturbation strategies, effectively reducing the selection of high-risk state-action pairs. Nonetheless, the aforementioned studies applied differential privacy to locations with the distance between the server and the user as a metric, instead of directly describing the user’s location.

As shown in Table 1, existing works lack a BPVEC task offloading scheme that protects the location privacy of users and parked consensus nodes while ensuring offloading performance. To address this gap, this paper designs a DBPVEC offloading framework, which achieves secure offloading and transactions while protecting the location privacy of both users and parked consensus nodes. In addition, we jointly optimize differential privacy and system performance.

3. Differential Privacy Mechanism

Differential privacy is a method for protecting data privacy. It perturbs the original data by adding noise, ensuring that the impact of any single record on the output is always below a certain threshold, thereby preventing attackers from inferring sensitive information through differential analysis. Based on different parameter configurations, differential privacy is classified into ε-differential privacy and (ε, δ)-differential privacy. This paper employs ε-differential privacy, so the principle of ε-differential privacy is introduced below.

Definition 1

(ε-differential privacy definition [33]). Let D₁ and D₂ be adjacent datasets, and let M denote the randomized query mechanism. Then, the algorithm and all possible output sets

Y \subseteq R a n g (M)

satisfy

\Pr [M (D_{1}) \in Y] \leq e^{ε} \times \Pr [M (D_{2}) \in Y]

(1)

Where ε denotes the privacy budget. Rang(M) denotes the set of all possible outputs of the randomized mechanism M, and Pr[∙] represents the probability of an event.

Definition 2

(Global Sensitivity [33]). For any query function f:

D \to R^{d}

, the global sensitivity of f is defined as

Δ f = \max_{D_{1}, D_{2} \in D} ‖f (D_{1}) - f (D_{2})‖

(2)

Definition 3

(Laplace Mechanism [34]). For an arbitrary function f:

D \to R^{d}

, let

M

be the query mechanism. Given a privacy budget ε > 0, the Laplace mechanism is defined

M (D) = f (D) + (Y_{1}, Y_{2} \dots, Y_{k})

.

Where

Y_{i} \overset{i . i . d .}{\sim} L a p (0, Δ f / ε)

. Let β = Δf/ε. The Laplace distribution Lap(0,β) with β > 0 has the probability density function (pdf) [30]:

f_{L a p} (x; 0, β) = \frac{1}{2 β} e^{- \frac{|x|}{β}}, x \in R

(3)

Theorem 1

([34]). The Laplace mechanism preserves ε-differential privacy.

On the basis of the Laplace mechanism, an ε-differential privacy mechanism dedicated to location privacy protection is developed in this paper.

4. System Model

4.1. DBPVEC Offloading System Framework

To ensure task offloading security and to protect users’ and PVs’ location privacy in the task offloading of PVEC, the DBPVEC task offloading framework is proposed. The system block diagram is shown in Figure 1. The proposed framework involves multiple entities, namely user vehicles, PVs, RSUs, as well as VEC servers connected to RSUs. Among these entities, the RSUs and their connected servers form the main chain, while the PV network constitutes the sub-chain, together forming a dual-blockchain architecture. User vehicles can offload computing tasks either to RSUs or to PVs, thereby enabling distributed computation offloading and transaction processing.

When a user demands computing services, it sends a request message to an RSU or a PV. After the request legitimacy is verified by the RSU or PV, the computing task is offloaded by the user. To safeguard the user’s location privacy, location differential privacy is adopted to perturb the user’s location throughout the entire offloading process. After receiving the computing task, the RSU and PV conduct service transactions. They then form blocks containing task offloading and transaction records through a consensus mechanism and upload them to the blockchain. To prevent storage desynchronization in the sub-chain caused by the departure of PVs, the sub-chain uploads its blocks to RSUs after each consensus execution. To protect the location privacy of PVs during the consensus process, location differential privacy is used to perturb the locations of PVs, and the Hotstuff consensus mechanism is combined to complete the operation of the blockchain. To balance the system performance, consensus performance, and privacy security, a two-layer DRL structure is designed for optimization. Accordingly, this paper mainly focuses on integrating location differential privacy with the consensus mechanism to implement distributed task offloading with privacy protection, while maintaining a satisfactory system performance. For ease of reference, the key notations are summarized in Table 2.

4.2. Network Model

We divide time, T, into sufficiently small time slots, t. For each slot t, the communication topology between vehicles has no variation. Assume that there are U users in the region, and let U = {1, 2, …, U} denote the set of users with task offloading requests, S RSUs and P PVs,

𝒮

= {1, 2, …, S} and

𝒫

= {1, 2, …, P} denote their set, respectively. Then, according to references [35,36], the Signal-to-Interference-plus-Noise Ratio (SINR) of RSU or PV received from other vehicles is expressed as follows:

S I N R_{i} (t) = \frac{P_{t} η {(d_{0} / d_{i}^{s} (t))}^{α}}{N_{0} + \sum_{j = 1, j \neq i}^{N} [P_{t} η {(d_{0} / d_{i, j}^{s} (t))}^{α}]}

(4)

wherein P_t is the transmission power; η is the transceiver determination coefficient; d₀ and

d_{i}^{s} (t)

represent the far-field reference distance and the distance from the RSU or PV to the receiving vehicle, respectively;

d_{i, j}^{s} (t)

is the distance from the RSU or PV to the interfering vehicle; N₀ is the noise power; and α is the path loss coefficient.

4.3. Location Differential Privacy Model

Based on the Laplacian differential privacy mechanism, this section designs the location differential privacy model. Specifically, based on the two-dimensional Laplace mechanism, a location Laplace mechanism is defined, which serves as an unclipped location perturbation mechanism. Since the range of the two-dimensional Laplace mechanism is (−∞, +∞), a clipping function is introduced to confine the location range within [0, lmax].

Definition 4.

Let two adjacent locations be denoted as p₁ = (u₁, v₁) and p₂ = (u₂, v₂), respectively. Then, the global sensitivity of the location data is defined as the maximum value of the L₁ distance between adjacent locations:

Δ f_{L} = \max_{p_{1}, p_{2}} (|u_{1} - u_{2}| + |v_{1} - v_{2}|)

(5)

Definition 5.

Unclipped location perturbation mechanism

ℳ

′:

D \to R^{2}

is defined as

M^{'} (p) = p + η

(6)

In Equation (6), η = (η_x, η_y),

η_{x}, η_{y} \overset{i . i . d .}{\sim} L a p (0, β)

,

β = Δ f_{L} / ε

, the pdf of mechanism

ℳ

′ is

f_{M^{'}} (q| p) = \frac{1}{4 β} \exp (- \frac{|x - u| + |y - v|}{β}), q = (x, y) \in R^{2}

(7)

Definition 6.

Location differential privacy mechanism

ℳ

_L:

D \to D

is defined as the composite function of the unclipped mechanism and the clipping function,

M_{L} = c l i p \circ M^{'}

, i.e., for the input location p = (u,v) ∈ D:

M_{L} (p) = c l i p (M^{'} (p)) = (c l i p (u + η_{x}), c l i p (v + η_{y}))

(8)

Lemma 1 ([34]).

If mechanism

ℳ

satisfies ε-differential privacy, then the mechanism obtained by performing any deterministic mapping on its output also satisfies the ε-differential privacy.

Theorem 2.

The location differential privacy mechanism

ℳ

_L satisfies the ε-differential privacy.

Proof.

Let p₁ = (u₁, v₁) and p₂ = (u₂, v₂) be any two adjacent locations. For any q = (x,y) ∈ R², we have the following:

\frac{f_{M^{'}} (q| p_{1})}{f_{M^{'}} (q| p_{2})} = \exp (\frac{|x - u_{2}| + |y - v_{2}| - |x - u_{1}| - |y - v_{1}|}{β})

(9)

Based on the triangle inequality, we have

\frac{f_{M^{'}} (q| p_{1})}{f_{M^{'}} (q| p_{2})} \leq \exp (\frac{|u_{1} - u_{2}| + |v_{1} - v_{2}|}{β}) \leq \exp (\frac{Δ f_{L}}{β}) = \exp (ε)

(10)

Therefore,

ℳ

′ satisfies the ε-differential privacy. According to Definition 6, the mechanism

ℳ

_L is the composition of

ℳ

′ and a deterministic clipping function. On the basis of Lemma 1,

ℳ

_L also satisfies the ε-differential privacy. □

In addition, according to [37], we use the KL divergence as a measure of the privacy strength of differential privacy, see Formula (11).

D_{K L} = \int_{0}^{l_{m a x}} \int_{0}^{l_{m a x}} f (x, y |u_{1}, v_{1}) \ln \frac{f (x, y |u_{1}, v_{1})}{f (x, y |u_{2}, v_{2})} dxdy

(11)

4.4. Task Offloading Model

To protect the vehicle’s location privacy during task offloading, before the user performs offloading, the user’s location is perturbed, using a location differential privacy mechanism to obtain the communication transmission rate after location perturbation:

R_{i, j} (t) = W_{B} \cdot \log_{2} (1 + \frac{P_{t} η {(d_{0} / d_{i, j}^{d} (t))}^{α}}{N_{0} + \sum_{k = 1, k \neq i, k \neq j}^{N} [P_{t} η {(d_{0} / d_{i, k}^{d} (t))}^{α}]})

(12)

where W_B is the wireless communication bandwidth and

d_{i, j}^{d} (t)

and

d_{i, k}^{d} (t)

denote the distance from the perturbed PV to the receiving vehicle and the distance from the perturbed PV to the interfering vehicle, respectively.

Throughout the user request procedure, the computing task of the user can be implemented either on the PV or on the edge server connected to the RSU. Let φ_RSUi(t) be the proportion of task offloaded by the user to the RSU. The delays generated by offloading to the RSU and the PV for the computing service are described in Formulas (13) and (14), respectively.

T_{pai} (t) = \max_{k} (\frac{(1 - φ_{RSUi} (t)) D_{qi} C_{pk}^{exe}}{f_{pk} (t)} + \frac{(1 - φ_{RSUi} (t)) D_{qi}}{R_{i, k} (t)})

(13)

T_{RSUi} (t) = \max_{j} (\frac{φ_{RSUi} (t) D_{qi} C_{rj}^{exe}}{f_{rj} (t)} + \frac{φ_{RSUi} (t) D_{qi}}{R_{i, j} (t)})

(14)

wherein I ∈ U, j ∈ R, k ∈ P. f_rj(t) and f_pk(t) represent the computing capabilities of RSUj and PVk, respectively. D_qi is the total task size of useri.

C_{p k}^{e x e}, C_{r j}^{e x e}

represent the CPU cycles consumed by the PV and RSU for processing a unit task size, respectively. Both R_i,k(t) and R_i,j(t) are the transmission rates after location perturbation.

Therefore, the energy consumption generated by useri offloading tasks to the RSU and PV for processing are as follows, respectively:

E_{pai} (t) = \sum_{k = 1}^{P} (κ_{v} f_{pk} {(t)}^{2} (1 - φ_{RSUi} (t)) D_{qi} C_{pk}^{exe} + P_{t} \frac{(1 - φ_{RSUi} (t)) D_{qi}}{R_{i, k} (t)})

(15)

E_{RSUi} (t) = \sum_{j = 1}^{R} (κ_{v} f_{rj} {(t)}^{2} φ_{RSUi} (t) D_{qi} C_{rj}^{exe} + P_{t} \frac{φ_{RSUi} (t) D_{qi}}{R_{i, j} (t)})

(16)

κ_v and κ_r are the capacitance switch coefficients of PV and RSU, respectively, and P_t is the transmission power.

4.5. Blockchain Model

During the consensus process, the PV consensus nodes communicate with each other using the transmission rate after location perturbation. Assume that PVs require σ and θ CPU cycles to generate or verify a signature and MAC, with B PV nodes participating in the consensus. The primary node is Bl, and replica nodes are Bm, where Bl, Bm ∈ B = {1, 2, 3, …, B}, and there are F Byzantine nodes. Meanwhile, let D_Bv(t) and ϖ denote the block size and average transaction size, respectively. The perturbed transmission rate is used as the communication rate for the Hotstuff consensus, which is divided into five phases. Therefore, the time delay required for each phase during the Hotstuff consensus is as follows.

New-view phase: In this phase, the replica nodes generate a QC message and a signature, which are sent to the primary node for verification. Meanwhile, the primary node is required to verify the transactions and signatures from the users. Therefore, the number of CPU cycles for primary is as follows:

O_{Bl}^{new} (t) = \frac{D_{Bv} (t) (σ + θ)}{ϖ} + σ + (2 F + 1) θ

. The number of CPU cycles consumed by the replica nodes to generate messages and signatures is

O_{Bm}^{new} (t) = σ + θ

. The sizes of the QC and signatures from the replica nodes are much smaller than that of the block, and thus the transmission delay can be ignored. The time delay generated in this phase is as follows:

T_{new}^{c} (t) = \frac{O_{Bl}^{new} (t)}{f_{pBl} (t)} + \max_{Bm \in B \ Bl} \{\frac{O_{Bm}^{new} (t)}{f_{pBm} (t)}\}

(17)

Prepare phase: After verifying the messages in the new-view phase, the primary node generates a new block on the secure branch of HighQC, signs it, and distributes the block to the replica nodes via a proposal message for verification. The replica nodes need to verify the user transactions and the new block, vote and sign the proposal message, and send them to the primary node. Therefore, the number of CPU cycles required in this phase is

O_{B l}^{p r e} (t) = σ + θ

and

O_{Bm}^{pre} (t) = 2 σ + 2 θ + \frac{D_{Bv} (t)}{ϖ} (σ + θ)

. The computing delay can be expressed as follows:

T_{pre}^{c} (t) = \frac{O_{Bl}^{pre} (t)}{f_{pBl} (t)} + \max_{Bm \in B \ Bl} \{\frac{O_{Bm}^{pre} (t)}{f_{pBm} (t)}\}

(18)

Pre-commit phase: During this phase, the primary node verifies the voting signatures from the replica nodes, aggregates them into a new signature, generates and stores a prepare-QC locally, and then broadcasts it to the replica nodes in the form of a pre-commit message. Therefore, the primary node CPU cycles require

O_{Bl}^{pre_com} (t) = 2 σ + (2 F + 1) θ

, and the replica nodes need to verify, vote and sign the pre-commit message, so

O_{B m}^{p r e_c o m} (t) = 2 σ + 2 θ

. The computing delay is

T_{pre_com}^{c} (t) = \frac{O_{Bl}^{pre_com} (t)}{f_{pBl} (t)} + \max_{Bm \in B \ Bl} \{\frac{O_{Bm}^{pre_com} (t)}{f_{pBm} (t)}\}

(19)

Commit phase: Analogous to the pre-commit phase, the primary node verifies the votes and broadcasts the pre-commit QC to the replica nodes in the form of a commit message. Therefore, the number of CPU cycles of the primary node is

O_{B l}^{c o m} (t) = 2 σ + (2 F + 1) θ

, and the replica nodes’ is

O_{B m}^{c o m} (t) = 2 σ + 2 θ

. The computing delay is given by

T_{com}^{c} (t) = \frac{O_{Bl}^{com} (t)}{f_{pBl} (t)} + \max_{Bm \in B \ Bl} \{\frac{O_{Bm}^{com} (t)}{f_{pBm} (t)}\}

(20)

Decide phase: The primary node combines the commit-vote votes received from the replica nodes into a commitQC, places it on the decide message, and broadcasts it. After obtaining the commitQC from the deciding message, the replica nodes regard the current proposal as confirmed and start a new phase. Thus, the CPU cycles for the primary node are

O_{B l}^{d e c} (t) = 2 σ + (2 F + 1) θ

, while those for the replica nodes can be ignored. The computing delay is as follows.

T_{d e c}^{c} (t) = \frac{O_{B l}^{d e c} (t)}{f_{p B l} (t)}

(21)

In summary, the transmission delay of the consensus process mainly comes from the last four phases:

T_{totalpa}^{tr} (t) = 4 \times (\max_{Bm \in B \ Bl} \{\frac{D_{Bv} (t)}{R_{Bl, Bm} (t)}\} + \max_{Bm \in B \ Bl} \{\frac{D_{Bv} (t)}{R_{Bm, Bl} (t)}\})

(22)

The total computational delay in the consensus process is

T_{t o t a l p a}^{c} (t) = T_{n e w}^{c} (t) + T_{p r e}^{c} (t) + T_{p r e_c o m}^{c} (t) + T_{c o m}^{c} (t) + T_{d e c}^{c} (t)

(23)

Therefore, the total delay of the consensus process is

T_{t o t a l p a}^{B} (t) = \frac{U}{D_{B v} (t) / ϖ} (T_{t o t a l p a}^{c} (t) + T_{t o t a l p a}^{t r} (t))

(24)

In addition, the energy consumption for transmission and computation can be expressed as

E_{totalpa}^{tr} (t) = 4 \times (\sum_{\{Bm \in B \ Bl\}} \frac{D_{Bv} (t)}{R_{Bm, Bl} (t)} + \sum_{\{Bm \in B \ Bl\}} \frac{D_{Bv} (t)}{R_{Bl, Bm} (t)}) ∙ P_{t}

(25)

\begin{array}{l} E_{totalpa}^{c} (t) = κ_{v} f_{pBl} {(t)}^{2} (O_{Bl}^{new} (t) + O_{Bl}^{pre} (t) + O_{Bl}^{pre_com} (t) + O_{Bl}^{com} (t) + O_{Bl}^{dec} (t)) \\ \begin{matrix}  \end{matrix} + \sum_{\{Bm \in B \ Bl\}} κ_{v} f_{pBm} {(t)}^{2} (O_{Bm}^{new} (t) + O_{Bm}^{pre} (t) + O_{Bm}^{pre_com} (t) + O_{Bm}^{com} (t)) \end{array}

(26)

For the transaction of task offloading of U users, the total consensus energy consumption is

E_{totalpa}^{B} (t) = \frac{U}{D_{Bv} (t) / ϖ} (E_{totalpa}^{tr} (t) + E_{totalpa}^{c} (t))

(27)

Formulas (24) and (27) are the energy consumption and delay generated by PVs executing the Hotstuff consensus, and RSUs use the same form. Therefore, the energy consumption and delay for the consensus can be expressed as follows.

E_{total}^{B} (t) = (E_{totalpa}^{B} (t) + E_{totalRSU}^{B} (t))

(28)

T_{total}^{B} (t) = (T_{totalpa}^{B} (t) + T_{totalRSU}^{B} (t))

(29)

5. Optimization Problem Form

As can be seen from the above, the delay and energy consumption factors of offloading and blockchain need to be considered, and the location differential privacy strength also needs to be guaranteed. Therefore, this chapter constructs an optimization problem aimed at minimizing system delay and energy consumption while ensuring location differential privacy strength. In the system, the privacy budget controls the strength of location perturbation noise, which affects the quality of wireless communication, and subsequently influences both task offloading and blockchain consensus. Meanwhile, the block size and the number of consensus nodes directly impact the performance of the consensus mechanism, but both are constrained by the offloading ratio, which determines the distribution of computational tasks between RSUs and parked vehicles. A larger number of tasks leads to more transactions, thereby affecting the consensus overhead. However, the optimal offloading ratio depends on the channel quality and the strength of the privacy noise. Therefore, the offloading decisions, block size, number of consensus nodes, and privacy budget are mutually coupled. Optimizing any single variable in isolation cannot achieve the global optimum. It is necessary to jointly optimize the offloading decisions, block size, number of consensus nodes, and privacy budget to improve the system performance. Using the model presented above, the total system energy consumption is given by the following:

E_{total} (t) = \sum_{i = 1}^{U} (E_{pai} (t) + E_{RSUi} (t)) + E_{total}^{B} (t)

(30)

Similarly, the total delay of the system is as follows:

T_{total} (t) = \max \{\max_{i} T_{pai} (t), \max_{i} T_{RSUi} (t)\} + T_{total}^{B} (t)

(31)

The system needs to simultaneously optimize energy consumption, latency, and privacy strength, although the three objectives are often conflicting. To eliminate differences in dimensions and magnitudes among these metrics, normalization is required to ensure that the weighted sum is physically comparable. Therefore, aiming to balance energy consumption, latency, and privacy strength, the optimization problem is formulated as follows:

\min_{B (t), D_{Bv} (t), Φ_{RSU} (t), ε (t)} \frac{1}{U} (w_{1} E_{total} (t) + w_{2} T_{total} (t)) + w_{3} D_{K L} (t)

(32)

s . t . T_{p a i} (t), T_{R S U i} (t) \leq T_{m a x i}

(33)

E_{p a i} (t), E_{R S U i} (t) \leq E_{m a x i}

(34)

0 \leq ε (t) \leq 1

(35)

0 \leq φ_{R S U i} (t) \leq 1

(36)

wherein w₁, w₂ and w₃ are weight coefficients representing the weight of system delay, energy consumption and privacy strength in the optimization objective, respectively, and satisfy w₁ + w₂ +w₃ = 1. Furthermore, w₁, w₂ and w₃ allow for flexible prioritization of energy consumption, latency, and privacy strength. Intuitively, a larger w₃ imposes a higher penalty on the privacy loss D_KL(t), steering the optimizer toward a smaller ε(t) and stronger location perturbation. Conversely, larger w₁ or w₂ incentivize the optimizer to sacrifice privacy for better performance. The above optimization problem is a nonlinear and nonconvex optimization problem. To solve this problem while adapting to the dynamic IoV environment, we construct a two-layer DRL scheme to dynamically make decisions on offloading ratios, privacy budgets, and other parameters.

Φ_{RSU} (t) = \{φ_{RSUi} (t), \forall i \in U\}

is the proportion of the task size of each user offloaded to the RSU. Constraint conditions (33) and (34) limit the maximum values of delay and energy consumption, condition (35) ensures the range of the privacy budget, and condition (36) ensures that the offloading proportion of each user is less than one.

6. DBPVEC Task Offloading Strategy

6.1. Location Differential Privacy Hotstuff Algorithm

To protect the location privacy of PV consensus nodes during consensus, we combine the location differential privacy with the Hotstuff consensus algorithm, using the mechanism designed in Section 3 to perturb the locations of PVs and executing the consensus based on the perturbed locations, as shown in Algorithm 1.

Algorithm 1. Location differential privacy Hotstuff algorithm
Input: Block size D_Bv; Privacy budget ε; Sample number N; PV true position (x_pa,y_pa); PV number P.
Output: PV consensus energy consumption $E_{t o t a l p a}^{B};$ PV consensus delay $T_{t o t a l p a}^{B} .$
1:	Initialize the energy consumption weight a₁ and delay weight a₂.
2:	for i = 1:N do
3:	for j = 1:P do
4:	Obtain perturbed position (x_pert(j), y_pert(j)) by Formula (8).
5:	end for
6:	Compute SINR of PV by perturbed position (x_pert, y_pert), using Formula (4)
7:	Select consensus nodes by SINR, parking time and computing capability.
8:	Compute $E_{t o t a l p a}^{B} (i)$ and $T_{t o t a l p a}^{B} (i)$ based on Hotstuff model.
9:	Calculate $u (i) = a_{1} E_{t o t a l p a}^{B} (i) + a_{2} T_{t o t a l p a}^{B} (i)$
10:	end for
11:	Select the index of min(u)
12:	Return $E_{t o t a l p a}^{B} = E_{t o t a l p a}^{B} (i n d e x),$ $T_{t o t a l p a}^{B} = T_{t o t a l p a}^{B} (i n d e x) .$

In Algorithm 1, the locations of all PVs are perturbed by adding noise according to Formula (8); then, the SINR of each node is calculated according to Formula (4). When selecting consensus nodes, the node’s SINR, parking duration and computing capability are comprehensively considered for selection. After calculating the consensus energy consumption and consensus delay, the minimum value in the samples is chosen as the perturbed location. As can be seen from Algorithm 1, the location differential privacy mechanism merely adds noise to the location of each consensus node, and therefore has no impact on the correctness or security of the Hotstuff consensus. If B consensus nodes participate in the consensus, the complexity of the Hotstuff is O(B), while the complexity of Algorithm 1 is O(N × B).

6.2. Two-Tier DRL Task Offloading Algorithm

To achieve privacy security with low energy consumption and delay, the optimization problem is constructed as a Markov Decision Process (MDP), with its state space, action space, and reward function clearly defined. Considering the coexistence of discrete variables (block size, number of consensus nodes) and continuous variables (offloading ratio, privacy budget), a two-layer DRL model is developed, where DQN optimizes the discrete parameters and DDPG determines the continuous ones.

6.2.1. State Space

Each time slot t, the system state includes the average energy consumption and delay generated by consensus, privacy strength D_KL, and the system’s average energy consumption and delay. Among them, the average energy consumption of consensus

E_{a v g}^{B} (t) = \frac{1}{U} E_{t o t a l}^{B} (t)

and the average delay

E_{a v g}^{B} (t) = \frac{1}{U} E_{t o t a l}^{B} (t)

are used as the state input of the upper-layer DQN, while the system’s average energy consumption

E_{a v g} (t) = \frac{1}{U} E_{t o t a l} (t)

and average delay

T_{a v g} (t) = \frac{1}{U} T_{t o t a l} (t)

serve as the state input of the lower-layer DDPG. D_KL is the common input of the two layers. Therefore, the state space of the system is defined as

s_{t} = {s_{t}^{B}, s_{t}^{L}}

, and the state space is divided into

s_{t}^{B} = {E_{avg}^{B} (t), T_{avg}^{B} (t), D_{K L} (t)}

and

s_{t}^{L} = {E_{avg} (t), T_{avg} (t), D_{K L} (t)}

, which represent the state inputs of the upper and lower layers, respectively.

6.2.2. Action Space

Similarly, the action space of the system is divided into the upper-layer DQN action space and the lower-layer DDPG action space, corresponding to discrete variables and continuous variables, respectively. Therefore, the action space of the system is

a_{t} = {a_{t}^{B}, a_{t}^{L}}

, where

a_{t}^{B} = {B (t), D_{Bv} (t)}

is the discrete action space for handling blockchain parameter decisions, and

a_{t}^{L} = {Φ_{RSU} (t), ε (t)}

is the continuous action space for determining user offloading decisions and privacy budgets.

6.2.3. Reward Function

In DRL, the agent obtains a reward after taking an action, according to the environment, where the reward value is determined by the reward function. The larger the reward, the better the action. However, the optimization objective is to minimize the average energy consumption, average delay and privacy strength. Therefore, it is necessary to design the reward function as the negative value of the optimization objective. In addition, because the orders of magnitude of the average energy consumption, average delay and privacy strength values are different, normalization is required when designing the reward function. Hence, we use the function

F (x) = 1 - \log (1 + x) / \log (2 + x)

to realize the normalization of the reward function.

DQN reward function: As the upper-layer agent, DQN handles the decisions of the consensus process, and the reward function is composed of the average energy consumption, the average delay generated by the consensus process and the privacy strength. Therefore, the reward function can be designed as follows:

r_{t}^{B} = a_{1} F (E_{avg}^{B} (t)) + a_{2} F (T_{avg}^{B} (t)) + a_{3} (1 - D_{K L} (t))

(37)

DDPG reward function: As the lower-layer agent, this determines the user task offloading proportion and differential privacy budget, and needs to balance the system’s average energy consumption, average delay and differential privacy strength. Hence, the reward function is expressed as follows:

r_{t}^{L} = w_{1} F (E_{avg} (t)) + w_{2} F (T_{avg} (t)) + w_{3} (1 - D_{K L} (t))

(38)

6.2.4. Algorithm Design

The number of consensus nodes and the block size are discrete variables. Therefore, DQN is employed to optimize these parameters of the consensus mechanism. The structure of DQN consists of an evaluation network and a target network, as shown in Figure 2. The evaluation network gives the value of the action

a_{t}^{B}

under the state

s_{t}^{B}

:

a_{t}^{B} = \underset{a_{t}^{B}}{argmax} Q (s_{t}^{B}, a_{t}^{B}; χ)

(39)

χ represents the weight of the evaluation network. After the agent executes the action, a reward value

r_{t}^{B}

is returned, and the system transitions to the state in the next slot. In each slot, <

s_{t}^{B}

,

a_{t}^{B}

,

r_{t}^{B}

,

s_{t + 1}^{B}

> stores in the experience replay buffer, then elements in <

s_{t}^{B}

,

a_{t}^{B}

,

r_{t}^{B}

,

s_{t + 1}^{B}

> randomly samples from the experience replay buffer, and the Q value generates through the target network, that is as follows:

y_{t} = r_{t}^{B} + \max_{a_{t + 1}^{B}} Q (s_{t + 1}^{B}, a_{t + 1}^{B}; χ^{-})

(40)

where χ⁻ belongs to the weight of the target network. In the training phase, the update of the evaluation network is achieved by minimizing the loss function, which is formulated as below:

L (χ) = {(y_{t} - Q (s_{t}^{B}, a_{t}^{B}; χ))}^{2}

(41)

Unlike DQN, DDPG is an actor–critic DRL algorithm designed for continuous action decision problems, e.g., offloading ratio and privacy budget. It adopts two evaluation networks and two target networks, with each pair composed of an actor network and a critic network. Let μ, μ⁻ be the weights of the evaluation actor network and target actor network, and κ, κ⁻ be those of the evaluation critic network and target critic network, respectively. Then, the evaluation actor network generates corresponding actions by changing relevant parameters, which are as follows:

a_{t}^{L} = π (s_{t}^{L}; μ)

. Subsequently, the evaluation critic network evaluates the generated actions. The critic network approximates the Q-value function by minimizing the loss function, which can be formulated as follows:

L (κ) = \frac{1}{T} \sum_{t = 1}^{T} {(y_{t} - Q (s_{t}^{L}, a_{t}^{L}; κ))}^{2}

(42)

In Equation (42), y_t is the approximate value function of the target network, given by (43):

y_{t} = r_{t}^{L} + γ Q (s_{t + 1}^{L}, π (s_{t + 1}^{L}; μ^{-}); κ^{-})

(43)

In the above formula, γ represents the discount factor, and

π (s_{t + 1}^{L}; μ^{-})

is the action selected by the target actor network under the state

s_{t + 1}^{L}

. The actor network uses the critic network to generate a deterministic policy. Therefore, to minimize the value function, the weight μ of the evaluation actor network is updated by using the gradient descent method, that is as follows:

\nabla_{μ} J \approx \frac{1}{T} \sum_{t = 0}^{T} \nabla_{a_{t}^{L}} Q (s_{t}^{L}, a_{t}^{L}; κ) \nabla_{μ} π (s_{t}^{L}; μ)

(44)

Under the structure of the two-layer DRL, the agents of DQN and DDPG cooperate with each other to complete the system’s task offloading, and the algorithm is shown in Algorithm 2. Since there are U user vehicles and the sizes of the hidden layers in both the action network and the critic network are fixed, the complexity of action generation can be expressed as O(U). During the training process, the complexity per step is also O(U). If the number of training steps within a time interval is N_t, then the time complexity of the algorithm within one time interval is O(N_t × U).

Algorithm 2. Two-layer DRL-based DBPVEC offloading strategy algorithm
Input: Upper-layer state $s_{t}^{B};$ Lower-layer state $s_{t}^{L};$ Episode count N_k; Step count N_t.
Output: Optimized policy for task offloading φ_RSUi; Privacy budget ε; Block size D_Bv; Number of PV consensus nodes B.
1:	Initialize DQN and DDPG networks parameters
2:	for k = 1:N_k do
3:	Reset environment and obtain initial states $s_{0}^{B},$ $s_{0}^{L} .$
4:	for t = 1:N_t do
5:	Select a random number ρ
6:	If ρ < ζ
7:	Choose $a_{t}^{B}$ by Formula (35)
8:	else
9:	Choose a random action
10:	endif
11:	Take action in the environment, observe the next state st and the reward $r_{t}^{B},$ $r_{t}^{L}$
12:	Put the experience <s_t, a_t, $r_{t}^{B},$ $r_{t}^{L},$ s_t+₁> into replay buffer
13:	Select a batch sample from upper-layer replay buffer randomly
14:	Calculate target Q-values by Formula (36)
15:	Update χ by minimizing loss L(χ)
16:	χ⁻ ← τχ + (1−τ)χ⁻
17:	Sample random batch from lower-layer replay buffer
18:	Compute target actions and target Q-values
19:	Update κ by minimizing loss function L(κ)
20:	Compute policy gradient by Formula (40)
21:	Update μ using gradient ascent
22:	μ⁻ ← τμ + (1−τ)μ⁻
23:	κ⁻ ← τκ + (1−τ)κ⁻
24:	end for
25:	end for

7. Experimental Results and Analysis

To evaluate the performance of the proposed scheme, we conduct simulations on a desktop equipped with an NVIDIA GeForce RTX 5050 GPU, 32 GB of RAM, and an Intel i9-14900 CPU. MATLAB R2024b is used for simulation experiments. In the simulation, the PV parking duration data are from ACT Government Open Data Portal DataACT [38]. The simulation parameters are shown in Table 3.

In the simulation, for the upper-layer DQN, we set two fully connected hidden layers with 128 and 64 neurons, respectively. The learning rate is set to 10⁻⁴, the discount factor to 0.99, the mini-batch size to 128, and the replay buffer size to 10⁶. For the lower-layer DDPG, both the actor network and the critic network contain two fully connected hidden layers, with 128 and 200 neurons, respectively. The learning rates are both set to 10⁻⁴, the discount factor to 0.99, the mini-batch size to 128, and the replay buffer size to 10⁵. Additionally, we set 8000 training episodes, each consisting of 100 steps. The total training time is 52 h and 29 min (with 100 user vehicles). The resulting trained model size is less than 1 MB, making it suitable for deployment in resource-constrained devices. Moreover, through testing, the average inference time per decision is 82.43 ms.

We studied the impact of different offloading proportions on system performance, as shown in Figure 3. As shown in the figure, the system comprehensive cost, delay, and energy consumption vary with the number of users, respectively. Among them, the comprehensive cost is calculated by the delay, energy consumption and D_KL. On one hand, with growth in the number of users, the volume of computation tasks increases, and thus the system comprehensive cost, delay, and energy consumption rise accordingly. On the other hand, a higher task offloading proportion to RSUs results in a decreasing trend in the comprehensive cost, delay, and energy consumption. This is attributed to the wireless communication provided by the PV network during task offloading, which raises the communication cost and accordingly increases the delay and energy consumption. Since the number of RSUs is limited, the computing and communication resources of RSUs become inadequate when the task load reaches a certain threshold, leading to a slight increase in the comprehensive cost, delay, and energy consumption.

Subsequently, the variation in system performance with the privacy budget under different offloading ratios is discussed in Figure 4. As mentioned above, when the offloading proportion increases, the comprehensive cost, delay and energy consumption all decrease. However, when the privacy budget continues to increase, the values change little, indicating that introducing location differential privacy to protect location information has almost no impact on the system performance.

Then, to further discuss the impact of differential privacy on the system, we compare our scheme with those without a differential privacy mechanism (non-DP), as illustrated in Figure 5. As shown in the figure, in contrast to schemes without differential privacy, our approach achieves the lowest levels in terms of cost, energy consumption, and delay, indicating that a well-designed differential privacy mechanism can mitigate its impact on system performance.

Next, the execution of the consensus mechanism generates additional energy consumption and delay, thereby affecting the system performance. Figure 6 shows the impact of different consensus mechanisms on the system’s comprehensive cost. The figure shows that the Hotstuff consensus mechanism used in this paper has the smallest impact on the system’s comprehensive cost compared with BLS_PBFT and PBFT. The reason is that the Hotstuff consensus mechanism utilizes one-to-many or many-to-one transmission, instead of many-to-many transmission like BLS_PBFT and PBFT during the consensus process. Therefore, wireless communication interference is greatly reduced, which in turn decreases the communication delay and energy consumption.

Next, we analyze how the number of PV consensus nodes affects the delay and energy consumption, as presented in Figure 7. As the number of PV consensus nodes increases, both the communication cost in consensus and the number of signatures and MACs to be verified increase, which raises the delay and energy consumption of the consensus mechanism and thus increases the overall system delay and energy consumption.

Furthermore, the proposed scheme is compared with the existing works, as shown in Figure 8. In the figure, Teng et al. [20] adopts a PVEC architecture but employs a single-chain PBFT consensus algorithm deployed on RSUs. Shi et al. [4] uses a VEC structure with a single-chain PBFT to construct the blockchain. It can be observed that, in contrast to these two schemes, our approach achieves the lowest consumption. Specifically, the average costs of the three schemes are 15.27, 13.17, and 4.84, respectively; the average energy consumption values are 6.21 J, 6.67 J, and 5.59 J; and the average delays are 24.07 ms, 20.13 ms, and 16.50 ms. Accordingly, our scheme achieves reductions of 68.31% and 63.25% in cost, 9.96% and 16.27% in energy consumption, and 31.46% and 18.07% in delay, respectively, demonstrating that the proposed architecture effectively enhances system performance.

Finally, the proposed two-layer DRL is compared with other optimization approaches, as shown in Figure 9. Figure 9a compares the proposed two-layer DRL with a single-layer baseline. The two-layer DRL achieves a significantly lower system comprehensive cost. This improvement is attributed to its architecture, which processes discrete and continuous actions through separate layers, thereby enhancing the reward optimization process. In Figure 9b, different traditional optimization methods, i.e., Simulated Annealing (SA), Genetic Algorithm (GA), and Particle Swarm Optimization (PSO), are compared with our scheme. As illustrated in the figure, the proposed two-layer DRL achieves the lowest system comprehensive cost among all the compared methods. The results demonstrate the superiority of the proposed approach over traditional optimization methods in handling the complex trade-offs among energy consumption, delay, and privacy preservation.

8. Conclusions

We propose a DBPVEC offloading framework to address the problem that location information is prone to leakage during the consensus and user offloading processes in BPVEC. To achieve this, we design a location differential privacy mechanism based on the Laplace differential privacy mechanism and prove that this mechanism satisfies the ε-differential privacy. Meanwhile, the designed location differential privacy mechanism is used to perturb the locations of users and PVs, which is combined with the Hotstuff consensus algorithm. On this basis, an optimization problem is established with the system’s energy consumption, delay and privacy strength as the optimization objectives, and a two-layer DRL algorithm is designed to solve the optimization problem. The experimental results show that the DBPVEC task offloading scheme ensures that the location privacy of users and PV consensus nodes is not leaked while guaranteeing energy consumption and delay.

Author Contributions

Conceptualization, G.L. and F.Z.; methodology, G.L. and C.L.; software, G.L. and Z.S.; validation, M.C., C.L. and F.Z.; data curation, Z.S. and M.C.; supervision, F.Z. and C.L.; writing—original draft preparation, G.L. and Z.S.; writing—review and editing, M.C., C.L. and F.Z.; All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 62362013, and was funded by the Guangxi Science and Technology Plan Project, grant number Guike AB25069120.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Huang, X.; Yu, R.; Liu, J.; Shu, L. Parked Vehicle Edge Computing: Exploiting Opportunistic Resources for Distributed Mobile Applications. IEEE Access 2018, 6, 66649–66663. [Google Scholar] [CrossRef]
Zhang, J.; Huang, X.; Yu, R. Optimal Task Assignment with Delay Constraint for Parked Vehicle Assisted Edge Computing: A Stackelberg Game Approach. IEEE Commun. Lett. 2020, 24, 598–602. [Google Scholar] [CrossRef]
Zhai, X.; Pang, S.; Wang, N.; Gui, H.; He, X. Optimal Strategy for Task Offloading and Resource Pricing Behavior Based on Blockchain. IEEE Trans. Consum. Electron. 2025, 71, 7290–7303. [Google Scholar] [CrossRef]
Shi, J.; Du, J.; Shen, Y.; Wang, J.; Yuan, J.; Han, Z. DRL-Based V2V Computation Offloading for Blockchain-Enabled Vehicular Networks. IEEE Trans. Mob. Comput. 2023, 22, 3882–3897. [Google Scholar] [CrossRef]
Devarajan, G.G.; S, T.; Alenazi, M.J.F.; U, K.; Chandran, G.; Bashir, A.K. Federated Learning and Blockchain-Enabled Framework for Traffic Rerouting and Task Offloading in the Internet of Vehicles (IoV). IEEE Trans. Consum. Electron. 2025, 71, 3817–3825. [Google Scholar] [CrossRef]
Huang, X.; Huang, T.; Zhao, S.; Xiang, W.; Zhang, W.; Zhang, G. Joint Optimization of Task Partial Offloading and Resource Allocation in a Dual-Blockchain-Enabled MEC System with Parallelism Constraints. IEEE Trans. Commun. 2025, 73, 13725–13740. [Google Scholar] [CrossRef]
Luo, B.; Li, X.; Weng, J.; Guo, J.; Ma, J. Blockchain Enabled Trust-Based Location Privacy Protection Scheme in VANET. IEEE Trans. Veh. Technol. 2020, 69, 2034–2048. [Google Scholar] [CrossRef]
Xu, C.; Wu, H.; Liu, H.; Gu, W.; Li, Y.; Cao, D. Blockchain-Oriented Privacy Protection of Sensitive Data in the Internet of Vehicles. IEEE Trans. Intell. Veh. 2023, 8, 1057–1067. [Google Scholar] [CrossRef]
Lv, P.; Xie, L.; Xu, J.; Wu, X.; Li, T. Misbehavior Detection in Vehicular Ad Hoc Networks Based on Privacy-Preserving Federated Learning and Blockchain. IEEE Trans. Netw. Serv. Manag. 2022, 19, 3936–3948. [Google Scholar] [CrossRef]
Liu, H.; Zhang, P.; Pu, G.; Yang, T.; Maharjan, S.; Zhang, Y. Blockchain Empowered Cooperative Authentication With Data Traceability in Vehicular Edge Computing. IEEE Trans. Veh. Technol. 2020, 69, 4221–4232. [Google Scholar] [CrossRef]
Wang, S.; Yang, D.; Sheng, H.; Shen, J.; Zhang, Y.; Ke, W. A Blockchain-Enabled Distributed System for Trustworthy and Collaborative Intelligent Vehicle Re-Identification. IEEE Trans. Intell. Veh. 2024, 9, 3271–3282. [Google Scholar] [CrossRef]
Meng, X.; Liu, B.; Meng, X.; Liang, Y.; Deng, H. A Lightweight Group Authentication Protocol for Blockchain-Based Vehicular Edge Computing Networks. IEEE Trans. Intell. Transp. Syst. 2024, 25, 8556–8567. [Google Scholar] [CrossRef]
Ren, Y.; Chen, X.; Guo, S.; Guo, S.; Xiong, A. Blockchain-Based VEC Network Trust Management: A DRL Algorithm for Vehicular Service Offloading and Migration. IEEE Trans. Veh. Technol. 2021, 70, 8148–8160. [Google Scholar] [CrossRef]
Fardad, M.; Muntean, G.; Tal, I. A Blockchain-Enabled Vehicular Edge Computing Framework for Secure Performance-Oriented V2X Service Delivery. IEEE Trans. Veh. Technol. 2024, 73, 13853–13867. [Google Scholar] [CrossRef]
Wang, S.; Ye, D.; Huang, X.; Yu, R.; Wang, Y.; Zhang, Y. Consortium Blockchain for Secure Resource Sharing in Vehicular Edge Computing: A Contract-Based Approach. IEEE Trans. Netw. Sci. Eng. 2021, 8, 1189–1201. [Google Scholar] [CrossRef]
Liang, F. Decentralized and Network-Aware Task Offloading for Smart Transportation via Blockchain. Sensors 2025, 25, 5555. [Google Scholar] [CrossRef] [PubMed]
Li, J.; Liu, R.; Liu, W. FBMTO: Fusion of Blockchain and Multi-Agent Reinforcement Learning for Task Offloading in edge computing. Inf. Fusion 2025, 124, 103344. [Google Scholar] [CrossRef]
Xu, C.; Zhang, P.; Xia, X.; Kong, L.; Zeng, P.; Yu, H. Digital-Twin-Assisted Intelligent Secure Task Offloading and Caching in Blockchain-Based Vehicular Edge Computing Networks. IEEE Internet Things J. 2025, 12, 4128–4143. [Google Scholar] [CrossRef]
Huang, X.; Ye, D.; Yu, R.; Shu, L. Securing parked vehicle assisted fog computing with blockchain and optimal smart contract design. IEEE/CAA J. Autom. Sin. 2020, 7, 426–441. [Google Scholar] [CrossRef]
Teng, Y.; Cao, Y.; Liu, M.; Yu, F.R.; Leung, V.C.M. Efficient Blockchain-Enabled Large Scale Parked Vehicular Computing With Green Energy Supply. IEEE Trans. Veh. Technol. 2021, 70, 9423–9436. [Google Scholar] [CrossRef]
Kong, M.; Zhao, J.; Sun, X.; Nie, Y. Secure and efficient computing resource management in blockchain-based vehicular fog computing. China Commun. 2021, 18, 115–125. [Google Scholar] [CrossRef]
Qashlan, A.; Nanda, P.; He, X.; Mohanty, M. Privacy-Preserving Mechanism in Smart Home Using Blockchain. IEEE Access 2021, 9, 103651–103669. [Google Scholar] [CrossRef]
Mu, Y.; Hao, B.; He, M. Privacy-preserving cooperative hierarchical caching approach based on federated deep reinforcement learning for vehicular edge computing. J. Supercomput. 2025, 81, 532. [Google Scholar] [CrossRef]
Tchaye-Kondi, J.; Zhai, Y.; Shen, J.; Zhu, L. Privacy-Preserving Offloading in Edge Intelligence Systems With Inductive Learning and Local Differential Privacy. IEEE Trans. Netw. Serv. Manag. 2023, 20, 5026–5037. [Google Scholar] [CrossRef]
Wei, D.; Xi, N.; Ma, X.; Shojafar, M.; Kumari, S.; Ma, J. Personalized Privacy-Aware Task Offloading for Edge-Cloud-Assisted Industrial Internet of Things in Automated Manufacturing. IEEE Trans. Ind. Inform. 2022, 18, 7935–7945. [Google Scholar] [CrossRef]
Wang, S.; Li, J.; Wu, G.; Chen, H.; Sun, S. Joint Optimization of Task Offloading and Resource Allocation Based on Differential Privacy in Vehicular Edge Computing. IEEE Trans. Comput. Soc. Syst. 2022, 9, 109–119. [Google Scholar] [CrossRef]
Zhang, P.; Gan, P.; Chang, L.; Wen, W.; Selvi, M.; Kibalya, G. DPRL: Task Offloading Strategy Based on Differential Privacy and Reinforcement Learning in Edge Computing. IEEE Access 2022, 10, 54002–54011. [Google Scholar] [CrossRef]
Zhang, G.; Zhang, S.; Man, Z.; Cui, C.; Hu, W. Location Privacy Protection in Edge Computing: Co-Design of Differential Privacy and Offloading Mode. Electronics 2024, 13, 2668. [Google Scholar] [CrossRef]
Liu, Z.; Wang, J.; Gao, Z.; Wei, J. Privacy-preserving edge computing offloading scheme based on whale optimization algorithm. J. Supercomput. 2023, 79, 3005–3023. [Google Scholar] [CrossRef]
Wang, Z.; Sun, Y.; Liu, D.; Hu, J.; Pang, X.; Hu, Y.; Ren, K. Location Privacy-Aware Task Offloading in Mobile Edge Computing. IEEE. Trans. Mob. Comput. 2024, 23, 2269–2283. [Google Scholar] [CrossRef]
Zhang, G.; Du, J.; Yuan, X.; Zhang, K. Differential Privacy-Based Location Privacy Protection for Edge Computing Networks. Electronics 2024, 13, 3510. [Google Scholar] [CrossRef]
Min, M.; Liu, Z.; Duan, J.; Zhang, P.; Li, S. Safe-Learning-Based Location-Privacy-Preserved Task Offloading in Mobile Edge Computing. Electronics 2024, 13, 89. [Google Scholar] [CrossRef]
Dwork, C. Differential Privacy. In Proceedings of the 33rd International Colloquium on Automata, Languages and Programming, Venice, Italy, 10–14 July 2006; Bugliesi, M., Prennel, B., Sassone, V., Wegener, I., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4052, pp. 1–12. [Google Scholar]
Dwork, C.; Roth, A. The Algorithmic Foundations of Differential Privacy. Found. Trends^® Theor. Comput. Sci. 2014, 9, 211–487. [Google Scholar] [CrossRef]
Ma, X.; Zhao, J.; Wang, Y.; Zhang, T.; Li, Z. A New Approach to SINR-Based Reliability Analysis of IEEE 802.11 Broadcast Ad Hoc Networks. IEEE Commun. Lett. 2021, 25, 651–655. [Google Scholar] [CrossRef]
Ma, X.; Trivedi, K.S. SINR-Based Analysis of IEEE 802.11p/bd Broadcast VANETs for Safety Services. IEEE Trans. Netw. Serv. Manag. 2021, 18, 2672–2686. [Google Scholar] [CrossRef]
van Erven, T.; Harremoes, P. Rényi Divergence and Kullback-Leibler Divergence. IEEE Trans. Inf. Theory 2014, 60, 3797–3820. [Google Scholar] [CrossRef]
Australian Government. Smart Parking Stays Dataset. Available online: https://data.gov.au/data/dataset/3vsj-zpk7 (accessed on 1 January 2023).
Lang, P.; Tian, D.; Duan, X.; Zhou, J.; Sheng, Z.; Leung, V.C.M. Blockchain-Based Cooperative Computation Offloading and Secure Handover in Vehicular Edge Computing Networks. IEEE Trans. Intell. Veh. 2023, 8, 3839–3853. [Google Scholar] [CrossRef]
Feng, J.; Yu, F.R.; Pei, Q.; Du, J.; Zhu, L. Joint Optimization of Radio and Computational Resources Allocation in Blockchain-Enabled Mobile Edge Computing Systems. IEEE Trans. Wirel. Commun. 2020, 19, 4321–4334. [Google Scholar] [CrossRef]

Figure 1. The DBPVEC task offloading framework.

Figure 2. Two-layer DRL model.

Figure 3. The influence of different offloading ratios on system performance: (a) costs vary with the number of users and offloading ratios; (b) delays vary with the number of users and offloading ratios; and (c) energy varies with the number of users and offloading ratios.

Figure 4. The variation in system performance with privacy budget under different offloading ratios: (a) cost variation with the privacy budget and offloading ratios; (b) delay variation with the privacy budget and offloading ratios; and (c) energy variation with the privacy budget and offloading ratios.

Figure 5. Compared with non-DP: (a) cost variation with users; (b) delay variation with users; and (c) energy variation with users.

Figure 6. Compare with different consensus schemes.

Figure 7. The influence of different numbers of PVs consensus nodes on latency and energy consumption: (a) the impact on system latency and (b) the impact on system energy.

Figure 8. Compared with existing work: Shi et al. [4] and Teng et al. [20]: (a) cost variation with users; (b) delay variation with users; and (c) energy variation with users.

Figure 9. The influence of optimization approaches on the comprehensive cost of the system: (a) compared with single-layer DRL methods and (b) compared with traditional optimization methods.

Table 1. Comparative summary.

Study	Blockchain	Privacy Mechanism	Optimization Technique	Limitations
[10,11,12]	Consortium Blockchain	None	None	Only focuses on identity authentication
[13]	Consortium Blockchain	None	None	Complex network architecture
[14]	DPoS	None	DRL	Lacks incentive mechanism
[15]	BFT-based PoS consensus, Smart Contract	None	Contract-Based Approach	Lacks joint optimization of system performance
[16,17]	PoS consensus, DPoS	None	DQN	Ignores the impact of consensus
[4,18]	Improved PBFT	None	DRL	Ignores information leakage during consensus
[19,20,21]	Smart Contract, PBFT, PoW	None	Game Theory, DRL	Ignores sensitive information privacy leakage
[22,23,24]	None	Differential Privacy	None	Only protects data content
[25]	None	Differential Privacy (Deep Learning Parameters)	DRL	Treats privacy as an external constraint
[26]	None	Local Differential Privacy (Histogram-Driven)	K-NJTA	Treats privacy as an external constraint
[27,28,29]	None	Differential Privacy (Geographic Location Perturbation)	WOPP, RL	Only focuses on location privacy in single-edge server scenarios
[30,31,32]	None	Differential Privacy	DRL	Does not directly describe user location

Table 2. Explanation of notations.

Notations	Explanation
$C_{p k}^{e x e}, C_{r j}^{e x e}$	CPU cycles per unit task size required for RSU and PV computing
d₀	Wireless far-field reference distance
$d_{i, j}^{d}, d_{i, k}^{d}$	Distance from perturbed PV to the receiving vehicle and the interfering vehicle
D_qi	Task size for useri
D_Bv	Block size
D_KL	Kullback–Leibler (KL) divergence of differential privacy
$E_{t o t a l}^{B}$	Consensus energy consumption
E_pai, E_RSUi	RSU and PV energy consumption for computational tasks
E_total	Total energy consumption of the system
f_pk, f_rj	Computational capacity of PVk and RSUj
N₀	Noise power of the wireless channel
P_t	Wireless transmission power
R_i,j	Communication transmission rate after location perturbation
T_total	Total delay of the system
T_pai, T_RSUi	PV and RSU delay for processing computational tasks
W_b	Wireless communication bandwidth
α	Path loss factor
σ	CPU cycles required for generating or verifying signatures
φ_RSUi	Proportion of task offloading to RSU by useri
η	Transceiver decision factor
θ	CPU cycles required for generating or verifying the Message Authentication Code (MAC)
κ_v, κ_r	Capacitive switching coefficients for vehicles and RSUs
ϖ	Average transaction size of blockchain

Table 3. Main simulation parameters.

Parameters	Value	Parameters	Value
$C_{p k}^{e x e}, C_{r j}^{e x e}$	24 cycles/bit [39]	d₀	100 m [35]
ϖ	1 KB	W_b	15 MB
D_qi	10–30 MB	θ	10 × 10⁶ cycles [39]
D_Bv	4 MB	σ	1 × 10⁶ cycles [39]
f_pk	1–2.5 GHz	α	2 [35]
f_rj	4–6 GHz	η	1.63726 × 10⁻⁹ [35]
N₀	1.2589 × 10 W [35]	κ_v	10⁻²⁷ [40]
P_t	0.28183815 W [35]	κ_r	10⁻²⁸

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liang, G.; Su, Z.; Li, C.; Chen, M.; Zhao, F. Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff. Information 2026, 17, 339. https://doi.org/10.3390/info17040339

AMA Style

Liang G, Su Z, Li C, Chen M, Zhao F. Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff. Information. 2026; 17(4):339. https://doi.org/10.3390/info17040339

Chicago/Turabian Style

Liang, Guoling, Zhaoyu Su, Chunhai Li, Mingfeng Chen, and Feng Zhao. 2026. "Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff" Information 17, no. 4: 339. https://doi.org/10.3390/info17040339

APA Style

Liang, G., Su, Z., Li, C., Chen, M., & Zhao, F. (2026). Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff. Information, 17(4), 339. https://doi.org/10.3390/info17040339

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Task Offloading of Parked Vehicles Edge Computing Based on Differential Privacy Hotstuff

Abstract

1. Introduction

2. Related Work

2.1. Blockchain for Task Offloading

2.2. Differential Privacy for Task Offloading

3. Differential Privacy Mechanism

4. System Model

4.1. DBPVEC Offloading System Framework

4.2. Network Model

4.3. Location Differential Privacy Model

4.4. Task Offloading Model

4.5. Blockchain Model

5. Optimization Problem Form

6. DBPVEC Task Offloading Strategy

6.1. Location Differential Privacy Hotstuff Algorithm

6.2. Two-Tier DRL Task Offloading Algorithm

6.2.1. State Space

6.2.2. Action Space

6.2.3. Reward Function

6.2.4. Algorithm Design

7. Experimental Results and Analysis

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI