Intelligent Optimization Methods for Cloud–Edge Collaborative Vehicular Networks via the Integration of Bayesian Decision-Making and Reinforcement Learning
Abstract
1. Introduction
- 1.
- A three-layer cloud–edge–end collaborative architecture is established for intelligent transportation scenarios, together with the corresponding communication model, edge-caching model, delay model, and energy-consumption model. These models provide the system foundation for privacy-aware and service-oriented task-offloading optimization.
- 2.
- A fragment-level privacy-aware offloading framework is proposed by combining Bayesian privacy-level classification, differential-privacy-based perturbation for highly sensitive fragments, and a privacy-entropy metric for characterizing the dispersion of private fragments across collaborative nodes. On this basis, a lightweight multi-agent deep reinforcement learning strategy, namely CTMA-AC, is developed to optimize privacy-entropy-aware offloading decisions and reduce the risk of privacy leakage.
- 3.
- The effectiveness of the proposed scheme is validated through simulation experiments by comparing it with SAC, DQN, full-local, and random offloading baselines. The experimental results show that the proposed method achieves a better tradeoff among latency, energy consumption, and privacy-related performance in the considered cloud–edge–end collaborative intelligent transportation scenario.
2. Related Work
2.1. Existing Work on Privacy-Aware Task Offloading
2.2. Discussion of the Differences from Existing Studies
3. System Modeling
3.1. Three-Tier Communication Architecture for Cloud-Edge-End Collaboration
- 1.
- Vehicle-to-Vehicle communication (V2V): communication between intelligent connected vehicles.
- 2.
- Vehicle-to-Infrastructure communication (V2I): communication between vehicles and roadside infrastructure such as radar units and edge servers.
- 3.
- Wired communication: communication between edge servers.
- 4.
- Wireless communication: communication between vehicles and the central cloud server.
- 1.
- The central cloud server is treated as a trusted coordination entity responsible for global scheduling and resource management, and it is not considered an adversarial party in this work.
- 2.
- Edge servers are assumed to correctly provide computation and caching services, but they may attempt to infer user privacy from the data fragments they receive and store. Therefore, edge servers are modeled as honest-but-curious entities.
- 3.
- Multiple edge servers may jointly analyze the private fragments stored at different nodes, together with user offloading preferences and routing behavior, in order to reconstruct sensitive user information. This collusion threat is one of the main motivations for introducing the privacy entropy metric.
- 4.
- In the considered threat model, the attacker may observe stored data fragments, fragment distribution across edge servers, and offloading-related traffic patterns. However, model-parameter leakage and direct compromise of the cloud server are not considered as primary attack surfaces in this paper.
- 5.
- Vehicular terminals are regarded as normal service participants rather than malicious adversaries. Their role in this work is to generate tasks and offload data fragments according to the privacy-aware decision policy.
- 6.
- The privacy entropy defined in this paper is used to characterize the dispersion of privacy-sensitive fragments across collaborative nodes. A larger privacy entropy indicates that sensitive fragments are distributed more evenly, which reduces the risk that a single edge server or a small colluding set of edge servers can reconstruct complete private information. However, privacy entropy is used here as a heuristic indicator of resistance to aggregation-based inference attacks, rather than a strict closed-form success probability of a specific reconstruction attack.
3.2. Privacy-Level Classification of Data Fragments
3.3. Communication Model
3.4. Edge Caching Model
3.5. Delay and Energy Mode
3.6. Privacy Data Fragment Protection Model
3.7. Multi-Objective Optimization Problem Mode
4. Deep Reinforcement Learning Algorithm Design
4.1. CTMA-AC Algorithmic
| Algorithm 1 CTMA-AC algorithmic |
|
4.2. Algorithm Complexity Analysis
5. Simulation Experiment and Analysis
- 1.
- Soft actor–critic strategy (SAC) [29]: The core idea is to improve the learning efficiency and stability of the strategy and maximize the entropy of the strategy while optimizing the cumulative rewards to maintain exploratory behavior during the training process. The SAC strategy in intelligent transportation scenarios allows tasks to be offloaded to the central cloud servers, edge servers, and other service entities.
- 2.
- Full-Local offloading strategy (Full-Local) [30]: The execution of tasks is completely local, the computation process does not depend on cloud servers or edge servers, and good security performance is achieved because of the fast mobility of vehicles.
- 3.
- Deep Q-network offloading strategy (DQN) [31]: Combining Q-learning and deep neural networks can lead to efficient learning in high-dimensional state space, and the DQN offloading strategy in intelligent transportation scenarios can offload tasks to service entities such as central cloud servers and edge servers.
- 4.
- Random offloading policy (Random offloading) [30]: Offloading tasks to other service units, such as onboard, central cloud servers, and near-edge servers, via random selection.
5.1. Experimental Setup
5.2. Analysis of Experimental Results
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Zeng, J.; Gou, F.; Wu, J. Task offloading scheme combining deep reinforcement learning and convolutional neural networks for vehicle trajectory prediction in intelligent cities. Comput. Commun. 2023, 208, 29–43. [Google Scholar] [CrossRef]
- Llorens-Carrodeguas, A.; Cervelló-Pastor, C.; Valera, F. DQN-based intelligent controller for multiple edge domains. J. Netw. Comput. Appl. 2023, 218, 103705. [Google Scholar] [CrossRef]
- Njoku, J.N.; Nwakanma, C.I.; Amaizu, G.C.; Kim, D.S. Prospects and challenges of Metaverse application in data-driven intelligent transportation systems. IET Intell. Transp. Syst. 2023, 17, 1–21. [Google Scholar] [CrossRef]
- Antevski, K.; Bernardos, C.J. Applying Blockchain consensus mechanisms to Network Service Federation: Analysis and performance evaluation. Comput. Netw. 2023, 234, 109913. [Google Scholar] [CrossRef]
- Errounda, F.Z.; Liu, Y. Adaptive differential privacy in vertical federated learning for mobility forecasting. Future Gener. Comput. Syst. 2023, 149, 531–546. [Google Scholar] [CrossRef]
- Shen, X.; Luo, X.; Wang, B.; Chen, Y.; Tang, D.; Gao, L. Privacy-preserving multiparty deep learning based on homomorphic proxy re-encryption. J. Syst. Archit. 2023, 144, 102983. [Google Scholar] [CrossRef]
- Dai, X. Task Offloading for Cloud-Assisted Fog Computing With Dynamic Service Caching in Enterprise Management Systems. IEEE Trans. Ind. Inform. 2023, 19, 662–672. [Google Scholar] [CrossRef]
- Zhu, S.; Song, Z.; Huang, C.; Qiao, R.; Zhu, H. Cloud-edge-end collaborative caching and UAV-assisted offloading decision based on the fusion of deep reinforcement learning algorithms. Artif. Intell. Rev. 2025, 58, 408. [Google Scholar] [CrossRef]
- Heo, G.; Doh, I. Blockchain and differential privacy-based data processing system for data security and privacy in urban computing. Comput. Commun. 2024, 222, 161–176. [Google Scholar] [CrossRef]
- Zhang, H.; Cao, L.; Kumar, N.; Zhang, J.; Zhang, P.; Wang, J. An improved DDPG-based privacy sensitive level protection computation offloading method in mobile edge computing. Future Gener. Comput. Syst. 2024, 159, 522–532. [Google Scholar] [CrossRef]
- Wang, S.; Li, J.; Wu, G.; Chen, H.; Sun, S. Joint Optimization of Task Offloading and Resource Allocation Based on Differential Privacy in Vehicular Edge Computing. IEEE Trans. Comput. Soc. Syst. 2022, 9, 109–119. [Google Scholar] [CrossRef]
- Mahmood, A.; Hong, Y.; Ehsan, M.K.; Mumtaz, S. Optimal Resource Allocation and Task Segmentation in IoT Enabled Mobile Edge Cloud. IEEE Trans. Veh. Technol. 2021, 70, 13294–13303. [Google Scholar] [CrossRef]
- Jebreel, N.M.; Domingo-Ferrer, J.; Blanco-Justicia, A.; Sánchez, D. Enhanced Security and Privacy via Fragmented Federated Learning. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 6703–6717. [Google Scholar] [CrossRef]
- Samy, A.; Elgendy, I.A.; Yu, H. Secure Task Offloading in Blockchain-Enabled Mobile Edge Computing With Deep Reinforcement Learning. IEEE Trans. Netw. Serv. Manag. 2022, 19, 4872–4887. [Google Scholar] [CrossRef]
- Bai, F.; Shen, T.; Yu, Z. Trustworthy Blockchain-Empowered Collaborative Edge Computing-as-a-Service Scheduling and Data Sharing in the IIoE. IEEE Internet Things J. 2022, 9, 14752–14766. [Google Scholar] [CrossRef]
- Wu, G.; Chen, X.; Gao, Z.; Zhang, H.; Yu, S.; Shen, S. Privacy-preserving offloading scheme in multiaccess mobile edge computing based on MADRL. J. Parallel Distrib. Comput. 2024, 183, 104775. [Google Scholar] [CrossRef]
- Yang, M.; Tjuawinata, I.; Lam, K.Y. K-Means Clustering With Local Privacy for Privacy-Preserving Data Analysis. IEEE Trans. Inf. Forensics Secur. 2022, 17, 2524–2537. [Google Scholar] [CrossRef]
- Ye, D.; Shen, S.; Zhu, T. One Parameter Defense-Defending Against Data Inference Attacks via Differential Privacy. IEEE Trans. Inf. Forensics Secur. 2022, 17, 1466–1480. [Google Scholar] [CrossRef]
- Chen, X.; Hu, X.; Li, Y.; Tang, Q. Optimization of Privacy Budget Allocation In Differential Privacy-Based Public Transit Trajectory Data Publishing for Intelligent Mobility Applications. IEEE Trans. Intell. Transp. Syst. 2023, 24, 15158–15168. [Google Scholar] [CrossRef]
- Hu, J. Shield Against Gradient Leakage Attacks: Adaptive Privacy-Preserving Federated Learning. IEEE/ACM Trans. Netw. 2024, 32, 1407–1422. [Google Scholar] [CrossRef]
- Zhang, G.; Liu, B.; Zhu, T.; Ding, M.; Zhou, W. PPFed: A Privacy-Preserving and Personalized Federated Learning Framework. IEEE Internet Things J. 2024, 11, 19380–19393. [Google Scholar] [CrossRef]
- Zhou, W.; Zhu, T.; Ye, D.; Ren, W.; Choo, K.K. A Concurrent Federated Reinforcement Learning for IoT Resources Allocation With Local Differential Privacy. IEEE Internet Things J. 2024, 11, 6537–6550. [Google Scholar] [CrossRef]
- Rezaeibagha, F.; Mu, Y.; Huang, K. Authenticable Additive Homomorphic Scheme and its Application for MEC-Based IoT. IEEE Trans. Serv. Comput. 2023, 16, 1664–1672. [Google Scholar] [CrossRef]
- Gao, W.; Yu, W.; Liang, F.; Hatcher, W.G.; Lu, C. Privacy-Preserving Auction for Big Data Trading Using Homomorphic Encryption. IEEE Trans. Netw. Sci. Eng. 2020, 7, 776–791. [Google Scholar] [CrossRef]
- Zhang, P. Privacy-Preserving and Outsourced Multi-Party K-Means Clustering Based on Multi-Key Fully Homomorphic Encryption. IEEE Trans. Dependable Secur. Comput. 2023, 20, 2348–2359. [Google Scholar] [CrossRef]
- Liu, M.; Song, X.; Li, Y.; Li, W. Correlated differential privacy based logistic regression for supplier data protection. Comput. Secur. 2024, 136, 03542. [Google Scholar] [CrossRef]
- Li, J.; Yang, Y.; He, Z.; Wu, H.; Shi, H.; Chen, W. Cournot policy model: Rethinking centralized training in multiagent reinforcement learning. Inf. Sci. 2024, 677, 120983. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhou, Y.; Lu, H.; Fujita, H. Cooperative multiagent actor-critic control of traffic network flow based on edge computing. Future Gener. Comput. Syst. 2021, 123, 128–141. [Google Scholar] [CrossRef]
- Zhu, S.; Song, Z.; Zhu, H.; Qiao, R. Efficient slicing scheme and cache optimization strategy for structured dependent tasks in intelligent transportation scenarios. Ad Hoc Netw. 2025, 168, 103699. [Google Scholar]
- Hersi, A.H.; Udayan, J.D. Efficient and Robust Multirobot Navigation and Task Allocation Using Soft Actor Critic. Procedia Comput. Sci. 2024, 235, 484–495. [Google Scholar] [CrossRef]
- Zhu, S.; Tian, X.; Zhang, Z.; Qiao, R.; Zhu, H. Content Placement and Edge Collaborative Caching Scheme Based on Deep Reinforcement Learning for Internet of Vehicles. IEEE Trans. Intell. Transp. Syst. 2025, 26, 8050–8064. [Google Scholar] [CrossRef]
- Zhu, S.; Liu, C.; Zhu, H.; Chen, H.; Qiao, R.; Wu, X.Y. DRL-based structured task offloading decision in intelligent transportation scenarios. Appl. Soft Comput. 2025, 171, 112770. [Google Scholar] [CrossRef]











| Math Abstraction | Representation of Parameters |
|---|---|
| Feature vector of fragment i in task d | |
| True privacy-level label of fragment i in task d | |
| Predicted privacy level of fragment i in task d | |
| Prior probability of privacy class k | |
| Mean vector of privacy class k | |
| Covariance matrix of privacy class k |
| Parameters | Symbolic | Numerical Value |
|---|---|---|
| Computing capability of cloud servers | 4000 MIPS | |
| Computing power of cloud servers | 600 W | |
| Computing capability of vehicle terminals | 80∼220 MIPS | |
| Computing power of vehicle terminals | 60∼120 W | |
| Transmission power of vehicle terminals | 100∼160 W | |
| Computing capability of edge servers | 350∼750 MIPS | |
| Cache capacity of edge servers | 3000 MB | |
| Required computing capability of | 60∼200 MIPS | |
| Communication bandwidth between users and edge servers | 80 MHz | |
| Delay weight parameter | 0.3 | |
| Energy-consumption weight parameter | 0.3 | |
| Privacy-entropy weight parameter | 0.4 |
| Latency (ms) | Energy Consumption (J) | Privacy Entropy (K = 1) | |||
|---|---|---|---|---|---|
| 0.5 | 0.5 | 0 | 268.47 | 5042.81 | 1.74 |
| 0.3 | 0.3 | 0.4 | 289.36 | 5196.42 | 1.91 |
| 0.2 | 0.2 | 0.6 | 304.18 | 5328.57 | 1.93 |
| 0.1 | 0.1 | 0.8 | 326.94 | 5481.63 | 1.97 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yu, Y.; Song, Z.; Zhu, S.; Zhang, Q. Intelligent Optimization Methods for Cloud–Edge Collaborative Vehicular Networks via the Integration of Bayesian Decision-Making and Reinforcement Learning. Future Internet 2026, 18, 215. https://doi.org/10.3390/fi18040215
Yu Y, Song Z, Zhu S, Zhang Q. Intelligent Optimization Methods for Cloud–Edge Collaborative Vehicular Networks via the Integration of Bayesian Decision-Making and Reinforcement Learning. Future Internet. 2026; 18(4):215. https://doi.org/10.3390/fi18040215
Chicago/Turabian StyleYu, Youjian, Zhaowei Song, Sifeng Zhu, and Qinghua Zhang. 2026. "Intelligent Optimization Methods for Cloud–Edge Collaborative Vehicular Networks via the Integration of Bayesian Decision-Making and Reinforcement Learning" Future Internet 18, no. 4: 215. https://doi.org/10.3390/fi18040215
APA StyleYu, Y., Song, Z., Zhu, S., & Zhang, Q. (2026). Intelligent Optimization Methods for Cloud–Edge Collaborative Vehicular Networks via the Integration of Bayesian Decision-Making and Reinforcement Learning. Future Internet, 18(4), 215. https://doi.org/10.3390/fi18040215

