You are currently viewing a new version of our website. To view the old version click .

40 Results Found

  • Article
  • Open Access
17 Citations
5,892 Views
19 Pages

11 March 2023

The demand for autonomous UAV swarm operations has been on the rise following the success of UAVs in various challenging tasks. Yet conventional swarm control approaches are inadequate for coping with swarm scalability, computational requirements, an...

  • Article
  • Open Access
3 Citations
4,179 Views
21 Pages

Hybrid Centralized Training and Decentralized Execution Reinforcement Learning in Multi-Agent Path-Finding Simulations

  • Hua-Ching Chen,
  • Shih-An Li,
  • Tsung-Han Chang,
  • Hsuan-Ming Feng and
  • Yun-Chien Chen

7 May 2024

In this paper, we propose a hybrid centralized training and decentralized execution neural network architecture with deep reinforcement learning (DRL) to complete the multi-agent path-finding simulation. In the training of physical robots, collisions...

  • Article
  • Open Access
3 Citations
1,353 Views
14 Pages

In addressing the complex challenges of path planning in multi-robot systems, this paper proposes a novel Hybrid Decentralized and Centralized Training and Execution (DCTE) Strategy, aimed at optimizing computational efficiency and system performance...

  • Article
  • Open Access
1 Citations
1,252 Views
20 Pages

27 June 2025

In a multi-cell network, interference management between adjacent cells is a key factor that determines the performance of the entire cellular network. In particular, in order to control inter-cell interference while providing a high data rate to use...

  • Article
  • Open Access
1 Citations
4,457 Views
18 Pages

30 March 2025

In multi-agent reinforcement learning, the fully centralized approach suffers from issues such as explosion of the joint state and action spaces, leading to performance degradation. On the other hand, the fully decentralized approach relies on agents...

  • Article
  • Open Access
2 Citations
2,391 Views
9 Pages

25 August 2021

Using reinforcement learning technologies to learn offloading strategies for multi-access edge computing systems has been developed by researchers. However, large-scale systems are unsuitable for reinforcement learning, due to their huge state spaces...

  • Article
  • Open Access
26 Citations
8,689 Views
28 Pages

13 January 2023

In this paper, a real-time scheduling problem of a dual-resource flexible job shop with robots is studied. Multiple independent robots and their supervised machine sets form their own work cells. First, a mixed integer programming model is establishe...

  • Article
  • Open Access
1,339 Views
16 Pages

Adaptive Control of VSG Inertia Damping Based on MADDPG

  • Demu Zhang,
  • Jing Zhang,
  • Yu He,
  • Tao Shen and
  • Xingyan Liu

20 December 2024

As renewable energy sources become more integrated into the power grid, traditional virtual synchronous generator (VSG) control strategies have become inadequate for the current low-damping, low-inertia power systems. Therefore, this paper proposes a...

  • Article
  • Open Access
15 Citations
4,525 Views
19 Pages

24 March 2021

In this paper, we present a novel deep reinforcement learning (DRL) based method that is used to perform multi-robot task allocation (MRTA) and navigation in an end-to-end fashion. The policy operates in a decentralized manner mapping raw sensor meas...

  • Article
  • Open Access
8 Citations
3,029 Views
18 Pages

A Multi-Agent Reinforcement Learning Method for Cooperative Secondary Voltage Control of Microgrids

  • Tianhao Wang,
  • Shiqian Ma,
  • Zhuo Tang,
  • Tianchun Xiang,
  • Chaoxu Mu and
  • Yao Jin

27 July 2023

This paper proposes a novel cooperative voltage control strategy for an isolated microgrid based on the multi-agent advantage actor-critic (MA2C) algorithm. The proposed method facilitates the collaborative operation of a distributed energy system (D...

  • Article
  • Open Access
1 Citations
3,427 Views
17 Pages

1 November 2023

This paper proposes an energy-efficient scheduling scheme for multi-path TCP (MPTCP) in heterogeneous wireless networks, aiming to minimize energy consumption while ensuring low latency and high throughput. Each MPTCP sub-flow is controlled by an age...

  • Article
  • Open Access
14 Citations
5,258 Views
16 Pages

14 October 2021

In many existing multi-agent reinforcement learning tasks, each agent observes all the other agents from its own perspective. In addition, the training process is centralized, namely the critic of each agent can access the policies of all the agents....

  • Article
  • Open Access
4 Citations
2,281 Views
12 Pages

Secondary Voltage Collaborative Control of Distributed Energy System via Multi-Agent Reinforcement Learning

  • Tianhao Wang,
  • Shiqian Ma,
  • Na Xu,
  • Tianchun Xiang,
  • Xiaoyun Han,
  • Chaoxu Mu and
  • Yao Jin

25 September 2022

In this paper, a new voltage cooperative control strategy for a distributed power generation system is proposed based on the multi-agent advantage actor-critic (MA2C) algorithm, which realizes flexible management and effective control of distributed...

  • Article
  • Open Access
5 Citations
3,942 Views
21 Pages

Federated learning is a privacy-preserving machine learning framework where multiple data owners collaborate to train a global model under the orchestra of a central server. The local training results from trainers should be submitted to the central...

  • Article
  • Open Access
5 Citations
1,696 Views
23 Pages

24 March 2025

Aiming to address the issue of multi-user dynamic spectrum access in an opportunistic mode in cognitive radio networks leading to low sum throughput, we propose a multi-user opportunistic spectrum access method based on multi-head self-attention and...

  • Article
  • Open Access
1,232 Views
39 Pages

Generative AI and Blockchain-Integrated Multi-Agent Framework for Resilient and Sustainable Fruit Cold-Chain Logistics

  • Abhirup Khanna,
  • Sapna Jain,
  • Anushree Sah,
  • Sarishma Dangi,
  • Abhishek Sharma,
  • Sew Sun Tiang,
  • Chin Hong Wong and
  • Wei Hong Lim

27 August 2025

The cold-chain supply of perishable fruits continues to face challenges such as fuel wastage, fragmented stakeholder coordination, and limited real-time adaptability. Traditional solutions, based on static routing and centralized control, fall short...

  • Article
  • Open Access
1,617 Views
18 Pages

25 August 2025

The Centralized Teacher with Decentralized Student (CTDS) framework is a multi-agent reinforcement learning (MARL) approach that utilizes knowledge distillation within the Centralized Training with Decentralized Execution (CTDE) paradigm. In this fra...

  • Article
  • Open Access
4 Citations
3,281 Views
27 Pages

Multi-agent reinforcement learning (MARL) is characterized by its simple structure and strong adaptability, which has led to its widespread application in the field of path planning. To address the challenge of optimal path planning for mobile agent...

  • Article
  • Open Access
10 Citations
2,337 Views
17 Pages

Federated System for Transport Mode Detection

  • Iago C. Cavalcante,
  • Rodolfo I. Meneguette,
  • Renato H. Torres,
  • Leandro Y. Mano,
  • Vinícius P. Gonçalves,
  • Jó Ueyama,
  • Gustavo Pessin,
  • Georges D. Amvame Nze and
  • Geraldo P. Rocha Filho

6 December 2022

Data on transport usage is important in a wide range of areas. These data are often obtained manually through costly and inaccurate interviews. In the last decade, several researchers explored the use of smartphone sensors for the automatic detection...

  • Article
  • Open Access
18 Citations
5,657 Views
26 Pages

Federated learning (FL) is a transformative approach to Machine Learning that enables the training of a shared model without transferring private data to a central location. This decentralized training paradigm has found particular applicability in e...

  • Article
  • Open Access
3 Citations
2,254 Views
20 Pages

1 August 2024

Unmanned aerial vehicle (UAV) confrontation scenarios play a crucial role in the study of agent behavior selection and decision planning. Multi-agent reinforcement learning (MARL) algorithms serve as a universally effective method guiding agents towa...

  • Article
  • Open Access
6 Citations
4,056 Views
21 Pages

13 December 2023

Navigating multiple drones autonomously in complex and unpredictable environments, such as forests, poses a significant challenge typically addressed by wireless communication for coordination. However, this approach falls short in situations with li...

  • Article
  • Open Access
13 Citations
4,722 Views
22 Pages

Multi-UAV Cooperative Air Combat Decision-Making Based on Multi-Agent Double-Soft Actor-Critic

  • Shaowei Li,
  • Yongchao Wang,
  • Yaoming Zhou,
  • Yuhong Jia,
  • Hanyue Shi,
  • Fan Yang and
  • Chaoyue Zhang

Multiple unmanned aerial vehicle (multi-UAV) cooperative air combat, which is an important form of future air combat, has high requirements for the autonomy and cooperation of unmanned aerial vehicles. Therefore, it is of great significance to study...

  • Article
  • Open Access
4 Citations
3,825 Views
16 Pages

With the development of autonomous driving technology, truck platooning control has become a reality. Truck platooning can improve road capacity by maintaining a minor headway. Platooning systems can significantly reduce fuel consumption and emission...

  • Article
  • Open Access
2 Citations
2,170 Views
20 Pages

8 July 2022

In multi-agent domains, dealing with non-stationary opponents that change behaviors (policies) consistently over time is still a challenging problem, where an agent usually requires the ability to detect the opponent’s policy accurately and ado...

  • Article
  • Open Access
436 Views
14 Pages

31 October 2025

Crowdsourced delivery plays a key role in fresh food retailing, where tight time limits and perishability require fast, reliable fulfillment. However, real-time order–courier assignment is challenging because orders arrive in bursts, couriers&r...

  • Article
  • Open Access
43 Citations
6,340 Views
16 Pages

23 December 2022

Unmanned aerial vehicles (UAVs) are important in reconnaissance missions because of their flexibility and convenience. Vitally, UAVs are capable of autonomous navigation, which means they can be used to plan safe paths to target positions in dangerou...

  • Article
  • Open Access
660 Views
16 Pages

For multiple unmanned underwater vehicles (UUVs) systems, obstacle avoidance during cooperative operation in complex marine environments remains a challenging issue. Recent studies demonstrate the effectiveness of deep reinforcement learning (DRL) fo...

  • Article
  • Open Access
3 Citations
3,884 Views
23 Pages

Federated XAI IDS: An Explainable and Safeguarding Privacy Approach to Detect Intrusion Combining Federated Learning and SHAP

  • Kazi Fatema,
  • Samrat Kumar Dey,
  • Mehrin Anannya,
  • Risala Tasin Khan,
  • Mohammad Mamunur Rashid,
  • Chunhua Su and
  • Rashed Mazumder

An intrusion detection system (IDS) is a crucial element in cyber security concerns. IDS is a safeguarding module that is designed to identify unauthorized activities in network environments. The importance of constructing IDSs has never been this si...

  • Article
  • Open Access
782 Views
19 Pages

Physics-Informed Multi-Agent DRL-Based Active Distribution Network Zonal Balancing Control Strategy for Security and Supply Preservation

  • Bingxu Zhai,
  • Yuanzhuo Li,
  • Wei Qiu,
  • Rui Zhang,
  • Zhilin Jiang,
  • Wei Wang,
  • Tao Qian and
  • Qinran Hu

4 June 2025

When large-scale and clustered distributed photovoltaic devices are connected to an active distribution network, the safe and stable operation of the distribution network is seriously threatened, and it is difficult to satisfy the demand for preserva...

  • Article
  • Open Access
1,629 Views
19 Pages

18 March 2025

In modern society, the autonomous exploration of unknown environments has attracted extensive attention due to its broad applications, such as in search and rescue operations, planetary exploration, and environmental monitoring. This paper proposes a...

  • Article
  • Open Access
1,216 Views
31 Pages

24 June 2025

Intelligent Connected Vehicles (ICVs) impose stringent requirements on real-time computational services. However, limited onboard resources and the high latency of remote cloud servers restrict traditional solutions. Unmanned Aerial Vehicle (UAV)-ass...

  • Article
  • Open Access
9 Citations
3,234 Views
17 Pages

An Optimization Method for Collaborative Radar Antijamming Based on Multi-Agent Reinforcement Learning

  • Cheng Feng,
  • Xiongjun Fu,
  • Ziyi Wang,
  • Jian Dong,
  • Zhichun Zhao and
  • Teng Pan

1 June 2023

Attacking a naval vessel with multiple missiles is an important way to improve the hit rate of missiles. Missile-borne radars need to complete detection and antijamming tasks to guide missiles, but communication between these radars is often difficul...

  • Article
  • Open Access
9 Citations
2,533 Views
19 Pages

26 April 2024

Delay-sensitive task offloading in a device-to-device assisted mobile edge computing (D2D-MEC) system with energy harvesting devices is a critical challenge due to the dynamic load level at edge nodes and the variability in harvested energy. In this...

  • Article
  • Open Access
1 Citations
1,639 Views
14 Pages

A wireless communication intelligent anti-jamming decision algorithm based on Deep Reinforcement Learning (DRL) can gradually optimize communication anti-jamming strategies without prior knowledge by continuously interacting with the jamming environm...

  • Article
  • Open Access
6 Citations
3,988 Views
19 Pages

8 August 2024

Device-to-device (D2D) is a pivotal technology in the next generation of communication, allowing for direct task offloading between mobile devices (MDs) to improve the efficient utilization of idle resources. This paper proposes a novel algorithm for...

  • Article
  • Open Access
2 Citations
2,879 Views
18 Pages

25 May 2024

The space–air–ground integrated network can provide services to ground users in remote areas by utilizing high-altitude platform (HAP) drones to support stable user access and using low earth orbit (LEO) satellites to provide large-scale...

  • Article
  • Open Access
387 Views
27 Pages

21 October 2025

The rapid growth of Distributed Energy Resources (DERs) exerts significant pressure on distribution network margins, requiring predictive and safe coordination. This paper presents a closed-loop framework combining a topology-aware Spatio-Temporal Tr...

  • Article
  • Open Access
7 Citations
2,686 Views
17 Pages

5 March 2024

Effective traffic signal control (TSC) plays an important role in reducing vehicle emissions and improving the sustainability of the transportation system. Recently, the feasibility of using multi-agent reinforcement learning technology for TSC has b...

  • Article
  • Open Access
1 Citations
840 Views
21 Pages

28 July 2025

This paper investigates the problem of computation offloading and resource allocation in an integrated space–air–sea network based on unmanned aerial vehicle (UAV) and low Earth orbit (LEO) satellites supporting Maritime Internet of Thing...