You are currently viewing a new version of our website. To view the old version click .

291 Results Found

  • Article
  • Open Access
9 Citations
5,265 Views
24 Pages

6 April 2022

In this paper, a novel deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) is proposed to achieve the fixed point flight control of a quadrotor. The attitude and position information of the quadrotor is directly mapped t...

  • Article
  • Open Access
1,665 Views
29 Pages

As cryptocurrency transactions continue to grow, detecting scams within transaction records remains a critical challenge. These transactions can be represented as dynamic graphs, where Neural Network Convolution (NNConv) models are widely used for de...

  • Article
  • Open Access
2 Citations
1,445 Views
22 Pages

11 December 2024

The rapid integration of distributed energy resources (DERs) such as photovoltaics (PV), wind turbines, and energy storage systems has transformed modern power systems, with hosting capacity optimization emerging as a critical challenge. This paper p...

  • Article
  • Open Access
2,280 Views
18 Pages

26 September 2025

This article presents a deep reinforcement learning (DRL) approach for adaptive robotic grasping in dynamic environments. We developed UR5GraspingEnv, a PyBullet-based simulation environment integrated with OpenAI Gym, to train a UR5 robotic arm with...

  • Article
  • Open Access
8 Citations
2,005 Views
22 Pages

8 June 2024

As the integration of renewable energy expands, effective energy system management becomes increasingly crucial. Distributed renewable generation microgrids offer green energy and resilience. Combining them with energy storage and a suitable energy m...

  • Article
  • Open Access
9 Citations
4,758 Views
19 Pages

Autonomous drones offer immense potential in dynamic environments, but their navigation systems often struggle with moving obstacles. This paper presents a novel approach for drone trajectory planning in such scenarios, combining the Interactive Mult...

  • Article
  • Open Access
1,630 Views
27 Pages

12 October 2025

The increasing complexity of urban traffic networks has highlighted the potential of Multi-Agent Reinforcement Learning (MARL) for Traffic Signal Control (TSC). However, most existing MARL methods assume homogeneous observation and action spaces amon...

  • Article
  • Open Access
18 Citations
5,971 Views
17 Pages

2 January 2024

In the advanced 5G and beyond networks, multi-access edge computing (MEC) is increasingly recognized as a promising technology, offering the dual advantages of reducing energy utilization in cloud data centers while catering to the demands for reliab...

  • Article
  • Open Access
1,495 Views
17 Pages

5 September 2025

Autonomous docking is a critical capability for enabling fully automated operations in industrial and logistics environments using Autonomous Mobile Robots (AMRs). Traditional rule-based docking approaches often struggle with generalization and robus...

  • Article
  • Open Access
656 Views
16 Pages

AI-Based Intelligent System for Personalized Examination Scheduling

  • Marco Barone,
  • Muddasar Naeem,
  • Matteo Ciaschi,
  • Giancarlo Tretola and
  • Antonio Coronato

Artificial Intelligence (AI) has brought a revolution in many areas, including the education sector. It has the potential to improve learning practices, innovate teaching, and accelerate the path towards personalized learning. This work introduces Re...

  • Article
  • Open Access
1 Citations
1,258 Views
23 Pages

26 February 2025

Three-dimensional (3D) dynamic trajectory planning for Autonomous Underwater Vehicles (AUVs) is associated with significant challenges such as balancing the trajectory quality, computational efficiency, and environmental adaptability within complex d...

  • Article
  • Open Access
3 Citations
1,956 Views
20 Pages

Guidance commands of flight vehicles can be regarded as a series of data sets having fixed time intervals; thus, guidance design constitutes a typical sequential decision problem and satisfies the basic conditions for using the deep reinforcement lea...

  • Article
  • Open Access
11 Citations
2,663 Views
18 Pages

13 September 2024

The stratospheric airship, as a near-space vehicle, is increasingly utilized in scientific exploration and Earth observation due to its long endurance and regional observation capabilities. However, due to the complex characteristics of the stratosph...

  • Article
  • Open Access
3 Citations
1,970 Views
23 Pages

To address the challenges of manual buoy inspection, this study enhances a previously proposed Unmanned Surface Vehicle (USV) inspection system by improving its navigation and obstacle avoidance capabilities using Proximal Policy Optimization (PPO)....

  • Article
  • Open Access
958 Views
35 Pages

This paper proposes a dynamic portfolio allocation framework that integrates deep reinforcement learning (DRL) with classical portfolio optimization to enhance rebalancing strategies and risk–return management. Within a unified reinforcement-le...

  • Article
  • Open Access
43 Citations
3,833 Views
19 Pages

9 November 2022

The integration of artificial intelligence (AI) technology into the Internet of Vehicles (IoV) has provided smart services for intelligent connected vehicles (ICVs). However, due to gradually upgrading to ICVs, an increasing number of external commun...

  • Article
  • Open Access
979 Views
24 Pages

19 August 2025

In complex urban wireless environments, unmanned aerial vehicle–mobile edge computing (UAV-MEC) systems face challenges like link blockage and single-antenna eavesdropping threats. The traditional single reconfigurable intelligent surface (RIS)...

  • Article
  • Open Access
8 Citations
2,090 Views
19 Pages

6 October 2022

As the Internet of Things (IoT) continues to grow, a vast amount of data is generated. The IoT environment is quite sensitive to security challenges because personal information may be leaked or sensor data may be manipulated, which could cause accid...

  • Article
  • Open Access
3 Citations
3,154 Views
26 Pages

15 October 2024

Earthwork operations are critical to construction projects, with their safety and efficiency influenced by factors such as operator skill and working hours. Pre-construction simulation of these operations is essential for optimizing outcomes, providi...

  • Article
  • Open Access
1 Citations
687 Views
21 Pages

The experimentation of agricultural robots has been increasing in recent years, both in greenhouses and open fields. While agricultural robots are inherently useful for automating various farming tasks, their presence can also be leveraged to collect...

  • Article
  • Open Access
39 Citations
4,255 Views
15 Pages

14 January 2022

The complexity of network intrusion detection systems (IDSs) is increasing due to the continuous increases in network traffic, various attacks and the ever-changing network environment. In addition, network traffic is asymmetric with few attack data,...

  • Article
  • Open Access
3 Citations
2,188 Views
35 Pages

Based on the quasi-six-degree-of-freedom flight dynamic equations, considering the changes in the elevation angle caused by an increase in the rolling angle during maneuvering turns, which leads to a rise in the radar cross-section. A computational m...

  • Article
  • Open Access
1,856 Views
35 Pages

DeepSIGNAL-ITS—Deep Learning Signal Intelligence for Adaptive Traffic Signal Control in Intelligent Transportation Systems

  • Mirabela Melinda Medvei,
  • Alin-Viorel Bordei,
  • Ștefania Loredana Niță and
  • Nicolae Țăpuș

27 August 2025

Urban traffic congestion remains a major contributor to vehicle emissions and travel inefficiency, prompting the need for adaptive and intelligent traffic management systems. In response, we introduce DeepSIGNAL-ITS (Deep Learning Signal Intelligence...

  • Article
  • Open Access
9 Citations
2,851 Views
16 Pages

Efficient Asynchronous Federated Learning for AUV Swarm

  • Zezhao Meng,
  • Zhi Li,
  • Xiangwang Hou,
  • Jun Du,
  • Jianrui Chen and
  • Wei Wei

11 November 2022

The development of automatic underwater vehicles (AUVs) has brought about unprecedented profits and opportunities. In order to discover the hidden valuable data detected by an AUV swarm, it is necessary to aggregate the data detected by AUV swarm to...

  • Article
  • Open Access
1 Citations
1,091 Views
26 Pages

Industrial Control Systems (ICS) are increasingly targeted by sophisticated and evolving cyberattacks, while conventional static defense mechanisms and isolated intrusion detection models often lack the robustness required to cope with such dynamic t...

  • Article
  • Open Access
101 Views
21 Pages

Cloud-based platforms form the backbone of smart city ecosystems, powering essential services such as transportation, energy management, and public safety. However, their operational complexity generates vast volumes of system logs, making manual ano...

  • Article
  • Open Access
1 Citations
2,374 Views
19 Pages

The use of wearable assistive devices is growing in both industrial and medical fields. Combining human expertise and artificial intelligence (AI), e.g., in human-in-the-loop-optimization, is gaining popularity for adapting assistance to individuals....

  • Article
  • Open Access
38 Citations
5,951 Views
16 Pages

Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance

  • Weiwei Zhao,
  • Hairong Chu,
  • Xikui Miao,
  • Lihong Guo,
  • Honghai Shen,
  • Chenhao Zhu,
  • Feng Zhang and
  • Dongxin Liang

13 August 2020

Multiple unmanned aerial vehicle (UAV) collaboration has great potential. To increase the intelligence and environmental adaptability of multi-UAV control, we study the application of deep reinforcement learning algorithms in the field of multi-UAV c...

  • Article
  • Open Access
23 Citations
5,059 Views
17 Pages

Proactive Handover Decision for UAVs with Deep Reinforcement Learning

  • Younghoon Jang,
  • Syed M. Raza,
  • Moonseong Kim and
  • Hyunseung Choo

5 February 2022

The applications of Unmanned Aerial Vehicles (UAVs) are rapidly growing in domains such as surveillance, logistics, and entertainment and require continuous connectivity with cellular networks to ensure their seamless operations. However, handover po...

  • Article
  • Open Access
4 Citations
4,751 Views
19 Pages

Financial Vision-Based Reinforcement Learning Trading Strategy

  • Yun-Cheng Tsai,
  • Fu-Min Szu,
  • Jun-Hao Chen and
  • Samuel Yen-Chi Chen

9 August 2022

Recent advances in artificial intelligence (AI) for quantitative trading have led to its general superhuman performance among notable trading performance results. However, if we use AI without proper supervision, it can lead to wrong choices and huge...

  • Article
  • Open Access
7 Citations
3,901 Views
13 Pages

11 December 2023

The advent of autonomous vehicles has opened new horizons for transportation efficiency and safety. Platooning, a strategy where vehicles travel closely together in a synchronized manner, holds promise for reducing traffic congestion, lowering fuel c...

  • Feature Paper
  • Article
  • Open Access
23 Citations
5,213 Views
19 Pages

19 November 2020

One popular method for optimizing systems, referred to as ANN-PSO, uses an artificial neural network (ANN) to approximate the system and an optimization method like particle swarm optimization (PSO) to select inputs. However, with reinforcement learn...

  • Article
  • Open Access
4 Citations
2,880 Views
12 Pages

Vehicle-Following Control Based on Deep Reinforcement Learning

  • Yong Huang,
  • Xin Xu,
  • Yong Li,
  • Xinglong Zhang,
  • Yao Liu and
  • Xiaochuan Zhang

21 October 2022

Intelligent vehicle-following control presents a great challenge in autonomous driving. In vehicle-intensive roads of city environments, frequent starting and stopping of vehicles is one of the important cause of front-end collision accidents. Theref...

  • Article
  • Open Access
8 Citations
4,303 Views
20 Pages

30 May 2024

Landing a multi-rotor uncrewed aerial vehicle (UAV) on a moving target in the presence of partial observability, due to factors such as sensor failure or noise, represents an outstanding challenge that requires integrative techniques in robotics and...

  • Article
  • Open Access
4 Citations
2,339 Views
22 Pages

12 August 2024

An efficient energy management system (EMS) enhances microgrid performance in terms of stability, safety, and economy. Traditional centralized or decentralized energy management systems are unable to meet the increasing demands for autonomous decisio...

  • Article
  • Open Access
5 Citations
2,755 Views
18 Pages

24 November 2023

In the field of intelligent anti-jamming, deep reinforcement learning algorithms are regarded as key technical means. However, the learning process of deep reinforcement learning algorithms requires a stable learning environment to ensure its effecti...

  • Article
  • Open Access
15 Citations
8,508 Views
18 Pages

Prediction Horizon-Varying Model Predictive Control (MPC) for Autonomous Vehicle Control

  • Zhenbin Chen,
  • Jiaqin Lai,
  • Peixin Li,
  • Omar I. Awad and
  • Yubing Zhu

The prediction horizon is a key parameter in model predictive control (MPC), which is related to the effectiveness and stability of model predictive control. In vehicle control, the selection of a prediction horizon is influenced by factors such as s...

  • Article
  • Open Access
2 Citations
1,741 Views
17 Pages

Optimization of Predefined-Time Agent-Scheduling Strategy Based on PPO

  • Dingding Qi,
  • Yingjun Zhao,
  • Longyue Li and
  • Zhanxiao Jia

31 July 2024

In this paper, we introduce an agent rescue scheduling approach grounded in proximal policy optimization, coupled with a singularity-free predefined-time control strategy. The primary objective of this methodology is to bolster the efficiency and pre...

  • Article
  • Open Access
18 Citations
4,991 Views
17 Pages

17 December 2019

Location technology is playing an increasingly important role in urban life. Various active and passive wireless positioning technologies for mobile terminals have attracted research attention. However, positioning signals experience serious interfer...

  • Article
  • Open Access
16 Citations
4,444 Views
19 Pages

19 August 2022

Autonomous maneuver decision by an unmanned combat air vehicle (UCAV) is a critical part of air combat that requires both flight safety and tactical maneuvering. In this paper, an unmanned combat air vehicle air combat maneuver decision method based...

  • Article
  • Open Access
1,168 Views
18 Pages

26 September 2025

Reinforcement learning (RL), and in particular Proximal Policy Optimization (PPO), has shown promise in high-precision quadrotor unmanned aerial vehicle (QUAV) control. However, the performance of PPO is highly sensitive to the choice of the clipping...

  • Article
  • Open Access
2 Citations
2,070 Views
13 Pages

This paper presents a cooperative highway platooning strategy that integrates Multi-Agent Reinforcement Learning (MARL) with Proximal Policy Optimization (PPO) to effectively manage the complex task of merging. In modern transportation systems, plato...

  • Article
  • Open Access
1 Citations
2,445 Views
23 Pages

Deep Reinforcement Learning for Sim-to-Real Robot Navigation with a Minimal Sensor Suite for Beach-Cleaning Applications

  • Guillermo Cid Ampuero,
  • Gabriel Hermosilla,
  • Germán Varas and
  • Matías Toribio Clark

5 October 2025

Autonomous beach-cleaning robots require reliable, low-cost navigation on sand. We study Sim-to-Real transfer of deep reinforcement learning (DRL) policies using a minimal sensor suite—wheel-encoder odometry and a single 2-D LiDAR—on a 30...

  • Article
  • Open Access
54 Citations
9,152 Views
13 Pages

Multiple-UAV Reinforcement Learning Algorithm Based on Improved PPO in Ray Framework

  • Guang Zhan,
  • Xinmiao Zhang,
  • Zhongchao Li,
  • Lin Xu,
  • Deyun Zhou and
  • Zhen Yang

4 July 2022

Distributed multi-agent collaborative decision-making technology is the key to general artificial intelligence. This paper takes the self-developed Unity3D collaborative combat environment as the test scenario, setting a task that requires heterogene...

  • Article
  • Open Access
6 Citations
1,836 Views
16 Pages

29 March 2024

Unsignalized roundabouts have a significant impact on traffic flow and vehicle safety. To address the challenge of autonomous vehicles passing through roundabouts with low penetration, improve their efficiency, and ensure safety and stability, we pro...

  • Proceeding Paper
  • Open Access
1,400 Views
8 Pages

One of the crucial tasks for autonomous robots is learning to safely navigate through obstacles in real-world environments. An intelligent robot must not only perform the assigned task but also adapt to changes in its environment as quickly as possib...

  • Article
  • Open Access
3 Citations
5,012 Views
17 Pages

26 October 2022

This paper modifies the single rigid body (SRB) model, and considers the swinging leg as the disturbances to the centroid acceleration and rotational acceleration of the SRB model. This paper proposes deep reinforcement learning (DRL)-based model pre...

  • Proceeding Paper
  • Open Access
3 Citations
4,702 Views
25 Pages

A Reinforcement Learning-Based Proximal Policy Optimization Approach to Solve the Economic Dispatch Problem

  • Adil Rizki,
  • Achraf Touil,
  • Abdelwahed Echchatbi,
  • Rachid Oucheikh and
  • Mustapha Ahlaqqach

This paper presents a novel approach to economic dispatch (ED) optimization in power systems through the application of Proximal Policy Optimization (PPO), an advanced reinforcement learning algorithm. The economic dispatch problem, a fundamental cha...

  • Article
  • Open Access
853 Views
20 Pages

4 December 2025

Addressing two types of supply chain disruptions—frequent short-duration disruptions (e.g., minor natural disasters) and infrequent long-duration disruptions (e.g., geopolitical conflicts, public health crises)—while considering their imp...

  • Article
  • Open Access
7 Citations
5,036 Views
17 Pages

20 August 2024

This study proposes a method named Hybrid Heuristic Proximal Policy Optimization (HHPPO) to implement online 3D bin-packing tasks. Some heuristic algorithms for bin-packing and the Proximal Policy Optimization (PPO) algorithm of deep reinforcement le...

of 6