Skip to Content

535 Results Found

  • Article
  • Open Access
3 Citations
1,813 Views
13 Pages

Proximal Policy Optimization-Based Power Grid Structure Optimization for Reliable Splitting

  • Xinwei Sun,
  • Shuangteng Han,
  • Yuhong Wang,
  • Yunxiang Shi,
  • Jianquan Liao,
  • Zongsheng Zheng,
  • Xi Wang and
  • Peng Shi

9 February 2024

When systems experience a severe fault, splitting, as the final line of defense to ensure the stability of the power system, holds immense significance. The precise selection of splitting sections has become the current focal point of research. Addre...

  • Article
  • Open Access
12 Citations
5,447 Views
24 Pages

6 April 2022

In this paper, a novel deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) is proposed to achieve the fixed point flight control of a quadrotor. The attitude and position information of the quadrotor is directly mapped t...

  • Article
  • Open Access
9 Citations
2,958 Views
13 Pages

23 October 2020

Accurate temperature prediction plays an important role in the thermal protection of permanent magnet synchronous motors. A temperature prediction method of permanent magnet synchronous machines (PMSMs) based on proximal policy optimization is propos...

  • Article
  • Open Access
6 Citations
2,569 Views
17 Pages

Optimal Control Algorithm for Subway Train Operation by Proximal Policy Optimization

  • Bin Chen,
  • Chunhai Gao,
  • Lei Zhang,
  • Junjie Chen,
  • Jun Chen and
  • Yuyi Li

23 June 2023

With the increasing scale of the urban subway, the total energy consumption of the subway has increased dramatically and poses a great challenge to the comfort of passengers and the punctuality of train operation. In order to ensure on-time train ope...

  • Article
  • Open Access
6 Citations
2,533 Views
22 Pages

12 August 2024

An efficient energy management system (EMS) enhances microgrid performance in terms of stability, safety, and economy. Traditional centralized or decentralized energy management systems are unable to meet the increasing demands for autonomous decisio...

  • Article
  • Open Access
12 Citations
4,729 Views
14 Pages

22 March 2022

In the field of reinforcement learning, we propose a Correct Proximal Policy Optimization (CPPO) algorithm based on the modified penalty factor β and relative entropy in order to solve the robustness and stationarity of traditional algorithms. F...

  • Article
  • Open Access
3 Citations
2,819 Views
15 Pages

9 June 2025

Efficient task allocation remains a fundamental challenge in multi-agent systems, particularly under resource constraints and large-scale deployments. Classical methods, including market-based mechanisms, centralized optimization techniques, and game...

  • Article
  • Open Access
10 Citations
4,563 Views
20 Pages

11 July 2022

For most machine learning and deep learning models, the selection of hyperparameters has a significant impact on the performance of the model. Therefore, deep learning and data analysis experts have to spend a lot of time on hyperparameter tuning whe...

  • Article
  • Open Access
11 Citations
4,090 Views
25 Pages

20 March 2025

Autonomous vehicles must make quick and accurate decisions to operate efficiently in complex and dynamic urban traffic environments, necessitating a reliable and stable learning mechanism. The proximal policy optimization (PPO) algorithm stands out a...

  • Article
  • Open Access
1 Citations
1,862 Views
29 Pages

As cryptocurrency transactions continue to grow, detecting scams within transaction records remains a critical challenge. These transactions can be represented as dynamic graphs, where Neural Network Convolution (NNConv) models are widely used for de...

  • Proceeding Paper
  • Open Access
3 Citations
5,361 Views
25 Pages

A Reinforcement Learning-Based Proximal Policy Optimization Approach to Solve the Economic Dispatch Problem

  • Adil Rizki,
  • Achraf Touil,
  • Abdelwahed Echchatbi,
  • Rachid Oucheikh and
  • Mustapha Ahlaqqach

This paper presents a novel approach to economic dispatch (ED) optimization in power systems through the application of Proximal Policy Optimization (PPO), an advanced reinforcement learning algorithm. The economic dispatch problem, a fundamental cha...

  • Article
  • Open Access
33 Citations
4,163 Views
17 Pages

16 October 2021

The life cycle of wind turbines depends on the operation and maintenance policies adopted. With the critical components of wind turbines being equipped with condition monitoring and Prognostics and Health Management (PHM) capabilities, it is feasible...

  • Article
  • Open Access
3 Citations
1,633 Views
22 Pages

11 December 2024

The rapid integration of distributed energy resources (DERs) such as photovoltaics (PV), wind turbines, and energy storage systems has transformed modern power systems, with hosting capacity optimization emerging as a critical challenge. This paper p...

  • Article
  • Open Access
437 Views
20 Pages

Container multimodal transport faces many uncertainties in practice. To improve operational efficiency and reduce carbon emissions in freight transport, this study develops a multi-objective optimization model for container multimodal routes that inc...

  • Article
  • Open Access
5 Citations
2,326 Views
20 Pages

14 February 2025

To address the resource allocation problem in dynamic environments where multiple unmanned aerial vehicle base stations (UAV-BSs) provide efficient downlink services to ground users, this paper proposes a novel hierarchical decision-making mechanism...

  • Article
  • Open Access
4 Citations
2,817 Views
16 Pages

18 August 2023

Microstrip filters are widely used in high-frequency circuit design for signal frequency selection. However, designing these filters often requires extensive trial and error to achieve the desired performance metrics, leading to significant time cost...

  • Article
  • Open Access
50 Citations
11,942 Views
19 Pages

18 August 2020

Advanced deep reinforcement learning shows promise as an approach to addressing continuous control tasks, especially in mixed-autonomy traffic. In this study, we present a deep reinforcement-learning-based model that considers the effectiveness of le...

  • Article
  • Open Access
2 Citations
2,450 Views
20 Pages

4 October 2024

Single-Pilot Operations (SPO) mode is set to reshape the decision-making process between human-machine and air-ground operations. However, the limited on-board computing resources impose greater demands on the organization of performance parameters a...

  • Article
  • Open Access
2 Citations
86,509 Views
18 Pages

26 August 2024

The more path conflicts between multiple robots, the more time it takes to avoid each other, and the more navigation time it takes for the robots to complete all tasks. This study designs a multi-robot navigation system based on deep reinforcement le...

  • Article
  • Open Access
757 Views
14 Pages

In response to the issues of severe pitch oscillation and unstable roll attitude present in existing reinforcement learning-based aircraft cruise control methods during dynamic maneuvers, this paper proposes a precise control method for aircraft crui...

  • Article
  • Open Access

The increasing densification of cell-free massive multiple-input multiple-output (MIMO) networks makes access point switch on/off (ASO) a key mechanism for improving energy efficiency in future wireless systems. While reinforcement learning (RL) has...

  • Article
  • Open Access
2 Citations
899 Views
17 Pages

23 July 2025

The electric–hydrogen coupled integrated energy system (EHCS) is a critical pathway for the low-carbon transition of energy systems. However, the inherent uncertainties of renewable energy sources present significant challenges to optimal energ...

  • Article
  • Open Access
40 Citations
4,529 Views
15 Pages

14 January 2022

The complexity of network intrusion detection systems (IDSs) is increasing due to the continuous increases in network traffic, various attacks and the ever-changing network environment. In addition, network traffic is asymmetric with few attack data,...

  • Article
  • Open Access
19 Citations
6,330 Views
17 Pages

2 January 2024

In the advanced 5G and beyond networks, multi-access edge computing (MEC) is increasingly recognized as a promising technology, offering the dual advantages of reducing energy utilization in cloud data centers while catering to the demands for reliab...

  • Article
  • Open Access
2 Citations
1,890 Views
22 Pages

In the context of intelligent manufacturing, the integrated scheduling problem of dual rail-guided vehicles (RGVs) and multiple parallel processing equipment in flexible manufacturing systems has gained increasing importance. This problem exhibits sp...

  • Article
  • Open Access
277 Views
18 Pages

13 February 2026

High penetration of distributed photovoltaic (PV) generation has transformed active distribution networks into inverter-dominated systems, where maintaining voltage stability, minimizing power losses, and maximizing renewable utilization under uncert...

  • Article
  • Open Access
537 Views
15 Pages

19 September 2025

This article proposes a wideband microstrip-to-microstrip vertical transition with multi-layer pixel structures, alongside a multi-branch knowledge-assisted proximal policy optimization (MB-KPPO) method for its automatic design. The proposed transiti...

  • Article
  • Open Access
12 Citations
8,358 Views
20 Pages

20 November 2023

In an era characterised by rapid technological advancement, the application of algorithmic approaches to address complex problems has become crucial across various disciplines. Within the realm of education, there is growing recognition of the pivota...

  • Article
  • Open Access
703 Views
28 Pages

14 August 2025

In the field of mobile crowdsensing (MCS), a large number of tasks rely on the participation of ordinary mobile device users for data collection and processing. This model has shown great potential for applications in environmental monitoring, traffi...

  • Article
  • Open Access
2 Citations
1,830 Views
27 Pages

27 January 2025

Improving decision-making in the autonomous maneuvering of unmanned aerial vehicles (UAVs) is of great significance to improving flight safety, the mission execution rate, and environmental adaptability. The method of deep reinforcement learning make...

  • Article
  • Open Access
18 Citations
3,563 Views
23 Pages

As the performances of energy management strategy (EMS) are essential for a plug-in hybrid electric bus (PHEB) to operate in an efficient way. The proximal policy optimization (PPO) based multi-objective EMS considering the battery thermal characteri...

  • Article
  • Open Access
1 Citations
840 Views
21 Pages

The experimentation of agricultural robots has been increasing in recent years, both in greenhouses and open fields. While agricultural robots are inherently useful for automating various farming tasks, their presence can also be leveraged to collect...

  • Article
  • Open Access
38 Citations
6,152 Views
16 Pages

Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance

  • Weiwei Zhao,
  • Hairong Chu,
  • Xikui Miao,
  • Lihong Guo,
  • Honghai Shen,
  • Chenhao Zhu,
  • Feng Zhang and
  • Dongxin Liang

13 August 2020

Multiple unmanned aerial vehicle (UAV) collaboration has great potential. To increase the intelligence and environmental adaptability of multi-UAV control, we study the application of deep reinforcement learning algorithms in the field of multi-UAV c...

  • Article
  • Open Access
5 Citations
3,384 Views
26 Pages

8 January 2025

In this paper, we address the issues of the explainability of reinforcement learning-based machine learning agents trained with Proximal Policy Optimization (PPO) that utilizes visual sensor data. We propose an algorithm that allows an effective and...

  • Article
  • Open Access
14 Citations
6,859 Views
24 Pages

With the increasing global demand for renewable energy and heightened environmental awareness, electric vehicles (EVs) are rapidly becoming a popular clean and efficient mode of transportation. However, the widespread adoption of EVs has presented se...

  • Article
  • Open Access
3 Citations
2,032 Views
20 Pages

Guidance commands of flight vehicles can be regarded as a series of data sets having fixed time intervals; thus, guidance design constitutes a typical sequential decision problem and satisfies the basic conditions for using the deep reinforcement lea...

  • Article
  • Open Access
18 Citations
4,613 Views
19 Pages

19 August 2022

Autonomous maneuver decision by an unmanned combat air vehicle (UCAV) is a critical part of air combat that requires both flight safety and tactical maneuvering. In this paper, an unmanned combat air vehicle air combat maneuver decision method based...

  • Article
  • Open Access
18 Citations
5,075 Views
17 Pages

17 December 2019

Location technology is playing an increasingly important role in urban life. Various active and passive wireless positioning technologies for mobile terminals have attracted research attention. However, positioning signals experience serious interfer...

  • Article
  • Open Access
2 Citations
1,544 Views
17 Pages

25 April 2025

The rapid development of mobile Internet technology has made users’ requirements for quality of service (QoS) continuously improve. The task unloading process of mobile edge computing has the problem that it is impossible to balance delay and e...

  • Article
  • Open Access
3 Citations
3,784 Views
21 Pages

6 September 2024

Serverless computing is a new cloud computing model suitable for providing services in both large cloud and edge clusters. In edge clusters, the autoscaling functions play a key role on serverless platforms as the dynamic scaling of function instance...

  • Article
  • Open Access
9 Citations
3,359 Views
17 Pages

Autonomous Driving Decision Control Based on Improved Proximal Policy Optimization Algorithm

  • Qingpeng Song,
  • Yuansheng Liu,
  • Ming Lu,
  • Jun Zhang,
  • Han Qi,
  • Ziyu Wang and
  • Zijian Liu

24 May 2023

The decision-making control of autonomous driving in complex urban road environments is a difficult problem in the research of autonomous driving. In order to solve the problem of high dimensional state space and sparse reward in autonomous driving d...

  • Article
  • Open Access
17 Citations
3,435 Views
20 Pages

Proximal Policy Optimization for Energy Management of Electric Vehicles and PV Storage Units

  • Monica Alonso,
  • Hortensia Amaris,
  • David Martin and
  • Arturo de la Escalera

29 July 2023

Connected autonomous electric vehicles (CAEVs) are essential actors in the decarbonization process of the transport sector and a key aspect of home energy management systems (HEMSs) along with PV units, CAEVs and battery energy storage systems. Howev...

  • Article
  • Open Access
1 Citations
665 Views
21 Pages

Cloud-based platforms form the backbone of smart city ecosystems, powering essential services such as transportation, energy management, and public safety. However, their operational complexity generates vast volumes of system logs, making manual ano...

  • Article
  • Open Access
6 Citations
1,980 Views
16 Pages

29 March 2024

Unsignalized roundabouts have a significant impact on traffic flow and vehicle safety. To address the challenge of autonomous vehicles passing through roundabouts with low penetration, improve their efficiency, and ensure safety and stability, we pro...

  • Article
  • Open Access
382 Views
27 Pages

Joint Optimization of Microservice and Database Orchestration in Edge Clouds via Multi-Stage Proximal Policy

  • Xingfeng He,
  • Mingwei Luo,
  • Dengmu Liu,
  • Zhenhua Wang,
  • Yingdong Liu,
  • Chen Zhang,
  • Jiandong Wang,
  • Jiaxiang Xu and
  • Tianping Deng

9 January 2026

Microservices as an emerging architectural approach have been widely applied in the development of online applications. However, in large-scale service systems, frequent data communications, complex invocation dependencies, and strict latency require...

  • Article
  • Open Access
3 Citations
1,450 Views
17 Pages

Surface Defect Detection for Small Samples of Particleboard Based on Improved Proximal Policy Optimization

  • Haifei Xia,
  • Haiyan Zhou,
  • Mingao Zhang,
  • Qingyi Zhang,
  • Chenlong Fan,
  • Yutu Yang,
  • Shuang Xi and
  • Ying Liu

17 April 2025

Particleboard is an important forest product that can be reprocessed using wood processing by-products. This approach has the potential to achieve significant conservation of forest resources and contribute to the protection of forest ecology. Most c...

  • Article
  • Open Access
1 Citations
1,596 Views
15 Pages

25 September 2024

Future 6G networks will inherit and develop Network Function Virtualization (NFV) architecture. With the NFV-enabled network architecture, it becomes possible to establish different virtual networks within the same infrastructure, create different Vi...

  • Article
  • Open Access
4 Citations
2,613 Views
22 Pages

5 November 2024

As the decarbonization strategies of automated container terminals (ACTs) continue to advance, electrically powered Battery-Automated Guided Vehicles (B-AGVs) are being widely adopted in ACTs. The U-shaped ACT, as a novel layout, faces higher AGV ene...

  • Article
  • Open Access
4 Citations
2,431 Views
17 Pages

29 November 2024

Existing multi-agent deep reinforcement learning (MADRL) methods for multi-UAV navigation face challenges in generalization, particularly when applied to unseen complex environments. To address these limitations, we propose a Dual-Transformer Encoder...

  • Article
  • Open Access
11 Citations
3,084 Views
18 Pages

31 October 2023

To solve the problems of path planning and dynamic obstacle avoidance for an unmanned surface vehicle (USV) in a locally observable non-dynamic ocean environment, a visual perception and decision-making method based on deep reinforcement learning is...

of 11