Skip to Content

875 Results Found

  • Article
  • Open Access
8 Citations
2,389 Views
18 Pages

An Improved Q-Learning Algorithm for Optimizing Sustainable Remanufacturing Systems

  • Shujin Qin,
  • Xiaofei Zhang,
  • Jiacun Wang,
  • Xiwang Guo,
  • Liang Qi,
  • Jinrui Cao and
  • Yizhi Liu

16 May 2024

In our modern society, there has been a noticeable increase in pollution due to the trend of post-use handling of items. This necessitates the adoption of recycling and remanufacturing processes, advocating for sustainable resource management. This p...

  • Article
  • Open Access
4 Citations
3,918 Views
25 Pages

15 February 2024

This paper presents a solution to address the challenges of unexpected events in the operation of metro trains, which can lead to increased delays and safety risks. An improved Q-learning algorithm is proposed to reschedule train timetables via incor...

  • Article
  • Open Access
261 Views
40 Pages

To address the rapid population diversity loss and premature convergence of the Artificial Lemming Algorithm (ALA) in complex optimization problems, this paper proposes an Improved Artificial Lemming Algorithm (IALA) with multi-strategy enhancements...

  • Article
  • Open Access
3 Citations
3,578 Views
13 Pages

22 March 2020

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of...

  • Article
  • Open Access
9 Citations
3,442 Views
22 Pages

28 March 2024

In contemporary warfare, radar countermeasures have become multifunctional and intelligent, rendering the conventional jamming method and platform unsuitable for the modern radar countermeasures battlefield due to their limited efficiency. Reinforcem...

  • Article
  • Open Access
5 Citations
1,891 Views
17 Pages

This paper proposes a central anti-jamming algorithm (CAJA) based on improved Q-learning to further solve the communication challenges faced by multi-user wireless communication networks in terms of external complex malicious interference. This will...

  • Article
  • Open Access
7 Citations
3,044 Views
26 Pages

11 June 2021

An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant protection such as watering, sowing, and pesticide spraying. It is essential to develop a Decision-making Support System (DSS) for UAVs to help them choose the cor...

  • Article
  • Open Access
7 Citations
1,695 Views
27 Pages

An Improved Deep Q-Learning Approach for Navigation of an Autonomous UAV Agent in 3D Obstacle-Cluttered Environment

  • Ghulam Farid,
  • Muhammad Bilal,
  • Lanyong Zhang,
  • Ayman Alharbi,
  • Ishaq Ahmed and
  • Muhammad Azhar

23 July 2025

The performance of the UAVs while executing various mission profiles greatly depends on the selection of planning algorithms. Reinforcement learning (RL) algorithms can effectively be utilized for robot path planning. Due to random action selection i...

  • Article
  • Open Access
8 Citations
2,718 Views
13 Pages

27 August 2023

With the rapid advancement of unmanned aerial vehicle (UAV) technology, the widespread utilization of UAVs poses significant challenges to urban low-altitude safety and airspace management. In the coming future, the quantity of drones is expected to...

  • Article
  • Open Access
16 Citations
4,262 Views
21 Pages

CLSQL: Improved Q-Learning Algorithm Based on Continuous Local Search Policy for Mobile Robot Path Planning

  • Tian Ma,
  • Jiahao Lyu,
  • Jiayi Yang,
  • Runtao Xi,
  • Yuancheng Li,
  • Jinpeng An and
  • Chao Li

8 August 2022

How to generate the path planning of mobile robots quickly is a problem in the field of robotics. The Q-learning(QL) algorithm has recently become increasingly used in the field of mobile robot path planning. However, its selection policy is blind in...

  • Article
  • Open Access
3 Citations
2,005 Views
11 Pages

Automatic Verification Flow Shop Scheduling of Electric Energy Meters Based on an Improved Q-Learning Algorithm

  • Long Peng,
  • Jiajie Li,
  • Jingming Zhao,
  • Sanlei Dang,
  • Zhengmin Kong and
  • Li Ding

22 February 2022

Considering the engineering problem of electric energy meter automatic verification and scheduling, this paper proposes a novel scheduling scheme based on an improved Q-learning algorithm. First, by introducing the state variables and behavior variab...

  • Article
  • Open Access
1 Citations
340 Views
33 Pages

9 January 2026

Most multi-objective studies on distributed hybrid flow shops that include tardiness-related objectives focus solely on optimizing makespan alongside a single tardiness objective. However, in real-world scenarios with strict contractual deadlines or...

  • Article
  • Open Access
18 Citations
6,507 Views
17 Pages

19 November 2022

Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in large complex dynamic environments, due to the sparse reward function of Dyna-Q and the large searching space, this method has the problems of low search efficien...

  • Article
  • Open Access
30 Citations
7,316 Views
16 Pages

Deep Q-Learning in Robotics: Improvement of Accuracy and Repeatability

  • Marius Sumanas,
  • Algirdas Petronis,
  • Vytautas Bucinskas,
  • Andrius Dzedzickis,
  • Darius Virzonis and
  • Inga Morkvenaite-Vilkonciene

21 May 2022

Recent industrial robotics covers a broad part of the manufacturing spectrum and other human everyday life applications; the performance of these devices has become increasingly important. Positioning accuracy and repeatability, as well as operating...

  • Article
  • Open Access
1 Citations
1,231 Views
22 Pages

1 September 2025

With the growing adoption of circular economy principles in manufacturing, efficient disassembly and reassembly of end-of-life (EOL) products has become a key challenge in smart factories. This paper addresses the Disassembly and Assembly Line Balanc...

  • Article
  • Open Access
10 Citations
4,010 Views
22 Pages

10 July 2023

Nowadays, academic research, disaster mitigation, industry, and transportation apply the cooperative multi-agent concept. A cooperative multi-agent system is a multi-agent system that works together to solve problems or maximise utility. The essentia...

  • Article
  • Open Access
775 Views
26 Pages

Application of an Improved Double Q-Learning Algorithm in Ground Mobile Robots

  • Jinchao Zhao,
  • Ya Zhang,
  • Nan Wu,
  • Xinye Han,
  • Luoyin Ning,
  • Xiaowei Ren,
  • Lingling Fang,
  • Jiaxuan Wang,
  • Xu Ren and
  • Jinghao Feng
  • + 1 author

12 September 2025

Since efficient path planning technology is the key to the safe and autonomous navigation of autonomous ground robots, and in the complex and asymmetrically distributed land environment, the existing path planning and obstacle avoidance technologies...

  • Article
  • Open Access
13 Citations
3,569 Views
17 Pages

Robust Attitude Control of an Agile Aircraft Using Improved Q-Learning

  • Mohsen Zahmatkesh,
  • Seyyed Ali Emami,
  • Afshin Banazadeh and
  • Paolo Castaldi

12 December 2022

Attitude control of a novel regional truss-braced wing (TBW) aircraft with low stability characteristics is addressed in this paper using Reinforcement Learning (RL). In recent years, RL has been increasingly employed in challenging applications, par...

  • Article
  • Open Access
8 Citations
2,743 Views
18 Pages

An Improved Q-Learning-Based Sensor-Scheduling Algorithm for Multi-Target Tracking

  • Zhiyi Qu,
  • Xue Zhao,
  • Huihui Xu,
  • Hongying Tang,
  • Jiang Wang and
  • Baoqing Li

15 September 2022

Target tracking is an essential issue in wireless sensor networks (WSNs). Compared with single-target tracking, how to guarantee the performance of multi-target tracking is more challenging because the system needs to balance the tracking resource fo...

  • Article
  • Open Access
5 Citations
1,855 Views
21 Pages

22 November 2022

This paper presents a power distribution system that prioritizes the reliability of power to critical loads within a community. The proposed system utilizes reinforcement learning methods (Q-learning) to train multi-port power electronic interface (M...

  • Article
  • Open Access
18 Citations
2,873 Views
21 Pages

11 November 2023

The path planning problem of nuclear environment robots refers to finding a collision-free path under the constraints of path length and an accumulated radiation dose. To solve this problem, the Improved Dueling Deep Double Q Network algorithm (ID3QN...

  • Article
  • Open Access
1 Citations
1,822 Views
18 Pages

14 August 2025

This study proposes a novel framework, Mamba-DQN, which integrates the state space-based time-series encoder Mamba-SSM into the Deep Q-Network (DQN) architecture to improve reinforcement learning performance in dynamic environments. Conventional rein...

  • Article
  • Open Access
16 Citations
3,954 Views
21 Pages

A Multi-Area Task Path-Planning Algorithm for Agricultural Drones Based on Improved Double Deep Q-Learning Net

  • Jian Li,
  • Weijian Zhang,
  • Junfeng Ren,
  • Weilin Yu,
  • Guowei Wang,
  • Peng Ding,
  • Jiawei Wang and
  • Xuen Zhang

With the global population growth and increasing food demand, the development of precision agriculture has become particularly critical. In precision agriculture, accurately identifying areas of nitrogen stress in crops and planning precise fertiliza...

  • Communication
  • Open Access
2 Citations
4,290 Views
12 Pages

18 December 2023

This paper studies the tactical decision-making model of short track speed skating based on deep reinforcement learning, so as to improve the competitive performance of corresponding short track speed skaters. Short track speed skating, a traditional...

  • Article
  • Open Access
13 Citations
3,171 Views
28 Pages

27 May 2023

With the accelerated development of smart cities, the concept of a “smart industrial park” in which unmanned ground vehicles (UGVs) have wide application has entered the industrial field of vision. When faced with multiple tasks and heter...

  • Article
  • Open Access
22 Citations
4,429 Views
23 Pages

14 December 2021

Unmanned aerial vehicle (UAV) clusters usually face problems such as complex environments, heterogeneous combat subjects, and realistic interference factors in the course of mission assignment. In order to reduce resource consumption and improve the...

  • Article
  • Open Access
31 Citations
5,349 Views
21 Pages

14 June 2023

In order to improve the efficiency and adaptability of cognitive radar jamming decision-making, a fusion algorithm (Ant-QL) based on ant colony and Q-Learning is proposed in this paper. The algorithm does not rely on a priori information and enhances...

  • Article
  • Open Access
19 Citations
3,780 Views
14 Pages

11 March 2021

In this paper, we address the application of the flying Drone Base Stations (DBS) in order to improve the network performance. Given the high degrees of freedom of a DBS, it can change its position and adapt its trajectory according to the users move...

  • Article
  • Open Access
845 Views
10 Pages

29 August 2025

Vehicle routing improvement has become a vital topic in modern transport digitalization projects. Presently, there are no fully adapted techniques to offer optimal solutions for finding the best routes that include all visiting locations, considering...

  • Article
  • Open Access
2 Citations
1,554 Views
15 Pages

2 December 2024

Due to the severe threats posed by smart jammers, anti-jamming decision making has become an essential technology for wireless communications. Most of the existing anti-jamming decision-making approaches have adopted Q-Learning to improve accuracy. H...

  • Article
  • Open Access
6 Citations
2,404 Views
18 Pages

Intelligent Robot in Unknown Environments: Walk Path Using Q-Learning and Deep Q-Learning

  • Mouna El Wafi,
  • My Abdelkader Youssefi,
  • Rachid Dakir and
  • Mohamed Bakir

Autonomous navigation is essential for mobile robots to efficiently operate in complex environments. This study investigates Q-learning and Deep Q-learning to improve navigation performance. The research examines their effectiveness in complex maze c...

  • Article
  • Open Access
6 Citations
3,154 Views
16 Pages

12 February 2022

Compared to model-based Reinforcement Learning (RL) approaches, model-free RL algorithms, such as Q-learning, require less space and are more expressive, since specifying value functions or policies is more flexible than specifying the model for the...

  • Article
  • Open Access
37 Citations
4,513 Views
16 Pages

A Self-Adaptive Reinforcement-Exploration Q-Learning Algorithm

  • Lieping Zhang,
  • Liu Tang,
  • Shenglan Zhang,
  • Zhengzhong Wang,
  • Xianhao Shen and
  • Zuqiong Zhang

11 June 2021

Directing at various problems of the traditional Q-Learning algorithm, such as heavy repetition and disequilibrium of explorations, the reinforcement-exploration strategy was used to replace the decayed ε-greedy strategy in the traditional Q-Learning...

  • Article
  • Open Access
5 Citations
4,500 Views
20 Pages

4 June 2025

This paper introduces a pioneering hybrid framework that integrates Q-learning/deep Q-network (DQN) with a locally deployed large language model (LLM) to enhance obstacle avoidance in embedded robotic systems. The STM32WB55RG microcontroller handles...

  • Article
  • Open Access
1,872 Views
16 Pages

28 April 2023

This paper presents a Q-learning-based pending zone adjustment for received signal strength indicator (RSSI)-based proximity classification (QPZA). QPZA aims to improve the accuracy of RSSI-based proximity classification by adaptively adjusting the s...

  • Article
  • Open Access
5 Citations
3,072 Views
20 Pages

30 August 2020

In order to maximize energy efficiency in heterogeneous networks (HetNets), a turbo Q-Learning (TQL) combined with multistage decision process and tabular Q-Learning is proposed to optimize the resource configuration. For the large dimensions of acti...

  • Article
  • Open Access
12 Citations
3,453 Views
20 Pages

An accurate mathematical model is a basis for controlling and estimating the state of an Autonomous underwater vehicle (AUV) system, so how to improve its accuracy is a fundamental problem in the field of automatic control. However, AUV systems are c...

  • Article
  • Open Access
1 Citations
2,305 Views
21 Pages

24 August 2023

In high-speed railway operational monitoring network systems targeting railway infrastructure as its monitoring objective, there is a wide variety of sensor types with diverse operational requirements. These systems have varying demands on data trans...

  • Article
  • Open Access
1,151 Views
16 Pages

AI-Based Intelligent System for Personalized Examination Scheduling

  • Marco Barone,
  • Muddasar Naeem,
  • Matteo Ciaschi,
  • Giancarlo Tretola and
  • Antonio Coronato

Artificial Intelligence (AI) has brought a revolution in many areas, including the education sector. It has the potential to improve learning practices, innovate teaching, and accelerate the path towards personalized learning. This work introduces Re...

  • Article
  • Open Access
6 Citations
3,452 Views
27 Pages

27 October 2022

Dealing with the packet-routing problem is challenging in the V2X (Vehicle-to-Everything) network environment, where it suffers from the high mobility of vehicles and varied vehicle density at different times. Many related studies have been proposed...

  • Article
  • Open Access
31 Citations
5,301 Views
18 Pages

Q-LBR: Q-Learning Based Load Balancing Routing for UAV-Assisted VANET

  • Bong-Soo Roh,
  • Myoung-Hun Han,
  • Jae-Hyun Ham and
  • Ki-Il Kim

5 October 2020

Although various unmanned aerial vehicle (UAV)-assisted routing protocols have been proposed for vehicular ad hoc networks, few studies have investigated load balancing algorithms to accommodate future traffic growth and deal with complex dynamic net...

  • Article
  • Open Access
7 Citations
2,433 Views
18 Pages

18 August 2023

In recent years, integration of production scheduling and machine maintenance has gained increasing attention in order to improve the stability and efficiency of flowshop manufacturing systems. This paper proposes a Q-learning-based aquila optimizer...

  • Article
  • Open Access
2 Citations
1,995 Views
23 Pages

3 November 2023

To reduce the economic losses caused by debt evasion amongst lost-link borrowers (LBs) and improve the efficiency of finding information on LBs, this paper focuses on the cross-platform information collaborative search optimization problem for LBs. G...

  • Article
  • Open Access
6 Citations
2,870 Views
19 Pages

Real-Time Data Transmission Scheduling Algorithm for Wireless Sensor Networks Based on Deep Q-Learning

  • Aiqi Zhang,
  • Meiyi Sun,
  • Jiaqi Wang,
  • Zhiyi Li,
  • Yanbo Cheng and
  • Cheng Wang

In the industrial environment, the data transmission of Wireless Sensor Networks (WSNs) usually has strict deadline requirements. Improving the reliability and real-time performance of data transmission has become one of the critical issues in WSNs r...

  • Article
  • Open Access
15 Citations
2,486 Views
23 Pages

19 January 2024

This study focuses on the scheduling problem of heterogeneous unmanned surface vehicles (USVs) with obstacle avoidance pretreatment. The goal is to minimize the overall maximum completion time of USVs. First, we develop a mathematical model for the p...

  • Article
  • Open Access
5 Citations
2,526 Views
29 Pages

20 April 2024

Stagnation at local optima represents a significant challenge in bio-inspired optimization algorithms, often leading to suboptimal solutions. This paper addresses this issue by proposing a hybrid model that combines the Orca predator algorithm with d...

  • Article
  • Open Access
4 Citations
2,421 Views
19 Pages

14 May 2023

Path planning in complex environments remains a challenging task for unmanned vehicles. In this paper, we propose a decoupled path-planning algorithm with the help of a deep reinforcement learning algorithm that separates the evaluation of paths from...

  • Article
  • Open Access
8 Citations
2,969 Views
28 Pages

18 January 2023

Q-learning has been primarily used as one of the reinforcement learning (RL) techniques to find the optimal routing path in wireless sensor networks (WSNs). However, for the centralized RL-based routing protocols with a large state space and action s...

  • Article
  • Open Access
4 Citations
2,625 Views
12 Pages

25 March 2024

This research paper presents the Buckley-James Q-learning (BJ-Q) algorithm, a cutting-edge method designed to optimize personalized treatment strategies, especially in the presence of right censoring. We critically assess the algorithm’s effect...

  • Article
  • Open Access
21 Citations
7,158 Views
50 Pages

21 August 2024

This paper presents a comprehensive study on the optimization of electric vehicle (EV) battery management using Q-learning, a powerful reinforcement learning technique. As the demand for electric vehicles continues to grow, there is an increasing nee...

of 18