You are currently on the new version of our website. Access the old version .

83 Results Found

  • Article
  • Open Access
5 Citations
4,048 Views
20 Pages

22 March 2021

Screening for cervical cancer is a critical policy that requires clinical and managerial vigilance because of its serious health consequences. Recently the practice of conducting simultaneous tests of cytology and Human Papillomavirus (HPV)-DNA testi...

  • Article
  • Open Access
5 Citations
2,329 Views
30 Pages

The fairness problem in the IOTA (Internet of Things Application) Tangle network has significant implications for transaction efficiency, scalability, and security, particularly concerning orphan transactions and lazy tips. Traditional tip selection...

  • Article
  • Open Access
1 Citations
2,765 Views
16 Pages

5 October 2018

In this paper, we investigate jamming attacks in the physical layer against cooperative communications networks, where a jammer tries to block the data communication between the source and destination. An energy-constrained relay is able to assist th...

  • Article
  • Open Access
18 Citations
4,226 Views
18 Pages

Improved Deep Recurrent Q-Network of POMDPs for Automated Penetration Testing

  • Yue Zhang,
  • Jingju Liu,
  • Shicheng Zhou,
  • Dongdong Hou,
  • Xiaofeng Zhong and
  • Canju Lu

14 October 2022

With the development of technology, people’s daily lives are closely related to networks. The importance of cybersecurity protection draws global attention. Automated penetration testing is the novel method to protect the security of networks,...

  • Article
  • Open Access
3 Citations
3,290 Views
24 Pages

29 April 2021

Decentralized partially observable Markov decision process (DEC-POMDP) models sequential decision making problems by a team of agents. Since the planning of DEC-POMDP can be interpreted as the maximum likelihood estimation for the latent variable mod...

  • Article
  • Open Access
5 Citations
3,661 Views
21 Pages

1 February 2022

During the process of disease diagnosis, overdiagnosis can lead to potential health loss and unnecessary anxiety for patients as well as increased medical costs, while underdiagnosis can result in patients not being treated on time. To deal with thes...

  • Article
  • Open Access
8 Citations
5,176 Views
18 Pages

27 February 2023

Current electronic warfare jammers and radar countermeasures are characterized by dynamism and uncertainty. This paper focuses on a decision-making framework of radar anti-jamming countermeasures. The characteristics and implementation process of rad...

  • Feature Paper
  • Article
  • Open Access
7 Citations
6,797 Views
26 Pages

UAV4PE: An Open-Source Framework to Plan UAV Autonomous Missions for Planetary Exploration

  • Julian Galvez-Serna,
  • Fernando Vanegas,
  • Shahzad Brar,
  • Juan Sandino,
  • David Flannery and
  • Felipe Gonzalez

2 December 2022

Autonomous Unmanned Aerial Vehicles (UAV) for planetary exploration missions require increased onboard mission-planning and decision-making capabilities to access full operational potential in remote environments (e.g., Antarctica, Mars or Titan). Ho...

  • Review
  • Open Access
4,873 Views
36 Pages

Decision-Making for Path Planning of Mobile Robots Under Uncertainty: A Review of Belief-Space Planning Simplifications

  • Vineetha Malathi,
  • Pramod Sreedharan,
  • Rthuraj P R,
  • Vyshnavi Anil Kumar,
  • Anil Lal Sadasivan,
  • Ganesha Udupa,
  • Liam Pastorelli and
  • Andrea Troppina

15 September 2025

Uncertainty remains a central challenge in robotic navigation, exploration, and coordination. This paper examines how Partially Observable Markov Decision Processes (POMDPs) and their decentralized variants (Dec-POMDPs) provide a rigorous foundation...

  • Review
  • Open Access
8 Citations
8,824 Views
16 Pages

The two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approach to simulate the human’s na...

  • Review
  • Open Access
58 Citations
18,471 Views
28 Pages

The first part of a two-part series of papers provides a survey on recent advances in Deep Reinforcement Learning (DRL) applications for solving partially observable Markov decision processes (POMDP) problems. Reinforcement Learning (RL) is an approa...

  • Article
  • Open Access
21 Citations
4,803 Views
23 Pages

21 August 2020

The problem of multi-agent remote sensing for the purposes of finding survivors or surveying points of interest in GPS-denied and partially observable environments remains a challenge. This paper presents a framework for multi-agent target-finding us...

  • Article
  • Open Access
33 Citations
10,649 Views
30 Pages

Drone-Based Autonomous Motion Planning System for Outdoor Environments under Object Detection Uncertainty

  • Juan Sandino,
  • Frederic Maire,
  • Peter Caccetta,
  • Conrad Sanderson and
  • Felipe Gonzalez

8 November 2021

Recent advances in autonomy of unmanned aerial vehicles (UAVs) have increased their use in remote sensing applications, such as precision agriculture, biosecurity, disaster monitoring, and surveillance. However, onboard UAV cognition capabilities for...

  • Article
  • Open Access
39 Citations
5,500 Views
17 Pages

Autonomous Search of Radioactive Sources through Mobile Robots

  • Jianwen Huo,
  • Manlu Liu,
  • Konstantin A. Neusypin,
  • Haojie Liu,
  • Mingming Guo and
  • Yufeng Xiao

19 June 2020

The research of robotic autonomous radioactivity detection or radioactive source search plays an important role in the monitoring and disposal of nuclear safety and biological safety. In this paper, a method for autonomously searching for radioactive...

  • Article
  • Open Access
46 Citations
13,450 Views
17 Pages

10 May 2016

Unmanned Aerial Vehicles (UAV) can navigate with low risk in obstacle-free environments using ground control stations that plan a series of GPS waypoints as a path to follow. This GPS waypoint navigation does however become dangerous in environments...

  • Article
  • Open Access
7 Citations
3,362 Views
21 Pages

Wireless Body Area Network Control Policies for Energy-Efficient Health Monitoring

  • Yair Bar David,
  • Tal Geller,
  • Ilai Bistritz,
  • Irad Ben-Gal,
  • Nicholas Bambos and
  • Evgeni Khmelnitsky

21 June 2021

Wireless body area networks (WBANs) have strong potential in the field of health monitoring. However, the energy consumption required for accurate monitoring determines the time between battery charges of the wearable sensors, which is a key performa...

  • Article
  • Open Access
2 Citations
2,531 Views
25 Pages

A POMDP Approach to Map Victims in Disaster Scenarios

  • Pedro Gabriel Villani and
  • Paulo Sergio Cugnasca

7 November 2024

Background: The rise in natural and man-made disasters has increased the need for effective search-and-rescue tools, particularly in resource-limited areas. Unmanned Aerial Vehicles (UAVs) are increasingly used for this purpose due to their flexibili...

  • Article
  • Open Access
2,451 Views
21 Pages

14 June 2022

The operation of a variety of natural or man-made systems subject to uncertainty is maintained within a range of safe behavior through run-time sensing of the system state and control actions selected according to some strategy. When the system is ob...

  • Article
  • Open Access
2 Citations
4,434 Views
19 Pages

27 April 2020

We consider a robot that must sort objects transported by a conveyor belt into different classes. Multiple observations must be performed before taking a decision on the class of each object, because the imperfect sensing sometimes detects the incorr...

  • Article
  • Open Access
3,710 Views
19 Pages

Multi-Agent Planning under Uncertainty with Monte Carlo Q-Value Function

  • Jian Zhang,
  • Yaozong Pan,
  • Ruili Wang,
  • Yuqiang Fang and
  • Haitao Yang

4 April 2019

Decentralized partially observable Markov decision processes (Dec-POMDPs) are general multi-agent models for planning under uncertainty, but are intractable to solve. Doubly exponential growth of the search space as the horizon increases makes a brut...

  • Article
  • Open Access
19 Citations
5,994 Views
28 Pages

Energy-Efficient Channel Handoff for Sensor Network-Assisted Cognitive Radio Network

  • Muhammad Usman,
  • Muhammad Sajjad Khan,
  • Hiep Vu-Van and
  • Koo Insoo

23 July 2015

The visiting and less-privileged status of the secondary users (SUs) in a cognitive radio network obligates them to release the occupied channel instantly when it is reclaimed by the primary user. The SU has a choice to make: either wait for the chan...

  • Article
  • Open Access
732 Views
27 Pages

7 October 2025

Content delivery networks (CDNs) face steadily rising, uneven demand, straining heuristic cache replacement. Reinforcement learning (RL) is promising, but most work assumes a fully observable Markov Decision Process (MDP), unrealistic under delayed,...

  • Article
  • Open Access
83 Citations
12,855 Views
31 Pages

UAV Framework for Autonomous Onboard Navigation and People/Object Detection in Cluttered Indoor Environments

  • Juan Sandino,
  • Fernando Vanegas,
  • Frederic Maire,
  • Peter Caccetta,
  • Conrad Sanderson and
  • Felipe Gonzalez

16 October 2020

Response efforts in emergency applications such as border protection, humanitarian relief and disaster monitoring have improved with the use of Unmanned Aerial Vehicles (UAVs), which provide a flexibly deployed eye in the sky. These efforts have been...

  • Article
  • Open Access
2 Citations
2,892 Views
28 Pages

30 July 2023

Coordinating multiple unmanned aerial vehicles (UAVs) for the purposes of target finding or surveying points of interest in large, complex, and partially observable environments remains an area of exploration. This work proposes a modeling approach a...

  • Article
  • Open Access
13 Citations
4,225 Views
20 Pages

Deep Reinforcement Learning for Attacking Wireless Sensor Networks

  • Juan Parras,
  • Maximilian Hüttenrauch,
  • Santiago Zazo and
  • Gerhard Neumann

12 June 2021

Recent advances in Deep Reinforcement Learning allow solving increasingly complex problems. In this work, we show how current defense mechanisms in Wireless Sensor Networks are vulnerable to attacks that use these advances. We use a Deep Reinforcemen...

  • Article
  • Open Access
11 Citations
5,935 Views
14 Pages

27 January 2023

Condition-based maintenance (CBM) scheduling of an aircraft fleet in a disruptive environment while considering health prognostics for a set of systems is a very complex combinatorial problem, which is becoming more challenging in light of the uncert...

  • Article
  • Open Access
1 Citations
1,171 Views
21 Pages

Assisted-Value Factorization with Latent Interaction in Cooperate Multi-Agent Reinforcement Learning

  • Zhitong Zhao,
  • Ya Zhang,
  • Siying Wang,
  • Yang Zhou,
  • Ruoning Zhang and
  • Wenyu Chen

27 April 2025

With the development of value decomposition methods, multi-agent reinforcement learning (MARL) has made significant progress in balancing autonomous decision making with collective cooperation. However, the collaborative dynamics among agents are con...

  • Article
  • Open Access
5 Citations
4,630 Views
25 Pages

15 July 2018

The full-duplex transmission protocol has been widely investigated in the literature in order to improve radio spectrum usage efficiency. Unfortunately, due to the effect of imperfect self-interference suppression, the change in transmission power an...

  • Article
  • Open Access
323 Views
25 Pages

29 December 2025

Moving-target detection under strict sensing constraints is a recurring subproblem in surveillance, search-and-rescue, and autonomous robotics. We study a canonical one-dimensional finite grid in which a sensor probes one location per time step with...

  • Article
  • Open Access
1,117 Views
23 Pages

FRMA: Four-Phase Rapid Motor Adaptation Framework

  • Xiangbei Liu,
  • Chang Lu,
  • Hui Wu,
  • Bo Hu,
  • Xutong Li,
  • Zongyuan Li and
  • Xian Guo

25 September 2025

In many real-world control tasks, agents operate under partial observability, where access to complete state information is limited or corrupted by noise. This poses significant challenges for reinforcement learning algorithms, as methods relying on...

  • Article
  • Open Access
8 Citations
1,835 Views
14 Pages

9 October 2024

Aiming to improve the efficiency of the online process in path planning, a novel searching method is proposed based on environmental information analysis. Firstly, a search and rescue (SAR) environmental model and an unmanned ground vehicle (UGV) mot...

  • Article
  • Open Access
4 Citations
5,060 Views
29 Pages

17 November 2024

Deep reinforcement learning (DRL)-based navigation in an environment with dynamic obstacles is a challenging task due to the partially observable nature of the problem. While DRL algorithms are built around the Markov property (assumption that all th...

  • Feature Paper
  • Article
  • Open Access
3 Citations
6,105 Views
33 Pages

Introducing ActiveInference.jl: A Julia Library for Simulation and Parameter Estimation with Active Inference Models

  • Samuel William Nehrer,
  • Jonathan Ehrenreich Laursen,
  • Conor Heins,
  • Karl Friston,
  • Christoph Mathys and
  • Peter Thestrup Waade

12 January 2025

We introduce a new software package for the Julia programming language, the library ActiveInference.jl. To make active inference agents with Partially Observable Markov Decision Process (POMDP) generative models available to the growing research comm...

  • Article
  • Open Access
16 Citations
3,442 Views
20 Pages

16 February 2022

Compressed sensing (CS)-based frequency agile radar (FAR) is attractive due to its superior data rate and target measurement performance. However, traditional frequency strategies for CS-based FAR are not cognitive enough to adapt well to the increas...

  • Article
  • Open Access
2 Citations
1,194 Views
17 Pages

29 September 2024

Inducing learning strategies is a crucial component of intelligent tutoring systems. Previous research has predominantly focused on the induction of offline learning strategies. Although the existing offline learning strategy induction methods can al...

  • Article
  • Open Access
3 Citations
1,808 Views
27 Pages

12 September 2024

Directional unmanned aerial vehicle (UAV) ad hoc networks (DUANETs) are widely applied due to their high flexibility, strong anti-interference capability, and high transmission rates. However, within directional networks, complex mutual interference...

  • Article
  • Open Access
1 Citations
1,378 Views
24 Pages

Enhancing Geomagnetic Navigation with PPO-LSTM: Robust Navigation Utilizing Observed Geomagnetic Field Data

  • Xiaohui Zhang,
  • Wenqi Bai,
  • Jun Liu,
  • Songnan Yang,
  • Ting Shang and
  • Haolin Liu

13 June 2025

Geospatial navigation in GPS-denied environments presents significant challenges, particularly for autonomous vehicles operating in complex, unmapped regions. We explore the Earth’s geomagnetic field, a globally distributed and naturally occurr...

  • Article
  • Open Access
16 Citations
4,465 Views
18 Pages

4 June 2021

During the last decades, collaborative robots capable of operating out of their cages are widely used in industry to assist humans in mundane and harsh manufacturing tasks. Although such robots are inherently safe by design, they are commonly accompa...

  • Article
  • Open Access
3 Citations
2,197 Views
24 Pages

16 November 2023

Autonomous shuttles have been used as end-mile solutions for smart mobility in smart cities. The urban driving conditions of smart cities with many other actors sharing the road and the presence of intersections have posed challenges to the use of au...

  • Article
  • Open Access
2,047 Views
20 Pages

A Secure GNN Training Framework for Partially Observable Graph

  • Dongdong An,
  • Yi Yang,
  • Wenyan Liu,
  • Qin Zhao,
  • Jing Liu,
  • Hongda Qi and
  • Jie Lian

Graph Neural Networks (GNNs) are susceptible to adversarial injection attacks, potentially compromising the model integrity, reducing accuracy, and posing security risks. However, most of the current countermeasures focus on enhancing the robustness...

  • Article
  • Open Access
43 Citations
6,851 Views
17 Pages

UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy

  • Jingyi Xie,
  • Xiaodong Peng,
  • Haijiao Wang,
  • Wenlong Niu and
  • Xiao Zheng

1 October 2020

Unmanned aerial vehicle (UAV) autonomous tracking and landing is playing an increasingly important role in military and civil applications. In particular, machine learning has been successfully introduced to robotics-related tasks. A novel UAV autono...

  • Article
  • Open Access
1 Citations
2,456 Views
20 Pages

18 July 2023

This paper proposes a continuous state partially observable Markov decision process (POMDP) model for the corrosion maintenance of oil and gas pipelines. The maintenance operations include complex and extensive activities to detect the corrosion type...

  • Article
  • Open Access
111 Citations
20,791 Views
23 Pages

20 December 2019

Penetration testing (also known as pentesting or PT) is a common practice for actively assessing the defenses of a computer network by planning and executing all possible attacks to discover and exploit existing vulnerabilities. Current penetration t...

  • Article
  • Open Access
1 Citations
1,896 Views
27 Pages

The warehouse picking process is one of the most critical components of logistics operations. Human–robot collaboration (HRC) is seen as an important trend in warehouse picking, as it combines the strengths of both humans and robots in the pick...

  • Article
  • Open Access
1 Citations
1,567 Views
19 Pages

20 February 2025

Spacecraft approaching maneuver control normally uses traditional control methods such as Proportional–Integral–Derivative (PID) or Model Predictive Control (MPC), which require meticulous system design and lack robustness against unknown...

  • Article
  • Open Access
3 Citations
5,930 Views
12 Pages

We present a deep reinforcement learning framework for an automatic trading of contracts for difference (CfD) on indices at a high frequency. Our contribution proves that reinforcement learning agents with recurrent long short-term memory (LSTM) netw...

  • Article
  • Open Access
2 Citations
1,276 Views
20 Pages

Energy Management for Distributed Carbon-Neutral Data Centers

  • Wenting Chang,
  • Chuyi Liu,
  • Guanyu Ren and
  • Jianxiong Wan

30 May 2025

With the continuous expansion of data centers, their carbon emission has become a serious issue. A number of studies are committing to reduce the carbon emission of data centers. Carbon trading, carbon capture, and power-to-gas technologies are promi...

  • Article
  • Open Access
1,472 Views
16 Pages

9 December 2024

As network systems become larger and more complex, there is an increasing focus on how to verify the security of systems that are at risk of being attacked. Automated penetration testing is one of the effective ways to achieve this. Uncertainty cause...

of 2