Next Article in Journal
Scale Considerations and the Quantification of the Degree of Fracturing for Geological Strength Index (GSI) Assessments
Previous Article in Journal
Special Issue on Production, Treatment, Utilization, and Future Opportunities of Biogas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Managing Redundancy in a Multiple-Robot Orienteering Problem for Equilibrium and Robustness

College of Mechanical Engineering, Chongqing University of Technology, Chongqing 400054, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Appl. Sci. 2025, 15(15), 8217; https://doi.org/10.3390/app15158217
Submission received: 23 June 2025 / Revised: 16 July 2025 / Accepted: 17 July 2025 / Published: 24 July 2025
(This article belongs to the Special Issue Embodied Intelligence and Its Application in Robotics)

Abstract

In our work, the Robust Multiple-robot Orienteering Problem with Workload Balancing is constructed for the first time. Our primary contribution lies in the rigorous formulation of this problem as a three-stage optimization task. It leverages the Robust Multiple-robot Orienteering Problem (RMOP) as the initial stage. The Path Replanning stage and the workload balancing stage are introduced to minimize walk redundancy and achieve workload equilibrium. The resultant solution upholds the optimality inherent to the original RMOP. Additionally, we craft a suite of heuristic strategies to mitigate redundancy and employ Monte Carlo sampling to tackle the problem. Our algorithm analysis indicates that the method has asymptotic convergence properties and a feasible time complexity under certain conditions. Local parallelization of the algorithm can further improve its performance. Our simulation studies demonstrate that our approach can efficaciously attain a balance between robustness and workload without compromising performance in the presence of adversarial challenges.

1. Introduction

In various missions, whether drone is conducting post-disaster search and rescue operations [1,2,3,4] or a ground mobile robot is exploring and perceiving the surrounding environment [5,6,7,8,9], the initial goal usually involves exploring and mapping the environment. The task of an agent exploring its surroundings can be modeled as the Orienteering Problem (OP) in many scenarios [7,8,9]. OP refers to determining a walk with a given budget from a given starting point, maximizing the total reward collected along the walk [7]. The reward depends on the vertices visited along the walk. The focus of robotic exploring tasks is gradually shifting to unmanned systems composed of multiple robots. Such autonomous systems can help solve new problems or improve existing solutions. In exploring unknown environments, using multiple robots can shorten the overall exploration time, improve accuracy, and be more robust to failures [10,11,12,13]. In [14], a Multiple-robot Orienteering Problem (MOP) aims to design paths for n robots, maximizing the sum of rewards collected by all robots. Recently, Shi et al. proposed the Robust Multiple-robot Orienteering Problem (RMOP) [15], which further considers robots operating in adversarial or fail-prone environments.
The applications of this methodology extend far beyond military domains, holding significant potential in numerous civilian scenarios as well. For instance, in a large-scale post-disaster search and rescue operation, as illustrated in Figure 1, the disaster area can be abstracted as a network where nodes and links represent the critical elements of the rescue mission. The nodes are categorized into two types: target points for searching for survivors (mission points) and waypoints that serve as relays. In the complex and hazardous post-disaster environment, direct communication and passage between mission points are often obstructed, necessitating that the drone fleet first establishes a reliable communication relay network. These relay points can also be assigned additional functions, such as serving as temporary drone maintenance, recharging, or data-uplink stations. By assigning differentiated reward values to different types of nodes (both mission and relay points), they can be handled uniformly. Considering the urgency of disaster rescue missions and the dynamic nature of the environment (e.g., aftershocks, building collapses), the planning of rescue routes must possess a high degree of robustness. Therefore, the RMOP methodology can provide powerful decision support for such tasks, ensuring that in a worst-case scenario where some drones fail or lose contact, the overall mission reward (such as search and rescue coverage and communication network integrity) is still maximized. Beyond disaster response [4,16,17], this robust multi-robot path planning method shows broad application prospects in various other fields, including environmental monitoring [15,18], precision agriculture [18], logistics and transportation [7,19], and tourist route planning [7,19].
RMOP focuses on selecting collections that can withstand the worst-case scenario of removing certain subsets of projects. Under certain circumstances, RMOP can effectively balance the trade-off between excessive overlap (resulting in insufficient reward) and insufficient overlap (resulting in insufficient redundancy). However, its effectiveness diminishes for sparse graphs—a common characteristic of networks formed by drones operating deep within enemy territory with minimal connections. To overcome these challenges, we propose the Robust Multiple-robot Orienteering Problem with Workload Balancing (RMOPWB). It can realize the redundancy balance of the system well.

1.1. Related Works

Although the OP is NP-hard [14], there are some approximation algorithms for many variants of OP. The review by Vansteenwegen et al. [7] provides a summary of various algorithms for OP and its variants. Blum et al. [14] have presented a polynomial-time 4–approximation for OP when the objective function is modular, which is extended to yield a 5–approximation for the MOP assuming all robots start at different vertices. If the reward function is submodular, Chekuri and Pal [20] present a recursive greedy algorithm for a single robot. In [21], Singh et al. demonstrated the utilization of OP and MOP for active information gathering. Their algorithms sequentially determine paths for each robot using the single-robot algorithms by Blum et al. [14] and Chekuri and Pal [20] as subroutines. Atanasov et al. [22] recently presented a decentralized version for multi-robot information gathering following similar principles to [21]. Recently, Jorgensen et al. introduced the Matroids Team-Surviving Orienteers Problem (MTSO) [8], which takes into account individual robot failures by assuming a given probability of failure associated with every edge in the environment. Robust submodular maximization [23,24,25,26,27] focuses on selecting sets that can withstand the worst-case removal of some subset of items. Using robust submodular maximization, RMOP considers worst-case failures and is better suited for operation in adversarial or stochastic conditions when seeking worst-case guarantees.
The assignment of multiple targets to each robot is often treated as a partitioning problem in multi-robot exploration research. A simplified variant of this problem is to assign a single task to each robot, which has been widely studied in other fields. The Hungarian method can optimally solve this problem in polynomial time, and the authors of [28] have applied this method to multi-robot exploration. As described in [29], the issue for the effective coordination of multi-robot systems is the allocation of target points so that each robot can explore different regions of the environment. In [30], the authors use the MiniMax criterion to minimize overall exploration time and use minimum spanning trees (MSTs) to approximate the path costs of robots.

1.2. Contributions

This paper, for the first time, simultaneously considers robustness and load balancing in orienteering tasks in multi-robot systems, presenting a comprehensive approach that enhances the resilience and efficiency of these systems. Our method is validated in a high-fidelity simulation environment. A sample of task allocation results is shown in Figure 2, wherein the environment is explored by six UAVs with the proposed method.
Our principal contributions are highlighted as follows:
  • Path Replanning for Workload Management. We introduce a Path Replanning stage, specifically designed to optimize the total workload of multi-robot systems, which we mathematically formalize as Problem 2.
  • Load Balancing for Equitable Distribution. We introduce a workload balancing stage, which we formulate as Problem 3. The complete framework, comprising three sequential stages (Problem 1, Problem 2, and Problem 3), is termed RMOPWB. This framework has been empirically validated through numerical simulations, demonstrating its effectiveness in achieving balanced workload distribution.
  • Efficient Algorithm for Complex Optimization. To address the complexity of this optimization problem, we propose an efficient algorithm that combines heuristic methods with Monte Carlo sampling. This approach dramatically reduces redundant operational paths while preserving essential redundancy to enhance system robustness.
The remainder of this paper is organized as follows. Section 2 establishes the theoretical foundation for our work. We begin in Section 2.1 with a formal review of the existing RMOP. In Section 2.2, we introduce our novel problem formulation, the RMOPWB, which explicitly addresses load distribution among agents. Section 2.3 is then dedicated to detailing our proposed algorithm for solving the RMOPWB. Subsequently, Section 3 presents the results of extensive numerical simulations designed to validate the performance and efficiency of our proposed method. Finally, Section 4 concludes the paper with a summary of our key findings and a discussion of potential avenues for future research.

2. Materials and Methods

2.1. Review of RMOP

We use calligraphic letters to denote sets (e.g., A ). Given a set B , A B denotes the set of elements in A but not in B . For elements u and v, { u , v } denotes an unordered pair and u , v denotes an ordered pair.
Definition 1
(Submodularity). For a set V , a function f : 2 V R is submodular if and only if for any sets A V and B V , we have f ( A ) + f ( B ) f ( A B ) + f ( A B ) .
Definition 2
(Normalized Monotonicity). For a set V , a function f : 2 V R is called normalized, monotonically non-decreasing if and only if for any A B V , we have f ( A ) f ( B ) and A = if and only if f ( A ) = 0 .
As a shorthand, we refer to a normalized, monotonically non-decreasing function as a monotone function. Let G ( V , E ) be a graph where V is a nonempty finite set and E is a set of unordered pairs of distinct elements of V . The vertex u and the edge e are incident if u e . A walk of a graph G is an alternating sequence of vertices and edges v 0 , e 1 , v 1 , e 2 , v 2 , , e m , v m beginning and ending with vertices such that each edge e i is incident to v i 1 and v i . The walk is also denoted by W = v 0 , v 1 , , v m . Let T denote the set of all walks. The set of all connected induced subgraphs of W is denoted by I ( W ) . We use V W to represent the vertices of W and E W : = { { v 0 , v 1 } , , { v m 1 , v m } } to denote its edges. Thus, W corresponds to the subgraph G ( V W , E W ) , and for convenience, we may also refer to W itself as this subgraph. The cost of a walk W , denoted by C ( W ) , is the sum of the edge weights along the walk. The edge weights are assumed to be metric. The walk of robot i, denoted by W i , must begin at a specific vertex v s i . The reward function g ( V ) : 2 V R + , V V is assumed to be a monotone submodular function. Let S be the union of all n walks of the robots in the team, i.e., S = { W 1 , W 2 , , W n } . The team reward f ( S ) : 2 S R + , S S is given by
f ( S ) = g W i S V W i .
Multiple robots can visit the same vertex, but only one visit is accounted for when computing the reward of the team. The reward of a vertex cannot be accumulated twice for the reward of the team. In [15], there is the mathematical representation of the RMOP:
Problem 1
(Stage #1). Given a graph G ( V , E ) , n robots with starting positions { v s 1 , v s 2 , , v s n } , a budget constraint B, a robot reward function g ( V ) : 2 V R + , and a team reward function f as defined in (1), the Robust Multiple-robot Orienteering Problem seeks to find a collection of n walks S 1 = { W 1 , W 2 , , W n } that are robust to the worst-case failure of α robots:
max S T min A S f ( S A ) s . t . | S | = n , | A | α , 0 < α < n C ( W j ) B , j
where v s j must additionally be the starting vertex when constructing a walk W j for robot j.

2.2. Problem Formulation

The concept of system workload is pivotal in ensuring the longevity and sustainability of a system, particularly within the context of multi-robot operations. This consideration encompasses two principal facets: the aggregate workload, referred to as total workload, and the distribution of this workload, known as Workload Balance. While robustness in system design often necessitates redundancy to enhance resilience, the efficient management of the system load demands a reduction in such redundancy. Striking an equilibrium between these two objectives is the key to achieving redundancy balance. Moreover, the resilience of the system in the face of adversarial attacks is quantified by a function denoted as
R d ( S , α ) = min A S f ( S A ) , | A | α
which represents the cumulative rewards garnered by the robots after an attack by α adversaries.

2.2.1. Minimizing Total Workload

To minimize the total workload, the initial step is termed Path Replanning (stage #2 in Figure 3). This stage must satisfy certain criteria to be formally represented. Following stage #1, there emerges a set of n walks, denoted as { W 1 , W 2 , , W n } . For each walk W i , the objective is to eliminate unnecessary vertices while ensuring that the resulting condensed connected graph remains a subgraph of W i . Drawing on the concept of an induced subgraph, the post-replanning walk W i should correspond to a variant of the Traveling Salesman Problem (TSP) walk on some induced subgraph. Let T ( I i ) denote a variant TSP walk on a connected undirected graph I i . The walk T ( I i ) starts at vertex v s i and traverses all vertices in I i without returning to the starting vertex, optimizing the budget expenditure for robot i. The set of reconfigured walks { W 1 , W 2 , , W n } must satisfy the following four conditions:
(1)
Budget constraint: C ( W i ) B , i { 1 , , n } .
(2)
TSP-variant walk: W i = T ( I i ) , I i I ( W i ) , i .
(3)
Reward preservation: R d ( W i , α ) = R d ( W i , α ) , i .
It can be shown that the walks { W i } must be optimal for stage #1. Through the above discussion, we formalize the Path Replanning stage:
Problem 2
(Stage #2). After stage #1, we obtain a collection of n walks, denoted as S 1 = { W 1 , W 2 , , W n } . The objective function for stage #1 reaches its optimal value R. The Path Replanning problem is formulated as follows:
min I i I ( W i ) C ( T ( I 1 ) ) + C ( T ( I 2 ) ) + + C ( T ( I n ) ) s . t . min A f ( D A ) = R , | A | α , A D D = { T ( I 1 ) , , T ( I n ) } , C ( T ( I i ) ) B . i
The optimal solution { I 1 , I 2 , , I n } yields a new set of n walks denoted as S 2 = { T ( I 1 ) , , T ( I n ) } . For notational convenience, we may refer to { I 1 , I 2 , , I n } simply as S 2 .
Example 1.
As illustrated in Figure 3, the red robot in subfigure (1-1) traverses a path in 5 steps. However, the final step proves redundant as it duplicates the path of the yellow robot, creating unnecessary overlap. By eliminating this superfluous movement, the solution maintains its optimality for Problem 1. This demonstrates that removing such redundancies preserves the solution’s optimal characteristics.
Remark 1.
To mitigate excessive redundancy, one could impose budget constraints B on individual robots. Although this approach effectively prevents path overpopulation, it requires a priori budget allocation for each robot. Furthermore, in multi-robot systems with heterogeneous budget requirements, this method demonstrates limited adaptability to dynamic environments with frequent budget variations. In contrast, Problem 2’s solution achieves superior redundancy balancing.

2.2.2. Achieve Workload Balancing

Now considering workload balancing (represented as stage #3 in Figure 3), we proceed with the formalization. Let II ( S 1 ) denote the set of optimal solutions from stage #2 for a given S 1 S 1 * , where S 1 * represents the optimal solution set of stage #1. The workload balancing stage is then formalized using the MiniMax criterion:
Problem 3
(Stage #3). Following stages #1 and #2, we obtain a set of n walks defined as S * : = S 1 S 1 * II ( S 1 ) . Employing the MiniMaxoptimization strategy coupled with a load function h : T R + , we identify solutions that achieve optimal load balancing:
min S 2 S * max W i S 2 h ( W i ) .
We may define the load function as the total budget expenditure of a single robot; i.e., let h ( W ) = C ( W ) . The integrated problem comprising stages #1–#3 is designated as the Robust Multiple Orienteering Problem with Workload Balancing (RMOPWB).
Example 2.
The budget variations of robotic walks from stage #1 to stage #2 are obtained by solving Problem 2, as shown in Figure 3:
subfigure ( 2 - 1 ) : ( 5 , 5 ) ( 4 , 3 ) , subfigure ( 2 - 2 ) : ( 5 , 5 ) ( 5 , 2 ) .
Subsequent solution of Problem 3 yields the configuration presented in subfigure (3-1).

2.3. Algorithm for RMOPWB

This section proposes an efficient method for solving RMOPWB. Figure 4 illustrates the entire process of the algorithm. In Algorithm 1, the main procedure of the algorithm for solving RMOPWB is shown. To achieve workload balancing, perform Monte Carlo sampling on the solution set S * . After setting the Monte Carlo sampling times K and the load function h, the algorithm executes K loops, and each loop obtains a sample (i.e., a solution to the first two stages of RMOPWB) through two functions (RMOP and PR). After sampling, calculate the load of each solution according to the load function h and select the most balanced solution.
The RMOP function serves as the solver for the RMOP (Stage #1). This stochastic solver typically generates multiple optimal RMOP solutions. The PR function, designed for stage #2, reconstructs input walks to minimize path redundancy through an optimization process. For a formal algorithm description, let L denote the workload balancing metric, derived from the load function h. The metric is formally expressed as L ( S ) = max W i S h ( W i ) .
Algorithm 1: Solve RMOPWB
Applsci 15 08217 i001
Algorithm 2: Function PR (Path Replanning)
Applsci 15 08217 i002
Algorithm 3: CutCircle and BackWalks
Applsci 15 08217 i003

2.3.1. Path Replan

Two heuristic methodologies are explored: one for solitary robotic operations and the other for collaborative robotic endeavors. These approaches are implemented through the CutCircle and BackWalks functions, respectively. Algorithm 2 formalizes the Path Replanning process using a while-loop structure, where each iteration processes a walk S 1 ( i ) via the CutCircle function. Following loop termination, the complete walk set undergoes optimization through the BackWalks function. Implementation details for both functions are provided in Algorithm 3.
  • Implementation of the CutCircle function. When a robot exhausts its traversal budget, it may return to a previously visited vertex. This revisit does not necessarily constitute redundancy—in some cases (e.g., example 3 in Figure 5), the robot may restart exploration in a new direction to access high-reward vertices. A cycle is identified when the current vertex connects back to a previously visited counterpart. Such a cycle is deemed redundant and removable only when all constituent vertices are revisits (as demonstrated in example 2 of Figure 5).
  • Implementation of the BackWalks function. In crowded environments with multiple robots, walk redundancy primarily manifests as inter-robot path overlaps. The solution involves backtracking walks from terminal vertices to starting points (see examples 1, 4, and 5 in Figure 5). The algorithm sequentially checks each walk’s terminal vertex for presence in other robots’ walks (example 4). For implementation, we must verify whether removing overlapping terminal vertices preserves solution optimality relative to Problem 1.

2.3.2. Algorithm Analysis

The guarantee of optimality for our proposed algorithm is conditional upon the characteristics of the employed RMOP Solver. As illustrated in the workflow of Figure 4 and the procedure in Algorithm 1, if the RMOP Solver is optimal, each execution will yield a random optimal solution S 1 , where S 1 S 1 * , the set of all optimal solutions for stage #1. Under this condition, our Monte Carlo sampling framework is theoretically guaranteed to identify the globally optimal RMOPWB solution. Specifically, for each random optimal solution drawn from the RMOP solver, there is a probability of at least 1 / | S 1 * | of obtaining the solution that corresponds to the global optimum of the RMOPWB problem. While this provides a theoretical guarantee, achieving the optimal trade-off between the sample size and the associated computational cost remains an essential practical consideration, particularly when the cardinality of S 1 * is large.
In Algorithm 1, the main computational expenses come from the execution of the functions denoted as RMOP and PR. The total computation time is K × ( R 1 + R 2 ) , where K symbolizes the quantity of Monte Carlo samples. R 1 and R 2 correspond to the respective computational time required for RMOP and PR. Upon the implementation of parallel sampling, the computation time is significantly reduced, culminating in an expression of R 1 + R 2 . R 2 consists of two processes: the CutCircle function and the BackWalks function. Suppose that basic operations like sorting and comparison take one unit of time. CutCircle determines for each vertex whether there is a repeated cycle. If there is, remove the cycle. So there is
C C ( W i ) = l = 1 M i ( l + l ) = M i ( M i + 1 ) .
In the above formula, M i represents the length of W i . C C ( W i ) represents the time taken by CutCircle to process the W i . Then consider an upper bound on the computation time of the BackWalks function:
B W = j = 1 n M j k 1 = 1 j 1 M k 1 + ( M j 1 ) + k 2 = j + 1 n M k 2 j = 1 n M j k = 1 n M k = j = 1 n M j 2 .
Utilizing (5) and (6), deduce the upper bound of the time complexity when employing parallelization techniques:
O ( R 1 + R 2 ) = O ( R 1 + i = 1 n C C ( W i ) + B W ) = O R 1 + 2 j = 1 n M j 2 .

3. Results

This section validates our proposed algorithm through a series of numerical simulations. All simulations are conducted on a workstation running Ubuntu 22.04, equipped with a 24-core Intel i9-13900HX CPU (2.20 GHz) and 32 GB RAM. The implementation is developed in Python 3.10.13 under the ROS2 Humble framework.

3.1. Simulation Setup

Simulation Environment: Our foundational simulations are conducted using randomly generated graphs. For more realistic validation, we also implement a higher-fidelity simulation environment in ROS2 Humble on Ubuntu 22.04. In these ROS2 scenarios, the mission topology graphs are predefined to model specific operational environments. It is important to note that these are not merely abstract graphs with uniform edge weights. For instance, the ’City’ environment (Figure 2) models a dense urban area where travel costs between tasks are non-uniform and are estimated via trajectory planning that accounts for obstacles and no-fly zones. Similarly, the ’Maze’ environment (Figure 6) introduces highly constrained navigation challenges, where direct paths are often impossible. This underlying complexity ensures our validation is performed on non-trivial, realistic problem instances.
Baseline algorithms: Given that the RMOPWB (Algorithm 1) is introduced in this paper, there exists no pre-existing efficient algorithm for comparative analysis. Consequently, the RMOPWB algorithm, which is an evolution of the established RMOP algorithm, is juxtaposed against its predecessor as a benchmark. This comparative framework serves to demonstrate the effectiveness of our proposed method. The Sequential Greedy Assignment (SGA) proposed in [15] is used as our RMOP Solver. The GCB algorithm [31] is employed as a subroutine to address the OP due to its efficiency and ease of implementation.

3.2. Simulation Instances

Figure 7a shows the first example. Due to the walks overlapping, we also show the walks in Table 1 in detail and calculate the total reward (Reward), total budget (Budget), and system Workload Balance value (WBV). In Table 1 and Figure 7(a2), the walks obtained by planning have enough redundancy to allow the robot system to deal with adversarial attacks. In Table 1, it can be observed that, while ensuring the total reward remains unchanged, our algorithm is capable of finding better walks. Figure 7(a3) presents a Path Replanning process that significantly reduces the redundancy of the original walks. The walks for robots 2 and 5 have been substantially shortened. These changes have directly led to a reduction in the total budget from 131.47 to 73.54 , while the workload is not well balanced. Figure 7(a4) demonstrates the outcome after workload balancing. At this stage, our algorithm shortens the walks for robot 3 from [ 13 , 8 , 20 , 5 , 20 , 3 ] to [ 13 , 8 ] by adding some walks to robot 2 and 5. Consequently, all walks are shortened to some extent, greatly balancing the workload of the system (the WB value drops from 21.69 to 13.73 ). Overall, from Figure 7a and Table 1, it can be seen that our algorithm is highly effective in achieving a balance between system robustness and system workload.
A second example is illustrated in Figure 7b and detailed in Table 2. Our algorithm effectively diminishes the total workload and achieves workload balancing, all while maintaining the original reward. A key observation is the variation in the set of attacked robots, as depicted in Figure 7b. In the scenario shown in Figure 7(b2), robots 0 and 5 are targeted. In contrast, Figure 7(b3,b4) show attacks on robot sets { 3 , 4 } and { 1 , 4 } , respectively. This indicates that the attacker’s actions in stages #2 and #3 are responsive to our path planning strategy. The attackers within our model are assumed to be rational, meaning they possess prior knowledge of our RMOPWB strategy. Conversely, in a situation involving irrational attackers, the optimization for stages #2 and #3 would disregard their presence, operating under the assumption that these attackers are only aware of the RMOP strategy.
We further validate our approach through simulations conducted in ROS2 for both the Maze environment shown in Figure 6 and the City environment from Figure 2. In these scenarios, a ground station assigns mission targets to the drone fleet, and the resulting assignments are visualized by the color of the target points. A more detailed comparison of the methods’ performance is presented in Table 3. The results in Table 3 clearly demonstrate that our RMOPWB algorithm consistently enables the system to collect greater rewards while simultaneously maintaining a lower total workload and achieving balanced load distribution. More importantly, the parallelized implementation of RMOPWB significantly enhances computational efficiency, though this typically requires a computational platform with greater parallel processing capabilities.

3.3. Simulation Results

Our multi-robot planning scenario is characterized by five key parameters, which fall into three distinct categories: system parameters (the number of robots and the individual robot budget), topological parameters (the number of vertices and edges), and an adversarial parameter (the number of attackers).
Figure 8 illustrates the influence of two critical parameters on the performance of the robotic system. In Figure 8a, the number of robots is varied from 2 to 6. The rewards remain consistent without a decrease. Notably, the RMOPWB algorithm demonstrates a substantial reduction in both the total budget and the load balancing value. Specifically, in the second subfigure of Figure 8a, as the number of robots increases, the budget required by the RMOP algorithm surges significantly, while the budget for the RMOPWB algorithm exhibits only a modest increment. This trend is attributed to the fact that, with other variables held constant, an increased number of robots leads to scene congestion and spatial limitations. A similar phenomenon can be observed in the workload balancing value. In Figure 8b, there are analogous results. As the budget expands, robots are permitted longer walks, suggesting an increasingly congested operational environment. Our RMOPWB reduces system redundancy and becomes increasingly more effective as the budget grows.
Figure 9 illustrates the influence of two key parameters of the task topology: the number of vertices and the number of edges. First, we examine the impact of the number of vertices in Figure 9a. In this setup, the environment’s topology is constructed as a minimum spanning tree (MST), which is a planar graph. Thus, the number of edges is determined by the number of vertices. In contrast, Figure 9b shows the results as we directly increase the number of edges from 10 to 18. Both Figure 9a,b demonstrate that with our proposed algorithm, the total rewards collected by the robot team do not decrease. In both scenarios, increasing either the vertices or the edges leads to a sparser spatial distribution of the robots. This contrasts with the results shown in Figure 8 and causes the performance of the RMOPWB and RMOP algorithms to converge. Despite this convergence, our proposed RMOPWB algorithm still demonstrates a consistent, albeit smaller, improvement over RMOP. Lastly, Figure 10 presents the analysis of the impact of the number of attackers. As the number of attackers increases, the number of available robots decreases. This creates an experimental condition analogous to that in Figure 8a, and thus, the results show a similar trend.
Taken together, these results lead to a clear conclusion: our RMOPWB algorithm is most impactful in dense, confined, or resource-limited scenarios where it can significantly reduce system redundancy and improve workload balancing while ensuring mission rewards are not compromised. However, it is important to critically acknowledge the associated trade-offs. The two additional optimization stages in RMOPWB mean it is computationally more intensive than the simpler RMOP algorithm. While this makes RMOP inherently faster, the results presented in Table 3 demonstrate that our parallelized implementation of RMOPWB yields acceptable computation times for practical applications. This makes RMOPWB a highly effective solution, balancing superior planning quality with manageable computational demands.

4. Discussion

The paper simultaneously considers robustness and workload equilibrium in orienteering tasks for the first time. It is strictly expressed as a three-stage optimization problem called RMOPWB. The approach extends the traditional MOP framework by integrating robust optimization principles and workload balancing. Our contributions include the development of heuristic strategies and the utilization of Monte Carlo sampling techniques to mitigate redundancy. The numerical simulations show that the algorithm can maintain the total rewards, while effectively reducing the excessive redundancy. Building on the comprehensive validation presented here, our immediate future work will focus on deploying and evaluating the RMOPWB algorithm on a physical fleet of UAVs. The reliability and performance demonstrated in our high-fidelity ROS2 environment establish the necessary foundation for this important next step.

Author Contributions

Conceptualization, T.J. and Z.M.; methodology, Z.M.; software, T.J.; validation, T.J., Z.M. and W.L.; formal analysis, T.J.; investigation, Z.M.; resources, T.J.; data curation, Z.M.; writing—original draft preparation, T.J.; writing—review and editing, Z.M.; visualization, T.J.; supervision, Y.L.; project administration, Z.M.; funding acquisition, Z.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy or ethical restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Tutsoy, O.; Asadi, D.; Ahmadi, K.; Nabavi-Chashmi, S.Y.; Iqbal, J. Minimum Distance and Minimum Time Optimal Path Planning With Bioinspired Machine Learning Algorithms for Faulty Unmanned Air Vehicles. IEEE Trans. Intell. Transp. Syst. 2024, 25, 9069–9077. [Google Scholar] [CrossRef]
  2. Mohd Daud, S.M.S.; Mohd Yusof, M.Y.P.; Heo, C.C.; Khoo, L.S.; Chainchel Singh, M.K.; Mahmood, M.S.; Nawawi, H. Applications of drone in disaster management: A scoping review. Sci. Justice 2022, 62, 30–42. [Google Scholar] [CrossRef] [PubMed]
  3. Baeck, P.; Lewyckyj, N.; Beusen, B.; Horsten, W.; Pauly, K. Drone based near real-time human detection with geographic localization. Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2019, 42, 49–53. [Google Scholar] [CrossRef]
  4. Yamazaki, Y.; Premachandra, C.; Perea, C.J. Audio-Processing-Based Human Detection at Disaster Sites With Unmanned Aerial Vehicle. IEEE Access 2020, 8, 101398–101405. [Google Scholar] [CrossRef]
  5. Zohaib, M.; Pasha, S.M.; Javaid, N.; Iqbal, J. IBA: Intelligent Bug Algorithm—A Novel Strategy to Navigate Mobile Robots Autonomously. In Communication Technologies, Information Security and Sustainable Development: Third International Multi-topic Conference, IMTIC 2013, Jamshoro, Pakistan, 18–20 December 2013, Revised Selected Papers; Shaikh, F.K., Chowdhry, B.S., Zeadally, S., Hussain, D.M.A., Memon, A.A., Uqaili, M.A., Eds.; Springer: Cham, Switzerland, 2014; pp. 291–299. [Google Scholar]
  6. Wu, C.Y.; Lin, H.Y. Autonomous Mobile Robot Exploration in Unknown Indoor Environments Based on Rapidly-exploring Random Tree. In Proceedings of the 2019 IEEE International Conference on Industrial Technology (ICIT), Melbourne, VIC, Australia, 13–15 February 2019; pp. 1345–1350. [Google Scholar] [CrossRef]
  7. Vansteenwegen, P.; Souffriau, W.; Oudheusden, D.V. The orienteering problem: A survey. Eur. J. Oper. Res. 2011, 209, 1–10. [Google Scholar] [CrossRef]
  8. Jorgensen, S.; Chen, R.H.; Milam, M.B.; Pavone, M. The Matroid Team Surviving Orienteers problem: Constrained routing of heterogeneous teams with risky traversal. In Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, BC, Canada, 24–28 September 2017; pp. 5622–5629. [Google Scholar] [CrossRef]
  9. Thakur, D.; Likhachev, M.; Keller, J.; Kumar, V.; Dobrokhodov, V.; Jones, K.; Wurz, J.; Kaminer, I. Planning for opportunistic surveillance with multiple robots. In Proceedings of the 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, Tokyo, Japan, 3–7 November 2013; pp. 5750–5757. [Google Scholar] [CrossRef]
  10. Prorok, A. Redundant Robot Assignment on Graphs with Uncertain Edge Costs. In Distributed Autonomous Robotic Systems: The 14th International Symposium, Boulder, CO, USA, 15–17 October 2018; Correll, N., Schwager, M., Otte, M., Eds.; Springer: Cham, Switzerland, 2019; pp. 313–327. [Google Scholar]
  11. Guerrero-Bonilla, L.; Prorok, A.; Kumar, V. Formations for Resilient Robot Teams. IEEE Robot. Autom. Lett. 2017, 2, 841–848. [Google Scholar] [CrossRef]
  12. Prorok, A.; Sadler, B.M.; Egerstedt, M.; Kumar, V. Guest Editorial: Special section on “Foundations of resilience for networked robotic systems”. Auton. Robot. 2019, 43, 741. [Google Scholar] [CrossRef]
  13. Saulnier, K.; Saldaña, D.; Prorok, A.; Pappas, G.J.; Kumar, V. Resilient Flocking for Mobile Robot Teams. IEEE Robot. Autom. Lett. 2017, 2, 1039–1046. [Google Scholar] [CrossRef]
  14. Blum, A.; Chawla, S.; Karger, D.; Lane, T.; Meyerson, A.; Minkoff, M. Approximation algorithms for orienteering and discounted-reward TSP. In Proceedings of the 44th Annual IEEE Symposium on Foundations of Computer Science, Cambridge, MA, USA, 11–14 October 2003; pp. 46–55. [Google Scholar] [CrossRef]
  15. Shi, G.; Zhou, L.; Tokekar, P. Robust Multiple-Path Orienteering Problem: Securing Against Adversarial Attacks. IEEE Trans. Robot. 2023, 39, 2060–2077. [Google Scholar] [CrossRef]
  16. Assunção, L.; Santos, A.C. The Robust Steiner Team Orienteering Problem with Decreasing Priorities under budgeted uncertainty. Oper. Res. Perspect. 2025, 15, 100344. [Google Scholar] [CrossRef]
  17. Yu, Q.; Cheng, C.; Zhu, N. Robust Team Orienteering Problem with Decreasing Profits. INFORMS J. Comput. 2022, 34, 3215–3233. [Google Scholar] [CrossRef]
  18. Mansfield, A.; Manjanna, S.; Macharet, D.G.; Ani Hsieh, M. Multi-robot Scheduling for Environmental Monitoring as a Team Orienteering Problem. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; pp. 6398–6404. [Google Scholar] [CrossRef]
  19. Vansteenwegen, P.; Souffriau, W.; Vanden Berghe, G.; Van Oudheusden, D. Iterated local search for the team orienteering problem with time windows. Comput. Oper. Res. 2009, 36, 3281–3290. [Google Scholar] [CrossRef]
  20. Chekuri, C.; Pal, M. A recursive greedy algorithm for walks in directed graphs. In Proceedings of the 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS’05), Pittsburgh, PA, USA, 23–25 October 2005; pp. 245–253. [Google Scholar] [CrossRef]
  21. Singh, A.; Krause, A.; Guestrin, C.; Kaiser, W.J. Efficient informative sensing using multiple robots. J. Artif. Int. Res. 2009, 34, 707–755. [Google Scholar] [CrossRef]
  22. Atanasov, N.; Le Ny, J.; Daniilidis, K.; Pappas, G.J. Decentralized active information acquisition: Theory and application to multi-robot SLAM. In Proceedings of the 2015 IEEE International Conference on Robotics and Automation (ICRA), Seattle, WA, USA, 26–30 May 2015; pp. 4775–4782. [Google Scholar] [CrossRef]
  23. Orlin, J.B.; Schulz, A.S.; Udwani, R. Robust monotone submodular function maximization. Math. Program. 2018, 172, 505–537. [Google Scholar] [CrossRef]
  24. Tzoumas, V.; Gatsis, K.; Jadbabaie, A.; Pappas, G.J. Resilient monotone submodular function maximization. In Proceedings of the 2017 IEEE 56th Annual Conference on Decision and Control (CDC), Melbourne, VIC, Australia, 12–15 December 2017; pp. 1362–1367. [Google Scholar] [CrossRef]
  25. Tzoumas, V.; Jadbabaie, A.; Pappas, G.J. Resilient Monotone Sequential Maximization. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami, FL, USA, 17–19 December 2018; pp. 7261–7268. [Google Scholar] [CrossRef]
  26. Schlotfeldt, B.; Tzoumas, V.; Thakur, D.; Pappas, G.J. Resilient Active Information Gathering with Mobile Robots. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 1–5 October 2018; pp. 4309–4316. [Google Scholar] [CrossRef]
  27. Zhou, L.; Tzoumas, V.; Pappas, G.J.; Tokekar, P. Resilient Active Target Tracking With Multiple Robots. IEEE Robot. Autom. Lett. 2019, 4, 129–136. [Google Scholar] [CrossRef]
  28. Wurm, K.M.; Stachniss, C.; Burgard, W. Coordinated multi-robot exploration using a segmentation of the environment. In Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, Nice, France, 22–26 September 2008; pp. 1160–1165. [Google Scholar] [CrossRef]
  29. Burgard, W.; Moors, M.; Stachniss, C.; Schneider, F. Coordinated multi-robot exploration. IEEE Trans. Robot. 2005, 21, 376–386. [Google Scholar] [CrossRef]
  30. Klodt, L.; Willert, V. Equitable workload partitioning for Multi-Robot exploration through pairwise optimization. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Hamburg, Germany, 28 September–2 October 2015; pp. 2809–2816. [Google Scholar] [CrossRef]
  31. Zhang, H.; Vorobeychik, Y. Submodular optimization with routing constraints. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI’16), Phoenix, AZ, USA, 12–17 February 2016; AAAI Press: Cambridge, MA, USA, 2016; pp. 819–825. [Google Scholar]
Figure 1. In a sophisticated combat zone, an autonomous multi-drone system is deployed to execute rescue operations. The terrain is delineated into two distinct regions: the regular zone and the perilous zone. Within this landscape, Unmanned Aerial Vehicles (UAVs) establish two categories of operational nodes: the relay node and the rescue node. Relay nodes are generally used to relay UAVs in the path from the start node to the rescue node. The rescue node is the node where the injured person is located.
Figure 1. In a sophisticated combat zone, an autonomous multi-drone system is deployed to execute rescue operations. The terrain is delineated into two distinct regions: the regular zone and the perilous zone. Within this landscape, Unmanned Aerial Vehicles (UAVs) establish two categories of operational nodes: the relay node and the rescue node. Relay nodes are generally used to relay UAVs in the path from the start node to the rescue node. The rescue node is the node where the injured person is located.
Applsci 15 08217 g001
Figure 2. An instance of the proposed multi-UAV task allocation, where a ground station assigns respective tasks to 6 UAVs in an urban environment using RMOPWB. Based on a given topological map of the urban environment, the Ground Control Station (GCS) first performs trajectory planning to estimate the travel costs between task points. Our proposed method is then used to allocate these points to the drone swarm. The different colors distinguish the target assignments for each UAV.
Figure 2. An instance of the proposed multi-UAV task allocation, where a ground station assigns respective tasks to 6 UAVs in an urban environment using RMOPWB. Based on a given topological map of the urban environment, the Ground Control Station (GCS) first performs trajectory planning to estimate the travel costs between task points. Our proposed method is then used to allocate these points to the drone swarm. The different colors distinguish the target assignments for each UAV.
Applsci 15 08217 g002
Figure 3. All processes of solving our orienteering problem. There are four processes, which are the graph model of a scene, stage #1, stage #2, and stage #3. The RMOP Solver obtains the walks of stage #1. These three stages correspond to (2), (3), and (4), respectively. There is a loop between stage #1 and stage #2. When the loop is over, obtain the optimal solution of stage #3 by (4). The upper right corner of the subgraph shows the total budget for the two drones.
Figure 3. All processes of solving our orienteering problem. There are four processes, which are the graph model of a scene, stage #1, stage #2, and stage #3. The RMOP Solver obtains the walks of stage #1. These three stages correspond to (2), (3), and (4), respectively. There is a loop between stage #1 and stage #2. When the loop is over, obtain the optimal solution of stage #3 by (4). The upper right corner of the subgraph shows the total budget for the two drones.
Applsci 15 08217 g003
Figure 4. Description of the entire process of the algorithm. Our algorithm has three main modules, which correspond to the three stages of the RMOPWB. The three modules are (1) RMOP Solver, (2) Path Replan, and (3) Workload Balance. The input information has three blocks: map information, robot cluster information, and attacker information. The RMOP Solver takes the three input information and some configuration information as inputs and outputs the walks of the multi-robot system. The Path Replanning module replans the walks. After sampling these two steps several times, the workload balancing module processes the sampling results.
Figure 4. Description of the entire process of the algorithm. Our algorithm has three main modules, which correspond to the three stages of the RMOPWB. The three modules are (1) RMOP Solver, (2) Path Replan, and (3) Workload Balance. The input information has three blocks: map information, robot cluster information, and attacker information. The RMOP Solver takes the three input information and some configuration information as inputs and outputs the walks of the multi-robot system. The Path Replanning module replans the walks. After sampling these two steps several times, the workload balancing module processes the sampling results.
Applsci 15 08217 g004
Figure 5. Robots α and β start at the blue and yellow S vertices, respectively. The table presents five examples illustrating how our algorithm reduces path redundancy when the two robots, α and β , perform tasks in different regions.
Figure 5. Robots α and β start at the blue and yellow S vertices, respectively. The table presents five examples illustrating how our algorithm reduces path redundancy when the two robots, α and β , perform tasks in different regions.
Applsci 15 08217 g005
Figure 6. The top figure shows our maze simulation environment. The bottom figure shows the resulting task assignments for three UAVs produced by our RMOPWB algorithm. Operating on a given task topology, the GCS assigns targets whose interconnecting path costs are approximated via trajectory planning. The three colors distinguish the target assignments for each UAV.
Figure 6. The top figure shows our maze simulation environment. The bottom figure shows the resulting task assignments for three UAVs produced by our RMOPWB algorithm. Operating on a given task topology, the GCS assigns targets whose interconnecting path costs are approximated via trajectory planning. The three colors distinguish the target assignments for each UAV.
Applsci 15 08217 g006
Figure 7. Two mission topology graphs. Six robots are set and numbered in order as 0 , 1 , 2 , 3 , 4 , 5 . The budget of each robot is 24. The initial positions of the robots are randomly distributed and | A | = 2 . For all vertices in the two graphs, different colors are used to represent rewards and the reward values are labeled in black font on the upper right corner of each vertex. The vertex number is labeled in gray font. For the two scenarios, the first subgraph, (a1) and (b1), comprehensively displays the task topology information and the initial state of the robots. The subsequent subgraphs illustrate the three stages of the RMOPWB algorithm. Specifically, (a2) and (b2) demonstrate the task allocation results for the six robots in Stage #1. Following this, (a3) and (b3) depict the task allocation results for each robot in Stage #2. Finally, (a4) and (b4) show the final paths and the rewards obtained by each robot after completing Stage #3.
Figure 7. Two mission topology graphs. Six robots are set and numbered in order as 0 , 1 , 2 , 3 , 4 , 5 . The budget of each robot is 24. The initial positions of the robots are randomly distributed and | A | = 2 . For all vertices in the two graphs, different colors are used to represent rewards and the reward values are labeled in black font on the upper right corner of each vertex. The vertex number is labeled in gray font. For the two scenarios, the first subgraph, (a1) and (b1), comprehensively displays the task topology information and the initial state of the robots. The subsequent subgraphs illustrate the three stages of the RMOPWB algorithm. Specifically, (a2) and (b2) demonstrate the task allocation results for the six robots in Stage #1. Following this, (a3) and (b3) depict the task allocation results for each robot in Stage #2. Finally, (a4) and (b4) show the final paths and the rewards obtained by each robot after completing Stage #3.
Applsci 15 08217 g007
Figure 8. Performance analysis of the robotic system under varying parameters. (a) The influence of the number of robots. (b) The influence of the individual budget allocated to each robot.
Figure 8. Performance analysis of the robotic system under varying parameters. (a) The influence of the number of robots. (b) The influence of the individual budget allocated to each robot.
Applsci 15 08217 g008
Figure 9. Impact of environmental parameters on algorithm performance. (a) Performance as a function of the number of vertices in the graph. (b) Performance as a function of the number of edges, which reflects graph density.
Figure 9. Impact of environmental parameters on algorithm performance. (a) Performance as a function of the number of vertices in the graph. (b) Performance as a function of the number of edges, which reflects graph density.
Applsci 15 08217 g009
Figure 10. Influence of the number of attackers on algorithm performance. Results from 50 simulations, | S | = 6 , B = 26 , | V | = 14 and | E | = 13 .
Figure 10. Influence of the number of attackers on algorithm performance. Results from 50 simulations, | S | = 6 , B = 26 , | V | = 14 and | E | = 13 .
Applsci 15 08217 g010
Table 1. The detailed path planning information of the robot system in Figure 7a.
Table 1. The detailed path planning information of the robot system in Figure 7a.
Figure 7aFigure 7(a2)Figure 7(a3)Figure 7(a4)
PathsRobot 06,19,6,17,0,166,19,6,176,19,6,17
Robot 120,8,13,1,13,120,8,13,120,8,13,1
Robot 20,16,4,15,14,110,160,16,4,15
Robot 313,8,20,5,20,313,8,20,5,20,313,8
Robot 44,15,14,11,14,44,15,14,114,15,14,11
Robot 516,4,15,14,11,1416,4,1516,4,15,14
IndicatorsReward124124124
Budget131.4773.5469.51
WBV23.9321.6913.73
Table 2. The detailed path planning information of the robot system in Figure 7b.
Table 2. The detailed path planning information of the robot system in Figure 7b.
Figure 7bFigure 7(b2)Figure 7(b3)Figure 7(b4)
PathsRobot 07,20,11,3,97,20,117,20,11,3,9
Robot 14,16,14,0,13,154,16,14,0,13,154,16,14
Robot 20,14,16,6,12,60,14,16,60,13,12
Robot 35,21,8,15,12,65,21,85,21,8
Robot 414,16,6,12,15,814,16,6,12,15,814,16,6,12,15
Robot 53,9,3,11,203,9,3,113,20,17,5
IndicatorsReward189189189
Budget135.8693.7882.38
WBV23.8222.8720.85
Table 3. Performance comparison of different methods in terms of total reward, total budget spent, and workload balancing value in the City (Figure 2) and Maze (Figure 6) scenarios.
Table 3. Performance comparison of different methods in terms of total reward, total budget spent, and workload balancing value in the City (Figure 2) and Maze (Figure 6) scenarios.
SceneUAV NumMethodRewardBudgetWBVRuntime (ms)
City3RMOP121.872.424.351
| V | = 18 RMOPWB1121.866.523.4192
| E | = 22 RMOPWB2121.864.823.676
City7RMOP228.5125.326.195
| V | = 18 RMOPWB1228.595.922.9403
| E | = 22 RMOPWB2228.591.423.0151
City11RMOP298.7187.625.6176
| V | = 18 RMOPWB1298.7122.521.1783
| E | = 22 RMOPWB2298.7128.021.9212
Maze3RMOP82.156.728.342
| V | = 15 RMOPWB182.150.127.9141
| E | = 20 RMOPWB282.151.827.159
Maze7RMOP155.679.630.878
| V | = 15 RMOPWB1155.664.226.8286
| E | = 20 RMOPWB2155.662.525.6109
Maze11RMOP218.9116.832.7116
| V | = 15 RMOPWB1218.972.225.8529
| E | = 20 RMOPWB2218.970.927.3161
The distinction between RMOPWB1 and RMOPWB2 is whether the K Monte Carlo sampling steps in Algorithm 1 are performed with parallel processing. RMOPWB2 utilizes parallelization, while RMOPWB1 does not.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mi, Z.; Jiang, T.; Leng, W.; Lei, Y. Managing Redundancy in a Multiple-Robot Orienteering Problem for Equilibrium and Robustness. Appl. Sci. 2025, 15, 8217. https://doi.org/10.3390/app15158217

AMA Style

Mi Z, Jiang T, Leng W, Lei Y. Managing Redundancy in a Multiple-Robot Orienteering Problem for Equilibrium and Robustness. Applied Sciences. 2025; 15(15):8217. https://doi.org/10.3390/app15158217

Chicago/Turabian Style

Mi, Zengzhen, Tong Jiang, Wenwen Leng, and Yuchengzhi Lei. 2025. "Managing Redundancy in a Multiple-Robot Orienteering Problem for Equilibrium and Robustness" Applied Sciences 15, no. 15: 8217. https://doi.org/10.3390/app15158217

APA Style

Mi, Z., Jiang, T., Leng, W., & Lei, Y. (2025). Managing Redundancy in a Multiple-Robot Orienteering Problem for Equilibrium and Robustness. Applied Sciences, 15(15), 8217. https://doi.org/10.3390/app15158217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop