Next Article in Journal
Evaluation of the Impact of External Conditions on Arm Positioning During Punches in MMA Fighters: A Comparative Analysis of 2D and 3D Methods
Previous Article in Journal
Real-Time Seam Extraction Using Laser Vision Sensing: Hybrid Approach with Dynamic ROI and Optimized RANSAC
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents

Department of Electrical Engineering, Pusan National University, Busan 46241, Republic of Korea
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(11), 3269; https://doi.org/10.3390/s25113269
Submission received: 17 April 2025 / Revised: 14 May 2025 / Accepted: 21 May 2025 / Published: 22 May 2025

Abstract

:
This paper presents a system in which mobile manipulators and transport agents cooperate to solve a multi-agent pickup and delivery (MAPD) problem. The primary objective is to allocate appropriate tasks to heterogeneous robots by considering their capabilities and states. Unlike previous studies that focused on homogeneous teams or assigned distinct roles to heterogeneous robots, this work emphasizes synergy through cooperative task execution. A key feature of the proposed system is that mobile manipulators behave differently depending on whether they are paired with a transport agent. Additionally, rather than generating a full trajectory from start to end, the system plans partial trajectories, allowing dynamic re-pairing of transport agents through an auction algorithm. After re-pairing, new starting nodes are defined, and the following trajectory is updated accordingly. The proposed system is validated through simulations, and its effectiveness is demonstrated by comparing it against a baseline system without dynamic pairing.

1. Introduction

In recent years, multi-robot systems are utilized in various fields, such as logistics [1,2], rescue [3,4], medical care [5,6], and agriculture [7,8]. For these domains, which require the coordination of multiple robots, solving MAPD problems is an important part of the process. Current research in MAPD focuses on developing efficient algorithms for task allocation and path planning to optimize efficiency [9,10]. These studies aim to minimize travel distance, task completion time, and resource consumption while ensuring a collision-free trajectory for multiple agents [11,12,13].
However, most existing approaches assume homogeneous robot teams [7,14] or assigning heterogeneous robots to distinct tasks [15,16,17,18]. Although some studies address heterogeneous robots performing the same task, they rarely focus on performing assigned tasks cooperatively [3,5,19].
In this context, we propose a multi-robot system that performs tidying-up tasks by integrating two heterogeneous robot types: mobile manipulators and transport agents. The system is designed to support synergistic collaboration between these robots.
Figure 1 shows the capabilities of mobile manipulators and transport agents. In this system, a mobile manipulator can pick up an object and deliver it to the destination or hand it to a transport agent. In contrast, a transport agent cannot pick up objects themselves but can receive them from manipulators. However, unlike mobile manipulators, they can deliver multiple objects to the destination in a single trip.
Unlike previous approaches that assign independent tasks to each robot type, this system enables pairing these two types to form collaborative units and create complementary capabilities by integrating trajectory optimization and auction-based task allocation.
In generating an optimized trajectory, the objective function consists of traveling and object-handling time without considering the process of finding new partners. The planning conditions are different depending on whether a transport agent is paired (carrying multiple objects) or unpaired (carrying only one object). Additionally, the system generates a partial trajectory corresponding to the number of objects to deliver in a single expectation.
After matching a new pairing partner and generating the sequence trajectory, new starting nodes are defined at the positions where robots are expected to be located after a certain amount of time. From these nodes, a new trajectory is generated.

1.1. Related Work

Several works have been proposed to solve the MAPD problem. Authors in [14] present the MAPD approach that robots can run in a peer-to-peer fashion. By distributing computation to multiple robots using primal decomposition [20,21,22], this method enables a reduction in computations without a central node. However, this work assumes the use of homogeneous robots. Additionally, duplicate access to delivery points is not considered in the optimization, and all robots should start their task at the same starting point. Furthermore, they conducted only abbreviated experiments, without pick-and-place, using mobile robots without manipulators. Authors in [23] show a game-theoric multi-robot task allocation framework for multi-robot trash collection in dynamically changing environments. In this work, robots choose their working area based on a decision-making algorithm consisting of a payoff (reward) mechanism, Poisson clock, and task revision protocol. Another MAPD framework introduced in [7] shows a practical example of using a multi-objective discrete artificial bee colony (MODABC) algorithm in orchards. However, both [7,23] favor the use of homogeneous robots in solving MAPD problems.
Meanwhile, other studies deal with multi-robot systems with heterogeneous robots. Authors in [3] show task allocation with heterogenous robots by considering their capabilities and states in a rescue scenario, and they prove its validity by applying it to Capture the Flag game. Authors in [5] present task allocation and path planning of heterogeneous robots in medical scenarios. This work introduces Intensive Inter-task Relationship Tree Search (IIRTS) to perform fast task allocation and enable real-time implementation. However, they only present the methods to place heterogeneous robots in appropriate places and do not present how to perform the tasks at the arrival point.
Research in [15,16,17,18] deals with a topic similar to this work that solves the MAPD problem with mobile manipulators and transport agents. Although they solve this problem in various ways (e.g., weighted block sequence graph, constructive heuristics, and recurrent open shop scheduling), these works have limitations in restricting the movement range of mobile manipulators to near storage units. For this reason, these robots only take out items from storage at designated points. Furthermore, as this work assumes that all robots are operated in a structured factory setting with a grid path, applying them in non-structured environments is challenging.

1.2. Contributions

In order to implement the MAPD using only optimization, the timing and location of pairings should be considered in the optimization condition. However, as this method requires an extremely high amount of computation, it is impossible to use in practice. To address this issue, we introduce a multi-robot control system that combines optimization and auction algorithms. This work provides the following contributions.
1.
We propose a novel MAPD system that enables cooperative task execution between mobile manipulators and transport agents in the same space.
2.
The system integrates trajectory optimization and an auction-based task allocation mechanism to adjust the pairing of robots dynamically.
3.
Unlike previous full-trajectory approaches, our method uses a horizon-based partial trajectory planning strategy that enables dynamic pairing and re-optimization during execution with reduced computational costs.
4.
The system applies different planning conditions to mobile manipulators depending on whether they are currently paired with a transport agent or are operating independently. On the other hand, many previous works assume robots have fixed roles or behave in the same way.
5.
The proposed system is validated through realistic simulations, demonstrating the algorithm’s effectiveness.

2. System Overview

The overall system structure, shown in Figure 2, is composed of four main components: the initializer, trajectory planner, pairing planner, and predictor.
At the beginning of an operation, the initializer processes the current states of the environment—such as robot positions and object distributions derived from GPS sensors—and derives the initial states necessary for trajectory planning. These are passed to the trajectory planner, which generates a robot’s optimal path up to a predefined planning horizon without considering pairing updates.
Next, the pairing planner uses an auction-based algorithm to determine new pairing partners for mobile manipulators and transport agents. It evaluates pairing options based on cost and availability and generates a trajectory that reflects these new pairings. These are then merged with the initial pre-auction trajectory to form the post-auction trajectory, which is passed to the predictor.
The predictor analyzes the post-auction trajectory and estimates predictive nodes, which are used by the initializer in the next planning cycle. It also contributes to determining the target behaviors of the robots by combining trajectory information with the current environmental context.
Finally, using the combined trajectory and current state information, the system derives the target actions for each robot.

3. Pre-Auction Trajectory Planning

We formulate the problem using the notations defined in Table 1 to generate optimal trajectories up to a predefined planning horizon without considering pairing updates.
An example of such a trajectory is illustrated in Figure 3, which includes three types of nodes: s for starting positions, j for object locations, and k for delivery destinations. In this example, mobile manipulators i 1 and i 3 , which start at s 1 and s 3 , are initially paired with transport agents, while i 2 , starting at s 2 , is not.
The destination node k 1 is assigned to the objects from j 1 to j 3 , k 2 to the objects from j 4 to j 6 , and k 3 to those from j 7 to j 9 .

3.1. Initializer

Before computing the optimized trajectory, it is necessary to initialize the state of each mobile manipulator i. Algorithm 1 outlines this process, where the initial cost c i init , starting node s i , paired transport agent τ i , and initially loaded objects l i init are determined.
Algorithm 1 Initializer.
1:
for i I do
2:
   if predictive nodes exist then
3:
     c i init G E T I N I T I A L C O S T ( s i pred , Π i prev )
4:
     s i G E T L A S T N O D E I N P R E D I C T I O N ( s i pred , Π i prev )
5:
     τ i G E T P A I R E D T R A N S P O R T A G E N T ( τ i pred , Π i prev )
6:
     l i init l i pred
7:
   else
8:
     c i init , s i , τ i , l i init 0 , p i now , τ i now , l i now
9:
end for
If predictive states are available, c i init is computed based on the predictive node s i pred in the previous trajectory Π i prev ; s i is defined as the last accessing node during the prediction period; τ i is considered paired with i if τ i pred arrives at the pairing node within the prediction range; and l i init is set to its predictive value l i pred . (This process is explained in more detail in Section 4.2).
In contrast, if predictive states are unavailable, c i init is set to zero; s i is assigned to the data received at the current position from an embedded GPS p i now ; and τ i and l i init are also set to their current states τ i now and l i now , respectively.

3.2. Trajectory Planner

After determining the initial states, the following objective function and constraints are defined to generate the trajectory.

3.2.1. Objective Function

min i I , j J { w max i I C i + ( 1 w ) i I C i }
Equation (1) describes the objective function used to minimize the completion time and the total cost of all robots. These two components can be balanced using the weight parameter w. A higher value of w emphasizes minimizing the maximum completion time, while a lower value prioritizes the reduction of the total cost.

3.2.2. Traveling Constraints

x i j a k j b = 0 , x i j a j b = 0 , x i k j a j b = 0 , where j a = j b , i I
x i j a k j b x i j b k j a = 0 , x i j a j b x i j b j a = 0 , x i k j a j b x i k j b j a = 0 , i I
x ¬ i p j a j b = 0 , x ¬ i p k j a j b = 0 , i p I p
x i p j a j b = 0 , i p I p , where k j a k j b
x i p s j a x i p j a k j b = 0 , j a , j b J x i p k j a j b 1 , i p I p
Equations (2) and (3) show that the mobile manipulators cannot stop at a specific node and cannot return to the previously visited node.
Equations (4) and (5) describe the conditions when a mobile manipulator travels directly between object nodes. Equation (4) describes that performing such direct travels is restricted for the mobile manipulators not paired with the transport agents. However, even if paired, as described in Equation (5), object nodes j a and j b should have the same destination node in performing direct travels without separation.
Equation (6) describes the conditions for the mobile manipulators paired with transport agents: they cannot travel to an object node via a destination node immediately after its first movement. Additionally, separating tasks can only be performed once for the paired mobile manipulators.

3.2.3. Manipulation Constraints

j J x i s j 1 , i I
i I x i j k 1 , j J
x i s j = x i s j j J x i j k , i I , j J
{ i I x i s j a + i I j b J ( x i j b j a + x i k j b j a + x i j b k j a ) } 1 , j a J
j a J { i I x i s j a + i I j b J ( x i j b j a + x i k j b j a + x i j b k j a ) } = H
Equations (7)–(9) describe that if a mobile manipulator starts a task, it can only execute one starting task x i s j and one final task x i j k . However, these constraints alone may result in multiple robots performing the same task. This issue is addressed by Equation (10), which prevents such overlaps by enforcing exclusivity in task execution. Moreover, Equation (10) is extended to Equation (11) by summing over all object nodes, thereby defining the horizon that the number of objects handled in a single planning. In Equation (11), the horizon is determined based on available computing power.

3.2.4. Connecting Constraints

x i s j a = x i s j a { j b J ( x i k j a j b + x i j a j b + x i j a k j b ) + x i j a k } , j a J , i I
x i j a k = x i j a k { j b J ( x i k j b j a + x i j b j a + x i j b k j a ) + x i s j a } , j a J , i I
x i j a k j b = x i j a k j b { x i s j a + j c J ( x i k j c j a + x i j c k j a ) } , j a J , i I
x i j b k j a = x i j b k j a { x i j a k + j c J x i j a k j c } , j a J , i I
x i j a j b = x i j a j b { x i s j a + j c J x i j c j a } , j a J , i I
x i j b j a = x i j b j a { x i j a k + j c J ( x i j a j c + x i k j a j c ) } , j a J , i I
x i k j a j b = x i k j a j b { x i s j a + j c J x i j c j a } , j a J , i I
x i k j b j a = x i k j b j a { x i j a k + j c J x i j a k j c } , j a J , i I
Equations (12)–(19) define the connecting constraint that connected edges should share the same node. These constraints enforce the continuity of movement by requiring the connections between nodes to align with respective task assignments. Additionally, Equation (19) specifies that if the mobile manipulators separate from their paired transport agents, they are no longer allowed to move directly between object nodes.

3.2.5. Ordering and Capacity Constraints

o i j b = o i j a + 1 n ( J ) ( 1 x i j a j b x i j a k j b ) , j a J , i I
j a J j b J x i j a j b < ( L max l i init ) , i I
Equation (20) defines the ordering constraints to prevent the generation of infinite loops in the trajectory (e.g., x i j a k j b = 1 , x i j b k j c = 1 , and x i j c k j a = 1 ), by assigning an execution order using the Miller–Tucker–Zemlin (MTZ) formulation [24].
Additionally, since the number of loaded objects increases by one whenever the robot travels directly between two object locations, the number of such direct travels for i is bounded by its maximum carrying capacity and the number of initially loaded objects, as expressed in Equation (21).

3.2.6. Total Cost

C i = c i init + j J c i s j x i s j + j J c j k x i j k + j a , j b J ( c j a k + c k j b ) x i j a k j b + j a , j b J c j a j b x i j a j b + j a , j b J ( w τ c j a k + c j a j b ) x i k j a j b , i I
Equation (22) represents the deriving total cost of i. When x i k j a j b is performed, the cost of the separated transport agent is adjusted by weight w τ .

4. Post-Auction Trajectory Planning

After generating the pre-auction trajectory, each transport agent selects a new pairing partner and node and regenerates its trajectory accordingly.

4.1. Auction-Based Pairing Planner

Algorithm 2 shows the overall process. After initializing the best pairing candidate set P best to an empty state, the set of pairing candidates P is generated from the pre-auction trajectory Π pre . Each candidate comprises a transport agent separated from its original partner, a target mobile manipulator, and a pairing index indicating a node on the target manipulator’s trajectory.
Algorithm 2 Pairing planner.
  1:
P best , P G E T P A I R I N G C A N D I D A T E ( Π pre )
  2:
while P do
  3:
   p best , J best done S E L E C T B E S T P A I R ( P , P best , Π pre , J , H )
  4:
   P E X C L U D E S E L E C T E D A G E N T S ( P , p best . i , p best . τ )
  5:
   J J J best done , H H n ( J best done ) ,
  6:
   P best P best + p best
  7:
end while
  8:
C init , S new G E T I N I T I A L S T A T E A F T E R P A I R I N G ( P best )
  9:
Π new T R J P L A N N E R ( P best , C init , S new , H , J )
10:
Π post R E P L A C E T R J ( Π pre , S new , Π new )
The algorithm iteratively selects the best available candidate until all separated transport agents in P are paired with new mobile manipulators. Once the best candidate p best is identified, the robots included in p best and the objects handled at SelectBestPair J best done are excluded from further consideration. The process is then repeated with the remaining pairings to determine the next optimal pairing state.
After all pairings are finalized, the new pairing trajectory Π new is generated from new starting nodes S new . By replacing the pre-auction trajectory after S new to Π new , the post-auction trajectory Π post is generated.

4.1.1. Pairing Candidates

Algorithm 3 demonstrates the procedure for generating pairing candidates. For each transport agent τ and mobile manipulator i, the algorithm evaluates every node π i n along i’s trajectory Π i . If i is initially unpaired, the evaluation begins from the first node; otherwise, it starts from the separation index n sep .
Algorithm 3 GetPairingCandidate
  1:
P
  2:
for τ T , i I do
  3:
   P i , τ
  4:
   for n = 0 , 1 , . . . , N Π i 1 do
  5:
     if i ¬ I p or n n sep then
  6:
      t τ π i n t τ pre - sep + t τ j sep , k sep + t handle + t τ k sep , π i n
  7:
      if t τ π i n t i π i n + t A then
  8:
        P i , τ P i , τ + ( τ , i , n , t τ π i n )
  9:
      end if
10:
     end if
11:
   end for
12:
   P P + F I L T E R F A S T E S T M E E T I N G P A I R S ( P i , τ , N )
13:
end for
14:
return P
At each node, a pairing with τ is considered feasible if the total estimated time for the transport agent to arrive—including the pre-separation time t τ pre - sep , travel time from the separation point to the destination node t τ j sep , k sep , object handling time t handle , and travel time from the destination node to the n-th node on the trajectory t τ k sep , π i n —does not exceed the sum of the arrival/handling time of the target mobile manipulator at the n-th node t i π i n , and the allowable waiting time t A .
Among all feasible candidates in P i , τ , the top N candidates with the earliest meeting times t τ π i n are selected using F I L T E R F A S T E S T M E E T I N G P A I R S , and the corresponding values ( τ , i , n ) are added to the global candidate set P.

4.1.2. Selecting the Best Pair with Auction

To determine the optimal pairs, each candidate is evaluated using the process described in Algorithm 4.
In each evaluation, i in the subject candidate p is assumed to be paired with a transport agent at the n-th node on its pre-auction trajectory Π p . i pre . Additionally, the previously selected candidates P best are also assumed to be paired at their respective pairing nodes. Moreover, C eval init and S eval represent the initial cost and starting positions of all agents, and J p . i done represents the set of objects already handled by p . i before the n-th node, and they are derived using G E T I N I T I A L S T A T E S .
Figure 4 shows an example of how the initial states are set when evaluating the subject candidate p 1 . In this example, the mobile manipulator i 1 is paired with a transport agent at node j 2 . Other relevant robots, such as i 2 and i 3 , are also considered in the evaluation: i 2 is a non-subject mobile manipulator, and i 3 is a previously selected mobile manipulator that is paired with a transport agent at node j 5 . When i 1 arrives at j 2 , each of these robots is positioned at its respective coincident node, which is marked with a red line in the figure.
Algorithm 4 SelectBestPair
  1:
C eval
  2:
for p P do
  3:
   S E T P A I R I N G S ( p . i , p . n , P best , Π p . i pre )
  4:
   C eval init , S eval , J p . i done G E T I N I T I A L S T A T E S ( p , P best , Π pre )
  5:
   J eval J J p . i done , H eval H n ( J p . i done )
  6:
   Π p T R J P L A N N E R ( p , P best , C eval init , S eval , H eval , J eval )
  7:
   C eval C eval + Π p . c
  8:
end for
  9:
p best , J best done G E T M I N C O S T P A I R ( C eval )
10:
return p best , J best done
In this scenario, j 1 is the object handled by the subject candidate, and j 2 becomes the starting node for i 1 . For each non-subject agent, the starting node is defined as the first node on its trajectory after passing the coincident point. However, if the agent belongs to a previously selected candidate, its pairing node is used as its starting point.
Based on these updated starting nodes, the initial cost of each candidate is calculated as the sum of travel and handling times from their original starting node to the updated one, presented with blue arrows.
After defining the initial states, the candidate trajectory Π p is generated using T R J P L A N N E R (Section 3.2), assuming that non-subject mobile manipulators not in the previously selected set are unpaired. The resulting cost Π p . c is then computed from Π p and stored in the evaluation set C eval .
This evaluation process is repeated for all candidates. Among them, the candidate with the lowest cost is selected as the best pair p best , and its handled object set J best done is excluded from consideration in the subsequent sequence auctions.

4.1.3. Post-Auction Trajectory

The mobile manipulators included in P best are designated as paired at their respective pairing nodes. Conversely, those not included in P best are designated as unpaired, starting from their separating node if they were initially paired or from their original starting node if they were initially unpaired.
At these new starting nodes S new , the cost from each robot’s original starting node to its new starting node is set as the initial cost C init . Additionally, the object nodes between these starting nodes are omitted when planning the new trajectory.
With these initial states, the trajectory Π new is generated using T R J P L A N N E R , and the post-auction trajectory Π post is obtained by replacing the segment after S new in Π pre with Π new .

4.2. Predictor

After generating the post-auction trajectory, we can create a new trajectory from the points concerning the specific time that has passed in reference to the previous starting points. As shown in Figure 5, the starting nodes move through the trajectory for the predefined prediction time, which is indicated with a dotted arrow, and the predictive starting node is set as s i pred . From these nodes, a new trajectory is generated.
As performed in Section 4.1.2, each mobile manipulator designates its last accessing node during the prediction time as its predictive initial accessing nodes, and the sum of the traveling time and handling time from each predictive starting node to the initial accessing node is defined as the initial cost c i init . For instance, in Figure 5, j 2 , k 1 , and j 7 are assigned as the new starting nodes for i 1 , i 2 , and i 3 , respectively, and the costs from each s i pred to its corresponding starting node are set as c i init .
If the initial accessing node for i is an object node j, then x i s j is set to 1. In addition, object nodes handled by each paired robot within the prediction range are added to its initially loaded objects l i init in the new trajectory. As shown in Figure 5, j 1 and j 3 are added to the loaded objects of i 1 and i 2 , respectively, and x i 1 s j 2 and x i 3 s j 7 are set to 1.

5. Experiments

The experiments were conducted using Webots [25], an open-source robotic simulator integrated with ROS2 [26].
To generate navigation trajectories, we employed the A* [27] algorithm to compute travel paths and estimate the costs between waypoints, including object positions, destinations, and robot locations. Based on the computed costs, the Gurobi Optimizer [28] was used to generate optimized trajectories as described in Section 3.2, where the weight parameter w was set to 0.7. In addition, ORCA (Optimal Reciprocal Collision Avoidance) [29] was applied during navigation to prevent collisions between robots.
The experiments are designed to evaluate the proposed system from two perspectives—its scalability in different task settings, and the impact of the auction-based pairing algorithm on overall performance.
In evaluating its scalability, the experiments were conducted in two environments: a simplified environment and a kindergarten-themed environment—both shown in Figure 6. The simplified environment consists of cube-shaped objects and three destinations without obstacles. In contrast, the kindergarten-themed environment includes objects commonly found in kindergartens, four destinations, and various obstacles.
Meanwhile, to evaluate the effectiveness of the auction-based pairing algorithm, we compared the proposed system to a baseline configuration that does not include this mechanism. In the baseline system, each mobile manipulator operates independently with fixed storage, and no dynamic role reassignment with transport agents occurs.
Both experiments were conducted under varying numbers of objects N j and transport agents N τ , assuming L max = 2 for all transport agents.
For a visual representation of the experiments, all corresponding video recordings can be accessed at the following link: (https://youtube.com/playlist?list=PLB1pUAsYGpRHK9mhHIeu827JLSoLhcznD&si=97sXqhfHEqkMkMc0, accessed on 20 May 2025).

5.1. Scalability Evaluation

5.1.1. Experimental Scenarios

To evaluate the scalability of the proposed system, we test its performance under two different environments:
  • Simplified Environment: A minimal environment shown in Figure 6a, where three mobile manipulators and a varying number of transport agents collaborate to tidy up the workspace by placing cube-shaped objects into the destination zones of matching colors.
  • Kindergarten-Themed Environment: A realistic environment shown in Figure 6b, which consists of four distinct areas: a playroom, a corridor, a classroom, and a teacher’s room. In this environment, five mobile manipulators and a varying number of transport agents work together to tidy up four different types of objects—balls, toys, teaching aids, and stationery—by delivering them to their designated locations. To ensure consistent handling and loading behaviors, all objects were standardized to have a weight of 50 g and were encapsulated in a cubic bounding box with a side length of 0.07 m.

5.1.2. Performance Comparison

We measured task completion time in both environments while increasing N j and N τ . The results, summarized in Figure 7 and Table 2, show that the proposed system scales well in both structured and complex environments.
In the simplified environment, the system shows relatively minor gains when N j is small. For example, at N j = 9 , increasing N τ from 2 to 3 results in only a 1-s increase, suggesting possible coordination overhead. However, as the number of objects increases to N j = 15 , the benefit becomes more evident, with up to 44 s of reduction when increasing N τ from 2 to 3.
In the kindergarten-themed environment, which introduces additional complexity, such as obstacles and spatial constraints, similar trends are observed. For N j = 12 , increasing the number of transport agents from one to two reduces the completion time significantly by 36 s. A further increase to three units continues to reduce the time to 98 s. However, increasing N τ from 3 to 4 does not yield any improvement, and adding a fifth transport agent increases the completion time by 5 s, possibly due to the coordination overhead in a low-demand scenario.
For larger workloads, N j = 20 and N j = 24 , the performance improves more consistently as the number of transport agents increases, following a similar trend observed in the simplified environment. For N j = 20 , task completion time drops from 229 s with one unit to 155 s with five units, with the average decrease being 18.5 s per unit. A similar pattern is observed at N j = 24 , where time reduces from 272 to 182 s as the number of transport agents increases, with the average decrease being 22.5 s per unit.
These results demonstrate that the proposed system scales effectively in both simplified and realistic settings, and that the degree of performance improvement becomes more substantial as the task complexity and workload increase.

5.2. Effectiveness of Auction-Based Pairing

5.2.1. Experimental Scenarios

To evaluate the effectiveness of the auction-based pairing algorithm, we define two experimental scenarios:
  • With auction-based pairing: Robots use the proposed dynamic pairing strategy, where transport agents and mobile manipulators are paired through an auction-based algorithm. Pairings can change dynamically based on task status and robot availability.
  • Without auction-based pairing: Robots operate without the auction-based pairing mechanism. Instead of dynamic pairing, each mobile manipulator, assumed to be paired with a transport agent, is equipped with a container that has the same L max as a transport agent and performs the task without any pairing mechanism. The trajectory planner employed in this work is based on the approach proposed in [14].
Both scenarios are tested under identical environment settings, including object placements and robot configurations. Task completion time is measured and compared to analyze the effectiveness of the auction-based pairing strategy.

5.2.2. Performance Comparison

To assess the effectiveness of the proposed auction-based pairing strategy, we compare its performance against a baseline configuration in which each mobile manipulator is equipped with a fixed container and no dynamic pairing is allowed. Both systems are evaluated in the kindergarten-themed environment.
As shown in Figure 8 and Table 3, the auction-based system achieves better performance in most settings, particularly when the number of transport agents increases.
For example, at N j = 12 , the completion time decreases from 156 s to 103 s as N τ increases from 1 to 5 without an auction-based pairing; with the proposed method, the time drops from 155 s to 98 s over the same range.
At N j = 16 , the difference becomes even more pronounced. The baseline shows a reduction of 32 s (from 191 s to 159 s), while the auction-based method achieves a 92 s reduction (from 211 s to 119 s), demonstrating more effective resource utilization under moderate task loads.
Even in higher workloads, the auction-based method consistently maintains a more significant margin of improvement. When N j = 24 , the completion time drops from 274 s to 218 s without an auction-based pairing and from 272 s to 182 s with an auction-based pairing, yielding a 36 s advantage.
These results indicate that the proposed auction-based pairing enables better adaptation to varying task distributions. By allowing dynamic reallocation of roles between mobile manipulators and transport agents, the system achieves consistently shorter completion times across different conditions.

6. Discussion

This study proposes an MAPD system that enables efficient collaboration between mobile manipulators and transport agents. Unlike conventional approaches that treat heterogeneous agents separately, our system integrates trajectory optimization with auction-based dynamic pairing to support flexible role adjustment and synergy.
Experimental results validate the approach, showing that the proposed method improves task completion time, especially in high-demand situations. These improvements highlight the value of strategic dynamic pairing and partial trajectory planning in optimizing heterogeneous multi-robot collaboration.
However, the current system assumes that all transport agents have identical capabilities and ignores energy limitations. These assumptions simplify the problem but may limit applicability in real-world deployments. Future improvements will focus on addressing these constraints by incorporating robot heterogeneity and energy-aware task allocation.

7. Conclusions

This work presented an MAPD framework that enables heterogeneous multi-robot collaboration through dynamic pairing and partial trajectory planning. By integrating an auction-based planner with trajectory optimization, the system achieves improved task efficiency and flexibility. Experimental validation in both simplified and realistic environments confirmed the effectiveness of the proposed method under various workloads.
For future work, we plan to extend the system to consider energy-aware scheduling [30,31,32], which is essential for real-world robot operation, by incorporating battery levels and standby agents in charging stations, allowing for dynamic deployment based on resource availability and demand. Furthermore, the current system assumes uniform robot capabilities. Our future research will explore heterogeneous agents with varying speeds and capacities to enhance generalizability.

Author Contributions

Conceptualization, J.-B.Y. and S.-J.Y.; methodology, J.-B.Y.; software, J.-B.Y.; validation, J.-B.Y., J.K., S.N. and D.S.; formal analysis, J.-B.Y.; investigation, J.-B.Y.; resources, S.-J.Y.; data curation, J.-B.Y.; writing—original draft preparation, J.-B.Y.; writing—review and editing, J.-B.Y., J.K, S.N., D.S. and S.-J.Y.; visualization, J.-B.Y.; supervision, S.-J.Y.; project administration, S.-J.Y.; funding acquisition, S.-J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by BK21FOUR, Creative Human Resource Education and Research Programs for ICT Convergence in the 4th Industrial Revolution and 2-Year Research Grant of Pusan National University.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
MAPDMulti-Agent Pickup and Delivery
MTZMiller–Tucker–Zemlin
ROSRobot operating system
A*A-star
ORCAOptimal Reciprocal Collision Avoidance

References

  1. Li, J.T. Design Optimization of Amazon Robotics. Autom. Control Intell. Syst. 2016, 4, 48. [Google Scholar] [CrossRef]
  2. Farinelli, A.; Boscolo, N.; Zanotto, E.; Pagello, E. Advanced approaches for multi-robot coordination in logistic scenarios. Robot. Auton. Syst. 2017, 90, 34–44. [Google Scholar] [CrossRef]
  3. Fu, B.; Smith, W.; Rizzo, D.M.; Castanier, M.; Ghaffari, M.; Barton, K. Robust Task Scheduling for Heterogeneous Robot Teams Under Capability Uncertainty. IEEE Trans. Robot. 2023, 39, 1087–1105. [Google Scholar] [CrossRef]
  4. Osooli, H.; Robinette, P.; Jerath, K.; Azadeh, R. A Multi-Robot Task Assignment Framework for Search and Rescue with Heterogeneous Teams. arXiv 2023, arXiv:2309.12589. [Google Scholar]
  5. Li, L.; Chen, Z.; Wang, H.; Kan, Z. Fast Task Allocation of Heterogeneous Robots with Temporal Logic and Inter-Task Constraints. IEEE Robot. Autom. Lett. 2023, 8, 4991–4998. [Google Scholar] [CrossRef]
  6. Wu, Y.; Gao, Y.; Song, W. Multi-robot Task Assignment Algorithm for Medical Service System. In Proceedings of the 2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), Jinghong, China, 5–9 December 2022; pp. 1204–1209. [Google Scholar] [CrossRef]
  7. Dai, L.L.; Pan, Q.K.; Miao, Z.H.; Suganthan, P.N.; Gao, K.Z. Multi-Objective Multi-Picking-Robot Task Allocation: Mathematical Model and Discrete Artificial Bee Colony Algorithm. IEEE Trans. Intell. Transp. Syst. 2024, 25, 6061–6073. [Google Scholar] [CrossRef]
  8. Ju, C.; Kim, J.; Seol, J.; Son, H.I. A review on multirobot systems in agriculture. Comput. Electron. Agric. 2022, 202, 107336. [Google Scholar] [CrossRef]
  9. Chen, Z.; Alonso-Mora, J.; Bai, X.; Harabor, D.D.; Stuckey, P.J. Integrated Task Assignment and Path Planning for Capacitated Multi-Agent Pickup and Delivery. IEEE Robot. Autom. Lett. 2021, 6, 5816–5823. [Google Scholar] [CrossRef]
  10. Xu, G.; Wu, Y.; Tao, S.; Yang, Y.; Liu, T.; Huang, T.; Wu, H.; Liu, Y. Multi-robot Task Allocation and Path Planning with Maximum Range Constraints. arXiv 2024, arXiv:2409.06531. [Google Scholar]
  11. Zhang, Y.; Bastani, F.B.; Yen, I.L. Decentralized Path Planner for Multi-robot Systems. In Proceedings of the 2009 21st IEEE International Conference on Tools with Artificial Intelligence, Newark, NJ, USA, 2–4 November 2009; pp. 171–175. [Google Scholar] [CrossRef]
  12. Hou, J.; Zhou, X.; Gan, Z.; Gao, F. Enhanced Decentralized Autonomous Aerial Robot Teams with Group Planning. IEEE Robot. Autom. Lett. 2022, 7, 9240–9247. [Google Scholar] [CrossRef]
  13. Cecchi, M.; Paiano, M.; Mannucci, A.; Palleschi, A.; Pecora, F.; Pallottino, L. Priority-Based Distributed Coordination for Heterogeneous Multi-Robot Systems with Realistic Assumptions. IEEE Robot. Autom. Lett. 2021, 6, 6131–6138. [Google Scholar] [CrossRef]
  14. Camisa, A.; Testa, A.; Notarstefano, G. Multi-Robot Pickup and Delivery via Distributed Resource Allocation. IEEE Trans. Robot. 2021, 39, 1106–1118. [Google Scholar] [CrossRef]
  15. Wang, H.; Chen, W.; Wang, J. Coupled task scheduling for heterogeneous multi-robot system of two robot types performing complex-schedule order fulfillment tasks. Robot. Auton. Syst. 2020, 131, 103560. [Google Scholar] [CrossRef]
  16. Wang, H.; Chen, W. Task scheduling for transport and pick robots in logistics: A comparative study on constructive heuristics. Auton. Intell. Syst. 2021, 1, 17. [Google Scholar] [CrossRef]
  17. Wang, H.; Chen, W. Task scheduling for heterogeneous agents pickup and delivery using recurrent open shop scheduling models. Robot. Auton. Syst. 2024, 172, 104604. [Google Scholar] [CrossRef]
  18. Thilagavathy, R.; Sumithradevi, K. ROS-Enabled Collaborative Navigation and Manipulation with Heterogeneous Robots. SN Comput. Sci. 2023, 4. [Google Scholar] [CrossRef]
  19. Aswale, A.; Pinciroli, C. Heterogeneous Coalition Formation and Scheduling with Multi-Skilled Robots. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023; pp. 5402–5409. [Google Scholar] [CrossRef]
  20. Camisa, A.; Notarnicola, I.; Notarstefano, G. A Primal Decomposition Method with Suboptimality Bounds for Distributed Mixed-Integer Linear Programming. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami, FL, USA, 17–19 December 2018; pp. 3391–3396. [Google Scholar] [CrossRef]
  21. Camisa, A.; Notarnicola, I.; Notarstefano, G. Distributed Primal Decomposition for Large-Scale MILPs. IEEE Trans. Autom. Control 2022, 67, 413–420. [Google Scholar] [CrossRef]
  22. Vujanic, R.; Esfahani, P.; Goulart, P.; Mariethoz, S.; Morari, M. A Decomposition Method for Large Scale MILPs, with Performance Guarantees and a Power System Application. Automatica 2014, 67, 144–156. [Google Scholar] [CrossRef]
  23. Park, S.; Zhong, Y.D.; Leonard, N.E. Multi-Robot Task Allocation Games in Dynamically Changing Environments. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 8678–8684. [Google Scholar] [CrossRef]
  24. Sawik, T. A note on the Miller-Tucker-Zemlin model for the asymmetric traveling salesman problem. Pol. Akad. Nauk. Pol. Acad. Sci. 2016, 64, 517–520. [Google Scholar] [CrossRef]
  25. Michel, O. Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation. Int. J. Adv. Robot. Syst. 2004, 1, 5. [Google Scholar] [CrossRef]
  26. Macenski, S.; Foote, T.; Gerkey, B.; Lalancette, C.; Woodall, W. Robot Operating System 2: Design, Architecture, and Uses In The Wild. arXiv 2022. [Google Scholar] [CrossRef] [PubMed]
  27. Hart, P.; Nilsson, N.; Raphael, B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
  28. Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual; Gurobi Optimization, LLC: Beaverton, OR, USA, 2024. [Google Scholar]
  29. van den Berg, J.P.; Guy, S.J.; Lin, M.C.; Manocha, D. Reciprocal n-Body Collision Avoidance. In Robotics Research, Proceedings of the 14th International Symposium ISRR, Lucerne, Switzerland, 31 August–3 September 2009; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
  30. Bagchi, M.J.; Nair, S.B.; Das, P.K. On a dynamic and decentralized energy-aware technique for multi-robot task allocation. Robot. Auton. Syst. 2024, 180, 104762. [Google Scholar] [CrossRef]
  31. Mokhtari, M.; Mohamadi, P.H.A.; Aernouts, M.; Singh, R.K.; Vanderborght, B.; Weyn, M.; Famaey, J. Energy-aware multi-robot task scheduling using meta-heuristic optimization methods for ambiently-powered robot swarms. Robot. Auton. Syst. 2025, 186, 104898. [Google Scholar] [CrossRef]
  32. Fouad, H.; Beltrame, G. Energy Autonomy for Resource-Constrained Multi Robot Missions. In Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 24 October 2020–24 January 2021; pp. 7006–7013. [Google Scholar] [CrossRef]
Figure 1. Capabilities of the mobile manipulator (a,b) and the transport agent (c,d): (a) picking up an object; (b) delivering an object at the destination; (c) receiving an object from a paired mobile manipulator; (d) delivering multiple objects to the destination.
Figure 1. Capabilities of the mobile manipulator (a,b) and the transport agent (c,d): (a) picking up an object; (b) delivering an object at the destination; (c) receiving an object from a paired mobile manipulator; (d) delivering multiple objects to the destination.
Sensors 25 03269 g001
Figure 2. Overview of system structure: yellow arrows, green arrows, and blue arrows present the state flows starting from the environment, initializer, and predictor, respectively.
Figure 2. Overview of system structure: yellow arrows, green arrows, and blue arrows present the state flows starting from the environment, initializer, and predictor, respectively.
Sensors 25 03269 g002
Figure 3. Pre-auction trajectory with H = 8 : blue arrows, purple arrows, and red arrows represent the trajectory of mobile manipulators i 1 , i 2 , and i 3 , respectively, and yellow and green arrows represent the trajectory of transport agents separated from i 1 and i 3 , respectively.
Figure 3. Pre-auction trajectory with H = 8 : blue arrows, purple arrows, and red arrows represent the trajectory of mobile manipulators i 1 , i 2 , and i 3 , respectively, and yellow and green arrows represent the trajectory of transport agents separated from i 1 and i 3 , respectively.
Sensors 25 03269 g003
Figure 4. Evaluation of the subject candidate: Initial cost trajectories are depicted with blue arrows, while evaluation trajectories are represented with purple arrows.
Figure 4. Evaluation of the subject candidate: Initial cost trajectories are depicted with blue arrows, while evaluation trajectories are represented with purple arrows.
Sensors 25 03269 g004
Figure 5. Post-auction trajectory generated from new starting nodes.
Figure 5. Post-auction trajectory generated from new starting nodes.
Sensors 25 03269 g005
Figure 6. Simulation environment used in the experiments: (a) simplified environment; (b) kindergarten-themed environment.
Figure 6. Simulation environment used in the experiments: (a) simplified environment; (b) kindergarten-themed environment.
Sensors 25 03269 g006
Figure 7. Changes in task completion time depend on the number of transport agents and objects: (a) simplified environment; (b) kindergarten-themed environment.
Figure 7. Changes in task completion time depend on the number of transport agents and objects: (a) simplified environment; (b) kindergarten-themed environment.
Sensors 25 03269 g007
Figure 8. Visual comparison of task completion times according to the number of transport agents and objects.
Figure 8. Visual comparison of task completion times according to the number of transport agents and objects.
Sensors 25 03269 g008
Table 1. Definitions of the notations.
Table 1. Definitions of the notations.
Parameters
I = { i 1 , i 2 , . . , i n } Set of mobile manipulator i.
I p I Set of mobile manipulators paired with transport agent.
J = { j 1 , j 2 , . . , j m } Set of object node j.
w [ 0 , 1 ] Completion time weight.
w τ [ 0 , 1 ] Separated transport agent cost weight.
kDestination node.
H N Object number horizon.
Constraints
l i init Z 0 Initial loaded object of the paired transport agent of the mobile manipulator i.
L max N Maximum carrying capacity of a transport agent.
c i init R 0 Initial cost of the mobile manipulator i.
c i s j R 0 The cost when traveling ( s i , j ) and handling j.
c j k R 0 The cost when traveling ( j , k ) and placing the object j at its destination node k.
c k j R 0 The cost when traveling ( k , j ) and handling j.
c j j R 0 The cost when traveling ( j , j ) and handling j .
Decision variables
x i s j { 0 , 1 } 1 if i travels ( s i , j ) and picks up j, 0 otherwise.
x i j k j { 0 , 1 } 1 if i travels ( j , k ) , places the j at its destination node k, travels ( k , j ) , and picks up j , 0 otherwise.
x i j j { 0 , 1 } 1 if i travels ( j , j ) and picks up j , 0 otherwise.
x i k j j { 0 , 1 } 1 if i separates its paired transport and the transport agent travels to k, the destination node of j, and travels ( j , j ) and picks up j , 0 otherwise.
x i j k { 0 , 1 } 1 if i travels ( j , k ) at last and places the j at its destination node k, 0 otherwise.
o i j R 0 The order of picking up j among the tasks of i.
C i R 0 The total cost of i.
Table 2. Detailed task completion times by the number of transport agents and objects.
Table 2. Detailed task completion times by the number of transport agents and objects.
N τ Simplified EnvironmentKindergarten-Themed Environment
N j = 9 N j = 12 N j = 15 N j = 12 N j = 16 N j = 20 N j = 24
1129213 s278 s155 s211 s229 s272 s
2124 s193 s240 s119 s179 s210 s253 s
3125 s167 s196 s98 s177 s187 s225 s
498 s132 s170 s201 s
5103 s119 s155 s182 s
Table 3. Detailed task completion times by the number of transport agents and objects.
Table 3. Detailed task completion times by the number of transport agents and objects.
N τ Without Auction-Based PairingWith Auction-Based Pairing
N j = 12 N j = 16 N j = 20 N j = 24 N j = 12 N j = 16 N j = 20 N j = 24
1156 s191 s229 s274 s155 s211 s229 s272 s
2150 s185 s212 s262 s119 s179 s210 s253 s
3137 s180 s205 s246 s98 s177 s187 s225 s
4138 s170 s191 s234 s98 s132 s170 s201 s
5136 s159 s176 s218 s103 s119 s155 s182 s
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Yi, J.-B.; Nasrat, S.; Song, D.; Kim, J.; Yi, S.-J. Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors 2025, 25, 3269. https://doi.org/10.3390/s25113269

AMA Style

Yi J-B, Nasrat S, Song D, Kim J, Yi S-J. Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors. 2025; 25(11):3269. https://doi.org/10.3390/s25113269

Chicago/Turabian Style

Yi, Jae-Bong, Shady Nasrat, Dongwoon Song, Joonyoung Kim, and Seung-Joon Yi. 2025. "Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents" Sensors 25, no. 11: 3269. https://doi.org/10.3390/s25113269

APA Style

Yi, J.-B., Nasrat, S., Song, D., Kim, J., & Yi, S.-J. (2025). Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors, 25(11), 3269. https://doi.org/10.3390/s25113269

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop