Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents

Yi, Jae-Bong; Nasrat, Shady; Song, Dongwoon; Kim, Joonyoung; Yi, Seung-Joon

doi:10.3390/s25113269

Open AccessArticle

Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents

by

Jae-Bong Yi

,

Shady Nasrat

,

Dongwoon Song

,

Joonyoung Kim

and

Seung-Joon Yi

^*

Department of Electrical Engineering, Pusan National University, Busan 46241, Republic of Korea

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(11), 3269; https://doi.org/10.3390/s25113269

Submission received: 17 April 2025 / Revised: 14 May 2025 / Accepted: 21 May 2025 / Published: 22 May 2025

(This article belongs to the Special Issue Advances in Multi-Agent Systems: Cooperative and Intelligent Control Strategies for Complex Applications)

Download

Browse Figures

Versions Notes

Abstract

This paper presents a system in which mobile manipulators and transport agents cooperate to solve a multi-agent pickup and delivery (MAPD) problem. The primary objective is to allocate appropriate tasks to heterogeneous robots by considering their capabilities and states. Unlike previous studies that focused on homogeneous teams or assigned distinct roles to heterogeneous robots, this work emphasizes synergy through cooperative task execution. A key feature of the proposed system is that mobile manipulators behave differently depending on whether they are paired with a transport agent. Additionally, rather than generating a full trajectory from start to end, the system plans partial trajectories, allowing dynamic re-pairing of transport agents through an auction algorithm. After re-pairing, new starting nodes are defined, and the following trajectory is updated accordingly. The proposed system is validated through simulations, and its effectiveness is demonstrated by comparing it against a baseline system without dynamic pairing.

Keywords:

heterogeneous multi-agent pickup and delivery; cooperating robots; multi-robot task allocation

1. Introduction

In recent years, multi-robot systems are utilized in various fields, such as logistics [1,2], rescue [3,4], medical care [5,6], and agriculture [7,8]. For these domains, which require the coordination of multiple robots, solving MAPD problems is an important part of the process. Current research in MAPD focuses on developing efficient algorithms for task allocation and path planning to optimize efficiency [9,10]. These studies aim to minimize travel distance, task completion time, and resource consumption while ensuring a collision-free trajectory for multiple agents [11,12,13].

However, most existing approaches assume homogeneous robot teams [7,14] or assigning heterogeneous robots to distinct tasks [15,16,17,18]. Although some studies address heterogeneous robots performing the same task, they rarely focus on performing assigned tasks cooperatively [3,5,19].

In this context, we propose a multi-robot system that performs tidying-up tasks by integrating two heterogeneous robot types: mobile manipulators and transport agents. The system is designed to support synergistic collaboration between these robots.

Figure 1 shows the capabilities of mobile manipulators and transport agents. In this system, a mobile manipulator can pick up an object and deliver it to the destination or hand it to a transport agent. In contrast, a transport agent cannot pick up objects themselves but can receive them from manipulators. However, unlike mobile manipulators, they can deliver multiple objects to the destination in a single trip.

Unlike previous approaches that assign independent tasks to each robot type, this system enables pairing these two types to form collaborative units and create complementary capabilities by integrating trajectory optimization and auction-based task allocation.

In generating an optimized trajectory, the objective function consists of traveling and object-handling time without considering the process of finding new partners. The planning conditions are different depending on whether a transport agent is paired (carrying multiple objects) or unpaired (carrying only one object). Additionally, the system generates a partial trajectory corresponding to the number of objects to deliver in a single expectation.

After matching a new pairing partner and generating the sequence trajectory, new starting nodes are defined at the positions where robots are expected to be located after a certain amount of time. From these nodes, a new trajectory is generated.

1.1. Related Work

Several works have been proposed to solve the MAPD problem. Authors in [14] present the MAPD approach that robots can run in a peer-to-peer fashion. By distributing computation to multiple robots using primal decomposition [20,21,22], this method enables a reduction in computations without a central node. However, this work assumes the use of homogeneous robots. Additionally, duplicate access to delivery points is not considered in the optimization, and all robots should start their task at the same starting point. Furthermore, they conducted only abbreviated experiments, without pick-and-place, using mobile robots without manipulators. Authors in [23] show a game-theoric multi-robot task allocation framework for multi-robot trash collection in dynamically changing environments. In this work, robots choose their working area based on a decision-making algorithm consisting of a payoff (reward) mechanism, Poisson clock, and task revision protocol. Another MAPD framework introduced in [7] shows a practical example of using a multi-objective discrete artificial bee colony (MODABC) algorithm in orchards. However, both [7,23] favor the use of homogeneous robots in solving MAPD problems.

Meanwhile, other studies deal with multi-robot systems with heterogeneous robots. Authors in [3] show task allocation with heterogenous robots by considering their capabilities and states in a rescue scenario, and they prove its validity by applying it to Capture the Flag game. Authors in [5] present task allocation and path planning of heterogeneous robots in medical scenarios. This work introduces Intensive Inter-task Relationship Tree Search (IIRTS) to perform fast task allocation and enable real-time implementation. However, they only present the methods to place heterogeneous robots in appropriate places and do not present how to perform the tasks at the arrival point.

Research in [15,16,17,18] deals with a topic similar to this work that solves the MAPD problem with mobile manipulators and transport agents. Although they solve this problem in various ways (e.g., weighted block sequence graph, constructive heuristics, and recurrent open shop scheduling), these works have limitations in restricting the movement range of mobile manipulators to near storage units. For this reason, these robots only take out items from storage at designated points. Furthermore, as this work assumes that all robots are operated in a structured factory setting with a grid path, applying them in non-structured environments is challenging.

1.2. Contributions

In order to implement the MAPD using only optimization, the timing and location of pairings should be considered in the optimization condition. However, as this method requires an extremely high amount of computation, it is impossible to use in practice. To address this issue, we introduce a multi-robot control system that combines optimization and auction algorithms. This work provides the following contributions.

1.: We propose a novel MAPD system that enables cooperative task execution between mobile manipulators and transport agents in the same space.
2.: The system integrates trajectory optimization and an auction-based task allocation mechanism to adjust the pairing of robots dynamically.
3.: Unlike previous full-trajectory approaches, our method uses a horizon-based partial trajectory planning strategy that enables dynamic pairing and re-optimization during execution with reduced computational costs.
4.: The system applies different planning conditions to mobile manipulators depending on whether they are currently paired with a transport agent or are operating independently. On the other hand, many previous works assume robots have fixed roles or behave in the same way.
5.: The proposed system is validated through realistic simulations, demonstrating the algorithm’s effectiveness.

2. System Overview

The overall system structure, shown in Figure 2, is composed of four main components: the initializer, trajectory planner, pairing planner, and predictor.

At the beginning of an operation, the initializer processes the current states of the environment—such as robot positions and object distributions derived from GPS sensors—and derives the initial states necessary for trajectory planning. These are passed to the trajectory planner, which generates a robot’s optimal path up to a predefined planning horizon without considering pairing updates.

Next, the pairing planner uses an auction-based algorithm to determine new pairing partners for mobile manipulators and transport agents. It evaluates pairing options based on cost and availability and generates a trajectory that reflects these new pairings. These are then merged with the initial pre-auction trajectory to form the post-auction trajectory, which is passed to the predictor.

The predictor analyzes the post-auction trajectory and estimates predictive nodes, which are used by the initializer in the next planning cycle. It also contributes to determining the target behaviors of the robots by combining trajectory information with the current environmental context.

Finally, using the combined trajectory and current state information, the system derives the target actions for each robot.

3. Pre-Auction Trajectory Planning

We formulate the problem using the notations defined in Table 1 to generate optimal trajectories up to a predefined planning horizon without considering pairing updates.

An example of such a trajectory is illustrated in Figure 3, which includes three types of nodes: s for starting positions, j for object locations, and k for delivery destinations. In this example, mobile manipulators

i_{1}

and

i_{3}

, which start at

s_{1}

and

s_{3}

, are initially paired with transport agents, while

i_{2}

, starting at

s_{2}

, is not.

The destination node

k_{1}

is assigned to the objects from

j_{1}

to

j_{3}

,

k_{2}

to the objects from

j_{4}

to

j_{6}

, and

k_{3}

to those from

j_{7}

to

j_{9}

.

3.1. Initializer

Before computing the optimized trajectory, it is necessary to initialize the state of each mobile manipulator i. Algorithm 1 outlines this process, where the initial cost

c_{i}^{init}

, starting node

s_{i}

, paired transport agent

τ_{i}

, and initially loaded objects

l_{i}^{init}

are determined.

Algorithm 1 Initializer.

1:: $for i \in I do$
2:: $if predictive nodes exist then$
3:: $c_{i}^{init} \leftarrow G E T I N I T I A L C O S T (s_{i}^{pred}, Π_{i}^{prev})$
4:: $s_{i} \leftarrow G E T L A S T N O D E I N P R E D I C T I O N (s_{i}^{pred}, Π_{i}^{prev})$
5:: $τ_{i} \leftarrow G E T P A I R E D T R A N S P O R T A G E N T (τ_{i}^{pred}, Π_{i}^{prev})$
6:: $l_{i}^{init} \leftarrow l_{i}^{pred}$
7:: $else$
8:: $c_{i}^{init}, s_{i}, τ_{i}, l_{i}^{init} \leftarrow 0, p_{i}^{now}, τ_{i}^{now}, l_{i}^{now}$
9:: $end for$

If predictive states are available,

c_{i}^{init}

is computed based on the predictive node

s_{i}^{pred}

in the previous trajectory

Π_{i}^{prev}

;

s_{i}

is defined as the last accessing node during the prediction period;

τ_{i}

is considered paired with i if

τ_{i}^{pred}

arrives at the pairing node within the prediction range; and

l_{i}^{init}

is set to its predictive value

l_{i}^{pred}

. (This process is explained in more detail in Section 4.2).

In contrast, if predictive states are unavailable,

c_{i}^{init}

is set to zero;

s_{i}

is assigned to the data received at the current position from an embedded GPS

p_{i}^{now}

; and

τ_{i}

and

l_{i}^{init}

are also set to their current states

τ_{i}^{now}

and

l_{i}^{now}

, respectively.

3.2. Trajectory Planner

After determining the initial states, the following objective function and constraints are defined to generate the trajectory.

3.2.1. Objective Function

min_{i \in I, j \in J} {w max_{i \in I} C_{i} + (1 - w) \sum_{i \in I} C_{i}}

(1)

Equation (1) describes the objective function used to minimize the completion time and the total cost of all robots. These two components can be balanced using the weight parameter w. A higher value of w emphasizes minimizing the maximum completion time, while a lower value prioritizes the reduction of the total cost.

3.2.2. Traveling Constraints

x_{i}^{j_{a} k j_{b}} = 0, x_{i}^{j_{a} j_{b}} = 0, x_{i}^{k j_{a} j_{b}} = 0, where j_{a} = j_{b}, \forall i \in I

(2)

x_{i}^{j_{a} k j_{b}} x_{i}^{j_{b} k j_{a}} = 0, x_{i}^{j_{a} j_{b}} x_{i}^{j_{b} j_{a}} = 0, x_{i}^{k j_{a} j_{b}} x_{i}^{k j_{b} j_{a}} = 0, \forall i \in I

(3)

x_{\neg i_{p}}^{j_{a} j_{b}} = 0, x_{\neg i_{p}}^{k j_{a} j_{b}} = 0, \forall i_{p} \in I_{p}

(4)

x_{i_{p}}^{j_{a} j_{b}} = 0, \forall i_{p} \in I_{p}, where k_{j_{a}} \neq k_{j_{b}}

(5)

x_{i_{p}}^{s j_{a}} x_{i_{p}}^{j_{a} k j_{b}} = 0, \sum_{\begin{matrix} j_{a}, j_{b} \in J \end{matrix}} x_{i_{p}}^{k j_{a} j_{b}} \leq 1, \forall i_{p} \in I_{p}

(6)

Equations (2) and (3) show that the mobile manipulators cannot stop at a specific node and cannot return to the previously visited node.

Equations (4) and (5) describe the conditions when a mobile manipulator travels directly between object nodes. Equation (4) describes that performing such direct travels is restricted for the mobile manipulators not paired with the transport agents. However, even if paired, as described in Equation (5), object nodes

j_{a}

and

j_{b}

should have the same destination node in performing direct travels without separation.

Equation (6) describes the conditions for the mobile manipulators paired with transport agents: they cannot travel to an object node via a destination node immediately after its first movement. Additionally, separating tasks can only be performed once for the paired mobile manipulators.

3.2.3. Manipulation Constraints

\sum_{\begin{matrix} j \in J \end{matrix}} x_{i}^{s j} \leq 1, \forall i \in I

(7)

\sum_{\begin{matrix} i \in I \end{matrix}} x_{i}^{j k} \leq 1, \forall j \in J

(8)

x_{i}^{s j} = x_{i}^{s j} \sum_{\begin{matrix} j^{'} \in J \end{matrix}} x_{i}^{j^{'} k}, \forall i \in I, \forall j \in J

(9)

{\sum_{\begin{matrix} i \in I \end{matrix}} x_{i}^{s j_{a}} + \sum_{\begin{matrix} i \in I \end{matrix}} \sum_{\begin{matrix} j_{b} \in J \end{matrix}} (x_{i}^{j_{b} j_{a}} + x_{i}^{k j_{b} j_{a}} + x_{i}^{j_{b} k j_{a}})} \leq 1, \forall j_{a} \in J

(10)

\sum_{\begin{matrix} j_{a} \in J \end{matrix}} {\sum_{\begin{matrix} i \in I \end{matrix}} x_{i}^{s j_{a}} + \sum_{\begin{matrix} i \in I \end{matrix}} \sum_{\begin{matrix} j_{b} \in J \end{matrix}} (x_{i}^{j_{b} j_{a}} + x_{i}^{k j_{b} j_{a}} + x_{i}^{j_{b} k j_{a}})} = H

(11)

Equations (7)–(9) describe that if a mobile manipulator starts a task, it can only execute one starting task

x_{i}^{s j}

and one final task

x_{i}^{j k}

. However, these constraints alone may result in multiple robots performing the same task. This issue is addressed by Equation (10), which prevents such overlaps by enforcing exclusivity in task execution. Moreover, Equation (10) is extended to Equation (11) by summing over all object nodes, thereby defining the horizon that the number of objects handled in a single planning. In Equation (11), the horizon is determined based on available computing power.

3.2.4. Connecting Constraints

x_{i}^{s j_{a}} = x_{i}^{s j_{a}} {\sum_{\begin{matrix} j_{b} \in J \end{matrix}} (x_{i}^{k j_{a} j_{b}} + x_{i}^{j_{a} j_{b}} + x_{i}^{j_{a} k j_{b}}) + x_{i}^{j_{a} k}}, \forall j_{a} \in J, \forall i \in I

(12)

x_{i}^{j_{a} k} = x_{i}^{j_{a} k} {\sum_{\begin{matrix} j_{b} \in J \end{matrix}} (x_{i}^{k j_{b} j_{a}} + x_{i}^{j_{b} j_{a}} + x_{i}^{j_{b} k j_{a}}) + x_{i}^{s j_{a}}}, \forall j_{a} \in J, \forall i \in I

(13)

x_{i}^{j_{a} k j_{b}} = x_{i}^{j_{a} k j_{b}} {x_{i}^{s j_{a}} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} (x_{i}^{k j_{c} j_{a}} + x_{i}^{j_{c} k j_{a}})}, \forall j_{a} \in J, \forall i \in I

(14)

x_{i}^{j_{b} k j_{a}} = x_{i}^{j_{b} k j_{a}} {x_{i}^{j_{a} k} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} x_{i}^{j_{a} k j_{c}}}, \forall j_{a} \in J, \forall i \in I

(15)

x_{i}^{j_{a} j_{b}} = x_{i}^{j_{a} j_{b}} {x_{i}^{s j_{a}} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} x_{i}^{j_{c} j_{a}}}, \forall j_{a} \in J, \forall i \in I

(16)

x_{i}^{j_{b} j_{a}} = x_{i}^{j_{b} j_{a}} {x_{i}^{j_{a} k} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} (x_{i}^{j_{a} j_{c}} + x_{i}^{k j_{a} j_{c}})}, \forall j_{a} \in J, \forall i \in I

(17)

x_{i}^{k j_{a} j_{b}} = x_{i}^{k j_{a} j_{b}} {x_{i}^{s j_{a}} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} x_{i}^{j_{c} j_{a}}}, \forall j_{a} \in J, \forall i \in I

(18)

x_{i}^{k j_{b} j_{a}} = x_{i}^{k j_{b} j_{a}} {x_{i}^{j_{a} k} + \sum_{\begin{matrix} j_{c} \in J \end{matrix}} x_{i}^{j_{a} k j_{c}}}, \forall j_{a} \in J, \forall i \in I

(19)

Equations (12)–(19) define the connecting constraint that connected edges should share the same node. These constraints enforce the continuity of movement by requiring the connections between nodes to align with respective task assignments. Additionally, Equation (19) specifies that if the mobile manipulators separate from their paired transport agents, they are no longer allowed to move directly between object nodes.

3.2.5. Ordering and Capacity Constraints

o_{i}^{j_{b}} = o_{i}^{j_{a}} + 1 - n (J) (1 - x_{i}^{j_{a} j_{b}} - x_{i}^{j_{a} k j_{b}}), \forall j_{a} \in J, \forall i \in I

(20)

\sum_{\begin{matrix} j_{a} \in J \end{matrix}} \sum_{\begin{matrix} j_{b} \in J \end{matrix}} x_{i}^{j_{a} j_{b}} < (L^{\max} - l_{i}^{init}), \forall i \in I

(21)

Equation (20) defines the ordering constraints to prevent the generation of infinite loops in the trajectory (e.g.,

x_{i}^{j_{a} k j_{b}} = 1

,

x_{i}^{j_{b} k j_{c}} = 1

, and

x_{i}^{j_{c} k j_{a}} = 1

), by assigning an execution order using the Miller–Tucker–Zemlin (MTZ) formulation [24].

Additionally, since the number of loaded objects increases by one whenever the robot travels directly between two object locations, the number of such direct travels for i is bounded by its maximum carrying capacity and the number of initially loaded objects, as expressed in Equation (21).

3.2.6. Total Cost

\begin{matrix} C_{i} = c_{i}^{init} + \sum_{j \in J} c_{i}^{s j} x_{i}^{s j} + \sum_{j \in J} c^{j k} x_{i}^{j k} + \sum_{\begin{matrix} j_{a}, j_{b} \in J \end{matrix}} (c^{j_{a} k} + c^{k j_{b}}) x_{i}^{j_{a} k j_{b}} + \\ \sum_{\begin{matrix} j_{a}, j_{b} \in J \end{matrix}} c^{j_{a} j_{b}} x_{i}^{j_{a} j_{b}} + \sum_{\begin{matrix} j_{a}, j_{b} \in J \end{matrix}} (w_{τ} c^{j_{a} k} + c^{j_{a} j_{b}}) x_{i}^{k j_{a} j_{b}}, \forall i \in I \end{matrix}

(22)

Equation (22) represents the deriving total cost of i. When

x_{i}^{k j_{a} j_{b}}

is performed, the cost of the separated transport agent is adjusted by weight

w_{τ}

.

4. Post-Auction Trajectory Planning

After generating the pre-auction trajectory, each transport agent selects a new pairing partner and node and regenerates its trajectory accordingly.

4.1. Auction-Based Pairing Planner

Algorithm 2 shows the overall process. After initializing the best pairing candidate set

P_{best}

to an empty state, the set of pairing candidates P is generated from the pre-auction trajectory

Π^{pre}

. Each candidate comprises a transport agent separated from its original partner, a target mobile manipulator, and a pairing index indicating a node on the target manipulator’s trajectory.

Algorithm 2 Pairing planner.

1:: $P_{best} \leftarrow \emptyset, P \leftarrow G E T P A I R I N G C A N D I D A T E (Π^{pre})$
2:: $while P \neq \emptyset do$
3:: $p_{best}, J_{best}^{done} \leftarrow S E L E C T B E S T P A I R (P, P_{best}, Π^{pre}, J, H)$
4:: $P \leftarrow E X C L U D E S E L E C T E D A G E N T S (P, p_{best} . i, p_{best} . τ)$
5:: $J \leftarrow J - J_{best}^{done}, H \leftarrow H - n (J_{best}^{done})$ ,
6:: $P_{best} \leftarrow P_{best} + p_{best}$
7:: $end while$
8:: $C^{init}, S^{new} \leftarrow G E T I N I T I A L S T A T E A F T E R P A I R I N G (P_{best})$
9:: $Π^{new} \leftarrow T R J P L A N N E R (P_{best}, C^{init}, S^{new}, H, J)$
10:: $Π^{post} \leftarrow R E P L A C E T R J (Π^{pre}, S^{new}, Π^{new})$

The algorithm iteratively selects the best available candidate until all separated transport agents in P are paired with new mobile manipulators. Once the best candidate

p_{best}

is identified, the robots included in

p_{best}

and the objects handled at SelectBestPair

J_{best}^{done}

are excluded from further consideration. The process is then repeated with the remaining pairings to determine the next optimal pairing state.

After all pairings are finalized, the new pairing trajectory

Π^{new}

is generated from new starting nodes

S^{new}

. By replacing the pre-auction trajectory after

S^{new}

to

Π^{new}

, the post-auction trajectory

Π^{post}

is generated.

4.1.1. Pairing Candidates

Algorithm 3 demonstrates the procedure for generating pairing candidates. For each transport agent

τ

and mobile manipulator i, the algorithm evaluates every node

π_{i}^{n}

along i’s trajectory

Π_{i}

. If i is initially unpaired, the evaluation begins from the first node; otherwise, it starts from the separation index

n_{sep}

.

Algorithm 3 GetPairingCandidate

1:: $P \leftarrow \emptyset$
2:: $for τ \in T, i \in I do$
3:: $P_{i, τ} \leftarrow \emptyset$
4:: $for n = 0, 1, . . ., N_{Π_{i}} - 1 do$
5:: $if i \in \neg I_{p} or n \geq n_{sep} then$
6:: $t_{τ}^{π_{i}^{n}} \leftarrow t_{τ}^{pre - sep} + t_{τ}^{j_{sep}, k_{sep}} + t_{handle} + t_{τ}^{k_{sep}, π_{i}^{n}}$
7:: $if t_{τ}^{π_{i}^{n}} \leq t_{i}^{π_{i}^{n}} + t_{A} then$
8:: $P_{i, τ} \leftarrow P_{i, τ} + (τ, i, n, t_{τ}^{π_{i}^{n}})$
9:: $end if$
10:: $end if$
11:: $end for$
12:: $P \leftarrow P + F I L T E R F A S T E S T M E E T I N G P A I R S (P_{i, τ}, N)$
13:: $end for$
14:: $return P$

At each node, a pairing with

τ

is considered feasible if the total estimated time for the transport agent to arrive—including the pre-separation time

t_{τ}^{pre - sep}

, travel time from the separation point to the destination node

t_{τ}^{j_{sep}, k_{sep}}

, object handling time

t_{handle}

, and travel time from the destination node to the n-th node on the trajectory

t_{τ}^{k_{sep}, π_{i}^{n}}

—does not exceed the sum of the arrival/handling time of the target mobile manipulator at the n-th node

t_{i}^{π_{i}^{n}}

, and the allowable waiting time

t_{A}

.

Among all feasible candidates in

P_{i, τ}

, the top N candidates with the earliest meeting times

t_{τ}^{π_{i}^{n}}

are selected using

F I L T E R F A S T E S T M E E T I N G P A I R S

, and the corresponding values

(τ, i, n)

are added to the global candidate set P.

4.1.2. Selecting the Best Pair with Auction

To determine the optimal pairs, each candidate is evaluated using the process described in Algorithm 4.

In each evaluation, i in the subject candidate p is assumed to be paired with a transport agent at the n-th node on its pre-auction trajectory

Π_{p . i}^{pre}

. Additionally, the previously selected candidates

P_{best}

are also assumed to be paired at their respective pairing nodes. Moreover,

C_{eval}^{init}

and

S_{eval}

represent the initial cost and starting positions of all agents, and

J_{p} . i^{done}

represents the set of objects already handled by

p . i

before the n-th node, and they are derived using

G E T I N I T I A L S T A T E S

.

Figure 4 shows an example of how the initial states are set when evaluating the subject candidate

p_{1}

. In this example, the mobile manipulator

i_{1}

is paired with a transport agent at node

j_{2}

. Other relevant robots, such as

i_{2}

and

i_{3}

, are also considered in the evaluation:

i_{2}

is a non-subject mobile manipulator, and

i_{3}

is a previously selected mobile manipulator that is paired with a transport agent at node

j_{5}

. When

i_{1}

arrives at

j_{2}

, each of these robots is positioned at its respective coincident node, which is marked with a red line in the figure.

Algorithm 4 SelectBestPair

1:: $C_{eval} \leftarrow \emptyset$
2:: $for p \in P do$
3:: $S E T P A I R I N G S (p . i, p . n, P_{best}, Π_{p . i}^{pre})$
4:: $C_{eval}^{init}, S^{eval}, J_{p . i}^{done} \leftarrow G E T I N I T I A L S T A T E S (p, P_{best}, Π^{pre})$
5:: $J_{eval} \leftarrow J - J_{p . i}^{done}, H_{eval} \leftarrow H - n (J_{p . i}^{done})$
6:: $Π_{p} \leftarrow T R J P L A N N E R (p, P_{best}, C_{eval}^{init}, S_{eval}, H_{eval}, J_{eval})$
7:: $C_{eval} \leftarrow C_{eval} + Π_{p} . c$
8:: $end for$
9:: $p_{best}, J_{best}^{done} \leftarrow G E T M I N C O S T P A I R (C^{eval})$
10:: $return p_{best}, J_{best}^{done}$

In this scenario,

j_{1}

is the object handled by the subject candidate, and

j_{2}

becomes the starting node for

i_{1}

. For each non-subject agent, the starting node is defined as the first node on its trajectory after passing the coincident point. However, if the agent belongs to a previously selected candidate, its pairing node is used as its starting point.

Based on these updated starting nodes, the initial cost of each candidate is calculated as the sum of travel and handling times from their original starting node to the updated one, presented with blue arrows.

After defining the initial states, the candidate trajectory

Π_{p}

is generated using

T R J P L A N N E R

(Section 3.2), assuming that non-subject mobile manipulators not in the previously selected set are unpaired. The resulting cost

Π_{p} . c

is then computed from

Π_{p}

and stored in the evaluation set

C_{eval}

.

This evaluation process is repeated for all candidates. Among them, the candidate with the lowest cost is selected as the best pair

p_{best}

, and its handled object set

J_{best}^{done}

is excluded from consideration in the subsequent sequence auctions.

4.1.3. Post-Auction Trajectory

The mobile manipulators included in

P_{best}

are designated as paired at their respective pairing nodes. Conversely, those not included in

P_{best}

are designated as unpaired, starting from their separating node if they were initially paired or from their original starting node if they were initially unpaired.

At these new starting nodes

S^{new}

, the cost from each robot’s original starting node to its new starting node is set as the initial cost

C^{init}

. Additionally, the object nodes between these starting nodes are omitted when planning the new trajectory.

With these initial states, the trajectory

Π^{new}

is generated using

T R J P L A N N E R

, and the post-auction trajectory

Π^{post}

is obtained by replacing the segment after

S^{new}

in

Π^{pre}

with

Π^{new}

.

4.2. Predictor

After generating the post-auction trajectory, we can create a new trajectory from the points concerning the specific time that has passed in reference to the previous starting points. As shown in Figure 5, the starting nodes move through the trajectory for the predefined prediction time, which is indicated with a dotted arrow, and the predictive starting node is set as

s_{i}^{pred}

. From these nodes, a new trajectory is generated.

As performed in Section 4.1.2, each mobile manipulator designates its last accessing node during the prediction time as its predictive initial accessing nodes, and the sum of the traveling time and handling time from each predictive starting node to the initial accessing node is defined as the initial cost

c_{i}^{init}

. For instance, in Figure 5,

j_{2}

,

k_{1}

, and

j_{7}

are assigned as the new starting nodes for

i_{1}

,

i_{2}

, and

i_{3}

, respectively, and the costs from each

s_{i}^{pred}

to its corresponding starting node are set as

c_{i}^{init}

.

If the initial accessing node for i is an object node j, then

x_{i}^{s j}

is set to 1. In addition, object nodes handled by each paired robot within the prediction range are added to its initially loaded objects

l_{i}^{init}

in the new trajectory. As shown in Figure 5,

j_{1}

and

j_{3}

are added to the loaded objects of

i_{1}

and

i_{2}

, respectively, and

x_{i_{1}}^{s j_{2}}

and

x_{i_{3}}^{s j_{7}}

are set to 1.

5. Experiments

The experiments were conducted using Webots [25], an open-source robotic simulator integrated with ROS2 [26].

To generate navigation trajectories, we employed the A* [27] algorithm to compute travel paths and estimate the costs between waypoints, including object positions, destinations, and robot locations. Based on the computed costs, the Gurobi Optimizer [28] was used to generate optimized trajectories as described in Section 3.2, where the weight parameter w was set to 0.7. In addition, ORCA (Optimal Reciprocal Collision Avoidance) [29] was applied during navigation to prevent collisions between robots.

The experiments are designed to evaluate the proposed system from two perspectives—its scalability in different task settings, and the impact of the auction-based pairing algorithm on overall performance.

In evaluating its scalability, the experiments were conducted in two environments: a simplified environment and a kindergarten-themed environment—both shown in Figure 6. The simplified environment consists of cube-shaped objects and three destinations without obstacles. In contrast, the kindergarten-themed environment includes objects commonly found in kindergartens, four destinations, and various obstacles.

Meanwhile, to evaluate the effectiveness of the auction-based pairing algorithm, we compared the proposed system to a baseline configuration that does not include this mechanism. In the baseline system, each mobile manipulator operates independently with fixed storage, and no dynamic role reassignment with transport agents occurs.

Both experiments were conducted under varying numbers of objects

N_{j}

and transport agents

N_{τ}

, assuming

L^{\max} = 2

for all transport agents.

For a visual representation of the experiments, all corresponding video recordings can be accessed at the following link: (https://youtube.com/playlist?list=PLB1pUAsYGpRHK9mhHIeu827JLSoLhcznD&si=97sXqhfHEqkMkMc0, accessed on 20 May 2025).

5.1. Scalability Evaluation

5.1.1. Experimental Scenarios

To evaluate the scalability of the proposed system, we test its performance under two different environments:

Simplified Environment: A minimal environment shown in Figure 6a, where three mobile manipulators and a varying number of transport agents collaborate to tidy up the workspace by placing cube-shaped objects into the destination zones of matching colors.
Kindergarten-Themed Environment: A realistic environment shown in Figure 6b, which consists of four distinct areas: a playroom, a corridor, a classroom, and a teacher’s room. In this environment, five mobile manipulators and a varying number of transport agents work together to tidy up four different types of objects—balls, toys, teaching aids, and stationery—by delivering them to their designated locations. To ensure consistent handling and loading behaviors, all objects were standardized to have a weight of 50 g and were encapsulated in a cubic bounding box with a side length of 0.07 m.

5.1.2. Performance Comparison

We measured task completion time in both environments while increasing

N_{j}

and

N_{τ}

. The results, summarized in Figure 7 and Table 2, show that the proposed system scales well in both structured and complex environments.

In the simplified environment, the system shows relatively minor gains when

N_{j}

is small. For example, at

N_{j} = 9

, increasing

N_{τ}

from 2 to 3 results in only a 1-s increase, suggesting possible coordination overhead. However, as the number of objects increases to

N_{j} = 15

, the benefit becomes more evident, with up to 44 s of reduction when increasing

N_{τ}

from 2 to 3.

In the kindergarten-themed environment, which introduces additional complexity, such as obstacles and spatial constraints, similar trends are observed. For

N_{j} = 12

, increasing the number of transport agents from one to two reduces the completion time significantly by 36 s. A further increase to three units continues to reduce the time to 98 s. However, increasing

N_{τ}

from 3 to 4 does not yield any improvement, and adding a fifth transport agent increases the completion time by 5 s, possibly due to the coordination overhead in a low-demand scenario.

For larger workloads,

N_{j} = 20

and

N_{j} = 24

, the performance improves more consistently as the number of transport agents increases, following a similar trend observed in the simplified environment. For

N_{j} = 20

, task completion time drops from 229 s with one unit to 155 s with five units, with the average decrease being 18.5 s per unit. A similar pattern is observed at

N_{j} = 24

, where time reduces from 272 to 182 s as the number of transport agents increases, with the average decrease being 22.5 s per unit.

These results demonstrate that the proposed system scales effectively in both simplified and realistic settings, and that the degree of performance improvement becomes more substantial as the task complexity and workload increase.

5.2. Effectiveness of Auction-Based Pairing

5.2.1. Experimental Scenarios

To evaluate the effectiveness of the auction-based pairing algorithm, we define two experimental scenarios:

With auction-based pairing: Robots use the proposed dynamic pairing strategy, where transport agents and mobile manipulators are paired through an auction-based algorithm. Pairings can change dynamically based on task status and robot availability.
Without auction-based pairing: Robots operate without the auction-based pairing mechanism. Instead of dynamic pairing, each mobile manipulator, assumed to be paired with a transport agent, is equipped with a container that has the same $L^{\max}$ as a transport agent and performs the task without any pairing mechanism. The trajectory planner employed in this work is based on the approach proposed in [14].

Both scenarios are tested under identical environment settings, including object placements and robot configurations. Task completion time is measured and compared to analyze the effectiveness of the auction-based pairing strategy.

5.2.2. Performance Comparison

To assess the effectiveness of the proposed auction-based pairing strategy, we compare its performance against a baseline configuration in which each mobile manipulator is equipped with a fixed container and no dynamic pairing is allowed. Both systems are evaluated in the kindergarten-themed environment.

As shown in Figure 8 and Table 3, the auction-based system achieves better performance in most settings, particularly when the number of transport agents increases.

For example, at

N_{j} = 12

, the completion time decreases from 156 s to 103 s as

N_{τ}

increases from 1 to 5 without an auction-based pairing; with the proposed method, the time drops from 155 s to 98 s over the same range.

At

N_{j} = 16

, the difference becomes even more pronounced. The baseline shows a reduction of 32 s (from 191 s to 159 s), while the auction-based method achieves a 92 s reduction (from 211 s to 119 s), demonstrating more effective resource utilization under moderate task loads.

Even in higher workloads, the auction-based method consistently maintains a more significant margin of improvement. When

N_{j} = 24

, the completion time drops from 274 s to 218 s without an auction-based pairing and from 272 s to 182 s with an auction-based pairing, yielding a 36 s advantage.

These results indicate that the proposed auction-based pairing enables better adaptation to varying task distributions. By allowing dynamic reallocation of roles between mobile manipulators and transport agents, the system achieves consistently shorter completion times across different conditions.

6. Discussion

This study proposes an MAPD system that enables efficient collaboration between mobile manipulators and transport agents. Unlike conventional approaches that treat heterogeneous agents separately, our system integrates trajectory optimization with auction-based dynamic pairing to support flexible role adjustment and synergy.

Experimental results validate the approach, showing that the proposed method improves task completion time, especially in high-demand situations. These improvements highlight the value of strategic dynamic pairing and partial trajectory planning in optimizing heterogeneous multi-robot collaboration.

However, the current system assumes that all transport agents have identical capabilities and ignores energy limitations. These assumptions simplify the problem but may limit applicability in real-world deployments. Future improvements will focus on addressing these constraints by incorporating robot heterogeneity and energy-aware task allocation.

7. Conclusions

This work presented an MAPD framework that enables heterogeneous multi-robot collaboration through dynamic pairing and partial trajectory planning. By integrating an auction-based planner with trajectory optimization, the system achieves improved task efficiency and flexibility. Experimental validation in both simplified and realistic environments confirmed the effectiveness of the proposed method under various workloads.

For future work, we plan to extend the system to consider energy-aware scheduling [30,31,32], which is essential for real-world robot operation, by incorporating battery levels and standby agents in charging stations, allowing for dynamic deployment based on resource availability and demand. Furthermore, the current system assumes uniform robot capabilities. Our future research will explore heterogeneous agents with varying speeds and capacities to enhance generalizability.

Author Contributions

Conceptualization, J.-B.Y. and S.-J.Y.; methodology, J.-B.Y.; software, J.-B.Y.; validation, J.-B.Y., J.K., S.N. and D.S.; formal analysis, J.-B.Y.; investigation, J.-B.Y.; resources, S.-J.Y.; data curation, J.-B.Y.; writing—original draft preparation, J.-B.Y.; writing—review and editing, J.-B.Y., J.K, S.N., D.S. and S.-J.Y.; visualization, J.-B.Y.; supervision, S.-J.Y.; project administration, S.-J.Y.; funding acquisition, S.-J.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by BK21FOUR, Creative Human Resource Education and Research Programs for ICT Convergence in the 4th Industrial Revolution and 2-Year Research Grant of Pusan National University.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

MAPD	Multi-Agent Pickup and Delivery
MTZ	Miller–Tucker–Zemlin
ROS	Robot operating system
A*	A-star
ORCA	Optimal Reciprocal Collision Avoidance

References

Li, J.T. Design Optimization of Amazon Robotics. Autom. Control Intell. Syst. 2016, 4, 48. [Google Scholar] [CrossRef]
Farinelli, A.; Boscolo, N.; Zanotto, E.; Pagello, E. Advanced approaches for multi-robot coordination in logistic scenarios. Robot. Auton. Syst. 2017, 90, 34–44. [Google Scholar] [CrossRef]
Fu, B.; Smith, W.; Rizzo, D.M.; Castanier, M.; Ghaffari, M.; Barton, K. Robust Task Scheduling for Heterogeneous Robot Teams Under Capability Uncertainty. IEEE Trans. Robot. 2023, 39, 1087–1105. [Google Scholar] [CrossRef]
Osooli, H.; Robinette, P.; Jerath, K.; Azadeh, R. A Multi-Robot Task Assignment Framework for Search and Rescue with Heterogeneous Teams. arXiv 2023, arXiv:2309.12589. [Google Scholar]
Li, L.; Chen, Z.; Wang, H.; Kan, Z. Fast Task Allocation of Heterogeneous Robots with Temporal Logic and Inter-Task Constraints. IEEE Robot. Autom. Lett. 2023, 8, 4991–4998. [Google Scholar] [CrossRef]
Wu, Y.; Gao, Y.; Song, W. Multi-robot Task Assignment Algorithm for Medical Service System. In Proceedings of the 2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), Jinghong, China, 5–9 December 2022; pp. 1204–1209. [Google Scholar] [CrossRef]
Dai, L.L.; Pan, Q.K.; Miao, Z.H.; Suganthan, P.N.; Gao, K.Z. Multi-Objective Multi-Picking-Robot Task Allocation: Mathematical Model and Discrete Artificial Bee Colony Algorithm. IEEE Trans. Intell. Transp. Syst. 2024, 25, 6061–6073. [Google Scholar] [CrossRef]
Ju, C.; Kim, J.; Seol, J.; Son, H.I. A review on multirobot systems in agriculture. Comput. Electron. Agric. 2022, 202, 107336. [Google Scholar] [CrossRef]
Chen, Z.; Alonso-Mora, J.; Bai, X.; Harabor, D.D.; Stuckey, P.J. Integrated Task Assignment and Path Planning for Capacitated Multi-Agent Pickup and Delivery. IEEE Robot. Autom. Lett. 2021, 6, 5816–5823. [Google Scholar] [CrossRef]
Xu, G.; Wu, Y.; Tao, S.; Yang, Y.; Liu, T.; Huang, T.; Wu, H.; Liu, Y. Multi-robot Task Allocation and Path Planning with Maximum Range Constraints. arXiv 2024, arXiv:2409.06531. [Google Scholar]
Zhang, Y.; Bastani, F.B.; Yen, I.L. Decentralized Path Planner for Multi-robot Systems. In Proceedings of the 2009 21st IEEE International Conference on Tools with Artificial Intelligence, Newark, NJ, USA, 2–4 November 2009; pp. 171–175. [Google Scholar] [CrossRef]
Hou, J.; Zhou, X.; Gan, Z.; Gao, F. Enhanced Decentralized Autonomous Aerial Robot Teams with Group Planning. IEEE Robot. Autom. Lett. 2022, 7, 9240–9247. [Google Scholar] [CrossRef]
Cecchi, M.; Paiano, M.; Mannucci, A.; Palleschi, A.; Pecora, F.; Pallottino, L. Priority-Based Distributed Coordination for Heterogeneous Multi-Robot Systems with Realistic Assumptions. IEEE Robot. Autom. Lett. 2021, 6, 6131–6138. [Google Scholar] [CrossRef]
Camisa, A.; Testa, A.; Notarstefano, G. Multi-Robot Pickup and Delivery via Distributed Resource Allocation. IEEE Trans. Robot. 2021, 39, 1106–1118. [Google Scholar] [CrossRef]
Wang, H.; Chen, W.; Wang, J. Coupled task scheduling for heterogeneous multi-robot system of two robot types performing complex-schedule order fulfillment tasks. Robot. Auton. Syst. 2020, 131, 103560. [Google Scholar] [CrossRef]
Wang, H.; Chen, W. Task scheduling for transport and pick robots in logistics: A comparative study on constructive heuristics. Auton. Intell. Syst. 2021, 1, 17. [Google Scholar] [CrossRef]
Wang, H.; Chen, W. Task scheduling for heterogeneous agents pickup and delivery using recurrent open shop scheduling models. Robot. Auton. Syst. 2024, 172, 104604. [Google Scholar] [CrossRef]
Thilagavathy, R.; Sumithradevi, K. ROS-Enabled Collaborative Navigation and Manipulation with Heterogeneous Robots. SN Comput. Sci. 2023, 4. [Google Scholar] [CrossRef]
Aswale, A.; Pinciroli, C. Heterogeneous Coalition Formation and Scheduling with Multi-Skilled Robots. In Proceedings of the 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA, 1–5 October 2023; pp. 5402–5409. [Google Scholar] [CrossRef]
Camisa, A.; Notarnicola, I.; Notarstefano, G. A Primal Decomposition Method with Suboptimality Bounds for Distributed Mixed-Integer Linear Programming. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami, FL, USA, 17–19 December 2018; pp. 3391–3396. [Google Scholar] [CrossRef]
Camisa, A.; Notarnicola, I.; Notarstefano, G. Distributed Primal Decomposition for Large-Scale MILPs. IEEE Trans. Autom. Control 2022, 67, 413–420. [Google Scholar] [CrossRef]
Vujanic, R.; Esfahani, P.; Goulart, P.; Mariethoz, S.; Morari, M. A Decomposition Method for Large Scale MILPs, with Performance Guarantees and a Power System Application. Automatica 2014, 67, 144–156. [Google Scholar] [CrossRef]
Park, S.; Zhong, Y.D.; Leonard, N.E. Multi-Robot Task Allocation Games in Dynamically Changing Environments. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; pp. 8678–8684. [Google Scholar] [CrossRef]
Sawik, T. A note on the Miller-Tucker-Zemlin model for the asymmetric traveling salesman problem. Pol. Akad. Nauk. Pol. Acad. Sci. 2016, 64, 517–520. [Google Scholar] [CrossRef]
Michel, O. Cyberbotics Ltd. Webots™: Professional Mobile Robot Simulation. Int. J. Adv. Robot. Syst. 2004, 1, 5. [Google Scholar] [CrossRef]
Macenski, S.; Foote, T.; Gerkey, B.; Lalancette, C.; Woodall, W. Robot Operating System 2: Design, Architecture, and Uses In The Wild. arXiv 2022. [Google Scholar] [CrossRef] [PubMed]
Hart, P.; Nilsson, N.; Raphael, B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual; Gurobi Optimization, LLC: Beaverton, OR, USA, 2024. [Google Scholar]
van den Berg, J.P.; Guy, S.J.; Lin, M.C.; Manocha, D. Reciprocal n-Body Collision Avoidance. In Robotics Research, Proceedings of the 14th International Symposium ISRR, Lucerne, Switzerland, 31 August–3 September 2009; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Bagchi, M.J.; Nair, S.B.; Das, P.K. On a dynamic and decentralized energy-aware technique for multi-robot task allocation. Robot. Auton. Syst. 2024, 180, 104762. [Google Scholar] [CrossRef]
Mokhtari, M.; Mohamadi, P.H.A.; Aernouts, M.; Singh, R.K.; Vanderborght, B.; Weyn, M.; Famaey, J. Energy-aware multi-robot task scheduling using meta-heuristic optimization methods for ambiently-powered robot swarms. Robot. Auton. Syst. 2025, 186, 104898. [Google Scholar] [CrossRef]
Fouad, H.; Beltrame, G. Energy Autonomy for Resource-Constrained Multi Robot Missions. In Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA, 24 October 2020–24 January 2021; pp. 7006–7013. [Google Scholar] [CrossRef]

Figure 1. Capabilities of the mobile manipulator (a,b) and the transport agent (c,d): (a) picking up an object; (b) delivering an object at the destination; (c) receiving an object from a paired mobile manipulator; (d) delivering multiple objects to the destination.

Figure 2. Overview of system structure: yellow arrows, green arrows, and blue arrows present the state flows starting from the environment, initializer, and predictor, respectively.

Figure 3. Pre-auction trajectory with

H = 8

: blue arrows, purple arrows, and red arrows represent the trajectory of mobile manipulators

i_{1}

,

i_{2}

, and

i_{3}

, respectively, and yellow and green arrows represent the trajectory of transport agents separated from

i_{1}

and

i_{3}

, respectively.

Figure 3. Pre-auction trajectory with

H = 8

: blue arrows, purple arrows, and red arrows represent the trajectory of mobile manipulators

i_{1}

,

i_{2}

, and

i_{3}

, respectively, and yellow and green arrows represent the trajectory of transport agents separated from

i_{1}

and

i_{3}

, respectively.

Figure 4. Evaluation of the subject candidate: Initial cost trajectories are depicted with blue arrows, while evaluation trajectories are represented with purple arrows.

Figure 5. Post-auction trajectory generated from new starting nodes.

Figure 6. Simulation environment used in the experiments: (a) simplified environment; (b) kindergarten-themed environment.

Figure 7. Changes in task completion time depend on the number of transport agents and objects: (a) simplified environment; (b) kindergarten-themed environment.

Figure 8. Visual comparison of task completion times according to the number of transport agents and objects.

Table 1. Definitions of the notations.

Parameters
$I = {i_{1}, i_{2}, . ., i_{n}}$	Set of mobile manipulator i.
$I_{p} \subseteq I$	Set of mobile manipulators paired with transport agent.
$J = {j_{1}, j_{2}, . ., j_{m}}$	Set of object node j.
$w \in [0, 1]$	Completion time weight.
$w_{τ} \in [0, 1]$	Separated transport agent cost weight.
k	Destination node.
$H \in N$	Object number horizon.
Constraints
$l_{i}^{init} \in Z_{\geq 0}$	Initial loaded object of the paired transport agent of the mobile manipulator i.
$L^{\max} \in N$	Maximum carrying capacity of a transport agent.
$c_{i}^{init} \in R_{\geq 0}$	Initial cost of the mobile manipulator i.
$c_{i}^{s j} \in R_{\geq 0}$	The cost when traveling $(s_{i}, j)$ and handling j.
$c^{j k} \in R_{\geq 0}$	The cost when traveling $(j, k)$ and placing the object j at its destination node k.
$c^{k j} \in R_{\geq 0}$	The cost when traveling $(k, j)$ and handling j.
$c^{j j^{'}} \in R_{\geq 0}$	The cost when traveling $(j, j^{'})$ and handling $j^{'}$ .
Decision variables
$x_{i}^{s j} \in {0, 1}$	1 if i travels $(s_{i}, j)$ and picks up j, 0 otherwise.
$x_{i}^{j k j^{'}} \in {0, 1}$	1 if i travels $(j, k)$ , places the j at its destination node k, travels $(k, j^{'})$ , and picks up $j^{'}$ , 0 otherwise.
$x_{i}^{j j^{'}} \in {0, 1}$	1 if i travels $(j, j^{'})$ and picks up $j^{'}$ , 0 otherwise.
$x_{i}^{k j j^{'}} \in {0, 1}$	1 if i separates its paired transport and the transport agent travels to k, the destination node of j, and travels $(j, j^{'})$ and picks up $j^{'}$ , 0 otherwise.
$x_{i}^{j k} \in {0, 1}$	1 if i travels $(j, k)$ at last and places the j at its destination node k, 0 otherwise.
$o_{i}^{j} \in R_{\geq 0}$	The order of picking up j among the tasks of i.
$C_{i} \in R_{\geq 0}$	The total cost of i.

Table 2. Detailed task completion times by the number of transport agents and objects.

$N_{τ}$	Simplified Environment			Kindergarten-Themed Environment
$N_{τ}$	$N_{j} = 9$	$N_{j} = 12$	$N_{j} = 15$	$N_{j} = 12$	$N_{j} = 16$	$N_{j} = 20$	$N_{j} = 24$
1	129	213 s	278 s	155 s	211 s	229 s	272 s
2	124 s	193 s	240 s	119 s	179 s	210 s	253 s
3	125 s	167 s	196 s	98 s	177 s	187 s	225 s
4	–			98 s	132 s	170 s	201 s
5	–			103 s	119 s	155 s	182 s

Table 3. Detailed task completion times by the number of transport agents and objects.

$N_{τ}$	Without Auction-Based Pairing				With Auction-Based Pairing
$N_{τ}$	$N_{j} = 12$	$N_{j} = 16$	$N_{j} = 20$	$N_{j} = 24$	$N_{j} = 12$	$N_{j} = 16$	$N_{j} = 20$	$N_{j} = 24$
1	156 s	191 s	229 s	274 s	155 s	211 s	229 s	272 s
2	150 s	185 s	212 s	262 s	119 s	179 s	210 s	253 s
3	137 s	180 s	205 s	246 s	98 s	177 s	187 s	225 s
4	138 s	170 s	191 s	234 s	98 s	132 s	170 s	201 s
5	136 s	159 s	176 s	218 s	103 s	119 s	155 s	182 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yi, J.-B.; Nasrat, S.; Song, D.; Kim, J.; Yi, S.-J. Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors 2025, 25, 3269. https://doi.org/10.3390/s25113269

AMA Style

Yi J-B, Nasrat S, Song D, Kim J, Yi S-J. Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors. 2025; 25(11):3269. https://doi.org/10.3390/s25113269

Chicago/Turabian Style

Yi, Jae-Bong, Shady Nasrat, Dongwoon Song, Joonyoung Kim, and Seung-Joon Yi. 2025. "Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents" Sensors 25, no. 11: 3269. https://doi.org/10.3390/s25113269

APA Style

Yi, J.-B., Nasrat, S., Song, D., Kim, J., & Yi, S.-J. (2025). Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents. Sensors, 25(11), 3269. https://doi.org/10.3390/s25113269

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-Robot System for Cooperative Tidying Up with Mobile Manipulators and Transport Agents

Abstract

1. Introduction

1.1. Related Work

1.2. Contributions

2. System Overview

3. Pre-Auction Trajectory Planning

3.1. Initializer

3.2. Trajectory Planner

3.2.1. Objective Function

3.2.2. Traveling Constraints

3.2.3. Manipulation Constraints

3.2.4. Connecting Constraints

3.2.5. Ordering and Capacity Constraints

3.2.6. Total Cost

4. Post-Auction Trajectory Planning

4.1. Auction-Based Pairing Planner

4.1.1. Pairing Candidates

4.1.2. Selecting the Best Pair with Auction

4.1.3. Post-Auction Trajectory

4.2. Predictor

5. Experiments

5.1. Scalability Evaluation

5.1.1. Experimental Scenarios

5.1.2. Performance Comparison

5.2. Effectiveness of Auction-Based Pairing

5.2.1. Experimental Scenarios

5.2.2. Performance Comparison

6. Discussion

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI