Next Article in Journal
Systematic HLS Co-Design: Achieving Scalable and Fully-Pipelined NTT Acceleration on FPGAs
Previous Article in Journal
Low-Rank Matrix Completion via Nonconvex Rank Approximation for IoT Network Localization
Previous Article in Special Issue
Quaternion-Based Velocity Scheduling for Robotic Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Agent Coverage Path Planning Using Graph-Adapted K-Means in Road Network Digital Twin

1
Research Institute of Computer and Information Communication, Pusan National University, Busan 46241, Republic of Korea
2
School of Computer Science and Engineering, Pusan National University, Busan 46241, Republic of Korea
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(19), 3921; https://doi.org/10.3390/electronics14193921
Submission received: 25 August 2025 / Revised: 25 September 2025 / Accepted: 25 September 2025 / Published: 1 October 2025

Abstract

In this paper, we research multi-robot coverage path planning (MCPP), which generates paths for agents to visit all target areas or points. This problem is common in various fields, such as agriculture, rescue, 3D scanning, and data collection. Algorithms to solve MCPP are generally categorized into online and offline methods. Online methods work in an unknown area, while offline methods generate a path for the known. Recently, offline MCPP has been researched through various approaches, such as graph clustering, DARP, genetic algorithms, and deep learning models. However, many previous algorithms can only be applied on grid-like environments. Therefore, this study introduces an offline MCPP algorithm that applies graph-adapted K-means and spanning tree coverage for robust operation in non-grid-structure maps such as road networks. To achieve this, we modify a cost function based on the travel distance by adjusting the referenced clustering algorithm. Moreover, we apply bipartite graph matching to reflect the initial positions of agents. We also introduce a cluster-level graph to alleviate local minima during clustering updates. We compare the proposed algorithm with existing methods in a grid environment to validate its stability, and evaluation on a road network digital twin validates its robustness across most environments.

1. Introduction

Digital twin and immersive environments are increasingly used to synchronize and mirror physical spaces in real time [1]. To maintain such synchronization, continuous sensing of the real-world environment is required [2]. One effective approach is to employ multiple agents that can collectively perform tasks across a wide area [3,4,5,6]. In order for these agents to efficiently and periodically cover the environment, multi-agent coverage path planning (MCPP) is essential.
MCPP involves generating paths for agents to visit all given points of interest or areas. The path is generated to efficiently traverse the points of interest while avoiding collisions with obstacles and other agents. This problem has broad applications across various domains, such as disaster areas [7], data collection [8,9], monitoring [10,11], agriculture [12], 3D scanning [13], and autonomous robots [14].
MCPP is primarily divided into online and offline methods based on the prior knowledge and situation of the target environment. Online MCPP generates paths for either unknown or dynamic areas. For instance, when agents investigate an unexplored area, path modification is necessary to avoid unexpected obstacles or explore unvisited regions [15]. Meanwhile, during the task, when the number of agents changes, path redesign becomes essential [16].
Due to these conditions, the algorithm must operate in real time. Therefore, online MCPP has been studied through reinforcement learning [17], artificial potential fields (APFs) [18], etc. Moreover, integrating these approaches with immersive VR/AR interfaces can support human-in-the-loop monitoring and the intuitive visualization of agent coverage and coordination.
Conversely, offline MCPP generates a complete coverage path for a known environment. Unlike online methods, it does not require real-time path modification. Moreover, due to prior knowledge of the environment, this method tends to generate a more optimized path than the former approach. It is mainly researched using divide and conquer methods such as graph partitioning [19], clustering [20], spanning tree coverage (STC) [21], and genetic algorithms [22].
Previous studies on graph-based methods typically employ grid graphs, where the target area is divided into uniform cells. This structure features uniform vertex degrees and a regular arrangement of edges. However, these characteristics are not always guaranteed. Consider a scenario where agents are traversing a non-grid road network digital twin, as in Figure 1. To address this, the environment should be represented as a non-grid graph with weighted edges. Therefore, we present a clustering-based offline MCPP method applicable to non-grid graphs. The contributions of this paper are as follows.
1.
We implement an objective function based on STC by modifying the weights of graph-adapted K-means [23].
2.
We apply an optimized merging method on the update step based on [23] to prevent the graph separation problem.
3.
We develop a cluster propagation method using a cluster-level graph to alleviate the local minima problem.

2. Related Works

With advances in computational power and communication technologies, multi-agent systems have been drawing significant attention. These systems enable the manipulation of agents in a digital twin to perform collaborative tasks in the real world. For instance, there are cases where multiple virtual agents are employed to efficiently generate paths for each agent in order to conduct large-scale 3D scanning. For example, ref. [3] extracts a skeleton of the central traversable paths in the map and divides it into multiple agent paths while considering physical constraints. Meanwhile, ref. [5] generates paths for unmanned aerial vehicles (UAVs) to scan occluded parts of a structure by scheduling exploration plans on a coarse-terrain digital twin via solving the traveling salesman problem. Similarly, there are also studies on swarm robot control within digital twin environments. For example, ref. [1] demonstrates a control approach where humans issue commands only to a small set of virtual agents in the VR. Each agent controls subordinate robot groups via its hierarchical digital twin, while path coordination ensures collision avoidance. These approaches commonly reinterpret continuous digital twin spaces into alternative forms and generate paths via vertex traversal problems.
MCPP can be classified into centralized and decentralized approaches based on how agents coordinate [20]. In a centralized approach, a single solver with global information generates paths and controls the agents. Conversely, in decentralized methods, each agent has local information and constructs its own paths through cooperation with other agents. Consequently, the centralized method typically requires substantial computational resources but yields more efficient paths. For these reasons, offline methods are typically employed, coupled with centralized approaches. Previous studies have implemented this method by partitioning the environment into discrete units and then generating paths.
Graph-based approaches rasterize the environment [10,24] into a grid map or an adjacency graph through cellular decomposition [25,26]. Then, independent subgraphs are formed via clustering, and a path is generated for each cluster. In this process, cellular decomposition employs path patterns such as back and forth or zigzag [26]. When using a grid map, graph traversal methods such as Euler circuits [27] and spanning tree coverage (STC) [21] are commonly adopted for coverage tasks.
Ref. [10] converts the entire area into a grid of square cells and then applies K-means using the centers of the Voronoi diagram as cluster centroids. Because the boundaries of this diagram are similar to the criteria for cluster partitioning, cells that include a boundary become ‘conflict points’ that are not clearly assigned to any cluster. In this study, these cells are allocated to neighboring clusters with lower weights, creating clusters in which the workload is equally distributed. Nikolaos [18] employed affinity propagation clustering. After forming initial clusters, the method calculates the similarity based on a four-grid distance to revise the clusters so that travel distances are evenly balanced.
STC [21] first constructs a minimum spanning tree (MST) of the given graph and then generates a path that circumnavigates it. Expanding on this, multi-robot STC (MSTC) [24,28], for multiple agents, creates a traversal path in the same manner and allocates segments of the tree between adjacent agents. However, if agents are positioned too closely, they all move in the same direction, causing the allocation to fail. Although the same study proposed a backtracking approach that revisits previously covered paths to mitigate this issue, optimal allocation was still not achieved.
To address this problem, multi-robot forest coverage (MFC) divides the entire tree into several balanced subgraphs [29]. Later, MSTC* [30] was introduced to incorporate physical constraints—such as terrain traversability and material load capacity—into a cost-based framework. Similar to MFC, this method partitions the entire tree equally and propagates coverage among neighboring agents. More recently, researchers have explored minimum-turn MSTC* (TMSTC*) [31], which uses linear block segments based on tree branches to reduce the number of agent rotations, as well as an online method that adapts to real-time weight changes [32].
Divide Area based on Robot’s Initial Positions (DARP) [33] performs area partitioning by optimizing a cost function defined using the distance matrix between robots’ initial positions and all vertices. This cost function is designed based on two criteria, namely cluster connectivity and workload balance, and is computed according to the distances from the initial positions to the vertices. The optimization proceeds by iteratively reassigning vertices adjacent to clusters, which allows DARP to achieve optimal solutions compared to other algorithms. However, it suffers from a high computational cost and may lead to imbalanced partitions in cases where bottlenecks occur around the initial positions.
Reinforcement learning can be deployed to solve this problem. For example, [34] addresses decentralized multi-robot coverage path planning by framing subarea allocation as a sequential decision-making task with reinforcement learning. Robots expand their territories using local observations enriched with structural and neighbor information. A neural network policy directs these expansions, while robots may pause expansion to prevent unnecessary overlaps. The method yields balanced subareas with minimal overlap and scales to larger maps and more agents, although training is difficult due to delayed rewards and can be unstable, occasionally producing imbalanced partitions.
Ref. [35] performs cooperative coverage path planning by first partitioning the environment with an improved K-means clustering algorithm that incorporates MST to achieve more balanced divisions. After partitioning, each robot is assigned to a subarea and applies deep reinforcement learning with a dueling network structure. An improved reward function guides robots away from redundant paths and toward unexplored regions, enabling more efficient coverage. This framework achieves higher coverage ratios and reduced path duplication relative to single-robot methods. However, these approaches remain sensitive to the reward design, which can lead to imbalanced paths.

3. Problem Statement

The target environment graph is defined as G = ( V , E , W ) , where V = { v 1 v n } is the set of vertices and E = { e 1 e m } is the set of edges. W = { w 1 w m } is the set of edge weights, and w x represents the weight assigned to edge e x . The number of agents is represented by k. The initial positions of each agent are defined as S = { s 1 s k | s V } . The path generation algorithm F is a function that outputs a set of paths by taking a graph and an initial position, as shown in Equation (1). P = { p 1 p k } represents the collision-free paths assigned to each agent. Meanwhile, p i is a set of path vertices with length l, p i = { v i 1 v i l } .
F ( G , S ) = P
Path group P must visit every vertex of the graph at least once. Furthermore, consider that each path must have the optimal traversal time. Therefore, we set the goal as the ideal path group P * in Equation (2), where T ( p i ) indicates the total traversal time of p i .
P * = argmin P max ( T ( P ) ) , ( V P p i )

4. Proposed Methods

4.1. Graph Adapted K-Means

Graph-adapted K-means [23], proposed by Sieranoja and Fränti, is a clustering method that considers the edge weights of the graph. This algorithm utilizes three types of costs with iterative clustering and merging. Specifically, we focus on the inverse internal weight (IIW) cost.
To explain the IIW, a graph G = ( V , E ) is defined. The weight of the edge between vertices i and j is defined as w i j . At this time, if a cluster X consists of n x vertices, the sum of the weights of the internal edges is defined as W x . The sum of the weights of the edges connected to the other cluster is expressed as E x . For example, calculating the values according to the above definition for the graph in which all edge weights are one, as shown in Figure 2, gives the metrics listed in Table 1.
IIW is the average of the reciprocals of the sums of W x in each cluster, as shown in Equation (3), where M is the sum of the weights of all edges in the graph. This cost is scaled to the range [ 1 , ] , with smaller values meaning better clustering. As a result, it aims to assign the same weight to all clusters. In the case of optimal clustering, all clusters have a ’mean weight of a perfectly balanced cluster’ ( M / k ).
IIW = M k 1 k i = 1 k 1 W i
As mentioned above [23], the method consists of two algorithms: the K-algorithm (clustering) and the M-algorithm (merging). The K-algorithm tracks the cost by delta calculation, as shown in (4), when assigning a vertex j in cluster X to a neighbor cluster Y . C X and C Y are the subgraphs of each cluster before the change, while C X and C Y are those after the change. Then, the vertex is assigned to the cluster with the optimal Δ f ( j , X , Y ) . This approach reduces the computational cost by avoiding the calculation of the average value of the cluster in each iteration.
Δ f ( j , X , Y ) = Δ f ( C X ) + Δ f ( C Y ) = f ( C X ) + f ( C Y ) f ( C X ) f ( C Y )
The M-algorithm operates by repeating these operations until the cost converges or the maximum number of iterations is reached.
1.
Select and merge two randomly selected adjacent clusters.
2.
Split one random cluster into two.
3.
Apply the K-algorithm.
However, the split–merge process can cause substantial shifts in cluster centroids, which increases the errors in path generation. Therefore, the proposed algorithm incorporates the K-algorithm and the merging method in the M-algorithm.

4.2. Distance-Based Cost Function

To enable the use of STC in non-grid environments, we design the cost function as follows. While the conventional spiral STC operates only on grid graphs [21], it is possible to construct a coverage path based on MST [30]. This enables the derivation of a traversal path in weighted non-grid graphs using algorithms such as depth-first search. In the ideal case, each agent would be assigned an equally partitioned subtree. However, when the initial positions of agents are distant from their assigned cluster, additional travel costs are incurred, which may result in an imbalance in workload distribution. To address this, we divide each agent’s route into an initial travel path and a coverage path and define the target cluster size of each agent in proportion to the length of these paths. Finally, the cost is computed as the error between the ratio of the desired cluster sizes and the ratio of the actual path lengths.
In this paper, we assume that the movement speed of all agents is the same, denoted as v. For this reason, the travel distance of path D ( p i ) is expressed as D ( p i ) = T ( p i )   v , and Equation (2) can be written as Equation (5). This implies that all paths in P * will have the same travel distance. To calculate the distance, we separate p i into the initial path p t i n i t and the traversal path p t t r a v e r s a l . As shown in Figure 3, p i i n i t denotes the path from an initial position to the cluster, whereas p i t r a v e r s a l refers to the Euler circuit within it. Thus, D ( p i ) equals the sum of the distances of the two parts, which matches Equation (6).
P * = argmin P max ( D ( P ) ) , ( V p i p i )
D ( p i ) = D ( p i i n i t ) + D ( p i t r a v e r s a l )
Considering Equation (6), to satisfy P * , the relative distances of the two parts between clusters must be taken into account, i.e., if D ( p i i n i t ) is relatively high, then D ( p i t r a v e r s a l ) should be low. Additionally, we utilize IIW for the cost function, which is ideal when all clusters have the same sum of internal weights ( M / k ) . According to this, we calculate the target ratio of the traversal distance R i as in Equation (7), where d * is the optimally distributed traversal distance. We define this as the traversal length of a global minimum spanning tree equally divided into k segments, as in Equation (8). Then, we calculate R i as
R i = z = 1 k D ( p z i n i t ) D ( p i i n i t ) + d * j = 1 k ( z = 1 k D ( p z i n i t ) D ( p j i n i t ) + d * ) , i = 1 k R i = 1
d * = 2 M S T ( G ) w i k
Finally, we define the cost function C ( P ) as the absolute sum of the errors between the actual ratio of D ( p i t r a v e r s a l ) and R i , as in Equation (9). This cost ranges within [ 0 , k ) , and a lower value indicates better clustering. Accordingly, the target path set P * , denoted as in Equation (5), can be expressed as in Equation (10).
C ( P ) = i = 1 k D ( p i t r a v e r s a l ) z = 1 k D ( p z t r a v e r s a l ) R i
P * = a r g m i n P C ( P ) , ( V p i p i )

4.3. Initialization Step

In the initialization step, the initial clusters and R i are computed based on the agent’s position. The pipeline is described in Algorithm 1. First, we apply region-growing base initialization [23]. The extension of cluster i starts at s i , targeting its neighboring vertices, and continues until the cluster contains at least | V | / k 0.8 [23] of vertices or when the internal weight reaches d * . However, it blocks other s i , making it impossible to expand. In this case, we change the blocked position to a new vertex that has the highest density as calculated via d e n s i t y ( v x ) = j n e i g h b o r ( v x ) w x j . Despite efforts to avoid blocking, some vertices may remain unassigned. Consequently, they are assigned to the adjacent cluster with the highest edge weight.
Algorithm 1 Initialization Step
1:
Input:  G , S
2:
Output: cluster, R , M B
3:
   cluster = initialize cluster
4:
   cluster = assign remaining vertices
5:
    B = construct bipartite graph
6:
    M B = perfect minimum matching on B
7:
    R = cal target ratio by M B
As we update s i , there is a D ( p i i n i t ) because the agent was not contained in the cluster. To obtain an optimized R i , we perform perfect minimum bipartite matching. Therefore, we define a complete bipartite graph as B = ( V a g e n t , V c l u s t e r , E , W ) , where | V a g e n t | = | V c l u s t e r | = k . An edge e i j in E denotes the path from s i to cluster j. The weight of this edge, which is an element of W , corresponds to the distance of its path. A visualization of B is provided in Figure 4. As a result, we compute the bipartite matching M B , followed by the calculation of R .

4.4. Update Step

In the update step, clusters are iteratively refined according to cost C ( P ) . This step consists of a K-step and M-step, as mentioned in Section 4.1. The algorithm follows the process outlined in Algorithm 2. The K-step applies to all vertices that are on the cluster’s boundary. If a decrease in cost is observed, the update is accepted. To incorporate delta calculation into this process, we reformulate the cost function as in Equation (11), where ’−’ indicates the cluster from which a vertex is removed and ’+’ denotes the one to which it is added. Moreover, Δ W x represents the change in cluster weight, while W x p r e v indicates the prior value.
C ( P ) = i = 1 k 2 W i p r e v + Δ W i z = 1 k 2 W z p r e v + Δ W + + Δ W R i
Algorithm 2 Update Step
1:
Input:  G , cluster, R , MAX_ITER, THRESHOLD
2:
Output: cluster
3:
for iter = 0 : MAX_ITER do
4:
   update by K-step
5:
   merge isolated component by M-step
6:
   if C(P) < THRESHOLD then
7:
     break
8:
   end if
9:
end for
However, in the K-step, a single cluster may fragment into multiple connected components. For example, as seen in Figure 5, if a vertex in the blue cluster is updated to the red cluster, the former inevitably splits into two disconnected components, making pathfinding more difficult. In the M-step, these fragments are merged into another cluster. We define a cluster’s connected components as ’isolated’, except for the largest one. These isolates are merged into adjacent clusters. If multiple neighboring ones exist, the one with the lowest W i after merging is selected. This process is applied to all isolated components. The two steps are repeated until the cost converges below a certain threshold (THRESHOLD) or the maximum number of iterations (MAX_ITER) is reached.

4.5. Path Generation

After clustering, traversal paths are generated using STC. Since clusters do not share vertices, collisions between agents over p t r a v e r s a l do not occur. However, p i n i t may overlap with other clusters, potentially causing collisions. To mitigate this, we adapt Windowed Hierarchical Cooperative A* (WHCA*) [36] for initial path generation. The modified algorithm finds the nearest vertex t i within an agent’s assigned cluster from s i . Then, we calculate the A* path between two vertices. After the agent reaches t i , its p t r a v e r s a l is incorporated into the window to ensure a collision-free path.

5. Cluster Propagation Using Cluster-Level Graph

5.1. Local Minima Problem

The K-step in Section 4.4 considers only neighboring clusters when updating a vertex. This is not an issue in a simple environment, i.e., with fewer agents and a simple graph structure. In contrast, the local minima problem may arise due to failed cluster propagation. For instance, consider two situations where G is a 3 × 3 4-grid graph with k = 2 (a) and k = 4 (b), as shown in Figure 6. At this point, assume that transferring a vertex from cluster 2 to cluster 1 is required to achieve the optimal R . Additionally, clusters 3 and 4 in (b) are defined as having the optimal weight. In case (a), the update will succeed since the two clusters are adjacent. On the other hand, in scenario (b), clusters 3 and 4 lie between them, preventing direct vertex transfer. However, as the K-step rejects updates that increase the cost, propagation may fail, potentially leading to the local minima problem. To address this issue, we designate the global source and destination clusters from the cluster-level graph in each update.

5.2. Update Step Using Cluster-Level Graph

We define a cluster-level graph as Q = { V Q , E Q , VW Q } , where vertices V Q = { v i Q } indicate cluster i. The edge e i j Q is formed when the two clusters i and j are neighbors. The vertex weight v w i Q is calculated as in (12), which represents the deviation of R for the cluster. A negative value indicates insufficient weight, a positive value indicates excess, and, lastly, an optimally balanced cluster has a value of zero. An example of Q is illustrated in Figure 7b.
Figure 7. The K-step process with cluster-level graph Q . Each color of the vertices represents a distinct cluster. (a) Target environment graph G ; (b) cluster-level graph Q , where the numbers on each vertex indicates the vertex weight v w i Q ; (c) the shortest path HS between the global source ( v max Q ) and destination ( v min Q ); (d) the K-step process follows the update rule derived from HS . Despite applying this process, isolated components may still emerge, requiring correction in the M-step.
Figure 7. The K-step process with cluster-level graph Q . Each color of the vertices represents a distinct cluster. (a) Target environment graph G ; (b) cluster-level graph Q , where the numbers on each vertex indicates the vertex weight v w i Q ; (c) the shortest path HS between the global source ( v max Q ) and destination ( v min Q ); (d) the K-step process follows the update rule derived from HS . Despite applying this process, isolated components may still emerge, requiring correction in the M-step.
Electronics 14 03921 g007
v w i Q = W i z = 1 k W z R i
We set the goal of cluster propagation as setting all v w i Q to zero. To this end, we extract two vertices v max Q , v min Q from Q with the maximum and minimum weights. Then, we designate v max Q as a global source cluster, where propagation starts, and v min Q as the destination. Subsequently, we compute the shortest path HS on Q between the source and the destination as depicted in (c). This path represents the sequence of cluster updates. In the given example, propagation follows the order of clusters 1 → 2 → 3 → 5. Additionally, to enforce each update, we transfer the vertex that yields the greatest cost reduction or the one that results in the smallest cost increase if no such vertex exists. The K-step process with HS is illustrated in (d). In conclusion, by leveraging the cluster-level graph, updates can account for non-adjacent clusters, alleviating the local minima problem.

6. Results

6.1. Experimental Setup

Since the proposed algorithm operates on grid graphs, we evaluated its performance in these environments to compare it with previous research. We constructed grid graphs using the multi-agent pathfinding (MAPF) benchmarking graphs provided by the Moving AI Lab [37]. This dataset offers unweighted grid graphs based on real city layouts, video game maps, random obstacles, and a maze. In this study, we selected game maps (den312d, ht_chantry) and random obstacle data (random-64-64-20). In addition, to check the performance on non-grid graphs, we used a road network digital twin graph where the edge weights corresponded to the distances between two vertices. We chose the networks of Pusan National University (pnu) and Jangjeon-dong in Busan, South Korea (jangjeon). The structure and information of the map used are shown in Figure 8.
We implemented two methods to generate the initial positions of agents. First, ’clutter’ refers to cases where agents are densely distributed at the start, which indicates high complexity. We randomly selected a vertex and then generated the agents’ initial positions by region growing. In contrast, ’arbitrary’ indicates that all agents are placed at random vertices. To avoid overlap with the clutter scenario, we reject cases where all agents’ initial positions form a single connected component. Finally, to account for randomness in initial positions, we executed the algorithm 10 times each for k and compared the path lengths with those of MFC [29] and Naive MSTC* (MSTC*-NB) [30]. We used an Intel i5-11600K CPU with 64 GB RAM on the Ubuntu 22.04 LTS for the test environment, and the algorithms were implemented in Python 3.9 using rustworkX [38]. To allow a 0.5% deviation from the optimally distributed distance, we set THRESHOLD = 0.005, and we empirically determined MAX_ITER = 2000.

6.2. Evaluation

The evaluation results on grid graphs are summarized in Table 2. In general, under arbitrary settings, our algorithm shows improved performance over the baselines across most graphs. An exception is observed in den312d and ht_chantry when k = 2 , where the result falls short of that of MSTC*-NB by approximately 4%. In clutter, the distance is overall higher than in arbitrary conditions. Moreover, as the complexity of the graph increases, specifically in random-64-64-20, the efficiency degrades by around 7%. We also report the standard deviations of each algorithm in the grid environment in Table 3. The proposed algorithm exhibits larger deviations than MSTC*-NB when the number of agents is small. Additionally, it shows higher values in cluttered environments. However, as k increases, the values converge to those of the baseline. Nevertheless, these results demonstrate that our method remains effective even in grid environments. The results for all tested values of k are reported in Figure A1 and Figure A2.
The results for non-grid graphs are presented in Table 4 and Table 5. As k increases, the generated paths become more distributed across all environments. Moreover, similarly to the grid case, clutter yields longer paths. However, as indicated in Figure A3 and Figure A4, as the number of agents increases, the path generation process becomes unstable. We will discuss the causes of these degradations and instabilities in the Conclusions.
The runtime comparison of the algorithms is given in Table 6 and Table 7, and we present the results for ht_chantry and jangjeon in Figure 9. In these cases, the proposed algorithm completed execution in under one minute. Compared to MFC, it operated more than 10 times faster, but it remained about eight times slower than MSTC*-NB. As the number of graph nodes increased, the execution time consistently grew. Increasing k had contrasting effects on the runtime, becoming longer in grid but shorter in non-grid environments. Moreover, the algorithm exhibited approximately 5% longer runtimes in cluttered conditions compared to arbitrary ones. This indicates that the execution time rises with increasing environmental complexity.
Figure 9. Comparison of runtime on grid graph (ht_chantry) and non-grid graph (jangjeon).
Figure 9. Comparison of runtime on grid graph (ht_chantry) and non-grid graph (jangjeon).
Electronics 14 03921 g009
Table 6. Comparison of maximum path lengths on non-grid graphs.
Table 6. Comparison of maximum path lengths on non-grid graphs.
TypeGraphMethodk
2510203040
arbitrarypnuProposed48242666224016251816
jangjeon38,42517,36410,468768060605682
clutterpnu54022791233720641794
jangjeon38,60618,76211,010786763356027
Table 7. Comparison of standard deviations of path lengths on non-grid graphs.
Table 7. Comparison of standard deviations of path lengths on non-grid graphs.
TypeGraphMethodk
2510203040
arbitrarypnuProposed316393484398412
jangjeon5759871256115810301140
clutterpnu657404528562538
jangjeon6111243120210459561035

6.3. Ablation Study

We conducted a comparative analysis of the proposed algorithm under both non-grid and grid environments by varying two primary parameters. For MAX_ITER, we employed values of 100, 250, 500, 1000, and 2000, while setting THRESHOLD = 0.005. The corresponding results are presented in Figure 10 and Table 8. With respect to the maximum path length, it became shorter with more iterations. However, over 1000 iterations, no significant improvement was observed. As shown in Figure 11 and Table 9, a similar pattern was found for the deviation, indicating that the algorithm operates more stably as the number of iterations increases.
Table 8. Comparison of maximum path lengths with respect to MAX_ITER. Bold indicates the minimum path length among methods for each k.
Table 8. Comparison of maximum path lengths with respect to MAX_ITER. Bold indicates the minimum path length among methods for each k.
TypeGraphMAX_ITERk
2510203040
arbitraryden312d100308915621045561429305
25029261329741430340292
50027111143646402343299
100026561099635394343299
200026561102631394343299
jangjeon10041,20623,80212,415786962565837
25040,50920,60111,130706061295769
50039,76318,03310,793701961245725
100038,04417,24010,630698361195725
200037,99017,23510,611698361195725
Table 9. Comparison of standard deviations of path lengths with respect to MAX_ITER. Bold indicates the minimum deviation among methods for each k.
Table 9. Comparison of standard deviations of path lengths with respect to MAX_ITER. Bold indicates the minimum deviation among methods for each k.
TypeGraphMAX_ITERk
2510203040
arbitraryden312d1006383903131228861
250476224135775849
50025410865494846
10001987653484746
20001987352484746
jangjeon100360662582386159111371129
250287435111494121010221031
500209916751216109310341031
100039110201174108410361031
200034010031199108610361031
Figure 10. Comparison of runtime with respect to MAX_ITER on den312d and jangjeon.
Figure 10. Comparison of runtime with respect to MAX_ITER on den312d and jangjeon.
Electronics 14 03921 g010
Figure 11. Comparison of mean and standard deviation of path length with respect to MAX_ITER on den312d and jangjeon.
Figure 11. Comparison of mean and standard deviation of path length with respect to MAX_ITER on den312d and jangjeon.
Electronics 14 03921 g011
To further examine the impact of THRESHOLD on performance, we tested values of 0.005 (0.5%), 0.01 (1%), 0.03 (3%), 0.05 (5%), and 0.1 (10%), while fixing MAX_ITER = 1000. As summarized in Table 10, the results show that higher threshold values led to approximately a 5% difference in the maximum path length when k was relatively small. However, this difference diminished as k increased.
The impact of the parameters on the runtime was evaluated, as presented in Figure 12. For MAX_ITER, the runtime increased as the parameter value increased. At low values of k, MAX_ITER = 2000 took approximately four times longer than MAX_ITER = 100. This effect became more pronounced with a larger k, where the runtime difference reached up to nine times. In contrast, for THRESHOLD, smaller values resulted in longer execution times.

7. Discussion

We present an offline MCPP algorithm for non-grid graphs. We employ a graph-adapted K-means method and modify its weights based on decomposing the path into initial and traversal paths. In this process, we explain the isolated component and local minima problems. To mitigate these issues, we introduce a merge step and cluster propagation via a cluster-level graph. Finally, paths are generated using STC and WHCA*. Through comparisons with prior work (MFC, MSTC*), we demonstrate that our algorithm works effectively on both grid and non-grid environments.
However, we observed degraded performance when the number of agents was small or the complexity was high. We attribute these outcomes to two primary limitations of the proposed algorithm. First, the optimally distributed traversal distance d * was designed for STC, and it is not directly applicable to weighted graphs. Determining the STC lengths of large clusters in weighted environments is challenging and often yields misleading results. As a result, when k is small, inaccuracies may arise, potentially yielding incorrect paths. We expect this problem to be mitigated by leveraging techniques for approximating the total weight of a minimum spanning tree [39]. Another limitation occurs in the merge step, where centroid shifts cause discrepancies between the estimated and actual initial path lengths. This effect is particularly pronounced in cluttered environments and accounts for the instability observed in the evaluation. We anticipate that the periodic recomputation of the initial path with bipartite graph matching will alleviate this issue.
The applicability of non-grid MCPP has significant implications for the design of interactive experiences in VR/AR environments. In particular, establishing a foundation that enables multiple agents to efficiently explore spaces and act cooperatively would enhance both the realism and the utility of immersive content. Furthermore, this study demonstrates the capability to explore paths within digital twins represented through diverse data structures. Non-grid graphs are well suited to represent environments where movement between adjacent cells is constrained. For example, when an agent travels by vehicle on a road, traffic regulations may impose directional restrictions on movement. These limitations are difficult to model with grid-based cells but can be represented easily via edges in non-grid graphs. Therefore, the proposed method can be applied to scenarios involving structured digital twins that can be used for urban, pedestrian-driven citizen science investigations or the vehicle-based monitoring of road networks. These contributions are expected to play a pivotal role in advancing large-scale multi-user virtual simulations, education and training systems, and intelligent collaborative virtual environments.

Author Contributions

Conceptualization, H.L. and M.L.; Methodology, H.L.; Software, H.L.; Supervision, M.L.; Writing—original draft, H.L.; Writing—review and editing, H.L. and M.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) under the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2025-RS-2023-00254177) grant funded by the Korea government (MSIT).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Evaluation Results

Figure A1. Comparison of maximum path length (left) and standard deviation with mean (right) on grid graphs under arbitrary settings.
Figure A1. Comparison of maximum path length (left) and standard deviation with mean (right) on grid graphs under arbitrary settings.
Electronics 14 03921 g0a1
Figure A2. Comparison of maximum path length (left) and standard deviation with mean (right) on grid graphs under clutter settings.
Figure A2. Comparison of maximum path length (left) and standard deviation with mean (right) on grid graphs under clutter settings.
Electronics 14 03921 g0a2
Figure A3. Comparison of maximum path length (left) and standard deviation with mean (right) on non-grid graphs under arbitrary settings.
Figure A3. Comparison of maximum path length (left) and standard deviation with mean (right) on non-grid graphs under arbitrary settings.
Electronics 14 03921 g0a3
Figure A4. Comparison of maximum path length (left) and standard deviation with mean (right) on non-grid graphs under clutter settings.
Figure A4. Comparison of maximum path length (left) and standard deviation with mean (right) on non-grid graphs under clutter settings.
Electronics 14 03921 g0a4

References

  1. Nguyen, H.; Hussein, A.; Garratt, M.A.; Abbass, H.A. Swarm Metaverse for Multi-Level Autonomy Using Digital Twins. Sensors 2023, 23, 4892. [Google Scholar] [CrossRef]
  2. Sharma, A.; Kosasih, E.; Zhang, J.; Brintrup, A.; Calinescu, A. Digital Twins: State of the art theory and practice, challenges, and open research questions. J. Ind. Inf. Integr. 2022, 30, 100383. [Google Scholar] [CrossRef]
  3. Denk, M.; Bickel, S.; Steck, P.; Götz, S.; Völkl, H.; Wartzack, S. Generating Digital Twins for Path-Planning of Autonomous Robots and Drones Using Constrained Homotopic Shrinking for 2D and 3D Environment Modeling. Appl. Sci. 2022, 13, 105. [Google Scholar] [CrossRef]
  4. Ahmed, M.F.; Maragliano, M.; Frémont, V.; Recchiuto, C.T. Efficient Multi-robot Active SLAM. J. Intell. Robot. Syst. 2025, 111, 64. [Google Scholar] [CrossRef]
  5. Zhang, S.; Zhang, W.; Liu, C. Model-based Multi-UAV path planning for high-quality 3D reconstruction of buildings. Int. Arch. Photogramm. Remote. Sens. Spat. Inf. Sci. 2023, 48, 1923–1928. [Google Scholar] [CrossRef]
  6. Ivić, S.; Crnković, B.; Grbčić, L.; Matleković, L. Multi-UAV trajectory planning for 3D visual inspection of complex structures. Autom. Constr. 2023, 147, 104709. [Google Scholar] [CrossRef]
  7. Xiong, T.; Liu, F.; Liu, H.; Ge, J.; Li, H.; Ding, K.; Li, Q. Multi-Drone Optimal Mission Assignment and 3D Path Planning for Disaster Rescue. Drones 2023, 7, 394. [Google Scholar] [CrossRef]
  8. Luna, M.A.; Isaac, M.S.A.; Ragab, A.R.; Cervera, P.C.; Peña, P.F.; González, M.M. Fast Multi-UAV Path Planning for Optimal Area Coverage in Aerial Sensing Applications. Sensors 2022, 22, 2297. [Google Scholar] [CrossRef]
  9. Huang, X.; Sun, M.; Zhou, H.; Liu, S. A multi-robot coverage path planning algorithm for the environment with multiple land cover types. IEEE Access 2020, 8, 198101–198117. [Google Scholar] [CrossRef]
  10. Collins, L.; Ghassemi, P.; Esfahani, E.T.; Doermann, D.; Dantu, K.; Chowdhury, S. Scalable Coverage Path Planning of Multi-Robot Teams for Monitoring Non-Convex Areas. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2021; pp. 7393–7399. [Google Scholar] [CrossRef]
  11. Nigam, N.; Bieniawski, S.; Kroo, I.; Vian, J. Control of multiple UAVs for persistent surveillance: Algorithm and flight test results. IEEE Trans. Control. Syst. Technol. 2012, 20, 1236–1251. [Google Scholar] [CrossRef]
  12. Botteghi, N.; Kamilaris, A.; Sinai, L.; Sirmacek, B. Multi-Agent Path Planning of Robotic Swarms in Agricultural Fields. Copernic. Gmbh 2020, 5, 361–368. [Google Scholar] [CrossRef]
  13. Almadhoun, R.; Taha, T.; Seneviratne, L.; Zweiri, Y. A survey on multi-robot coverage path planning for model reconstruction and mapping. SN Appl. Sci. 2019, 1, 847. [Google Scholar] [CrossRef]
  14. Galceran, E.; Campos, R.; Palomeras, N.; Ribas, D.; Carreras, M.; Ridao, P. Coverage Path Planning with Real-time Replanning and Surface Reconstruction for Inspection of Three-dimensional Underwater Structures using Autonomous Underwater Vehicles. J. Field Robot. 2015, 32, 952–983. [Google Scholar] [CrossRef]
  15. Bouman, A.; Ott, J.; Kim, S.K.; Chen, K.; Kochenderfer, M.J.; Lopez, B.; Agha-Mohammadi, A.A.; Burdick, J. Adaptive Coverage Path Planning for Efficient Exploration of Unknown Environments. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23–27 October 2022; Institute of Electrical and Electronics Engineers Inc.: New York, NY, USA, 2022; pp. 11916–11923. [Google Scholar] [CrossRef]
  16. Nair, V.G.; Dileep, M.V.; Guruprasad, K.R. Robust Online Multi-Robot Simultaneous Exploration and Coverage Path Planning. IEEE Access 2024, 12, 72990–73003. [Google Scholar] [CrossRef]
  17. Tang, J.; Gao, Y.; Lam, T.L. Learning to Coordinate for a Worker-Station Multi-Robot System in Planar Coverage Tasks. IEEE Robot. Autom. Lett. 2022, 7, 12315–12322. [Google Scholar] [CrossRef]
  18. Wang, Z.; Zhao, X.; Zhang, J.; Yang, N.; Wang, P.; Tang, J.; Zhang, J.; Shi, L. APF-CPP: An Artificial Potential Field Based Multi-robot Online Coverage Path Planning Approach. IEEE Robot. Autom. Lett. 2024, 9, 9199–9206. [Google Scholar] [CrossRef]
  19. Ann, S.; Kim, Y.; Ahn, J. Area allocation algorithm for multiple UAVs area coverage based on clustering and graph method. IFAC-PapersOnLine 2015, 28, 204–209. [Google Scholar] [CrossRef]
  20. Tang, Y.; Zhou, R.; Sun, G.; Di, B.; Xiong, R. A Novel Cooperative Path Planning for Multirobot Persistent Coverage in Complex Environments. IEEE Sens. J. 2020, 20, 4485–4495. [Google Scholar] [CrossRef]
  21. Gabriely, Y.; Rimon, E. Spanning-tree based coverage of continuous areas by a mobile robot. Ann. Math. Artif. Intell. 2001, 31, 1927–1933. [Google Scholar] [CrossRef]
  22. Jimenez, P.A.; Shirinzadeh, B.; Nicholson, A.; Alici, G. Optimal area covering using genetic algorithms. In Proceedings of the 2007 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, Zurich, Switzerland, 4–7 September 2007; pp. 1–5. [Google Scholar] [CrossRef]
  23. Sieranoja, S.; Fränti, P. Adapting k-means for graph clustering. Knowl. Inf. Syst. 2022, 64, 115–142. [Google Scholar] [CrossRef]
  24. Hazon, N.; Kaminka, G. Redundancy, Efficiency and Robustness in Multi-Robot Coverage. In Proceedings of the 2005 IEEE International Conference on Robotics and Automation, Barcelona, Spain, 18–22 April 2005; pp. 735–741. [Google Scholar] [CrossRef]
  25. Fazli, P.; Davoodi, A.; Pasquier, P.; Mackworth, A.K. Complete and robust cooperative robot area coverage with limited range. In Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipei, Taiwan, 18–22 October 2010; Volume 10, pp. 5577–5582. [Google Scholar] [CrossRef]
  26. Majeed, A.; Hwang, S.O. A multi-objective coverage path planning algorithm for uavs to cover spatially distributed regions in urban environments. Aerospace 2021, 8, 343. [Google Scholar] [CrossRef]
  27. Christofides, N. Worst-Case Analysis of a New Heuristic for the Travelling Salesman Problem. Oper. Res. Forum 2022, 3, 20. [Google Scholar] [CrossRef]
  28. Hazon, N.; Mieli, F.; Kaminka, G. Towards robust on-line multi-robot coverage. In Proceedings of the 2006 IEEE International Conference on Robotics and Automation (ICRA 2006), Orlando, FL, USA, 15–19 May 2006; pp. 1710–1715. [Google Scholar] [CrossRef]
  29. Zheng, X.; Jain, S.; Koenig, S.; Kempe, D. Multi-robot forest coverage. In Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Edmonton, AB, Canada, 2–6 August 2005; pp. 3852–3857. [Google Scholar] [CrossRef]
  30. Tang, J.; Sun, C.; Zhang, X. MSTC*: Multi-robot Coverage Path Planning under Physical Constrain. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; Volume 5, pp. 2518–2524. [Google Scholar] [CrossRef]
  31. Lu, J.; Zeng, B.; Tang, J.; Lam, T.L.; Wen, J. TMSTC*: A Path Planning Algorithm for Minimizing Turns in Multi-Robot Coverage. IEEE Robot. Autom. Lett. 2023, 8, 5275–5282. [Google Scholar] [CrossRef]
  32. Mo, W.; Lin, Z. MDSTC: A Dynamic Approach to Multi-Robot Coverage Path Planning. In Proceedings of the 2024 18th International Conference on Control, Automation, Robotics and Vision (ICARCV), Dubai, United Arab Emirates, 12–15 December 2024; Volume 12, pp. 1100–1105. [Google Scholar] [CrossRef]
  33. Kapoutsis, A.C.; Chatzichristofis, S.A.; Kosmatopoulos, E.B. DARP: Divide Areas Algorithm for Optimal Multi-Robot Coverage Path Planning. J. Intell. Robot. Syst. 2017, 86, 663–680. [Google Scholar] [CrossRef]
  34. Liu, Y.; Hu, J.; Dong, W. Decentralized Coverage Path Planning with Reinforcement Learning and Dual Guidance. arXiv 2022, arXiv:cs.RO/2210.07514. [Google Scholar] [CrossRef]
  35. Ni, J.; Gu, Y.; Tang, G.; Ke, C.; Gu, Y. Cooperative Coverage Path Planning for Multi-Mobile Robots Based on Improved K-Means Clustering and Deep Reinforcement Learning. Electronics 2024, 13, 944. [Google Scholar] [CrossRef]
  36. Silver, D. Cooperative Pathfinding. In Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Virtual, 11–15 October 2021; Volume 1, pp. 117–122. [Google Scholar] [CrossRef]
  37. Stern, R.; Sturtevant, N.; Felner, A.; Koenig, S.; Ma, H.; Walker, T.; Li, J.; Atzmon, D.; Cohen, L.; Kumar, T.K.; et al. Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks. Proc. Int. Symp. Comb. Search 2021, 10, 151–158. [Google Scholar] [CrossRef]
  38. Treinish, M.; Carvalho, I.; Tsilimigkounakis, G.; Sá, N. rustworkx: A High-Performance Graph Library for Python. J. Open Source Softw. 2022, 7, 3968. [Google Scholar] [CrossRef]
  39. Chazelle, B.; Rubinfeld, R.; Trevisan, L. Approximating the Minimum Spanning Tree Weight in Sublinear Time. SIAM J. Comput. 2005, 34, 1370–1379. [Google Scholar] [CrossRef]
Figure 1. Example environment represented as a graph. The grid graph (left) exhibits regular connections between adjacent vertices, whereas the non-grid graph (right) lacks such structural regularity.
Figure 1. Example environment represented as a graph. The grid graph (left) exhibits regular connections between adjacent vertices, whereas the non-grid graph (right) lacks such structural regularity.
Electronics 14 03921 g001
Figure 2. Example graph with two clusters. Each color of the vertices represents a distinct cluster.
Figure 2. Example graph with two clusters. Each color of the vertices represents a distinct cluster.
Electronics 14 03921 g002
Figure 3. The overall path is divided into the initial path and the traversal path. s A denotes the starting vertex of agent A. p A i n i t refers to the path from agent A’s initial point to the cluster A. p A t r a v e r s e represents the traversal path that visits all vertices within cluster A. Each color of the vertices represents a distinct cluster and star indicates agent’s initial position.
Figure 3. The overall path is divided into the initial path and the traversal path. s A denotes the starting vertex of agent A. p A i n i t refers to the path from agent A’s initial point to the cluster A. p A t r a v e r s e represents the traversal path that visits all vertices within cluster A. Each color of the vertices represents a distinct cluster and star indicates agent’s initial position.
Electronics 14 03921 g003
Figure 4. Grid graph G (left) and its corresponding bipartite graph representation B (right). In the grid graph, SA and SB denote the initial vertices of agents A and B, respectively, and are shown as stars on the vertices. Each color of the vertices represents a distinct cluster. In the bipartite graph, vertices A and B represent agents A and B, while X and Y denote clusters X and Y, respectively. The numbers on the edges indicate the path lengths from each agent’s initial vertex to each cluster.
Figure 4. Grid graph G (left) and its corresponding bipartite graph representation B (right). In the grid graph, SA and SB denote the initial vertices of agents A and B, respectively, and are shown as stars on the vertices. Each color of the vertices represents a distinct cluster. In the bipartite graph, vertices A and B represent agents A and B, while X and Y denote clusters X and Y, respectively. The numbers on the edges indicate the path lengths from each agent’s initial vertex to each cluster.
Electronics 14 03921 g004
Figure 5. Isolated component generated in the K-step (left middle) and the merged result from the M-step (right). Each color of the vertices represents a distinct cluster.
Figure 5. Isolated component generated in the K-step (left middle) and the merged result from the M-step (right). Each color of the vertices represents a distinct cluster.
Electronics 14 03921 g005
Figure 6. Issues during cluster updates. Each color of the vertices represents a distinct cluster. (a,b) shows graph with k = 2 and k = 4. Given the current updating rule, (a,b) can be updated since there are adjacent vertices between the clusters (represented by a red cross in c). However, in (c), another cluster lies between the two clusters, requiring updates to pass through them.
Figure 6. Issues during cluster updates. Each color of the vertices represents a distinct cluster. (a,b) shows graph with k = 2 and k = 4. Given the current updating rule, (a,b) can be updated since there are adjacent vertices between the clusters (represented by a red cross in c). However, in (c), another cluster lies between the two clusters, requiring updates to pass through them.
Electronics 14 03921 g006
Figure 8. Graphs used for evaluation, with the numbers of vertices and edges. The upper graph has a grid structure, while the lower graph has a non-grid structure.
Figure 8. Graphs used for evaluation, with the numbers of vertices and edges. The upper graph has a grid structure, while the lower graph has a non-grid structure.
Electronics 14 03921 g008
Figure 12. Comparison of runtime with respect to MAX_ITER and THRESHOLD on den312d and jangjeon.
Figure 12. Comparison of runtime with respect to MAX_ITER and THRESHOLD on den312d and jangjeon.
Electronics 14 03921 g012
Table 1. Variables of the example graph.
Table 1. Variables of the example graph.
Cluster n x W x E x
A482
B362
Table 2. Comparison of maximum path lengths on grid graphs. Bold indicates the minimum path length among methods for each k.
Table 2. Comparison of maximum path lengths on grid graphs. Bold indicates the minimum path length among methods for each k.
TypeGraphMethodk
2510203040
arbitraryden312dMFC30921282780501372278
MSTC*-NB25801114677442367331
Proposed26541110628387330275
ht_chantryMFC82853636225913941019818
MSTC*-NB7627323917821056820725
Proposed7744323016921013766664
random-64-64-20MFC36531700968571439340
MSTC*-NB33611415780486373331
Proposed32871423770465354315
clutterden312dMFC31601424758457430324
MSTC*-NB25181119626427352279
Proposed27011153625417351282
ht_chantryMFC8376363419971352940712
MSTC*-NB7563321317591033804686
Proposed7767329518391050778682
random-64-64-20MFC36561808993573408355
MSTC*-NB33311424793468376315
Proposed33411489803461385305
Table 3. Comparison of standard deviations of path lengths on grid graphs. Bold indicates the minimum deviation among methods for each k.
Table 3. Comparison of standard deviations of path lengths on grid graphs. Bold indicates the minimum deviation among methods for each k.
TypeGraphMethodk
2510203040
arbitraryden312dMFC599191153837342
MSTC*-NB253348504950
Proposed2038453434739
ht_chantryMFC760313373266169135
MSTC*-NB176073848288
Proposed26814674947877
random-64-64-20MFC352315210977355
MSTC*-NB152837343436
Proposed138259464442
clutterden312dMFC67629614077136105
MSTC*-NB13936465037
Proposed2329247495339
ht_chantryMFC850290237215159103
MSTC*-NB16070748184
Proposed264214161968090
random-64-64-20MFC3534081671085264
MSTC*-NB13338323835
Proposed4310262545139
Table 4. Comparison of runtime (s) on grid graphs. Bold indicates the minimum runtime among methods for each k.
Table 4. Comparison of runtime (s) on grid graphs. Bold indicates the minimum runtime among methods for each k.
TypeGraphMethodk
2510203040
arbitraryden312dMFC13.113.812.614.317.818.8
MSTC*-NB0.140.150.210.290.410.49
Proposed5.23.43.12.23.35
ht_chantryMFC142.8197.8168.6115.6171.8155.9
MSTC*-NB0.460.841.031.081.841.77
Proposed6.918.317.514.117.315.5
random-64-64-20MFC28.542.445.45456.158.7
MSTC*-NB0.20.340.450.650.810.91
Proposed1.82.62.13.44.46
clutterden312dMFC27.223.328.241.25657.2
MSTC*-NB0.240.260.320.610.760.71
Proposed6.23.63.93.84.54.6
ht_chantryMFC223228.7239169.1241.4217.9
MSTC*-NB0.740.961.161.051.871.82
Proposed8.923.517.517.316.819.1
random-64-64-20MFC46.742.747.160.168.757.7
MSTC*-NB0.310.340.440.630.930.73
Proposed2.13.43.23.84.85.8
Table 5. Comparison of runtime (s) on non-grid graphs.
Table 5. Comparison of runtime (s) on non-grid graphs.
TypeGraphMethodk
2510203040
arbitrarypnuProposed0.050.070.10.150.34
jangjeon20.520.518.616.513.916.4
clutterpnu0.050.080.10.150.36
jangjeon27.518.619.61818.318.6
Table 10. Comparison of maximum path lengths with respect to THRESHOLD. Bold indicates the minimum path length among methods for each k.
Table 10. Comparison of maximum path lengths with respect to THRESHOLD. Bold indicates the minimum path length among methods for each k.
TypeGraphTHRESHOLDk
2510203040
arbitraryden312d0.00526681092633393338293
0.0126681094634393338294
0.0326681110639391337296
0.0526681119642390337295
0.126891102650383329294
jangjeon0.00538,21917,52610,465763065635893
0.0138,28017,53510,472764565635893
0.0338,44917,52710,490757065615893
0.0538,48717,52210,524753965615893
0.138,74818,11710,521762365285925
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lee, H.; Lee, M. Multi-Agent Coverage Path Planning Using Graph-Adapted K-Means in Road Network Digital Twin. Electronics 2025, 14, 3921. https://doi.org/10.3390/electronics14193921

AMA Style

Lee H, Lee M. Multi-Agent Coverage Path Planning Using Graph-Adapted K-Means in Road Network Digital Twin. Electronics. 2025; 14(19):3921. https://doi.org/10.3390/electronics14193921

Chicago/Turabian Style

Lee, Haeseong, and Myungho Lee. 2025. "Multi-Agent Coverage Path Planning Using Graph-Adapted K-Means in Road Network Digital Twin" Electronics 14, no. 19: 3921. https://doi.org/10.3390/electronics14193921

APA Style

Lee, H., & Lee, M. (2025). Multi-Agent Coverage Path Planning Using Graph-Adapted K-Means in Road Network Digital Twin. Electronics, 14(19), 3921. https://doi.org/10.3390/electronics14193921

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop