Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method

Sun, Lu; Qian, Puhua

doi:10.3390/electronics15020304

Open AccessArticle

Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method

by

Lu Sun

¹ and

Puhua Qian

^2,*

¹

School of Electrical and Automation Engineering, Nanjing Normal University, Nanjing 210023, China

²

School of Energy and Power Engineering, Nanjing University of Science and Technology, Nanjing 210094, China

^*

Author to whom correspondence should be addressed.

Electronics 2026, 15(2), 304; https://doi.org/10.3390/electronics15020304

Submission received: 14 December 2025 / Revised: 7 January 2026 / Accepted: 9 January 2026 / Published: 9 January 2026

(This article belongs to the Special Issue Advanced Control Strategies and Applications of Multi-Agent Systems)

Download

Browse Figures

Versions Notes

Abstract

Motivated by the inefficiencies where multi-agent systems fail to reconcile individual agent self-interest with global optimality and accommodate dynamic tasks as its population increases, this paper investigates a clustering allocation problem for large-scale multi-agent systems. A novel coalitional game clustering allocation scheme that can simultaneously reconcile individual agent self-interest and adapt to dynamic tasks is proposed. In this scheme, a coalition switching strategy is newly constructed and incorporated to select optimal switching operation and obtain stable coalition partition. Simulation and comparative results are provided to verify the effectiveness of the developed allocation scheme. It is shown both theoretically and simulation experimentally that in the case of large-scale multi-agent systems, the generated clustering allocation strategy is Nash stable using the proposed scheme.

Keywords:

large-scale; multi-agent systems; clustering allocation; coalitional game; coalition switching

1. Introduction

For the scenarios with a large number of agents in modern military operations (e.g., UAV swarms for cooperative reconnaissance), intelligent transportation systems (e.g., connected autonomous vehicle fleets), and large-scale industrial IoT (e.g., distributed sensor networks for smart manufacturing), information transmission congestion and the increasing difficulty of management have become urgent problems to be solved. Clustering, as an innovative hierarchical management strategy, has attracted extensive attention from academic and practical circles in recent years [1,2,3]. With its outstanding capabilities, this strategy has shown remarkable results in solving network management problems and enhancing network stability. By implementing clustering management, large-scale multi-agent systems (LS-MAS) not only significantly improve operational efficiency, enhance survivability, optimize resource utilization, but also promote the rapid development of related technologies. Specifically, the clustering strategy enables more efficient allocation and execution of operational tasks, enhancing the flexibility and response speed of the overall system; at the same time, by dispersing risks and improving concealment, it significantly improves the survivability of agents; furthermore, the optimal allocation and utilization of resources are realized, reducing operational costs and maintenance difficulties.

In MAS, clustering allocation has become a fundamental and widely adopted strategy to tackle scalability, collaborative task execution, and resource optimization challenges, with substantial research advancements spanning diverse application scenarios. Early studies lay the groundwork for adaptive and utility-driven clustering in mobile networks, such as the adaptive clustering mechanism for wireless ad hoc networks [4] and the on-demand weighted clustering algorithm [5], which are further enhanced by load-balancing clustering frameworks [6] and connectivity-centric k-hop clustering [7]. For multi-robot MAS, clustering has been integrated with auction-based task allocation [8], heuristic grouping [9], and game-theoretic optimization [10] to improve collaboration efficiency. In UAV-based MAS, clustering allocation has been deployed to minimize insecure communication ranges [11] and energy provision [12] and enhance cluster stability via mobility-aware strategies [13]. Meanwhile, in cloud/fog computing and data center MAS, modified k-means clustering [14], hierarchical clustering [15], and workflow-aware clustering [16] have enabled balanced resource scheduling [17,18]. Extensions to heterogeneous networks [19], ultra-dense networks [20], and non-ideal non-orthogonal multiple access (NOMA) systems [21] have leveraged clustering for user grouping and resource allocation, with reinforcement learning-aided clustering further optimizing performance [22]. Beyond traditional MAS, clustering allocation has also been applied to asset allocation [23], facility location [24], and regional resource network optimization [25], demonstrating its versatility across domains. By analyzing these references, balancing conflicting objectives remains a core dilemma as most algorithms prioritize specific criteria at the expense of others, such as communication efficiency [11], real-time responsiveness [22], and energy consumption [26]. Furthermore, scalability limitations persist in large-scale deployments. Existing methods often struggle to adapt dynamic re-clustering to rapid topology changes caused by agent mobility [13] or suffer from exponential computational overhead when the number of agents grows exponentially [18].

Despite the broad applicability of clustering allocation, it is faced with two critical challenges when applied to LS-MAS. Firstly, existing clustering allocation strategies lack effective mechanisms to reconcile individual agent self-interest with global system optimality, leading to unstable and inefficient cluster formations. Most conventional approaches prioritize global objectives such as load balancing [6] or connectivity [7] without adequately accounting for agents’ autonomous utility maximization. This conflict is exacerbated in LS-MAS scenarios where agents may deviate from cluster assignments to pursue local gains, undermining long-term cluster stability. Secondly, current methods fail to achieve decentralized, dynamic clustering and resource allocation with scalable fairness in LS-MAS. Centralized clustering approaches suffer from excessive communication overhead and single-point vulnerabilities when scaling to massive agent deployments, while distributed heuristic methods lack systematic frameworks to ensure equitable resource distribution across clusters. This limitation is evident in cloud computing [14], asset allocation [23], and facility location [24] scenarios, where static clustering structures cannot adapt to real-time changes in agent states or task demands.

Motivated by these observations, this paper uses coalition game to solve the clustering allocation problem of large-scale multi-agent systems. Coalition game theory inherently models strategic agent interactions, enables self-organized coalition formation, and balances individual and collective utilities, such that the optimality–stability trade-off and decentralized fairness gap in LS-MAS clustering allocation is achieved. In this paper, a large-scale multi-agent clustering model is firstly established. Then, the target allocation result of agents is obtained according to the cost of agents countering targets. Based on the allocation result, a coalition game solution algorithm is designed, and repeated simulation cases are designed to verify the applicability of the algorithm. The main contributions of this paper are summarized as below.

A coalitional game clustering allocation scheme is developed for large-scale multi-agent systems. This scheme can effectively reconcile individual agent self-interest with global optimality and adapt to dynamic tasks, as it models each cluster as a cooperative alliance where agents voluntarily form stable partitions to maximize collective benefits while satisfying their local preferences. By integrating dynamic switching strategies, the scheme enables agents to adjust their coalition memberships in response to changes in task requirements, ensuring sustained performance in dynamic operational scenarios.
A new coalitional switching strategy is designed and incorporated into the proposed allocation scheme to generate stable coalition partition for the clustering allocation process. Related Nash stable analysis is also provided. The game-theoretic analysis framework inherently provides a rigorous basis for analyzing the stability of cluster structures, guaranteeing that the final partitions are Nash-stable and free from unilateral deviations that could undermine system-wide efficiency.

The remaining part of this article is arranged as follows. Section 2 formulates the problem considered in this paper. Section 3 presents a clustering allocation algorithm based on coalitional game theory. Section 4 shows simulation results for the developed allocation scheme. Section 5 concludes this work.

2. Problem Formulation

When the number of agents in the cluster is large, to facilitate communication network management, all agents in the cluster will form

N_{r}

clusters, where

C_{r}

represents the set of agents contained in the r-th cluster. On the one hand, since the efficiency of inter-cluster communication is lower than that of intra-cluster communication, too many clusters will lead to high communication delay. On the other hand, agents performing the same task need to exchange task information more frequently. Therefore, how to balance the communication efficiency of the multi-agent system and the characteristics of task execution to obtain the optimal clustering scheme, thereby effectively managing the communication network, is currently a difficult problem.

Since this work focuses solely on communication efficiency and the impact of agent dynamics on the core results is negligible, no specific constraints are imposed on the system model, which is thus omitted from the problem formulation in this paper. However, it does not mean that the proposed method lacks the fidelity required for real-world deployment where dynamic physical limitations are critical. As can be seen later, dynamic physical limitations affect initializing coalition partition. A different initialization strategy of grouping agents may generate distinct coalition partition results, e.g., for consensus-based auction strategy taking flight distance and energy consumption as the trajectory cost, dynamic physical limitations can influence the calculation of energy consumption, and then it will affect the generation of initial coalition partition.

2.1. Constraint Condition

Due to limited communication resources, the number of agents in a single cluster should not be excessive to ensure effective intra-cluster communication. Therefore, the number of agents in a cluster is constrained as shown below:

n_{r} \leq n_{\max}, \forall r \in [1, N_{r}]

(1)

where

n_{r}

represents the number of agents in the r-th cluster, and

n_{\max}

represents the maximum number of agents allowed in a single cluster.

2.2. Performance Metric

Considering that multiple agents performing the same task need to frequently exchange task information, a feasible clustering method is to group agents performing the same task into one cluster. However, when the number of agents performing the same task is small, it may lead to an excessive number of clusters, reducing communication efficiency of agents. To control the number of clusters and improve communication performance, we comprehensively consider the communication efficiency and task attributes of agents and establish the performance metric shown below:

F (C_{r}) = f_{1} (C_{r}) ε + f_{2} (C_{r}) (1 - ε), \forall r \in [1, N_{r}]

(2)

where

ε

is a trade-off factor,

f_{1} (C_{r})

is related to the communication efficiency of agents and

f_{2} (C_{r})

is related to the task attributes of agents.

For

f_{1} (C_{r})

, since inter-cluster communication is less efficient than intra-cluster communication, excessive clusters cause high latency, and dynamic agent movement requires frequent cluster structure updates. To reduce the number of clusters and improve network stability,

f_{1} (C_{r})

is defined as

f_{1} (C_{r}) = \frac{{(n_{r})}^{2}}{{(n_{\max})}^{2}} min_{(i, k) \in ε_{r}} P_{i k}

(3)

where

ε_{r}

is the set of direct communication links between any two agents in cluster

C_{r}

, and

P_{i k}

is the predicted link survival probability between agent i and its adjacent agent k in cluster

C_{r}

. For

P_{i k}

, we calculate the following:

P_{i k} = \{\begin{matrix} 0, & L_{i k} > 2 L \\ 1 - \frac{L_{i k} - L}{L}, & L < L_{i k} < 2 L \\ 1, & L_{i k} < L \end{matrix}

(4)

where

L_{i k}

is the distance between agent i and its adjacent agent k in cluster

C_{r}

, and L is the rated distance for unobstructed communication between agents.

For

f_{2} (C_{r})

, considering that agents performing the same task need frequent task information exchange, they should be grouped into the same cluster as much as possible.

f_{2} (C_{r})

is defined as

f_{2} (C_{r}) = \frac{1}{N_{t}} \sum_{j \in J_{r}} \frac{{({\tilde{n}}_{r}^{j})}^{2}}{{({\tilde{n}}_{j})}^{2}}

(5)

where

{\tilde{n}}_{j}

represents the total number of agents performing the j-th task,

{\tilde{n}}_{r}^{j}

represents the number of agents performing the j-th task in cluster

C_{r}

, and

J_{r}

represents the task number executed by agents in cluster

C_{r}

.

2.3. Optimization Model

The optimization model for agent clustering problem can be written as follows:

\begin{matrix} max_{C_{1}, C_{2}, \dots, C_{N_{r}}} & \sum_{r = 1}^{N_{r}} F (C_{r}) \\ s . t . & (1) \end{matrix}

(6)

The goal of the above optimization problem is to find a suitable clustering structure that maximizes the overall network performance while satisfying the constraints on the number of agents, considering agent communication efficiency and task attributes.

3. Clustering Allocation Scheme Design

Coalition game refers to the process where decision-makers form stable alliances with other decision-makers through alliance and cooperation. A coalition is formed when all rational decision-makers are willing to cooperate and hope to achieve better results by establishing a cooperative organization. Therefore, the main problem to be solved by coalition game is how to form an appropriate cooperative organization to achieve expected outcomes. From the above definition, the problem solved by coalition game is consistent with the agent clustering problem in Section 2. In agent clustering, the concepts of “coalition” and “cluster” are equivalent—each cluster corresponds to a coalition. In other words, agent swarms will eventually form multiple non-overlapping coalitions, a structure called a coalition partition in coalition game. In this section, a new coalition game clustering allocation scheme for large-scale multi-agent systems is designed.

3.1. Overall Scheme Design

Since the problem to be solved by coalition game is highly similar to the clustering problem of multi-agent systems, the main idea to address the discussed problem is to construct an appropriate cooperation mechanism to ensure the achievement of expected positive results. This means that in the final state, the multi-agent system will form multiple non-overlapping coalitions, i.e., coalition partition, in a coalition game.

In the above algorithm idea, agents assigned to perform the same task will automatically gather to form initial clusters, completing the initialization of clustering. Subsequently, each agent in the cluster will periodically execute three key steps in turn: (1) the generation of a switching set; (2) the establishment of a switching operation; (3) the selection of the optimal switching operation. This process will continue until a stable coalition partition is finally formed.

The specific steps are given below.

(1) Initialize coalition partition, where agents performing the same task form a cluster. Specifically, agents assigned to execute the same task are grouped into an initial cluster, as they need frequent interactions to share task-related information and coordinate operational actions. This initial grouping not only simplifies the subsequent optimization process by reducing unnecessary computational overhead but also ensures that the basic collaborative needs of agents are met at the early stage. Such a task-based initialization strategy aligns with the core requirements of multi-agent system operations and provides a reasonable starting point for further alliance adjustment and optimization.

(2) For any agent i, assuming it performs the j-th task and belongs to cluster

C_{r}

, periodically execute the three steps of “switching set generation–switching operation establishment–optimal switching operation selection” as follows.

Switching set generation. First, initialize the agent switching set $F = {i, k}$ , and $(i, k) \in ε_{r}$ ; then, establish different switching sets according to whether the cluster $C_{r}$ meets the constraint conditions before and after switching: when the cluster $C_{r}$ where agent i is located does not meet the constraint conditions, the agent switching set remains unchanged; when clusters $C_{r}$ and $C_{r} ∖ {i}$ meet the constraint conditions, the agent switching set becomes: $F = F \cup {i}$ .
Switching operation establishment. Agents can only switch between adjacent clusters (including empty clusters). Only when the benefit brought by the switching operation is greater than zero, such a switching will be considered effective, and a series of actually executable switching operations will be generated accordingly.
Optimal switching operation selection. According to the switching benefit, find the optimal agent switching set $P^{*}$ and cluster $C_{r^{*}}$ . If all agents in $P^{*}$ are effective, then the agent set $P^{*}$ leaves cluster $C_{r}$ and joins cluster $C_{r^{*}}$ , and the coalition partition is updated according to $C = (C_{r} ∖ P^{*}) \cup (C_{r^{*}} \cup P^{*})$ .

3.2. Algorithm Design

Specific flow of the proposed clustering allocation algorithm is shown in Figure 1. Agents performing the same task initially form a cluster. For any agent i performing task m and belonging to cluster

C_{r}

, it initiates with three core initialization steps: coalition partition initialization, setting the switching set

\tilde{P} = {i, k}

where

k \in O_{m}

with

O_{m}

the set of agents who performs task m, and initializing the switching operation

U = ⌀

. It then enters an iterative loop focused on constraint verification and coalition adjustment. Firstly, it checks if the new coalition

C_{r} ∖ i

meets preset constraints. If

C_{r} ∖ i

fails the check, the algorithm further verifies whether the merged coalition

C_{r^{*}} \cup \tilde{P}

satisfies constraints where

r^{*}

is an iteration flag for any agent in cluster

C_{r}

. When this condition holds, it judges the switch gain of

σ_{r, r^{*}} (\tilde{P})

which is denoted as

ρ (σ_{r, r^{*}} (\tilde{P}))

and is used to evaluate the quality of a switch operation. Mathematically, for two coalitions labeled by l and k, the switch gain

ρ (σ_{k, l} (P))

is defined as

ρ (σ_{k, l} (P)) = F (C_{l^{'}}) + F (C_{k^{'}}) - F (C_{l}) - F (C_{k})

(7)

where

C_{k}

and

C_{l}

are coalitions before the switch operation, and

C_{k^{'}}

and

C_{l^{'}}

are new coalitions formed after the switch, expressed as

C_{k^{'}} = C_{k} ∖ P

and

C_{l^{'}} = C_{l} \cup P

. Now, proceed to elaborate on the algorithm. If

ρ (σ_{r, r^{*}} (\tilde{P}))

is positive, U is updated to

U = U \cup σ_{r, r^{*}} (\tilde{P})

. When

r^{*}

is larger than the agent amount

N_{r}

of cluster

C_{r}

, it follows by reassigning

P^{*}

to the element maximizing

σ_{r, r^{*}} (\tilde{P})

where

P^{*}

is a temporary switching set symbol. Meanwhile, it conducts the coalition switching operation

C_{r} \mapsto C_{r} ∖ P^{*}, C_{r^{*}} \mapsto C_{r^{*}} \cup P^{*}

before incrementing the index i. Above the coalition switching operation, symbol ↦ has a meaning of replacement.

The iteration continues until

i > N_{u}

where

N_{u}

represents the total number of agents. In this point, the algorithm checks if the current coalition partition has reached a Nash equilibrium. If Nash equilibrium is achieved, the algorithm terminates; if not, the iterative process of constraint checking, coalition adjustment, and switching set/operation updates restarts, repeating the logical sequence until the equilibrium condition is satisfied to obtain a stable coalition structure.

Remark 1.

The superiority of the proposed method over established heuristic clustering techniques like k-means or auction-based protocols mentioned in the literature are described as below. For the k-means technique, it can only guarantee clustering without considering allocation. For auction-based protocols, only allocation is taken into account without fully considering clustering. The proposed coalitional game based clustering allocation method can not only ensure clustering but also generate an optimal allocation result, which represents a major innovation of this work.

3.3. Stability Analysis

This subsection presents a theorem that indicates the stability for the designed clustering allocation scheme as summarized in the following Theorem 1.

Theorem 1.

Under the assumption that the algorithm completes the iteration without exceeding the maximum iteration number N, using the coalition switching operation

C = (C_{r} ∖ P^{*}) \cup (C_{r^{*}} \cup P^{*})

, the proposed clustering allocation algorithm in Figure 1 ensures that after a period of time, the cluster structure tends to be stable and the formed coalition partition Π is Nash stable.

Proof.

The above theorem can be proved by contradiction. If the final coalition partition

Π

is not Nash stable, there exists a switching operation

ρ (σ_{r, r^{*}} ({i})) > 0

that causes agent i in cluster

C_{r}

to trigger the switching operation, leave the current cluster, and join cluster

C_{r^{*}}

, which contradicts the stability of the cluster structure. Since the assumption that the algorithm completes the iteration without exceeding the maximum iteration number N, it is obvious that the algorithm cannot be trapped in a local optimum. This is mainly supported by the fact that the switch gain

ρ (\cdot)

is a monotonic function which can be observed from the definition as below. For a participant, if the switch gain

ρ (σ_{k, l} (P))

of switch operation

σ_{k, l} (P)

is greater than the switch gain

ρ (σ_{k^{'}, l^{'}} (P^{'}))

of another switch operation

σ_{k^{'}, l^{'}} (P^{'})

, the participant prefers

σ_{k, l} (P)

over

σ_{k^{'}, l^{'}} (P^{'})

. Thus, the participant’s preference relation “≻” for switch operations can be expressed as

σ_{k, l} (P) ≻ σ_{k^{'}, l^{'}} (P^{'}) \Leftrightarrow ρ (σ_{k, l} (P)) > ρ (σ_{k^{'}, l^{'}} (P^{'}))

where “⇔” denotes an equivalence relation. For a monotonically switching gain, the local optimal solution and the global optimal solution are equivalent. Therefore, through the proposed algorithm, the finally formed coalition partition

Π

is Nash stable. □

Theorem 1 reveals that the proposed coalition game clustering allocation algorithm can obtain stable coalition partition

Π

after a limited number of iterations. It can also be inferred that if the algorithm completes at the maximum iteration number, the algorithm becomes trapped at a local optimum during iterative coalition switches and the final result is not a Nash equilibrium. Furthermore, from the flowchart of clustering allocation algorithm in Figure 1, we can see that compared with the enumeration method, the proposed coalitional game method can reduce the complexity of clustering allocation for multi-agent systems from

O (N_{u}^{2})

to

O (N_{u} l o g (N_{u}))

.

4. Simulation Results

This section uses the coalition game algorithm to solve the clustering allocation problem of large-scale multi-agent systems. Two groups of repeated experiments are set up by changing the initial positions of agents and targets. Detailed analysis is presented to illustrate the effectiveness of the proposed scheme in solving clustering allocation problem of large-scale multi-agent systems. Meanwhile, simulation results compared with the clustering allocation problem using the enumeration method are also presented.

Consider the scenario of the communication link planning problem when multiple unmanned aerial vehicles (UAVs) collaborate to scout multiple targets. The number of targets and UAVs are denoted as

N_{t}

and

N_{u}

, respectively. The target can be a fixed or moving target, but they are all considered as particles, and for moving targets, their velocity is taken into account. For UAV agents, a universal longitudinal motion model is considered, where the height, speed, inclination angle, and yaw angle are included. Model description is omitted here for simplicity and one can refer to [27] for details. Committed to establishing the optimal communication link, UAV agents need to perform clustering, where in each cluster, they are divided into two roles: cluster heads and cluster members. Cluster heads are responsible for managing intra-cluster members, finding routes to the ground station, and completing resource allocation. The clustering process of UAV swarms is similar to coalition game, which refers to the process where decision-makers form stable coalitions with other decision-makers through alliance and cooperation. Therefore, a coalition game algorithm is adopted to solve such clustering problem.

Next, how the optimization problem in Section 2 is instantiated in the experimental scenario of communication link planning problem is explained. For the constraint condition, each cluster allows up to 15 UAV agents, i.e.,

n_{max} = 15

. For the objective function, since the rated distance for unobstructed communication between agents is

20 m

, i.e.,

L = 20

, then

L_{i k}

is obtained by substituting the clustering results for each round, thus

f_{1}

and

f_{2}

can be calculated. For the interaction topology, valid communication distance is set to be

100 m

, within which two UAV agents are treated as connected. Due to the changes in the positions between agents, their communication topology is also changing, requiring real-time computation.

In simulations, the consensus-based auction algorithm is used to solve the allocation problem of agents scouting targets, after then, the algorithm is initialized. The inputs of the consensus-based auction algorithm, i.e., the trajectory cost, mainly consist of flight distance and energy consumption. Therefore, the initialization strategy of grouping agents solely by task assignment fully considers spatial proximity and it will not induce high initial communication latency that requires excessive iterations to resolve. Since the trajectory cost calculation is not the core part of this work, it is omitted here. Subsequently, the coalition game algorithm is used to solve the clustering results. The simulation environment is DESKTOP-8DOTIFE, with a processor of AMD Ryzen 7 4800H with Radeon Graphics 2.90 GHz, installed RAM of 16.0 GB, and a 64-bit operating system. Collectively, this hardware–software configuration balances computational power, memory capacity, and system compatibility, creating a reliable and efficient platform to accurately assess the algorithm’s performance in terms of coalition benefit, allocation time, and stability, while also ensuring the reproducibility and generalizability of the validation results. The two groups of simulations are presented and compared to verify the adaptability of the algorithm as follows.

Case 1: The number of our agents

N_{u}

is 100, and there are 10 targets; the initial position information of agents and targets is randomly set as shown in Figure 2, where the height of all agents is 300 m, the speed is 100 m/s, the initial inclination angle and yaw angle are 0°, and the height of all targets is 0 m.

Case 2: The number of our agents

N_{u}

is 100, and there are 10 targets; the initial position information of agents and targets is randomly set as shown in Figure 3, where the speed of all agents is 100 m/s, the initial inclination angle and yaw angle are 0°, and the height of all targets is 0 m.

The allocation of agents countering targets is carried out. The allocation results of the two cases are shown in Table 1 and Table 2.

Based on the above initial allocation results, the benefits corresponding to different agent clusters are calculated according to the model. According to the calculated benefits, the coalition partition results are obtained, as shown in Table 3 and Table 4. Meanwhile, the results using the proposed algorithm are compared with clustering using the enumeration algorithm. Comparative coalition partition results are given in Table 5.

Based on the results in Table 3, Table 4 and Table 5, a comprehensive analysis of the coalition partition outcomes and algorithm performance is presented here. In both Cases 1 and 2, the proposed algorithm yields well-structured coalition partitions with nine non-overlapping clusters formed respectively to cover all 100 agents, and the overall coalition benefits reach 2.80 and 2.38, respectively. This demonstrates that the proposed algorithm effectively balances communication efficiency and task collaboration required in the UAV agent swarm. Meanwhile, it is shown that although the difference of geographic locations is significant, Nash equilibrium is achieved in both cases. More importantly, comparative results in Table 5 highlight the superior performance of the proposed algorithm. While achieving the same optimal coalition benefit as the enumeration algorithm (2.8 for Case 1 and 2.38 for Case 2), it drastically reduces the allocation time from 51.382 s and 46.423 s with the enumeration algorithm to only 5.296 s and 4.969 s, respectively. This indicates that the proposed algorithm not only ensures the optimality of clustering results but also significantly improves computational efficiency, making it more suitable for practical large-scale multi-agent system applications. It is worth noting that the proposed algorithm is fast enough for the real-time control of UAVs where topology changes occur in milliseconds. In fact, our algorithm for clustering allocation of large-scale multi-agent systems is deployed on the cloud or high-performance edge devices, where the update of instructions is conducted in seconds. The cluster allocation results need to be updated only when multiple new tasks emerge, which is not affected by the real-time control of UAVs. This process will not be in milliseconds, but at least in seconds or longer.

To show the impact of the number of agents on the performance of the proposed clustering scheme, the simulation is expanded to include larger agent populations, ranging from 100 to 1200, where massive swarm scenarios involving thousands of nodes are also encompassed. Related results are shown in Figure 4, in which the scalability of the proposed method for industrial IoT or massive swarm scenarios involving thousands of nodes are sufficiently demonstrated. From Figure 4, it can also be seen that the round of iterations required for convergence increases slowly as the cluster size expands. This is because as the cluster size increases, the number of switch operations UAV agents can perform increases, requiring more iterations to achieve network stability.

5. Conclusions

To address the clustering allocation problem of large-scale multi-agent systems, this paper proposes a solution based on coalition game. A large-scale multi-agent clustering model is established, and a coalition game solution algorithm based on coalition switching is designed. By changing the positions of agents and targets, repeated simulation cases are designed for solution. Finally, a suitable clustering allocation result for large-scale multi-agent systems is obtained. The algorithm calculation time is fast, the required goals are successfully achieved, and the applicability of the algorithm is verified. Future work includes extending the proposed method to adapt in more complicated communication model which considers realistic channel impairments like multi-path fading or interference that would destabilize clusters in actual swarms.

Author Contributions

Conceptualization, L.S.; methodology, L.S.; software, L.S.; validation, L.S.; formal analysis, L.S.; investigation, L.S.; resources, L.S. and P.Q.; data curation, L.S.; writing—original draft preparation, L.S.; writing—review and editing, P.Q.; visualization, L.S.; supervision, L.S.; project administration, P.Q.; funding acquisition, L.S. and P.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by Research Start-up Foundation of Nanjing Normal University, Natural Science Foundation of Higher Education Institutions in Jiangsu Province under Grant 23KJB470021, Natural Science Foundation of China under Grant 62303225.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to privacy.

Acknowledgments

The authors would like to thank anonymous reviewers for their valuable comments to enhance the quality of this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wang, K.; Liang, W.; Yuan, Y.; Liu, Y.; Ma, Z.; Ding, Z. User clustering and power allocation for hybrid non-orthogonal multiple access systems. IEEE Trans. Veh. Technol. 2019, 68, 12052–12065. [Google Scholar] [CrossRef]
Yemini, M.; Goldsmith, A.J. Virtual cell clustering with optimal resource allocation to maximize capacity. IEEE Trans. Wirel. Commun. 2021, 20, 5099–5114. [Google Scholar] [CrossRef]
Lin, Y.; Zhang, R.; Li, C.; Yang, L.; Hanzo, L. Graph-based joint user-centric overlapped clustering and resource allocation in ultradense networks. IEEE Trans. Veh. Technol. 2017, 67, 4440–4453. [Google Scholar] [CrossRef]
Lin, C.R.; Gerla, M. Adaptive clustering for mobile wireless networks. IEEE J. Sel. Areas Commun. 2002, 15, 1265–1275. [Google Scholar] [CrossRef]
Chatterjee, M.; Sas, S.; Turgut, D. An ondemand weighted clustering algorithm (WCA) for ad hoc networks. In Proceedings of the IEEE Global Telecommunications Conference (GLOBECOM’00), San Francisco, CA, USA, 27 November–1 December 2000; Volume 3, pp. 1697–1701. [Google Scholar]
Amis, A.D.; Prakash, R. Load-balancing clusters in wireless ad hoc networks. In Proceedings of the 3rd IEEE Symposium on Application-Specific Systems and Software Engineering Technology, Richardson, TX, USA, 24–25 March 2000; pp. 25–32. [Google Scholar]
Nocetti, F.G.; Gonzalez, J.S.; Stojmenovic, I. Connectivity based k-hop clustering in wireless networks. Telecommun. Syst. 2003, 22, 205–220. [Google Scholar] [CrossRef]
Zhang, K.; Collins, E.G., Jr.; Shi, D. Centralized and distributed task allocation in multi-robot teams via a stochastic clustering auction. ACM Trans. Auton. Adapt. Syst. (TAAS) 2012, 7, 21. [Google Scholar] [CrossRef]
Janati, F.; Abdollahi, F.; Ghidary, S.S.; Jannatifar, M.; Baltes, J.; Sadeghnejad, S. Multi-robot task allocation using clustering method. In Proceedings of the Robot Intelligence Technology and Applications 4: Results from the 4th International Conference on Robot Intelligence Technology and Applications; Springer: Berlin/Heidelberg, Germany, 2016; pp. 233–247. [Google Scholar]
Martin, J.G.; Muros, F.J.; Maestre, J.M.; Camacho, E.F. Multi-robot task allocation clustering based on game theory. Robot. Auton. Syst. 2023, 161, 104314. [Google Scholar] [CrossRef]
Wu, J.; Zou, L.; Zhao, L.; Al-Dubai, A.; Mackenzie, L.; Min, G. A multi-UAV clustering strategy for reducing insecure communication range. Comput. Netw. 2019, 158, 132–142. [Google Scholar] [CrossRef]
Liu, X.; Chen, A.; Zheng, K.; Chi, K.; Yang, B.; Taleb, T. Distributed Computation Offloading for Energy Provision Minimization in WP-MEC Networks With Multiple HAPs. IEEE Trans. Mob. Comput. 2025, 24, 2673–2689. [Google Scholar] [CrossRef]
Bhandari, S.; Wang, X.; Lee, R. Mobility and location-aware stable clustering scheme for UAV networks. IEEE Access 2020, 8, 106364–106372. [Google Scholar] [CrossRef]
Sharma, V.; Bala, M. An improved task allocation strategy in cloud using modified k-means clustering technique. Egypt. Inform. J. 2020, 21, 201–208. [Google Scholar] [CrossRef]
Raffinot, T. Hierarchical clustering-based asset allocation. J. Portf. Manag. 2018, 44, 89–99. [Google Scholar] [CrossRef]
Shang, Q. A dynamic resource allocation algorithm in cloud computing based on workflow and resource clustering. J. Internet Technol. 2021, 22, 403–411. [Google Scholar]
Shooshtarian, L.; Lan, D.; Taherkordi, A. A clustering-based approach to efficient resource allocation in fog computing. In Proceedings of the International Symposium on Pervasive Systems, Algorithms and Networks; Springer: Berlin/Heidelberg, Germany, 2019; pp. 207–224. [Google Scholar]
Wang, K.; Zhou, Q.; Guo, S.; Luo, J. Cluster frameworks for efficient scheduling and resource allocation in data center networks: A survey. IEEE Commun. Surv. Tutor. 2018, 20, 3560–3580. [Google Scholar] [CrossRef]
Abdelnasser, A.; Hossain, E.; Kim, D.I. Clustering and resource allocation for dense femtocells in a two-tier cellular OFDMA network. IEEE Trans. Wirel. Commun. 2014, 13, 1628–1641. [Google Scholar] [CrossRef]
Tian, X.; Jia, W. Improved clustering and resource allocation for ultra-dense networks. China Commun. 2020, 17, 220–231. [Google Scholar] [CrossRef]
Celik, A.; Tsai, M.C.; Radaydeh, R.M.; Al-Qahtani, F.S.; Alouini, M.S. Distributed user clustering and resource allocation for imperfect NOMA in heterogeneous networks. IEEE Trans. Commun. 2019, 67, 7211–7227. [Google Scholar] [CrossRef]
Zhou, S.; Cheng, Y.; Lei, X.; Peng, Q.; Wang, J.; Li, S. Resource allocation in UAV-assisted networks: A clustering-aided reinforcement learning approach. IEEE Trans. Veh. Technol. 2022, 71, 12088–12103. [Google Scholar] [CrossRef]
Duarte, F.G.; De Castro, L.N. A framework to perform asset allocation based on partitional clustering. IEEE Access 2020, 8, 110775–110788. [Google Scholar] [CrossRef]
Pooja; Kumar, R.; Viriyasitavat, W.; Yadav, K.; Dhiman, G. Analysis of clustering algorithms for facility location allocation problems. In Proceedings of the Third International Conference on Advances in Computer Engineering and Communication Systems: ICACECS 2022; Springer: Berlin/Heidelberg, Germany, 2023; pp. 597–605. [Google Scholar]
Jain, S.; Chin, H.H.; Bandyopadhyay, S.; Klemeš, J.J. Clustering and optimising regional segregated resource allocation networks. J. Environ. Manag. 2022, 322, 116030. [Google Scholar] [CrossRef]
Sharma, N.; Kumar, K. Energy efficient clustering and resource allocation strategy for ultra-dense networks: A machine learning framework. IEEE Trans. Netw. Serv. Manag. 2022, 20, 1884–1897. [Google Scholar] [CrossRef]
Lu, Y.; Gu, J.; Sun, R. Adaptive Performance Guaranteed Formation Control for Unmanned Aerial Vehicles under Anti-collision Constraints. In Proceedings of the 2022 17th International Conference on Control, Automation, Robotics and Vision (ICARCV), Singapore, 11–13 December 2022; pp. 814–819. [Google Scholar]

Figure 1. Flowchart of clustering allocation algorithm.

Figure 2. Initial position Z-X coordinates of agents and targets (Case 1).

Figure 3. Initial position coordinates of agents and targets (Case 2).

Figure 4. The relationship between the round of iterations and the agent population.

Table 1. Final allocation results of Case 1.

Agent Allocation Result	Target	Total Cost	Allocation Time
4, 11, 12, 23, 35, 38, 43, 47, 72, 90	1
2, 8, 25, 31, 52, 66, 76, 81, 82, 97	2
1, 7, 30, 39, 56, 59, 60, 79, 80, 94	3
6, 13, 20, 32, 50, 54, 55, 71, 74, 100	4
9, 28, 36, 40, 41, 61, 65, 70, 83, 98	5
22, 33, 34, 37, 53, 57, 58, 75, 77, 78	6	183.797	6.765 s
18, 24, 29, 44, 68, 73, 84, 85, 88, 93	7
3, 16, 19, 42, 46, 86, 87, 89, 91, 96	8
5, 10, 15, 26, 45, 49, 62, 67, 92, 99	9
14, 17, 21, 27, 48, 51, 63, 64, 69, 95	10

Table 2. Final allocation results of Case 2.

Agent Allocation Result	Target	Total Cost	Allocation Time
1, 11, 13, 32, 39, 70, 71, 73, 74, 96	1
7, 16, 18, 26, 27, 38, 47, 48, 67, 95	2
20, 24, 34, 35, 37, 52, 57, 58, 64, 75	3
5, 14, 60, 61, 78, 79, 84, 85, 98, 100	4
29, 36, 45, 53, 54, 76, 90, 92, 94, 97	5
19, 33, 44, 49, 56, 77, 81, 82, 89, 93	6	183.446	4.173 s
8, 15, 21, 31, 41, 50, 51, 59, 68, 87	7
2, 4, 10, 12, 28, 55, 69, 72, 83, 99	8
3, 6, 9, 22, 30, 42, 46, 86, 88, 91	9
17, 23, 25, 40, 43, 62, 63, 65, 66, 80	10

Table 3. Coalition partition results of Case 1.

Coalition	Agents	Coalition	Allocation
Partition		Benefit	Time
1	6, 11, 15, 17, 31, 32, 38, 43, 47, 53, 55, 58, 77, 90, 92
2	29, 33, 34, 39, 40, 48, 52, 64, 76, 81, 82, 89, 94, 96, 97
3	7, 16, 30, 59, 73, 79, 80, 84, 85, 91, 93
4	86
5	2, 3, 9, 13, 18, 19, 24, 28, 36, 41, 46, 61, 65, 70, 83	2.80	5.296 s
6	35
7	14, 21, 22, 25, 27, 37, 42, 44, 54, 56, 60, 66, 67, 68, 98
8	1, 4, 8, 10, 50, 63, 71, 78, 87, 88, 95, 99
9	5, 12, 20, 23, 26, 45, 49, 51, 57, 62, 69, 72, 74, 75, 100

Table 4. Coalition partition results of Case 2.

Coalition	Agents	Coalition	Allocation
Partition		Benefit	Time
1	9, 18, 19, 38, 39, 42, 49, 52, 53, 55, 56, 62, 63, 96, 99
2	16, 26, 27, 47, 66, 69, 74, 75, 76, 77, 78, 83, 84, 91, 95
3	20, 32, 34, 35, 41, 57, 58, 60, 64, 68, 71, 73, 81, 89, 98
4	70
5	8, 11, 13, 21, 22, 24, 25, 29, 30, 36, 45, 54, 92, 93, 94	2.38	4.969 s
6	37, 50, 61, 79, 85
7	2, 3, 4, 5, 6, 7, 10, 12, 15, 28, 65, 72, 80, 87, 100
8	1, 17, 23, 33, 40, 43, 44, 46, 48, 51, 59, 82, 86, 88, 97
9	14, 31, 67, 90

Table 5. Comparative coalition partition results.

Case Number	Method	Coalition Benefit	Allocation Time
1	The Enumeration Algorithm	2.8	51.382 s
1	The Proposed Algorithm	2.8	5.296 s
2	The Enumeration Algorithm	2.38	46.423 s
2	The Proposed Algorithm	2.38	4.969 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sun, L.; Qian, P. Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method. Electronics 2026, 15, 304. https://doi.org/10.3390/electronics15020304

AMA Style

Sun L, Qian P. Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method. Electronics. 2026; 15(2):304. https://doi.org/10.3390/electronics15020304

Chicago/Turabian Style

Sun, Lu, and Puhua Qian. 2026. "Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method" Electronics 15, no. 2: 304. https://doi.org/10.3390/electronics15020304

APA Style

Sun, L., & Qian, P. (2026). Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method. Electronics, 15(2), 304. https://doi.org/10.3390/electronics15020304

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Clustering Allocation for Large-Scale Multi-Agent Systems: A Coalitional Game Method

Abstract

1. Introduction

2. Problem Formulation

2.1. Constraint Condition

2.2. Performance Metric

2.3. Optimization Model

3. Clustering Allocation Scheme Design

3.1. Overall Scheme Design

3.2. Algorithm Design

3.3. Stability Analysis

4. Simulation Results

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI