Abstract
5G technology represents a transformative shift in mobile communications, delivering improved ultra-low latency, data throughput, and the capacity to support huge device connectivity, surpassing the capabilities of LTE systems. As global telecommunication operators shift toward widespread 5G implementation, ensuring optimal network performance and intelligent resource management has become increasingly obvious. To address these challenges, this study explored the role of advanced clustering methods in optimizing cellular networks under heterogeneous and dynamic conditions. A systematic literature review (SLR) was conducted by analyzing 40 peer-reviewed and non-peer-reviewed studies selected from an initial collection of 500 papers retrieved from the Semantic Scholar Open Research Corpus. This review examines a diversity of clustering approaches, including spectral clustering with Bayesian non-parametric models and K-means, density-based clustering such as DBSCAN, and deep representation-based methods like Differential Evolution Memetic Clustering (DEMC) and Domain Adaptive Neighborhood Clustering via Entropy Optimization (DANCE). Key performance outcomes reported across studies include anomaly detection accuracy of up to 98.8%, delivery rate improvements of up to 89.4%, and handover prediction accuracy improvements of approximately 43%, particularly when clustering techniques are combined with machine learning models. In addition to summarizing their effectiveness, this review highlights methodological trends in clustering parameters, mechanisms, experimental setups, and quality metrics. The findings suggest that advanced clustering models play a crucial role in intelligent spectrum sensing, adaptive mobility management, and efficient resource allocation, thereby contributing meaningfully to the development of intelligent 5G/6G mobile network infrastructures.
1. Introduction
1.1. Background
The convergence of artificial intelligence (AI) and data-centric technologies has become pivotal for next-generation communication systems, enabling intelligent and adaptive network optimization. Modern mobile networks must dynamically interpret complex traffic patterns, respond to user mobility, and adapt to evolving environmental conditions to ensure high performance and reliability, particularly in heterogeneous and densely deployed infrastructures. To meet these demands and enhance users’ quality of experience (QoE), advanced clustering techniques have emerged as essential tools for mobile network optimization.
In this study, clustering refers explicitly to unsupervised machine learning techniques used for analysing and managing mobile network data. It excludes the physical grouping of network nodes, such as base stations or data centres, and instead focuses on algorithmic approaches such as spectral clustering (e.g., Bayesian non-parametric models), density-based clustering (e.g., DBSCAN), and deep representation-based methods (e.g., DEMC and DANCE). These techniques enable feature discovery, anomaly detection, and dynamic quality-of-service (QoS) management within intelligent mobile infrastructures.
The ongoing global deployment of fifth generation (5G) networks and the anticipated evolution toward sixth generation (6G) networks introduce new dimensions of complexity in mobile communications. 5G supports enhanced mobile broadband (eMBB), massive machine-type communications (mMTC), and ultra-reliable low-latency communications (URLLC), collectively enabling services and automation at unprecedented scales. By contrast, 6G is envisioned to push beyond these capabilities, targeting terabit-per-second peak data rates, sub-millisecond end-to-end latencies, and native integration of sensing and communication. The 6G vision emphasizes the use of the upper mid-band spectrum (7–20 GHz) for enhanced capacity, open radio access networks (O-RANs) for architectural flexibility, and AI-native designs for self-optimizing network behaviour [1,2].
While integrated sensing and communications (ISAC) and dual-functional radar-communication (DFRC) are related concepts, they are not synonymous. ISAC represents a broader design paradigm in which communication and sensing functionalities are jointly integrated within a unified wireless infrastructure. In ISAC, spectrum, waveforms, and hardware are shared to support both data transmission and environmental perception, making it a key enabler of 6G’s intelligent, perceptive networking capabilities.
In contrast, DFRC refers to a specific system-level realization of this concept, typically at the physical layer, where a transceiver performs both radar sensing and data communication simultaneously using shared or co-designed waveforms. DFRC research focuses on waveform optimization, precoding, and beamforming to achieve dual functionality with minimal performance trade-offs. Thus, DFRC can be viewed as a subset or implementation approach within the broader ISAC framework.
Recent work on hybrid radar fusion demonstrates how monostatic DFRC base stations can fuse uplink and downlink sensing measurements to improve range and angle estimation accuracy [3]. Further, waveform and precoder design for DFRC systems has been extensively studied [4], while secure full-duplex ISAC architectures have recently been explored to enhance confidentiality and interference mitigation [5]. Together, these developments underscore the importance of integrating ISAC (as a system-level framework) and DFRC (as a physical-layer realization) into future discussions on spectrum-aware clustering and sensing-aware resource allocation in next-generation networks [6].
Moreover, the International Telecommunication Union (ITU) has identified six key pillars defining the 6G framework: (i) massive communications, (ii) ubiquitous connectivity, (iii) hyper-reliable and low-latency communication, (iv) AI and communication, (v) immersive communication, and (vi) integrated sensing and communication [7]. These pillars guide the evolution of intelligent, flexible, and perceptive communication networks that align with future digital transformation objectives.
As these networks grow more complex, optimization objectives extend beyond traditional load balancing and signal strength enhancement. Current research increasingly focuses on dynamic resource allocation, fault detection, traffic classification, anomaly detection, intelligent handovers, and service orchestration. To address these challenges, clustering algorithms such as DBSCAN, K-means, Bayesian non-parametric models, and deep representation-based methods like DEMC and DANCE have gained widespread adoption. Table 1 summarizes the main clustering algorithms, outlining their fundamental principles, objectives, and mathematical formulations used in mobile network optimization.
Table 1.
Summary of the main clustering algorithms.
Coupled with advances in machine learning, clustering enables autonomous network adaptation with minimal human intervention, thereby improving operational efficiency and resilience under dynamic conditions.
The economic implications of these advances are significant. For example, the World Economic Forum estimates that 5G alone could contribute more than USD 13 trillion to the global economy by 2030 and USD 13.2 trillion by 2035 [8]. The mobile network analytics market, valued at USD 6.53 billion in 2024, is projected to reach USD 18.97 billion due to automation and optimization priorities [9]. Empirical studies have shown that clustering integrated with machine learning can achieve up to 98.8% accuracy in anomaly detection [10], 89.4% improvement in data delivery [11], and a 43% enhancement in handover prediction [12], demonstrating the practical value of clustering-based methods in operational networks. It is important to note that these results were obtained from existing peer-reviewed literature and did not originate from the original experimental work in this review. They demonstrated the effectiveness of advanced clustering methods when integrated with machine learning models.
Moreover, it should be noted that a distinction must be made between narrative reviews and systematic literature reviews (SLRs). Narrative reviews provide broad, qualitative insights but often lack transparency and reproducibility, leading to selective reporting. In contrast, SLRs follow a rigorous and reproducible approach for identifying, selecting, and analyzing relevant studies. This study followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) methodology [13,14] to ensure objectivity and reliability through well-defined search strategies, selection criteria, and data extraction processes.
1.2. Research Gaps
Despite substantial contributions to cellular network optimization, most existing literature reviews either adopt a narrow technical focus or follow a narrative approach that lack methodological rigor. Specifically, there is no unified framework that systematically compares clustering techniques across different network generations, traffic types, or deployment scenarios. Algorithm-specific parameters, performance metrics, and real-world evaluation environments have rarely been reported in an integrated manner, limiting the ability of researchers to benchmark approaches, evaluate cross-context applicability, and design generalizable clustering solutions. Several existing surveys, such as those on clustering in wireless sensor networks and anomaly detection in 5G, have provided foundational insights. However, they often focus on narrow domains or fail to comprehensively explore deep-representation-based techniques. Unlike prior studies, this review offers a unified analysis across heterogeneous 5G/6G environments, integrates deep learning and federated learning clustering models, and assesses their utility in adaptive QoS management, filling a key research gap.
Furthermore, evaluating clustering algorithms across both 4G and 5G contexts offers unique insights into their scalability and adaptability. With 5G introducing architectural innovations such as network slicing, edge orchestration, and AI-native management, and 6G emphasizing integrated sensing, pervasive intelligence, and immersive services, cross-generational and forward-looking analyses are essential for identifying algorithmic strengths and designing robust solutions aligned with 6G vision.
1.3. Objectives
This study conducts a systematic review of advanced clustering techniques, including deep representation-based, spectrum-based, and density-based approaches, for feature discovery, anomaly detection, and adaptive quality of service (QoS) estimation in heterogeneous and dynamic mobile network environments.
The major contributions of this study are as follows:
- (i.)
- Classification of clustering methods and their applications in mobile networking;
- (ii.)
- Review of algorithmic parameters and performance metrics;
- (iii.)
- Exploration of integration with deep learning, edge computing, and federated learning;
- (iv.)
- Identification of open challenges and research direction; and
- (v.)
- Consolidated reference for practitioners and researchers exploring clustering for adaptive QoS in 5G/6G mobile networks
2. Research Methodology
For clarity, the clustering techniques covered in this review are grouped into three primary categories: (i) spectrum-based clustering, including Bayesian and K-means methods; (ii) density-based clustering such as DBSCAN; and (iii) deep representation-based clustering including DEMC and DANCE. This study adopts the SPICE framework (Setting, Perspective, Intervention, Comparison, Evaluation) to structure the research question and guide the methodology.
2.1. Research Question and Framework
The main research questions guiding this systematic review are: “How can advanced clustering techniques, including spectral clustering, density-based, and deep representation-based clustering, be applied to enhance feature discovery, anomaly detection, and adaptive quality-of-service (QoS) prediction in heterogeneous and dynamic mobile network environments?”
To ensure a structured, comprehensive, and contextually relevant study of the challenges in intelligent mobile network management, the SPICE framework was employed to formulate this research question. SPICE is widely used to structure focused and application-oriented research questions suitable for practical deployment.
In this study, the setting was defined as a mixed and dynamic cellular network environment, capturing the non-static and heterogeneous nature of modern mobile networks. The Perspective focused on researchers and system designers, ensuring that the question reflected stakeholder priorities and real-world concerns. The interventions included spectral clustering, density-based methods, and deep representation-based clustering, highlighting the core analytical strategies under investigation. Although a direct comparison was not incorporated, given the emphasis on innovation rather than benchmarking, the framework remained effective in supporting methodological advancement. Finally, the Evaluation emphasized measurable outcomes such as feature discovery, anomaly detection, and adaptive QoS prediction, aligning the research objectives with performance-driven and practically relevant goals. These elements of the SPICE framework are summarized in Table 2, which outlines the Setting, Perspective, Intervention, Comparison, and Evaluation components used to guide this systematic review.
Table 2.
SPICE Protocol Framework.
The main research question developed for this study is supported by three sub-research questions designed to provide a structured pathway for detailed analysis.
First, it seeks to identify the main categories of clustering techniques and examine how each category aligns with specific mobile network use cases. This includes evaluating the suitability of spectrum-based, density-based, and deep representation-based clustering methods for key tasks such as anomaly detection, feature discovery, and adaptive QoS management.
Second, the review explores how algorithmic parameters and evaluation metrics influence clustering performance. By analyzing the impact of parameter choices (e.g., cluster size and, distance metrics) and performance indicators (e.g., accuracy, latency, and throughput), this study aims to highlight the critical design factors that affect the reliability and efficiency of clustering outcomes.
Third, the review investigates how clustering approaches are integrated with advanced technologies, including deep learning, and machine learning. The objective is to understand how these integrations enable the intelligent, real-time, and scalable optimization of mobile networks in increasingly complex and heterogeneous environments.
Collectively, these sub-questions support the main research aim and help to uncover emerging trends and evolving practices in the field. The sub-research questions were as follows:
- What are the major categories of clustering techniques and how do they align with mobile network use cases?
- How do algorithmic parameters and evaluation metrics influence clustering performance?
- In what ways are clustering approaches integrated with deep learning, edge computing, and machine learning to support intelligent, real-time, and scalable network optimization?
2.2. Review Protocol
This review fundamentally emphasizes the way in which these techniques have been used to tackle the key challenges of feature discovery, anomaly detection, and adaptive quality-of-service (QoS) prediction in dynamic and heterogeneous mobile network environments. The objective is to synthesize the existing body of knowledge and pinpoint evolving trends, performance outcomes, and methodological patterns that can inform future research and engineering practices in mobile network optimization.
2.2.1. Search Strategy and Eligibility Criteria
The search strategy used multiple databases (IEEE Xplore, Scopus, Web of Science, SpringerLink), with the following keywords: “clustering,” “QoS prediction,” “5G,” “6G,” “adaptive networks,” “machine learning,” “unsupervised learning,” and “heterogeneous mobile networks.” The inclusion criteria were as follows: (i) peer-reviewed, (ii) English, (iii) published between 2000 and 2024, with priority on recent 5G/6G papers; and (iv) direct relevance to clustering in mobile networks. Dissertations, patents, non-peer-reviewed materials, or studies lacking empirical evaluation were excluded. After screening 10,000+ entries, 500 were retained, and 40 met all high-quality inclusion criteria.
2.2.2. Study Selection Process
The remaining 40 studies were subjected to full-text assessment, and all were deemed eligible and retained for the final synthesis. The study identification, screening, eligibility, and inclusion phases were visually documented using the PRISMA 2020 [13,14] flow diagram reported in Figure 1.
Figure 1.
PRISMA flow diagram outlining the systematic review protocol and decision flow.
To contextualize these publication trends, Table 3 summarizes the PRISMA stages and corresponding study counts, illustrating the identification, screening, eligibility, and inclusion processes that led to the final set of 40 studies included in this review.
Table 3.
PRISMA stages explained.
A notable increase in research output was observed from 2017 onwards, with peak publications recorded in 2018 and 2020, suggesting heightened research interest and activity during this period. In contrast, earlier years (2000–2010) showed relatively sporadic and low publication rates, indicating limited early exploration of the topic. The post-2017 surge may reflect the impact of emerging technologies, methodological advancements, or the growing recognition of the importance of the research area. Following is Figure 2, which illustrates the annual distribution of peer-reviewed journal and conference publications in English related to clustering in mobile networks, spanning the period from 2000 to 2024.
Figure 2.
Annual distribution of peer-reviewed journal and conference publications in English related to clustering in mobile networks, covering the period from 2000 to 2024.
2.2.3. Data Extraction and Management
Table 4 summarizes the reasons for excluding studies during the full-text eligibility assessment phase. Each paper was evaluated against the predefined inclusion and exclusion criteria to ensure methodological rigor and thematic relevance. The exclusion counts reflect the dominant reasons why studies did not meet the review’s eligibility requirements, supporting the transparency and reproducibility of the selection process.
Table 4.
Exclusion reasons and counts.
As summarized in Table 4, a total of 460 studies were excluded during the full-text review phase. The most common exclusion reason (n = 210) was that the study did not focus on clustering within mobile network environments, often addressing unrelated application domains such as IoT, healthcare, or social media. A further 120 papers were excluded for lacking empirical or quantitative evaluation, while 85 studies fell outside the defined publication period (2000–2024) or were otherwise out of scope. Additionally, 30 publications were removed due to non-peer-reviewed status or inaccessibility, and 15 entries were excluded as duplicates or due to data inconsistencies.
2.2.4. Risk of Bias and Quality Assessment
Quality assessment of the included studies was performed using the AMSTAR (A Measurement Tool to Assess Systematic Reviews) checklist. The evaluation focused on design rigor, empirical validation, and reproducibility of each study. The results of the appraisal process are summarized in Table 5, which includes assessments of the 11 AMSTAR criteria.
Table 5.
AMSTAR quality appraisal summary.
Why AMSTAR? AMSTAR provides a concise, well-known checklist for assessing methodological transparency, search comprehensiveness, and reporting quality of systematic reviews. Items such as whether priori criteria were specified, whether a comprehensive search was performed, and whether study quality was considered in conclusions, are broadly applicable beyond health sciences and help to establish reproducible review practice. We used AMSTAR for engineering studies as a framework to assess reporting and transparency rather than as a prescriptive medical instrument. Where AMSTAR items were not directly applicable to algorithmic/empirical engineering work (for example, items focused on patient outcomes), we:
- (a)
- explicitly noted non-applicability and documented the reason, and
- (b)
- replaced or supplemented those items with engineering-relevant checks (e.g., whether datasets, code, or experimental settings were reported, whether performance metrics and computational costs were provided).
Table 5 presents the AMSTAR quality appraisal summary.
2.2.5. Data Synthesis
Owing to methodological and application heterogeneity, a narrative synthesis approach was adopted instead of a meta-analysis. The findings were thematically grouped based on the type of clustering technique and associated application area (e.g., feature discovery or anomaly detection). This allowed for a detailed interpretation of the literature while preserving methodological distinctions.
2.2.6. Protocol Registration
To ensure transparency and reproducibility, the full protocol for this systematic literature review was registered in the Open Science Framework (OSF).
3. Results and Thematic Analyses
In this section, the findings of the systematic literature review, combining both quantitative and qualitative intuitions, provide a comprehensive understanding of how clustering techniques are applied in mobile network optimization. The first part of the section emphasizes quantitative results, combining the publications per year, distribution of network environments, clustering techniques, and application domains. Following this, thematic analysis identifies and interprets recurring patterns and conceptual trends within the studies, shedding light on the motivations, challenges, and innovations that shape the current research landscape.
3.1. Quantitative Results
Table 6 presents the characteristics of the included studies.
Table 6.
Characteristics of the Included Studies.
3.1.1. Network Environment Studied
The selected studies have employed a wide range of network environments. 5G networks were the most frequently studied, appearing in 10 studies, followed by general mobile or cellular networks in 9 studies. Ad hoc environments, including Mobile Ad Hoc Networks (MANETs) and general ad hoc setups, were considered in 5 studies. LTE/4G networks were featured in three studies, whereas IoT and vehicular ad-hoc networks (VANETs) were the focus of two studies each. Several specialized environments, including cognitive radio networks, vehicular cloud networks, and mobile edge computing, have appeared in only one study. Remarkably, one study did not specify a network environment.
The reviewed studies were conducted in a wide range of network environments, showing a growing pattern of mobile and wireless communication systems. 5G networks have evolved as the most frequently studied environment, presenting 10 of the 40 studies, underlining their relevance to modern mobile network optimization challenges. While foundational clustering methods from earlier decades were included for historical completeness, the majority of the selected studies (60%) were from post-2018, reflecting current trends in 5G/6G network research. Older studies are briefly summarized to contextualize the evolution of clustering approaches in mobile networks.
The bar chart in Figure 3 shows the number of studies per network environment category.
Figure 3.
Distribution of network environments in selected studies.
3.1.2. Clustering Techniques Used
This review reveals diverse clustering approaches. Both K-means clustering and deep learning/representation-based methods were the most common methods used in seven studies. These were followed by spectral clustering and hybrid techniques, each of which was found in four studies. Adaptive and hierarchical clustering approaches were applied in three studies. Techniques such as density-based, prediction-based, and federated learning appeared in two studies. Other specialized approaches (e.g., ensemble, time-aware, dynamic, and geographical clustering) were found in one to two studies. Five studies did not specify the clustering technique. Figure 4 shows the clustering methods with the corresponding study counts.
Figure 4.
Frequency of clustering techniques employed.
Several performance metrics critical to clustering in 5G/6G networks were identified, including the accuracy, precision, recall, F1-score, silhouette coefficient, spectral efficiency, energy efficiency, latency, and throughput. Spectral efficiency and latency are crucial for meeting the demands of next-generation networks. Future studies should therefore balance quantitative evaluation with qualitative insights, including interpretability, adaptability, and robustness, to achieve a more holistic understanding of clustering effectiveness in next-generation networks.
3.1.3. Application Domains
A substantial portion of the reviewed work focused on anomaly detection, appearing in 19 of the 40 studies, demonstrating its central role in mobile network security and performance monitoring. Quality of service (QoS)-related applications were the second most prevalent and addressed in 11 studies. Quality of Experience (QoE) appeared in four studies, indicating a moderate focus on user-centric optimization. Other applications such as data clustering/collection (three studies) and spectrum management (two studies) were less common. A small number of studies (one each) focused on niche areas such as gateway management, handover prediction, and energy efficiency. Figure 5 shows the number of studies per application.
Figure 5.
Application areas addressed by reviewed studies.
It is important to note that this review does not aim to simulate specific 5G/6G scenarios but rather consolidates the literature on how clustering techniques support adaptive QoS in heterogeneous mobile environments. Therefore, detailed simulation parameters (e.g., network topologies) were beyond the scope of this study.
3.2. Thematic Analysis
Thematic analysis involves identifying, analyzing, and interpreting patterns within qualitative data. To complement the quantitative findings, a thematic analysis was conducted to uncover recurring conceptual patterns and emerging trends in the literature.
3.2.1. Clustering Techniques and Their Applications
This section categorizes the clustering techniques based on their technical formulation, implementation approaches, and network specific applications as presented in Table 7.
Table 7.
Clustering techniques and their applications.
- -
- Implementation approaches
Our review identified 14 distinct implementation approaches distributed across nine clustering techniques. Among these, K-means clustering emerged as the only technique mentioned across the two different clustering types, underscoring its popularity and adaptability in mobile network optimization scenarios. All other implementation methods were unique to their respective technique types, indicating a high degree of customization tailored to specific research contexts or performance goals. Figure 6 shows a visual representation of the implemented approaches.
Figure 6.
Distribution of implementation approaches adopted across different clustering methods, illustrating the relative reliance on common algorithms (e.g., K-means) versus technique-specific or uniquely tailored implementations.
- -
- Performance metrics
The studies employed 12 unique performance metrics to evaluate the clustering effectiveness. Notably, anomaly detection was the most prevalent, appearing in the three different techniques, reflecting its central role in securing and monitoring mobile networks. Accuracy was the second most common metric, mentioned for both techniques. The remaining metrics were associated with only one technique, highlighting a fragmented landscape where performance evaluation is highly contextual and often technique specific. Figure 7 shows a visualization of the performance metrics across techniques.
Figure 7.
Frequency distribution of performance metrics across clustering technique types.
- -
- Adaptation capabilities
We identified nine distinct adaptation capabilities across clustering techniques, each uniquely tied to their respective approaches. These capabilities range from adapting to channel conditions, traffic variability, and node mobility, to dynamically responding to environmental changes. Interestingly, K-means clustering was singled out for its limited adaptation capabilities, reinforcing its perception as a baseline technique that may lack the responsiveness required in highly dynamic environments. These differences in adaptability across clustering techniques are visually summarized in Figure 8, highlighting how most methods demonstrate unique adaptation strengths, whereas K-means remains notably limited.
Figure 8.
Adaptation capabilities of clustering methods, showing that most exhibit unique adaptability, while K-means remains limited.
3.2.2. Network Performance Enhancement
This paragraph organizes clustering applications around ten major enhancement areas and evaluates their implementation methods, benefits, and limitations. Table 8 presents the network performance enhancement.
Table 8.
Network performance enhancement.
Sixteen distinct implementation methods were identified. Among these, certain techniques, most notably Deep Learning, Federated Learning, and K-means, have repeatedly appeared, demonstrating their versatility and adaptability across different domains of network optimization. These methods were not confined to single-purpose use; instead, they spanned multiple enhancement areas such as anomaly detection, QoS prediction, and handover management. Figure 9 provides the distribution of clustering implementation methods categorized by performance evaluation domains.
Figure 9.
Distribution of clustering implementation methods categorized by performance evaluation domains, illustrating the extent to which different techniques are applied across distinct performance areas.
Recurring benefits were evident throughout the reviewed implementations. Many approaches have reported gains in prediction accuracy and Quality of Service (QoS), suggesting that clustering-based models are instrumental in addressing performance variability in mobile networks. However, the associated limitations were predominantly domain specific. This highlights a persistent contextual challenge: while clustering techniques show promise in enhancing network functionality, their practical deployment often requires customization to fit the unique constraints of each application area.
3.2.3. Adaptive Mechanisms and Dynamic Response
Clustering techniques in mobile networks are increasingly incorporating adaptive mechanisms to address the dynamic nature of real-world environments. These mechanisms are essential for maintaining performance under conditions such as user mobility, fluctuating traffic patterns, and shifting spectrum availability. Several studies have demonstrated real-time adaptation capabilities. For instance, Ali et al. [15] and Gajic et al. [16] exhibited responsiveness to evolving network conditions by processing and adjusting streaming data in real time. Mobility-aware clustering is another critical adaptation domain. Techniques such as prediction-based clustering proposed by Sivavakeesar and Pavlou [17] and federated clustering for handover prediction explored by Nivitha et al. [18] focus on preserving service quality despite the continual movement of users or nodes across network boundaries. In the context of traffic-adaptive clustering, methods such as the self-adaptive deep learning model introduced by Fernández Maimó et al. [19] and the adaptive vehicular clustering network model by Kaleibar and St-Hilaire [20] adjust the clustering behavior in response to varying traffic loads and patterns, thereby enabling stable and efficient communication. Adaptation to spectrum dynamics was also explored. Notable contributions include adaptive cooperative sensing mechanisms by Pêrez and Santamaría [21] and time-variant spectral clustering approaches such as those developed by Sun et al. [22], both of which dynamically adjust spectrum usage strategies based on real-time measurements. QoS and QoE remain central to adaptive efforts. Yin et al. [23] applied fuzzy clustering for QoS prediction, whereas John and Thangaraj [24] implemented QoE-driven anomaly detection, demonstrating how adaptive clustering can align with user experience metrics in complex environments. From a resource management perspective, techniques such as adaptive DBSCAN combined with deep reinforcement learning by Elsayed and Erol-Kantarci [25] and dynamic DBSCAN-based methods by Ren and Xu [26] adapt to the availability and demand for computational or network resources.
Finally, the hybrid adaptive framework represents the convergence of the multiple adaptation strategies. These include reinforcement learning-enhanced clustering by Kim et al. [27] and co-clustering techniques combined with logistic regression by Kassan et al. [28], which aim to improve responsiveness while maintaining the overall system stability. However, despite these advances, several challenges remain. These include managing the computational complexity, balancing stability with responsiveness, ensuring scalability across large heterogeneous networks, and validating techniques in real-world scenarios. Together, these insights emphasize the growing sophistication and importance of adaptive mechanisms in clustering-based mobile network optimization.
3.2.4. Integration Challenges and Solutions
Clustering techniques offer promising advancements in mobile network optimization; however, their integration into real-world systems remains fraught with technical and operational challenges. This theme synthesizes the key barriers to implementation and highlights innovative solutions proposed in the literature. One of the foremost obstacles is the heterogeneity of network environments. Seamless operations across various platforms, such as vehicular ad hoc networks (VANETs) and legacy mobile systems (UMTS), require significant adaptation. Benslimane et al. [29] and Xu et al. [30] illustrated approaches to unify VANET and WSN/IoT clustering within the broader 5G ecosystem.
Scalability and real-time performance are critical concerns, particularly for high-velocity data streams and dense user environments. Techniques such as NetWalk [15] and DBSCAN combined with LSTM-driven deep reinforcement learning [25] have demonstrated how adaptive algorithms can help maintain responsiveness and reliability at scale. Privacy and data security have become increasingly pertinent. Federated learning-based clustering methods, such as those proposed by Fernández Maimó et al. [19] and extended by Stenhammar et al. [31], preserve user privacy by decentralizing data processing while enabling efficient model training and cluster formation. Another significant challenge is the adaption to network changes. Real-time fluctuations in topology, user behavior, and service demand require clustering mechanisms that are responsive and robust. Incremental time-aware clustering [16] and dynamic clustering for vehicular cloud networks [20] offer effective methods for continuous adaptation to evolving scenarios. The issue of legacy integration, incorporating advanced clustering into established network infrastructures, has been addressed by Fernández Maimó et al. [21] through MEC-based anomaly detection, and by Ren and Xu [26] with DBSCAN adaptations for ultra-dense networks.
As clustering increasingly engages with multi-dimensional datasets, ensuring an accurate representation and low-noise inputs becomes vital. Approaches involving spectrum feature vector clustering [22] and denoising auto-encoders [23] have been employed to address these data-related complexities. The literature also reveals efforts to balance optimization trade-offs, particularly between the accuracy, latency, and energy consumption. Solutions include the QoE-driven anomaly detection frameworks by Murudkar and Gitlin [28] and energy-efficient VANET clustering techniques by Padmanabhan et al. [32]. Despite these advances, the interpretability of clustering decisions remains a challenge. Few studies have provided transparent frameworks for understanding or validating cluster outputs, which is a critical area for future research.
Finally, in efforts toward cross-layer optimization, studies such as those by Ali et al. [15] proposed multi-channel cognitive radio networks that integrate clustering across different OSI layers to improve overall network efficiency. Similarly, energy efficiency, particularly in resource-constrained environments such as WSNs, is addressed using energy-aware clustering approaches [30]. Researchers advocate the development of standardized benchmarking frameworks, deployment of hybrid and flexible clustering designs, broader integration into edge computing platforms, and incorporation of clustering into comprehensive AI/ML pipelines. These advancements should be validated through real-world pilot implementations to bridge the gap between theory and practice.
4. Conclusions
This systematic review highlights an evolving landscape in the application of clustering techniques for mobile network optimization. While traditional algorithms such as K-means and hierarchical clustering remain widely utilized, there has been a marked shift toward more sophisticated approaches, including adaptive, hybrid, and deep representation-based clustering. These methods are increasingly being adopted to address the multifaceted challenges posed by next-generation network environments, particularly within 5G, IoT, and vehicular networks, which are characterized by high user mobility, diverse service requirements, and dynamic spectrum conditions.
Despite these advancements, several gaps remain evident across the literature. A primary concern is the considerable variation in implementation strategies, performance metrics, and experimental settings across studies. This methodological heterogeneity undermines cross-comparison and weakens the external validity of reported outcomes. Moreover, many studies continue to emphasize algorithmic novelty at the expense of practical deployment considerations, such as scalability, real-time adaptability, and energy efficiency. Consequently, the operational maturity and deployment readiness of many proposed models remain limited, especially in edge computing and resource-constrained environments.
To strengthen the scientific rigor and practical impact of clustering research in mobile network optimization, future studies should prioritize the development of standardized benchmarking frameworks. In practice, such frameworks should incorporate shared open datasets, common performance indicators (e.g., spectral efficiency, latency, clustering stability, and energy consumption), and reproducible experimental configurations using containerized simulation environments (e.g., Docker, OMNeT++, or NS-3). Establishing such unified platforms would enable consistent cross-study evaluation and provide a foundation for quantitative and qualitative benchmarking of clustering algorithms under realistic network conditions.
Equally important is the need to balance competing performance objectives, notably accuracy, adaptability, and computational efficiency. Achieving this balance requires the integration of multi-objective optimization and context-aware clustering frameworks capable of dynamically adjusting to changing network conditions. Hybrid edge-cloud architectures, online learning mechanisms, and hierarchical clustering pipelines may offer viable pathways for maintaining high accuracy without compromising latency or energy efficiency.
Finally, as deep learning-based clustering methods continue to gain prominence, interpretability and transparency must be treated as first-class design objectives. Incorporating explainable AI (XAI) techniques, such as SHAP values, Layer-wise Relevance Propagation (LRP), and attention visualization, can significantly enhance model transparency by revealing how specific features (e.g., signal strength, interference, or mobility patterns) influence clustering outcomes. Furthermore, adopting prototype-based neural clustering and self-explainable architectures will improve user trust and facilitate integration into mission-critical, real-time network operations.
In conclusion, advancing clustering for mobile network optimization requires a shift from isolated, algorithm-centric research toward a systematic, standardized, and interpretable paradigm. By establishing common benchmarks, balancing performance trade-offs, and embedding explainability into model design, the research community can accelerate progress toward scalable, adaptive, and transparent clustering solutions for next-generation networks. Table 9 presents the identified gaps and future research recommendations in clustering for mobile network optimization.
Table 9.
Identified gaps and future research recommendations in clustering for mobile network optimization.
Moreover, we conclude that the scalability limitations are particularly pronounced across several clustering paradigms. Density-based algorithms (e.g., DBSCAN, OPTICS) exhibit high computational complexity during neighborhood searches, making them impractical for large-scale, real-time datasets. Spectral clustering suffers from the cubic cost of eigen-decomposition, restricting its use to small or moderate datasets. Deep representation-based clustering, though effective in feature extraction, demands extensive computation and memory, challenging its deployment in edge or fog environments. In contrast, centroid-based methods like K-means are scalable but often fail to adapt to dynamic or high-dimensional network conditions. Addressing these challenges will require distributed, incremental, and edge-cloud cooperative clustering frameworks that maintain accuracy while supporting the scalability demands of next-generation mobile networks.
Author Contributions
Conceptualization, C.M.N. and P.A.O.; methodology, C.M.N.; software, C.M.N.; validation, C.M.N., P.A.O. and T.M.W.; formal analysis, C.M.N.; investigation, C.M.N.; resources, P.A.O.; data curation, C.M.N.; writing, original draft preparation, C.M.N.; writing, review and editing, P.A.O. and T.M.W.; visualization, C.M.N.; supervision, P.A.O. and T.M.W.; project administration, P.A.O. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The datasets presented in this article are not readily available because the data are part of an ongoing study. Requests to access the datasets should be directed to claudenwj@gmail.com.
Acknowledgments
The authors gratefully acknowledge the academic and administrative support provided by the University of KwaZulu-Natal and the Tshwane University of Technology during the preparation of this research. During the preparation of this manuscript, the authors used ChatGPT (GPT-5, OpenAI, 2025) for text refinement, grammar improvement, and formatting alignment with MDPI author guidelines. The authors have carefully reviewed and edited the generated text and take full responsibility for the scientific accuracy and integrity of the final manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Tariq, F.; Khandaker, M.R.A.; Wong, K.; Imran, M.; Bennis, M.; Debbah, M. A speculative study on 6G. IEEE Wirel. Commun. 2020, 27, 118–125. [Google Scholar] [CrossRef]
- ITU-T Focus Group on Network 2030. Framework and Overall Objectives of the Future Network-Network 2030; International Telecommunication Union: Geneva, Switzerland, 2020. [Google Scholar]
- Chowdary, A.; Bazzi, A.; Chafii, M. On hybrid radar fusion for integrated sensing and communication. IEEE Trans. Wirel. Commun. 2024, 23, 8984–9000. [Google Scholar] [CrossRef]
- Liu, F.; Zhou, L.; Masouros, C.; Li, A.; Luo, W.; Petropulu, A. Toward dual-functional radar-communication systems: Optimal waveform design. IEEE Trans. Signal Process. 2020, 68, 3878–3891. [Google Scholar] [CrossRef]
- Bazzi, A.; Chafii, M. Secure full duplex integrated sensing and communications. IEEE Trans. Inf. Forensics Secur. 2024, 19, 2082–2097. [Google Scholar] [CrossRef]
- Zhang, J.; Lu, W.; Xing, C.; Zhao, N.; Al-Dhahir, N.; Karagiannidis, G.K.; Yang, X. Integrated sensing and communication: A survey. Sci. China Inf. Sci. 2025, 68, 131301. [Google Scholar] [CrossRef]
- ITU-R. Framework and Overall Objectives of the Future Development of IMT for 2030 and Beyond; Recommendation ITU-R M.2083-1; International Telecommunication Union: Geneva, Switzerland, 2021. [Google Scholar]
- World Economic Forum. The Impact of 5G: Creating New Value Across Industries and Society; World Economic Forum: Geneva, Switzerland, 2020. [Google Scholar]
- MarketsandMarkets. Mobile Network Analytics Market by Component-Global Forecast to 2027; MarketsandMarkets: Pune, India, 2024. [Google Scholar]
- Khan, R.; Taj, S.; Ma, X.; Noor, A.; Zhu, H.; Khan, J.; Khan, Z.U.; Khan, S.U. Advanced federated ensemble internet of learning approach for cloud-based medical healthcare monitoring system. Sci. Rep. 2024, 14, 26068. [Google Scholar] [CrossRef]
- Jour, T.Y.; Zhang, A.U.; Jiaheng, X. Improving predictive accuracy of customer churn models through data cleaning in the telecommunications industry. Adv. Econ. Manag. Political Sci. 2025, 154, 58–61. [Google Scholar] [CrossRef]
- Abdullah, R.M.; Zukarnain, Z.A. Enhanced handover decision algorithm in heterogeneous wireless networks. Sensors 2017, 17, 1626. [Google Scholar] [CrossRef]
- Parums, D.V. Review articles, systematic reviews, meta-analysis, and updated preferred reporting items for systematic reviews and meta-analyses (PRISMA) 2020 guidelines. Med. Sci. Monit. 2021, 27, e934475-1. [Google Scholar] [CrossRef]
- Page, M.J.; Moher, D.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. PRISMA 2020 explanation and elaboration: Updated guidance and exemplars for reporting systematic reviews. BMJ 2021, 372, n160. [Google Scholar] [CrossRef]
- Ali, A.; Yaqoob, I.; Ahmed, E.; Imran, M.; Kwak, K.; Ahmad, A.; Hussain, S.; Ali, Z. Channel clustering and QoS level identification scheme for multi-channel cognitive radio networks. IEEE Commun. Mag. 2018, 56, 162–168. [Google Scholar] [CrossRef]
- Gajic, B.; Nováczki, S.; Mwanje, S.S. Improved anomaly detection in mobile networks using incremental time-aware clustering. In Proceedings of the IFIP/IEEE Symposium on Integrated Network Management, Ottawa, ON, Canada, 11–15 May 2015; pp. 948–953. [Google Scholar]
- Sivavakeesar, G.; Pavlou, G.; Bohoris, C.; Liotta, A. Effective management through prediction-based clustering approach in next-generation ad hoc networks. In Proceedings of the IEEE International Conference on Communications (ICC), Paris, France, 20–24 June 2004; pp. 4085–4090. [Google Scholar]
- Nivitha, K.; Solaiappan, A.; Pabitha, P. Robust service selection through intelligent clustering in an uncertain environment. In Proceedings of the International Conference on Emerging Trends in Information Technology and Engineering (ic-ETITE), Vellore, India, 24–25 February 2020; pp. 1–6. [Google Scholar]
- Fernández Maimó, L.; Huertas Celdrán, A.; Gil Pérez, M.; García Clemente, F.J.; Martínez Pérez, G. Dynamic management of a deep-learning-based anomaly detection system for 5G networks. J. Ambient Intell. Humaniz. Comput. 2018, 9, 1691–1701. [Google Scholar] [CrossRef]
- Kaleibar, F.J.; St-Hilaire, M. An adaptive clustering approach for dynamic service provisioning in vehicular cloud networks. In Proceedings of the IEEE World Forum on Internet of Things (WF-IoT), Aveiro, Portugal, 10–13 October 2024; pp. 1–6. [Google Scholar]
- Fernández Maimó, L.; Perales Gómez, Á.L.; García Clemente, F.J.; Gil Pérez, M.; Martínez Pérez, G. A self-adaptive deep learning-based system for anomaly detection in 5G networks. IEEE Access 2018, 6, 7700–7712. [Google Scholar] [CrossRef]
- Sun, J.; Wang, J.; Chen, J.; Ding, G.; Lin, F. Clustering analysis for Internet of spectrum devices: Real-world data analytics and applications. IEEE Internet Things J. 2020, 7, 4128–4141. [Google Scholar] [CrossRef]
- Yin, Y.; Cao, Z.; Xu, Y.; Gao, H.; Li, R.; Mai, Z. QoS prediction for service recommendation with feature learning in mobile edge computing environment. IEEE Trans. Cogn. Commun. Netw. 2020, 6, 1159–1173. [Google Scholar] [CrossRef]
- Caleb, J.; Thangaraj, J. Anomaly detection in self-organizing mobile networks motivated by quality of experience. In Proceedings of the IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), Coimbatore, India, 15–17 February 2023; pp. 1–6. [Google Scholar]
- Elsayed, M.H.M.; Erol-Kantarci, M. Radio resource and beam management in 5G mmWave systems: Clustering and deep reinforcement learning. In Proceedings of the IEEE Global Communications Conference (GLOBECOM), Taipei, Taiwan, 7–11 December 2020; pp. 1–6. [Google Scholar]
- Ren, Y.; Xu, R. An adaptive clustering scheme based on modified density-based spatial clustering of applications with noise algorithm in ultra-dense networks. In Proceedings of the IEEE Vehicular Technology Conference (VTC), Kuala Lumpur, Malaysia, 28 April–1 May 2019; pp. 1–5. [Google Scholar]
- Kim, Y.; Al Hakim, E.; Haraldson, J.; Eriksson, H.; Da Silva, J.M.B.; Fischione, C. Dynamic clustering in federated learning. In Proceedings of the IEEE International Conference on Communications (ICC), Montreal, QC, Canada, 14–18 June 2021; pp. 1–6. [Google Scholar]
- Kassan, S.; Hadj-Kacem, I.; Ben Jemaa, S.; Allio, S. Robustness analysis of hybrid machine learning model for anomaly forecasting in radio access networks. In Proceedings of the International Symposium on Computers and Communications (ISCC), Tunis, Tunisia, 10–13 July 2023; pp. 220–225. [Google Scholar]
- Benslimane, A.; Taleb, T.; Sivaraj, R. Dynamic clustering-based adaptive mobile gateway management in integrated VANET/3G heterogeneous wireless networks. IEEE J. Sel. Areas Commun. 2011, 29, 559–570. [Google Scholar] [CrossRef]
- Xu, L.; Collier, R.W.; O’Hare, G.M.P. A survey of clustering techniques in WSNs and consideration of the challenges of applying such methods to 5G IoT scenarios. IEEE Internet Things J. 2017, 4, 1229–1249. [Google Scholar] [CrossRef]
- Stenhammar, O.; Fodor, G.; Fischione, C. Clustered predictive quality of service for connected vehicles using federated learning. In Proceedings of the IEEE International Conference on Communications Workshops (ICC Workshops), Denver, CO, USA, 9–13 June 2024; pp. 1–6. [Google Scholar]
- Padmanabhan, K.; Jeyasubramanian, I.; Pandian, J.S.; Rajendran, J.V. Improving QoS in VANET using dynamic clustering technique. In Proceedings of the IEEE International Conference on Advanced Computing (ICoAC), Chennai, India, 19–21 December 2016; pp. 313–318. [Google Scholar]
- Ali, A.A.; Hussein, M.K.; Subhi, M. A classifier-driven deep learning clustering approach to enhance data collection in MANETs. Mesopotamian J. Cybersecur. 2024, 3, 44–52. [Google Scholar] [CrossRef]
- Aljadhai, A.R.; Znati, T. Predictive mobility support for QoS provisioning in mobile wireless environments. IEEE J. Sel. Areas Commun. 2001, 19, 1882–1895. [Google Scholar] [CrossRef]
- Almobaideen, W.A.; Hushaidan, K.; Sleit, A.; Qatawneh, M. A cluster-based approach for supporting QoS in mobile ad hoc networks. Int. J. Comput. Sci. Netw. Secur. 2011, 11, 1–8. [Google Scholar]
- Aziz, Z.; Bestak, R. Insight into anomaly detection and prediction and mobile network security enhancement leveraging K-means clustering on call detail records. In Proceedings of the Italian National Conference on Sensors (SENSORS), Rome, Italy, 12–14 March 2024; pp. 88–94. [Google Scholar]
- Balakrishnan, N.; Rajendran, A.; Unk, A.P. Deep embedded median clustering for routing misbehaviour and attack detection in ad-hoc networks. Ad Hoc Netw. 2021, 119, 102524. [Google Scholar]
- Casas, P.; Fiadino, P.; D’Alconzo, A. When smartphones become the enemy: Unveiling mobile apps anomalies using clustering techniques. In Proceedings of the Workshop on All Things Cellular: Operations, Applications, and Challenges (ATC@MobiCom), Paris, France, 7–11 September 2016; pp. 19–24. [Google Scholar]
- Cretu-Ciocarlie, G.F.; Lindqvist, U.; Nováczki, S.; Sanneck, H. Detecting anomalies in cellular networks using an ensemble method. In Proceedings of the IEEE Conference on Network and Service Management (CNSM), Zurich, Switzerland, 14–18 October 2013; pp. 77–84. [Google Scholar]
- Hussain, B.; Du, Q.; Zhang, S.; Imran, A.; Imran, M. Mobile edge computing-based data-driven deep learning framework for anomaly detection. IEEE Access 2019, 7, 77521–77533. [Google Scholar] [CrossRef]
- Kajo, M.; Schnellbach, J.; Mwanje, S.; Carle, G. Decorrelating Adversarial Nets for Clustering Mobile Network Data. arXiv 2021, arXiv:2103.08348. [Google Scholar] [CrossRef]
- Kajo, M.; Schnellbach, J.; Mwanje, S.S.; Carle, G. Clustering Mobile Network Data with Decorrelating Adversarial Nets. In Proceedings of the IEEE/IFIP Network Operations and Management Symposium, Budapest, Hungary, 25–29 April 2022. [Google Scholar]
- Moulay, M.; Leiva, R.G.; Rojo Maroni, P.J.; Lazaro, J.; Mancuso, V.; Fernández, A. A Novel Methodology for the Automated Detection and Classification of Networking Anomalies. In Proceedings of the IEEE Conference on Computer Communications Workshops (INFOCOM Workshops), Toronto, ON, Canada, 6–9 July 2020. [Google Scholar]
- Moysen, J.; Giupponi, L.; Mangues-Bafalluy, J. On the Potential of Ensemble Regression Techniques for Future Mobile Network Planning. In Proceedings of the International Symposium on Computers and Communications, Messina, Italy, 27–30 June 2016. [Google Scholar]
- Moysen, J.; Ahmed, F.; García-Lozano, M.; Niemelä, J. Unsupervised Learning for Detection of Mobility Related Anomalies in Commercial LTE Networks. In Proceedings of the European Conference on Networks and Communications, Dubrovnik, Croatia, 15–18 June 2020. [Google Scholar]
- Murudkar, C.V.; Gitlin, R. Machine Learning for QoE prediction and anomaly detection in self-organizing mobile networking systems. Int. J. Wirel. Mob. Netw. 2019, 11. [Google Scholar] [CrossRef]
- Murudkar, C.V.; Gitlin, G.D. QoE-Driven Anomaly Detection in Self-Organizing Mobile Networks Using Machine Learning. In Proceedings of the Wireless Telecommunications Symposium, New York, NY, USA, 9–12 April 2019. [Google Scholar]
- Oldmeadow, J.; Ravinutala, S.; Leckie, C. Adaptive Clustering for Network Intrusion Detection. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 26–28 May 2004. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).