Optimizing Kubernetes with Multi-Objective Scheduling Algorithms: A 5G Perspective
Abstract
1. Introduction
- Comprehensive Literature Review: A thorough examination of existing work on Kubernetes scheduling, with a focus on 5G use cases and networking. The review is organized into two sub-categories: generic scheduling and multi-objective optimization-based scheduling;
- Critical Analysis: A meticulous evaluation of the strengths and weaknesses of current Kubernetes schedulers for 5G, offering insights into their effectiveness and limitations;
- Gap Identification and Open Questions: Identification of key research gaps and open questions in the literature, highlighting areas that require further investigation and innovation;
- Multi-objective Scheduling Approaches: A focused discussion on bi-objective, tri-objective, and multi-objective scheduling strategies identified in the existing research.
1.1. Motivation
1.2. Search Strategy
1.3. Eligibility Criteria
1.4. Data Extraction
1.5. Sources of Data
- (1)
- IEEE Xplore (<www.ieeexplore.ieee.org>);
- (2)
- Science Direct (<www.scidirect.com>);
- (3)
- Springer (<www.springer.com>);
- (4)
- ResearchGate (<www.researchgate.net>);
- (5)
- Google Scholar (<scholar.google.com>);
- (6)
- Scopus (<www.scopus.com>);
- (7)
- Taylor & Francis (<taylorandfrancis.com>);
- (8)
- The Wiley Library (<www.onlinelibrary.wiley.com>).
2. Background
2.1. Microservices
2.2. Containers
2.3. Kubernetes Architecture
2.4. Network Slicing
3. Literature Review
3.1. Containerized 5G Network Functions
3.1.1. CNF Orchestration with Kubernetes-Based Approaches
3.1.2. CNF Orchestration with Artificial Intelligence and Machine Learning
3.1.3. CNF Orchestration with Service Function Chains (SFCs)
3.1.4. CNF Orchestration with Slicing
3.2. Containerized 5G Applications
3.3. Performance Metrics of the Kubernetes Scheduler in 5G
Metric | Description |
---|---|
Availability | Availability refers to the system’s ability to ensure consistent accessibility of applications [12,29,35]. |
Reliability | Reliability refers to the consistent and dependable performance of the network in providing connectivity, data transmission, and other services to users [40]. |
Performance | Performance is achieved by balancing loads, optimizing resources, and applying affinity rules to ensure effective allocation for meeting the low-latency and high-throughput demands of 5G applications, enhancing the overall system responsiveness and scalability [12]. |
Bandwidth | The bandwidth requirement involves efficient data compression, minimizing unnecessary transfers and optimizing communication patterns [55,56]. |
Response Time | The response time refers to the duration taken for the system to process and respond to a request. It is achieved by minimizing workload allocation delays, optimizing resource management, and prioritizing low-latency communication [29,55]. |
Flexibility | Flexibility involves dynamically allocating resources, adapting scaling, and supporting diverse workloads, enabling rapid adjustment to changing demands and ensuring versatility in handling various applications within the 5G environment [33]. |
Deployment Time | Deployment time refers to the duration required for efficient orchestration, streamlined container deployment processes, and optimized resource allocation, ensuring swift and timely deployment of applications within the dynamic environment [48]. |
Cost | Cost refers to the economic considerations associated with resource utilization, infrastructure expenses, and efficiency in managing computing resources [33]. |
Resilience | Resilience refers to the system’s capability to withstand and recover from failures or disruptions by employing mechanisms such as fault tolerance, automatic workload recovery, and dynamic resource reallocation [33]. |
Scheduling Duration | Scheduling duration refers to the duration taken for the execution of scheduling processes, including workload allocation decisions, resource assignments, and real-time decision making [54]. |
Cluster Temperature | Cluster temperature metaphorically describes the overall load and resource utilization within the cluster. It refers to the strategies, such as load balancing and dynamic scaling, employed to maintain optimal temperatures by preventing resource bottlenecks and ensuring efficient performance [54]. |
Energy Consumption | Energy consumption refers to the amount of power utilized by the system, which is optimized through strategies like dynamic resource allocation and load-aware scheduling [61]. |
Fault Tolerance | Fault tolerance the system’s ability to withstand and recover from failures by employing mechanisms such as pod replication and dynamic workload rescheduling, ensuring uninterrupted operation and reliability in the face of node or pod failures within the dynamic 5G environment [12]. |
Load Balancing | Load balancing involves distributing workloads evenly across cluster nodes, optimizing resource usage, and preventing bottlenecks, ensuring efficient allocation of computing resources and maintaining optimal performance in the dynamic 5G environment [55]. |
Quality of Service (QoS) | QoS (quality of service) refers to the system’s ability to meet the performance expectations of applications, ensuring optimal resource allocation, low latency, and high throughput to deliver a reliable and responsive user experience [55]. |
Latency | It represents the delay experienced during communication and is typically measured in milliseconds (ms) [57,59]. |
Served Traffic | This refers to the volume of data transmitted over the network to and from users or devices within a specific period. It encompasses all the data packets sent and received, including voice calls, video streams, text messages, and internet browsing, among others [44,62]. |
Throughput | Refers to the rate at which data can be transferred over the network within a given period, typically measured in bits per second (bps) or megabits per second (Mbps) [57]. |
Computation time | This refers to the duration it takes for network devices or systems to process and analyze data, execute tasks, and generate responses. It encompasses various processes, such as data transmission, reception, signal processing, protocol handling, and application execution [42]. |
Quality of Experience (QoE) | This refers to the overall satisfaction and perception of users regarding the performance, reliability, and usability of services and applications delivered over 5G networks [27]. |
Execution Time | This refers to the duration taken to complete a specific task or operation within the network infrastructure or on end-user devices. It encompasses the time required for processing user requests, executing network functions, and delivering services or applications over the 5G network [59]. |
Security | This refers to the measures and protocols implemented to safeguard the integrity, confidentiality, and availability of data and communications within the 5G network [31]. |
QoV | Quality of video (QoV) refers to the level of excellence or fidelity in visual content delivered over 5G networks. It encompasses factors such as resolution, frame rate, compression efficiency, latency, and reliability of the video stream [39]. |
MOS Mean Opinion Score | This refers to a subjective measure used to assess the perceived quality of communication services, including voice and video, experienced by users [39]. |
Downtime | Ensuring minimal downtime is crucial for maintaining high availability and performance levels, especially in situations where uninterrupted service is essential [52]. |
Makespan | This is the period between the starting time of the execution and the completion time of a set of tasks [24,26]. |
Efficiency | Efficiency refers to the optimal orchestration and management of network resources to achieve high performance and service quality [58]. |
Reference | Availability | Performance | Bandwidth | Response Time | Flexibility | Deployment Time | Cost | Downtime | Resilience | Scheduling Duration | Cluster Temperature | Energy Consumption | Fault Tolerance | Load Balancing | QoS | Makespan | Efficiency | Served Traffic | Latency | Throughput | Computation Time | QoE | Execution Time | Security | QoV | Resource Utilization | Scalability | Reliability | Mean Opinion Score | Single/Multi-Objective | Scheduling Strategy |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
[32] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[33] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[35] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[36] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[37] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[38] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[27] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[40] | √ | √ | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||
[41] | √ | √ | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||
[43] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[44] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[45] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[46] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[47] | √ | √ | √ | √ | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||
[49] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[50] | √ | Single | Dynamic | ||||||||||||||||||||||||||||
[51] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[59] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[60] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[53] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[48] | √ | Single | Dynamic | ||||||||||||||||||||||||||||
[54] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[55] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[56] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[63] | √ | √ | √ | Multi | Dynamic | ||||||||||||||||||||||||||
[61] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[28] | √ | √ | Multi | Static | |||||||||||||||||||||||||||
[34] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[57] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[42] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[52] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[59] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[31] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[39] | √ | √ | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||
[26] | √ | Single | Dynamic | ||||||||||||||||||||||||||||
[25] | √ | √ | Multi | Dynamic | |||||||||||||||||||||||||||
[58] | √ | √ | Multi | Dynamic |
3.4. Limitations of the Current Literature
4. Advantages and Disadvantages of Integrating 5G Technology with Kubernetes
- Dynamic Scalability for 5G Traffic: Kubernetes enables operators to dynamically add/remove nodes to manage fluctuating 5G traffic, such as ultra-reliable low-latency communication (URLLC) and enhanced mobile broadband (eMBB) services [64];
- Automation for CNFs: Kubernetes automates CNF deployment, scaling, and recovery across distributed 5G infrastructure, reducing manual intervention in handling core and RAN functions [65];
- High Availability for 5G Services: Ensures uninterrupted 5G services by restarting failed pods, reallocating workloads, and distributing slices to maintain ultralow latency requirements [66];
- Flexibility Across 5G Deployments: Supports hybrid and edge–cloud environments, enabling orchestration of 5G functions across on-premises sites, cloud, and edge nodes for network slicing [44];
- Security for Telecom-Grade Workloads: Role-based access control, encrypted node communication, and fine-grained policies help to secure 5G CNF deployments against evolving threats [67].
- Complexity for Telecom Operators: Running Kubernetes in multi-domain 5G networks requires deep expertise in telecom protocols and distributed systems, creating adoption barriers [68];
- Resource Overhead in 5G Core/RAN: Kubernetes’ control plane consumes significant CPU/memory, which may reduce efficiency in dense 5G core deployments [69];
- Operational Complexity in Large 5G Networks: Maintaining Kubernetes clusters across heterogeneous RAN, edge, and core locations adds monitoring and troubleshooting burdens [5];
- Security Risks in 5G Environments: Expanded attack surfaces, including insecure container images or poorly configured policies, pose risks to critical 5G workloads [67];
- Vendor Lock-in for Telecom Clouds: Reliance on cloud-managed Kubernetes services risks limiting portability of 5G network functions across multi-cloud environments [23].
- Policy Control Across Protocols: Difficulty in applying consistent scheduling and policy control across mixed traffic (4G protocols like SIP and SCTP vs. 5G NR slicing) [64];
- Security Enforcement Across Layers: Ensuring end-to-end security across multi-layer 5G CNF architectures remains a challenge [67];
- Limited Visibility of Network Flows: Difficulty in monitoring intra-slice traffic, making optimization of 5G CNF scheduling less transparent [38];
- Revenue Model Limitations: Dependence on legacy 4G billing systems while deploying standalone 5G cores delays ROI for operators [70].
5. Comparing Container Orchestration Solutions for Modern Deployment Needs
6. Applications Enhanced by Kubernetes in 5G Environments
- Service Orchestration and Automation: 5G networks require complex service orchestration to efficiently manage resources and deliver services to end-users. Kubernetes can orchestrate and automate various network functions, such as (VNFs) and containerized network functions (CNFs), ensuring smooth service delivery and dynamic scaling to meet changing demands;
- Network Slicing: 5G enables network slicing, where a single physical network is partitioned into multiple virtual networks to support different use cases with varying requirements. Kubernetes can manage the deployment and scaling of applications within each network slice, ensuring isolation and efficient resource utilization;
- Edge Computing: Kubernetes is well suited for deploying and managing applications at the network edge, where low latency and high performance are critical. Edge applications in 5G networks, such as IoT services, video analytics, and augmented reality (AR) applications, benefit from Kubernetes’s ability to manage containers close to end-users, reducing latency and improving responsiveness;
- Workflow Schedulers: Workflow schedulers, such as Airflow or Argo, are essential for orchestrating complex data-processing pipelines and workflows in 5G environments. Kubernetes can manage the execution of workflow tasks across distributed environments, ensuring reliability, scalability, and efficient resource utilization;
- Real-time Analytics and Monitoring: 5G networks generate large amounts of data that require real-time analytics and monitoring to optimize network performance and ensure quality of service (QoS). Kubernetes can deploy and manage analytics and monitoring applications, such as Prometheus and Grafana, to collect, analyze, and visualize network data in real time;
- Content Delivery Networks (CDNs): CDNs in 5G networks benefit from Kubernetes’s ability to dynamically scale and distribute content delivery nodes closer to end-users. Kubernetes can deploy and manage CDN nodes across distributed edge locations, optimizing content delivery and reducing latency for multimedia and streaming services.
7. Research Gaps and Future Directions
7.1. Key Challenges in Multi-Objective Scheduling for 5G with Kubernetes
- Heterogeneous Infrastructure: The presence of diverse hardware in 5G networks requires scheduling algorithms to account for heterogeneity for optimal resource allocation. Failure to address hardware diversity can result in inefficient resource utilization and hinder the achievement of scheduling objectives [28,38];
- Real-time Responsiveness: The need for real-time responsiveness in 5G applications requires scheduling algorithms to make rapid decisions based on changing conditions. Delays in decision making can lead to performance bottlenecks, especially for applications with stringent latency requirements [25,54];
- Energy Efficiency: Efficient energy consumption is crucial in 5G networks, necessitating scheduling algorithms to consider the energy impacts of resource allocation decisions. Inefficient energy usage can contribute to increased operational costs and environmental concerns [68].
7.2. Future Directions in Multi-Objective Scheduling for 5G with Kubernetes
- Machine-Learning Integration: Integrating machine-learning techniques for dynamic workload prediction and optimization can significantly improve the adaptability of scheduling algorithms. Machine-learning models offer valuable insights into workload patterns, empowering the system to make proactive and intelligent scheduling decisions. By discerning patterns and trends, the system can anticipate fluctuations and efficiently scale Kubernetes clusters in preparation for expected increases or decreases in demand [26,71];
- Continuous Monitoring and Adaptation: Continuous monitoring enables dynamic adjustment of scheduling algorithms in real time to optimize performance in dynamic environments. This adaptability enhances responsiveness by assessing performance and resource utilization across heterogeneous infrastructure components. Leveraging real-time data, Kubernetes can allocate resources dynamically, optimizing the overall system performance based on the specific characteristics and capabilities of each component [72,73];
- Quantum Computing Exploration: The exploration of quantum computing’s potential impacts on complex scheduling scenarios in 5G networks offers new possibilities. Quantum computing’s parallel processing capabilities may help to address multi-objective scheduling challenges more efficiently than classical computers. Quantum algorithms could enhance QoS optimization in resource allocation, scheduling, and routing, thereby improving the overall efficiency in 5G networks [74,75];
- Standardization and Interoperability: Establishing industry standards is essential for ensuring seamless integration between 5G components and Kubernetes, fostering interoperability for efficient scheduling. Standardized interfaces enable dynamic service orchestration within Kubernetes, allowing for automatic resource allocation and configurations to meet real-time responsiveness requirements. This reduces compatibility issues and promotes widespread adoption by providing a common foundation for deployment and management [76,77];
- Autonomic Computing Principles: Applying autonomic computing principles to scheduling algorithms in 5G with Kubernetes enables self-configuring and self-optimizing systems. These systems enhance adaptability and reduce the need for manual intervention. By continuously monitoring workloads in a 5G environment, autonomic systems dynamically scale resources based on real-time conditions, optimizing energy consumption while meeting service-level objectives [74,78].
8. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wijethilaka, S.; Liyanage, M. Survey on Network Slicing for Internet of Things Realization in 5G Networks. IEEE Commun. Surv. Tutor. 2021, 23, 957–994. [Google Scholar] [CrossRef]
- Kaur, K.; Mangat, V.; Kumar, K. A review on Virtualized Infrastructure Managers with management and orchestration features in NFV architecture. Comput. Netw. 2022, 217, 109281. [Google Scholar] [CrossRef]
- Senjab, K.; Abbas, S.; Ahmed, N.; Khan, A. ur R. A survey of Kubernetes scheduling algorithms. J. Cloud Comput. 2023, 12, 87. [Google Scholar] [CrossRef]
- Casalicchio, E. Container Orchestration: A Survey. In Systems Modeling: Methodologies and Tools; EAI/Springer Innovations in Communication and Computing; Springer: Cham, Switzerland, 2019; Volume 91, pp. 221–235. [Google Scholar] [CrossRef]
- Pahl, C.; Brogi, A.; Soldani, J.; Jamshidi, P. Cloud container technologies: A state-of-the-art review. IEEE Trans. Cloud Comput. 2019, 7, 677–692. [Google Scholar] [CrossRef]
- Rodriguez, M.A.; Buyya, R. Container-based cluster orchestration systems: A taxonomy and future directions. Softw. Pract. Exp. 2019, 49, 698–719. [Google Scholar] [CrossRef]
- Truyen, E.; Van Landuyt, D.; Preuveneers, D.; Lagaisse, B.; Joosen, W. A comprehensive feature comparison study of open-source container orchestration frameworks. Appl. Sci. 2019, 9, 931. [Google Scholar] [CrossRef]
- Arunarani, A.R.; Manjula, D.; Sugumaran, V. Task scheduling techniques in cloud computing: A literature survey. Future Gener. Comput. Syst. 2019, 91, 407–415. [Google Scholar] [CrossRef]
- Vijindra; Shenai, S. Survey on scheduling issues in cloud computing. Procedia Eng. 2012, 38, 2881–2888. [Google Scholar] [CrossRef]
- Hosseinioun, P.; Kheirabadi, M.; Kamel Tabbakh, S.R.; Ghaemi, R. aTask scheduling approaches in fog computing: A survey. Trans. Emerg. Telecommun. Technol. 2022, 33, e3792. [Google Scholar] [CrossRef]
- Wang, K.; Zhou, Q.; Guo, S.; Luo, J. Cluster frameworks for efficient scheduling and resource allocation in data center networks: A survey. IEEE Commun. Surv. Tutor. 2018, 20, 3560–3580. [Google Scholar] [CrossRef]
- Rejiba, Z.; Chamanara, J. Custom Scheduling in Kubernetes: A Survey on Common Problems and Solution Approaches. ACM Comput. Surv. 2022, 55, 151. [Google Scholar] [CrossRef]
- Carrión, C. Kubernetes Scheduling: Taxonomy, Ongoing Issues and Challenges. ACM Comput. Surv. 2022, 55, 138. [Google Scholar] [CrossRef]
- Tang, S.; Yu, Y.; Wang, H.; Wang, G.; Chen, W.; Xu, Z.; Guo, S.; Gao, W. A Survey on Scheduling Techniques in Computing and Network Convergence. IEEE Commun. Surv. Tutor. 2024, 26, 160–195. [Google Scholar] [CrossRef]
- Ahmad, I.; AlFailakawi, M.G.; AlMutawa, A.; Alsalman, L. Container scheduling techniques: A Survey and assessment. J. King Saud. Univ. Comput. Inf. Sci. 2022, 34, 3934–3947. [Google Scholar] [CrossRef]
- Sánchez, J.A.H.; Casilimas, K.; Rendon, O.M.C. Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey. Sensors 2022, 22, 3031. [Google Scholar] [CrossRef]
- Attaoui, W.; Sabir, E.; Elbiaze, H.; Guizani, M. VNF and CNF Placement in 5G: Recent Advances and Future Trends. IEEE Trans. Netw. Serv. Manag. 2023, 20, 4698–4733. [Google Scholar] [CrossRef]
- Laghrissi, A.; Taleb, T. A Survey on the Placement of Virtual Resources and Virtual Network Functions. IEEE Commun. Surv. Tutor. 2019, 21, 1409–1434. [Google Scholar] [CrossRef]
- Ali, A.Q.; Sultan, A.B.M.; Ghani, A.A.A.; Zulzalil, H. A Systematic Mapping Study on the Customization Solutions of Software as a Service Applications. IEEE Access 2019, 7, 88196–88217. [Google Scholar] [CrossRef]
- Botez, R.; Pasca, A.G.; Dobrota, V. Kubernetes-Based Network Functions Orchestration for 5G Core Networks with Open Source MANO. In Proceedings of the 2022 International Symposium on Electronics and Telecommunications (ISETC), Timisoara, Romania, 10–11 November 2022; pp. 1–4. [Google Scholar] [CrossRef]
- Delnat, W.; Truyen, E.; Rafique, A.; Van Landuyt, D.; Joosen, W. K8-scalar: A workbench to compare autoscalers for container-orchestrated database clusters. In Proceedings of the 40th International Conference on Software Engineering (ICSE), Gothenburg, Sweden, 27 May–3 June 2018; pp. 33–39. [Google Scholar] [CrossRef]
- Cérin, C.; Menouer, T.; Saad, W.; Abdallah, W. Ben A New Docker Swarm Scheduling Strategy. In Proceedings of the 2017 IEEE 7th International Symposium on Cloud and Service Computing (SC2), Kanazawa, Japan, 22–25 November 2017; Volume 2018, pp. 112–117. [Google Scholar] [CrossRef]
- Bernstein, D. Containers and cloud: From LXC to docker to kubernetes. IEEE Cloud Comput. 2014, 1, 81–84. [Google Scholar] [CrossRef]
- Ogbuachi, M.C.; Gore, C.; Reale, A.; Suskovics, P.; Kovacs, B. Context-aware K8S scheduler for real time distributed 5G edge computing applications. In Proceedings of the 2019 International Conference on Software, Telecommunications and Computer Networks (SoftCOM), Split, Croatia, 19–21 September 2019; pp. 1–6. [Google Scholar] [CrossRef]
- Mondal, S.K.; Pan, R.; Kabir, H.M.D.; Tian, T.; Dai, H.N. Kubernetes in IT administration and serverless computing: An empirical study and research challenges. J. Supercomput. 2022, 78, 2937–2987. [Google Scholar] [CrossRef]
- Boudi, A.; Bagaa, M.; Poyhonen, P.; Taleb, T.; Flinck, H. AI-Based Resource Management in beyond 5G Cloud Native Environment. IEEE Netw. 2021, 35, 128–135. [Google Scholar] [CrossRef]
- Schneider, S. Conventional and Machine Learning Approaches for Network and Service Coordination. In Proceedings of the 2023 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN), Dresden, Germany, 7–9 November 2023; pp. 201–206. [Google Scholar] [CrossRef]
- Tsolkas, D.; Charsmiadis, A.S.; Xenakis, D.; Merakos, L. Service and network function placement in the edge-cloud continuum. In Proceedings of the 2022 IEEE Conference on Standards for Communications and Networking (CSCN), Thessaloniki, Greece, 28–30 November 2022; pp. 188–193. [Google Scholar] [CrossRef]
- Chen, Y. Optimal Configuration of Distributed Device Container Resources considering Virtual Aggregation Rules and Edge Node Constraints. In Proceedings of the 2023 8th Asia Conference on Power and Electrical Engineering (ACPEE), Tianjin, China, 14–16 April 2023; pp. 1844–1851. [Google Scholar] [CrossRef]
- Charismiadis, A.S.; Tsolkas, D.; Passas, N.; Xenakis, D.; Merakos, L. Metaheuristics as Enablers for VNF Scheduling in the Network Slice Set Up Process. J. Commun. Netw. 2022, 24, 742–753. [Google Scholar] [CrossRef]
- Lee, P.-H.; Lin, F.J. Tackling IoT Scalability with 5G NFV-Enabled Network Slicing. Adv. Internet Things 2021, 11, 123–139. [Google Scholar] [CrossRef]
- Alemany, P.; Roman, A.; Vilalta, R.; Pol, A.; Bonnet, J.; Kapassa, E.; Touloupou, M.; Kyriazis, D.; Karkazis, P.; Trakadas, P.; et al. A KPI-Enabled NFV MANO Architecture for Network Slicing with QoS. IEEE Commun. Mag. 2021, 59, 44–50. [Google Scholar] [CrossRef]
- Qian, D.; Guo, S.; Sun, L.; Hao, Q.; Song, Y.; Wang, M. An Integrity Measurement Scheme for Containerized Virtual Network Function. In Proceedings of the 2021 5th International Conference on Electrical, Mechanical and Computer Engineering (ICEMCE 2021), Xi’an, China, 29–31 October 2021; Volume 2137. [Google Scholar] [CrossRef]
- Nguyen, D.T.; Dao, N.L.; Tran, V.T.; Lang, K.T. Enhancing CNF performance for 5G core network using SR-IOV in Kubernetes. In Proceedings of the 2022 24th International Conference on Advanced Communication Technology (ICACT), PyeongChang, Republic of Korea, 13–16 February 2022; pp. 501–506. [Google Scholar] [CrossRef]
- Vu, D.D.; Tran, M.N.; Kim, Y. Predictive Hybrid Autoscaling for Containerized Applications. IEEE Access 2022, 10, 109768–109778. [Google Scholar] [CrossRef]
- Pino, A.; Khodashenas, P.; Hesselbach, X.; Coronado, E.; Siddiqui, S. Validation and Benchmarking of CNFs in OSM for pure Cloud Native applications in 5G and beyond. In Proceedings of the 2021 International Conference on Computer Communications and Networks (ICCCN), Athens, Greece, 19–22 July 2021; Volume 2021, pp. 1–9. [Google Scholar] [CrossRef]
- Arora, S.; Ksentini, A. Dynamic Resource Allocation and Placement of Cloud Native Network Services. In Proceedings of the ICC 2021-IEEE International Conference on Communications, Montreal, QC, Canada, 14–23 June 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Dion, J.; Lallet, J.; Beaulieu, L.; Savelli, P.; Bertin, P. Cloud Native Hardware Accelerated 5G virtualized Radio Access Network. In Proceedings of the 2021 IEEE 32nd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Helsinki, Finland, 13–16 September 2021; Volume 2021, pp. 1061–1066. [Google Scholar] [CrossRef]
- Amogh, P.C.; Veeramachaneni, G.; Rangisetti, A.K.; Tamma, B.R.; Antony, F.A. A cloud native solution for dynamic auto scaling of MME in LTE. In Proceedings of the 2017 IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 8–13 October 2017; Volume 2017, pp. 1–7. [Google Scholar] [CrossRef]
- Zeb, S.; Rathore, M.A.; Mahmood, A.; Hassan, S.A.; Kim, J.; Gidlund, M. Edge Intelligence in Softwarized 6G: Deep Learning-enabled Network Traffic Predictions. In Proceedings of the 2021 IEEE Globecom Workshops (GC Wkshps), Madrid, Spain, 7–11 December 2021. [Google Scholar]
- Da Silva, D.C.; Batista, J.O.R.; de Sousa, M.A.F.; Mostaço, G.M.; Monteiro, C.d.C.; Bressan, G.; Cugnasca, C.E.; Silveira, R.M. A Novel Approach to Multi-Provider Network Slice Selector for 5G and Future Communication Systems. Sensors 2022, 22, 6066. [Google Scholar] [CrossRef]
- Tsourdinis, T.; Chatzistefanidis, I.; Makris, N.; Korakis, T.; Nikaein, N.; Fdida, S. Service-aware real-time slicing for virtualized beyond 5G networks. Comput. Netw. 2024, 247, 110445. [Google Scholar] [CrossRef]
- Mudvari, A.; Makris, N.; Tassiulas, L. ML-driven scaling of 5G Cloud-Native RANs. In Proceedings of the GLOBECOM 2021-2021 IEEE Global Communications Conference, Madrid, Spain, 7–11 December 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Watanabe, H.; Yasumori, R.; Kondo, T.; Kumakura, K.; Maesako, K.; Zhang, L.; Inagaki, Y.; Teraoka, F. ContMEC: An Architecture of Multi-access Edge Computing for Offloading Container-Based Mobile Applications. In Proceedings of the ICC 2022-IEEE International Conference on Communications, Seoul, Republic of Korea, 16–20 May 2022; Volume 2022, pp. 3647–3653. [Google Scholar] [CrossRef]
- Wauters, T.; Volckaert, B.; Turck, F. De Towards delay-aware container-based Service Function Chaining in Fog Computing. In Proceedings of the 2020 IEEE/IFIP Network Operations and Management Symposium (NOMS 2020), Budapest, Hungary, 20–24 April 2020. [Google Scholar] [CrossRef]
- Dab, B.; Fajjari, I.; Rohon, M.; Auboin, C.; Diquelou, A. An Efficient Traffic Steering for Cloud-Native Service Function Chaining. In Proceedings of the 2020 23rd Conference on Innovation in Clouds, Internet and Networks and Workshops (ICIN), Paris, France, 24–27 February 2020; pp. 71–78. [Google Scholar] [CrossRef]
- De Simone, L.; Di Mauro, M.; Natella, R.; Postiglione, F. Performance and Availability Challenges in Designing Resilient 5G Architectures. IEEE Trans. Netw. Serv. Manag. 2024, 21, 5291–5303. [Google Scholar] [CrossRef]
- Nikaein, N.; Schiller, E.; Favraud, R.; Katsalis, K.; Stavropoulos, D.; Alyafawi, I.; Zhao, Z.; Braun, T.; Korakis, T. Network store: Exploring slicing in future 5G networks. In Proceedings of the MobiCom’15: The 21th Annual International Conference on Mobile Computing and Networking, Paris, France, 7–11 September 2015; pp. 8–13. [Google Scholar] [CrossRef]
- Katsalis, K.; Nikaein, N.; Schiller, E.; Favraud, R.; Braun, T.I. 5G Architectural Design Patterns. In Proceedings of the 2016 ICC-2016 IEEE International Conference on Communications Workshops (ICC), Kuala Lumpur, Malaysia, 23–27 May 2016; pp. 32–37. [Google Scholar] [CrossRef]
- Luong, D.H.; Outtagarts, A.; Ghamri-Doudane, Y. Multi-level resource scheduling for network slicing toward 5G. In Proceedings of the 2019 10th International Conference on Networks of the Future (NoF), Rome, Italy, 1–3 October 2019; pp. 25–31. [Google Scholar] [CrossRef]
- Aba, M.A.; Kassis, M.; Elkael, M.; Araldo, A.; Al Khansa, A.; Castel-Taleb, H.; Jouaber, B. Efficient Network Slicing Orchestrator for 5G Networks using a Genetic Algorithm-based Scheduler with Kubernetes: Experimental Insights. In Proceedings of the 2024 IEEE 10th International Conference on Network Softwarization (NetSoft), Saint Louis, MO, USA, 24–28 June 2024; pp. 82–90. [Google Scholar] [CrossRef]
- Mukuhi, D.K.; Outtagarts, A. LLRS: A Low Latency Resource Scheduling in Cloud Continuum Orchestration. In Proceedings of the 2024 27th Conference on Innovation in Clouds, Internet and Networks (ICIN), Paris, France, 11–14 March 2024; pp. 81–87. [Google Scholar] [CrossRef]
- Mekki, M.; Arora, S.; Ksentini, A. A Scalable Monitoring Framework for Network Slicing in 5G and beyond Mobile Networks. IEEE Trans. Netw. Serv. Manag. 2022, 19, 413–423. [Google Scholar] [CrossRef]
- Aleyadeh, S.; Moubayed, A.; Heidari, P.; Shami, A. Optimal Container Migration/Re-Instantiation in Hybrid Computing Environments. IEEE Open J. Commun. Soc. 2022, 3, 15–30. [Google Scholar] [CrossRef]
- Wojciechowski, L.; Opasiak, K.; Latusek, J.; Wereski, M.; Morales, V.; Kim, T.; Hong, M. NetMARKS: Network metrics-AwaRe kubernetes scheduler powered by service mesh. In Proceedings of the IEEE INFOCOM 2021-IEEE Conference on Computer Communications, Vancouver, BC, Canada, 10–13 May 2021; Volume 2021. [Google Scholar] [CrossRef]
- Pusztai, T.; Nastic, S.; Morichetta, A.; Pujol, V.C.; Raith, P.; Dustdar, S.; Vij, D.; Xiong, Y.; Zhang, Z. Polaris Scheduler: SLO- and Topology-aware Microservices Scheduling at the Edge. In Proceedings of the 2022 IEEE/ACM 15th International Conference on Utility and Cloud Computing (UCC), Portland, OR, USA, 6–9 December 2022; pp. 61–70. [Google Scholar] [CrossRef]
- Wadatkar, P.V.; Garroppo, R.G.; Nencioni, G.; Volpi, M. Joint multi-objective MEH selection and traffic path computation in 5G-MEC systems. Comput. Netw. 2024, 240, 110168. [Google Scholar] [CrossRef]
- Rahali, M.; Phan, C.; Rubino, G.; Rahali, M.; Phan, C.; Rubino, G.; Kubernetes, K.R.S.; Scheduler, R.; Rahali, M.; Phan, C.; et al. KRS: Kubernetes Resource Scheduler for resilient NFV networks. In Proceedings of the GLOBECOM 2021—2021 IEEE Global Communications Conference, Madrid, Spain, 7–11 December 2021. [Google Scholar]
- Dimolitsas, I.; Dechouniotis, D.; Papavassiliou, S. Time-efficient distributed virtual network embedding for round-trip delay minimization. J. Netw. Comput. Appl. 2023, 217, 103691. [Google Scholar] [CrossRef]
- Caminero, A.C. Quality of Service Provision in Fog Computing: Network-Aware Scheduling of Containers. Sensors 2021, 21, 3978. [Google Scholar] [CrossRef]
- Botez, R.; Costa-Requena, J.; Ivanciu, I.A.; Strautiu, V.; Dobrota, V. Sdn-based network slicing mechanism for a scalable 4g/5g core network: A kubernetes approach. Sensors 2021, 21, 3773. [Google Scholar] [CrossRef]
- Ojaghi, B.; Adelantado, F.; Verikoukis, C. SO-RAN: Dynamic RAN Slicing via Joint Functional Splitting and MEC Placement. IEEE Trans. Veh. Technol. 2023, 72, 1925–1939. [Google Scholar] [CrossRef]
- Srirama, S.N. A decade of research in fog computing: Relevance, challenges, and future directions. Softw. Pract. Exp. 2023, 54, 3–23. [Google Scholar] [CrossRef]
- Biran, Y.; Pasricha, S.; Collins, G.; Dubow, J. Enabling green content distribution network by cloud orchestration. In Proceedings of the 2016 3rd Smart Cloud Networks & Systems (SCNS), Dubai, United Arab Emirates, 19–21 December 2016. [Google Scholar] [CrossRef]
- Xu, Z.; Gong, Y.; Zhou, Y.; Bao, Q.; Qian, W. Enhancing Kubernetes Automated Scheduling with Deep Learning and Reinforcement Techniques for Large-Scale Cloud Computing Optimization. In Proceedings of the Ninth International Symposium on Advances in Electrical, Electronics, and Computer Engineering (ISAEECE 2024), Changchun, China, 15–17 March 2024. [Google Scholar]
- Oluebube, B.; Egbuna, P.; Member, R.T.; Org, K. Machine Learning Applications in Kubernetes for Autonomous Container Management. J. Artif. Intell. Res. 2024, 4, 196–219. [Google Scholar]
- Mao, Y.; Fu, Y.; Zheng, W.; Cheng, L.; Liu, Q.; Tao, D. Speculative Container Scheduling for Deep Learning Applications in a Kubernetes Cluster. IEEE Syst. J. 2021, 16, 3770–3781. [Google Scholar] [CrossRef]
- F5, “5G Makes a Cloud-Native Application Architecture Vital,” F5 Resources. [Online]. Available online: https://www.f5.com/pt_br/resources/articles/5g-makes-a-cloud-native-application-architecture-vital (accessed on 12 September 2025).
- SID Global Solutions. Exploring the top 10 Kubernetes Alternatives: Which One Is Right for You? Medium, Mar. 7, 2023. [Online]. Available online: https://sidglobalsolutions.medium.com/exploring-the-top-10-kubernetes-alternatives-which-one-is-right-for-you-7249faf724b (accessed on 12 September 2025).
- Kaur, K.; Garg, S.; Kaddoum, G.; Ahmed, S.H.; Atiquzzaman, M. KEIDS: Kubernetes-Based Energy and Interference Driven Scheduler for Industrial IoT in Edge-Cloud Ecosystem. IEEE Internet Things J. 2020, 7, 4228–4237. [Google Scholar] [CrossRef]
- Chen, Z.; Hu, J.; Min, G.; Zomaya, A.Y.; El-Ghazawi, T. Towards Accurate Prediction for High-Dimensional and Highly-Variable Cloud Workloads with Deep Learning. IEEE Trans. Parallel Distrib. Syst. 2020, 31, 923–934. [Google Scholar] [CrossRef]
- Ogbuachi, M.C.; Reale, A.; Suskovics, P.; Kovács, B. Context-Aware kubernetes scheduler for edge-native applications on 5g. J. Commun. Softw. Syst. 2020, 16, 85–94. [Google Scholar] [CrossRef]
- Perez, R.; Benedetti, P.; Pergolesi, M.; Garcia-Reinoso, J.; Zabala, A.; Serrano, P.; Femminella, M.; Reali, G.; Steenhaut, K.; Banchs, A. Monitoring Platform Evolution Toward Serverless Computing for 5G and Beyond Systems. IEEE Trans. Netw. Serv. Manag. 2022, 19, 1489–1504. [Google Scholar] [CrossRef]
- Gill, S.S.; Xu, M.; Ottaviani, C.; Patros, P.; Bahsoon, R.; Shaghaghi, A.; Golec, M.; Stankovski, V.; Wu, H.; Abraham, A.; et al. AI for next generation computing: Emerging trends and future directions. Internet Things 2022, 19, 100514. [Google Scholar] [CrossRef]
- Zeb, S.; Rathore, M.A.; Hassan, S.A.; Raza, S.; Dev, K.; Fortino, G. Toward AI-Enabled NextG Networks with Edge Intelligence-Assisted Microservice Orchestration. IEEE Wirel. Commun. 2023, 30, 148–156. [Google Scholar] [CrossRef]
- Apostolakis, K.C.; Margetis, G.; Stephanidis, C.; Duquerrois, J.M.; Drouglazet, L.; Lallet, A.; Delmas, S.; Cordeiro, L.; Gomes, A.; Amor, M.; et al. Cloud-native 5G infrastructure and network applications (NetApps) for public protection and disaster relief: The 5G-EPICENTRE Project. In Proceedings of the 2021 Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit), Porto, Portugal, 8–11 June 2021; pp. 235–240. [Google Scholar] [CrossRef]
- Varga, P.; Peto, J.; Franko, A.; Balla, D.; Haja, D.; Janky, F.; Soos, G.; Ficzere, D.; Maliosz, M.; Toka, L. 5G Support for Industrial Iot Applications—Challenges, Solutions, and Research Gaps. Sensors 2020, 20, 828. [Google Scholar] [CrossRef] [PubMed]
- Huang, S.Y.; Chen, C.Y.; Chen, J.Y.; Chao, H.C. A Survey on Resource Management for Cloud Native Mobile Computing: Opportunities and Challenges. Symmetry 2023, 15, 538. [Google Scholar] [CrossRef]
Ref. | Survey |
---|---|
A Survey on Common Problems and Solution Approaches [12] | The article reviews advances in Kubernetes scheduling, explaining that the default scheduler handles general tasks well but struggles with modern demands, like machine learning, deep learning, and edge computing. It classifies research by objectives, workload types, and environments and highlights optimization strategies. |
A Survey on Scheduling Techniques in Computing and Network Convergence [14] (2024) | The survey comprehensively reviews literature on scheduling across various scenarios, including computing and network convergence from heterogeneous resources, multiple-objective optimization, and diverse task environments. It explores potential explanations and implications arising from these studies and identifies important challenges for future research. |
Container Scheduling Techniques: A Survey and Assessment [15] (2022) | The survey categorizes scheduling techniques into four groups based on the optimization algorithm used: mathematical modeling, heuristics, meta-heuristics, and machine learning. For each category, it analyses the strengths, weaknesses, and challenges of the techniques, focusing on their performance metrics. The paper concludes by outlining promising avenues for future research aimed at maximizing the benefits of container technology. |
Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey [16] (2022) | This survey identifies key stages in network-slicing resource management and examines how reinforcement-learning (RL) and deep reinforcement-learning (DRL) algorithms can independently handle each stage. It evaluates these approaches based on their optimization goals, state and action spaces, algorithmic choices, deep neural network structures, exploration–exploitation strategies, and specific use cases (vertical applications). Additionally, the survey outlines future research paths concerning RL/DRL-based resource management for network slicing. |
Kubernetes Scheduling: Taxonomy, Ongoing Issues, and Challenges [13] (2022) | This survey specifically focuses on the scheduler, a critical orchestrator responsible for allocating physical resources to containers. Scheduling strategies are tailored around various quality-of-service (QoS) parameters. The paper seeks to provide comprehensive insights into Kubernetes scheduling, identify key gaps, and offer guidance for future research directions. It conducts an empirical study on Kubernetes-scheduling techniques and introduces a novel taxonomy for categorizing them. The survey also addresses challenges, outlines future directions, and explores research opportunities in this field. |
VNF and CNF Placement in 5G: Recent Advances and Future Trends [17] (2022) | This study categorizes optimization techniques for VNF placement, organizing papers by metrics, methods, algorithms, and deployment contexts. Virtualization spans beyond VMs and VNFs to encompass microservices, containers, and cloud-native systems. The article’s second part focuses on CNF placement in edge/fog computing, addressing challenges like traffic congestion, resource use, energy efficiency, and performance degradation. |
A Survey on the Placement of Virtual Resources and Virtual Network Functions [18] (2019) | This survey extensively investigates existing VNF placement strategies and algorithms, categorizing them into network function chain placement, VNF-forwarding graphs, and VNF replications. It covers generic VM-based NFV frameworks and specific VNF placement strategies. The survey reviews relevant protocols, heuristics, algorithms, and architectures with the aim of proposing efficient strategies for network slicing in future work, ensuring satisfaction for end-users and verticals while respecting various constraints. |
This work | This paper offers a comprehensive review of Kubernetes advancements specific to orchestrating 5G network functions and use cases. It focuses on key metrics studied in this domain and explores various methods applicable to Kubernetes scheduling in 5G scenarios. The taxonomy categorizes different aspects of Kubernetes resource scheduling, extending beyond algorithms and methods. |
Reference, Year | Algorithm/Method | Type of Algorithm | Type of Dataset | Testing Tool | Advantage | Disadvantage |
---|---|---|---|---|---|---|
[28] (2022) | Kubernetes-based network function orchestration with open-source MANO | KNF-based containerized approach | Service instances | Open-Source MANO (OSM), including Kubernetes | Kubernetes-based 5G core deployment offers a low latency, high availability, and flexible business model, reducing downtime fivefold compared to virtualization. | The cloud-native Kubernetes approach increases complexity, resource demands, and operational costs. |
[31] (2021) | Container integrity measurement (CIM) | Static integrity measurement | Comprises experimental data from deploying OAI-based 4G core network functions in bare-metal and VM containers. | NFV MANO and Docker or Kubernetes | The proposed scheme secures containerized VNFs, enhancing trust in 5G SBA while ensuring flexibility, scalability, and efficient resource management. | The CIM scheme increases average CPU usage by 26% and slightly raises memory usage, balancing security with performance at an acceptable cost. |
[25] (2022) | ILP (integer linear programming) | Mathematical programming | Various scenarios provide structured datasets for investigating service and function placement optimization in edge–cloud environments. | IBM ILOG CPLEX Studio | ILP solvers excel with small inputs, producing optimal solutions for edge–cloud placement, while continuity constraints enhance realism and modeling accuracy. | IBM ILOG CPLEX Studio incurs high licensing costs, a steep learning curve, and high computational demands for large-scale problems. |
[24] (2021) | Dynamic resource allocation and placement of (DRAP) algorithm | Heuristic | Service instances | Kubernetes | The cloud-native approach enhances resilience and reduces costs, using multiple replicas and dynamic vCPU allocation. | The algorithm has only been tested with simple services; complex network services may present unaddressed challenges. |
[32] (2022) | SR-IOV | Hardware virtualization technology | Voice and messaging services | Kubernetes | Offloading packet processing to user space with SR-IOV and DPDK reduces overhead on the operating system’s kernel, leading to faster processing times and improved responsiveness. | Implementing SR-IOV and DPDK requires expertise and careful configuration, potentially increasing deployment complexity. |
[33] (2022) | Modular design for network functions. | Statistical approach for modeling and resolution of resource allocation | Uses Kubernetes infrastructure hosting different network services | Kubernetes | Addresses the diversity of 5G use cases with maximum flexibility and cost effectiveness; improves network functions’ availability and resilience | Limited information provided on experimental setup and specific results obtained |
[34] (2021) | Adapting the ETSI NFV reference architecture and leveraging open-source MANO initiatives, specifically OSM | Full container technology deployment—via Kubernetes—in an NFV architecture | Service instances | OSM-K8s’ testbed | Kubernetes, as a key component of the CN approach in 5G systems, drives operational and capital expenditure savings, promoting its adoption. | Early Kubernetes deployments in 5G systems are non-standalone and are often integrated with VNF-based solutions, like OpenStack. |
[35] (2021) | Automatic activation of hardware-programmable acceleration resources for Cloud RAN | Resource allocation/optimization | Data related to radio physical layer processes | Kubernetes-based orchestration | Improves performance and energy efficiency by offloading demanding functions to hardware accelerators. | May not fully address the variability in computing demands across different network scenarios. |
[36] (2017) | Cloud-native solution for LTE MME (CNS-MME) | Microservice-based cloud-native architecture | Mobile network data | Kubernetes, Docker, Prometheus, and L7 load balancer | Provides 7% higher MME throughput and reduces processing resource consumption by 26%. Offers scalability, autoscaling, and load-balancing features. | Additional complexity of implementing microservices and managing cloud-native infrastructure. |
Reference, Year | Algorithm/Method | Type of Algorithm | Type of Dataset | Testing Tool | Advantage | Disadvantage |
---|---|---|---|---|---|---|
[37] (2021) | AI-based resource-aware orchestration (AIRO) framework in CNE | AIRO framework leverages ZSM concept, cloud-native approach, and ML | Service instances | AIRO framework for training ML agents within Kubernetes | The AIRO framework offers robust resource management and service enforcement for telco-grade systems. | The simulation platform introduces uncertainties due to stochastic behaviors, potentially affecting the accuracy of performance predictions in real deployments. |
[38] (2021) | Edge intelligence method | Deep learning model | Real network traffic dataset | Kubernetes-based edge cluster with GIST playground control center and packet tracing | Enhances 6G network management with AI and SDN/NFV; accurately predicts edge device data traffic properties | Complex implementation and data-preprocessing challenges; potential for future automation and scalability improvements |
[27] (2023) | ML and deep reinforcement learning (DRL) for network and service coordination | ML and DRL | Wireless network data | Kubernetes | Enhances resource allocation; optimizes scaling, placement, and scheduling; reduces wasted resources; and allows autonomous decision making | Some existing DRL methods rely on a priori knowledge or partial observations, which can reduce the model’s adaptability to real-world scenarios. |
[39] (2022) | Network slice selection function decision-aid framework (NSSF DAF) | Dynamic and nonlinear | Data collected from testbed experiments conducted to validate the proposed framework | Kubernetes based on OpenAirInterface (OpenAir–K8s) | The study shows the proposed 5G slice selection solution is standard-compliant and suits IoT and multimedia applications. | Implementing NSSF DAF requires integrating multicriteria decision-making methods and ML algorithms, increasing complexity and resource demands. |
[40] (2024) | ML models for traffic classification and prediction in cellular networks | ML algorithm for traffic prediction and slice reallocation in network management | Cellular network traffic flow data used for training and testing the models | Kubernetes | Enables proactive slice reallocation, resulting in up to four times lower latency (jitter) and nearly four times higher throughput compared to those of standard methods while optimizing resource allocation. | The framework’s efficiency may depend on accurate real-time predictions, which could be challenging under highly dynamic and unpredictable network conditions. |
[41] (2021) | LSTM RNN model to predict network load | ML | Real-world traffic datasets | OAI Kubernetes | Enables automatic scaling based on monitored metrics. | Requires large datasets for training and validation |
Reference, Year | Algorithm/Method | Type of Algorithm | Type of Dataset | Testing Tool | Advantage | Disadvantage |
---|---|---|---|---|---|---|
[43] (2020) | SFC controller | Network service management | Container-based service chains | Kubernetes | The proposed SFC controller reduces network latency and conserves bandwidth in fog–cloud environments, optimizing resource provisioning for Smart City use cases. | The scheduling decision time is increased by 6 ms per pod, potentially affecting the overall system performance. |
[44] (2020) | Optimized network-aware load-balancing algorithm | Load-balancing algorithm | Traffic data and infrastructure states | Kubernetes-based platform | Efficient traffic steering and load balancing in cloud-native SFCs, reducing CAPEX and OPEX while ensuring network performance | Error proneness in service function chaining due to many microservices |
[45] (2024) | Non-product-form queueing networks and RBDs/SRN | Proactive scaling using machine learning | Real-world traffic datasets | Kubernetes | Enables efficient management and automatic scaling of network functions | Dependency on effective fault injection setup and analysis |
Reference, Year | Algorithm/Method | Type of Algorithm | Type of Dataset | Testing Tool | Advantage | Disadvantage |
---|---|---|---|---|---|---|
[46] (2015) | Network store with SDN/NFV-based network slicing | Architecture/platform design | Real-time LTEaaS testbed | OpenStack (with Heat and LXC), OAI, and USRP B210 | Enables programmable 5G network slices; supports service-specific customization; leverages NFV/SDN for flexibility | Limited real-world deployment; challenges in meeting real-time deadlines without proper hardware provisioning |
[47] (2016) | 5G architectural design patterns (distributed shared memory, dedicated data plane, shared control plane, agent-based VNF/VNA software, and SLA-driven RAN) | Architectural design methodology | Not applicable (design focused) | Theoretical framework, SDN/NFV orchestration, and cloud simulation references | Modular, scalable, supports SLA-driven RAN, stateless VNFs, improved fault tolerance, and flexible slicing | High complexity in implementation, dependency on SDN/NFV maturity, and lacks empirical performance metrics |
[48] (2019) | Multi-level (global slice) scheduling for network slicing | Heuristics | Measurement data from an in-house developed testbed | Kubernetes and OpenStack cluster | Multi-level orchestration | Limited optimization |
[49] (2024) | Setpod scheduler | Scheduling algorithm for network slicing within Kubernetes (K8s) | Virtualized resource data related to network slices within a Kubernetes environment | Kubernetes (K8s) as the platform for virtualization and resource management | Maximizes acceptance ratio for deploying slices while minimizing energy consumption; shows better performance in energy efficiency and average deployment time compared to those of Kube scheduler | Tested only with a small cluster of three machines; does not account for additional constraints, like bandwidth and latency between pods; requires further scalability and adaptability for larger clusters and varying use cases |
[50] (2024) | Low-latency resource scheduler (LLRS) | Scheduling algorithm | Resource allocation and scheduling data within a cloud continuum environment | Docker Swarm and Kubernetes as orchestration platforms | Reduces container-scheduling time by nearly 50% in comparison to those of default schedulers in Docker Swarm and Kubernetes, improving the overall system responsiveness and performance. | While it reduces scheduling delay, it may not fully address the stringent QoS demands of URLLC in 5G/6G networks without further optimization or enhancements. |
[51] (2022) | Scalable architecture with lifecycle management | Distributed monitoring | Network slice performance metrics | Kubernetes | Supports high scalability for numerous network slices | Needs continuous testing to maintain performance under varying loads |
Algorithm/Scheduler | Scalability | Latency | Energy Efficiency | Operational Overhead | Real-Time Responsiveness |
---|---|---|---|---|---|
ILP-based solvers [24] | Low (NP, hard; impractical at scale) | Moderate | Neutral | High (computational cost) | Poor (slow convergence) |
DRAP heuristics [25] | Moderate | Improved | Neutral | Low–moderate | Limited (heuristic approximations) |
SR-IOV CNF placements [32] | High (hardware assisted) | Very low (fast throughput) | Moderate | Hardware dependent | Good (if hardware is available) |
DRL-based orchestrators [27] | High (learning adaptation) | Improved | Varies (depending on training) | High (training cost and instability) | Moderate (depending on observability) |
AIRO [37] | Moderate | Improved | Improved | High (complex orchestration) | Limited under dynamic conditions |
NetMARKS [53] | Moderate | Improved | Improved | Low–Moderate | Context dependent (testbed specific) |
Setpod scheduler [49] | Moderate | Improved | Improved | Moderate | Context dependent |
Polaris2 [56] | High | 30% latency reduction | Neutral | Moderate (approximation methods) | Limited predictability |
IPA-E [60] | High | Reduced latency | Neutral | Moderate–high (network aware) | Possible scheduling delays |
Reference, Year | Algorithm | Type of Algorithm | Type of Dataset | Testing Tool | Advantage | Disadvantage |
---|---|---|---|---|---|---|
[52] (2022) | Edge-computing-enabled container migration/re-instantiation (EC2-MRI) | Real-time heuristic | IOT applications | Testbed based | By optimizing container placement, applications can execute faster, leading to better performance and shorter execution times. | Underutilized container instances lead to inefficient use of resources. |
[53] (2021) | NetMARKS represents an innovative approach to Kubernetes pod scheduling, utilizing advanced techniques. | Dynamically collected network metrics through the Istio Service Mesh | Different workloads and processing layouts | Kubernetes | NetMARKS optimizes Kubernetes scheduling for 5G workflows, saving up to 50% of the inter-node bandwidth and reducing response times by 37%. | Its drawback is the lack of a detailed experimental setup and specific results, hindering evaluation and reproducibility. |
[54] (2019) | Dynamic orchestration of edge-native applications on 5G | The algorithm dynamically coordinates edge-native 5G applications, considering physical, operational, network, and software parameters. | Real-time information about edge infrastructure components | Kubernetes | The strategy broadens node capabilities but needs improvement in scheduling speed and efficiency for demanding deployments. | Enhancements could include empowering users to fine-tune scheduling strictness by specifying the degree of influence that assigned parameters have on the final score. |
[55] (2023) | Two-population ant colony algorithm to solve the container resource allocation problem | Dynamic multi-objective algorithm | Real-time sensing data | Kubernetes and Docker | The goal is to attain the best allocation of edge container resources for distributed devices in edge-computing scenarios, ensuring effective utilization. | The proposed model and algorithm introduce complexity, requiring advanced implementation and oversight. |
[56] (2022) | Polaris Scheduler is designed to be SLO aware. | Heuristic algorithm | Real-time collected data | Kubernetes | The scheduler models edge topology as a cluster topology graph, capturing network quality for accurate environment representation and improved microservice placement. | The added network QoS and dependency considerations increase resource requirements, impacting scheduling efficiency. |
[57] (2023) | Multi-objective Dijkstra algorithm (MDA) | Multi-objective Dijkstra algorithm (MDA) | VideoLan application between multiple multi-access edge-computing (MEC) hosts (MEHs) | Testbed based on AdvantEDGE and Kubernetes | The results show that MDA can migrate applications with minimal impacts on network performance and user experience, addressing the challenges of mobility and resource allocation in MEC systems. | Implementing MEC entails managing edge infrastructure, incurring high costs, and raising security and privacy concerns due to processing sensitive data at the edge. |
[42] (2022) | Container orchestration system for the PoC implementation of ContMEC | ContMEC’s algorithm involves creating computing clusters per edge station, implementing hierarchical resource management, and deploying overlapped computing clusters. | The control traffic in Kubernetes for container deployment and health checks occurs between the cluster master and worker nodes, as well as between the cluster master and UEs when joining the computing cluster in ContMEC. | Kubernetes | The ContMEC architecture efficiently offloads applications to MEC servers, promoting scalability and resource sharing, as demonstrated in a Kubernetes-based PoC. | Implementing applications as container clusters can introduce management and maintenance complexity, potentially increasing overhead in orchestration and monitoring. |
[58] (2022) | KRS: Kubernetes resource scheduler for resilient NFV networks | Kubernetes resource scheduler | 10 to 20 CNFs deployed at each node | Kubernetes | The KRS algorithm allocates memory and CPUs for critical CNFs, leverages Kubernetes customization, and optimizes CNF coexistence to improve stability post deployment. | Optimizing KRS involves mastering statistical methods, GAs, parameter tuning, continuous adaptation for resilience. |
[59] (2023) | sV-VNM algorithm | Round-trip delay minimization | IoT-based applications | Kubernetes and OpenStack | The proposed algorithm offers time-efficient DVNE solutions for IoT applications with strict delay requirements, optimizing resource usage and ensuring timely service delivery, cost savings, and improved network performance. | The worst-case complexity of the proposed algorithm is O(kn3), which may limit scalability or numerous VNE requests, necessitating efficient complexity management. |
[60] (2021) | IPerf algorithm (IPA) | Network-aware scheduling algorithm | 200 jobs | Kubernetes | The IPA improves QoS by executing applications within deadlines, considering the network status, and estimating job completion times. | The current limitations and complexity of IPA necessitate improvements in prediction calculations, advance scheduling, and the inclusion of additional network metrics. |
Feature | Development | Application Suitability | Infrastructure | Setup |
---|---|---|---|---|
Docker Swarm | Simple setup and user-friendly interface | Smaller deployments and teams with less experience | Docker | Simple and easy to set up |
Nomad | Easy to use, lightweight, and flexible | Smaller teams; does not require complex setup/configuration | Any infrastructure (public/private clouds and bare metal) | Straightforward setup: no complex configuration needed |
Mesos | Modular architecture integrated with different runtimes | Teams with advanced needs and expertise to manage complexity | Any infrastructure (flexible) | Requires expertise for setup and configuration |
OpenShift | Additional features/tools for simplified management | Teams looking for a more complete container platform | Kubernetes-based platform can run on any infrastructure | May require more setup and configuration effort |
Rancher | Complete container management platform that is easy to use | Teams needing a complete container platform and preferring managed service | Kubernetes and other container orchestration tools as a service | Easy to set up and user-friendly interface |
Amazon ECS | Fully managed container service that is simple and cost effective | Teams already using AWS and looking for fully managed solution | AWS cloud | Fully managed by AWS; simple setup and management |
Google Kubernetes Engine (GKE) | Fully managed Kubernetes service that is scalable and secure | Teams already using GCP and looking for fully managed Kubernetes | Google Cloud Platform (GCP) | Fully managed by Google; easy deployment and management |
Microsoft Azure Kubernetes Service (AKS) | Fully managed Kubernetes service; simple deployment on Azure | Teams already using Azure and looking for fully managed Kubernetes | Microsoft Azure cloud | Fully managed by Microsoft; easy deployment and management |
IBM Cloud Kubernetes Service | Fully managed Kubernetes service; scalable and secure | Teams already using IBM Cloud and looking for fully managed Kubernetes | IBM Cloud | Fully managed by IBM; easy deployment and management |
Apache Aurora | Advanced and customizable option for container orchestration | Teams with advanced needs and expertise | Flexible: runs on any infrastructure | Requires expertise for setup and configuration |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Farid, M.; Lim, H.S.; Lee, C.P.; Zarakovitis, C.C.; Chien, S.F. Optimizing Kubernetes with Multi-Objective Scheduling Algorithms: A 5G Perspective. Computers 2025, 14, 390. https://doi.org/10.3390/computers14090390
Farid M, Lim HS, Lee CP, Zarakovitis CC, Chien SF. Optimizing Kubernetes with Multi-Objective Scheduling Algorithms: A 5G Perspective. Computers. 2025; 14(9):390. https://doi.org/10.3390/computers14090390
Chicago/Turabian StyleFarid, Mazen, Heng Siong Lim, Chin Poo Lee, Charilaos C. Zarakovitis, and Su Fong Chien. 2025. "Optimizing Kubernetes with Multi-Objective Scheduling Algorithms: A 5G Perspective" Computers 14, no. 9: 390. https://doi.org/10.3390/computers14090390
APA StyleFarid, M., Lim, H. S., Lee, C. P., Zarakovitis, C. C., & Chien, S. F. (2025). Optimizing Kubernetes with Multi-Objective Scheduling Algorithms: A 5G Perspective. Computers, 14(9), 390. https://doi.org/10.3390/computers14090390