Next Article in Journal
A Batch-Based VNF Deployment Mechanism for Privacy-Preserving Multi-Domain SFC Deployment Using Deep Reinforcement Learning
Previous Article in Journal
TLS-Aware Anomaly Detection for Encrypted IoT Traffic Using a β-Variational Autoencoder with ANOVA–Mutual Information Feature Selection
Previous Article in Special Issue
Context-Aware Software-Defined Wireless Networks: An AI-Based Approach to Deal with QoS
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Topology-Aware Joint Control Plane Placement and Assignment for Resilient Hierarchical Cloud–Edge Networks

by
Samer Mohammed Rasool
1,
Yassine Boujelben
2,* and
Faouzi Zarai
2
1
Technical Computer Engineering Department, Mazaya University College, Nasiriyah 64001, Iraq
2
NTS’Com Research Laboratory, National School of Electronics and Telecoms of Sfax (ENET’Com), University of Sfax, Sfax 3018, Tunisia
*
Author to whom correspondence should be addressed.
Future Internet 2026, 18(6), 311; https://doi.org/10.3390/fi18060311
Submission received: 30 April 2026 / Revised: 3 June 2026 / Accepted: 4 June 2026 / Published: 8 June 2026

Abstract

Hierarchical cloud–edge networks rely on distributed control planes to manage large-scale heterogeneous infrastructures, where controller placement and node assignment strongly affect latency, load balancing, and resilience. Existing methods typically decouple these decisions and provide limited guarantees under controller failures or topology constraints. We introduce a topology-aware joint optimization framework for controller placement and node assignment in hierarchical cloud–edge networks. The problem is formulated as a multi-objective integer linear program capturing latency, load balancing, and control continuity. To ensure scalability, we design a two-phase heuristic: structurally important controller candidates are selected using graph-based metrics, including node degree and k-core decomposition, followed by a redundancy-aware proximity assignment strategy that preserves connectivity under single-controller failures. Experiments on synthetic hierarchical and random topologies with up to 500 nodes show that the proposed approach achieves optimality gaps below 10% with execution times under 10 ms. It improves load distribution and reduces control latency compared to baseline methods while maintaining resilience under controller failures. Results show that exploiting topological structure in joint placement and assignment enables efficient and resilient control plane design for hierarchical cloud–edge networks, supporting near-real-time reconfiguration.

Graphical Abstract

1. Introduction

The separation of the control plane from the data plane and the introduction of centralized network management have established Software-Defined Networking (SDN) as a transformative paradigm for overcoming the rigidity of traditional network architectures. The ability to dynamically configure, monitor, and optimize network behavior through software control is particularly beneficial for large-scale applications such as cloud computing, the Internet of Things (IoT), and 5G/6G networks. However, the centralized nature of the control plane introduces new challenges related to reliability, scalability, and performance [1,2].
Controller Placement Problem (CPP) is one of the most critical design challenges in SDN, as it determines the number and locations of controllers to minimize control latency, maximize reliability, and maintain a balanced workload distribution. Improper placement leads to high switch-to-controller communication delay, controller overload, and reduced network resilience. Heller et al. [3] demonstrated that this latency has a significant impact on network performance, establishing controller placement as a fundamental design factor in SDN architectures. Subsequent studies extended CPP to additional objectives including fault tolerance [4] and energy efficiency [5].
In multi-controller SDN environments, CPP is inherently coupled with the switch assignment problem, which determines how switches are mapped to controllers under latency and capacity constraints. Addressing placement without jointly optimizing assignment leads to suboptimal performance, particularly in large-scale or mission-critical networks [6,7]. Inefficient assignment strategies result in controller congestion, increased control overhead, and degraded fault tolerance [5].
As SDN deployments scale and diversify, resilience has emerged as the primary design requirement. Networks must maintain operation during controller failures, link failures, and unexpected traffic surges [3,7,8]. A widely adopted strategy is redundant multi-controller deployment, where each switch is connected to multiple controllers so that backup controllers take over upon primary failure. However, resilience cannot be achieved by placement alone; the assignment of switches to controllers determines the backup relationships that exist in practice, making joint optimization essential [8,9].
The urgency of this joint design is further amplified as SDN deployments extend beyond fixed terrestrial infrastructures into inherently dynamic environments, including LEO satellite constellations, vehicular networks, and large-scale IoT deployments [10,11,12]. In such settings, topology changes and controller failures are routine operational events rather than exceptional conditions, and controller reassignment must complete within the network’s coherence time, namely the interval during which current topology and load conditions remain operationally valid. A control plane that cannot adapt at this speed offers only nominal fault tolerance, making fast and topology-aware joint optimization a practical necessity rather than a theoretical concern.
Recent work has addressed resilience-aware CPP through a variety of optimization and metaheuristic approaches, including genetic algorithms, particle swarm optimization, and integer linear programming [13]. Despite this progress, most existing methods treat controller placement and switch assignment as sequential or independent subproblems, limiting their effectiveness in dynamic or failure-prone environments. Furthermore, many approaches focus narrowly on latency minimization without fully accounting for redundancy requirements, controller capacity limits, and the topological structure of the network.
In this paper, we propose a joint optimization framework for controller placement and switch assignment in resilient SDN environments. The framework is grounded in a multi-objective Integer Linear Program (ILP) that jointly optimizes controller placement and switch assignment under resilience, capacity, and QoS constraints, incorporates topological centrality measures based on node degree and k-core decomposition into the objective function, and serves as an exact benchmark for evaluating heuristic solutions.
To overcome the computational limitations of exact optimization at scale, a two-phase heuristic framework was proposed. In the first phase, five topology-aware candidate selection strategies of polynomial complexity exploit structural graph properties to identify a compact set of controller locations. In the second phase, switches are assigned to the selected controllers through a greedy proximity-driven algorithm that enforces redundancy and respects capacity constraints. By construction, every switch has guaranteed connections to a minimum number of controllers, ensuring control-plane continuity under individual controller failures.
The main contributions of this paper are as follows:
  • We propose a multi-objective ILP that integrates k-core decomposition and node degree as explicit optimization criteria alongside latency minimization and assignment coverage, enabling the network’s structural backbone to directly guide resilient controller placement decisions.
  • We introduce five topology-aware candidate selection strategies of polynomial complexity and provide a systematic comparison of structural criteria, from local degree to global core number, for controller placement, together with a complete complexity analysis.
  • We design a two-phase heuristic framework that separates candidate selection from switch assignment, guarantees per-switch redundancy through minimum assignment constraints, and enforces controller capacity limits using a proximity-driven greedy assignment algorithm.
  • We perform a systematic scalability study comparing exact and heuristic controller placement across two structurally distinct topologies, namely flat random and hierarchical multi-core networks, up to 500 nodes. The results quantify optimality gaps and demonstrate that richer hierarchical k-core structures significantly improve heuristic solution quality by reducing placement symmetry.
The remainder of this paper is organized as follows. Section 2 reviews related work on controller placement and resilient SDN. Section 3 presents the system model and relevant graph-theoretic concepts. Section 4 formalizes the optimization problem. Section 5 describes the proposed heuristic framework. Section 6 presents the numerical evaluation. Section 7 concludes the paper and outlines directions for future research.

2. Literature Review

CPP has been extensively studied since the early formalization by Heller et al. [3], who demonstrated that the geographic distribution of controllers significantly affects switch-to-controller latency and established CPP as a foundational SDN design challenge. Comprehensive surveys have confirmed that the problem has since expanded well beyond latency minimization to encompass resilience, capacity, energy efficiency, and joint assignment objectives [7,10,14,15]. This section organizes existing contributions into four themes: latency-aware placement, resilience and fault tolerance, switch assignment, and joint optimization approaches, before positioning the present work with respect to the identified gaps.

2.1. Latency-Aware Controller Placement

Early CPP formulations primarily focused on minimizing the propagation delay between switches and controllers. Subsequent work extended this to multi-controller settings, where controller-to-controller synchronization delay must also be accounted for [16]. Naseri et al. [17] proposed a binary linear programming model that jointly minimizes the setup cost and average control packet latency in wide-area networks, demonstrating favorable trade-offs over prior methods. Metaheuristic approaches have also been applied to the latency minimization objective: Ma et al. [18] proposed an improved artificial bee colony algorithm that simultaneously minimizes the average and worst-case propagation latency while balancing the controller load, achieving solutions within 4.37% of the optimum on real-world topologies. More recently, deep reinforcement learning has been applied to dynamic controller placement in IoT and vehicular networks [19], where traffic fluctuations render static placement strategies insufficient. Despite these advances, latency-only formulations remain insufficient when redundancy requirements and controller capacity constraints are jointly present.

2.2. Resilience and Fault Tolerance

Resilience-aware placement has received increasing attention as SDN deployments move toward mission-critical applications. Sallahi and St-Hilaire [4] formulated an optimal placement model that guarantees switch connectivity under single controller failures. Vizarreta et al. [20] proposed two strategies based on disjoint control paths and distinct controller replicas, showing that both significantly improve control-plane resilience with a limited penalty on the average path length. Perrot and Reynaud [21] formulated an ILP for resilient placement by incorporating QoS and load balancing constraints at multiple backup levels. Kumari et al. [22] showed through a survey that redundancy alone does not guarantee reliability unless switches are properly distributed among the controllers. At larger scales, Mogyorósi et al. [23] addressed resilient control-plane design for virtualized 6G core networks, proposing latency-aware dual hypervisor placement protected against single-link and node failures and proved that finding the minimum number of hypervisors is NP-hard and difficult to approximate.
More recently, the resilience challenge has expanded to non terrestrial and integrated network environments. Chiti et al. [12] propose a multi controller placement strategy for integrated terrestrial and LEO satellite networks in which satellites dynamically operate as SDN controllers despite frequent topology variations. Their results show that topology aware placement substantially reduces latency while improving control plane load balancing. Choi et al. [11] address the scalability aspect of this problem in LEO mega constellations. They introduce a topology virtualization framework that isolates the control plane from continuous topological changes and achieves efficient load balancing across deployments at the scale of Starlink.

2.3. Switch Assignment

The switch assignment problem has received comparatively less attention as an independent problem. Tanha et al. [6] showed that assigning switches to multiple controllers significantly improves the reliability of wide-area SDN, and formulated capacity-constrained assignment as an optimization model. Lange et al. [5] demonstrated that the nearest-controller assignment leads to an uneven load distribution under heterogeneous traffic, thereby motivating load-aware heuristics. Dynamic assignment strategies have more recently been explored through reinforcement learning: Bouzidi et al. [24] proposed a deep Q-network approach that dynamically clusters switches and places controllers to minimize response time and intra-cluster delay, while Kumari et al. [25] used online reinforcement learning to rebalance switch-to-controller mappings under dynamic traffic and failure events.

2.4. Joint Placement and Assignment

A growing body of work has recognized that placement and assignment must be optimized jointly because their interdependence makes sequential approaches inherently suboptimal [7,13]. Dong and Xue [13] formulated a mixed-integer joint optimization model that minimizes the control delay while balancing the controller workload. Metaheuristic approaches have also been applied to the joint problem: Liao and Leung [26] proposed a multi-objective genetic algorithm with particle swarm optimization-based mutation to simultaneously minimize switch-to-controller delay, controller-to-controller delay, and load imbalance, while Radam et al. [27] combined harmony search and particle swarm optimization for multi-controller placement. Sirjani et al. [28] combined reinforcement learning with metaheuristics for joint placement and demonstrated improved scalability over pure ILP approaches. The graph-theoretic properties have also been explored as structural proxies for controller placement. Kumar and Malik [29] proposed a k-core decomposition-based articulation point method for identifying critical controller locations in IoT-SDN networks, combined with a metaheuristic optimization stage.

2.5. Positioning of This Work

Despite extensive research on controller placement, three gaps remain.
Firstly, while k-core decomposition has been used as a preprocessing or ranking heuristic [29], it has not been systematically integrated as an explicit optimization component within a unified formulation that jointly considers node degree, distance-aware costs, and resilience-oriented design objectives.
Secondly, most existing formulations treat redundancy requirements and controller capacity constraints in isolation, and rarely embed them within a single consistent optimization framework that also accounts for structural centrality and communication cost.
Thirdly, the joint scalability analysis of exact and heuristic approaches across structurally diverse network topologies remains limited, particularly beyond a few hundred nodes and across heterogeneous structures such as random and hierarchical multi-core networks.
This paper addresses these gaps through a unified ILP formulation that integrates structural centrality and performance objectives, a set of five topology-aware candidate selection strategies with formal complexity analysis, and a systematic scalability study up to 500 nodes across random and hierarchical multi-core topologies.

3. System Model

3.1. Network Model

We model the underlying physical infrastructure as an undirected graph G = ( V , E ) , where V is the set of network nodes and E V × V is the set of bidirectional communication links. Each node v V represents a physical device that hosts a switch responsible for forwarding data-plane traffic. A subset L V identifies the nodes that are additionally eligible to host an SDN controller, so that S = V and L S . In the simulation study presented in Section 6, we adopted the common assumption that every node in V is a candidate controller location, that is, L = V , so that the placement strategies operate over the full node set.

3.1.1. Local Connectivity: Node Degree

The degree of node l V is defined as the number of its direct neighbors:
deg ( l ) = | { j V ( l , j ) E } | .
Nodes with a higher degree offer a larger number of alternative communication paths, thereby reducing the risk of isolation under link failures. In the context of controller placement, high-degree nodes can serve a larger number of switches over short paths, enhancing the local fault tolerance and reducing the average control-plane latency.

3.1.2. Global Centrality: Core Number

The k-core of graph G is the maximal subgraph in which every node has degree at least k [30]. We denote by C k ( G ) the k-core of G. The core number of node l is defined as the index of the deepest k-core to which the node belongs.
κ ( l ) = max { k l C k ( G ) } .
Nodes with a high core number, κ ( l ) , reside in the densest and most interconnected regions of the network, thus forming a structural backbone. Controllers placed at such nodes maintain connectivity to a large fraction of the network, even under partial failures, thereby enhancing the global robustness of the control plane.
Figure 1 illustrates the k-core decomposition of the Karate Club network [31], which is a widely used benchmark in graph theory. Each panel shows the subgraph induced by nodes belonging to the k-core at the corresponding level, with node colors consistent across the panels. The color of a node reflects its core number and is preserved as decomposition deepens. Nodes surviving into the innermost core ( κ = 4 here) form the densest and most interconnected subgraph and are precisely those prioritized as candidate controller locations by the Core-Only and Hybrid Centrality strategies introduced in Section 5.

3.1.3. Topological Motivation for Placement

Together, deg ( l ) and κ ( l ) capture the complementary aspects of the node centrality. A node may have a high degree but low core number, indicating many neighbors in a sparse periphery, or a high core number but a moderate degree, indicating deep embeddedness in a dense subgraph. Controllers placed at nodes that score highly on both metrics are simultaneously well connected locally and structurally robust globally, which is the guiding principle of the placement strategies developed in Section 5.

3.2. SDN Control Plane Architecture

In SDN, the control plane is physically decoupled from the data plane. Switches handle packet forwarding according to flow rules, whereas controllers maintain a global view of the network and issue configuration commands to the switches via a dedicated control channel. Therefore, the placement of controllers and assignment of switches to controllers are critical design decisions that directly affect the performance, reliability, and scalability of the control plane.

3.2.1. Switch-Controller Assignment

Each switch s S must be assigned to at least one controller l L through which it receives control instructions. Communication between switches and controllers is mediated by a southbound interface, for which several protocols have been proposed. OpenFlow [1] is among the most widely adopted, explicitly supporting the notion of a primary controller handling active control decisions, and one or more backup controllers that take over upon primary failure. Other protocols, such as NETCONF and RESTCONF, offer alternative interaction models that are oriented toward device configuration and telemetry. While the choice of the southbound protocol is not the focus of this work, it is worth noting that placement strategies are not entirely protocol-agnostic: latency-sensitive protocols such as OpenFlow favor distributed, proximity-aware placements, whereas configuration-driven or telemetry-oriented protocols may shift the optimization emphasis toward scalability and load balancing. In this study, we adopt a latency-aware model consistent with the OpenFlow interaction paradigm, using the shortest-path distance as a proxy for the control-plane propagation delay.

3.2.2. Resilience Through Redundant Assignment

A single-controller assignment leaves the network vulnerable to controller failures; if the sole controller of a switch becomes unavailable, the switch loses all control-plane connectivity. To mitigate this risk, each switch must be connected to a minimum number of controllers, R min 2 , ensuring that at least one backup controller is always available. Conversely, assigning a switch to too many controllers introduces unnecessary synchronization overheads and control traffic. Therefore, an upper bound R max is imposed on the number of controllers per switch. Together, R min and R max define the redundancy requirements that are formally expressed as assignment bounds in the problem formulation.

3.2.3. Controller Capacity

Each controller has a finite processing capacity and can reliably manage at most C max switches simultaneously. Exceeding this limit degrades the control responsiveness and increases the risk of control-plane congestion. This capacity limitation is enforced as a constraint in the optimization model, and its interaction with the redundancy requirement R min governs the global feasibility of any valid assignment, a condition explicitly verified in the heuristic procedure in Section 5.

3.2.4. Number of Controllers

Deploying more controllers improves coverage and reduces switch-to-controller distances, but increases infrastructure and management costs. The budget constraint limits the total number of active controllers to N max c t r l . Therefore, the joint optimization of controller placement and switch assignment must balance resilience, efficiency, and deployment cost within these structural constraints, all of which are formalized in the following section.

3.3. Communication Cost Model

In SDN, the responsiveness of the control plane depends critically on the latency of the communication channel between each switch and its assigned controllers. A switch that is topologically distant from its controller experiences longer round-trip times for control message exchanges, thereby degrading the flow setup latency and reaction time to network events. Therefore, accurately modeling this cost is central to any placement optimization framework.

3.3.1. Distance as a Proxy for Propagation Delay

We quantify the communication cost between switch s S and candidate controller location l L using the shortest-path distance in graph G:
d ( s , l ) = length   of   the   shortest   path   between   s   and   l   in   G .
This metric serves as a proxy for the propagation delay under the assumption that each hop introduces a uniform and additive latency contribution. Although more refined delay models incorporating link capacities, queuing delays, or geographic distances can be substituted without altering the problem structure, the hop-count metric is widely adopted in the controller placement literature because it captures the essential topology-dependent cost in a tractable and parameter-free manner.

3.3.2. Distance and QoS Requirements

In practice, SDN deployments impose implicit or explicit QoS requirements on control-plane latency. Beyond a certain switch-to-controller distance, the resulting propagation delay becomes operationally unacceptable; the flow setup times increase, fault detection is delayed, and network responsiveness degrades. This motivates treating d ( s , l ) not merely as a cost to minimize but also as a QoS-relevant quantity that should be kept below an acceptable level for every active switch-controller pair. In the optimization model, this is captured softly through a distance penalty term in the objective function, which discourages long-range assignments without imposing a hard cut-off. In the heuristic assignment procedure, a derived distance threshold is used to explicitly reject assignments whose cost outweighs their benefit to the objective. This is formalized in Section 5 once the objective parameters have been introduced.

4. Problem Formulation

Building on the system model introduced in the previous section, we formalize the joint controller placement and switch assignment problem as an ILP. The model captures the resilience, efficiency, and load-balancing requirements identified in Section 3 through a set of binary decision variables, a collection of structural constraints, and a multi-component objective function that balances competing design criteria.

4.1. Decision Variables

Placement and assignment decisions are encoded using two binary variables and one integer variable:
y l = 1 if   a   controller   is   placed   at   node   l L , 0 otherwise . x l s = 1 if   switch   s S   is   assigned   to   controller   l L , 0 otherwise .
Variable y l encodes the controller placement decision, whereas x l s encodes the assignment of switches to the active controllers.

4.2. Parameters

Table 1 summarizes the key parameters used in our controller placement and switch assignment model.

4.3. Constraints

4.3.1. Assignment Bounds per Switch

Each switch must be connected to at least R min and at most R max controllers. These bounds define the admissible redundancy range for each switch: the lower bound guarantees resilient multi-controller connectivity, whereas the upper bound limits synchronization overhead and control-plane signaling complexity.
R min l L x l s R max , s S .

4.3.2. Valid Assignments Only to Active Controllers

A switch can only be assigned to a controller activated by the placement decision.
x l s y l , s S , l L .

4.3.3. Controller Capacity Constraint

Each controller can simultaneously manage at most C max switches and prevent a control-plane overload.
s S x l s C max , l L .

4.3.4. Limit on Total Number of Controllers

The total number of deployed controllers is bounded by the available infrastructure budget.
l L y l N max c t r l .

4.4. Objective Function

We decompose the objective into four components, each capturing a distinct structural or performance criterion for the joint controller placement and switch assignment problem.

4.4.1. Assignment Coverage

z assign = s S l L x l s
This term aggregates the total number of switch-controller associations induced by a feasible assignment. The admissible range of redundancy is enforced through the constraint in (4), which guarantees that each switch is assigned at least R min and at most R max controllers. Within these feasibility limits, the term contributes to the objective by favoring configurations that fully utilize the available redundancy budget across the network, thereby supporting resilient multi-controller connectivity.

4.4.2. Local Connectivity Preference

z degree = l L deg ( l ) y l
This component favors controller locations with higher node degrees, promoting locally well-connected placements that reduce the average hop count to neighboring switches.

4.4.3. Global Centrality Preference

z core = l L κ ( l ) y l
This term rewards the selection of controllers at nodes with a high core number, thereby placing the control plane in the structurally robust backbone of the network.

4.4.4. Distance Penalty

z distance = s S l L d ( s , l ) x l s
This term penalizes long communication paths between switches and their assigned controllers, acting as a soft QoS constraint on control-plane latency.

4.4.5. Overall Objective

The four components are combined into a single weighted objective:
max α z assign + β z degree + γ z core δ z distance
where α , β , γ , δ 0 control the relative importance of each criterion. Note that z distance has a negative sign, as it represents the cost to be minimized rather than the benefit to be maximized.
The proposed formulation balances structural robustness and operational QoS objectives through comparable weighting coefficients. In the experimental configuration, the assignment coverage, degree centrality, k-core centrality, and distance penalty terms are assigned weights of similar magnitude ( α = 15 , β = 10 , γ = 12 , and δ = 10 ). The larger value assigned to α reflects the resilience-oriented design objective of the framework, where redundant controller assignments are prioritized slightly more strongly than communication distance minimization. At the same time, the coefficients remain sufficiently balanced to avoid the dominance of any single objective component. Consequently, controller placement decisions emerge from a joint trade-off between coverage, resilience, connectivity, and communication cost rather than being driven solely by topological or performance-related factors.

4.5. Complete ILP Formulation

The model seeks a joint deployment of controllers and assignment of switches that simultaneously ensures resilience through redundant assignments, favors topologically central controller locations, penalizes distant switch-controller pairs, and respects both the controller capacity and deployment budget. The complete integer linear program is as follows:
max x , y α s S l L x l s + β l L deg ( l ) y l + γ l L κ ( l ) y l δ s S l L d ( s , l ) x l s s . t . R min l L x l s R max , s S , x l s y l , s S , l L , s S x l s C max , l L , l L y l N max c t r l , x l s , y l { 0 , 1 } .

4.6. Scalability Challenges and Heuristic Design Rationale

The joint placement and assignment problem formulated above belongs to the class of facility location problems, which is known to be NP-hard in general [32]. The additional resilience constraints, requiring each switch to be assigned to at least R min controllers, further expand the combinatorial search space, making exact ILP solving tractable only for small-to-medium networks within reasonable time budgets. State-of-the-art solvers such as Gurobi or CPLEX are retained here as an exact benchmark, but as shown in Section 6, their runtime grows sharply with network size, rendering them unsuitable for large-scale or time-sensitive deployments.
In dynamic environments such as LEO satellite constellations, vehicular networks, and large-scale IoT infrastructures, reconfiguration must complete within the network’s coherence time, the interval during which current topology and load conditions remain operationally valid, making fast solution methods an operational requirement rather than a convenience.
The two-phase heuristic design proposed in Section 5 is a deliberate architectural choice grounded in the structure of the problem itself. The joint placement and assignment problem admits a natural decomposition: identifying where controllers should be placed is structurally separable from determining how switches should be assigned to those controllers, even though the two decisions remain coupled in the objective function.
This separability motivates a first phase that exploits graph-structural metrics, namely node degree and k-core decomposition, to identify a compact set of candidate controller locations at polynomial cost. These metrics serve as computationally efficient proxies for the placement component of the ILP objective. High-degree nodes are structurally central and tend to reduce the average hop distance, while k-core decomposition identifies the densely connected backbone of the network, where controller placement improves both coverage and resilience.
The second phase then solves the assignment problem over this reduced candidate set using a proximity-driven greedy algorithm that enforces redundancy and capacity constraints by construction.
Beyond computational efficiency, this design offers two practical advantages over black-box metaheuristics such as simulated annealing or genetic algorithms. Firstly, it is interpretable: an operator can directly understand why a controller is placed at a given node based on its graph-structural role. Secondly, it is incrementally re-executable, meaning that a topology change or failure event triggers only a local reassignment rather than a complete re-optimization from scratch.
Reinforcement learning (RL) and multi-agent reinforcement learning (MARL) have demonstrated promising results in dynamic network management [24,25,28], and MARL in particular offers low inference latency once trained. However, several practical considerations motivate the heuristic approach adopted in this work. First, RL-based methods typically require a large and representative training corpus covering diverse topology instances and operating conditions. In environments involving arbitrary or evolving topologies, a pre-trained policy may not generalize effectively without additional retraining, introducing a deployment overhead that the proposed framework avoids. Second, RL policies do not generally provide explicit optimality guarantees with respect to the underlying ILP objective. In contrast, the proposed two-phase heuristic can be directly benchmarked against the exact ILP solution through the optimality gap metric, enabling a transparent assessment of solution quality. Third, interpretability remains an important consideration in control plane design. Placement decisions derived from graph-structural metrics such as degree centrality or k-core decomposition are readily explainable, whereas the internal decision process of a learned policy is typically less transparent. Investigating RL-based adaptive reconfiguration as a complement to the proposed framework, particularly in highly dynamic environments, constitutes an interesting direction for future work.

5. Heuristic Controller Placement and Switch Assignment

The proposed framework addresses the joint controller placement and switch assignment problem using a two-phase heuristic procedure. In the first phase, a restricted set of candidate controller locations L V is identified by exploiting the topological structure of the network. In the second phase, switches are assigned to the selected controllers according to QoS-driven criteria. Proximity is a proxy for propagation delay and resilience requirements, whereby each switch must be assigned to at least R min controllers to ensure fault tolerance in the control plane. This separation between placement and assignment allows the two phases to be analyzed independently, while their combination yields a complete, computationally tractable solution whose quality is benchmarked against the exact ILP optimum via the gap metric (17).
The proposed heuristics exploit graph-structural properties such as node degree and k-core centrality, together with proximity and coverage-driven criteria, to construct computationally efficient approximations of the underlying multi-objective placement problem. This design ensures that the heuristics are not solving a fundamentally different optimization problem, but rather providing scalable surrogate solutions that preserve the trade-off between resilience, topological centrality, and communication distance. Consequently, the comparison with the ILP is performed under identical modeling assumptions, allowing us to quantify how closely lightweight topology-aware methods approximate the exact solution under scalability constraints.

5.1. Controller Candidate Selection

5.1.1. Motivation

Solving the ILP directly over the full node set V becomes computationally expensive for medium- and large-scale networks, as established in Section 4. To address this limitation, we first introduce Algorithm 1, which defines a candidate preselection phase in which each strategy constructs a restricted set L V such that | L | = N max c t r l . Five strategies are considered, each exploiting a different structural property of the network, namely Core-Only, Degree-Based, Hybrid Centrality, Distance-Sum, and Greedy Coverage, which are used throughout the paper under these names for simplicity.

5.1.2. Core-Only

Nodes with a high core number κ ( l ) belong to dense and structurally central regions of the network, making them natural controller candidates. The selection starts from the deepest core level κ start [ κ min , κ max ] and accumulates candidates by descending through the core levels until N max c t r l nodes are collected:
L L { l V κ ( l ) k } , for   k = κ start , κ start 1 , , 1
until | L | N max c t r l or k < 1 . By default, κ start = κ max , so the selection begins from the innermost core and expands outward only as needed. If | L | > N max c t r l after accumulation, the set is trimmed by retaining the top- N max c t r l nodes ranked by descending degree, with the node index used as a deterministic tie breaker:
L = Top - N max c t r l deg ( l ) , id ( l ) l L .

5.1.3. Degree-Based

This strategy prioritizes locally well-connected nodes, under the assumption that high-degree nodes can reach more switches with fewer hops. Nodes are ranked by degree in descending order, and the top min ( N max c t r l , | V | ) nodes are selected as follows:  
L = Top - min ( N max c t r l , | V | ) deg ( l ) l V .
This formulation ensures that the selection remains valid when the network contains fewer nodes than the required number of controllers.

5.1.4. Hybrid Centrality

To combine local and global structural importance, a weighted composite score is defined as
ϕ ( l ) = ω κ κ ( l ) + ω d C d ( l ) + ω b b ( l ) + ω c c ( l ) ,
where C d ( l ) , b ( l ) , and c ( l ) denote the degree, betweenness [33], and closeness centrality of node l respectively, and ω κ + ω d + ω b + ω c = 1 . Note that the hybrid score uses degree centrality C d ( l ) = deg ( l ) / ( | V | 1 ) , a normalized quantity, rather than a raw degree, ensuring that all four components are comparable in scale before weighting.
The candidate set selects the top- min ( N max c t r l , | V | ) nodes ranked by ϕ ( l ) in descending order as follows:
L = Top - min ( N max c t r l , | V | ) ϕ ( l ) l V .

5.1.5. Distance-Sum

A controller placed at a node with a small total distance from all other nodes minimizes the average control-plane latency. For each candidate node l V , the distance-sum score is computed using single-source shortest-path lengths as follows:
σ ( l ) = u V d ( l , u ) ,
where d ( l , u ) is the shortest-path distance between l and u in G. Nodes are then ranked by σ ( l ) in ascending order, and the top- N max c t r l nodes with the smallest scores are selected:
L = Top - N max c t r l argmin σ ( l ) l V .

5.1.6. Greedy Coverage

This strategy iteratively selects the node that covers the largest number of uncovered nodes within a maximum distance threshold d 0 , which is interpretable as the maximum acceptable propagation delay. For each candidate node l V , the coverage gain over the current uncovered set U is
gain ( l ) = { u U d ( u , l ) d 0 } .
Starting from U V and L , at each iteration, the node that maximizes the coverage gain is selected as
l * = arg max l V L gain ( l ) ,
and the uncovered set is updated accordingly:
L L { l * } , U U { u V d ( u , l * ) d 0 } .
The procedure terminates when | L | = N max c t r l or no further coverage gain is possible.
Algorithm 1 Candidate Controller Selection Strategies
  • Require: Graph G = ( V , E ) , desired number of candidate controllers N max c t r l , distance function d ( · , · ) , coverage threshold d 0
  • Ensure: Candidate controller set L V
  • Strategy 1: Core-Only
1:
Compute κ ( v ) for each v V            ▹k-core decomposition [30]
2:
k κ start ,     L                ▹ default: κ start = max v V κ ( v )
3:
while  k 1   and  | L | < N max c t r l   do
4:
     L L { v V κ ( v ) k }
5:
     k k 1
6:
end while
7:
if  | L | > N max c t r l   then
8:
     L Top - N max c t r l nodes in L ranked by ( deg ( v ) , id ( v ) ) descending
9:
end if
  • Strategy 2: Degree-Based
10:
L Top - min ( N max c t r l , | V | ) nodes in V ranked by deg ( v ) descending
  • Strategy 3: Distance-Sum
11:
for each v V  do
12:
     σ ( v ) u V d ( v , u )             ▹ all-pairs hop count via BFS
13:
end for
14:
L Top - N max c t r l nodes in V ranked by σ ( v ) ascending
  • Strategy 4: Hybrid Centrality
15:
Compute C d ( v ) , b ( v ) [33], c ( v ) , κ ( v ) for each v V
16:
for each v V  do
17:
     ϕ ( v ) ω κ κ ( v ) + ω d C d ( v ) + ω b b ( v ) + ω c c ( v )     ▹ ω κ + ω d + ω b + ω c = 1
18:
end for
19:
L Top - min ( N max c t r l , | V | ) nodes in V ranked by ϕ ( v ) descending
  • Strategy 5: Greedy Coverage
20:
U V ,     L
21:
while  | L | < N max c t r l do
22:
     * arg max V L { u U d ( u , ) d 0 }
23:
    if no improvement possible then
24:
        break
25:
    end if
26:
     L L { * }
27:
     U U { u V d ( u , * ) d 0 }
28:
end while

5.2. Complexity of Candidate Selection

Let n = | V | and m = | E | denote the number of nodes and edges of G respectively, and let N max c t r l denote the desired number of candidate controllers.

5.2.1. Core-Only: Complexity Analysis

The k-core decomposition is computed in O ( n + m ) using the peeling algorithm [30]. The accumulation loop iterates over at most κ max core levels, where κ max n , and each iteration scans all nodes, yielding O ( κ max · n ) in the worst case. In practice, κ max n for sparse networks; therefore, the dominant cost is reduced to O ( n + m ) . The final trimming step ranks the nodes by ( deg ( v ) , id ( v ) ) and costs O ( n log n ) . Overall: O ( n log n + m ) .

5.2.2. Degree-Based: Complexity Analysis

The node degrees are computed in O ( m ) by scanning the edge list. Sorting min ( N max c t r l , | V | ) nodes by deg ( v ) costs O ( n log n ) . Overall: O ( n log n + m ) .

5.2.3. Hybrid Centrality: Complexity Analysis

The hybrid score ϕ ( v ) aggregates four centrality measures. Degree centrality C d ( v ) costs O ( n + m ) . Betweenness centrality b ( v ) requires O ( n m + n 2 log n ) using Brandes’ algorithm [33]. Closeness centrality c ( v ) requires one BFS per node, costing O ( n ( n + m ) ) . k-core decomposition costs O ( n + m ) . Score aggregation and sorting of min ( N max c t r l , | V | ) nodes cost O ( n log n ) . The bottleneck is betweenness centrality. Overall: O ( n m + n 2 log n ) .

5.2.4. Distance-Sum: Complexity Analysis

Computing σ ( v ) = u V d ( v , u ) for a single node requires a Breadth-First Search (BFS) in O ( n + m ) for unweighted graphs, where d ( v , u ) is the hop count. Repeating over all n nodes yields O ( n ( n + m ) ) . Sorting the resulting scores results in a cost O ( n log n ) , which is dominated. Overall: O ( n ( n + m ) ) .

5.2.5. Greedy Coverage: Complexity Analysis

At each of the N max c t r l iterations, the algorithm evaluates every remaining candidate V L and counts how many uncovered nodes u U satisfy d ( u , ) d 0 , where d 0 = 1 hop in the experiments. Assuming d ( · , · ) is precomputed at cost O ( n ( n + m ) ) via BFS from all nodes, each coverage gain evaluation costs O ( n ) , and each iteration scans at most n candidates. Therefore, the greedy loop costs O ( N max c t r l · n 2 ) . The early termination condition, triggered when no further coverage gain is possible, may reduce the average number of iterations to below N max c t r l in practice. Overall (excluding pre-computation): O ( N max c t r l · n 2 ) .

5.2.6. Overall Complexity of Candidate Selection

Table 2 summarizes the time complexity of each selection strategy. The overall heuristic pipeline, selection followed by assignment, is dominated by the betweenness centrality computation in Strategy 4 ( O ( n m + n 2 log n ) ), or by the all-pairs BFS required for Strategies 3 and 5 and the assignment phase ( O ( n ( n + m ) ) ), both of which are computed once and shared across all downstream steps.

5.3. Switch Assignment Algorithm

The switch-to-controller assignment heuristic, detailed in Algorithm 2, operates in two successive steps. Given the candidate controller set L V produced by Algorithm 1, each switch s S must be assigned to at least R min and at most R max controllers, whereas no controller can serve more than C max switches. A global feasibility check is first performed to verify that the available controller capacity is sufficient to satisfy the minimum redundancy requirement across all switches.
In the first step, each switch greedily selects its R min closest available controllers, ensuring that the minimum redundancy constraint is satisfied for every switch.
In the second step, additional controllers are assigned up to R max , but only if the candidate controller falls within the objective-derived distance threshold:
d max = α δ ,
where α and δ are the objective weights defined in (12), associated with assignment coverage and distance penalty, respectively. Beyond d max , an additional assignment would incur a distance cost that outweighs the marginal coverage benefit, yielding a net negative contribution to the objective.
This two-phase structure separates feasibility from quality: the first step guarantees a valid assignment, whereas the second refines it under proximity and capacity constraints.
Algorithm 2 Heuristic Switch-to-Controller Assignment
  • Require: Candidate controller set L V , set of switches S, distance function d ( · , · ) , parameters R min , R max , C max , α , δ
  • Ensure: Assignment map A : S 2 L
  • Global Feasibility Check
1:
if  | S | · R min > | L | · C max   then
2:
    raise Infeasibility error
3:
end if
4:
A ( s )   for all  s S
5:
load ( ) 0   for all  L
  • Step 1: Minimum Redundancy Assignment ( R min )
6:
for each s S  do
7:
    Sort L by d ( s , ) ascending L s
8:
    for each L s  do
9:
        if  load ( ) < C max  then
10:
            A ( s ) A ( s ) { }
11:
            load ( ) load ( ) + 1
12:
        end if
13:
        if  | A ( s ) | = R min  then
14:
           break
15:
        end if
16:
    end for
17:
    if  | A ( s ) | < R min  then
18:
        raise Infeasibility error for switch s
19:
    end if
20:
end for
  • Step 2: Extended Assignment up to R max
21:
d max α / δ
22:
for each s S  do
23:
    while  | A ( s ) | < R max  do
24:
         C { L load ( ) < C max , A ( s ) }
25:
        if  C =  then
26:
           break
27:
        end if
28:
         * arg min C d ( s , )
29:
        if  d ( s , * ) < d max  then
30:
            A ( s ) A ( s ) { * }
31:
            load ( * ) load ( * ) + 1
32:
        else
33:
           break
34:
        end if
35:
    end while
36:
end for
37:
return  A

5.4. Complexity Analysis of the Switch Assignment Heuristic

Let n = | S | denote the number of switches and p = | L | be the number of placed controllers, with p n in practice.

5.4.1. Global Feasibility Check

The condition | S | · R min > | L | · C max is evaluated in O ( 1 ) .

5.4.2. Step 1: Minimum Redundancy Assignment

For each switch s S , the algorithm sorts the p controllers by d ( s , ) , costing O ( p log p ) , then scans the sorted list until R min controllers are assigned. Because R min p , the scan cost is at most O ( p ) . Over all n switches:
O ( n · p log p ) .

5.4.3. Step 2: Extended Assignment

For each switch s S , the while loop runs for at most R max R min iterations. At each iteration, the candidate set C is built by filtering L in O ( p ) , and arg min is computed in O ( p ) . Each iteration therefore costs O ( p ) . Over all n switches:
O ( n · ( R max R min ) · p ) .
Since R max R min p in the worst case, this is bounded by O ( n · p 2 ) , but in practice R max R min p , reducing it to O ( n p ) .

5.4.4. Overall Complexity

Step 1 dominates Step 2 when ( R max R min ) p log p , which holds in all practical configurations, where the redundancy bounds are small relative to the number of controllers.
O ( n · p log p ) .
If d ( · , · ) is precomputed via all-pairs BFS at cost O ( n ( n + m ) ) , this constitutes the true bottleneck for sparse graphs where p log p n + m .
The complexity of each step of the switch assignment heuristic is summarized in Table 3.

5.5. Overall Complexity of the Heuristic Framework

The overall two-phase heuristic, candidate selection followed by switch assignment, is dominated by the betweenness centrality computation in the hybrid strategy ( O ( n m + n 2 log n ) ), or by the all-pairs BFS required for Distance-Sum, Greedy Coverage, and assignment phases ( O ( n ( n + m ) ) ), whichever applies. Both are computed once and are shared across all downstream steps; therefore, the marginal cost of the assignment phase remains O ( n p log p ) .

6. Numerical Results

The aim of the studies presented in this section is to assess the performance of the proposed heuristics relative to the exact ILP solution, using the optimality gap defined in (17) as the primary performance indicator. Each heuristic is named according to its controller placement strategy, whereas the assignment algorithm (Algorithm 2) is kept identical across all methods. Optimal solutions are obtained using the Gurobi solver. All numerical experiments are executed on a workstation equipped with a 13th Gen Intel® Core™ i7-1355U × 12 processor and 16 GB of RAM.
The primary metric for solution quality is the optimality gap, defined as:
Δ ( % ) = z ILP z heuristic z ILP × 100 ,
where z ILP is the optimal ILP objective value and z heuristic is the value achieved by the heuristic. The four performance metrics evaluated across all studies are the optimality gap (17), solver time, average switch-to-controller path length, and average number of assignments per switch.
All experiments share a common set of fixed parameters, as summarized in Table 4. The study-specific parameters, namely the controller capacity C max and the maximum number of controllers N max c t r l , are varied independently in each subsection and described therein.

6.1. Placement Philosophy: Coverage vs. Centrality

Before presenting the quantitative results, we illustrate the qualitative differences in placement behavior between the coverage-based strategy and the four centrality-based strategies. Figure 2 shows the controller placement produced by each heuristic on a representative random network instance, compared side by side with the exact ILP solution. In each row, the left panel shows the exact placement as a reference, and the right panel shows the heuristic placement. The red nodes are selected by both the exact and heuristic solutions, the green nodes are selected by the exact solution only, the orange nodes are selected by the heuristic only, and the blue nodes are regular switches.
Two distinct placement behaviors were observed. The core-only, degree-based, hybrid, and distance-sum strategies concentrate controllers in the densely connected central region of the network, producing placements that substantially overlap with the exact solution, as evidenced by the large proportion of red nodes. In contrast, the greedy coverage strategy scatters controllers across the periphery of the network, with very few nodes in common with the exact placement. This behavior is a direct consequence of the coverage objective: by greedily maximizing the number of one-hop neighbors covered at each step, the algorithm favors nodes that are spatially spread out rather than structurally central because peripheral nodes tend to cover disjoint neighborhoods.
Although this strategy may be appropriate in scenarios where geographic coverage is the primary concern, it is poorly suited to the joint placement and assignment objective defined in (12), which rewards topological centrality, proximity, and redundancy simultaneously. Therefore, the coverage strategy is retained in the comparison for completeness and to validate this design insight but is excluded from the main axis scaling of the performance figures that follow, where its outlier behavior would obscure the differences among the remaining strategies.

6.2. Heuristic vs. Exact Controller Placement

In this study, we compare the proposed heuristics with an exact ILP solution. To obtain statistically robust results, five independent instances are generated for each network type, each with 100 nodes and comparable number of edges. All metrics are reported as averages over these five instances. Representative examples are shown in Figure 3 and Figure 4.
Two network topologies are considered:
  • Random network: Generated using the Erdős–Rényi model, with approximately 295 edges and three k-core levels ( κ { 1 , 3 , 4 } ), yielding a relatively flat structure (Figure 3).
  • Multi-core network: Generated using a custom procedure designed to maximize hierarchical depth, with approximately 298 edges and seven k-core levels ( κ { 1 , , 7 } ), providing a markedly more hierarchical structure (Figure 4).
For all experiments in this subsection, the number of controllers is fixed at N max c t r l = 16 , with the controller capacity C max varying from 15 to 30. These values guarantee the feasibility with respect to the minimum assignment constraint ( R min = 2 ) and remain below the maximum value ( R max = 4 ). For 100 switches, the chosen C max values correspond to a total assignment capacity ranging from approximately 240 to 480, comfortably covering the expected 200–400 assignments.
Figure 5 reports the optimality gap of each heuristic as a function of C max . Overall, all heuristics produce near-optimal solutions, with gaps decreasing as C max increases. At C max = 15 , corresponding to a controller load slightly above the minimum feasibility threshold, multi-core networks already exhibit substantially lower gaps (6.7–8.5%) than random networks.
On random networks, a clear structure sensitivity effect is observed when increasing controller capacity. They exhibit a stronger relative improvement as C max increases, with gaps decreasing rapidly and reaching approximately 2.4% for the hybrid, core-only, and degree-based heuristics at C max = 30 , while the distance-sum heuristic remains an outlier. Note that the core-only and degree-based curves overlap exactly in Figure 5, making the core-only curve visually indistinguishable. This performance becomes comparable to the best result observed in multi-core networks.
In contrast, multi-core networks maintain consistently low gaps across all capacity regimes, indicating that their hierarchical structure already guides the heuristics toward near-optimal solutions even under tighter constraints. Among all methods, the degree-based strategy consistently achieves the lowest gaps across all values of C max , reaching 1.9% at C max = 30 . This can be attributed to the alignment between node-level structural properties and the hierarchical topology of multi-core networks, which naturally favors high-quality placements.
Figure 6 shows the solver times as a function of C max . All heuristics complete in less than 1 ms, which is several orders of magnitude faster than the exact ILP, which requires approximately 140 ms on average and exceeds 3 s at C max = 15 on the multi-core network, the configuration with the highest optimality gap, confirming that combinatorial difficulty correlates with both solution quality and solver effort. These results demonstrate that the heuristics achieve near-optimal solutions at a fraction of the computational cost.
Figure 7 shows the average switch-to-controller path length, which serves as a proxy for control-plane latency (Section 3). Multi-core networks consistently exhibit slightly longer average paths than random networks across all heuristics, which is a direct consequence of their hierarchical structure forcing switches in peripheral layers to reach controllers over longer paths. The exact solution yields the shortest paths on random networks, confirming its optimality. In multi-core networks, the exact solution produces slightly longer paths than in random networks, comparable to heuristics, reflecting the inherent topological cost of the hierarchical structure rather than any suboptimality.
Figure 8 reports the average number of controller assignments per switch, which directly reflects the resilience level achieved by each method. Multi-core networks consistently achieve higher average assignments than random networks, and the degree-based heuristic produces assignment levels closest to the exact solution across both topologies. Although R min = 2 and R max = 4 , the observed averages remain in the range 2.2–2.6, indicating that capacity constraints prevent full redundancy for all switches simultaneously. This is consistent with the assignment algorithm’s design: Algorithm 2 prioritizes feasibility and capacity balance in Step 1, and only extends assignments in Step 2 when proximity and capacity conditions are jointly satisfied.

6.3. Effect of the Number of Controllers N max c t r l

This study analyzes the impact of the maximum number of controllers N max ctrl on the four performance metrics. The network size is fixed at 100 nodes, while C max is varied such that N max ctrl × C max = 400 , ensuring a constant total assignment budget of 400, corresponding to the maximum number of assignments R max per switch.
Figure 9 shows the optimality gap as a function of N max ctrl . The gap exhibits a clear non-monotonic behavior, remaining low at both extremes of the controller budget and peaking at intermediate values. At N max ctrl = 2 , all strategies achieve gaps below 10% on both topologies, reflecting a strongly constrained regime in which limited placement options reduce the divergence between heuristic and ILP solutions. At higher budgets, N max ctrl { 8 , 10 , 16 } , gaps decrease again and remain mostly below 10%, as the increased redundancy in controller placement reduces the sensitivity of the objective to exact placement decisions. Between these regimes, gaps increase significantly. In random networks, this increase is highly pronounced, with core-only and degree-based methods peaking near 48% at N max ctrl = 4 , while distance-sum reaches approximately 74%, indicating a strong sensitivity of the heuristic to intermediate controller budgets. In multi-core networks, the same non-monotonic pattern is observed but with a lower amplitude, where core-only peaks above 33% at N max ctrl = 4 , and degree-based remains the most robust strategy at approximately 15%.
This pattern can be interpreted through three interaction regimes between topology and combinatorial complexity. At low controller budgets, the solution space is highly constrained, leading to similar decisions across heuristic and exact approaches. At intermediate budgets, placement decisions have maximal marginal impact: the problem is neither over- nor under-constrained, and locally driven heuristics are more likely to diverge from the global optimum identified by the ILP. At high budgets, the system becomes increasingly insensitive to the exact controller locations, which reduces performance differences between strategies.
Two additional structural observations emerge. Firstly, core-only and degree-based approaches yield identical gap values in random networks across all controller budgets, confirming that degree and coreness rankings largely coincide in this topology class. In multi-core networks, the two strategies diverge, with degree-based consistently achieving lower gaps, suggesting that local connectivity is a slightly more informative criterion than global coreness in hierarchical structures. Secondly, multi-core networks exhibit systematically higher and sharper gap peaks than random networks, highlighting that structured topologies amplify the difficulty of the intermediate regime where placement decisions have the strongest influence on performance.
Figure 10 shows that all heuristics are complete in under 1 ms across all values of N max c t r l , with the exception of multi-core networks at N max c t r l = 5 , which corresponds to the peak combinatorial difficulty identified above. The exact ILP exhibits a similar peak in solver time for the same configuration, corroborating the interpretation that the intermediate controller counts induce the most difficult instances.
Figure 11 shows that the average path length decreases monotonically with N max c t r l , consistently across all heuristics and for both network types. This trend is intuitive: a larger number of deployed controllers reduces the average topological distance between any switch and its nearest controller, thereby directly improving control-plane latency.
Figure 12 shows that the average number of assignments per switch increases monotonically with N max c t r l . A larger controller pool relaxes the capacity constraints, enabling Algorithm 2 to extend the assignments further in Step 2, thereby increasing the resilience. This confirms that N max c t r l is a key design parameter governing the trade-off between deployment cost and fault tolerance.

6.4. Scalability Analysis

This study evaluates the behavior of the heuristics as the network size | V | increases from 50 to 500 nodes, with the exact ILP serving as the benchmark where tractable. To ensure structural consistency across scales, the number of k-core hierarchy levels grows logarithmically with | V | as follows:
levels = 5 + log 2 ( | V | ) ,
subject to the feasibility condition
= 1 levels ( + 1 ) | V | .
Edge density is controlled by scaling the edge factor as
edge _ factor = base _ factor · log ( | V | ) log ( n ref ) ,
where n ref = 100 and base _ factor = 0.6 , preserving a comparable sparsity across scales while maintaining the hierarchical k-core structure. The controller capacity is fixed at C max = 50 and N max c t r l is scaled proportionally to | V | to maintain a constant controller density. Ten independent instances per network type are generated for each scale to ensure statistical robustness.
Figure 13 shows the evolution of the optimality gap with network size. Across both topology types and all strategies, gaps remain generally below 20%, confirming that the proposed heuristics achieve near-optimal solutions at a fraction of the computational cost of the exact ILP. In random networks, core-only and degree-based strategies remain below 8%, hybrid reaches approximately 12%, and distance-sum stabilizes around 18% beyond 200 nodes. In multi-core networks, degree-based consistently achieves the lowest gap, remaining below 12% up to 500 nodes, while core-only, hybrid, and distance-sum remain in the 10–15% range at larger scales. Overall, both topology types exhibit similar gap profiles across most strategies, with the notable exception of distance-sum, which performs comparatively worse on random networks.
A closer examination of individual strategies reveals two structural phenomena. First, in random networks, core-only and degree-based strategies produce identical gap values at every scale, making their curves indistinguishable in the figure. This is a structural consequence rather than a numerical artifact: in random graphs, node degree and coreness become strongly correlated, leading to highly similar rankings of candidate nodes and therefore nearly identical placement decisions. The two metrics consequently carry largely redundant information in this topology class, causing both heuristics to converge toward the same controller placements. In multi-core networks, by contrast, the hierarchical structure introduces greater differentiation between local connectivity and global coreness, allowing the two strategies to select partially distinct candidate sets and resulting in progressively different gaps at larger scales.
Second, the optimality gap in multi-core networks exhibits a steeper increase at 500 nodes than in random networks. This behavior can be explained by the expansion of structurally prominent candidate nodes in hierarchical topologies, which enlarges the decision space available to the ILP. The heuristic, however, remains guided by local selection rules and therefore becomes less competitive as the candidate space grows. In random networks, the absence of strong hierarchical organization limits this expansion, resulting in a more stable relative performance. Overall, these observations indicate that the optimality gap depends not only on network size but also on the interaction between topological structure and locality-driven decision rules.
Figure 14 shows that all heuristics remain highly efficient, with execution times below 10 ms even at 500 nodes, whereas the exact ILP exceeds 10 s for the largest instances and becomes computationally prohibitive beyond 200–300 nodes.
In a static planning context, where controller placement is decided once at the network design time, a solver time of a few seconds may be acceptable, and the computational advantage of heuristics is less critical. However, solver time is a decisive factor in three practically relevant scenarios. Firstly, in failure-driven reconfiguration: when a controller becomes unavailable and placement must be recomputed to restore the required redundancy level R min , the reconfiguration latency directly affects service continuity and the duration of the resilience degradation window. Secondly, in dynamically evolving networks such as data center fabrics or campus SDN infrastructures, topology changes such as node additions, link failures, or traffic-driven reconfigurations occur frequently and trigger repeated re-optimization. Thirdly, in large-scale planning studies requiring the evaluation of many configurations, such as sensitivity analyses over objective weights ( α , β , γ , δ ) or topology parameters, the ILP must be solved hundreds or thousands of times, making its per-instance cost prohibitive even when a single run is acceptable. In all three scenarios, the proposed heuristics provide a tractable alternative that scales gracefully with the network size while maintaining near-optimal solution quality.
Figure 15 presents the average path length as a function of the network size. In multi-core networks, the path length increases with | V | , reflecting a deeper hierarchical structure: as the number of k-core levels increases, switches in peripheral layers must traverse progressively longer paths to reach controllers in the inner core. This growth in path length also contributes to the widening optimality gap observed in Figure 13 because heuristics place controllers further from the optimal positions at larger scales. In contrast, random networks exhibit a stable average path length of approximately 1.6 regardless of | V | , which is consistent with their flat k-core structure and the small-world property of Erdős–Rényi graphs at the considered density.
Figure 16 shows that multi-core networks achieve higher average assignments overall, but exhibit a slight decreasing trend with | V | . As the network size and path lengths increase, additional assignments become less beneficial relative to their distance cost, and the threshold d max (13) increasingly rejects distant candidates in Step 2 of Algorithm 2. Random networks maintain a nearly constant assignment level close to R min = 2 , reflecting their limited structural depth and the correspondingly low incentive for additional redundancy under Objective (12).

7. Conclusions and Future Work

This paper proposed a joint optimization framework for controller placement and switch assignment in resilient SDNs, formulated as a multi-objective ILP integrating topological centrality, latency, load balancing, and redundancy constraints. A two-phase heuristic framework was developed to overcome the computational limitations of exact optimization, combining topology-aware candidate selection with a capacity-driven assignment algorithm that guarantees per-switch redundancy by construction.
Experiments on random and hierarchical multi-core networks demonstrate near-optimal solutions with optimality gaps below 10%, runtimes under 10 ms enabling near real-time reconfiguration, and confirm that hierarchical k-core topologies significantly improve heuristic solution quality by reducing placement symmetry. The degree-based strategy consistently achieves the best trade-off between quality and simplicity.
Future work includes evaluating a heuristic-restricted ILP mode, extending the framework to dynamic environments with time-varying traffic particularly for IoT, VANETs, and satellite networks, and incorporating heterogeneous link capacities for broader practical applicability.

Author Contributions

Methodology: S.M.R. and Y.B.; formal analysis and investigation: S.M.R. and Y.B.; writing—original draft preparation: S.M.R. and Y.B.; writing—review and editing: S.M.R., Y.B. and F.Z.; supervision: Y.B. and F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. McKeown, N.; Anderson, T.; Balakrishnan, H.; Parulkar, G.; Peterson, L.; Rexford, J.; Shenker, S.; Turner, J. OpenFlow: Enabling innovation in campus networks. ACM SIGCOMM Comput. Commun. Rev. 2008, 38, 69–74. [Google Scholar] [CrossRef]
  2. Kreutz, D.; Ramos, F.M.; Verissimo, P.E.; Rothenberg, C.E.; Azodolmolky, S.; Uhlig, S. Software-defined networking: A comprehensive survey. Proc. IEEE 2014, 103, 14–76. [Google Scholar] [CrossRef]
  3. Heller, B.; Sherwood, R.; McKeown, N. The controller placement problem. ACM SIGCOMM Comput. Commun. Rev. 2012, 42, 473–478. [Google Scholar] [CrossRef]
  4. Sallahi, A.; St-Hilaire, M. Optimal model for the controller placement problem in software defined networks. IEEE Commun. Lett. 2014, 19, 30–33. [Google Scholar] [CrossRef]
  5. Lange, S.; Gebert, S.; Zinner, T.; Tran-Gia, P.; Hock, D.; Jarschel, M.; Hoffmann, M. Heuristic approaches to the controller placement problem in large scale SDN networks. IEEE Trans. Netw. Serv. Manag. 2015, 12, 4–17. [Google Scholar] [CrossRef]
  6. Tanha, M.; Sajjadi, D.; Ruby, R.; Pan, J. Capacity-aware and delay-guaranteed resilient controller placement for software-defined WANs. IEEE Trans. Netw. Serv. Manag. 2018, 15, 991–1005. [Google Scholar] [CrossRef]
  7. Mojez, H.; Kamel, H.; Zanjani, R.; Bidgoli, A.M. Controller placement issue in software-defined networks with different goals: A comprehensive survey. J. Supercomput. 2024, 80, 19127–19209. [Google Scholar] [CrossRef]
  8. Hu, Y.; Wang, W.; Gong, X.; Que, X.; Cheng, S. On reliability-optimized controller placement for software-defined networks. China Commun. 2014, 11, 38–54. [Google Scholar] [CrossRef]
  9. Xing, L. Cascading failures in Internet of Things: Review and perspectives on reliability and resilience. IEEE Internet Things J. 2020, 8, 44–64. [Google Scholar] [CrossRef]
  10. Darwish, T.; Alhaj, T.A.; Elhaj, F.A. Controller placement in software defined emerging networks: A review and future directions. Telecommun. Syst. 2025, 88, 18. [Google Scholar] [CrossRef]
  11. Choi, I.; Kim, S.; Lee, Y.; Choi, J.P. A Scalable Multicontroller SDN Framework for LEO Mega-Constellation via Topology Virtualization. IEEE Internet Things J. 2025, 12, 33311–33327. [Google Scholar] [CrossRef]
  12. Chiti, F.; Morosi, S.; Pierucci, L. Multiple SDN Controllers Placement for Integrated Satellite/Terrestrial Network. IEEE Trans. Netw. Serv. Manag. 2026, 23, 3341–3350. [Google Scholar] [CrossRef]
  13. Dong, X.; Xue, L.; Zhang, Z.; Zhang, Y.; Li, T.; You, Z.; Shen, Y. Joint controller placement and control-service connection in hybrid-band control. IEEE Trans. Cloud Comput. 2023, 11, 3139–3152. [Google Scholar] [CrossRef]
  14. Rasool, S.; Boujelben, Y.; Zarai, F. Optimizing high availability multi-controller placement in SDN/NFV 5G networks: A survey. Indones. J. Electr. Eng. Comput. Sci. 2024, 34, 1800. [Google Scholar] [CrossRef]
  15. Isong, B.; Molose, R.R.S.; Abu-Mahfouz, A.M.; Dladlu, N. Comprehensive review of SDN controller placement strategies. IEEE Access 2020, 8, 170070–170092. [Google Scholar] [CrossRef]
  16. Bari, M.F.; Chowdhury, S.R.; Ahmed, R.; Boutaba, R. On orchestrating virtual network functions. In Proceedings of the 2015 11th International Conference on Network and Service Management (CNSM); IEEE: Piscataway, NJ, USA, 2015; pp. 50–56. [Google Scholar]
  17. Naseri, A.; Ahmadi, M.; PourKarimi, L. Placement of SDN controllers based on network setup cost and latency of control packets. Comput. Commun. 2023, 208, 15–28. [Google Scholar] [CrossRef]
  18. Ma, Y.; Chen, J.; Lv, W.; Qiu, X.; Zhang, Y.; Liu, W. An improved artificial bee colony algorithm to minimum propagation latency and balanced load for controller placement in Software Defined Network. Comput. Netw. 2024, 250, 110600. [Google Scholar] [CrossRef]
  19. Li, B.; Deng, X.; Chen, X.; Deng, Y.; Wan, S.; Zhang, H. DDPG-Based Load-Aware QoS Guaranteed SDN Controller Placement for Internet of Vehicles. IEEE Internet Things J. 2025, 12, 52575–52590. [Google Scholar] [CrossRef]
  20. Vizarreta, P.; Machuca, C.M.; Kellerer, W. Controller placement strategies for a resilient SDN control plane. In 2016 8th International Workshop on Resilient Networks Design and Modeling (RNDM); IEEE: Piscataway, NJ, USA, 2016; pp. 253–259. [Google Scholar] [CrossRef]
  21. Perrot, N.; Reynaud, T. Optimal placement of controllers in a resilient SDN architecture. In 2016 12th International Conference on the Design of Reliable Communication Networks (DRCN); IEEE: Piscataway, NJ, USA, 2016; pp. 145–151. [Google Scholar] [CrossRef]
  22. Kumari, A.; Sairam, A.S. A survey of controller placement problem in software defined networks. arXiv 2019, arXiv:1905.04649. [Google Scholar] [CrossRef]
  23. Mogyorósi, F.; Babarczi, P.; Zerwas, J.; Blenk, A.; Pašić, A. Resilient Control Plane Design for Virtualized 6G Core Networks. IEEE Trans. Netw. Serv. Manag. 2022, 19, 2453–2467. [Google Scholar] [CrossRef]
  24. Bouzidi, E.H.; Outtagarts, A.; Langar, R.; Boutaba, R. Dynamic clustering of software defined network switches and controller placement using deep reinforcement learning. Comput. Netw. 2022, 207, 108852. [Google Scholar] [CrossRef]
  25. Kumari, A.; Roy, A.; Singh Sairam, A. Optimizing SDN Controller Load Balancing Using Online Reinforcement Learning. IEEE Access 2024, 12, 131591–131604. [Google Scholar] [CrossRef]
  26. Liao, L.; Leung, V.C.M. Genetic algorithms with particle swarm optimization based mutation for distributed controller placement in SDNs. In 2017 IEEE Conference on Network Function Virtualization and Software Defined Networks (NFV-SDN); IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
  27. Radam, N.S.; Al-Janabi, S.T.F.; Jasim, K.S. Multi-controllers placement optimization in SDN by the hybrid HSA-PSO algorithm. Computers 2022, 11, 111. [Google Scholar] [CrossRef]
  28. Sirjani, M.S.; Maleki, A.; Pakmehr, A.; Abedini Bagha, M.; Ghaffari, A.; Pour Haji Kazem, A.A. Controller placement in software-defined networks using reinforcement learning and metaheuristics. Clust. Comput. 2025, 28, 660. [Google Scholar] [CrossRef]
  29. Kumar, S.; Malik, A. Optimized Controller Placement and Load Balancing in SDN-Enabled IoT Networks. Microsyst. Technol. 2026, 32, 47. [Google Scholar] [CrossRef]
  30. Batagelj, V.; Zaversnik, M. An O(m) Algorithm for Cores Decomposition of Networks. arXiv 2003, arXiv:cs/0310049v1. [Google Scholar] [CrossRef]
  31. Zachary, W.W. An Information Flow Model for Conflict and Fission in Small Groups. J. Anthropol. Res. 1977, 33, 452–473. [Google Scholar] [CrossRef]
  32. Garey, M.R.; Johnson, D.S. Computers and Intractability: A Guide to the Theory of NP-Completeness; W. H. Freeman: New York, NY, USA, 1979. [Google Scholar]
  33. Brandes, U. A Faster Algorithm for Betweenness Centrality. J. Math. Sociol. 2001, 25, 163–177. [Google Scholar] [CrossRef]
Figure 1. k-core decomposition of the Karate Club network [31] across four levels ( κ = 1 , 2 , 3 , 4 ).
Figure 1. k-core decomposition of the Karate Club network [31] across four levels ( κ = 1 , 2 , 3 , 4 ).
Futureinternet 18 00311 g001
Figure 2. Controller placement comparison on a representative random network instance. Each row corresponds to one heuristic strategy; left panel shows the exact ILP placement (reference), right panel shows the heuristic placement. Red: controllers common to both; green: exact only; orange: heuristic only; blue: switches. Centrality-based strategies (rows 2–5) concentrate controllers in the dense core of the network, closely matching the exact solution. The coverage strategy (row 1) scatters controllers across the periphery, resulting in minimal overlap with the exact placement.
Figure 2. Controller placement comparison on a representative random network instance. Each row corresponds to one heuristic strategy; left panel shows the exact ILP placement (reference), right panel shows the heuristic placement. Red: controllers common to both; green: exact only; orange: heuristic only; blue: switches. Centrality-based strategies (rows 2–5) concentrate controllers in the dense core of the network, closely matching the exact solution. The coverage strategy (row 1) scatters controllers across the periphery, resulting in minimal overlap with the exact placement.
Futureinternet 18 00311 g002
Figure 3. Representative random network instance: 100 nodes, 295 edges.
Figure 3. Representative random network instance: 100 nodes, 295 edges.
Futureinternet 18 00311 g003
Figure 4. Representative multi-core network instance: 100 nodes, 298 edges.
Figure 4. Representative multi-core network instance: 100 nodes, 298 edges.
Futureinternet 18 00311 g004
Figure 5. Optimality gap vs. controller capacity C max .
Figure 5. Optimality gap vs. controller capacity C max .
Futureinternet 18 00311 g005
Figure 6. Solver time vs. controller capacity C max .
Figure 6. Solver time vs. controller capacity C max .
Futureinternet 18 00311 g006
Figure 7. Average switch-to-controller path length vs. C max .
Figure 7. Average switch-to-controller path length vs. C max .
Futureinternet 18 00311 g007
Figure 8. Average assignments per switch vs. C max .
Figure 8. Average assignments per switch vs. C max .
Futureinternet 18 00311 g008
Figure 9. Optimality gap vs. N max c t r l .
Figure 9. Optimality gap vs. N max c t r l .
Futureinternet 18 00311 g009
Figure 10. Solver time vs. N max c t r l .
Figure 10. Solver time vs. N max c t r l .
Futureinternet 18 00311 g010
Figure 11. Average path length vs. N max c t r l .
Figure 11. Average path length vs. N max c t r l .
Futureinternet 18 00311 g011
Figure 12. Average assignments per switch vs. N max c t r l .
Figure 12. Average assignments per switch vs. N max c t r l .
Futureinternet 18 00311 g012
Figure 13. Optimality gap vs. network size | V | .
Figure 13. Optimality gap vs. network size | V | .
Futureinternet 18 00311 g013
Figure 14. Solver time vs. network size | V | .
Figure 14. Solver time vs. network size | V | .
Futureinternet 18 00311 g014
Figure 15. Average switch-to-controller path length vs. | V | .
Figure 15. Average switch-to-controller path length vs. | V | .
Futureinternet 18 00311 g015
Figure 16. Average assignments per switch vs. | V | .
Figure 16. Average assignments per switch vs. | V | .
Futureinternet 18 00311 g016
Table 1. Key parameters of the controller placement and switch assignment model.
Table 1. Key parameters of the controller placement and switch assignment model.
ParameterDescription
R min Minimum number of controllers per switch (resilience requirement)
R max Maximum number of controllers per switch
C max Maximum number of switches managed by a single controller
N max c t r l Maximum number of controllers to deploy
d ( s , l ) Shortest-path distance between switch s and controller node l
deg ( l ) Degree of node l in graph G
κ ( l ) Core number of node l in G
α , β , γ , δ Non-negative weights controlling the relative importance of each objective component
Table 2. Worst-case computational complexity of the candidate selection strategies.
Table 2. Worst-case computational complexity of the candidate selection strategies.
StrategyComplexityBottleneck
Core-Only O ( n log n + m ) Sorting
Degree-Based O ( n log n + m ) Sorting
Distance-Sum O ( n ( n + m ) ) All-pairs BFS
Hybrid Centrality O ( n m + n 2 log n ) Betweenness centrality
Greedy Coverage O ( N max c t r l · n 2 ) Greedy selection loop
Table 3. Worst-case computational complexity of the switch assignment heuristic.
Table 3. Worst-case computational complexity of the switch assignment heuristic.
StepComplexityCondition
Feasibility check O ( 1 )
Step 1: Minimum redundancy O ( n p log p ) ( R max R min ) p log p
Step 2: Extended assignment O ( n ( R max R min ) p ) R max R min p
Overall O ( n p log p ) Precomputed d ( · , · )
Table 4. Fixed simulation parameters.
Table 4. Fixed simulation parameters.
ParameterValueDescription
Objective weights
α 15Assignment coverage and resilience weight
β 10Controller degree weight
γ 12Controller core number weight
δ 10Distance penalty weight
Redundancy constraints
R min 2Minimum controllers per switch
R max 4Maximum controllers per switch
Strategy-specific
ω κ , ω d , ω b , ω c 0.30 , 0.25 , 0.25 , 0.20 Hybrid centrality scoring weights
d 0 1 hopCoverage distance threshold
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rasool, S.M.; Boujelben, Y.; Zarai, F. Topology-Aware Joint Control Plane Placement and Assignment for Resilient Hierarchical Cloud–Edge Networks. Future Internet 2026, 18, 311. https://doi.org/10.3390/fi18060311

AMA Style

Rasool SM, Boujelben Y, Zarai F. Topology-Aware Joint Control Plane Placement and Assignment for Resilient Hierarchical Cloud–Edge Networks. Future Internet. 2026; 18(6):311. https://doi.org/10.3390/fi18060311

Chicago/Turabian Style

Rasool, Samer Mohammed, Yassine Boujelben, and Faouzi Zarai. 2026. "Topology-Aware Joint Control Plane Placement and Assignment for Resilient Hierarchical Cloud–Edge Networks" Future Internet 18, no. 6: 311. https://doi.org/10.3390/fi18060311

APA Style

Rasool, S. M., Boujelben, Y., & Zarai, F. (2026). Topology-Aware Joint Control Plane Placement and Assignment for Resilient Hierarchical Cloud–Edge Networks. Future Internet, 18(6), 311. https://doi.org/10.3390/fi18060311

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop