A Three-Stage Approach for the Multi-Depot VRP with Priority Requests

Bouchbout, Yehya; Farou, Brahim; Molnár, Bálint; Benrazek, Ala-Eddine; Bouafia, Khawla; Seridi, Hamid

doi:10.3390/app16115188

Open AccessArticle

A Three-Stage Approach for the Multi-Depot VRP with Priority Requests

by

Yehya Bouchbout

^1,*

,

Brahim Farou

¹

,

Bálint Molnár

^2,*

,

Ala-Eddine Benrazek

^1,3

,

Khawla Bouafia

²

and

Hamid Seridi

¹

Labstic Laboratory, Department of Computer Science, Guelma University, Guelma 24000, Algeria

²

Information Systems Department, Faculty of Informatics, Eötvös Loránd University (ELTE), Pázmány Péter 1/C, 1117 Budapest, Hungary

³

CSAAIL Laboratory, Department of Computer Science, Ziane Achour University, Djelfa 17000, Algeria

^*

Authors to whom correspondence should be addressed.

Appl. Sci. 2026, 16(11), 5188; https://doi.org/10.3390/app16115188

Submission received: 8 April 2026 / Revised: 13 May 2026 / Accepted: 19 May 2026 / Published: 22 May 2026

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

Field-service operations for utility companies require routing technicians across multiple depots while guaranteeing same-day response to critical infrastructure customers, a constraint that standard multi-depot routing methods cannot structurally enforce. We introduce the MDVRP with Priority Requests (MDVRP-PR), formalised as a lexicographic optimisation problem that guarantees service to priority customers before maximising coverage and minimising route duration. A three-stage pipeline is proposed: hybrid DBSCAN-Hierarchical clustering for topology-aware depot assignment, an Enhanced Max-Min Ant System (MMAS) with priority-driven construction, lexicographic solution selection, and repair, and a Boundary Relocate post-optimisation stage with global cross-depot recovery. The approach is evaluated on a real-world applied case study from Algérie Télécom (Guelma, Algeria), comprising a single four-depot field-service instance scaled to three sizes (55, 90, and 150 customers) and assessed over 2135 controlled runs. On this case study, the proposed clustering method outperforms the MDVRP-adapted Sweep baseline by

22.9

percentage points on the largest instance (n = 150; Friedman p < 0.001). The priority mechanisms sustain

100 %

feasibility across all configurations, compared to complete collapse without them (

0 / 10

seeds at

\geq 40 %

priority), at a route-time overhead below

5 %

. Relative to the company’s current manual practice, the framework improves customer coverage by

16.1

percentage points within 28 s, confirming its practical utility for daily deployment in this capacity-constrained, priority-sensitive routing context.

Keywords:

multi-depot vehicle routing problem; priority requests; DBSCAN clustering; max-min ant system

1. Introduction

The Vehicle Routing Problem (VRP), first formalized by Dantzig and Ramser [1] as a generalization of the Traveling Salesman Problem, has given rise to a rich taxonomy of variants. The Capacitated VRP (CVRP) introduces vehicle-load limits and is the canonical reference variant for which exact and metaheuristic methods continue to advance [2]; the VRP with Time Windows (VRPTW) adds customer-side temporal feasibility and remains a workhorse for both academic benchmarks and applied delivery problems [3]; and the Multi-Depot VRP (MDVRP) couples customer-to-depot assignment with route construction across a fleet split among several geographically distinct depots [4,5]. Each captures specific structural features of real-world logistics and service operations.

Beyond these foundational variants, several research threads currently extend the VRP toward emerging operational realities. Two-echelon and satellite-based formulations decompose urban delivery into city-level transfers and last-mile distribution [6]; green and electric VRPs incorporate energy consumption, charging infrastructure, and emissions into the objective and constraints [7]; synchronization VRPs handle interdependent operations between vehicles, drivers, or facilities that must coincide in time and space [8]; drone-aided routing combines ground vehicles with unmanned aerial vehicles to exploit complementary mobility profiles [9]; and time-dependent VRPs allow travel times to vary with traffic conditions across the planning horizon [10]. These threads collectively illustrate that the VRP literature is rapidly diversifying along technological and sustainability axes.

Despite this breadth, applied VRP research remains concentrated in developed-economy contexts [11,12,13,14], leaving a gap for settings where asymmetric road networks, heterogeneous fleet distributions, and rigid administrative partitioning create operational challenges that standard benchmarks do not reflect.

This paper is motivated by a concrete instance of that gap: the daily field-service routing operations of Algérie Telecom, the state-owned telecommunications provider of Algeria. The company maintains infrastructure across all 69 provinces through a network of Technical Centers (TCs), each dispatching dedicated Technical Teams (TTs) to handle customer maintenance requests. Under the current practice, each TC partitions its service territory into fixed sub-zones assigned permanently to individual TTs, and teams plan their own routes based on field experience. This rigid scheme has two measurable consequences: teams in adjacent zones cannot collaborate even when one is overloaded and the other is idle, and more critically, no formal mechanism exists to guarantee that high-priority customers are served before the working day ends.

The urgency of the latter failure is significant. Algérie Telecom serves two operationally distinct customer classes: Regular Customers (RCs), private residential users covered by a 48-h response commitment, and Priority Customers (PCs), comprising government agencies, security services, and critical enterprises whose operations cannot sustain prolonged outages and who require same-day service. Routing a team to a residential customer before attending to a nearby government agency is not merely suboptimal—it constitutes a service-level breach with administrative consequences. Yet distance-minimizing formulations are structurally unable to prevent it, and to our knowledge no existing MDVRP formulation makes this distinction explicit at the customer level.

Formally, the problem maps to an MDVRP in which TCs are depots. Standard cluster-first, route-second MDVRP methods [15,16] decompose the problem into a spatial assignment phase followed by independent per-depot routing. This decomposition introduces a structural tension: customers near depot boundaries are frequently misassigned by spatial clustering, yet the routing phase treats assignments as fixed and cannot correct them, even when doing so would reduce cost. Under real-road asymmetry and hard priority constraints, this limitation is particularly damaging.

We address both gaps—the absence of a priority-aware MDVRP formulation and the misassignment problem inherent to the cluster-then-route paradigm—through an applied problem variant and a three-stage solution framework. The Multi-Depot VRP with Priority Requests (MDVRP-PR) models service urgency as a lexicographic hard constraint: every priority customer must be served before coverage and travel time are optimized. This is a stronger guarantee than weighted or soft-penalty approaches, because it structurally prevents any solution that leaves a priority customer unserved from being accepted, regardless of the travel-time savings it might offer. The solution framework integrates: (i) a road-distance-aware DBSCAN clustering stage with capacity-constrained merging, which distributes workload across the full fleet rather than producing geometrically compact but operationally imbalanced assignments; (ii) an enhanced Max-Min Ant System (MMAS) with a priority-first construction rule that reserves route capacity for priority customers during ant construction rather than relying on post-hoc repair; and (iii) a Boundary Relocate post-optimization operator that corrects depot misassignments using actual insertion and removal costs, explicitly bridging the assignment-routing gap that the clustering phase leaves open.

The primary contributions of this work are:

MDVRP-PR: to the best of our knowledge, the first MDVRP formulation in which priority is enforced as a structural admissibility constraint over the feasible region (lexicographic feasibility) rather than as a soft penalty, weighted reward, or time-window encoding. The formulation is motivated by and validated on real operational data from Algérie Telecom’s field service in the Guelma province, Algeria.
Topology-aware clustering: a hybrid DBSCAN-hierarchical assignment procedure that uses asymmetric road distances and capacity-constrained merging to achieve full fleet utilization, improving large-instance coverage by up to $+ 22.9$ percentage points over sweep-based assignment on the studied case.
Priority-first MMAS: an enhanced Max-Min Ant System in which must-serve constraints are enforced structurally during construction, achieving 100% priority feasibility across all tested configurations at a route-time overhead below 5%.
Boundary Relocate operator: a cross-depot post-optimization move that evaluates relocation candidates by actual insertion and removal costs rather than geometric proximity, correcting the systematic misassignments that spatial clustering produces near depot boundaries.

The remainder of this paper is organized as follows: Section 2 reviews related work; Section 3 presents the formal problem definition; Section 4 details the proposed methodology; Section 5 and Section 6 describe the experimental setup and results; and Section 7 concludes.

2. Related Work

The literature directly relevant to this work spans three interconnected areas: the classical MDVRP and its benchmark methods, which establish the baseline against which our formulation is positioned; clustering-based decomposition strategies for MDVRP, whose structural limitations motivate our topology-aware assignment and Boundary Relocate operator; and ACO/MMAS-based solution frameworks, whose uniform construction rules motivate our priority-first extension. A fourth thread, priority and service-class modeling in VRP, is reviewed separately, as it most directly concerns our core problem novelty. Table 1 summarizes the works reviewed across these threads, highlighting the features most relevant to our contributions.

2.1. Classical MDVRP: Formulations and Benchmark Methods

The classical MDVRP requires simultaneously determining customer-to-depot assignments and vehicle route sequences, two interdependent decisions that render it NP-hard. Early work established the solution templates that subsequent research has built upon. Renaud et al. [16] proposed the first dedicated tabu search heuristic (FIND) for the MDVRP, structured into fast improvement and diversification phases, which remained competitive for nearly a decade. Cordeau et al. [4] introduced a unified tabu search capable of solving the MDVRP, the Periodic VRP, and the Periodic MDVRP within a single framework, and their benchmark instances have since become the standard evaluation testbed for the problem. Ho et al. [17] demonstrated that hybrid genetic algorithms benefit substantially from informed initialization: seeding with Clarke–Wright savings routes produced consistently stronger solutions than random initialization, pointing to the value of constructive heuristics within population-based methods.

The most competitive reference method for the classical problem remains the Hybrid Genetic Search with Adaptive Diversity Control (HGSADC) of Vidal et al. [5]. It combines a biased fitness function that jointly ranks solution cost and population diversity, a periodic crossover operator for pattern assignment, and a dual-population management scheme. HGSADC achieves near-optimal solutions on the Cordeau benchmark instances and has served as the primary comparison baseline for over a decade. Azad et al. [18] extended the base formulation to the Multi-Depot Periodic VRP with Interdependent Depot Operations, solving it with a memetic algorithm that combines pattern-assignment initialization and selective simulated annealing, achieving new best-known solutions on 7 of 10 benchmark instances. Mancini [19] addressed a Multi-Depot Multi-Period VRP with a heterogeneous fleet using an ALNS matheuristic that solves an exact MIP subproblem at each iteration, reporting 8–10% cost improvements over a commercial solver. Lim et al. [20] introduced the first complete MIP formulation for the Multi-Depot Split-Delivery VRP with heterogeneous vehicles; permitting multiple depot visits per customer reduces total travel distance by up to 16%. Chen and Yang [21] reformulated the MDVRP as a bilevel optimisation that integrates urban traffic equilibrium, reducing delivery time by nearly 30% over distance-minimising methods, though the model requires a full origin-destination traffic matrix that is rarely available in practice.

Across all of these contributions, customer requests are treated as operationally homogeneous: no distinction is made between service classes that differ in response urgency, and no structural mechanism exists to prevent a lower-priority customer from being served before a higher-priority one.

2.2. Clustering-Based Decomposition for MDVRP

Given the NP-hardness of the MDVRP, the cluster-first, route-second paradigm is one of the most widely adopted solution strategies. It decomposes the problem into a depot-assignment phase followed by independent single-depot routing. While this approach substantially reduces computational complexity, solution quality depends critically on the clustering decisions, and purely spatial partitioning introduces two recurring structural limitations that are directly relevant to the present work.

The first is workload imbalance. When the spatial distribution of customers does not align with depot capacities, nearest-depot or geometry-based assignment produces overloaded depots alongside underutilised ones. Gu et al. [22] applied nearest-depot assignment as the first stage of a three-phase framework using an Artificial Bee Colony algorithm and explicitly acknowledged that fixing assignments prior to route construction may preclude globally optimal solutions. Tao et al. [23] addressed a Multi-Depot Capacitated VRP with Order Split and Allocation using a Variable Neighbourhood Search with seven specialised operators, but relied on a pre-determined depot structure without adaptive load rebalancing. Wang et al. [24] and Wang et al. [25] both employ k-medoids clustering as a preprocessing step for collaborative multi-depot routing under dynamic demands; neither accounts for asymmetric road distances in the assignment phase, so workload imbalances persist whenever road geometry departs from Euclidean proximity.

The second is boundary misassignment. Customers near service territory boundaries are frequently allocated to suboptimal depots by spatial clustering, and the routing phase—treating assignments as fixed—cannot correct these misallocations even when routing costs clearly favour an alternative depot. Stodola and Nohel [26] proposed AACO-NC, currently the most competitive ACO-based method for the classical MDVRP, matching or exceeding best-known solutions on 18 of 23 Cordeau benchmark instances and establishing four new best-known results. Despite this strong performance, its clustering component is purely spatial, does not account for depot workload balance, and provides no mechanism to revise depot assignments once the clustering phase is complete. Yu et al. [27] and Xu et al. [28] likewise assign customers to the nearest depot without considering routing efficiency, and report that this oversimplification degrades solution quality on instances where customers are not spatially well-separated by depot.

These two weaknesses jointly motivate the topology-aware DBSCAN-based clustering with capacity-constrained merging proposed in Stage 1 of our framework, and the Boundary Relocate post-optimization operator in Stage 3, which corrects residual misassignments using actual route insertion and removal costs rather than geometric proximity.

A complementary line of recent work explores dynamic decomposition that operates inside the metaheuristic loop rather than as a static first phase. Santini et al. [29] repeatedly break the current elite solution into smaller subproblems by clustering its routes (e.g., on geographic barycenters), solve each subproblem from scratch with the same solver, and reassemble the improved fragments, reporting consistently stronger results than using decomposition only as pre- or post-processing. Their decomposition is therefore solution-driven and adaptive, whereas our Stage 1 decomposes at the customer level once before any routing, with the cross-depot Boundary Relocate operator in Stage 3 acting as the corrective mechanism. The two strategies are complementary rather than substitutes: the dynamic decomposition ofSantini et al. [29] reorganizes routes during search, while our pipeline reorganizes depot assignments after construction. We do not claim methodological superiority over dynamic decomposition; we focus on the assignment side of the cluster-first paradigm, which is the locus of the boundary-misassignment failures observed in our applied case. Recent reviews of related research threads, including time-dependent vehicle routing [10], indicate that adaptive within-loop decomposition and depot-aware structural correction are likely to be combined in future MDVRP work.

2.3. ACO and MMAS for Multi-Depot Routing

Ant Colony Optimisation, and the Max-Min Ant System (MMAS) in particular, has demonstrated strong performance across VRP variants. This is largely due to the natural alignment between pheromone-guided construction and the sequential, interdependent nature of routing decisions. MMAS improves upon basic ACO by bounding pheromone concentrations within a dynamic interval

[τ_{min}, τ_{max}]

and restricting updates to the best solutions found, thereby preventing premature stagnation while preserving exploitation of high-quality routes [30].

Ting and Chen [31] introduced the first ACO-based method for the MDVRPTW, proposing a dual pheromone system with separate matrices for customer-depot assignment and route sequencing, combined with simulated annealing for intensification; this yielded six new best-known solutions on Cordeau instances. Their design illustrates the value of separating assignment and sequencing signals in pheromone management, though the construction rule remains uniform across all customers. Niu et al. [32] applied MMAS-based decomposition to the Multi-Depot Cumulative Capacitated VRP, maintaining a master pheromone matrix alongside subproblem-specific matrices and using pheromone trail smoothing combined with local search for perturbation; their analysis identified initialisation bias toward nearest-depot assignments and fixed subproblem boundaries as the principal limiting factors. Stodola and Nohel [26] demonstrated that adaptive pheromone management calibrated to population entropy outperforms fixed-parameter ACO variants in the MDVRP context, establishing entropy-driven evaporation as a practically effective anti-stagnation mechanism.

A limitation shared by all existing ACO and MMAS formulations for multi-depot routing is that route construction applies a uniform probabilistic selection rule to all customers, with no mechanism to reserve route capacity for urgent service requests during construction. When must-serve constraints are handled at all, they are delegated to post-hoc repair, where insertion slots may already be scarce. The priority-first construction rule introduced in Stage 2 of our framework addresses this gap directly: whenever any priority customer remains feasible, the candidate set is restricted to priority customers only, so that must-serve constraints are enforced structurally during construction rather than corrected after the fact.

2.4. Priority and Service-Class Modeling in VRP

Priority and service-class differentiation in VRP has received limited but meaningful attention, primarily in single-depot settings and adjacent routing domains. Four main mechanisms appear in the literature. Time-window tightening assigns narrow delivery windows to high-priority customers [33]. It requires translating urgency into a numerical time commitment and does not constitute a hard must-serve guarantee: a solution that misses a tight window is penalised but not rejected. Weighted objectives penalise late or missed service through cost coefficients [12]. These formulations are structurally unable to prevent priority-infeasible solutions: a route that leaves a priority customer unserved may still be accepted when the associated travel-time saving is large enough. Quantity-based priority encodes urgency through delivery rewards rather than scheduling: Wu et al. [34] formulate a workload-consistent VRP under demand uncertainty in which each demand site i carries a weight coefficient

e_{i}

that influences how much supply it receives, so priority is expressed as distributional fairness rather than service order. Tardiness-driven priority embeds urgency in the search procedure rather than in the model: Bock [35] maintains a min-heap of partial solutions inside a branch-and-bound algorithm; at each insertion, a heuristic completion repeatedly appends the unserviced order with the highest priority

ρ_{i}

, which is derived from the earliest completion time attainable by some vehicle. The priority rule there acts as a tie-breaking device for fast feasibility, not as a structural admissibility constraint over the solution space. Hard sequencing constraints, which force priority customers to be inserted before regular ones within each route, are the mechanism most closely aligned with our formulation, but their application in the multi-depot context has not, to our knowledge, been combined with a lexicographic admissibility rule that filters entire solutions out.

In field-service and emergency logistics, service urgency is operationally prevalent but rarely modeled as a formal routing constraint. Maintenance dispatch and telecommunications repair operations classify customers by response-time commitment [14], yet the routing literature has not translated this classification into a structural priority constraint within an MDVRP framework. Emergency vehicle routing models do enforce hard response-time deadlines, but these are single-depot or network-flow formulations that do not address the assignment–routing interdependence of the multi-depot setting.

Our positioning relative to these threads is therefore narrower and more precise than a generic claim of novelty would suggest. Quantity-based priority [34] answers a different question, namely how much to deliver under demand uncertainty in a single-depot consistent VRP, with priority entering as a hierarchical reward in the objective rather than as a feasibility rule. Tardiness-driven priority [35] concerns single-vehicle routing through connected service areas and embeds urgency as a heuristic completion device inside a branch-and-bound algorithm rather than as a constraint over the feasible region. Time-window-based priority encodings [33] are most often deployed in multi-depot settings, but a missed window is penalised, not rejected. To the best of our knowledge, the MDVRP-PR introduced in this paper is the first MDVRP formulation in which priority is enforced as a structural admissibility constraint defining the feasible region (lexicographic feasibility): it combines this hard admissibility rule on priority customers with a topology-aware depot assignment and a cross-depot repair operator within a single operational pipeline. The resulting guarantee, that solutions leaving any priority customer unserved are rejected regardless of the travel-time savings they might offer, is motivated by the field-service operations of Algérie Telecom, where routing a vehicle to a residential customer before attending to a nearby government agency constitutes an administrative service-level breach whose consequences cannot be traded off against travel-time savings.

Table 1. Summary of related MDVRP literature. PC = priority-customer treatment (No = none, Quantity = priority via delivered amount, Tardiness = priority as search-time tie-breaker, Hard lex. = lexicographic admissibility constraint); Cluster = clustering strategy (ND = nearest-depot, Km = k-medoids/k-means, Sp = spatial sectorization, DB+H = DBSCAN with capacity-constrained hierarchical merging, Dyn. = dynamic within-loop route decomposition, − = no clustering phase, · = decomposition without spatial clustering); XD-corr. = cross-depot correction after initial clustering; Pri.-cons. = construction rule that structurally enforces priority, during route building (Heur. = priority used only as a heuristic completion device, not as a hard rule).

Reference	Variant	PC	Cluster	XD-corr.	Pri.-cons.	Algorithm
Classical MDVRP and benchmark methods
Renaud et al. [16]	MDVRP	No	−	No	No	Tabu Search (FIND)
Cordeau et al. [4]	MDVRP/PVRP	No	−	No	No	Unified Tabu Search
Ho et al. [17]	MDVRP	No	−	No	No	Hybrid GA
Vidal et al. [5]	MDVRP/PVRP	No	−	No	No	HGSADC
Azad et al. [18]	MD Periodic+Interdep.	No	−	No	No	Memetic Algorithm
Mancini [19]	MD Multi-Period	No	−	No	No	ALNS Matheuristic
Lim et al. [20]	MD Split-Delivery	No	−	No	No	MIP + GA
Chen and Yang [21]	MDVRP + Traffic Eq.	No	−	No	No	Bilevel GA
Clustering-based decomposition
Yu et al. [27]	Dynamic MDVRP	No	ND	No	No	Improved ACO
Xu et al. [28]	Dynamic MDVRP	No	ND	No	No	Hybrid ACO
Gu et al. [22]	MDVRP	No	ND	No	No	ABC + Coevolution
Tao et al. [23]	MD-CVRP-OSA	No	·	No	No	VNS (7 operators)
Wang et al. [24]	Collab. MD Dynamic	No	Km	No	No	k-Medoids + Hybrid
Wang et al. [25]	Collab. MD Dynamic	No	Km	No	No	k-Medoids + Hybrid
ACO and MMAS-based methods
Ting and Chen [31]	MDVRPTW	No	·	No	No	MACS-SA (dual pheromone)
Niu et al. [32]	MD Cumul. CVRP	No	ND	No	No	MMAS + Decomposition
Stodola and Nohel [26]	MDVRP	No	Sp	No	No	AACO-NC (entropy PH)
Recent priority and decomposition formulations
Wu et al. [34]	VRP + workload/uncert.	Quantity	−	No	No	Branch-and-cut
Bock [35]	Connected service areas	Tardiness	·	No	Heur.	Branch-and-bound
Santini et al. [29]	VRP (dyn. decomp.)	No	Dyn.	N/A	No	Within-loop decomposition
This work
MDVRP-PR (Ours)	MD + Priority Req.	Hard lex.	DB+H	Yes	Yes	Our Method

3. Problem Formulation

3.1. Problem Definition

We consider a Multi-Depot Vehicle Routing Problem with Priority Requests (MDVRP-PR) motivated by a field-service operation in the province of Guelma, Algeria. A set

N = {1, \dots, N}

of geographically dispersed customers must be served by vehicles dispatched from K depots (technical centers), indexed by

K = {1, \dots, K}

. Depot k operates a homogeneous fleet of

m_{k}

vehicles over a common planning horizon

[T_{s}, T_{e}]

, yielding a maximum route duration of

T_{max} = T_{e} - T_{s}

. Each customer

i \in N

requires a service of duration

s_{i} > 0

. A designated subset

P \subseteq N

of priority customers must be served in every feasible solution.

The road-network data, distances

d_{i j}

(km), and travel times

t_{i j}

(min), are obtained from the Google Maps Distance Matrix API: https://developers.google.com/maps/documentation/distance-matrix/overview (accessed on 1 December 2025). Both matrices are asymmetric (

t_{i j} \neq t_{j i}

in general), reflecting one-way streets, turning restrictions, and other real-world road characteristics.

Because fleet capacity and working-hour limits may preclude serving all customers, the problem admits partial coverage: non-priority customers may remain unserved. The objective is therefore lexicographic: first, guarantee service to every priority customer; second, maximize the number of served customers (coverage); third, among solutions with equal coverage, minimize the total route duration (travel plus service time). The decision variables determine (i) an assignment of customers to depots, (ii) a partitioning of each depot’s customers into vehicle routes, and (iii) the visiting sequence within each route.

3.2. Notation

The notation used throughout the paper is summarized in Table 2.

3.3. Mathematical Model

The MDVRP-PR is formulated with a lexicographic objective that reflects the three-level priority structure of the problem. Let

S = {i \in N : \sum_{k} y_{i}^{k} = 1}

denote the set of served customers. The objective is:

lex-opt (\underset{\min : unserved priority}{\underset{⏟}{| P ∖ S |}}, \underset{\min (i . e ., \max coverage)}{\underset{⏟}{- | S |}}, \underset{\min : total time}{\underset{⏟}{Z}})

(1)

where the total route duration is defined as:

Z = \sum_{k \in K} \sum_{v \in V_{k}} \sum_{i \in L_{k}} \sum_{j \in L_{k}} x_{i j}^{k v} (t_{i j} + s_{j})

(2)

That is, a solution with fewer unserved priority customers is always preferred; among solutions serving all priority customers, one with higher total coverage dominates; and among coverage-equivalent solutions, the one with the smallest total duration is selected.

The model is subject to the following constraints:

\begin{matrix} \sum_{k \in K} y_{i}^{k} & \leq 1, & \forall i \in N \end{matrix}

(3)

\begin{matrix} \sum_{k \in K} y_{i}^{k} & = 1, & \forall i \in P \end{matrix}

(4)

\begin{matrix} \sum_{v \in V_{k}} \sum_{j \in L_{k}} x_{i j}^{k v} & = y_{i}^{k}, & \forall i \in N, \forall k \in K \end{matrix}

(5)

\begin{matrix} \sum_{i \in L_{k}} x_{i j}^{k v} & = \sum_{i \in L_{k}} x_{j i}^{k v}, & \forall j \in N_{k}, \forall v \in V_{k}, \forall k \in K \end{matrix}

(6)

\begin{matrix} \sum_{j \in N_{k}} x_{o_{k} j}^{k v} & \leq 1, & \forall v \in V_{k}, \forall k \in K \end{matrix}

(7)

\begin{matrix} \sum_{i \in N_{k}} x_{i o_{k}}^{k v} & \leq 1, & \forall v \in V_{k}, \forall k \in K \end{matrix}

(8)

\begin{matrix} \sum_{i \in L_{k}} \sum_{j \in L_{k}} x_{i j}^{k v} (t_{i j} + s_{j}) & \leq T_{max}, & \forall v \in V_{k}, \forall k \in K \end{matrix}

(9)

\begin{matrix} \sum_{v \in V_{k}} \sum_{j \in N_{k}} x_{o_{k} j}^{k v} & \leq m_{k}, & \forall k \in K \end{matrix}

(10)

\begin{matrix} \sum_{i \in S} \sum_{j \in S} x_{i j}^{k v} & \leq | S | - 1, & \forall S \subseteq N_{k}, | S | \geq 2, \forall v \in V_{k}, \forall k \in K \end{matrix}

(11)

\begin{matrix} x_{i j}^{k v} \in {0, 1}, y_{i}^{k} & \in {0, 1} \end{matrix}

(12)

Constraints (3) ensure that each customer is assigned to at most one depot. Constraint (3) is an inequality, permitting non-priority customers to remain unserved when route-duration limits preclude full coverage; by contrast, constraints (4) enforce mandatory service for all priority customers via equality. Together, these two constraints encode the first level of the lexicographic objective: any feasible solution must serve every priority customer, while non-priority customers are served to the extent permitted by the fleet. Constraints (5) link the assignment and routing variables: a customer is visited if and only if it is assigned to that depot. Flow conservation at customer nodes is imposed by (6). Constraints (7) and (8) guarantee that each vehicle performs at most one route, departing from and returning to its home depot. The working-hour limit is enforced by (9), bounding the total duration of each route to

T_{max}

. Fleet-size constraints (10) ensure that no depot dispatches more vehicles than available. Sub-tour elimination constraints (11) prevent disconnected cycles. Finally, (12) imposes binary integrality.

Remark 1.

The lexicographic structure (1) reflects the operational reality at Algérie Telecom: failing to serve a priority customer (a government agency or critical enterprise) is categorically worse than leaving a regular residential customer unserved, and leaving any customer unserved is worse than a modest increase in travel time. This hierarchy is enforced throughout the solution framework: the MMAS uses lexicographic solution selection (Section 4.2), and the post-optimization stage rejects any move that increases the number of unserved priority customers (Section 4.3).

3.4. Problem Characteristics

The instance under study exhibits several features that distinguish it from standard MDVRP benchmarks and warrant a tailored solution approach.

(i): Asymmetric, real-world travel data. Unlike Euclidean or symmetric benchmark instances, the travel-time and distance matrices are fully asymmetric, derived from actual road-network queries. The triangle inequality does not hold in general; consequently, route reversal yields a different cost, and standard symmetric neighbourhood operators (e.g., 2-opt) must account for directionality.
(ii): Heterogeneous fleet distribution. The depots possess unequal fleet capacities, creating an inherently imbalanced workload-distribution problem, further complicating the customer-to-depot assignment.
(iii): Duration-constrained routes. Vehicles are uncapacitated in terms of load. The sole binding constraint on route length is the working day length.
(iv): Heterogeneous service times. Service durations vary considerably from one customer to another, and constitute a substantial share of total route time. This variability makes service time a primary factor in feasibility assessment and objective evaluation, unlike many benchmark problems where it is assumed uniform or negligible.
(v): Priority customers. A configurable proportion of customers may be designated as priority ( $P$ ). Their mandatory-service requirement (constraint (4)) constitutes the first level of the lexicographic objective (1), and must be satisfied before coverage maximization or time minimization are considered.
(vi): Non-uniform spatial distribution. The N customers span numerous municipalities, with inter-customer distances ranging from less than 1 km (urban clusters) to over 110 km (remote areas). This dispersion produces dense urban pockets interspersed with sparse rural regions, a structure that purely distance-based assignment heuristics may fail to exploit.

4. Methodology

This section details each stage of the proposed framework. Figure 1 illustrates the overall pipeline.

4.1. Stage 1: Hybrid DBSCAN-Hierarchical Clustering

Standard clustering methods (K-Means, K-Medoids) optimize geometric compactness, producing spherical clusters that may misalign with the road network. The proposed method instead uses road-distance-based density clustering to produce topology-aware depot assignments. Algorithm 1 summarizes the seven-step procedure.

The medoid of a cluster C (line 2) minimizes total intra-cluster road distance:

m_{C} = arg min_{c \in C} \sum_{c^{'} \in C} d (c, c^{'})

(13)

where

d (c, c^{'})

denotes the road distance (km) between customers c and

c^{'}

. The entire clustering stage is fully deterministic: DBSCAN, the two-phase geographic merging, and the capacity rebalancing have no random components, ensuring reproducible depot assignments.

Algorithm 1 Hybrid DBSCAN-Hierarchical Clustering

Require: Customers $N$ , depots $K$ , road-distance matrix D, vehicles per depot ${m_{k}}$
Ensure: Depot assignments ${N_{k}}_{k \in K}$

1:: Apply DBSCAN [36] on D ( $ε$ = 10.88 km, $minPts$ = 2) → micro-clusters ${C_{1}, \dots, C_{q}}$ + noise set
2:: Compute medoid $m_{C}$ for each micro-cluster via Equation (13)
3:: Assign each noise point to the micro-cluster chosen by k-NN majority vote (k = 3) over medoids
4:: Build medoid-to-medoid distance matrix $\bar{D} \in R^{q \times q}$ from D
5:: Set expected capacity ${\hat{n}}_{k} \leftarrow ⌈ (m_{k} / \sum_{k^{'}} m_{k^{'}}) \cdot N ⌉$ for each depot
6:: Phase 1 (E–W axis). Identify the westernmost depot $d_{W}$ and easternmost depot $d_{E}$ by depot longitude. Sort micro-clusters by medoid longitude. Round-robin assign from the western end toward $d_{W}$ and from the eastern end toward $d_{E}$ until the total expected route time at each depot reaches its capacity bound $m_{k} \cdot T_{max}$
7:: Phase 2 (N–S axis). Among the remaining $K - 2$ depots, identify the northernmost $d_{N}$ and southernmost $d_{S}$ by latitude. Assign the remaining micro-clusters by medoid latitude until depot capacities are met
8:: if any depot k has $| N_{k} | > {\hat{n}}_{k} (1 + δ)$ then ▹ $δ = 0.20$
9:: Specificity pass: re-assign customers sorted by $ϕ_{i} = t (c_{i}, d_{2 nd}) / t (c_{i}, d_{nearest})$ ; low-specificity customers assigned last
10:: Overflow pass: move excess customers to nearest under-capacity depot
11:: Swap refinement: relocate customers $> 20 %$ closer to an alternative depot
12:: end if
13:: return ${N_{k}}$

Scope of the Geographic Merging Step

The two-phase merging procedure (Algorithm 1, lines 6–7) uses geographic priors specific to the four-depot field-service configuration of Algérie Telecom in the Guelma region: the depots separate cleanly along an east–west axis (TC3 west, TC4 east) and a north–south axis (TC1, TC2), and the operational service area maps to a roughly rectangular geographic envelope. Phase 1 exploits this layout by saturating the two extreme depots first, since their service territories are the most constrained by geography; Phase 2 then resolves the interior. This is a deliberate operational design choice rather than a domain-general clustering routine: applying the pipeline to MDVRP instances with arbitrary depot counts (

K \neq 4

), depot layouts that do not separate along orthogonal geographic axes, or service areas with strongly non-convex boundaries, would require substituting Algorithm 1, lines 6–7, with a generic capacity-constrained agglomerative merger on

\bar{D}

. The remaining stages of the pipeline (Section 4.2 and Section 4.3) are independent of this choice and are domain-general; we return to this scope distinction in the Limitations.

4.2. Stage 2: Max-Min Ant System (MMAS)

Each depot subproblem is solved independently using the Max-Min Ant System [30]. The MMAS operates within pheromone bounds

[τ_{min}, τ_{max}]

to prevent premature convergence while preserving learned solution structure. All pheromone trails are initialized to

τ_{max}

, the standard MMAS choice that maximizes early exploration [30]. Algorithm 2 gives the overall iteration loop; each phase is detailed below.

Algorithm 2 Enhanced MMAS for depot subproblem k

Require: Customers $N_{k}$ , depot $o_{k}$ , vehicles $m_{k}$ , parameters $α, β, ρ, Q, I_{max}$
Ensure: Best solution $S^{*}$

1:: Initialize $τ_{i j} \leftarrow τ_{max}$ for all arcs; $S^{*} \leftarrow NULL$ ; $stag \leftarrow 0$
2:: for $iter = 1, \dots, I_{max}$ do
3:: Construct $n_{k}$ solutions via Equations (14) and (15); if priority-aware, restrict $F$ per Equation (16)
4:: Sort solutions by lexicographic criterion (17)
5:: Apply 2-opt to iteration-best $S^{ib}$ ; apply cheapest-insertion repair to $S^{ib}$
6:: $S^{ib} \leftarrow$ best solution in current iteration
7:: if $S^{ib} ≻_{lex} S^{*}$ then
8:: $S^{*} \leftarrow S^{ib}$ ; $stag \leftarrow 0$
9:: else
10:: $stag \leftarrow stag + 1$
11:: end if
12:: Evaporate and deposit pheromone via Equation (18); depositing solution follows ib → gb schedule
13:: Clamp $τ_{i j} \in [τ_{min}, τ_{max}]$ per Equation (19)
14:: if $stag \geq ⌊ n_{k} / 2 ⌋$ then
15:: PTS: $τ_{i j} \leftarrow τ_{i j} + 0.3 (τ_{max} - τ_{i j})$ ; $stag \leftarrow 0$
16:: end if
17:: end for
18:: Apply final repair to $S^{*}$
19:: return $S^{*}$

4.2.1. Construction Phase

Each of the

m = n_{k}

ants (one per customer assigned to the depot) builds a solution by sequentially assigning customers to vehicle routes. At each step, the ant selects the next customer

c_{j}

from the set of feasible (duration-respecting) candidates with probability:

p_{i j} = \frac{{[τ_{i j}]}^{α} {[η_{i j}]}^{β}}{\sum_{l \in F} {[τ_{i l}]}^{α} {[η_{i l}]}^{β}}

(14)

where

τ_{i j}

is pheromone,

η_{i j}

is heuristic information, and

F

is the feasible set. A customer is feasible if inserting it and returning to the depot does not violate the route-duration constraint (9). When all vehicles are exhausted, remaining customers are marked as unserved.

Savings-Based Heuristic

Instead of the standard inverse-distance visibility

η_{i j} = 1 / t_{i j}

, we use a savings-aware heuristic inspired by Clarke and Wright [37]:

η_{i j} = \frac{max (s_{i j}, ϵ)}{t_{i j}}, s_{i j} = t_{i, o_{k}} + t_{o_{k}, j} - t_{i j}

(15)

where

ϵ = 0.1

prevents zero visibility. This biases ants toward edges that save the most depot-detour, naturally producing tighter route loops. For depot ↔ customer edges, the standard

η_{i j} = 1 / t_{i j}

is used.

Priority-First Construction Rule

When priority-aware mode is active, if any feasible customer in the candidate set is a priority customer, the set

F

is restricted to priority customers only:

F^{'} = \{\begin{matrix} {c \in F : c \in P} & if F \cap P \neq Ø \\ F & otherwise \end{matrix}

(16)

This ensures priority customers are served during construction rather than deferred to the repair phase, where insertion slots may be scarce.

4.2.2. Local Search

Following Dorigo and Stützle [38], 2-opt local search is applied to the iteration-best solution before pheromone deposit. The 2-opt uses best-improvement with prefix-sum acceleration for asymmetric travel time evaluation: for each pair

(i, j)

, the cost of reversing the segment

[i, j]

is computed in

O (1)

via precomputed forward and reverse prefix sums, yielding an

O (n_{k}^{2})

-per-pass procedure that repeats until no improving swap is found.

4.2.3. Repair Phase

Cheapest-insertion repair is applied to the iteration-best solution after 2-opt (line 5 of Algorithm 2), following the same principle of concentrating expensive post-processing on the most promising candidate [39]. Each unserved customer is tested at every position in every route; the feasible insertion with the smallest travel-time increase is selected. When priority-aware, priority customers are processed first, giving them access to the most favorable slots.

4.2.4. Solution Selection

When priority-aware, solutions are compared according to the lexicographic objective (1):

unserved priority ≻ unserved customers ≻ total time

(17)

A solution with fewer unserved priority customers is always preferred; among solutions with equal priority feasibility, higher coverage (fewer total unserved) dominates; total time breaks remaining ties. In priority-blind mode, solutions are ranked by total time only.

4.2.5. Pheromone Update

After each iteration, pheromone trails are evaporated and a single solution deposits pheromone:

τ_{i j} \leftarrow (1 - ρ) τ_{i j} + Δ τ_{i j}^{best}, Δ τ_{i j}^{best} = \frac{Q}{T^{best}}

(18)

where

ρ = 0.02

is the evaporation rate,

Q = 100

is the deposit constant, and

T^{best}

is the total time of the depositing solution. Following the original MMAS schedule [30], the identity of the depositing solution transitions gradually from iteration-best to global-best as the search progresses:

Progress (% of Iterations)	Depositing Solution
$[0, 25)$	Iteration-best (always)
$[25, 50)$	Iteration-best $\frac{2}{3}$ , global-best $\frac{1}{3}$
$[50, 75)$	Iteration-best $\frac{1}{3}$ , global-best $\frac{2}{3}$
$[75, 100]$	Global-best (always)

This schedule maximizes exploration early on and gradually shifts toward exploitation of the best-known solution. All trails are clamped to

[τ_{min}, τ_{max}]

, where:

τ_{max} = \frac{1}{ρ \cdot T_{gb}}, τ_{min} = τ_{max} \cdot \frac{1 - p_{best}^{1 / n_{k}}}{(n_{k} / 2 - 1) p_{best}^{1 / n_{k}}}

(19)

with

p_{best} = 0.05

following Stützle and Hoos [30]. When no improvement is found for

⌊ n_{k} / 2 ⌋

consecutive iterations, Pheromone Trail Smoothing (PTS) is triggered:

τ_{i j} \leftarrow τ_{i j} + 0.3 \cdot (τ_{max} - τ_{i j})

, pulling all trails toward

τ_{max}

to restore exploration.

4.3. Stage 3: Cross-Depot Post-Optimization

After the per-depot MMAS, a post-optimization stage refines the solution at the multi-depot level. It consists of two operators applied sequentially.

4.3.1. Boundary Relocate

Boundary customers are customers who are approximately equidistant to multiple depots, and they may have been misassigned by the clustering. A customer

c_{i}

is classified as boundary if:

\frac{t (d_{alt}^{*}, c_{i})}{t (d_{assigned}, c_{i})} < θ and t (d_{assigned}, c_{i}) - t (d_{alt}^{*}, c_{i}) > Δ_{min}

(20)

where

θ = 1.25

and

Δ_{min} = 5

min. The ratio threshold

θ = 1.25

flags customers whose alternative depot is within 25% of the assigned depot’s travel time, a moderate criterion that captures genuinely contested assignments without flooding the candidate pool. The absolute threshold

Δ_{min} = 5

min filters out relocations whose net saving is negligible relative to the mean depot–customer travel time in the study area (≈40 min). Algorithm 3 details the iterative relocation procedure.

Algorithm 3 Boundary Relocate Post-Optimization

Require: Per-depot solutions ${S_{k}}$ , parameters $θ$ , $Δ_{min}$
Ensure: Improved solutions ${S_{k}}$

1:: Identify boundary customers B via Equation (20)
2:: $no_imp \leftarrow 0$ ; $iter \leftarrow 0$ ; $S^{*} \leftarrow {S_{k}}$
3:: while $no_imp < 50$ and $iter < 500$ do
4:: if $iter mod 10 = 0$ then re-identify B from current solutions
5:: end if
6:: $candidates \leftarrow Ø$
7:: for all boundary customer $c_{i} \in B$ currently served do
8:: $savings \leftarrow$ travel time saved by removing $c_{i}$ from its route
9:: $cost \leftarrow$ cheapest feasible insertion of $c_{i}$ into any alternative-depot route
10:: if $cost - savings < 0$ and insertion respects duration constraint then
11:: Add $(c_{i}, cost - savings)$ to $candidates$
12:: end if
13:: end for
14:: if $candidates = Ø$ then
15:: $no_imp \leftarrow no_imp + 1$ ; $iter \leftarrow iter + 1$ ; continue
16:: end if
17:: Select move from top-5 candidates by net change (randomized)
18:: Execute relocation; apply 2-opt and Or-opt to affected routes
19:: if total time improved and priority unserved ≤ current and all routes feasible then
20:: Accept; $no_imp \leftarrow 0$ ; update $S^{*}$ if best overall
21:: else
22:: Reject; $no_imp \leftarrow no_imp + 1$
23:: end if
24:: $iter \leftarrow iter + 1$
25:: end while
26:: return $S^{*}$

Moves that would increase the number of unserved priority customers are rejected (line 17), preserving priority feasibility throughout the post-optimization.

Relation to Classical Inter-Depot Operators

Inter-route exchange and relocate operators are standard in MDVRP metaheuristics. Cordeau et al. [4] and Renaud et al. [16] move single customers between any two routes (relocate) and swap pairs of customers (exchange), accepting any feasible improving move. More recent VNS schemes [40] fix a neighbourhood ladder (relocate, exchange, 2-opt*, Or-opt) and iterate over all candidate pairs. These operators are unfiltered: every (customer, target route) pair is considered in each pass, giving an

O (n \cdot R)

candidate set per iteration with n customers and R routes. Boundary Relocate is an instance of this family—single-customer cross-depot relocate with strict-improvement acceptance—but adds a topological prefilter: a customer is admitted to the candidate set only if Equation (20) holds, that is, if its travel-time ratio between the alternative and assigned depots is below

θ

and the absolute gap exceeds

Δ_{min}

. The filter selects exactly the customers that lie on the geographic boundary between two depot clusters, where reassignment is most likely to pay off, and prunes interior customers whose home depot dominates by construction. The transformation itself is otherwise unchanged from a classical relocate; the contribution is in the filter, not the move. Table 3 summarises the comparison.

4.3.2. Cross-Depot Repair

Globally unserved customers (those not served by any depot) are re-inserted using cheapest insertion across all depot routes, not only the depot they were originally assigned to. Priority customers are processed first. For each candidate, every position in every route across all depots is evaluated, and the feasible insertion with the smallest travel-time increase is selected. Modified routes are improved with 2-opt and Or-opt.

4.4. Priority Integration

Priority constraints are enforced through three complementary mechanisms that operate at different stages of the pipeline:

Construction-phase restriction (Stage 2): The candidate set is restricted to feasible priority customers whenever any are available, as defined in Equation (16).
Lexicographic solution selection (Stage 2): Solutions are ranked by unserved priority count before total unserved count and total time, as defined in Equation (17).
Priority-first repair ordering (Stages 2 and 3): In both the MMAS repair and the cross-depot repair, priority customers are inserted before non-priority customers, giving them access to the most favorable remaining route slots.

These mechanisms are jointly termed the priority-aware framework. Their combined contribution is evaluated in Experiment 3b (Section 5.5).

5. Numerical Experiments

This section describes the benchmark instances and the design of six experiments (with Experiment 3 expanded into five sub-experiments, 3a through 3e) conducted to evaluate the proposed framework. Results are presented in Section 6.

5.1. Problem Instances

The approach is evaluated on a real-world case study involving a utility company in northeastern Algeria. The problem consists of scheduling daily maintenance visits across 47 municipalities served by four depots, each with a fixed vehicle fleet and an 8-h operating window (480 min). Customer locations, service-time requirements, and pre-existing depot/team assignments were provided by the company. Pairwise road distances (km) and travel times (min) were obtained via the Google Maps Distance Matrix API, capturing the actual road network rather than Euclidean approximations.

Table 4 summarizes the instance characteristics. Three dataset sizes were constructed by progressive sampling to test scalability under increasing fleet utilization. Service times average approximately 30 min per customer. Each customer record includes a pre-assigned depot and technician team, used in Experiments 4 and 5.

The vehicle fleet is distributed unevenly across depots (ranging from 2 to 5 vehicles per depot), reflecting the company’s actual operational structure.

Metrics and Protocol

Solutions are compared using the lexicographic objective (1): a solution is preferred if it serves more priority customers; ties are broken by overall coverage, then by total time (travel + service). Auxiliary metrics reported include travel time, total distance, customer coverage (%), and priority feasibility (fraction of seeds in which all priority customers are served). Each experiment is replicated over 10 random seeds; results are reported as mean ± standard deviation.

The MMAS optimizer uses the balanced preset (

α

= 1,

β

= 2,

ρ

= 0.02,

m = n_{k}

ants, 100 iterations) throughout, where

n_{k}

is the number of customers assigned to depot k. The parameter values follow the canonical recommendations of Stützle and Hoos [30], who established

α

= 1 and

β

= 2 as the standard pheromone–heuristic balance for MMAS across routing problems, and

ρ

= 0.02 as a low evaporation rate suited to the bounded

[τ_{min}, τ_{max}]

update scheme that prevents premature convergence without requiring per-instance tuning. The number of ants is set to m =

n_{k}

, matching the customer count at each depot, which Stützle and Hoos identify as a robust default that ensures adequate solution diversity relative to the search space size. The iteration limit of 100 was selected to balance solution quality and runtime within the operational constraint of a morning planning session: preliminary trials confirmed that the global-best solution stabilises well before iteration 100 on all three instance sizes, with no improvement observed beyond iteration 70 on average. The

p_{best}

=

0.05

value used in the pheromone bound calculations (Equation (19)) likewise follows the original MMAS recommendation [30].

All experiments were executed on a single machine equipped with an Intel(R) Core(TM) i7-10700KF CPU @ 3.8 GHz and 16 GB RAM, running Ubuntu 24.04 LTS. The framework was implemented in Python 3.12. Per-depot MMAS instances were solved sequentially (no parallelism).

5.2. Experiment 1: Clustering Method Comparison

Five first-stage clustering methods are compared, each followed by the same MMAS and post-optimization pipeline:

DBSCAN-Hierarchical (proposed): road-distance-based DBSCAN micro-clustering, k-NN noise assignment (k = 3), agglomerative merging into four depot groups, and capacity-aware rebalancing ( $\pm 20 %$ tolerance). Fully deterministic.
K-Means (Euclidean) [41] (K = 4): Euclidean coordinates, random initialization. This is the canonical K-Means baseline; it ignores network topology.
K-Means (Network) (K = 4): K-Means run on a classical multidimensional-scaling embedding of the road-distance matrix used throughout the pipeline. This variant removes the metric mismatch between the baseline and our method, so any remaining performance gap reflects the grouping principle (centroid-based vs. density-based) rather than the input metric.
K-Medoids (PAM) [24] (K = 4): precomputed road-distance matrix.
Sweep (MDVRP) [26]: Gillett–Miller angular sweep adapted for MDVRP following Tillman [42]; each customer is assigned to its nearest depot by road distance, then sorted by polar angle. Fully deterministic.

All three datasets are tested with 10 seeds each (

5 \times 3 \times 10 = 150

runs).

5.3. Experiment 2: Post-Optimization Ablation

The marginal value of the Boundary Relocate and Cross-Depot Repair phases is assessed by comparing the full pipeline against a truncated variant that stops after MMAS. Both variants share identical clustering and seeds, enabling paired comparisons (

3 \times 2 \times 10 = 60

runs).

5.4. Experiment 3a: Priority-Customer Sensitivity

The proportion of hard-priority customers is varied from 0% to 50% in 5% increments. Priority customers are selected uniformly at random per seed. The hard constraint is enforced via a priority-first construction rule that restricts the ant’s candidate set to unvisited priority customers whenever any remain feasible. The full pipeline is executed for each configuration (

3 \times 11 \times 10 = 330

runs).

5.5. Experiment 3b: Priority-Aware vs. Priority-Blind MMAS

To quantify the combined contribution of the three priority mechanisms, each configuration is run twice under identical clustering and seeds. The priority-aware variant uses all three mechanisms: (i) candidate-set restriction to feasible priority customers during ant construction, (ii) lexicographic best-solution selection (unserved priority ≻ total unserved ≻ total time), and (iii) priority-first repair ordering. The priority-blind baseline is a standard MMAS that treats all feasible customers equally during construction, selects solutions by total time only, and repairs without priority ordering. Five priority levels (10–50% in 10% steps) are tested; 0% is excluded as both variants are identical without priority customers. This yields paired comparisons on priority feasibility, unserved priority count, total time, and coverage (

3 \times 5 \times 2 \times 10 = 300

runs).

5.6. Experiment 3c: Isolated Mechanism Ablation

Experiment 3b confirms the combined contribution of the three priority mechanisms, but it does not isolate individual effects. Experiment 3c addresses this by testing each mechanism in isolation. Five variants are evaluated on the Large instance at priority levels 20%, 30%, and 40%: blind (no mechanisms), C_only (candidate-set restriction only), S_only (lexicographic selection only), R_only (priority-first repair ordering only), and aware (all three combined). This yields

5 \times 3 \times 10 = 150

runs.

5.7. Experiments 3d and 3e: Stress Tests for the S_only Variant

Experiment 3c shows that S_only marginally dominates the combined aware variant on coverage in the 20–

40 %

priority window with uniformly random priority assignment. Two stress experiments verify whether this advantage extends to operational regimes that the design of Experiment 3c does not exercise.

Experiment 3d (extended priority range). Same five variants as Experiment 3c, evaluated on the Large instance at priority levels

50 %

,

60 %

,

70 %

,

80 %

over 10 seeds (

5 \times 4 \times 10 = 200

runs). Tests whether S_only retains feasibility when the priority load is high enough that the unmodified MMAS construction frequently produces no priority-feasible solution.

Experiment 3e (spatially clustered priority). Same five variants on the Large instance at priority levels

10 %

,

20 %

,

30 %

, but priority customers are sampled exclusively from the territory of the dominant TC (the depot serving the largest customer share,

66 / 150

in our case), rather than uniformly across all customers. This simulates an outage or campaign concentrated in one geographic zone.

5 \times 3 \times 10 = 150

runs.

5.8. Experiment 4: Assignment Flexibility

Three assignment scenarios are compared at 0% priority:

Baseline: algorithmic depot assignment (DBSCAN-Hierarchical) with cross-depot post-optimization.
Center-Fixed: customers locked to their pre-assigned depot; four independent single-depot subproblems.
Team-Fixed: customers locked to a specific (depot, vehicle) pair; one route per team with no inter-vehicle flexibility.

These scenarios represent a decreasing gradient of flexibility (

3 \times 3 \times 10 = 90

runs).

5.9. Experiment 5: Priority × Flexibility Interaction

A full factorial design crosses the three assignment scenarios with six priority levels (0–50% in 10% steps) across all datasets (

3 \times 6 \times 3 \times 10 = 540

runs). This design estimates both main effects and their interaction, testing whether flexibility becomes more or less valuable as priority constraints tighten.

5.10. Experiment 6: Threshold Sensitivity

Question: Are the pipeline’s reported gains driven by implicit tuning of fixed thresholds, or are they robust across reasonable parameter ranges? Design: A one-at-a-time (OAT) sensitivity sweep over the seven main thresholds: the DBSCAN radius multiplier (around the data-driven default),

minPts

, the Boundary Relocate filter parameters

θ

and

Δ_{min}

, and the MMAS exploration coefficients

α

,

β

,

ρ

. For each parameter we sweep four–five levels around the default while holding all other parameters fixed, and report the resulting variation in total time relative to the seed-to-seed noise floor estimated from a baseline pool of 10 independent runs at the default configuration. We complement the descriptive analysis with a Kruskal–Wallis test across levels per parameter. The Medium dataset is used (5 seeds per level) to keep computational cost bounded; the headline pipeline behaviour scales monotonically with instance size, so trends observed on Medium transfer to Small and Large.

5.11. Summary

Table 5 summarizes the experimental design.

6. Results and Discussion

6.1. Experiment 1: Clustering Method Comparison

Table 6 summarizes the full-pipeline results for the five clustering methods.

All five methods achieve ≥99% coverage on the Small and Medium instances. On the Large instance, where fleet utilization approaches saturation, DBSCAN-Hierarchical reaches

97.4 %

coverage, substantially higher than K-Means (Eucl.) (

87.5 %

,

+ 9.9

percentage points), K-Means (Net.) (

86.7 %

,

+ 10.7

percentage points), K-Medoids (

80.4 %

,

+ 17.0

percentage points), and Sweep (

74.5 %

,

+ 22.9

percentage points). A Friedman test across the five methods confirms that these differences are statistically significant (

χ^{2} = 38.7

, df

= 4

,

p < 0.001

), with Kendall’s

W = 0.97

indicating near-perfect rank consistency across the 10 seeds (a large effect by Cohen’s conventions). Post-hoc pairwise Wilcoxon signed-rank tests with Holm–Bonferroni correction show that every pair of methods differs significantly (all adjusted

p < 0.05

, effect size

r = 0.75

–

0.91

, all classified as large).

The two K-Means variants differ only in the metric used by the centroid algorithm: the Euclidean variant operates on raw

(lat, lon)

coordinates, while the Network variant operates on a classical-MDS embedding of the same road-distance matrix used by our pipeline, eliminating the metric bias of the Euclidean baseline. Their large-instance coverage differs by only

0.8

percentage points (median paired

Δ = 0.67

pp, Wilcoxon

r = 0.75

); both remain over 10 percentage points below DBSCAN-Hierarchical. This rules out the explanation that the proposed method’s advantage stems from comparing against a metrically-biased baseline.

The Routes column reveals the mechanism behind this advantage: on the Large instance, DBSCAN-Hierarchical activates all 13 available vehicles, whereas K-Means (Eucl.) uses

11.1

, K-Means (Net.)

11.0

, K-Medoids

10.0

, and Sweep only

9.0

. By producing topology-aware clusters that match the road network and capacity-balanced merging, DBSCAN-Hierarchical distributes workload across the full fleet, enabling it to reach peripheral customers that centroid-based methods leave unserved.

The silhouette–coverage trade-off is evident in Figure 2c: both K-Means variants and K-Medoids produce the most geometrically compact clusters (silhouette

\approx 0.40

–

0.43

, panel (b)) but leave peripheral customers unservable within the 8-h window, as shown by their coverage collapse on the Large instance in panel (a). DBSCAN-Hierarchical’s lower silhouette (≈0.20) reflects topology-aware groupings that follow the road network, yielding the most graceful scalability (

100 % \to 100 % \to 97.4 %

) compared to K-Medoids (

100 % \to 100 % \to 80.4 %

) and Sweep (

100 % \to 99 % \to 74.5 %

), as illustrated by the diverging curves in Figure 2a. Importantly, the silhouette–coverage inversion persists when the road-distance metric is used inside K-Means itself (Network variant), confirming that the pathology lies in the centroid-based grouping principle rather than in the input metric.

6.2. Experiment 2: Post-Optimization Ablation

Table 7 compares the full pipeline against a variant truncated after MMAS, reporting solution quality, fleet utilization, and the execution time of each pipeline phase.

The number of routes is identical in both variants, post-optimization does not alter fleet size. On Small and Medium, post-optimization saves 10–12 min of total route time without changing coverage (Wilcoxon

p < 0.01

for both,

r = 0.94

on Small and

r = 0.89

on Medium, both classified as large effects by Cohen’s conventions). On the Large instance, total time increases by

103.2

min because the cross-depot repair rescues

1.6

previously unserved customers (Wilcoxon

p < 0.01

,

r = 0.89

, large effect; Cohen’s

d = 2.29

), trading a modest time increase for

+ 1.1

percentage points of coverage on the most capacity-constrained instance.

Table 8 details the wall-clock time contribution and objective change of each pipeline stage. The MMAS dominates runtime across all sizes (86–

96 %

of total wall time), while clustering and post-optimization together add at most

0.7

s and

0.3

s, respectively. The objective improvement attributable to each stage is reported as the reduction in total route time (negative = improvement) and the gain in coverage. Clustering drives coverage by determining fleet utilization; MMAS produces feasible routes; post-optimization trims residual time and rescues unserved customers on the hardest instance.

6.3. Experiment 3a: Priority-Customer Sensitivity

Across all 330 runs, the pipeline achieves 100% priority feasibility: every priority customer is served in every run. Table 9 reports coverage and total route time across all priority levels.

Figure 3a shows that total route time grows approximately linearly with the priority proportion across all three datasets, while panel (b) confirms that coverage remains stable: on Small and Medium it stays at 100% throughout, and on Large it declines by only

3.1

percentage points (from

97.4 %

to

94.3 %

), with all unserved customers being non-priority. The cost of priority grows approximately linearly with the priority proportion, remaining below 5% even at 50% priority.

6.4. Experiment 3b: Priority-Aware vs. Priority-Blind MMAS

Table 10 reports the paired comparison on the Large instance.

On Small and Medium, both variants achieve full priority service in all seeds at all priority levels (all 10/10), the repair phase alone suffices when fleet capacity is ample. Table 10 therefore reports only the Large instance, where the variants diverge.

The Blind variant produces identical solutions regardless of the priority proportion (coverage

= 93.0 %

, route time

= 5740.7

min at every level), because it ignores priority tags entirely during construction; only the random seed determines its output. By contrast, the Aware variant adapts its construction and solution selection to the priority structure, yielding

94.3

–

95.5 %

coverage. The coverage difference is statistically significant at all priority levels (Wilcoxon

p < 0.02

, Cohen’s

d = 0.99

–

2.02

; Wilcoxon effect size

r = 0.80

–

0.89

, all classified as large).

At 10% priority, only 3 out of 10 Blind seeds serve every priority customer; at ≥40%, none do, leaving up to

4.9

priority customers unserved on average. The Aware variant achieves 10/10 across all levels, at a route-time overhead of 3.6–4.6%.

Remark. The comparison bundles three mechanisms (candidate-set restriction, lexicographic selection, priority-first repair). The observed improvement therefore reflects their combined effect; isolating individual contributions is left to future work.

Figure 4a shows the unserved priority count for both variants across priority levels: the Blind variant leaves up to

4.9

customers unserved on average at 50% priority, while the Aware variant consistently reaches zero. Panel (b) quantifies the route-time overhead of priority-awareness (

+ 208

–262 min), which panel (c) contextualises as a by-product of serving more customers rather than of inefficiency, given that the Aware variant also achieves between

+ 1.3

and

+ 2.5

percentage points of additional coverage at every priority level.

6.5. Experiment 3c: Isolated Mechanism Ablation

Experiment 3b establishes that the three priority mechanisms together are necessary for feasibility, but it does not identify the individual contribution of each. Experiment 3c tests each mechanism in isolation to attribute the gains. Five variants are compared on the Large instance (n = 150) at priority levels 20%, 30%, and 40% over 10 seeds each (150 runs total): blind (no mechanisms), C_only (candidate-set restriction), S_only (lexicographic selection), R_only (priority-first repair), and aware (all three combined).

Table 11 reports feasibility counts and coverage per variant and priority level.

Three findings stand out. First, mechanism C alone suffices for full feasibility: enforcing priority-first candidate-set restriction during MMAS construction achieves 10/10 feasibility at every priority level. Second, mechanism S provides the largest coverage gain on the tested cells: lexicographic selection alone yields

97.0

–

97.3 %

coverage,

+ 4.0

pp above blind and

+ 2.5

pp above the full aware variant, at an equivalent

3.6

–

3.7 %

overhead. This is because S screens entire solutions on priority feasibility before accepting them, which incidentally retains the solutions covering the most regular customers. Third, mechanism R alone is insufficient: priority-first repair achieves only 1–4 feasible seeds, performing only marginally above blind, confirming that repair alone cannot compensate for a construction phase that ignores priority structure.

The apparent dominance of S_only over the combined aware variant on the cells of Experiment 3c warrants a careful interpretation. Two stress experiments (3d and 3e, Section 6.6) directly probe regimes outside the 20–

40 %

uniform-priority window of Experiment 3c, and the results identify the operational boundary at which S_only’s coverage advantage breaks down.

6.6. Experiments 3d and 3e: Stress Tests for the S_only Variant

Table 12 and Table 13 report the stress experiments described in Section 5.7.

Findings on Experiment 3d. S_only’s feasibility degrades monotonically with priority load, from

10 / 10

at

50 %

to

9 / 10

at

60 %

,

7 / 10

at

70 %

, and

3 / 10

at

80 %

. The combined aware variant retains

10 / 10

feasibility at 50–

70 %

and

9 / 10

at

80 %

. The feasibility gap between S_only and aware widens from 0 seeds at

50 %

to

- 6

seeds at

80 %

. The coverage advantage of S_only over aware compresses in parallel, from

+ 2.6

pp at

50 %

to

+ 0.7

pp at

80 %

. This is the predicted failure mode: at high priority loads, the unmodified MMAS construction underlying S_only fails to produce a priority-feasible solution on a non-trivial fraction of seeds, and S has nothing to filter on. Mechanism C, by enforcing priority-first candidate-set restriction during construction, preserves feasibility in this regime; this is what aware delivers and S_only does not.

Findings on Experiment 3e. Under spatial clustering of priority customers in the dominant TC, both S_only and aware retain

10 / 10

feasibility at all tested priority levels (10–

30 %

); S_only retains a small coverage advantage of approximately

+ 1

pp. Spatial clustering at moderate priority loads alone does not break S_only on this configuration, although the residual blind feasibility (3–

8 / 10

) is markedly worse than under uniform sampling at the same priority levels in Experiment 3c (compare

1 / 10

blind feasibility at

20 %

uniform vs.

4 / 10

at

20 %

clustered—the difference reflects the smaller absolute number of priority customers needed to be served when they are concentrated in a single TC’s territory).

Implication for the proposed configuration. The two stress experiments together resolve the interpretation of Experiment 3c. (i) Under high priority load (Experiment 3d), S_only loses up to $6 / 10$ seeds in feasibility while aware holds at 9–

10 / 10

. (ii) Under spatial clustering at moderate priority load (Experiment 3e), the two variants are tied on feasibility and S_only’s coverage advantage is small. The combined aware variant is therefore retained as the proposed pipeline configuration: it absorbs a coverage cost of 1–

2.5

pp in benign regimes (Experiment 3c) in exchange for measured robustness under high-load priority distributions (Experiment 3d). S_only remains a competitive, lower-overhead variant for deployments with empirically bounded priority loads (typically ≤50%); we discuss this explicitly in the conclusion as a deployment option.

6.7. Experiment 4: Assignment Flexibility

Table 14 compares the three assignment scenarios at 0% priority.

On the Large instance, the Baseline achieves

97.4 %

coverage versus

85.5 %

for Center-Fixed (

+ 11.9

percentage points) and

81.3 %

for Team-Fixed (

+ 16.1

percentage points). A Friedman test confirms significant differences (

χ^{2} = 20.0

,

p < 0.001

, Kendall’s

W = 1.00

, large effect with perfect rank consistency across seeds), and all pairwise comparisons are significant after Holm–Bonferroni correction (adjusted

p < 0.01

, Wilcoxon

r = 0.89

–

0.92

, all large effects). The constrained scenarios show lower total time, but only because they serve fewer customers. The coverage gap between Baseline and Team-Fixed grows from 0 (Small) to

5.6

(Medium) to

16.1

percentage points (Large), confirming that the value of flexible assignment increases with fleet saturation.

Figure 5a makes the coverage–flexibility relationship explicit: on the Small instance all three scenarios overlap at 100%, but the lines diverge progressively as the instance grows, with Team-Fixed trailing the Baseline by

16.1

percentage points on Large. Panel (b) shows that the constrained scenarios report lower total operational time precisely because they serve fewer customers, a misleading advantage that the coverage panel corrects.

6.8. Experiment 5: Priority × Flexibility Interaction

Table 15 reports coverage and priority feasibility on the Large instance across all scenario–priority combinations.

Priority and assignment rigidity interact super-additively. The coverage drop from 0% to 50% priority is

3.1

percentage points for Baseline but

4.3

percentage points for Center-Fixed. The lower panel counts in how many of the 10 seeds every priority customer was served (“—” at 0% indicates no priority customers exist). Baseline serves all priority customers in every seed at all priority levels (10/10). Center-Fixed fails in 1 seed at 50% priority (9/10). Team-Fixed degrades sharply: at 40% priority it fails in 1 seed, and at 50% it fails in 8 out of 10 seeds (only 2/10), while also achieving only

78.6 %

coverage.

Figure 6a makes the super-additive interaction visible: the Baseline coverage curve is nearly flat as priority rises (a drop of only

3.1

percentage points from 0% to 50%), whereas the Team-Fixed curve steepens sharply, reaching

78.6 %

at 50% priority. Panel (b) shows the corresponding total-time trajectories; Team-Fixed’s lower time again reflects reduced coverage rather than routing efficiency, and its steep rise at high priority levels indicates the growing cost of forced feasibility under rigid assignment.

6.9. Experiment 6: Threshold Sensitivity

We assess robustness of the pipeline’s performance to its fixed thresholds by sweeping each of the seven parameters described in Section 5.10 one-at-a-time around its default value, holding the others fixed. The seed-noise floor at the default configuration is

σ_{seed} = 9.54

min on a mean total time of

4044.6

min (coefficient of variation

0.24 %

across the 10 baseline seeds). Table 16 reports for each parameter the range of mean total times observed across levels, normalised by

σ_{seed}

, together with the Kruskal–Wallis p-value testing whether levels differ. A ratio

Range / σ_{seed} ≲ 1

indicates that the parameter has no effect distinguishable from random seed variation; values

p > 0.05

confirm that no statistically significant level effect is detected.

Three findings emerge. First, the two clustering thresholds (DBSCAN

ε

multiplier and

minPts

) produce identical depot assignments across the swept range, hence zero variation in total time, because the data-driven default

ε

formula and the capacity-constrained merging absorb local perturbations of the radius; extreme settings outside this range do break the merging pipeline (recorded as failures in the raw results), but those configurations lie far from any reasonable choice and are not in the table. Second, three of the remaining five parameters (MMAS

α

, MMAS

ρ

, in addition to the two clustering thresholds) have

Range / σ_{seed} \leq 1

and Kruskal–Wallis

p > 0.05

: their effect on total time is empirically indistinguishable from random seed variation, so the corresponding default choices contribute no measurable advantage. Third, the Boundary Relocate filter parameters

θ

and

Δ_{min}

do produce a statistically detectable effect (

p = 0.037

and

p = 0.019

), but the practical magnitude is small: the worst-to-best spread is

24.4

min on

θ

and

33.4

min on

Δ_{min}

, that is,

2.6

to

3.5 \times

the seed-noise floor and below

1 %

of the mean total time. MMAS

β

shows a comparable spread but no significant level effect (

p = 0.18

). Even the most sensitive thresholds therefore move the objective by less than the seed-to-seed difference between two clustering methods on the same instance, confirming that the pipeline’s headline gains are not the product of implicit tuning over the single applied case.

A confirmation run of the same OAT protocol on the Small dataset reproduces the qualitative finding: the seed-noise floor is even tighter (

σ_{seed} = 1.27

min on a baseline mean of

2751.8

min,

CV = 0.05 %

), and none of the seven parameters produces a statistically significant level effect (all Kruskal–Wallis

p > 0.05

). Absolute parameter-induced ranges remain below 13 min, that is, below

0.5 %

of the mean total time on this instance. The pipeline is therefore robust to threshold choices across instance scales.

Figure 7 visualises these findings. Panel (a) is a tornado plot of

Range / σ_{seed}

per parameter, with bars below the dashed

σ_{seed}

line falling within seed-noise; bars are coloured red where Kruskal–Wallis detects a level effect (

p \leq 0.05

) and grey otherwise. Panel (b)–(h) plot the per-parameter response curves with the baseline

\pm σ_{seed}

band shaded: five of the seven response curves remain inside the noise band over the entire swept range, and the two parameters with detectable level effects (

θ

,

Δ_{min}

) move the objective by less than

1 %

of the baseline mean.

6.10. Discussion

The six experiments yield a coherent picture of how clustering, optimization, priority handling, and assignment flexibility jointly determine service quality. We organize the discussion around cross-cutting findings, then address limitations.

6.10.1. Clustering Drives Scalability

DBSCAN-Hierarchical’s topology-aware clusters enable full fleet utilization (13/13 vehicles on Large), which is the proximate cause of its

+ 22.9

percentage points coverage advantage over Sweep. The silhouette–coverage inversion in Figure 2c demonstrates that geometric compactness is a misleading proxy for operational quality in road-network settings. All coverage differences on the Large instance are statistically significant (Friedman

p < 0.001

; all pairwise Holm-corrected

p < 0.02

).

6.10.2. Post-Optimization Adapts to Saturation

Post-optimization saves 10–12 min on unsaturated instances (Small, Medium) and rescues

1.6

unserved customers where fleet capacity is tight (Large), with both effects statistically significant (

p < 0.01

). Its role thus shifts from route smoothing to coverage recovery as the problem becomes harder.

6.10.3. Priority Robustness Requires All Three Mechanisms

The pipeline achieves 100% priority feasibility across all 330 tested configurations (Exp. 3a; Figure 3b), at a route-time cost below 5%. Exp. 3b shows this guarantee is necessary: without the three priority mechanisms, feasibility collapses to 0/10 seeds on the Large instance at ≥40% priority (Figure 4a; Wilcoxon

p < 0.02

for coverage at every level, Figure 4c). Exp. 3c isolates individual contributions: mechanism C (candidate-set restriction) alone achieves full feasibility; mechanism S (lexicographic selection) delivers the largest coverage gain (

97.0

–

97.3 %

,

+ 4

pp above blind); mechanism R (repair ordering) alone is insufficient (1–4/10 feasible seeds). Exps. 3d and 3e probe the operational boundary of the lower-overhead S_only variant: at high priority loads (Exp. 3d,

80 %

priority), S_only’s feasibility degrades to

3 / 10

seeds while the combined aware variant retains

9 / 10

, confirming that mechanism C’s structural guarantee is necessary at high load and not just defensive theory. Under spatially clustered priority (Exp. 3e), both variants retain full feasibility on the tested levels. The combined framework fuses these guarantees at a stable overhead below 5%.

6.10.4. Flexibility and Priority Interact Super-Additively

Experiments 1 and 4 together demonstrate that both clustering quality and assignment flexibility are needed: neither alone is sufficient (Figure 2a; Figure 5a). Exp. 5 reveals that priority constraints amplify the damage of rigid assignment (Figure 6a,b). The coverage drop from 0% to 50% priority is only

3.1

percentage points under flexible assignment but

4.3

percentage points under Center-Fixed. Team-Fixed at 50% priority collapses to 2/10 seeds with full priority service, the combined stress exceeds what either constraint imposes alone. These results show that the proposed framework’s value increases precisely when operational demands are highest.

6.10.5. Computational Performance

All experiments complete within practical time limits: the full pipeline executes in

2.4

s for the Small instance (

n = 55

),

7.8

s for Medium (

n = 90

), and

28.0

s for Large (

n = 150

). DBSCAN-Hierarchical clustering itself adds negligible overhead (<0.6 s). The Baseline consistently deploys

7.4

,

11.0

, and

13.0

routes on Small, Medium, and Large, respectively. These timings confirm suitability for operational deployment, where daily route plans are computed during morning planning sessions.

6.10.6. Comparison with Current Practice

Relative to the company’s manual Team-Fixed approach, the proposed pipeline improves coverage by

+ 16.1

percentage points on the Large instance (

97.4 %

vs.

81.3 %

) and achieves full priority service in all 10 seeds where Team-Fixed succeeds in only 2 out of 10. All computations complete in under 30 s per instance.

6.10.7. Limitations

Several limitations should be noted, and we organize them by which stage of the pipeline they affect. First, the two-phase geographic merging procedure (Stage 1, Algorithm 1, lines 6–7) is tailored to the four-depot configuration of Algérie Telecom in the Guelma region, which separates along an east–west and a north–south axis. Applying the pipeline to MDVRP instances with a different number of depots, depot layouts that do not separate along orthogonal geographic axes, or service areas with strongly non-convex boundaries would require substituting these two steps with a generic capacity-constrained agglomerative merger on the medoid distance matrix

\bar{D}

; the remainder of Stage 1 (DBSCAN density clustering, medoid computation, k-NN noise assignment, capacity rebalancing) is domain-general. Second, Stages 2 and 3 (the priority-aware MMAS construction, the lexicographic acceptance rule, and the Boundary Relocate operator) are independent of the depot layout and apply to any MDVRP-PR instance, but their broader benchmark validation against published MDVRP metaheuristics is left to future work, as standard MDVRP benchmarks do not include priority constraints or the service-time structure inherent to our problem. Third, the MMAS hyperparameters (

α

= 1,

β

= 2,

ρ

= 0.02, 100 iterations) were fixed at the canonical MMAS defaults of Stützle and Hoos [30] and not tuned per instance, which may leave room for further improvement. Fourth, priority customers are selected uniformly at random in most of our experiments; Experiment 3e probes spatially clustered priority within a single TC’s territory, but other clustering patterns (multi-zone outbreaks, temporally bunched arrivals) are not exercised here. Fifth, the model is deterministic and static; stochastic travel times, dynamic customer arrivals, and vehicle breakdowns are not considered. Sixth, per-depot MMAS instances are solved sequentially; parallelizing across depots would reduce wall-clock time roughly proportionally. The evaluation accordingly reports an applied case study rather than a domain-general benchmark, and the internal ablation design (2135 runs across controlled variants) provides rigorous evidence for each proposed component within that scope.

7. Conclusions

This paper addressed the Multi-Depot Vehicle Routing Problem with Priority Requests (MDVRP-PR), a practically motivated variant in which a subset of customers must be guaranteed service while overall coverage is maximized subject to duration-constrained routes and an asymmetric, real-world road network. The problem was formalized with a lexicographic objective that encodes the operational priority hierarchy of a utility company (Algérie Télécom, Guelma, Algeria), and a three-stage pipeline was developed to solve it: hybrid DBSCAN-Hierarchical clustering for topology-aware depot assignment, an Enhanced Max-Min Ant System (MMAS) for per-depot route construction with integrated priority-aware mechanisms, and a Boundary Relocate post-optimization stage with global cross-depot repair.

Across 2135 controlled runs, four findings stand out. Topology-aware clustering drives scalability: on the Large instance (n = 150), the proposed method achieves

97.4 %

coverage by activating the full 13-vehicle fleet, against

87.5 %

for K-Means with Euclidean coordinates,

86.7 %

for the unbiased K-Means variant operating on the same road-distance metric as our method,

80.4 %

for K-Medoids on road distance, and

74.5 %

for the MDVRP-adapted Sweep baseline (Friedman p < 0.001, Kendall’s W = 0.97; all pairwise Holm-corrected p < 0.05, Wilcoxon r = 0.75–

0.91

). The silhouette–coverage inversion across these methods confirms that geometric cluster compactness is a misleading proxy for operational quality in road-network settings. Post-optimization shifts from route smoothing to coverage recovery as fleet utilization tightens, rescuing

1.6

unserved customers on the hardest instance (p < 0.01). The three priority mechanisms together guarantee

100 %

priority feasibility across all tested configurations, a guarantee that collapses entirely without them (

0 / 10

seeds at

\geq 40 %

priority), at a route-time overhead of only

3.6

–

4.6 %

. Finally, flexibility and priority interact super-additively: rigid Team-Fixed assignment at

50 %

priority degrades to

78.6 %

coverage and

2 / 10

seeds with full priority service, while the proposed framework maintains

97.4 %

coverage with full priority service in all seeds. Against current company practice, the pipeline delivers a

+ 16.1

percentage point coverage gain within 28 s.

The evaluation is an applied case study on a single real instance rather than a domain-general benchmark; the internal ablation design (2135 runs across controlled variants) provides rigorous evidence for each proposed component within that scope. Future work should extend the evaluation to standard MDVRP benchmarks with diverse depot layouts, incorporate automated hyperparameter configuration, and address stochastic travel times and dynamic arrivals to further broaden applicability.

Author Contributions

Conceptualization, Y.B., B.F., B.M., A.-E.B., K.B. and H.S.; Methodology, Y.B., B.F., B.M. and A.-E.B.; Software, Y.B.; Validation, Y.B., B.F., B.M. and A.-E.B.; Formal Analysis, Y.B.; Investigation, Y.B.; Resources, B.M.; Data Curation, Y.B.; Writing—Original Draft Preparation, Y.B., B.F., B.M., A.-E.B., K.B. and H.S.; Writing—Review & Editing, Y.B., B.F., B.M. and A.-E.B.; Visualization, Y.B.; Supervision, Y.B., B.F. and B.M.; Project Administration, Y.B., B.F. and B.M.; Funding Acquisition, B.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The operational data used in this study are the property of Algérie Télécom and are subject to confidentiality restrictions that preclude public release. The data are not publicly available. Aggregated instance characteristics sufficient to understand the experimental setup are reported in Table 4. Researchers wishing to replicate the methodology may contact the corresponding author for further details regarding the experimental protocol, subject to approval by the data owner.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Dantzig, G.B.; Ramser, J.H. The truck dispatching problem. Manag. Sci. 1959, 6, 80–91. [Google Scholar] [CrossRef]
Accorsi, L.; Vigo, D. A fast and scalable heuristic for the solution of large-scale capacitated vehicle routing problems. Transp. Sci. 2021, 55, 832–856. [Google Scholar] [CrossRef]
Christiaens, J.; Vanden Berghe, G. Slack induction by string removals for vehicle routing problems. Transp. Sci. 2020, 54, 417–433. [Google Scholar] [CrossRef]
Cordeau, J.F.; Gendreau, M.; Laporte, G. A tabu search heuristic for periodic and multi-depot vehicle routing problems. Netw. Int. J. 1997, 30, 105–119. [Google Scholar] [CrossRef]
Vidal, T.; Crainic, T.G.; Gendreau, M.; Lahrichi, N.; Rei, W. A hybrid genetic algorithm for multidepot and periodic vehicle routing problems. Oper. Res. 2012, 60, 611–624. [Google Scholar] [CrossRef]
Sluijk, N.; Florio, A.M.; Kinable, J.; Dellaert, N.; Van Woensel, T. Two-echelon vehicle routing problems: A literature review. Eur. J. Oper. Res. 2023, 304, 865–886. [Google Scholar] [CrossRef]
Moghdani, R.; Salimifard, K.; Demir, E.; Benyettou, A. The green vehicle routing problem: A systematic literature review. J. Clean. Prod. 2021, 279, 123691. [Google Scholar] [CrossRef]
Drexl, M. Synchronization in vehicle routing—A survey of VRPs with multiple synchronization constraints. Transp. Sci. 2012, 46, 297–316. [Google Scholar] [CrossRef]
Macrina, G.; Di Puglia Pugliese, L.; Guerriero, F.; Laporte, G. Drone-aided routing: A literature review. Transp. Res. Part C Emerg. Technol. 2020, 120, 102762. [Google Scholar] [CrossRef]
Adamo, T.; Gendreau, M.; Ghiani, G.; Guerriero, E. A review of recent advances in time-dependent vehicle routing. Eur. J. Oper. Res. 2024, 319, 1–15. [Google Scholar] [CrossRef]
Bouchbout, Y.; Benrazek, A.E.; Farou, B.; Molnár, B.; Seridi, H. Optimizing Local Mail Delivery Routes: A Case Study of Algérie Poste. In Proceedings of the 2025 International Conference on Intelligent Computer Systems, Data Science and Applications (IC2SDA); IEEE: Piscataway, NJ, USA, 2025; pp. 1–7. [Google Scholar]
Oyola, J.; Arntzen, H.; Woodruff, D.L. The stochastic vehicle routing problem, a literature review, part II: Solution methods. EURO J. Transp. Logist. 2017, 6, 349–388. [Google Scholar] [CrossRef]
Oyola, J.; Arntzen, H.; Woodruff, D.L. The stochastic vehicle routing problem, a literature review, part I: Models. EURO J. Transp. Logist. 2018, 7, 193–221. [Google Scholar] [CrossRef]
Kumar, S.N. A survey on the vehicle routing problem and its variants. Intell. Inf. Manag. 2012, 4, 66–74. [Google Scholar] [CrossRef]
Gillett, B.E.; Miller, L.R. A heuristic algorithm for the vehicle-dispatch problem. Oper. Res. 1974, 22, 340–349. [Google Scholar] [CrossRef]
Renaud, J.; Laporte, G.; Boctor, F.F. A tabu search heuristic for the multi-depot vehicle routing problem. Comput. Oper. Res. 1996, 23, 229–235. [Google Scholar] [CrossRef]
Ho, W.; Ho, G.T.; Ji, P.; Lau, H.C. A hybrid genetic algorithm for the multi-depot vehicle routing problem. Eng. Appl. Artif. Intell. 2008, 21, 548–557. [Google Scholar] [CrossRef]
Azad, A.S.; Islam, M.; Chakraborty, S. A heuristic initialized stochastic memetic algorithm for MDPVRP with interdependent depot operations. IEEE Trans. Cybern. 2017, 47, 4302–4315. [Google Scholar] [CrossRef]
Mancini, S. A real-life multi depot multi period vehicle routing problem with a heterogeneous fleet: Formulation and adaptive large neighborhood search based matheuristic. Transp. Res. Part C Emerg. Technol. 2016, 70, 100–112. [Google Scholar] [CrossRef]
Lim, H.; Lee, G.M.; Singgih, I.K. Multi-depot split-delivery vehicle routing problem. IEEE Access 2021, 9, 112206–112220. [Google Scholar] [CrossRef]
Chen, D.; Yang, Z. Multiple depots vehicle routing problem in the context of total urban traffic equilibrium. J. Adv. Transp. 2017, 2017, 8524960. [Google Scholar] [CrossRef]
Gu, Z.; Zhu, Y.; Wang, Y.; Du, X.; Guizani, M.; Tian, Z. Applying artificial bee colony algorithm to the multidepot vehicle routing problem. Softw. Pract. Exp. 2022, 52, 756–771. [Google Scholar] [CrossRef]
Tao, Y.; Lin, C.; Wei, L. Metaheuristics for A Large-Scale Vehicle Routing Problem of Same-Day Delivery in E-Commerce Logistics System. J. Adv. Transp. 2022, 2022, 8253175. [Google Scholar] [CrossRef]
Wang, Y.; Zhe, J.; Wang, X.; Fan, J.; Wang, Z.; Wang, H. Collaborative multicenter reverse logistics network design with dynamic customer demands. Expert Syst. Appl. 2022, 206, 117926. [Google Scholar] [CrossRef]
Wang, Y.; Zhe, J.; Wang, X.; Sun, Y.; Wang, H. Collaborative multidepot vehicle routing problem with dynamic customer demands and time windows. Sustainability 2022, 14, 6709. [Google Scholar] [CrossRef]
Stodola, P.; Nohel, J. Adaptive ant colony optimization with node clustering for the multidepot vehicle routing problem. IEEE Trans. Evol. Comput. 2022, 27, 1866–1880. [Google Scholar] [CrossRef]
Yu, B.; Ma, N.; Cai, W.; Li, T.; Yuan, X.; Yao, B. Improved ant colony optimisation for the dynamic multi-depot vehicle routing problem. Int. J. Logist. Res. Appl. 2013, 16, 144–157. [Google Scholar] [CrossRef]
Xu, H.; Pu, P.; Duan, F. A hybrid ant colony optimization for dynamic multidepot vehicle routing problem. Discret. Dyn. Nat. Soc. 2018, 2018, 3624728. [Google Scholar] [CrossRef]
Santini, A.; Schneider, M.; Vidal, T.; Vigo, D. Decomposition strategies for vehicle routing heuristics. INFORMS J. Comput. 2023, 35, 543–559. [Google Scholar] [CrossRef]
Stützle, T.; Hoos, H.H. MAX–MIN ant system. Future Gener. Comput. Syst. 2000, 16, 889–914. [Google Scholar] [CrossRef]
Ting, C.J.; Chen, C.H. Combination of multiple ant colony system and simulated annealing for the multidepot vehicle-routing problem with time windows. Transp. Res. Rec. 2008, 2089, 85–92. [Google Scholar] [CrossRef]
Niu, M.; Liu, R.; Wang, H. A max-min ant system based on decomposition for the multi-depot cumulative capacitated vehicle routing problem. In Proceedings of the 2021 IEEE Congress on Evolutionary Computation (CEC); IEEE: Piscataway, NJ, USA, 2021; pp. 620–627. [Google Scholar]
Cordeau, J.F.; Gendreau, M.; Laporte, G.; Potvin, J.Y.; Semet, F. A guide to vehicle routing heuristics. J. Oper. Res. Soc. 2002, 53, 512–522. [Google Scholar] [CrossRef]
Wu, S.; Jin, C.; Bo, H. Exact solution of workload consistent vehicle routing problem with priority distribution and demand uncertainty. Comput. Ind. Eng. 2025, 202, 110940. [Google Scholar] [CrossRef]
Bock, S. Vehicle routing for connected service areas—A versatile approach covering single, hierarchical, and bi-criteria objectives. Eur. J. Oper. Res. 2024, 313, 905–925. [Google Scholar] [CrossRef]
Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Kdd; AAAI: Palo Alto, CA, USA, 1996; Volume 96, pp. 226–231. [Google Scholar]
Clarke, G.; Wright, J.W. Scheduling of vehicles from a central depot to a number of delivery points. Oper. Res. 1964, 12, 568–581. [Google Scholar] [CrossRef]
Dorigo, M.; Stützle, T. Ant colony optimization, 2004. Mass. Inst. Technol. 2004, 10. [Google Scholar]
Dorigo, M.; Birattari, M.; Stutzle, T. Ant colony optimization. IEEE Comput. Intell. Mag. 2006, 1, 28–39. [Google Scholar] [CrossRef]
Kytöjoki, J.; Nuortio, T.; Bräysy, O.; Gendreau, M. An efficient variable neighborhood search heuristic for very large scale vehicle routing problems. Comput. Oper. Res. 2007, 34, 2743–2757. [Google Scholar] [CrossRef]
Geetha, S.; Poonthalir, G.; Vanathi, P. Nested particle swarm optimisation for multi-depot vehicle routing problem. Int. J. Oper. Res. 2013, 16, 329–348. [Google Scholar] [CrossRef]
Tillman, F.A. The multiple terminal delivery problem with probabilistic demands. Transp. Sci. 1969, 3, 192–204. [Google Scholar] [CrossRef]

Figure 1. The overall pipeline of our proposed three-stage approach.

Figure 2. Experiment 1 results: (a) customer coverage by method and dataset, (b) clustering quality (silhouette), (c) silhouette–coverage trade-off showing that compact clusters do not translate to better coverage.

Figure 3. Experiment 3a: (a) total route time and (b) coverage as a function of priority proportion (shaded bands =

\pm 1 σ

).

Figure 3. Experiment 3a: (a) total route time and (b) coverage as a function of priority proportion (shaded bands =

\pm 1 σ

).

Figure 4. Experiment 3b: Aware vs. Blind comparison—(a) unserved priority customers, (b) time cost of priority-awareness (

Δ

total time), (c) coverage across priority levels.

Figure 4. Experiment 3b: Aware vs. Blind comparison—(a) unserved priority customers, (b) time cost of priority-awareness (

Δ

total time), (c) coverage across priority levels.

Figure 5. Experiment 4: (a) service coverage and (b) total operational time by assignment flexibility scenario across dataset sizes.

Figure 6. Experiment 5: (a) coverage and (b) total operational time vs. priority proportion by assignment scenario (Large instance,

n = 150

).

Figure 6. Experiment 5: (a) coverage and (b) total operational time vs. priority proportion by assignment scenario (Large instance,

n = 150

).

Figure 7. Experiment 6: One-at-a-time threshold sensitivity on the Medium instance. (a) tornado plot of

Range / σ_{seed}

per parameter; bars below the dashed line lie within the seed-noise floor. (b–h) per-parameter response curves (mean ± std across 5 seeds) with the baseline

\pm σ_{seed}

band shaded in blue. KW p-values shown in panel titles.

Figure 7. Experiment 6: One-at-a-time threshold sensitivity on the Medium instance. (a) tornado plot of

Range / σ_{seed}

per parameter; bars below the dashed line lie within the seed-noise floor. (b–h) per-parameter response curves (mean ± std across 5 seeds) with the baseline

\pm σ_{seed}

band shaded in blue. KW p-values shown in panel titles.

Table 2. Sets, parameters, and decision variables.

Sets
$K = {1, \dots, K}$	Set of depots; $o_{k}$ denotes the location of depot k
$N = {1, \dots, N}$	Set of customers
$P \subseteq N$	Subset of priority (mandatory) customers
$V_{k} = {1, \dots, m_{k}}$	Set of vehicles stationed at depot k
$N_{k}$	Subset of customers assigned to depot k
$S \subseteq N$	Set of served customers ( $P \subseteq S$ )
$L_{k} = N_{k} \cup {o_{k}}$	Node set for depot k (customers plus depot)
Parameters
$t_{i j}$	Travel time from node i to node j (min); asymmetric
$d_{i j}$	Distance from node i to node j (km); asymmetric
$s_{i}$	Service duration at customer i (min); $s_{o_{k}} = 0 \forall k$
$T_{max}$	Maximum route duration (min)
$m_{k}$	Fleet size at depot k
Decision variables
$x_{i j}^{k v} \in {0, 1}$	1 if vehicle v of depot k traverses arc $(i, j)$ ; 0 otherwise
$y_{i}^{k} \in {0, 1}$	1 if customer i is assigned to depot k; 0 otherwise

Table 3. Comparison of inter-depot reassignment operators.

	Cordeau et al. Relocate [4]	Kytöjoki et al. VNS $N_{1}$ [40]	Boundary Relocate (Ours)
Move granularity	Single customer	Single customer/segment	Single customer
Candidate scope	All cust. × all routes	All cust. × all routes	Boundary cust. only
Filter criterion	None (full scan)	None (full scan)	$t (d^{'}, c) / t (d, c)$ < $θ$ and $t (d, c) - t (d^{'}, c)$ > $Δ_{min}$
Acceptance rule	Strict improvement	Strict/first-improvement	Strict improvement, priority feasibility preserved
Per-pass complexity	$O (n \cdot R)$	$O (n \cdot R)$	$O (\| B \| \cdot K)$ , $\| B \| ≪ n$

Table 4. Instance characteristics.

Category	Parameter	Value
Infrastructure	Depots	4
	Vehicles	13
	Fleet per depot	2–5
Customers	Small (S)	55
	Medium (M)	90
	Large (L)	150
Operations	Service time (min)	15–45 (mean ≈ 30)
	Time horizon (min)	480
	Inter-customer distance (km)	0.2–112.7 (mean ≈ 40)
	Inter-customer travel time (min)	1.0–139.6 (mean ≈ 52)

Table 5. Overview of experimental design.

Exp.	Research Question	Factors	Runs
1	Clustering method impact	5 methods × 3 datasets	150
2	Post-optimization value	2 variants × 3 datasets	60
3a	Priority cost	11 levels × 3 datasets	330
3b	Priority-aware vs. blind MMAS	5 levels × 2 variants × 3 datasets	300
3c	Isolated mechanism ablation	5 variants × 3 levels	150
3d	S_only stress: extended priority	5 variants × 4 levels	200
3e	S_only stress: clustered priority	5 variants × 3 levels	150
4	Assignment flexibility	3 scenarios × 3 datasets	90
5	Priority × Flexibility	3 scenarios × 6 levels × 3 datasets	540
6	Threshold sensitivity (OAT)	7 parameters × ~5 levels × 5 seeds	165
Grand total			2135

Table 6. Experiment 1: Full-pipeline results by clustering method (mean ± std, 10 seeds). Bold cells mark the best result per column within each instance size; for Coverage, Silhouette, and Routes higher is better, and for Total Time and the timing columns lower is better. K-Means (Net.) applies K-Means to a classical-MDS embedding of the road-distance matrix, eliminating the metric bias of the Euclidean variant.

Dataset	Method	Total Time (min)	Coverage (%)	Silhouette	Routes	Clust. (s)	Optim. (s)	Total (s)
Small	DBSCAN-Hier.	$2751.8 \pm 1.3$	$100.0$	$0.175$	$7.5$	$0.23$	$2.21$	$2.44$
	K-Means (Eucl.)	$2637.3 \pm 7.4$	$100.0$	$0.333$	$7.0$	$0.01$	$2.00$	$2.01$
	K-Means (Net.)	$2678.5 \pm 5.0$	$100.0$	$0.392$	$8.7$	$0.01$	$2.34$	$2.34$
	K-Medoids	$2662.6 \pm 4.3$	$100.0$	$0.339$	$8.0$	$0.00$	$2.39$	$2.39$
	Sweep	$2593.2 \pm 1.6$	$100.0$	$0.126$	$8.0$	$0.00$	$2.51$	$2.51$
Medium	DBSCAN-Hier.	$4044.6 \pm 9.5$	$100.0$	$0.164$	$11.0$	$0.33$	$7.53$	$7.86$
	K-Means (Eucl.)	$3812.6 \pm 1.7$	$100.0$	$0.372$	$9.1$	$0.01$	$6.87$	$6.88$
	K-Means (Net.)	$4014.8 \pm 15.9$	$100.0$	$0.416$	$9.9$	$0.09$	$8.53$	$8.61$
	K-Medoids	$3874.9 \pm 9.8$	$100.0$	$0.396$	$10.0$	$0.00$	$8.52$	$8.53$
	Sweep	$3879.5 \pm 70.5$	$99.0$	$0.146$	$9.0$	$0.00$	$9.71$	$9.71$
Large	DBSCAN-Hier.	$5943.1 \pm 41.0$	$97.4$	$0.198$	$13.0$	$0.54$	$27.81$	$28.35$
	K-Means (Eucl.)	$5106.6 \pm 142.6$	$87.5$	$0.406$	$11.1$	$0.01$	$28.88$	$28.89$
	K-Means (Net.)	$5102.4 \pm 33.2$	$86.7$	$0.425$	$11.0$	$0.07$	$33.37$	$33.44$
	K-Medoids	$4636.4 \pm 27.2$	$80.4$	$0.406$	$10.0$	$0.01$	$31.18$	$31.19$
	Sweep	$4223.6 \pm 28.7$	$74.5$	$0.164$	$9.0$	$0.00$	$34.14$	$34.14$

Table 7. Experiment 2: Pipeline with vs. without post-optimization (mean over 10 seeds). Bold marks the best value in the comparison.

Dataset	Variant	Total Time (min)	Coverage (%)	Routes	Rescued
Small	Without	$2761.4$	$100.0$	$7.5$	—
Small	With	$2751.8$	$100.0$	$7.5$	$0.0$
Medium	Without	$4056.6$	$100.0$	$11.0$	—
Medium	With	$4044.6$	$100.0$	$11.0$	$0.0$
Large	Without	$5839.9$	$96.3$	$13.0$	—
Large	With	$5943.1$	$97.4$	$13.0$	$1.6$

Table 8. Per-stage timing and objective contribution (mean over 10 seeds, full pipeline with post-optimization).

Δ

Time: change in total route time (min) attributable to the stage.

Δ

Cov: change in coverage (pp) attributable to the stage. Clustering sets coverage by distributing workload; no direct time change is computed for it. Post-opt on Large trades

+ 103

min for

+ 1.1

pp coverage recovery.

Table 8. Per-stage timing and objective contribution (mean over 10 seeds, full pipeline with post-optimization).

Δ

Time: change in total route time (min) attributable to the stage.

Δ

Cov: change in coverage (pp) attributable to the stage. Clustering sets coverage by distributing workload; no direct time change is computed for it. Post-opt on Large trades

+ 103

min for

+ 1.1

pp coverage recovery.

Dataset	Stage	Wall Time (s)	% of Total	Δ Time (min)	Δ Cov (pp)
Small (n = 55)	Clustering	$0.22$	$9.1 %$	—	sets coverage
	MMAS	$2.11$	$87.2 %$	baseline	$100.0$
	Post-opt	$0.09$	$3.7 %$	$- 9.6$	$0.0$
Medium (n = 90)	Clustering	$0.33$	$4.2 %$	—	sets coverage
	MMAS	$7.20$	$92.2 %$	baseline	$100.0$
	Post-opt	$0.28$	$3.6 %$	$- 12.0$	$0.0$
Large (n = 150)	Clustering	$0.53$	$1.9 %$	—	sets coverage
	MMAS	$26.77$	$95.6 %$	baseline	$96.3$
	Post-opt	$0.70$	$2.5 %$	$+ 103.2$	$+ 1.1$

Table 9. Experiment 3a: Coverage and total route time by priority proportion (mean over 10 seeds). Bold marks the best value in each column.

	Coverage (%)						Total Time (min)
Dataset	0%	10%	20%	30%	40%	50%	0%	10%	20%	30%	40%	50%
Small	100.0	100.0	100.0	100.0	100.0	100.0	2743	2752	2789	2814	2807	2804
Medium	100.0	100.0	100.0	100.0	100.0	100.0	4045	4106	4166	4193	4267	4247
Large	97.4	95.5	94.7	94.2	94.5	94.3	5944	5950	5987	5988	6003	5983

Table 10. Experiment 3b: Priority-Aware vs. Priority-Blind on Large (

n = 150

, 10 seeds).

Δ

Route Time is positive because the Aware variant serves more customers, increasing total service and travel time.

Table 10. Experiment 3b: Priority-Aware vs. Priority-Blind on Large (

n = 150

, 10 seeds).

Δ

Route Time is positive because the Aware variant serves more customers, increasing total service and travel time.

			# Seeds All Priority
			Customers Served (/10)
Priority	$Δ$ Route Time (min)	$Δ$ Coverage (%)	Aware	Blind	$Δ$ Pri. Unsrv.
10%	$+ 208.8 \pm 57.4$	$+ 2.5$	10/10	3/10	$- 1.0$
20%	$+ 246.0 \pm 71.3$	$+ 1.7$	10/10	1/10	$- 1.7$
30%	$+ 247.4 \pm 60.6$	$+ 1.2$	10/10	1/10	$- 2.6$
40%	$+ 262.3 \pm 61.7$	$+ 1.5$	10/10	0/10	$- 4.0$
50%	$+ 242.4 \pm 44.7$	$+ 1.3$	10/10	0/10	$- 4.9$

Table 11. Experiment 3c: Isolated mechanism ablation on the Large instance (n = 150), 10 seeds per cell. Feasible = seeds/10 in which all priority customers are served. Coverage = mean customer coverage (%). Time overhead = mean total-time increase vs. blind (%). Mechanisms: C = candidate-set restriction, S = lexicographic selection, R = priority-first repair. Bold marks the best result per comparison group.

		Priority 20%			Priority 30%			Priority 40%
Variant	Mech.	Feas.	Cov.	Ovhd	Feas.	Cov.	Ovhd	Feas.	Cov.	Ovhd
blind	$\cdot \cdot \cdot$	1/10	93.0%	—	1/10	93.0%	—	0/10	93.0%	—
C_only	$C \cdot \cdot$	10/10	90.1%	$+ 3.1 %$	10/10	89.7%	$+ 2.6 %$	10/10	90.8%	$+ 3.8 %$
S_only	$\cdot S \cdot$	10/10	97.3%	$+ 3.6 %$	10/10	97.3%	$+ 3.7 %$	10/10	97.0%	$+ 3.7 %$
R_only	$\cdot \cdot R$	4/10	93.3%	$+ 1.0 %$	3/10	93.7%	$+ 1.9 %$	1/10	93.5%	$+ 0.8 %$
aware	$C S R$	10/10	94.7%	$+ 4.3 %$	10/10	94.2%	$+ 4.3 %$	10/10	94.5%	$+ 4.6 %$

Table 12. Experiment 3d: Extended priority range on the Large instance (n = 150), 10 seeds per cell. Feasible = seeds/10 in which all priority customers are served. Coverage = mean customer coverage (%). Bold marks the best feasibility per priority level.

	Pri. 50%		Pri. 60%		Pri. 70%		Pri. 80%
Variant	Feas.	Cov.	Feas.	Cov.	Feas.	Cov.	Feas.	Cov.
blind	0/10	93.0%	0/10	93.0%	0/10	93.0%	0/10	93.0%
C_only	10/10	90.4%	10/10	91.1%	9/10	92.1%	4/10	92.7%
S_only	10/10	96.9%	9/10	96.8%	7/10	96.9%	3/10	96.6%
R_only	1/10	93.7%	0/10	93.3%	0/10	93.3%	0/10	93.3%
aware	10/10	94.3%	10/10	94.7%	10/10	95.5%	9/10	95.9%

Table 13. Experiment 3e: Spatially clustered priority on the Large instance (n = 150), priority customers drawn exclusively from the dominant TC’s territory (66 candidates), 10 seeds per cell. Bold marks the best result per comparison group.

	Pri. 10%		Pri. 20%		Pri. 30%
Variant	Feas.	Cov.	Feas.	Cov.	Feas.	Cov.
blind	8/10	93.0%	4/10	93.0%	3/10	93.0%
C_only	10/10	91.9%	10/10	90.9%	10/10	90.6%
S_only	10/10	97.3%	10/10	97.4%	10/10	97.2%
R_only	8/10	93.5%	8/10	93.7%	4/10	93.4%
aware	10/10	96.4%	10/10	96.1%	10/10	96.1%

Table 14. Experiment 4: Assignment flexibility (0% priority, mean ± std, 10 seeds). Bold values mark the best result in each column within each instance size; lower Total Time and Travel are not meaningful when Coverage differs, since they reflect fewer customers served.

Dataset	Scenario	Total Time (min)	Coverage (%)	Travel (min)	Routes
Small	Baseline	$2751.8 \pm 1$	$100.0$	$1091.8$	$7.4$
	Center-Fixed	$2670.8 \pm 0$	$100.0$	$1010.8$	$8.2$
	Team-Fixed	$2891.0 \pm 0$	$100.0$	$1231.0$	$12.0$
Medium	Baseline	$4044.6 \pm 10$	$100.0$	$1331.6$	$11.0$
	Center-Fixed	$3830.4 \pm 4$	$98.9$	$1155.4$	$10.0$
	Team-Fixed	$3772.1 \pm 0$	$94.4$	$1234.1$	$12.0$
Large	Baseline	$5944.1 \pm 42$	$97.4$	$1532.3$	$13.0$
	Center-Fixed	$4972.5 \pm 21$	$85.5$	$1153.4$	$11.0$
	Team-Fixed	$4818.7 \pm 4$	$81.3$	$1224.8$	$13.0$

Table 15. Experiment 5: Coverage (%) and priority feasibility rate on Large (

n = 150

). Bold marks the best coverage per scenario.

Table 15. Experiment 5: Coverage (%) and priority feasibility rate on Large (

n = 150

). Bold marks the best coverage per scenario.

Scenario	0%	10%	20%	30%	40%	50%
Coverage (%)
Baseline	97.4	95.5	94.7	94.2	94.5	94.3
Center-Fixed	85.5	84.3	82.8	82.1	81.9	81.2
Team-Fixed	81.3	80.9	79.9	79.3	78.9	78.6
# Seeds with all priority customers served (out of 10)
Baseline	—	10	10	10	10	10
Center-Fixed	—	10	10	10	10	9
Team-Fixed	—	10	10	10	9	2

Table 16. Experiment 6: One-at-a-time threshold sensitivity on the Medium dataset (5 seeds per level; baseline pool of 10 seeds,

σ_{seed} = 9.54

min, mean total time

4044.6

min).

Range / σ_{seed}

expresses the parameter-induced spread in mean total time as a multiple of the seed-noise floor; values

\leq 1

indicate insensitivity within the tested range. Bold parameter names mark thresholds whose induced spread is below the seed-noise floor.

Table 16. Experiment 6: One-at-a-time threshold sensitivity on the Medium dataset (5 seeds per level; baseline pool of 10 seeds,

σ_{seed} = 9.54

min, mean total time

4044.6

min).

Range / σ_{seed}

expresses the parameter-induced spread in mean total time as a multiple of the seed-noise floor; values

\leq 1

indicate insensitivity within the tested range. Bold parameter names mark thresholds whose induced spread is below the seed-noise floor.

Parameter	Default	Levels Swept	Range (min)	$Range / σ_{Seed}$	KW p
DBSCAN $ε$ multiplier	$1.00$	${0.70, 0.85, 1.00, 1.15, 1.30}$	$0.00$	$0.00$	$1.0000$
DBSCAN $minPts$	2	${1, 2, 3, 4}$	$0.00$	$0.00$	$1.0000$
$θ$ (Boundary Reloc.)	$1.25$	${1.05, 1.15, 1.25, 1.35, 1.50}$	$24.44$	$2.56$	$0.0370$
$Δ_{min}$ (Boundary Reloc.)	$5.0$	${2.0, 5.0, 8.0, 10.0, 15.0}$	$33.38$	$3.50$	$0.0193$
MMAS $α$	$1.0$	${0.5, 1.0, 1.5, 2.0}$	$8.01$	$0.84$	$0.2536$
MMAS $β$	$2.0$	${1.0, 2.0, 3.0, 5.0}$	$26.56$	$2.79$	$0.1845$
MMAS $ρ$	$0.020$	${0.005, 0.020, 0.050, 0.100}$	$4.44$	$0.47$	$0.7819$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bouchbout, Y.; Farou, B.; Molnár, B.; Benrazek, A.-E.; Bouafia, K.; Seridi, H. A Three-Stage Approach for the Multi-Depot VRP with Priority Requests. Appl. Sci. 2026, 16, 5188. https://doi.org/10.3390/app16115188

AMA Style

Bouchbout Y, Farou B, Molnár B, Benrazek A-E, Bouafia K, Seridi H. A Three-Stage Approach for the Multi-Depot VRP with Priority Requests. Applied Sciences. 2026; 16(11):5188. https://doi.org/10.3390/app16115188

Chicago/Turabian Style

Bouchbout, Yehya, Brahim Farou, Bálint Molnár, Ala-Eddine Benrazek, Khawla Bouafia, and Hamid Seridi. 2026. "A Three-Stage Approach for the Multi-Depot VRP with Priority Requests" Applied Sciences 16, no. 11: 5188. https://doi.org/10.3390/app16115188

APA Style

Bouchbout, Y., Farou, B., Molnár, B., Benrazek, A.-E., Bouafia, K., & Seridi, H. (2026). A Three-Stage Approach for the Multi-Depot VRP with Priority Requests. Applied Sciences, 16(11), 5188. https://doi.org/10.3390/app16115188

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Three-Stage Approach for the Multi-Depot VRP with Priority Requests

Abstract

1. Introduction

2. Related Work

2.1. Classical MDVRP: Formulations and Benchmark Methods

2.2. Clustering-Based Decomposition for MDVRP

2.3. ACO and MMAS for Multi-Depot Routing

2.4. Priority and Service-Class Modeling in VRP

3. Problem Formulation

3.1. Problem Definition

3.2. Notation

3.3. Mathematical Model

3.4. Problem Characteristics

4. Methodology

4.1. Stage 1: Hybrid DBSCAN-Hierarchical Clustering

Scope of the Geographic Merging Step

4.2. Stage 2: Max-Min Ant System (MMAS)

4.2.1. Construction Phase

Savings-Based Heuristic

Priority-First Construction Rule

4.2.2. Local Search

4.2.3. Repair Phase

4.2.4. Solution Selection

4.2.5. Pheromone Update

4.3. Stage 3: Cross-Depot Post-Optimization

4.3.1. Boundary Relocate

Relation to Classical Inter-Depot Operators

4.3.2. Cross-Depot Repair

4.4. Priority Integration

5. Numerical Experiments

5.1. Problem Instances

Metrics and Protocol

5.2. Experiment 1: Clustering Method Comparison

5.3. Experiment 2: Post-Optimization Ablation

5.4. Experiment 3a: Priority-Customer Sensitivity

5.5. Experiment 3b: Priority-Aware vs. Priority-Blind MMAS

5.6. Experiment 3c: Isolated Mechanism Ablation

5.7. Experiments 3d and 3e: Stress Tests for the S_only Variant

5.8. Experiment 4: Assignment Flexibility

5.9. Experiment 5: Priority × Flexibility Interaction

5.10. Experiment 6: Threshold Sensitivity

5.11. Summary

6. Results and Discussion

6.1. Experiment 1: Clustering Method Comparison

6.2. Experiment 2: Post-Optimization Ablation

6.3. Experiment 3a: Priority-Customer Sensitivity

6.4. Experiment 3b: Priority-Aware vs. Priority-Blind MMAS

6.5. Experiment 3c: Isolated Mechanism Ablation

6.6. Experiments 3d and 3e: Stress Tests for the S_only Variant

6.7. Experiment 4: Assignment Flexibility

6.8. Experiment 5: Priority × Flexibility Interaction

6.9. Experiment 6: Threshold Sensitivity

6.10. Discussion

6.10.1. Clustering Drives Scalability

6.10.2. Post-Optimization Adapts to Saturation

6.10.3. Priority Robustness Requires All Three Mechanisms

6.10.4. Flexibility and Priority Interact Super-Additively

6.10.5. Computational Performance

6.10.6. Comparison with Current Practice

6.10.7. Limitations

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI