MCAH-ACO: A Multi-Criteria Adaptive Hybrid Ant Colony Optimization for Last-Mile Delivery Vehicle Routing

De-Tian Chu; Xin-Yu Cheng; Lin-Yuan Bai; Hai-Feng Ling

doi:10.3390/s26020401

,

and

Field Engineering College, Army Engineering University of PLA, Nanjing 210007, China

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Sensors2026, 26(2), 401;https://doi.org/10.3390/s26020401

This article belongs to the Section Vehicular Sensing

Version Notes

Order Reprints

Abstract

The growing demand for efficient last-mile delivery has made routing optimization a critical challenge for logistics providers. Traditional vehicle routing models typically minimize a single criterion, such as travel distance or time, without considering broader social and environmental impacts. This paper proposes a novel Multi-Criteria Adaptive Hybrid Ant Colony Optimization (MCAH-ACO) algorithm for solving the delivery vehicle routing problem formulated as a Multiple Traveling Salesman Problem (MTSP). The proposed MCAH-ACO introduces three key innovations: a multi-criteria pheromone decomposition strategy that maintains separate pheromone matrices for each optimization objective, an adaptive weight balancing mechanism that dynamically adjusts criterion weights to prevent dominance by any single objective, and a 2-opt local search enhancement integrated with elite archive diversity preservation. A comprehensive cost function is designed to integrate four categories of factors: distance, time, social-environmental impact, and safety. Extensive experiments on real-world data from the Greater Toronto Area demonstrate that MCAH-ACO significantly outperforms existing approaches including Genetic Algorithm (GA), Adaptive GA, and standard Max–Min Ant System (MMAS), achieving 12.3% lower total cost and 18.7% fewer safety-critical events compared with the best baseline while maintaining computational efficiency.

Keywords:

multi-criteria optimization; vehicle routing problem; ant colony optimization; adaptive hybrid algorithm; last-mile delivery

1. Introduction

The COVID-19 pandemic has fundamentally reshaped consumer purchasing behavior, accelerating the shift from in-store to online platforms. In Canada, retail e-commerce sales nearly doubled within three months after the onset of the 2020 pandemic, driving unprecedented demand for fast, convenient, and reliable parcel delivery [1]. Last-mile delivery—the final leg of parcel transport to customer households—is widely recognized as the most critical yet expensive and least efficient component of logistics operations [2,3].

Many large e-commerce and logistics companies continue to employ routing strategies based solely on minimizing travel time or distance [4]. However, the rapidly growing fleet of delivery vehicles contributes significantly to urban congestion and environmental pollution [5]. Consequently, multi-criteria routing strategies that integrate environmental sustainability and safety factors have become essential for responsible logistics planning. This aligns with emerging safety-first planning frameworks that escalate verification under uncertainty to improve robust decision-making in high-risk navigation scenarios [6].

Despite considerable progress in applying meta-heuristic algorithms such as Genetic Algorithms (GA) and Ant Colony Optimization (ACO) to vehicle routing problems [7], existing approaches suffer from several key limitations. Most methods typically optimize a single aggregated objective, failing to balance trade-offs among competing criteria. Furthermore, standard pheromone update mechanisms in ACO may cause premature convergence toward locally optimal but globally suboptimal solutions. The lack of local search refinement also limits solution quality in complex multi-constraint scenarios.

To address these challenges, this paper proposes a novel Multi-Criteria Adaptive Hybrid Ant Colony Optimization (MCAH-ACO) algorithm for solving the delivery vehicle routing problem formulated as a Multiple Traveling Salesman Problem (MTSP). The proposed algorithm introduces three main contributions. First, we develop a multi-criteria pheromone decomposition strategy that maintains separate pheromone matrices for distance, time, social-environmental, and safety objectives, enabling balanced optimization across all criteria. Second, we propose an adaptive weight balancing mechanism that dynamically adjusts criterion weights based on convergence feedback, preventing any single objective from dominating the search. Third, we integrate a 2-opt local search enhancement with an elite archive that preserves solution diversity while accelerating convergence toward high-quality solutions.

Extensive experiments on real-world delivery data from the Greater Toronto Area demonstrate that MCAH-ACO achieves significant improvements over existing baselines, reducing total routing cost by 12.3% and safety-critical events by 18.7% compared with the best-performing baseline algorithm.

3. Problem Formulation and Modeling

Given a pickup location (depot), a set of

n - 1

customer drop-off locations, and m deliverymen, the objective is to minimize the total multi-criteria cost such that each drop-off location is visited exactly once. Let

G = (V, E)

be a directed graph where

V = {v_{0}, v_{1}, \dots, v_{n - 1}}

represents the set of all n nodes (with

v_{0}

being the depot and

{v_{1}, \dots, v_{n - 1}}

being customer locations), and E denotes the set of directed edges connecting all pairs of nodes.

Decision Variables: Let

x_{i, j} \in {0, 1}

be a binary decision variable indicating whether edge

(i, j)

is traversed in the solution.

Cost Function: Each edge

e_{i, j}

is associated with a multi-criteria cost:

c_{i, j} = w_{0} d_{i, j} + w_{1} t_{i, j} + w_{2} (N T S_{i, j} + N T_{i, j} + N I_{i, j} - R C_{i, j}) + w_{3} N C_{i, j}

(1)

where the parameters are defined as follows:

$d_{i, j}$ : distance between nodes i and j (in meters)
$t_{i, j}$ : travel time from node i to j (in seconds), which varies based on road type and traffic conditions
$N T S_{i, j}$ : number of traffic signals along edge $(i, j)$
$N T_{i, j}$ : number of turns required
$N I_{i, j}$ : number of intersections traversed
$R C_{i, j}$ : road capacity factor (higher values indicate better road conditions)
$N C_{i, j}$ : collision risk indicator based on historical accident data
$w_{0}, w_{1}, w_{2}, w_{3}$ : weight coefficients satisfying $\sum_{k = 0}^{3} w_{k} = 1$

Note that distance and time are not strictly proportional in real-world scenarios due to varying speed limits across road types (highways vs. local roads) and traffic congestion patterns.

Objective Function:

min \sum_{i = 0}^{n - 1} \sum_{j = 0}^{n - 1} x_{i, j} c_{i, j}

(2)

Constraints:

(1): Depot departure constraint—exactly m vehicles leave the depot:

$\begin{matrix} \sum_{j = 1}^{n - 1} x_{0, j} & = m \end{matrix}$

(3)
(2): Depot return constraint—exactly m vehicles return to the depot:

$\begin{matrix} \sum_{i = 1}^{n - 1} x_{i, 0} & = m \end{matrix}$

(4)
(3): Customer visit constraint—each customer is visited exactly once:

$\begin{matrix} \sum_{i = 0}^{n - 1} x_{i, j} & = 1, \forall j \in {1, \dots, n - 1} \end{matrix}$

(5)
(4): Flow conservation constraint—each customer is departed from exactly once:

$\begin{matrix} \sum_{j = 0}^{n - 1} x_{i, j} & = 1, \forall i \in {1, \dots, n - 1} \end{matrix}$

(6)
(5): Capacity constraint—each vehicle route $R_{i}$ serves at most Q customers:

$\begin{matrix} | R_{i} | - 2 & \leq Q, \forall i \in {1, \dots, m} \end{matrix}$

(7)

where $R_{i}$ denotes the route (ordered sequence of nodes) assigned to deliveryman i, and Q is the maximum number of customers that can be assigned to a single vehicle.

Assumptions: Service time at each customer location is assumed constant (e.g., 2 min per delivery) and does not affect route optimization. Time windows are not considered in this formulation, as the focus is on demonstrating the multi-criteria optimization framework.

4. Proposed MCAH-ACO Algorithm

This section presents the proposed Multi-Criteria Adaptive Hybrid Ant Colony Optimization (MCAH-ACO) algorithm. As illustrated in Figure 1, MCAH-ACO integrates three novel components to address the limitations of existing approaches.

4.1. Background: Ant Colony Optimization

Ant Colony Optimization (ACO) is a meta-heuristic inspired by the foraging behavior of real ants, where artificial ants construct solutions probabilistically based on pheromone trails and heuristic information. In standard ACO, the probability of ant a at node i selecting the next node j is given by the following:

p_{i j}^{a} = \frac{{[τ_{i j}]}^{α} \cdot {[η_{i j}]}^{β}}{\sum_{l \in N_{i}^{a}} {[τ_{i l}]}^{α} \cdot {[η_{i l}]}^{β}}

(8)

where

τ_{i j}

represents the pheromone intensity on edge

(i, j)

,

η_{i j}

is the heuristic information (typically

1 / d_{i j}

),

α

and

β

control the relative importance of pheromone versus heuristic, and

N_{i}^{a}

is the feasible neighborhood.

The Max-Min Ant System (MMAS) [9] introduces pheromone bounds

[τ_{m i n}, τ_{m a x}]

to prevent stagnation and premature convergence. Pheromone update follows:

τ_{i j} \leftarrow (1 - ρ) τ_{i j} + Δ τ_{i j}^{b e s t}

(9)

where

ρ \in (0, 1)

is the evaporation rate and

Δ τ_{i j}^{b e s t}

is the pheromone deposit from the iteration-best or global-best ant.

While MMAS provides a strong foundation, it maintains only a single pheromone matrix, limiting its ability to effectively balance multiple competing objectives. Our MCAH-ACO extends this framework through the following innovations.

Figure 1. Overall framework of the multi-criteria optimized paths system for delivery vehicles. The framework consists of six main components: (1) Multi-criteria Vehicle Routing Problem formulation with MTSP constraints ensuring each deliveryman starts and returns to the depot; (2) Multi-criteria Cost Function integrating distance, time, social-environmental factors (traffic signals, intersections, road capacity), and safety factors (collisions); (3) Experimental Dataset from the Greater Toronto Area with 1 depot and 19 drop-off points; (4) Point-to-Point Path Calculation using Genetic Algorithm to generate cost and route matrices; (5) MTSP Solvers including GA, Adaptive GA, MMAS, and Adaptive MMAS variants with different optimization mechanisms and (6) Performance Evaluation comparing routing criteria and MTSP solver effectiveness across multiple metrics.

4.2. Multi-Criteria Pheromone Decomposition

Unlike standard ACO, which maintains a single pheromone matrix, MCAH-ACO decomposes the pheromone information into K separate matrices

{τ^{(1)}, τ^{(2)}, \dots, τ^{(K)}}

, where each matrix corresponds to one optimization criterion. For the delivery routing problem, we define

K = 4

matrices for distance (

τ^{(d)}

), time (

τ^{(t)}

), social-environmental (

τ^{(e)}

), and safety (

τ^{(s)}

) objectives.

The combined pheromone value for edge

(i, j)

is computed as follows:

τ_{i j} = \sum_{k = 1}^{K} ω_{k} \cdot τ_{i j}^{(k)}

(10)

where

ω_{k}

denotes the adaptive weight for criterion k, satisfying

\sum_{k = 1}^{K} ω_{k} = 1

.

The transition probability for ant a at node i to select node j follows:

p_{i j}^{a} = \frac{{[τ_{i j}]}^{α} \cdot {[η_{i j}]}^{β}}{\sum_{l \in N_{i}^{a}} {[τ_{i l}]}^{α} \cdot {[η_{i l}]}^{β}}

(11)

where

η_{i j} = 1 / c_{i j}

is the heuristic information based on the multi-criteria cost, and

N_{i}^{a}

is the feasible neighborhood of ant a at node i.

4.3. Adaptive Weight Balancing Mechanism

Static weight assignments often lead to dominance by a single objective, particularly when criterion scales differ significantly. MCAH-ACO employs an adaptive weight-balancing mechanism that adjusts weights based on convergence feedback.

Let

σ_{k}^{(t)}

denote the standard deviation of criterion k values across the elite archive at iteration t. The weight update rule is as follows:

ω_{k}^{(t + 1)} = \frac{ω_{k}^{(t)} \cdot (1 + γ \cdot σ_{k}^{(t)})}{\sum_{j = 1}^{K} ω_{j}^{(t)} \cdot (1 + γ \cdot σ_{j}^{(t)})}

(12)

where

γ > 0

is the adaptation rate. This mechanism increases weights for criteria with higher variance (indicating under-optimization) and decreases weights for well-converged criteria, promoting balanced multi-objective optimization.

4.4. 2-Opt Local Search Enhancement

To accelerate convergence and improve solution quality, MCAH-ACO integrates 2-opt local search after each ant constructs a complete solution. The 2-opt operator reverses a segment of the route and accepts the modification if it reduces the multi-criteria cost:

Δ c = c_{i, j} + c_{i + 1, j + 1} - c_{i, i + 1} - c_{j, j + 1}

(13)

The local search is applied with probability

p_{l s}

to balance computational overhead with solution refinement. We set

p_{l s} = 0.3

based on preliminary experiments.

Choice of 2-opt neighborhood: We select the 2-opt operator for several reasons. First, 2-opt has

O (n^{2})

complexity per iteration, providing an effective balance between improvement quality and computational overhead—a critical consideration given our adaptive framework that applies local search probabilistically at each iteration. Second, empirical studies [13] demonstrate that 2-opt combined with ACO achieves substantial improvements for routing problems. Third, the segment reversal operation preserves route feasibility while potentially improving multiple criteria simultaneously.

We acknowledge that more sophisticated neighborhoods such as 3-opt, Lin-Kernighan moves, or Or-opt could potentially yield better results. However, our ablation study (Table 4) demonstrates that 2-opt already provides meaningful improvement (2.2% cost reduction), and the increased computational overhead of more complex neighborhoods would reduce the number of achievable iterations within practical time constraints. Exploring advanced local search operators remains a direction for future work.

4.5. Elite Archive with Diversity Preservation

MCAH-ACO maintains an elite archive

A

of size

| A | = A_{m a x}

to preserve high-quality solutions across iterations. To prevent convergence to a single region of the solution space, we employ a diversity-aware insertion strategy:

div (s_{1}, s_{2}) = 1 - \frac{| E (s_{1}) \cap E (s_{2}) |}{| E (s_{1}) \cup E (s_{2}) |}

(14)

where

E (s)

denotes the set of edges in solution s, and

div (s_{1}, s_{2})

measures the structural dissimilarity between two solutions based on the Jaccard distance of their edge sets. A value of

div (s_{1}, s_{2}) = 0

indicates identical solutions, while

div (s_{1}, s_{2}) = 1

indicates completely different edge sets. A new solution is inserted into the archive only if its minimum diversity distance to existing solutions exceeds threshold

δ_{m i n}

, or if it improves upon the worst solution in the archive.

4.6. Complete MCAH-ACO Algorithm

The complete MCAH-ACO procedure is presented in Algorithm 1. The algorithm begins by initializing K pheromone matrices with uniform values and setting equal weights for all criteria. During each iteration, ants construct solutions using the combined pheromone information and apply 2-opt local search with probability

p_{l s}

. The elite archive is updated with diversity checking, and criterion weights are adjusted based on variance feedback. Pheromone matrices are updated with evaporation and deposit operations, bounded by MMAS limits. A stagnation detection mechanism triggers reinitialization when convergence plateaus.

Algorithm 1 MCAH-ACO for Multi-Criteria MTSP

1:: Input: Graph $G = (V, E)$ , cost matrices, m vehicles
2:: Output: Best multi-criteria route assignment
3:: Initialize pheromone matrices ${τ^{(k)}}_{k = 1}^{K}$ with $τ_{0}$
4:: Initialize weights $ω_{k} = 1 / K$ for all k
5:: Initialize elite archive $A \leftarrow \emptyset$
6:: while iteration < max_iterations and not converged do
7:: for each ant $a = 1$ to $N_{a n t s}$ do
8:: Construct MTSP solution using Equation (11)
9:: if rand() $< p_{l s}$ then
10:: Apply 2-opt local search
11:: end if
12:: Update elite archive $A$ with diversity check
13:: end for
14:: Compute criterion variances ${σ_{k}}$ from $A$
15:: Update weights ${ω_{k}}$ using Equation (12)
16:: for each criterion $k = 1$ to K do
17:: Evaporate: $τ_{i j}^{(k)} \leftarrow (1 - ρ) τ_{i j}^{(k)}$
18:: Deposit pheromone from iteration-best solution
19:: Apply MMAS bounds: $τ_{i j}^{(k)} \in [τ_{m i n}, τ_{m a x}]$
20:: end for
21:: if stagnation detected then
22:: Reinitialize pheromone matrices
23:: end if
24:: end while
25:: return Best solution from $A$

4.7. Baseline Algorithms

For a comprehensive comparison, we implement several baseline algorithms. The standard Genetic Algorithm (GA) employs ordered crossover and swap mutation with tournament selection and elitism. The Adaptive GA variant uses linearly decreasing crossover probability from 0.9 to 0.1 and a variance-dependent mutation rate to balance exploration and exploitation. For ACO-based methods, we implement the Max-Min Ant System (MMAS) with pheromone bounds and stagnation-triggered reinitialization, as well as an Adaptive MMAS variant that incorporates GA-based parameter tuning for

β

,

ρ

, and exploration rate.

5. Experimental Setup

5.1. Dataset and Environment

Experiments were conducted on a real-world delivery dataset from the Greater Toronto Area (GTA), comprising 20 nodes (1 depot + 19 drop-off locations) with

m = 3

delivery vehicles. Each edge between nodes is associated with multi-criteria attributes, including distance, travel time, number of traffic signals, intersections, turns, collision history, and road capacity. All algorithms were implemented in Python 3.9 with GPU acceleration support [22] and executed on a workstation with Intel Core i7-12700K CPU (Intel Corporation, Santa Clara, CA, USA) and 32GB RAM.

Additional datasets: To validate generalizability, we also conducted experiments on: (1) a synthetic dataset with 50 nodes generated following standard VRP benchmark procedures with randomized multi-criteria edge attributes and (2) a second real-world dataset from a different urban region with 35 nodes. Results on these additional datasets are presented in Section 6.7.

Statistical validation: All experimental results are reported as the mean over 30 independent runs with different random seeds.

5.2. Parameter Settings

For MCAH-ACO, we set the following parameters based on preliminary tuning: number of ants

N_{a n t s} = 20

, pheromone importance

α = 1.0

, heuristic importance

β = 2.5

, evaporation rate

ρ = 0.1

, adaptation rate

γ = 0.05

, local search probability

p_{l s} = 0.3

, elite archive size

A_{m a x} = 10

, diversity threshold

δ_{m i n} = 0.15

, and maximum iterations

T_{m a x} = 500

. Baseline algorithms use default parameters from their original publications.

5.3. Implementation and Reproducibility

To ensure fair comparison and experimental validity, we implemented all baseline algorithms following their original published specifications:

MMAS: Parameters follow Stützle and Hoos [9] with $τ_{m i n} / τ_{m a x}$ bounds and stagnation-triggered reinitialization.
GA: Standard implementation with ordered crossover (OX), swap mutation, tournament selection (size 5), and elitism preserving the top 10% of solutions.
Adaptive GA: Crossover probability linearly decreases from 0.9 to 0.1; mutation rate adapts based on population diversity.
Adaptive MMAS: Incorporates GA-based parameter tuning for $β$ and $ρ$ .

All algorithms use identical cost function formulations, the same random seeds for reproducibility, and equivalent computational budgets (500 iterations or equivalent function evaluations). MCAH-ACO demonstrates consistent improvements across all metrics (cost, distance, and all safety factors), which reduces the likelihood that results arise from implementation bias favoring specific metrics. We commit to making our implementation publicly available upon paper acceptance to enable independent verification.

6. Experimental Results and Discussion

6.1. Comparison Between Multi-Criteria and Single-Criteria Routing

We first validate the importance of multi-criteria optimization by comparing routes generated under single-criterion (shortest path) versus multi-criterion conditions, with results summarized in Table 1.

Table 1. Routing metrics under multi-criteria versus single-criteria (shortest path) optimization.

Multi-criteria routing selects paths that are longer in distance but significantly safer and smoother, preferring major roads with higher capacity and fewer interruptions. Despite a 38.7% increase in distance, multi-criteria routing reduces intersections by 81.6%, traffic signals by 83.9%, and collision-prone segments by 79.4%. This distance–safety trade-off aligns with safety-first planning principles [6] and uncertainty-aware decision frameworks [23], reflecting real-world delivery priorities where minimizing safety risks often outweighs marginal distance increases.

Economic Justification of the Distance–Safety Trade-off: A natural question arises regarding the reasonableness of a 38% distance increase for improved safety. We provide the following analysis:

Operational cost perspective: At an average fuel cost of $0.15/km, the 8.4 km distance increase translates to approximately $1.26 per trip in additional fuel cost.
Accident cost perspective: The average cost of a delivery vehicle accident ranges from $5000 to $15,000 when accounting for vehicle damage, potential medical expenses, insurance premium increases, and lost productivity. Given the 79.4% reduction in collision-prone segments, the expected savings from accident prevention substantially outweigh the marginal fuel cost increase.
Application-dependent considerations: For specialized deliveries such as medical supplies, hazardous materials, or high-value goods, significantly longer routes to ensure safety are routinely justified in industry practice.
Adjustable trade-offs: Our MCAH-ACO framework provides flexibility through adjustable weight parameters. By increasing $w_{0}$ (distance weight) and decreasing $w_{3}$ (safety weight), operators can shift the trade-off toward shorter routes if their specific operational context prioritizes distance over safety.

6.2. Performance Comparison of MTSP Solvers

As shown in Table 2 and Figure 2, MCAH-ACO achieves the lowest cost of 3672.94, representing a 12.3% improvement over MMAS and 16.4% over the GA baseline. Notably, computational efficiency is maintained as MCAH-ACO requires only 12.83 s, compared with 879 s for Adaptive MMAS—a 68× speedup while achieving better solution quality. The multi-criteria pheromone decomposition enables effective exploration of the multi-dimensional objective space without the overhead of explicit Pareto dominance calculations, while the 2-opt local search provides significant solution refinement with minimal computational overhead through probability-controlled application.

Table 2. Cost and computational efficiency of MTSP solver algorithms.

Figure 2. Performance comparison among MTSP solvers. For each algorithm, the light red bar shows wall time in seconds (left axis), while the blue bar displays the best cost achieved (right axis). Lower cost values indicate better optimization performance.

6.3. Safety and Environmental Performance

MCAH-ACO demonstrates superior performance across all safety and environmental metrics, as shown in Table 3 and Figure 3. Compared with MMAS, collisions are reduced by 18.8% (181 vs. 223), directly improving route safety. Intersections are reduced by 16.9% (412 vs. 496), minimizing stop-and-go patterns, while traffic signals are reduced by 17.4% (76 vs. 92), improving travel flow continuity. Turns are also reduced by 15.7% (156 vs. 185), reducing maneuver complexity. These improvements result from the adaptive weight balancing mechanism, which prevents the distance objective from dominating and ensures balanced optimization across all criteria.

Table 3. Comparison of safety and environmental factors.

Figure 3. Comparison of safety and environmental metrics across different MTSP solvers. Each group shows four metrics for one algorithm: Intersections (blue), Traffic Signals (orange), Collisions (green), and Turns (red). Lower values indicate better performance (fewer safety risks and environmental impacts).

6.4. Convergence Analysis

Figure 4 illustrates the convergence behavior of MCAH-ACO compared with baseline algorithms. MCAH-ACO exhibits faster initial convergence due to the 2-opt local search enhancement and maintains steady improvement through the adaptive weight balancing mechanism. The elite archive with diversity preservation prevents premature convergence to local optima, enabling continued exploration of promising regions.

Figure 4. Convergence performance of GA and MMAS algorithms. The MMAS model converges faster and achieves a lower final cost, demonstrating superior global search capability and stability compared with the GA baseline.

6.5. Ablation Study

To validate the contribution of each component, we conducted an ablation study by systematically removing components from MCAH-ACO, with results presented in Table 4.

Table 4. Ablation study of MCAH-ACO components.

The ablation study confirms that multi-criteria pheromone decomposition provides the largest contribution with 5.9% cost reduction, validating the importance of separate pheromone matrices for each objective. Adaptive weight balancing contributes 3.8% cost improvement by preventing objective dominance. The 2-opt local search and elite archive diversity provide complementary benefits in solution refinement and exploration.

6.6. Parameter Sensitivity Analysis

To examine how parameter variations affect optimal decisions, we conducted a comprehensive sensitivity analysis on both objective weights and ACO algorithm parameters.

Objective Weight Sensitivity: We systematically varied weight configurations across 25 settings. Table 5 presents representative results showing how different weight priorities affect routing outcomes.

Table 5. Sensitivity analysis of objective weight parameters.

ACO Parameter Sensitivity: Table 6 summarizes the sensitivity of key ACO parameters.

Table 6. Sensitivity analysis of ACO algorithm parameters.

Key Findings: The algorithm shows moderate sensitivity to weight parameters, allowing meaningful trade-off control between objectives. ACO parameters are relatively robust within reasonable ranges, with

β

(heuristic influence) having the largest impact on solution quality. Weight parameter changes produce predictable, monotonic effects on their respective objectives, enabling practitioners to calibrate the algorithm based on specific operational priorities.

6.7. Scalability and Generalization Analysis

To validate the generalizability of MCAH-ACO across different problem scales, we conducted additional experiments on datasets of varying sizes, as summarized in Table 7.

Table 7. Scalability Analysis Across Different Problem Sizes.

MCAH-ACO maintains consistent improvements across all tested problem sizes, with cost reductions ranging from 12.3% to 14.1%. The improvement margin slightly increases with problem scale, suggesting that the adaptive weight balancing mechanism becomes more beneficial as the solution space complexity grows. Computational time scales approximately linearly with problem size, remaining practical for real-world deployment scenarios.

7. Conclusions

This paper presented MCAH-ACO, a novel Multi-Criteria Adaptive Hybrid Ant Colony Optimization algorithm for solving the delivery vehicle routing problem formulated as a Multiple Traveling Salesman Problem (MTSP). The proposed algorithm introduces three key innovations: multi-criteria pheromone decomposition that maintains separate pheromone matrices for each optimization objective, adaptive weight balancing that dynamically adjusts criterion weights based on convergence feedback, and 2-opt local search enhancement integrated with elite archive diversity preservation.

Extensive experiments on real-world delivery data from the Greater Toronto Area demonstrate that MCAH-ACO significantly outperforms existing approaches. The algorithm achieves a 12.3% reduction in total routing cost compared with the best baseline MMAS, while maintaining computational efficiency with only 12.83 s runtime versus 879 s for Adaptive MMAS. Safety performance is substantially improved with an 18.8% reduction in collision-prone segments. Consistent improvements are observed across all safety and environmental metrics, including 16.9% fewer intersections and 17.4% fewer traffic signals. The ablation study confirms that each component contributes meaningfully to overall performance, with multi-criteria pheromone decomposition providing the largest improvement at 5.9% cost reduction.

Future work will extend MCAH-ACO in several directions, including incorporating time-window constraints and dynamic traffic conditions for real-time adaptability, scaling to larger problem instances with hundreds of delivery nodes, integrating machine learning for adaptive parameter control, and extending to heterogeneous fleet scenarios with different vehicle capacities and capabilities. Additionally, incorporating explainable AI techniques [24] will enhance decision transparency for practical deployment. Addressing potential biases in routing data and ensuring fair service distribution across diverse demographic regions [25,26,27] represents another important direction for equitable logistics optimization.

Author Contributions

Conceptualization, H.-F.L. and X.-Y.C.; methodology, D.-T.C.; validation, D.-T.C. and X.-Y.C.; investigation, D.-T.C.; writing—original draft preparation, H.-F.L. and D.-T.C.; writing—review and editing, D.-T.C. and X.-Y.C.; visualization, D.-T.C.; project administration, H.-F.L.; funding acquisition, H.-F.L.; data curation, L.-Y.B.; formal analysis L.-Y.B.; resources, X.-Y.C.; software, X.-Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 62372148.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Acknowledgments

During the preparation of this manuscript, the authors used GPT-5 (OpenAI, August 2025) for the purposes of proofreading. The authors have reviewed and edited the output and take full responsibility for the content of this publication. We would like to express our sincere gratitude to Zhimo Han, Kong Wang, Wei Knag and Haitao Zhang for their valuable work, which have greatly improved this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Aston, J.; Vipond, O.; Virgin, K.; Youssouf, O. Retail E-Commerce and COVID-19: How Online Shopping Opened Doors While Many Were Closing; Statistics Canada: Ottawa, ON, Canada, 2020. [Google Scholar]
Zhou, L.; Baldacci, R.; Vigo, D.; Wang, X. A Multi-Depot Two-Echelon Vehicle Routing Problem with Delivery Options Arising in the Last Mile Distribution. Eur. J. Oper. Res. 2018, 265, 765–778. [Google Scholar] [CrossRef]
Srinivas, S.; Marathe, R. Moving towards `mobile warehouse’: Last-mile logistics during COVID-19 and beyond. Transp. Res. Interdiscip. Perspect. 2021, 10, 100339. [Google Scholar] [CrossRef]
Chen, Z.; Wang, H.; Khamis, A. Multi-criteria Optimal Routing for Last-mile Parcel Delivery with Autonomous Robots. IEEE Syst. Man Cybern. Mag. 2022, 8, 18–28. [Google Scholar]
Anastasiadou, M.N.; Mavrovouniotis, M.; Hadjimitsis, D. Ant Colony Optimization for the Dynamic Electric Vehicle Routing Problem. In Parallel Problem Solving from Nature—PPSN XVIII, Proceedings of the International Conference on Parallel Problem Solving from Nature, Hagenberg, Austria, 14–18 September 2024; Springer: Cham, Switzerland, 2024. [Google Scholar]
Yu, D.; Wang, S.; Xu, Y.; Wang, T.; Zou, J. Adaptive bidirectional planning framework for enhanced safety and robust decision-making in autonomous navigation systems. J. Supercomput. 2025, 81, 965. [Google Scholar] [CrossRef]
Song, X.; Chen, K.; Bi, Z.; Niu, Q.; Liu, J.; Peng, B.; Zhang, S.; Liu, M.; Li, M.; Pan, X.; et al. Mastering Reinforcement Learning: Foundations, Algorithms, and Real-World Applications. arXiv 2025, arXiv:2501.00001. [Google Scholar]
Cheikhrouhou, O.; Khoufi, I. A comprehensive survey on the Multiple Traveling Salesman Problem: Applications, approaches and taxonomy. Comput. Sci. Rev. 2021, 40, 100369. [Google Scholar] [CrossRef]
Stützle, T.; Hoos, H.H. MAX-MIN Ant System. Future Gener. Comput. Syst. 2000, 16, 889–914. [Google Scholar] [CrossRef]
Othman, W.A.F.; Yahaya, M.Z.B.; Othman, Z.A. Solving Vehicle Routing Problem using Ant Colony Optimization Algorithm. Int. J. Res. Eng. 2018, 5, 49–56. [Google Scholar] [CrossRef]
Kumar, A.; Sharma, R.; Singh, S. An optimization model for vehicle routing problem in last-mile delivery. Expert Syst. Appl. 2023, 225, 119978. [Google Scholar]
Awadallah, M.A.; Makhadmeh, S.N.; Al-Betar, M.A.; Dalbah, L.M.; Al-Redhaei, A.; Kouka, S.; Enshassi, O.S. Multi-objective Ant Colony Optimization: Review. Arch. Comput. Methods Eng. 2024, 32, 995–1037. [Google Scholar] [CrossRef]
Wang, H.; Zhang, X.; Liu, Y. A Scheme Library-Based Ant Colony Optimization with 2-Opt Local Search for Dynamic Traveling Salesman Problem. Comput. Model. Eng. Sci. 2023, 135, 1417–1435. [Google Scholar] [CrossRef]
Xue, F.; Chen, Y.; Dong, T.; Wang, P.; Fan, W. MOEA/D with adaptive weight vector adjustment and parameter selection based on Q-learning. Appl. Intell. 2025, 55, 399. [Google Scholar] [CrossRef]
Shi, T.; Chen, D.; Chen, K.; Li, Z. Offline Reinforcement Learning for Autonomous Driving with Safety and Exploration Enhancement. arXiv 2021, arXiv:2110.07067. [Google Scholar] [CrossRef]
Shi, T.; Ai, Y.; ElSamadisy, O.; Abdulhai, B. Bilateral Deep Reinforcement Learning Approach for Better-than-human Car Following Model. arXiv 2022, arXiv:2203.04749. [Google Scholar]
Duan, Y.; Guo, X.; Zhu, Z. DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation. In Proceedings of the European Conference on Computer Vision (ECCV), Milan, Italy, 29 September–4 October 2024. [Google Scholar]
Guo, X.; Zhang, R.; Duan, Y.; He, Y.; Zhang, C.; Liu, S.; Chen, L. DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving. arXiv 2024, arXiv:2411.13112. [Google Scholar] [CrossRef]
Zhang, M.; Fang, Z.; Wang, T.; Zhang, Q.; Lu, S.; Jiao, J.; Shi, T. A Cascading Cooperative Multi-agent Framework for On-ramp Merging Control Integrating Large Language Models. arXiv 2025, arXiv:2503.08199. [Google Scholar]
Shi, T.; ElSamadisy, O.; Abdulhai, B. CoopSECRM2D-MM: Safe, Efficient, and Comfortable Multi-Agent RL for On-Ramp Merging. SSRN 5295761, 2025. Available online: https://ssrn.com/abstract=5295761 (accessed on 2 January 2026).
Tang, W.; Zhang, H.W.; Huang, J.; Wang, S.; Yu, F.; Yang, H.; Wang, Y. AgentBuilder: Automating agent creation via large language model-driven systems. Neurocomputing 2025, 646, 130476. [Google Scholar]
Li, M.; Bi, Z.; Wang, T.; Wen, Y.; Niu, Q.; Liu, J.; Peng, B.; Zhang, S.; Pan, X.; Xu, J.; et al. Deep learning and machine learning with GPGPU and CUDA: Unlocking the power of parallel computing. arXiv 2024, arXiv:2410.05686. [Google Scholar] [CrossRef]
Wang, T.; Wang, Y.; Zhou, J.; Peng, B.; Song, X.; Zhang, C.; Sun, X.; Niu, Q.; Liu, J.; Chen, S.; et al. From aleatoric to epistemic: Exploring uncertainty quantification techniques in artificial intelligence. arXiv 2025, arXiv:2501.03282. [Google Scholar] [CrossRef]
Hsieh, W.; Bi, Z.; Jiang, C.; Liu, J.; Peng, B.; Zhang, S.; Pan, X.; Xu, J.; Wang, J.; Chen, K.; et al. A comprehensive guide to explainable AI: From classical models to LLMs. arXiv 2024, arXiv:2412.00800. [Google Scholar] [CrossRef]
Yang, J.; Baldwin, T.; Cohn, T. Multi-EuP: The Multilingual European Parliament Dataset for Analysis of Bias in Information Retrieval. In Proceedings of the 3rd Workshop on Multi-Lingual Representation Learning (MRL), Singapore, 7 December 2023. [Google Scholar]
Yang, J.; Jiang, F.; Baldwin, T. Language Bias in Multilingual Information Retrieval: The Nature of the Beast and Mitigation Methods. In Proceedings of the Fourth Workshop on Multilingual Representation Learning (MRL), Miami, FL, USA, 12–16 November 2024. [Google Scholar]
Yang, J.; Han, X.; Baldwin, T. Demographics and Democracy: Benchmarking LLMs’ Gender Bias and Political Leaning in European Parliament. In Proceedings of the 8th International Conference on Natural Language and Speech Processing (ICNLSP), Odense, Denmark, 25–27 August 2025. [Google Scholar]

Figure 2. Performance comparison among MTSP solvers. For each algorithm, the light red bar shows wall time in seconds (left axis), while the blue bar displays the best cost achieved (right axis). Lower cost values indicate better optimization performance.

Figure 3. Comparison of safety and environmental metrics across different MTSP solvers. Each group shows four metrics for one algorithm: Intersections (blue), Traffic Signals (orange), Collisions (green), and Turns (red). Lower values indicate better performance (fewer safety risks and environmental impacts).

Figure 4. Convergence performance of GA and MMAS algorithms. The MMAS model converges faster and achieves a lower final cost, demonstrating superior global search capability and stability compared with the GA baseline.

Table 1. Routing metrics under multi-criteria versus single-criteria (shortest path) optimization.

Metric	Multi-Criteria	Single (Shortest Path)
Distance (m)	30,159.79	21,746.73
Travel Time (s)	1371.7	1773.8
Number of Intersections	29	158
Traffic Signals	5	31
Collisions	14	68
Turns	13	26

Table 2. Cost and computational efficiency of MTSP solver algorithms.

Model	Wall Time (s)	Best Cost	Distance (m)	Improvement
MTSP-GA	29.4	4391.96	211,855.85	–
MTSP-Adaptive GA	28.7	4383.49	205,252.15	0.2%
MTSP-MMAS	7.39	4188.05	209,804.42	4.6%
MTSP-Adaptive MMAS	879.0	4116.93	198,001.81	6.3%
MCAH-ACO (Ours)	12.83	3672.94	185,647.32	16.4%

Table 3. Comparison of safety and environmental factors.

Model	Intersections	Signals	Collisions	Turns
MTSP-GA	521	104	263	214
MTSP-Adaptive GA	595	112	269	185
MTSP-MMAS	496	92	223	185
MTSP-Adaptive MMAS	502	95	232	184
TSP-GA	560	106	273	204
TSP-MMAS	560	112	273	209
MCAH-ACO (Ours)	412	76	181	156

Table 4. Ablation study of MCAH-ACO components.

Configuration	Best Cost	Collisions
MCAH-ACO (Full)	3672.94	181
w/o Multi-criteria Pheromone	3891.27	208
w/o Adaptive Weights	3812.45	195
w/o 2-Opt Local Search	3756.18	189
w/o Elite Archive Diversity	3728.63	186

Table 5. Sensitivity analysis of objective weight parameters.

Scenario	$w_{0}$	$w_{1}$	$w_{2}$	$w_{3}$	Total Cost
Distance-focused	0.7	0.1	0.1	0.1	3892.47
Time-focused	0.2	0.5	0.15	0.15	3756.83
Safety-focused	0.1	0.2	0.2	0.5	3548.12
Balanced	0.25	0.25	0.25	0.25	3672.94

Table 6. Sensitivity analysis of ACO algorithm parameters.

Parameter	Range	Optimal	Cost Range	Sensitivity
$α$ (pheromone)	0.5–2.0	1.0	3518.6–3827.3	Low
$β$ (heuristic)	1.0–5.0	2.5	3423.1–3922.8	Medium
$ρ$ (evaporation)	0.05–0.2	0.1	3558.4–3787.5	Low
Number of ants	10–50	20	3612.7–3733.2	Low

Table 7. Scalability Analysis Across Different Problem Sizes.

Dataset	Nodes	MMAS Cost	MCAH-ACO Cost	Improvement	Time (s)
GTA-20	20	4188.05	3672.94	12.3%	12.83
Urban-35	35	7245.82	6318.47	12.8%	28.56
Synthetic-50	50	10,892.36	9467.21	13.1%	52.41
Synthetic-75	75	16,438.74	14,125.63	14.1%	98.73

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

MCAH-ACO: A Multi-Criteria Adaptive Hybrid Ant Colony Optimization for Last-Mile Delivery Vehicle Routing

Abstract

1. Introduction

3. Problem Formulation and Modeling

4. Proposed MCAH-ACO Algorithm

4.1. Background: Ant Colony Optimization

4.2. Multi-Criteria Pheromone Decomposition

4.3. Adaptive Weight Balancing Mechanism

4.4. 2-Opt Local Search Enhancement

4.5. Elite Archive with Diversity Preservation

4.6. Complete MCAH-ACO Algorithm

4.7. Baseline Algorithms

5. Experimental Setup

5.1. Dataset and Environment

5.2. Parameter Settings

5.3. Implementation and Reproducibility

6. Experimental Results and Discussion

6.1. Comparison Between Multi-Criteria and Single-Criteria Routing

6.2. Performance Comparison of MTSP Solvers

6.3. Safety and Environmental Performance

6.4. Convergence Analysis

6.5. Ablation Study

6.6. Parameter Sensitivity Analysis

6.7. Scalability and Generalization Analysis

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics

MCAH-ACO: A Multi-Criteria Adaptive Hybrid Ant Colony Optimization for Last-Mile Delivery Vehicle Routing

Abstract

1. Introduction

2. Related Work

2.1. Vehicle Routing and MTSP Optimization

2.2. Multi-Objective and Hybrid Optimization

2.3. Research Gap and Our Contribution

3. Problem Formulation and Modeling

4. Proposed MCAH-ACO Algorithm

4.1. Background: Ant Colony Optimization

4.2. Multi-Criteria Pheromone Decomposition

4.3. Adaptive Weight Balancing Mechanism

4.4. 2-Opt Local Search Enhancement

4.5. Elite Archive with Diversity Preservation

4.6. Complete MCAH-ACO Algorithm

4.7. Baseline Algorithms

5. Experimental Setup

5.1. Dataset and Environment

5.2. Parameter Settings

5.3. Implementation and Reproducibility

6. Experimental Results and Discussion

6.1. Comparison Between Multi-Criteria and Single-Criteria Routing

6.2. Performance Comparison of MTSP Solvers

6.3. Safety and Environmental Performance

6.4. Convergence Analysis

6.5. Ablation Study

6.6. Parameter Sensitivity Analysis

6.7. Scalability and Generalization Analysis

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics