1. Introduction
Over recent decades, urban waste generation has surged worldwide due to population growth and rapid urbanization. The World Bank forecasts that global municipal solid waste will reach 3.4 billion tons by 2050 [
1]. China, as the most populous country, generated 254 million tons of waste in 2024 [
2]. Facing such challenges, waste management authorities must urgently develop effective and sustainable approaches to handle the growing volume of waste.
The entire waste management process mainly includes four stages: waste collection, waste transfer, secondary collection, and final disposal. Among them, waste collection and secondary collection are important issues because they account for 60–80% of the total cost of waste management [
3]. Therefore, optimizing waste collection is not merely a logistics problem but a key entry point for improving the efficiency of the entire waste management process. It would not only reduce collection costs but also provide decision support for the optimal layout of waste facilities. As this study focuses on municipal waste collection, we ignored the subsequent waste processing procedures after the transfer station. Specifically, we focus on the collection process from waste collection points (WCPs) to the transfer station.
In response to the growing pressure on waste management, China’s central government has recently initiated a mandatory waste sorting policy, with Shanghai serving as the most representative city in its implementation [
4]. It states that the waste collection system must be restructured from the traditional mixed collection model to a separate collection model that maintains waste sorting. Additionally, an innovative initiative known as “Removal of Bins and Consolidation of Points” is being implemented in most cities across China. This initiative aims to transform scattered bins into centralized waste collection points. Shanghai served as an early leader of this initiative, and other Chinese cities subsequently followed suit [
5]. For example, Shanghai emphasizes a model combining centralized WCPs with fixed time slots and “green account” point incentives. Shenzhen has introduced smart devices at centralized WCPs accompanied by heavy fines, while Zhuhai relies on human supervision and guidance at such centralized WCPs. Although these cities have significant differences in their implementation models and intensities for this initiative, all of them have significantly reduced the number of WCPs, demonstrated that it is possible to achieve economies of scale in waste collection. This is a critical factor in waste collection, but it still faces barriers.
On the one hand, waste sorting is currently achieved by assigning dedicated vehicles to specific waste categories in most cities. Currently, separate vehicles are used for different waste types to keep waste sorted. However, this approach leads to multiple vehicle trips to the same site, raising collection costs and environmental burdens.
On the other hand, influenced by factors such as residents’ living habits, commuting patterns, and business activity cycles, the amount of domestic waste exhibits a distinct bimodal distribution pattern. Specifically, the amount of waste generated shows two distinct peaks during the morning and evening periods, creating a peak for collection demand. To cope with this heavy pressure, collection companies usually need to allocate excess transportation resources, including dispatching more vehicles, increasing the number of shifts, and extending working hours. However, this resource allocation model oriented towards peak demand leads to severe underutilization of collection capacity during off-peak hours. This brings considerable economic pressure to the sustainable operation of the collection companies and constitutes a critical challenge in waste collection operations.
To enhance resource utilization and collection efficiency, hybrid transportation capacity types combining self-operation, outsourcing and crowdsourcing has emerged in logistics practices. These types, which differ in working hours, shift scheduling practices, and compensation structures, can be strategically integrated to improve the overall efficiency of capacity resources. However, this study focuses on the collection of domestic waste. As a public service, it faces multiple risks such as uneven service coverage, illegal dumping and difficult supervision. Therefore, it is usually regulated and implemented by the government so it is characterized by a strong public character and a low degree of marketization. Therefore, this study does not consider the application of the crowdsourcing model in waste collection. In contrast, outsourced vehicles are usually not directly managed by government departments, which eliminates the costs associated with vehicle purchase and maintenance, thus effectively reducing fixed asset investments. Therefore, this study proposes a hybrid transport capacity model that combines self-operation and outsourcing as an optimal solution to enhance responsiveness and collection efficiency.
The waste collection problem is often regarded as a variant of the vehicle routing problem. In early studies, scholars mainly relied on historical data for analysis and seldom took real-time information into account [
6,
7,
8]. The development of IoT technology, particularly smart waste bins, enabled real-time monitoring of waste fill levels, thereby supporting dynamic collection [
9]. Hannan et al. [
10] reviewed the existing information and communication technologies, analyzed their application in smart waste management, and discussed potential challenges. Subsequently, Hannan et al. [
3] designed an improved particle swarm optimization algorithm for solid waste collection using smart waste bins. Akhtar et al. [
11] established a mathematical model based on bin filling level thresholds to minimize total collection route distance. Nowakowski et al. [
12] proposed a harmony search algorithm for recycling waste electrical and electronic equipment to optimize collection routes and validated its superiority through comparison with other algorithms. Idwan et al. [
13] employed a two-stage heuristic algorithm to address the reverse logistics vehicle routing optimization problem for smart bins. They also conducted a comparative analysis between traditional and smart collection methods. Jorge et al. [
14] adopted a hybrid meta-heuristic algorithm to prospectively determine which bins to collect based on their real-time and predicted filling levels. de Morais et al. [
15] utilized real-time information on filling levels to construct a dynamic reverse inventory routing optimization model. Kim et al. [
16] introduced two management methods to determine dynamic optimal routes. They combined ant colony optimization with the k-means clustering algorithm to solve the vehicle routing problem for large-scale waste collection. Although the above studies have considered dynamic collection scenarios, none of them addressed the vehicle routing problem when multiple types of waste in multi-compartment vehicles are involved. In the context of smart bin applications, Yang et al. [
17] considered the uncertainty in waste generation rates and introduced a chance-constrained approach to solve a multi-compartment electric vehicle routing problem. Subsequently, Mohammadi et al. [
18] first applied the discrete choice models to optimize dynamic multi-compartment vehicle routing problems, providing a new research perspective to the field. Despite these valuable contributions, neither study considered the impact of heterogeneous multi-compartment vehicles under dynamic demands in waste collection routing problems.
To the best of our knowledge, the existing studies on heterogeneous fleets in waste collection are predominantly framed within multi-level waste collection networks. Markov et al. [
19] considered intermediate facilities and introduced a heterogeneous fleet and flexible destination allocation to address the recyclable waste collection problem. Asefi et al. [
20] employed heterogeneous vehicles to construct a three-level collection network, aiming to optimize the fleet size and determine the optimal routes, thereby minimizing the waste collection cost. Ghiani et al. [
21] focused on transfer stations in waste collection systems. The study indicated that waste was transferred from small vehicles to large ones at transfer stations. To address this, they proposed a two-stage optimization method to solve the routing problems of small and large vehicles. Cao et al. [
22] focused on the electric vehicle routing problem in a two-level reverse logistics network. They adopted a heterogeneous hybrid fleet to reduce carbon emissions and save resources. Wei et al. [
23] explored integrated optimization of a multi-level routing problem, aiming to plan a set of routes for manually driven vehicles and those equipped with compressors to minimize travel costs. Beyond the field of waste collection, heterogeneous fleets have been extensively adopted across diverse application domains. Nucamendi-Guillén et al. [
24] studied a multi-depot location-routing problem with a heterogeneous fleet, aiming to minimize total costs by selecting contracted carriers, the types of vehicles used by each carrier, and the corresponding collection routes. Xu et al. [
25] investigated the collaborative issue between ground vehicles and multiple drones and proposed a two-stage heuristic algorithm for its solution. Lin et al. [
26] considered the optimization of task allocation and operational routing for a heterogeneous fleet of unmanned aerial vehicles to enable effective delivery of emergency medical services. While the majority of the above studies employed heterogeneous fleets for delivery tasks, they failed to systematically address the dynamic routing problem when both heterogeneous and multi-compartment vehicles are involved.
The introduction of heterogeneous vehicles and real-time changes in waste bin filling levels increase the complexity of the waste collection problem while also imposing higher requirements for the solution methods. Introduced by Heidari et al. [
27], the Harris Hawks Optimization (HHO) algorithm is a recently developed metaheuristic algorithm. Its appeal stems from a lean parameter configuration, intuitive underlying mechanism, high implementation feasibility, and effective balance between exploration and exploitation. These attributes have facilitated its successful deployment in multiple fields [
28,
29,
30]. Gupta et al. [
31] improved the HHO algorithm using four strategies: a nonlinear energy formula, a greedy selection mechanism, opposition-based learning, and an attenuation factor. These improvements addressed the imbalance between exploration and exploitation. Nivethitha et al. [
32] introduced an adaptive collaborative and decentralized foraging strategy in the HHO algorithm to guide the update of population positions. Fahmy et al. [
33] improved the HHO algorithm by integrating an enhanced Chimpanzee Optimization Algorithm. They also proposed a new position update formula to calculate the prey’s escape energy. The above improvement strategies enhance the performance of the single HHO algorithm from multiple perspectives, including parameter tuning, population update mechanisms, and energy formula reformulation. However, such strategies are typically still confined to parameter or mechanism optimization within the HHO framework itself, making them prone to premature convergence to local optima. To overcome this limitation, researchers have begun exploring hybrid algorithms that combine multiple algorithms to break through the performance bottlenecks of single algorithms. Bao et al. [
34] enhanced the search capability of HHO by adopting the differential evolution algorithm. This algorithm has been successfully applied to image segmentation, verifying its performance advantages. Sehgal et al. [
35] combined the slime mold optimization algorithm with the HHO algorithm, thereby effectively avoiding convergence to local optima. Sharma et al. [
36] constructed seven enhanced variants of the HHO algorithm. By incorporating adaptive parameter adjustment, chaotic mapping, elite individual retention, and multi-algorithm collaboration, these algorithms were designed to achieve a more effective balance between global exploration and local exploitation.
In summary, a significant research gap exists in the current research on smart waste collection. On the one hand, in both waste collection and other delivery domains, most models only consider single-compartment vehicles, multi-compartment vehicles, or heterogeneous vehicles in isolation. Few studies have simultaneously addressed the routing optimization of both multi-compartment and heterogeneous vehicles under dynamic demand. On the other hand, the existing algorithms struggle to balance global exploration and local exploitation when tackling complex optimization problems, making them prone to premature convergence and resulting in low search efficiency.
To address the aforementioned research gap, the main questions addressed by this study are as follows: (1) Under dynamic demand, how should we optimize the multi-compartment vehicle collection cost when using heterogeneous fleets? (2) In the face of dynamic collection requirements, how can efficient collection decisions be made in an extremely short time? (3) What are the sensitive parameters in this problem, and how do they affect the collection route planning?
Therefore, this study investigates the dynamic collection route optimization problem with multi-compartment vehicles in a mixed fleet of self-operated and outsourced vehicles. The main contributions are as follows:
A two-stage model is constructed to minimize the dynamic collection cost of multi-compartment vehicles with heterogeneous fleets.
To respond in a timely manner to collection demands, a hybrid strategy combining continuous and periodic optimization is designed. Meanwhile, a greedy insertion algorithm is adopted for local optimization, and an improved HHO algorithm is employed for global optimization. The improved HHO algorithm integrates multiple heuristic algorithms to ensure a balance between exploration and exploitation and avoid local optima.
The validity of the proposed model and method is verified through case studies, and the sensitivity of key parameters is systematically analyzed. This provides a quantifiable decision-making basis and management implications for the dynamic collection of domestic waste.
The rest of this study is organized as follows.
Section 2 describes the operational problems. In
Section 3, we present the pre-optimization and re-optimization models.
Section 4 elaborates on the design of the model solving algorithm. Numerical experiments are detailed in
Section 5.
Section 6 provides the sensitivity analysis. Finally,
Section 7 is devoted to the conclusions and further work.
2. Problem Description
This study considers WCPs within residential areas, related transfer stations, and multi-compartment collection vehicles consisting of both self-owned and outsourced fleets. In China, domestic waste is classified into four categories: kitchen waste, recyclable waste, hazardous waste and other waste. Recyclable waste and hazardous waste each have their own specialized collection systems, and thus are not considered in this study.
The problem can be described as shown in
Figure 1. A two-compartment waste collection vehicle departs from the depot every day and sequentially visits WCPs that have reached the collection threshold to collect kitchen waste and other waste. When any compartment cannot accommodate additional waste, the vehicle will go to the nearest waste transfer station corresponding to that waste type to unload. It then continues collecting waste at the remaining WCPs until all waste is gathered. Following this, the vehicle sequentially unloads at both transfer stations and then returns to the depot. Each WCP must be served by exactly one vehicle.
Influenced by residents’ commuting and daily routines, the amount of waste generated at each waste collection point exhibits a bimodal distribution pattern. The morning peak occurs between 7:00 and 9:00, while the evening peak occurs between 19:00 and 21:00. During the off-peak period from 9:00 to 19:00, waste generation is relatively stable with random fluctuations. In particular, the amount of waste generated during the periods of 0:00–7:00 and 21:00–24:00 is extremely low, so these time slots are not considered in this study. In other words, only the waste generated from 7:00 to 21:00 is considered.
Due to the bimodal distribution characteristics of waste generation, there are some differences in the collection strategy. Specifically, collection demands that arise during the two critical time slots of 7:00–9:00 and 19:00–21:00 are considered static demands, while those arising during other time slots are regarded as dynamic demands. Static demands are fulfilled exclusively by self-owned vehicles, while outsourced vehicles are introduced to collaborate on dynamic demands to achieve responsiveness to collection demands. If waste is not collected in a timely manner, the risk of overflow will increase, which will in turn raise the cost of manual secondary sorting. The risk of overflow here is measured by the retention volume and the retention time of the waste. It is worth noting that different types of waste have different risk coefficients.
4. Solution Methods
In this section, we discuss the development of a two-stage algorithm to solve the aforementioned model.
4.1. Pre-Optimization Algorithm Design
The HHO algorithm draws inspiration from the foraging and attack strategies observed in nature. However, the traditional HHO algorithm suffers from several limitations, particularly its inability to effectively balance exploitation and exploration, which often leads to premature convergence to local optima.
In the pre-optimization phase, for the static demands generated during two critical time periods, namely 7:00–9:00 and 19:00–21:00, an improved HHO algorithm that integrates multiple heuristic algorithms is used to optimize the collection routes of self-owned vehicles and obtain an initial collection plan.
The traditional HHO adopts a greedy update mechanism, where newly generated solutions replace the current ones regardless of their quality. While this greedy update strategy accelerates convergence, it also tends to trap the algorithm in local optima and undermines global exploration. To overcome this drawback, this study incorporates the Metropolis criterion from Simulated Annealing (SA) as the solution acceptance mechanism, thereby enhancing the algorithm’s ability to escape local optima. The specific improvements are outlined below.
In each iteration, the fitness of a newly generated solution is compared with that of the current solution :
Case 1: If , the new solution is directly accepted, replacing the current one.
Case 2: If , the worse solution is accepted with a probability given by , where denotes the fitness difference, and is the current temperature parameter.
Furthermore, the traditional HHO exhibits a substantial loss of population diversity in the later optimization stages, making it prone to local convergence. To alleviate this deficiency, an elite–worst individual perturbation strategy derived from the Genetic Algorithm (GA) is introduced. This strategy enhances population diversity to bolster global search. The specific modifications are presented below.
After each iteration, the selection, crossover, and mutation operations of GA are applied.
Identifying elite and worst individuals: The 10 best-performing and 10 worst-performing solutions in the current population are selected to form a candidate pool for perturbation.
Performing directional crossover and mutation: A two-point crossover is conducted between the elite and worst solutions, where corresponding segments are swapped. Mutation is then applied to the resulting offspring.
4.2. Re-Optimization Algorithm Design
The initial route plans from the pre-optimization phase provide two fundamental constraints for the re-optimization phase: capacity constraints and cost constraints. Capacity constraints refer to the maximum load capacity and remaining capacity of self-owned vehicles in each time period, as determined in the pre-optimization phase. These constraints serve as the upper bound for inserting new tasks into self-owned vehicle routes during the re-optimization phase. No insertion operation may cause a self-owned vehicle to exceed its capacity. Cost constraints are defined based on the total cost of the pre-optimized routes as a baseline. The cost increment resulting from inserting a dynamic demand into an existing self-owned vehicle route is calculated and must not exceed the cost of dispatching an outsourced vehicle to independently complete that demand.
Considering the dynamic characteristics of the volume of waste in the re-optimization phase, existing research on optimization strategies mainly falls into two categories: periodic re-optimization and continuous re-optimization. The core issue of these strategies lies in when to update the path after the arrival of new information. Periodic re-optimization partitions the time horizon into fixed intervals and updates routes at the end of each interval, which may lead to delayed responses. Continuous re-optimization adjusts routes immediately upon receiving new demands, offering timely responses but requiring high real-time performance under frequent demand arrivals. To address the above issues, this study adopts a hybrid strategy that integrates periodic and continuous re-optimization. Continuous re-optimization means that when a dynamic demand arrives, the system immediately evaluates whether it satisfies both the capacity constraints and the cost constraints. If both are met, the demand is instantly inserted into the existing self-owned vehicle route, enabling a real-time response. Under periodic re-optimization, if a demand does not satisfy either of the above constraints, it is not processed immediately but is instead temporarily placed in a pending queue and will be uniformly handled by outsourced vehicles in the next decision period.
The detailed procedure of the algorithm for the waste collection scenario described above is as follows:
Step 1: Pre-optimization is performed at 9:00 and 21:00 by self-owned vehicles due to the bimodal distribution pattern.
Step 2: When a new waste collection point is added, it is necessary to check whether it satisfies the capacity constraint and the cost constraint of the self-owned vehicles.
Step 3: If Step 2 is satisfied, the system attempts to insert the waste collection point into each existing self-owned vehicle route and computes the associated cost increment [
37]. Finally, the route with the minimum cost increment is selected as the optimal insertion route. If Step 2 is not satisfied, proceed to Step 4.
Step 4: The newly added WCP is allocated to the next period and serviced by outsourced vehicles, and the corresponding cost increment is calculated.
Step 5: Repeat Steps 2 to 4 until all WCPs have been serviced.
The flowchart of the algorithm is shown in
Figure 2. Here,
represents the start time of period
.