1. Introduction
The transportation sector’s heavy reliance on fossil fuels has made it one of the fastest-growing sources of Global Greenhouse Gas (GHG) emissions. Reducing GHG emissions, particularly carbon dioxide (CO
2), has become an urgent and common objective in mitigating the impacts of climate change [
1,
2]. According to the Intergovernmental Panel on Climate Change 2022 report, the transportation sector accounted for 15% of global anthropogenic GHG emissions and 23% of energy-related emissions in 2019. The sector produced approximately 870 million tonnes of CO
2 equivalent in that year, and this is projected to grow to between 16% and 50% by 2050.
Compared to conventional transportation, cold chain logistics exacerbates emission challenges due to its continuous refrigeration demands for maintaining low temperatures to preserve product quality. Adakemaya et al. [
3] estimate that approximately 40% of transported food requires refrigeration, accounting for 53% of total energy consumption in transportation. According to Allied Market Research (2019) [
3], the global cold chain logistics market reached USD 160 billion in 2018 and is expected to exceed USD 585 billion by 2026, with a Compound Annual Growth Rate (CAGR) of 17.9%. Against the backdrop of China’s dual carbon strategy, green transformation has become an imperative, and carbon reduction in cold chain logistics is both a challenge and an opportunity for sustainable development.
Despite rapid growth, the cold chain logistics sector in China remains fragmented, with low integration across firms. This inefficiency results in poor network coordination, excessive resource consumption, and significant avoidable carbon emissions, reportedly costing tens of billions of yuan annually [
4]. Traditional “supply-driven” operational models are increasingly unsustainable. In this context, joint distribution—a model in which multiple logistics enterprises share transportation resources and customer bases under unified planning and scheduling [
5]—has emerged as a promising approach. From an industry perspective, joint distribution can significantly reduce total logistics costs and carbon emissions through optimized resource allocation [
6]. However, most VRP studies adopt a single-enterprise perspective, neglecting the systemic benefits of collaborative resource sharing in cold chain logistics. This omission has left a gap in quantifying industry-wide cost and emission reductions achievable through joint distribution.
Route planning serves as a vital lever for improving transportation system efficiency. Studies have shown that optimizing vehicle flow distribution and alleviating congestion can significantly reduce fuel consumption, emissions, and time-related costs without requiring additional infrastructure investment [
7]. For cold chain logistics, where delivery delays can directly cause product spoilage, route optimization must not only minimize travel time but also incorporate energy efficiency, carbon control, and industry-level collaboration.
The Vehicle Routing Problem (VRP), initially developed to optimize delivery schedules and minimize transportation costs [
8], has evolved to include multiple variations such as time windows, multiple depots, and heterogeneous fleets. In cold chain contexts, additional constraints such as spoilage rate, refrigeration cost, and carbon emissions have been incorporated into routing models [
9,
10], while some studies introduce carbon trading and taxation mechanisms into the optimization framework [
11,
12,
13]. Nonetheless, research gaps remain: (1) Most models consider only a single enterprise, overlooking the collaborative potential of joint distribution in cold chain logistics. (2) Many models assume static traffic conditions and fail to account for time-dependent urban congestion. (3) Scale flexibility is rarely addressed, limiting adaptability to dynamic operational demands. (4) Single-objective optimization dominates, whereas practical cold chain operations require multi-objective trade-offs among cost, time, and emissions.
To address these gaps, this paper proposes a Time-Dependent Green Vehicle Routing Problem (TDGVRP) that incorporates joint distribution and time-dependent traffic conditions into a multi-objective optimization framework. The key contributions of this study are as follows:
Developing a cross-enterprise joint distribution framework based on depot location and customer clustering.
Proposing a piecewise time-dependent travel time function to reflect real-world urban traffic dynamics.
Designing a three-phase heuristic algorithm composed of space–time clustering for zone partitioning, a greedy method for initial route allocation, and an Adaptive Large Neighborhood Search (ALNS) for route refinement and repair.
Conducting a joint cost analysis that captures the time-sensitive relationship between energy consumption and carbon emissions under various routing decisions.
Overall, this research aims to enhance cold chain logistics efficiency and promote its low-carbon transformation by integrating joint distribution mechanisms with time-dependent routing optimization.
The remainder of this paper is organized as follows:
Section 2 presents a comprehensive literature review.
Section 3 introduces the research objectives and develops the mathematical model.
Section 4 describes the proposed three-phase heuristic algorithm in detail.
Section 5 conducts the experimental design and analyzes the results. Finally,
Section 6 summarizes the main findings and outlines potential directions for future research.
2. Literature Review
The VRP remains a cornerstone of logistics optimization, with recent research extending beyond cost/distance minimization to incorporate environmental sustainability, joint distribution, and multi-objective trade-offs. This section reviews (1) time-dependent VRP, (2) low-carbon VRP, (3) multi-objective VRP, and (4) green collaborative cold-chain routing.
2.1. Time-Dependent Vehicle Routing Problem
Traditional VRP models often assume fixed travel speeds, which is insufficient for capturing real-world traffic dynamics. The TDVRP incorporates time-varying travel speeds by associating velocity with departure time through time-dependent speed functions, offering a more realistic representation of traffic conditions.
Pan et al. [
14] formulated a Multi-Trip TDVRP with Time Windows (MT-TDVRPTW) and developed a hybrid metaheuristic algorithm. Gmira et al. [
15] addressed the TDVRP with time windows by considering time-dependent shortest paths and applied a tabu search-based heuristic to solve the problem. In addition, Zhao et al. [
16] and Ma et al. [
17] introduced deep learning approaches (e.g., GRU, CNN-LSTM) to forecast short-term urban traffic speeds, providing real-time inputs for dynamic routing decisions.
2.2. Low-Carbon Vehicle Routing Problem
In response to growing environmental concerns, carbon emissions have been incorporated into VRP models, giving rise to the LCVRP. The objective is to minimize greenhouse gas emissions while maintaining logistical efficiency.
Alinaghian et al. [
18] proposed an integrated framework combining enhanced tabu search and differential evolution to address a low-carbon inventory-routing problem.
Ferreira et al. [
19] developed a heuristic and branch-and-cut method for LCVRP with split deliveries. Li et al. [
20] studied the impact of carbon taxes and emission quotas on bi-objective VRP and proposed a hybrid genetic-tabu search algorithm. Liu et al. [
21] addressed vehicle routing with heterogeneous fuel types, while Pu et al. [
22] formulated a multi-objective LCVRP for intra-city distribution considering fuel cost, emission cost, and distance, solved by an improved chemical reaction optimization algorithm.
2.3. Multi-Objective Vehicle Routing Problem
Multi-objective optimization has gained traction as real-world applications often require balancing conflicting goals such as cost, service level, and environmental impact.
Ghannadpour and Zarrabi [
23] constructed a bi-objective model focusing on fuel consumption and customer service priority, aiming to minimize total distance, fleet size, and fuel usage while maximizing customer satisfaction. Sun et al. [
24] proposed an optimization algorithm based on Economical Traveling Distance to optimize the configuration of a heterogeneous vehicle fleet for minimizing fuel costs. Zarouk et al. [
25] developed a model under soft time windows to minimize energy consumption and maximize satisfaction using a hybrid genetic-simulated annealing algorithm. These works underscore the importance of multi-objective frameworks for sustainable logistics.
2.4. Green and Collaborative Routing in Cold Chain Logistics
Cold chain logistics demands temperature-controlled transport, leading to high energy consumption and costs—over 40% of food requires refrigeration, which accounts for more than half of operational expenses [
3]. Consequently, GVRPs have been developed to integrate environmental factors, such as CO
2 emissions [
11,
12,
13,
18,
26,
27,
28].
However, most cold chain operations still use independent distribution, resulting in low vehicle utilization and high costs [
9,
29]. Joint distribution—where multiple companies share resources under centralized coordination—has shown promise in improving efficiency, reducing transport distance, cost, and emissions [
6,
27,
30,
31,
32,
33].
Despite these advances, few studies integrate joint distribution with time-dependent traffic conditions, often treating them separately. Bridging this gap is key to achieving efficient, scalable, and low-carbon cold chain logistics. This study addresses these gaps by (1) modeling time-dependent speeds across periods, (2) integrating joint distribution strategies to improve load rates and reduce emissions, and (3) proposing a three-phase heuristic tailored for large-scale, real-time cold chain routing problems. This approach supports green, efficient, and scalable logistics systems.
3. Problem Statement
3.1. Problem Description
In practical operations, multiple cold-chain logistics companies typically operate their own warehouses and serve customers within designated regions, as illustrated in
Figure 1 and
Figure 2. Generally, there are two distribution modes: independent distribution, where each company uses its own fleet to serve its customers without any collaboration between companies; and collaborative distribution, where all companies share warehouses, fleets, and customer resources. This paper focuses on the collaborative distribution model.
The formulation of the Time-Dependent Green Vehicle Routing Problem (TDGVRP) is based on the following assumptions:
- (1)
The geographic location, product demand, and preferred delivery time window of each customer are known.
- (2)
There are multiple distribution centers, and all vehicles are homogeneous refrigerated trucks.
- (3)
Vehicles may depart from a distribution center at any time as needed.
- (4)
Distribution centers have sufficient inventory and transportation capacity to fulfill delivery tasks.
- (5)
Overloading is not allowed during delivery, and customer demand cannot be split.
- (6)
The urban road network is time-dependent; that is, vehicles travel at different speeds in different traffic periods depending on the level of congestion.
- (7)
All delivery vehicles start from a distribution center and return to the same center after completing their delivery tasks.
- (8)
Vehicles have a fixed usage cost, and during distribution incur travel time costs, waiting time costs, and service time costs based on their total usage time (including travel, waiting, and service).
3.2. Travel Speed and Time-Dependent Travel Time Calculation
The traffic conditions of an urban road network are influenced by time. For example, during morning and evening peak hours, roads become heavily congested and vehicle speeds slow down, while in other periods speeds are also affected to some extent by traffic flow. Therefore, the travel speed of delivery vehicles is time-dependent. In addition, a delivery vehicle may need to cross multiple time periods to complete travel on a given road segment, making it difficult to directly calculate the travel time for that segment and requiring an appropriate handling method. Methods for calculating vehicle travel time are mainly based on the Malandraki model [
34] and the Ichoua model [
35]. Since the traffic conditions in our study satisfy the First In, First Out (FIFO) property, the Ichoua model is applicable. Moreover, several studies, such as Jabali [
36], Xiao and Konak [
37], and Franceschetti et al. [
38], have adopted this model to investigate vehicle travel times under time-dependent road network conditions. Accordingly, this study adopts the Ichoua-based approach to analyze the relationships between time periods, time-dependent vehicle speeds, road segment lengths, travel distances, and travel times. First, the operating hours of the distribution center are evenly divided into multiple time periods. Then, following the FIFO principle, a method is designed to calculate travel times for road segments that span multiple periods (as illustrated in
Figure 3 and
Figure 4):
Figure 3 shows the travel speed of delivery vehicles on segment
during different time periods.
Figure 4 shows the travel distance and travel time of delivery vehicles on segment
during different time periods. Based on the vehicle speeds in each period from
Figure 3 and the travel distances in each period from
Figure 4, the travel time for each period can be computed, thereby determining the total travel time required for a vehicle to traverse segment
.
To compute the time-dependent travel time of segment , the operating time horizon T of the distribution center is divided into H discrete periods of equal length , forming discrete time slots , where each time slot is indexed as with . Each slot represents a time interval .
When a vehicle travels along arc
, it may cross one or multiple consecutive time slots. Let
represent the travel speed on arc
during time slot
, and let
denote the service time at node
j. The time-dependent travel time function
is defined as:
Here, represents the departure time from node i. The detailed interpretation of is as follows:
When
, the travel occurs entirely within a single time slot. The travel time between nodes
i and
j is simply the arc length divided by the speed in that time slot:
When
, the travel spans two consecutive time slots. The arc
is split into two segments with different speeds. The arrival time at node
j can be expressed as illustrated in
Figure 5a:
where
and
are the distances traveled in the first and second time slots, respectively.
When
, the travel spans more than two time slots. The total travel time consists of three components: the remaining time from
to the end of slot
, the sum of full time slots crossed, calculated as
, and the partial time within the final slot
until arrival at node
j. The arrival time at node
j is then, as illustrated in
Figure 5b:
Here, the arc
is divided into
segments, as illustrated in
Figure 5b.
3.3. Symbols and Parameters
We adopt the notations summarized in
Table 1 to mathematically formulate the problem as an MILP model.
3.4. Objective Function
According to the studies by [
39,
40], the total energy consumption consists of three major components: fixed cost, driving energy consumption, and refrigeration energy consumption. Both are affected by dynamic payload variations and operational time. The corresponding carbon emission cost is computed based on the energy consumed and the carbon emission factors.
3.4.1. Fixed Cost
The fixed cost is incurred when a truck is dispatched from the depot to serve customers and is modeled as:
where
represents the fixed dispatch cost per vehicle and
K is the number of refrigerated trucks.
3.4.2. Driving Energy
According to the literature [
40,
41], the fuel consumption for a truck traveling from node
i to node
j is modeled as a linear function of payload weight:
Specifically,
represents the fuel consumption per 100 km when the truck is empty (L/100 km) and
represents the fuel consumption per 100 km when the truck is fully loaded (L/100 km).
Q denotes the maximum payload capacity of the truck (kg), and
is the actual payload on arc
(kg). The total driving energy cost, including fuel and carbon emissions, is:
3.4.3. Refrigeration Energy
According to the literature [
39,
40], the refrigeration energy depends on both travel time and service time, and linearly increases with payload weight due to thermal load:
The total refrigeration energy cost including carbon emissions is:
3.4.4. Time Window Penalty
In cold chain logistics, the condition of products upon delivery is crucial, as it directly affects customers’ revenue, inventory management, and quality control. Therefore, both early and late arrivals reduce customer satisfaction. In this model, a penalty cost is incurred whenever a vehicle arrives outside the specified time window of a customer [
42,
43,
44]. The penalty cost is formulated as:
where
represents the total time window penalty cost,
is the arrival time of vehicle
k at customer iii, and
and
are the unit penalty costs for early and late arrivals, respectively.
3.5. Mathematical Model
The complete objective function minimizing the total operational cost is:
Subtour Elimination (MTZ)
The constraints (13)–(16) represent the flow balance constraints, which ensure that each customer is visited exactly once and that route continuity is maintained for each truck. The payload balance constraints (17)–(19) dynamically track the remaining load of each vehicle along its route based on customer demands. Constraint (20) represents the relationship between the segment distance and the distance traveled by the vehicle across multiple time periods. Constraint (21) represents the time relationship of the vehicle traveling from the previous node to the next node. Constraints (22) and (23) represents the relationship between travel time, travel speed, and travel distance of a vehicle within the same time period. To prevent infeasible sub-tours, the model incorporates the MTZ constraints (24) and (25). Finally, constraints (26)–(28) specify the variable domains, including binary decision variables for routing and non-negativity requirements for arrival times and payloads.
4. A TDGVRP Heuristic
4.1. Algorithm Design
In the previous section, a Mixed-Integer Programming (MIP) model for the TDGVRP was formulated, as shown in Equations (1)–(28). Essentially, TDGVRP is a generalized form of the classical VRP, which is an NP-hard problem [
45,
46,
47]. Therefore, TDGVRP inherits this NP-hard complexity. While exact solvers like Gurobi can efficiently solve small-scale instances (e.g., fewer than 10 nodes), solving large-scale problems becomes computationally prohibitive.
To address the complexity and scale of the TDGVRP, this study proposes a novel three-phase heuristic algorithm designed to efficiently generate high-quality solutions for large-scale instances. The algorithm comprises the following three stages (see
Figure 6):
Phase 1: Spatial–Temporal Clustering and Depot Assignment. The algorithm begins by performing depot-driven spatial–temporal clustering, which jointly considers the geographic locations and time window constraints of customers. A spatial–temporal weighted clustering approach is used to assign customers to the most appropriate depot, laying the foundation for subsequent route construction.
Phase 2: Capacity-Aware Greedy Route Construction. Based on the clusters formed in Phase 1, a capacity-aware greedy algorithm is applied to construct initial truck routes for each depot. This phase takes into account vehicle capacity limitations and customer time windows. The algorithm dynamically determines whether a depot has sufficient vehicle resources to serve all assigned customers, whether support from nearby depots is required, or whether additional vehicles need to be dispatched. Although the greedy strategy enables rapid generation of feasible solutions, it may lead to suboptimal local solutions due to its heuristic nature and dependence on initial cluster quality.
Phase 3: Solution Refinement via Adaptive Large Neighborhood Search (ALNS). To overcome the limitations of the initial solution and escape local optima, the third phase integrates an Adaptive Large Neighborhood Search (ALNS) framework, as introduced by Ropke and Pisinger [
48]. This phase iteratively performs destruction and repair operations to explore a broader solution space, improving route feasibility and cost-effectiveness over successive iterations.
Together, these three phases provide a scalable and effective approach to solving the TDGVRP, balancing computational efficiency and solution quality through a structured heuristic optimization pipeline.
4.2. Depot-Driven Spatial–Temporal Weighted Clustering
In cold chain logistics, customers are not only geographically dispersed but also subject to strict time window constraints. For example, certain fresh food retailers may require deliveries in the early morning (e.g., [6:00, 9:00]), while others may prefer afternoon deliveries. This dual heterogeneity in spatial locations and time requirements makes traditional clustering methods based solely on spatial distance or time windows insufficient for generating cost-effective delivery plans.
To address this, a depot-driven spatial–temporal weighted K-means clustering method is proposed. This method fully integrates customers’ spatial coordinates and time window characteristics. Centered around depots, it partitions customers into balanced spatial–temporal clusters, each served by a designated depot. This approach effectively reduces cross-region transportation costs and minimizes time window violation penalties.
For each customer
i, the time window is first transformed into a time window center:
where
and
denote the earliest and latest service times for customer
i, respectively. The time window center
represents the customer’s temporal service preference. For example, a customer with a time window of [8:00, 10:00] has
, indicating a preference for delivery around 9:00.
To integrate spatial coordinates
and the time window center
into a unified feature space, normalization is performed to map all variables into the [0, 1] range:
The normalized feature vector for each customer is defined as:
A spatial–temporal weighted distance between customer
i and depot
m is then computed as:
where
is a user-defined weighting coefficient that balances the importance between spatial proximity and time window similarity.
Customers are assigned to the depot with the minimum weighted distance:
At this stage, each customer is associated with its nearest depot in terms of spatial–temporal similarity. The depot functions not only as a spatial center but also as a coordinator of time window compatibility.
Each cluster center is then updated based on the average of its assigned customers:
This updated center reflects the average spatial and temporal characteristics of the customers within the cluster. The algorithm checks whether or not the customer assignments remain unchanged compared to the previous iteration. If stable, or if the maximum number of iterations is reached, the clustering process terminates. Otherwise, the algorithm returns to the distance calculation step and continues iterating.
The proposed clustering method efficiently balances spatial proximity and time window compatibility. It produces depot-specific customer clusters, which significantly improve the quality of initial solutions for the MDVRPTW.
4.3. Capacity-Aware Spatial–Temporal Greedy Route Construction
After completing the spatial–temporal weighted clustering, each depot m is assigned a relatively independent customer subset . However, cold chain logistics not only requires consideration of geographical proximity and time windows but also imposes strict constraints on vehicle capacity. In multi-depot, multi-vehicle distribution scenarios, efficient capacity allocation is critical to minimizing total logistics costs.
To address this, a capacity-aware spatial–temporal greedy route construction Algorithm 1 is proposed. This algorithm jointly considers spatial distance, time window constraints, vehicle capacity, and cost factors, aiming to efficiently construct initial truck routes with minimized travel distance, time window violations, and fixed vehicle costs.
For each depot
m and its associated customer set
, a spatial–temporal priority score is computed for each customer
:
where
is a weighting coefficient balancing spatial distance and time urgency. Customers that are geographically closer to the depot and have tighter time windows are prioritized.
Algorithm 1: Capacity-Aware Greedy Route Construction with Multi-Depot |
![Sustainability 17 07515 i001 Sustainability 17 07515 i001]() |
A vehicle pool is initialized for each depot m. Customers are sorted in ascending order of their priority scores. Starting from the depot, the algorithm iteratively attempts to add customers to the current truck route while satisfying two conditions: (1) the remaining vehicle capacity must be greater than or equal to the customer’s demand, and (2) the arrival time plus service time must satisfy the customer’s time window constraints.
If no vehicle within the current depot can serve customer i, the algorithm checks whether or not neighboring depots have vehicles with sufficient remaining capacity and feasible time windows. If such vehicles exist, customer i is dynamically reassigned to that depot. If none of the existing vehicles can accommodate customer i, a new vehicle is instantiated at the current depot to serve the customer, incurring additional fixed costs, energy consumption, and carbon emissions.
After each customer insertion, the algorithm updates the remaining capacity and arrival times for the entire route. This process continues until all customers have been successfully assigned to feasible routes.
4.4. ALNS
The initial solutions for the TDGVRP heavily rely on the quality of the depot-based mVRP clustering. Consequently, the performance of the routing solution can be significantly influenced by the clustering results. To further improve the solution quality, this study introduces an ALNS metaheuristic tailored for the TDGVRP.
ALNS, initially proposed by Ropke and Pisinger [
48], extends the Large Neighborhood Search (LNS) framework introduced by Shaw [
49]. ALNS has been widely recognized for its excellent performance on the VRP and its variants. The core idea of ALNS is to iteratively destroy and repair parts of the current solution using a set of different operators. The choice of operators is guided by an adaptive, score-based roulette wheel selection mechanism that favors operators with better historical performance.
Let
represent the set of all available operators, including both removal and repair strategies. Each operator
is associated with a weight
, which determines its selection probability as follows:
Initially, all operators are assigned equal weights. As the search progresses, these weights are adaptively adjusted based on the operators’ performances. The scoring mechanism rewards operators under three conditions
, as summarized in
Table 2.
Every
iterations, the weights are updated using the accumulated scores. Let
denote the weight of operator
i at the current update, and
represent its accumulated score. The weight update rule is:
where
is a reaction factor that controls how strongly the recent performance affects the updated weight. Typical values for
range between 0.1 and 0.3.
During each iteration, the algorithm selects one removal operator and one insertion operator using the roulette wheel selection based on . The removal operator destructs part of the current solution by eliminating customers or routes, while the insertion operator repairs the solution by reinserting customers into feasible routes. The adaptive mechanism ensures that operators with better historical performance are more likely to be selected, enhancing the search efficiency and solution quality.
4.4.1. Destruction Operators
To enhance solution diversity and facilitate effective exploration, this study designs multiple destruction (removal) operators, including both random-based and cost-driven strategies. Each operator can be executed in two modes: (1) local mode, which restricts the removal within customers assigned to a single depot, and (2) global mode, which disregards the current depot assignment and allows removal across all depots.
- (1)
Random Removal
Random removal is designed to introduce stochasticity into the search process. Two types of random removal operators are considered:
Customer-Based Random Removal: Randomly select a number of customers p from the target set. In the local mode, customers are randomly selected within a single depot; in the global mode, customers are selected from the entire customer set regardless of depot assignment.
Route-Based Random Removal: Randomly select one truck route, then remove a consecutive subsequence of customers of length L from that route. For example, given the route , a possible removal could be . This strategy effectively disrupts spatial structures and enables more flexible route reconstructions.
To prevent excessive or insufficient removal, the number of customers to be removed is dynamically adjusted within predefined bounds. The update rule is:
where
and
represent the lower and upper bounds of removal quantity, ensuring a balanced trade-off between exploration and solution stability.
is the adjustment term at iteration
t.
After each removal operation, the remaining payload and time information of all affected routes are updated accordingly to maintain feasibility for subsequent repair.
- (2)
Worst Removal
This operator removes the customers that contribute most negatively to the objective function. The cost contribution of customer
i is defined as:
Customers with the highest values are prioritized for removal. This operator is particularly effective in eliminating bottleneck customers that degrade solution quality.
- (3)
Time-Window-Based Removal
This operator targets customers with significant time window violations, either arriving too early or too late. Customers with the largest deviations from their allowed time windows are removed first. This removal strategy is designed to improve the overall temporal feasibility of the solution.
4.4.2. Repair Operators
The objective of the repair operators is to effectively reintegrate the removed customers into the existing routes or create new routes when necessary, ensuring solution feasibility while minimizing the total cost.
Unlike traditional methods, the proposed insertion strategy allows cross-depot and cross-vehicle insertions. For each removed customer
i, Algorithm 2 evaluates all possible insertion positions across all available routes and depots.
Algorithm 2: Repair Operator with Capacity and Time Constraints |
![Sustainability 17 07515 i002 Sustainability 17 07515 i002]() |
The insertion cost of placing a customer subsequence
(or a customer
i) between nodes
in a route is defined as:
where:
is the additional distance cost caused by inserting
between
:
is the change in time window penalty due to insertion.
is the change in refrigeration cost, proportional to the increase in travel time.
We first attempt to insert the customer subsequence —which can represent either a single customer i or a sequence of consecutive customers—into any feasible position within the current depot’s routes. The insertion prioritizes the position with the minimal total cost while strictly satisfying both the vehicle’s capacity and the customer’s time window constraints. If no feasible insertion is found within the current depot, the algorithm extends the search to the k nearest neighboring depots based on spatial proximity, thereby maintaining a balance between solution quality and computational efficiency. If all existing routes fail to accommodate the subsequence due to capacity violations or time infeasibility, a new route is created at the current depot, which incurs a fixed vehicle cost.
To improve computational efficiency, the algorithm performs preliminary feasibility checks before calculating detailed insertion costs. It first verifies whether or not the remaining capacity of a candidate route satisfies the constraint:
If not, the insertion attempt at this position is immediately discarded. Additionally, the algorithm estimates the expected arrival time at the subsequence when placed between nodes ; if the arrival time falls outside the valid time window , the insertion is rejected. Upon a successful insertion, the algorithm updates the remaining vehicle capacity, recalculates the arrival and departure times for all affected nodes, and adjusts the time window penalties and refrigeration costs accordingly.
5. Computation and Experiments
The proposed algorithm for solving the TDGVRP was implemented in Python 3.9.13, and all computational experiments were conducted on a machine equipped with an AMD Ryzen 7 6800H 3.20 GHz processor and 16 GB of RAM. Gurobi was employed as the linear programming solver, and a time limit of 3600 s was set for all tests.
5.1. Instances and Parameter Settings
In this section, we evaluate the performance of the proposed algorithm on small-, medium-, and large-scale instances, and analyze carbon emissions through a real-world case study.
For dataset 1, we utilized data from the Solomon benchmark set. Given that the C-type instances in the dataset provide relatively long service times per customer—simulating long-haul transportation scenarios—we only selected the R-type and RC-type instances for our experiments. Two sets of instances were generated based on the TDGVRP model, namely the R101-R110 and RC101-RC108 instances.
Dataset 2 is derived from the operational data of four cold-chain logistics companies located in Chengdu, China [
50]. These companies deliver the same frozen products to customers situated in downtown Chengdu. Each company operates independently. Due to the use of single-distribution models, these companies are required to make significant upfront investments in warehouses, resulting in high distribution costs and relatively low long-term profits. Additionally, because of the uncertainty in specific cold-chain products and market demands, even though customer demand per delivery is relatively low, delivery frequency remains high. Thus, these companies must distribute small quantities of cold-chain products multiple times each day. The characteristics of the empirical data used in this study realistically reflect the operational environment of the cold-chain logistics sector. Therefore, the selected data can be regarded as representative of China’s cold-chain logistics industry and its distinctive features.
The case study data were sourced from the research of Liu (2020) [
50], which focused on joint distribution strategies for minimizing the total distribution cost of cold-chain products.
However, that study overlooked the impact of traffic conditions during transportation. Therefore, this study incorporates time-varying speed profiles under the carbon trading mechanism, taking into account the actual traffic conditions in Chengdu [
51,
52]. The detailed speed profiles are presented in
Table 3 and
Figure 7. Meanwhile, the parameter settings in this study are based on standard procedures established by Wang, Chen, and Liu [
27,
43,
50], as summarized in
Table 4.
5.2. Computational Results of Experiment 1
Since no benchmark solutions are available for the TDGVRP problem, this study compares the proposed Three-Phase Heuristic Algorithm (TPH) with the exact solutions obtained using the Gurobi solver, as well as with the Genetic Algorithm (GA). The Solomon R101–R110 and RC101-RC110 instances were selected for evaluation, with the truck capacity fixed at 50. Both the TPH and the GA were run 10 times, and the best solution and the average solution were recorded.
Table 5 and
Table 6 present the results for instances of size 15, 30, and 50. The column MILP represents the best solution (denoted as
) obtained from the Gurobi solver. The columns labeled Best and Avg correspond to the best and average solutions from the respective heuristic methods. The column Gap (%) represents the percentage deviation of the best solution from the MILP result, and Time(s) refers to the average computation time.
To comprehensively evaluate the performance of the proposed TPH algorithm, comparisons are made across three aspects: solution quality, computation time, and algorithm adaptability.
1. Solution Quality (Best and Average Performance). In the R-type instances (
Figure 8), the TPH algorithm yields best solution gaps in the range of
relative to the MILP results, with an average gap of only
2.44%. For several 50-customer cases, TPH even outperforms the MILP optimal solution, demonstrating strong local search and route reconstruction capability. Compared with the GA, the TPH achieves better best solutions by over
1.0% on average, especially in medium- and large-scale instances (30 and 50 customers), indicating superior solution stability and robustness.
For the RC-type instances (
Figure 9), which involve more complex customer distributions and tighter time windows, TPH maintains competitive performance with an average gap of 1.88%, clearly outperforming the GA’s average gap of 4.12%. In many cases, the best solutions found by TPH are very close to or even better than the lower bounds provided by Gurobi, confirming its strength under complex constraints.
2. Computation Time. In terms of runtime, the TPH algorithm averages around 60 s across all instance sizes—significantly faster than Gurobi, which often requires over 3600 s for large-scale MILP problems. Compared with GA, TPH achieves comparable or better runtimes, and in some cases it is even faster (see Time(s) column). This efficiency is especially beneficial for cold-chain logistics systems requiring real-time or near-real-time decision-making.
3. Overall Assessment and Applicability. In summary, the proposed TPH algorithm balances high solution quality with computational efficiency, making it particularly suitable for large-scale and constraint-intensive problems such as cold-chain logistics. Compared with the GA, TPH offers better solution stability and faster convergence. Its modular and flexible architecture also allows integration with real-world factors such as traffic dynamics and carbon pricing mechanisms, enhancing its engineering practicality and extensibility.
Therefore, the three-phase heuristic proposed in this study demonstrates strong empirical performance and broad application potential for solving a multi-depot, GVRP under time-window constraints.
5.3. Computational Results of Experiment 2
To evaluate the comparative performance of single-mode and joint-mode distribution strategies under realistic traffic conditions, this study adopts benchmark data from [
50] and incorporates time-varying speed profiles based on actual traffic patterns in Chengdu city. A carbon pricing mechanism is implemented at a rate of 0.25 per kilogram of CO
2, and each delivery model is solved through 10 repeated runs. The optimal results are summarized in
Table 7. The experimental findings clearly demonstrate the superior performance of the joint distribution model across multiple dimensions. Economically, joint delivery reduces total operational costs by 14.4%, thus alleviating the financial burden on cold chain logistics providers. From an environmental perspective, carbon-related costs are decreased by 23.12%, indicating the model’s strong potential for supporting carbon neutrality goals. Moreover, the joint mode leads to a 25% reduction in fleet size, reflecting enhanced vehicle utilization and resource efficiency.
In conclusion, the joint distribution model achieves a dual objective of economic efficiency and environmental sustainability. It offers significant synergetic advantages and practical applicability, particularly in cold chain logistics scenarios characterized by high delivery time sensitivity and energy consumption. This highlights its promising value for broader implementation in sustainable logistics systems.
5.4. Sensitivity Analysis of Experiment 1
The sensitivity analysis of Three-Phase Heuristic (TPH) algorithm performance was conducted based on the detailed results from R-type and RC-type instances with varying sizes (15, 30, and 50 customers), as shown in
Table 5 and
Table 6.
The TPH algorithm consistently achieved high-quality solutions across all problem scales. For R-type instances, the average optimality gap relative to MILP decreased slightly from 3.31% for 15-customer cases to 2.44% for 50-customer cases, indicating that the algorithm scales well with problem size and even improves solution quality for larger instances. Moreover, the computation time increased moderately from around 18 s (15 customers) to about 63 s (50 customers), demonstrating efficient scalability. Similar trends were observed in the RC-type instances, where the average gap remained low (1.52% for 15 customers, 1.88% for 50 customers), while the computation time increased reasonably from 23 s to 87 s. These results confirm TPH’s robustness and effectiveness for both small- and large-scale scenarios.
Comparing R-type and RC-type instances reveals that TPH performs slightly better on RC-type instances, which feature more complex customer distributions and tighter time windows. The average optimality gaps for RC-type instances were generally smaller or comparable to R-type, e.g., 1.52% vs. 3.31% at 15 customers, and 1.88% vs. 2.44% at 50 customers. This suggests the heuristic’s ability to effectively manage complex temporal and spatial constraints without significant loss of solution quality.
The average gaps for TPH are consistently close to the best solution gaps across all instances, with differences generally within 1–3%, reflecting stable solution quality over multiple runs. In contrast, GA exhibits larger gaps and greater variability between best and average solutions, especially on larger instances, indicating TPH’s superior robustness and convergence reliability.
TPH maintains a favorable balance between solution quality and runtime. While MILP requires significantly longer computational times (not listed but known to be several orders of magnitude higher), TPH’s runtimes remain practical (under two minutes for 50-customer problems). Compared to GA, TPH often achieves better solution quality with comparable or less computation time, confirming its suitability for real-time or near-real-time cold chain logistics applications.
The sensitivity analysis based on varying instance sizes and types validates that the proposed TPH algorithm is scalable, robust to problem complexity, and provides consistent high-quality solutions efficiently. These characteristics make it well-suited for practical large-scale cold chain routing problems involving complex constraints.
5.5. Sensitivity Analysis of Experiment 2
- (1)
Analysis of the Impact of Carbon Pricing
In the previous section, we analyzed joint distribution using the current market average carbon price (0.25 RMB/kg CO2). Given the inherent volatility of carbon prices, which may increase or decrease in the future, this section examines 12 different carbon price levels (RMB/kg CO2) to conduct a sensitivity analysis of carbon emissions, carbon costs, total costs, and the proportion of carbon costs to total costs.
As shown in
Table 8 and
Figure 10, when the carbon price is within the range [0, 0.5], carbon emissions exhibit an upward trend, peaking at 172.15 kg when the carbon price reaches 0.5 RMB/kg. Once the carbon price enters the range [0.5, 3], emissions begin to decline significantly, with the most pronounced reduction observed in this interval. The average emission reduction rate in this range reaches approximately 2.04% compared to the peak. In the range [3, 6], the rate of decline slows, with an average reduction rate of only about 0.86%. When the carbon price exceeds 6 RMB/kg, emissions stabilize around 156–157 kg, indicating diminishing marginal abatement effects.
As the carbon price increases, carbon costs grow almost linearly with the price level, despite the overall decline in emissions. For example, when the carbon price reaches 7 RMB/kg, the carbon cost is RMB 396.55, representing 9.4% of the total logistics cost. Furthermore, total costs also rise steadily with carbon pricing, from RMB 3318.08 at zero carbon price to RMB 4218.58 at 7 RMB/kg. These results suggest that at higher carbon prices, carbon costs become a substantial component of overall operating costs, warranting close attention in enterprise decision-making.
- (2)
Joint Impact of Carbon Pricing and Carbon Quota
Table 9 and
Figure 11 reveal that, regarding carbon pricing and carbon quotas, these two policy levers interact to shape enterprise logistics costs. For a fixed carbon price, increasing the carbon quota directly reduces the carbon deficit (i.e., the shortfall between actual emissions and allocated allowances), thereby lowering the carbon cost. When the carbon quota is lower than actual emissions, enterprises must purchase additional allowances, leading to positive carbon costs that rise proportionally with the carbon price. For example, at a carbon price of 0.5 RMB/kg, raising the quota from 0 to 150 kg reduces carbon costs from RMB 86.08 to RMB 11.08, with a corresponding drop in total costs from RMB 3544.85 to RMB 3469.85.
Once the carbon quota exceeds actual emissions, the carbon cost becomes theoretically negative, indicating that the enterprise would hold surplus allowances that could be sold in the carbon market. In this study, negative carbon costs are treated as zero when calculating total costs; thus, beyond this threshold the total cost remains constant despite further increases in quota. This also implies that, under the given assumptions, surplus allowances provide no additional cost advantage unless explicitly monetized in practice.
From a policy perspective, an excessively low carbon price or an overly generous quota allocation both weaken the incentive for enterprises to reduce emissions. Conversely, high carbon prices combined with tight quotas can significantly raise operating costs, prompting stronger abatement actions but also risking cost burdens. Therefore, an optimal balance between carbon price and quota allocation is essential to achieving emission reduction targets while maintaining economic viability.
6. Conclusions
This study addresses the cold chain vehicle routing problem under carbon emission constraints, comparing single-mode and collaborative distribution strategies. A mixed-integer programming model is developed for the TDGVRP, incorporating carbon emissions, spatiotemporal distances, refrigeration costs, and product damage risks. Following the “cluster first, optimize later” principle, a Three-Phase Heuristic (TPH) algorithm is proposed and validated through extensive benchmark experiments.
The main findings and contributions are as follows:
A capacity-aware spatiotemporal route construction method is proposed to generate high-quality initial truck routes, and an ALNS framework with customized destroy-and-repair operators tailored to TDGVRP characteristics significantly improves both solution quality and computational efficiency.
The model accounts for realistic traffic conditions via time-dependent vehicle speeds, enhancing practical relevance compared to fixed-speed assumptions and improving the accuracy of urban delivery scheduling.
Experimental results show that the joint distribution model reduces total costs by 14.4%, carbon-related costs by 23.12%, and fleet size by 25%, relative to the single-mode strategy, confirming its economic and environmental advantages in high-service-level cold chain logistics.
Sensitivity analysis indicates that effective carbon policy requires balancing price signals with quota flexibility. Higher carbon prices (0–7 RMB/kg) achieve up to 7.12% emission reduction but raise compliance costs, with the most cost-effective abatement near 3 RMB/kg. Quota expansion under moderate pricing can cut deficits by over 80% and slightly lower total costs, while an optimal policy corridor of 1–3 RMB/kg and quotas of 120–180 kg minimizes cost growth while sustaining over 5% emission reductions.
In summary, the proposed model and algorithm offer a practical decision-support framework for sustainable cold chain logistics, delivering substantial cost savings and emission reductions under realistic operational constraints. Future extensions may incorporate stochastic demand, uncertain travel times, and dynamic carbon policies to further enhance robustness and adaptability in evolving regulatory environments.