Next Article in Journal
Data-Driven Strategies for Complex System Forecasts: The Role of Textual Big Data and State-Space Transformers in Decision Support
Previous Article in Journal
Framework for Assessing the Sustainability Impacts of Truck Routing Strategies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Waiting Strategy for the Dynamic Meal Delivery Routing Problem with Time-Sensitive Customers Using a Hybrid Adaptive Genetic Algorithm and Adaptive Large Neighborhood Search Algorithm

Glorious Sun School of Business and Management, Donghua University, Shanghai 200051, China
*
Author to whom correspondence should be addressed.
Systems 2024, 12(5), 170; https://doi.org/10.3390/systems12050170
Submission received: 5 March 2024 / Revised: 19 April 2024 / Accepted: 8 May 2024 / Published: 10 May 2024

Abstract

:
In this paper, we study the dynamic meal delivery routing problem (MDRP) with time-sensitive customers. The multi-objective MDRP optimization model is developed to maximize customer satisfaction and minimize delay penalty cost and riding cost. To solve the dynamic MDRP, a novel waiting strategy is proposed to divide the dynamic problem into a series of static subproblems. This waiting strategy utilizes the decision threshold to determine rerouting points based on the number of dynamic meal orders. Meanwhile, time-sensitive priority is introduced to accelerate assignment and routing decisions for orders from customers with high time sensitivity. For each static subproblem, a hybrid AGA–ALNS algorithm that incorporates the adaptive genetic algorithm and adaptive large neighborhood search is developed to improve both the global and local search capabilities of the genetic algorithm. We validate the performance of the proposed waiting strategy and the AGA–ALNS algorithm through numerical instances. In addition, managerial insights are obtained from sensitivity analysis experiments.

1. Introduction

With the continuous expansion of e-commerce, meal delivery has made people’s lives more convenient. According to a research report issued by CCFA (China Chain Store and Franchise Association) and NSRC (Ali New Service Research Centre), the Chinese online instant meal delivery market generated annual revenue of CNY 1003.6 billion (nearly USD 144 billion) in 2021, which accounted for 21.4% of the total revenue of the Chinese catering business, with a user base of 544.16 million [1]. In addition, the worldwide online meal delivery market is expected to reach USD 0.7 trillion in revenue by 2022, and the number of customers is projected to reach 2644.2 million by 2027 [2]. Numerous platforms, including Meituan and Eleme in China, as well as global meal delivery platforms like Uber Eats, Grubhub, and Door Dash, have emerged and rapidly expanded in recent years. Meituan, one of the largest meal delivery platforms in China, generated more than CNY 76.5 billion (about USD 10.6 billion) in revenue in the third quarter of 2023 [3]. Despite the considerable profits generated by the rapidly expanding instant meal delivery market, the increasing integration of meal delivery services into people’s daily lives has led customers to express higher expectations for delivery services, presenting more substantial operational challenges for platforms.
It should be noted that different customer groups are heterogeneous in their sensitivities to order delay [4,5,6]. Customers with high time sensitivity are often willing to pay additional delivery fees to enjoy their meals faster, whereas those with lower time sensitivity may accept longer delivery times for reduced delivery fees. Due to these varying time sensitivities among customers, platforms such as Uber Eats offer three options (priority, standard, and no rush) for meal delivery service to customers, as shown in Figure 1. However, a courier usually needs to perform multiple orders during the delivery process. Thus, how to generate a reasonable route to accommodate orders from customers with heterogeneous time sensitivity is a key issue that the platform needs to address.
The meal delivery routing problem (MDRP) is characterized by dynamics and stochasticity, and how to effectively deal with such characteristics of meal orders is another important issue faced by platforms [7,8]. Customers can place orders via meal-ordering applications at any time and place throughout the day. These orders typically include details such as the customer’s order time, meal type, restaurant location for pickup, and customer location for delivery. Upon receiving orders, platforms must efficiently assign them to couriers according to customer requirements. However, if couriers still have unfinished orders from previous periods, platforms need to adjust delivery routes to accommodate new orders. Notably, the number of meal orders varies at different times of the day. The study [9] points out that delivery demands present a visible tidal effect, with a sharp increase at 10:00–11:00 and 22:00–24:00 during the day. Therefore, it is another challenge for platforms to effectively handle dynamic orders during peak meal hours to avoid order delays.
With the continuous growth of the takeout market, academics are exploring possible solutions to the meal delivery problem. These studies include, but are not limited to, problems such as uncertainty in the meal delivery process [10], order assignment and route planning for couriers [11,12], and dynamic scheduling of meal orders [7]. However, these studies are based on the assumption that customers are homogeneous. Indeed, as an instant delivery service, the time-sensitive heterogeneity of customers is a crucial factor that should be considered in meal delivery. Hence, designing efficient solution strategies to serve such orders has important research implications.
To address the aforementioned issues, the meal delivery routing problem (MDRP) is investigated, which is characterized by the time-sensitive heterogeneity of customers and the dynamics of meal orders. Firstly, considering the time-sensitive heterogeneity of customers, a multi-objective optimization model is formulated to maximize customer time satisfaction while minimizing both delay penalty cost and riding cost. Secondly, a novel waiting strategy with time-sensitive priority is developed to process dynamic meal orders, which divides the dynamic problem into a series of static subproblems. In this waiting strategy, a decision threshold is introduced to determine the rerouting moment for dynamic orders. And time-sensitive priority is introduced to speed up assignment and rerouting decisions for orders belonging to customers with high time sensitivity. Finally, to solve each static subproblem, we develop a hybrid metaheuristic algorithm based on the adaptive genetic algorithm and adaptive large neighborhood search (AGA–ALNS). In AGA–ALNS, an enhanced best route with a stochastic insertion crossover (EBRSIC) operator is used to improve the algorithm’s global search capability. In addition, adaptive large neighborhood search (ALNS) is used as a mutation operator to improve the algorithm’s local search capability. Meanwhile, adaptive crossover and mutation probabilities are introduced to optimize the parameter selection of the algorithm. With dynamic MDRP instances of different scales, we illustrate the effectiveness of the waiting strategy with time-sensitive priority and the AGA–ALNS algorithm. Moreover, managerial insights are derived from sensitivity analysis experiments.
Compared with previous studies, the main contributions of this paper are summarized in three aspects. (1) We develop a multi-objective optimization model based on time-sensitive heterogeneous customers to solve the MDRP, which considers both perspectives of improving customer time satisfaction and reducing delay penalty cost and riding cost. This study enriches previous works on MDRP [7,8,13]. (2) A novel waiting strategy with time-sensitive priority is proposed, which further extends the research methods of MDRP for handling dynamic meal orders. (3) We further develop the AGA–ALNS algorithm for solving the optimization model and find the near-optimal solution for MDRP. To the best of our knowledge, this paper is one of the first attempts to solve MDRP using this hybrid approach.
The remaining structure of this paper is organized as follows. Section 2 reviews existing studies related to MDRP. Section 3 describes MDRP and defines the mathematical model of time-sensitive customer satisfaction and the operational costs of the platform. Section 4 presents the solution method of this paper, which incorporates the concepts of the waiting strategy with time-sensitive priority and the AGA–ALNS algorithm. Section 5 presents the computational experiments. Section 6 summarizes the conclusions and discusses future research directions.

2. Literature Review

In this section, we present a literature review of MDRP. MDRP is characterized by dynamically arriving orders, sharing similarities with the dynamic pickup and delivery problem (DPDP). Additionally, as an extended case of the vehicle routing pickup and delivery problem with time window (VRPPDTW), MDRP strictly regulates the order of pickup and delivery. Each order must be transported from a specific pickup location to the corresponding delivery location within a specified time interval, increasing the complexity of solving the problem. As a result, our review focuses on MDRP, DPDP, and solution methods for VRPPDTW.

2.1. Meal Delivery Routing Problem

Among the numerous relevant studies, Reyes et al. [14] first proposed the MDRP and developed a rolling-horizon repeated matching approach to solve large-scale dynamic MDRP. Yildiz and Savelsbergh [8] conducted a further investigation based on the study of Reyes et al. [14]. In contrast to Reyes et al. [14], they designed a simultaneous column- and row-generation algorithm to solve the MDRP and assumed perfect information about the order arrival. In order to avoid order delays caused by uncertain information during dynamic delivery, Ulmer et al. [7] proposed an anticipatory customer assignment (ACA) policy, which introduced a postponement to improve the flexibility of order assignment and a time buffer to reduce decisions that may cause delays. Wang and Jiang [13] developed a two-stage optimization algorithm to study customer satisfaction with time-sensitive heterogeneity. They found that considering time-sensitive heterogeneity was effective in improving customer time satisfaction. Xue et al. [10] investigated the meal delivery service with uncertain cooking time and travel time. They proposed a solution method that incorporates the island harmony search algorithm and scenario-based simulation. Bi et al. [11] established a multi-objective mathematical model to minimize customer dissatisfaction and the delivery cost for MDRP under the shared logistics services. In order to solve the proposed model, an improved ant lion optimizer (IALO) is developed. Liao et al. [15] designed a two-stage optimization algorithm for a green meal delivery routing problem to achieve objectives that maximize customer time satisfaction, courier balance utilization, and minimize carbon footprint. Wang [16] investigated three types of logistics services for multi-trip meal delivery routing problems, i.e., exclusive service and two sharing services. Through simulation analysis of real-world instances, the author found that two sharing services generated lower costs compared to the exclusive service. Tu et al. [17] studied an MDRP with dynamic arrivals for both crowdsourced couriers and meal orders. With the optimization objective of minimizing the delivery cost, a hybrid metaheuristic algorithm combining adaptive large neighborhood search (ALNS) and tabu search (TS) was proposed to solve the problem. Chen et al. [18] investigated the dynamic scheduling problem of instant delivery and proposed an online delivery scheduling algorithm with the optimization objective of minimizing the return time of the courier as well as the travel distance. Simoni and Winkenbach [12] studied the meal order assignment and route optimization problem under crowdsourcing delivery and proposed an order batching and assignment algorithm to solve the problem. Hu et al. [19] investigated the recovery management of meal delivery routes under four types of disruption events: vehicle breakdown, traffic jams, service time variation at restaurant locations, and service time variation at customer locations.
Unlike previous works such as [7,8,11,12,14], which assumed customers’ homogeneous demands, we consider the time-sensitive heterogeneity of customers when performing meal delivery routes, and develop a multi-objective optimization model for improving customer time satisfaction and reducing delay penalty cost and riding cost. Particularly, while Wang and Jiang [13] explored the MDRP considering the time-sensitive heterogeneity of customers, their study only analyzed the delivery problem for meal orders in a static scenario. In contrast, we extend their work to investigate a dynamic scenario where orders are generated dynamically.

2.2. Dynamic Pickup and Delivery Problem

In DPDPs, not all demand information can be known before the planning horizon begins, and future demand exists under a certain probability distribution. Such DPDPs are called dynamic and stochastic pickup and delivery problems [20,21]. Over the past two decades, scholars have strived to explore dynamic strategies for solving the DPDP under unknown demand information. Numerous approaches have been developed, such as the multiple scenario approach [22,23], and the waiting strategy. The waiting strategy is considered an effective method to reduce operational costs and improve service quality [24,25].
In previous studies, various waiting strategies have been explored, all focusing on deferring real-time demands and devising more efficient routes. Mitrović-Minić and Laporte [21] proposed four waiting strategies and evaluated their benefits in the dynamic pickup and delivery problem with time windows (DPDPTW). They found that the waiting strategy was effective in reducing detours and fleet size. Wong et al. [26] investigated the demand responsive transport problem in partially dynamic environments using waiting strategies. In this problem, they examined the impact on the system’s operational efficiency when the rate of dynamic demand changes. To solve the DPDP, Vonolfen and Affenzeller [25] proposed a waiting strategy that utilized historical demand information based on intensity measurement without further processing data. In a recent study, Park et al. [24] introduced a waiting strategy based on a rerouting indicator for addressing the vehicle routing problem with simultaneous pickup and delivery (VRPSPD). They argued that it is better for new demands to wait until reaching a strategically determined rerouting point, rather than be served immediately upon their arrival.
Similar to Park et al. [24], we develop a novel waiting strategy to solve the dynamic MDRP. Different from their study without time window constraints, the meal orders in this paper need to be delivered within a 40 min time window. Furthermore, our study explores the influence of customer time-sensitive heterogeneity on the waiting strategy, which is a more realistic consideration.

2.3. Solution Methods Associated with the VRPPDTW

VRPPDTW, as a variant of the vehicle routing problem (VRP) with additional constraints, is classified as an NP-hard problem. Current solution approaches for VRPPDTW primarily involve exact and metaheuristic algorithms. Exact algorithms, such as the branch-and-price algorithm [27] and the branch-and-cut algorithm [28], can achieve high-quality solutions to small-sized problems. However, as the scale of the problem increases, these exact algorithms are subjected to a sharp decrease in solution speed. Compared to exact algorithms, metaheuristic algorithms can find an approximate optimal solution within a reasonable time and are widely acknowledged as an efficient solution method.
Since the 21st century, many scholars have used some classical metaheuristic algorithms to solve the VRPPDTW, such as the particle swarm optimization (PSO) algorithm [29], variable neighborhood search (VNS) algorithm [30], and so on. In addition, some novel metaheuristics developed in recent years have also been demonstrated to be efficient methods for solving the VRP. For example, Lu et al. [31] developed a hybrid beetle swarm optimization algorithm (HBSO) to solve the 4PL routing problem. The HBSO algorithm is proposed based on the beetle antenna search (BAS) algorithm. A population search mechanism is introduced into the HBSO algorithm to improve search efficiency by turning one beetle into multiple beetles. In addition, the Dijkstra algorithm is introduced to initialize the population of the HBSO algorithm. Among numerous metaheuristic algorithms, the genetic algorithm (GA) has gained widespread application in solving the VRPPDTW due to its straightforward operation and global search capabilities. Wang and Chen [32] developed a co-evolution GA with the cheapest insertion method to solve the VRPPDTW, which generated better solutions within a reasonable time interval than the traditional GA. To solve the VRPPDTW with multi-depot, Dubey and Tanksale [33] proposed an elite GA and employed the 3-opt heuristic to further enhance the local search capability of the algorithm. One of the most important characteristics of metaheuristic algorithms is the ability to improve search performance by combining different algorithms [34]. Bi et al. [35] proposed a GA to solve an NP-hard problem with continuous domain, in which double-point crossover and non-uniform mutation operators were introduced to enhance the global and local search capabilities of the GA. To further improve the solution quality of the GA, some scholars have attempted to combine the GA with other metaheuristic algorithms. Wang et al. [36] introduced a two-stage algorithm for solving the VRPPDTW with multiple centers. In the first stage, they utilized the 3D k means algorithm to cluster the customer nodes. Subsequently, a hybrid GA–PSO algorithm is proposed to determine the routes. This hybrid approach aimed to enhance algorithm diversity and solution efficiency by employing coordinated operators between the GA and PSO. Zarouk et al. [34] developed a hybrid metaheuristic algorithm named MOGASA, which is based on the combination of GA and simulated annealing (SA). This hybrid approach harnesses the global and local search capabilities of both metaheuristic algorithms. Ren et al. [37] hybridized the fast elitist non-dominated sorting genetic algorithm (NSGA-II) with VNS for improving the search efficiency of the multi-objective GA.
Previous studies have continuously pursued the improvement of traditional GA to efficiently solve the VRPPDTW [34,36]. In this paper, we introduce a hybrid AGA–ALNS algorithm to solve the MDRP. Based on the framework of traditional GA, we enhance the algorithm’s global and local search capabilities by refining the crossover and mutation operators. In addition, adaptive crossover and mutation probabilities are introduced to avoid solution quality degradation caused by improper probability value setting. To the best of our knowledge, none of the studies have employed this hybrid AGA–ALNS algorithm to solve the MDRP.
With the above literature review, we compare this study with previous works in terms of the studied problem, objectives, problem characteristics, and solution methods. The summarized information is presented in Table 1. Our study stands out by integrating various factors such as time-sensitive heterogeneity of customers, dynamic demands, time window, pick and delivery, multiple objectives, and waiting strategy into the MDRP. None of the previous studies included all of these factors simultaneously. In summary, this study first investigates the time-sensitive heterogeneity of customers and develops a multi-objective model from the perspectives of customer time satisfaction as well as delay penalty cost and riding cost. Secondly, a novel waiting strategy is proposed to handle dynamic meal orders. The concept of time-sensitive priority is introduced to optimize order assignment and rerouting decisions for customers with time-sensitive heterogeneity. Finally, a hybrid AGA–ALNS algorithm is proposed to optimize the delivery routes for couriers.

3. Problem Statement and Modelling

In this section, we first give a detailed problem statement of the MDRP and then present the relevant settings for the dynamic problem. Afterward, the mathematical formulation of the static subproblem is presented in detail.

3.1. Problem Statement

In this paper, we study the dynamic MDRP with time-sensitive customers. We consider three types of customers with high, medium, and low time sensitivity. The higher the time sensitivity of customers, the lower their tolerance for order delay. Meal orders are dynamically generated over time, and each meal order contains information about the type of customer’s time sensitivity, the ordering time, the restaurant location for pickup, the customer’s location for delivery, and the delivery time window. This information becomes known to the platform only after the customer has placed the order. There is no fixed distribution center in the delivery area. Each courier starts at their initial location at the beginning of the decision period and performs a route including multiple orders. Couriers pick up meals from the restaurant and deliver them to the designated customer locations.
In order to effectively respond to real-time meal orders from customers with time-sensitive heterogeneity, the entire dynamic MDRP is divided into a series of static subproblems (details about the dynamic setting are presented in Section 3.2). Each subproblem aims to efficiently reoptimize delivery routes for couriers by integrating the time-sensitive heterogeneity of customers, the generated new orders, and the current routes being performed by couriers. To achieve this goal, a multi-objective function that maximizes customer time satisfaction and minimizes delay penalty cost and riding cost is proposed for each subproblem.
One of the challenging issues in formulating the model and developing the solution strategy is how to reorganize delivery routes for couriers based on the time-sensitive heterogeneity of the customers and the dynamic characteristics of meal orders. Here, we present an explanation of the investigated problem through the example in Figure 2. In the case presented in Figure 2a, the courier needs to provide delivery service for orders 1, 2, and 3 in the initial state. These orders correspond to customers with high, medium, and low time sensitivities, respectively. In the dynamic update phase shown in Figure 2b, the courier has completed the delivery of orders 1 and 2 and has arrived at the restaurant location of order 3 for pickup. The newly generated order 4 needs to be dynamically inserted into the current route to reorganize the courier’s delivery scheme. Figure 2c illustrates the reorganized courier route. Based on the customer’s time sensitivity while ensuring that the time window constraint is not violated, the courier first visits the pickup and delivery locations of order 4, and then travels to the delivery location of order 3.
From the above analysis, we have considered the following assumptions based on the characteristics of MDRP and realistic settings:
(1)
Meal orders are completed when the courier arrives at the restaurant;
(2)
The vehicle travels at a constant speed;
(3)
Each customer’s meal order can only be served by one courier;
(4)
The number of meals a courier can perform at one time cannot exceed the vehicle’s maximum capacity;
(5)
Each customer is limited to placing one order at a time. The number of meals in each order can vary, but it does not exceed a maximum of five.

3.2. Dynamic MDRP Setting

Not all orders can become known at the initial modelling period; they are dynamically generated over time. Therefore, we cannot directly generate delivery routes for all orders at the initial stage. To address this problem, we use the waiting strategy introduced in Section 4 to divide the dynamic MDRP into a series of static subproblems. Figure 3 visualizes the timeline for solving the dynamic MDRP. In each decision period, the system first collects dynamic orders from customers with heterogeneous time sensitivity. Then, the rerouting decision is implemented through order insertion at the beginning of the next period. Finally, the courier delivers meal orders according to the updated route.
In each subproblem, two types of meal orders exist within the system. The first type comprises new orders generated in the last period, which can be inserted anywhere in the routes if the capacity constraint is not violated. The second type of orders are those that have already been assigned to couriers. These orders cannot be transferred to another courier since the delivery route has already been generated. At the beginning of the current period, the system gathers newly generated orders from the last decision period and adds them to set N. Additionally, we define set U to indicate orders that have been assigned to couriers in previous periods but have not been completed, which includes orders for which pickups have been completed but have not yet been delivered as well as orders for which neither pickups nor deliveries have been completed.
At the beginning of each decision period, it is crucial to identify couriers’ start locations. We consider three possible scenarios: (a) Courier k has completed delivery service at location m but has not yet departed. We assume that courier k’s start location is m; (b) Courier k has already left location m and is on the way to location n. Then, we define courier k’s start location as n, and the available time for delivery in this period is the time after completing the service at location n; (c) Courier k has delivered all orders and is waiting for the next assignment. For simplicity, we assume that these couriers stay at their last customer location, which is also the start location where they can participate in the current decision period.

3.3. Mathematical Model for the Static Meal Delivery Routing Subproblem

In this section, we present the mathematical model of the static meal delivery routing subproblem with time-sensitive customers. The problem we studied in this paper can be defined as an undirected graph G = (V, A). As described in Section 3.2, there are two sets, namely N and U, in the subproblem, where set N contains new orders not yet assigned and set U includes orders that have already been assigned to couriers but have not been completed. In order to distinguish the visit locations of orders with different types in the model, we further define the set W = { i + , i i N } to indicate the restaurant and customer locations of the orders in the set N. In addition, the set M = { i + , i i U } is used to represent the restaurant and customer locations of all orders contained in the set U. m M also includes information about the matched couriers k(m) for each order. We use set C n o w to store the start locations of the couriers at the beginning of each decision period. c k C n o w indicates courier k’s start location. The notions of sets, parameters, and variables related to the model are shown in Table 2.

3.3.1. Customer Satisfaction with Heterogeneous Time Sensitivity

In the MDRP, a critical aspect of maximizing customer time satisfaction is ensuring that meal orders are delivered within the estimated time window [13]. In order to evaluate whether an order is delivered at the appropriate time, we employ the fuzzy membership function to formulate the customer satisfaction model. In addition, we introduce a time-sensitive coefficient α into the model to indicate the different tolerance for order delay by time-sensitive heterogeneous customers. The customer satisfaction with heterogeneous time sensitivity is represented by Equation (1).
S i = 1 , E i T a i k F i L i T a i k L i F i α , F i < T a i k L i 0 , T a i k > L i
In Equation (1), T a i k is the time for courier k to arrive at the customer location of order i. If it is within the desired delivery time window [ E i , F i ] , there is no impact on the customer’s experience, and customer satisfaction is 1. A delivery delay occurs when T a i k exceeds F i . Particularly, if T a i k exceeds the acceptable delayed delivery time L i , the customer satisfaction is 0. Otherwise, the customer satisfaction changes with the arrival time of the courier when T a i k is between F i and L i . We introduce a time-sensitive coefficient α to simulate the tolerance of delivery delay by customers with time-sensitive heterogeneity. When 0 < α < 1 , the customer has a low time sensitivity. α = 1 indicates that the customer has a medium time sensitivity and customer satisfaction decreases linearly with delay time. α > 1 represents that the customer has a high time sensitivity. Delivery delays will have a significant impact on customer satisfaction. In this paper, we take the values of α as 0.5, 1, and 1.5 to indicate that customers’ time sensitivity is low, medium, and high, respectively.

3.3.2. Delay Penalty Cost

The delay penalty cost evaluates the impact of order delays on meal delivery from the perspective of the platform’s operational cost. It is determined by the delay penalty per time unit and the duration of order delays. When an order is delivered beyond its estimated arrival time, a delay penalty cost occurs. Moreover, the longer the delay, the greater the penalty cost. To compute the penalty cost of meal orders from time-sensitive heterogeneous customers, we assign different values to the delay penalty cost per time unit β to reflect the distinct types of time sensitivity among customers. The mathematical model of delay penalty cost is as follows.
min β 1 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   l o w   t i m e   s e n s i t i v i t y β 2 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   m e d i u m   t i m e   s e n s i t i v i t y β 3 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   h i g h   t i m e   s e n s i t i v i t y
In Equation (2), β 1 , β 2 , and β 3 represent the delay penalty cost per time unit when the time sensitivity of the customer is low, medium, and high, respectively. And β 1 < β 2 < β 3 . Specifically, customers with low time sensitivity have a relatively large tolerance for order delays. An appropriate reduction in delay penalties for these orders can alleviate couriers’ pressure to perform urgent orders. Thus, we set β 1 to be the smallest delay penalty cost per time unit. Moreover, orders from customers with medium time sensitivity are more urgent than those with low time sensitivity; thus, β 1 < β 2 . Furthermore, order delays significantly impact customers with high time sensitivity, requiring higher penalties to enhance satisfaction. Therefore, β 3 is set as the highest delay penalty cost per time unit.

3.3.3. Riding Cost

Another major operational cost in meal delivery is the riding cost. The riding cost is mainly due to vehicle consumption during the delivery process, which is proportional to the mileage travelled by the courier. In this paper, the riding cost is calculated as shown in Equation (3).
min m , n W M C n o w k K λ d m n x m n k

3.3.4. Model Formulation

This section describes the multi-objective optimization model for the static meal delivery routing subproblem. Taking into account customers’ time-sensitive heterogeneity, the model aims to maximize customer time satisfaction and minimize operational costs of the platform, including delay penalty cost and riding cost. The relevant mathematical model is as follows:
max   F 1 = 1 N O i N U S i
min F 2 = β 1 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   l o w   t i m e   s e n s i t i v i t y β 2 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   m e d i u m   t i m e   s e n s i t i v i t y β 3 i N U k K max { ( T a i k F i ) , 0 } ,   i f   c u s t o m e r   w i t h   h i g h   t i m e   s e n s i t i v i t y
min F 3 = m , n W M C n o w k K   λ d m n x m n k
Subject to
S i = 1 , E i T a i k F i L i T a i k L i F i α , F i < T a i k L i 0 , T a i k > L i
k K m W M x i + m k = 1 , i N
m W M C n o w x m n k h W M x n h k = 0 , k K , n W M
m W M x i + m k m W M x m i k = 0 , k K , i N U
n W M x m n k ( m ) = 1 , m M
( T b m T b m c k ( m ) ) T a m = T b m , m M
( T b n > T b n c k ( n ) ) ( x m n k = 1 ) T a n = T a m + s m + t m n , k K , m W M C n o w , n M
( T b m T b n ) ( k ( m ) = k ( n ) ) T a m T a n , m , n M
( x m n k = 1 ) T a n k = T a m k + s m k + t m n , k K , m W M C n o w , n W
( x i + m k = 1 ) T b c k T a i + T a i , k K , i N , c k C n o w
( x m n k = 1 ) ( n = i + ) q n k = q m k + q i , k K , i N U , m , n W M
( x m n k = 1 ) ( n = i ) q n k = q m k q i , k K , i N U , m , n W M
q m k Q , m W M C n o w , k K
x m n k = 0 , 1 , k K , m , n W M C n o w
The objective function (4) is used to maximize the average time satisfaction for all time-sensitive customers. The objective (5) represents minimizing the delay penalty cost of meal orders while the objective function (6) represents minimization of the riding cost. Constraint (7) is the fuzzy membership function for customer satisfaction with heterogeneous time sensitivity. Constraint (8) ensures that each new order can only be served by one courier. Constraint (9) indicates flow conservation, which states that, when a courier enters a location, he or she must move from this location to the next. Constraint (10) ensures that the restaurant and customer location for the same meal order must be visited by the same courier. Constraint (11) represents that orders assigned in previous periods cannot be transferred to other couriers. Constraints (12)–(14) indicate that the sequence of visited locations already determined in previous decision periods cannot be changed after the new order is inserted. Constraints (15) and (16) ensure that new orders can only be inserted after the courier k’s current position. Constraints (17)–(19) illustrate the capacity constraints of the courier’s vehicle. Constraints (20) is the value ranges of the binary decision variable.

4. Solution Method

This section introduces the hybrid method proposed for solving the MDRP, which includes a dynamic order processing approach and a route optimization algorithm. Firstly, to efficiently handle dynamically generated meal orders and consider customers’ time-sensitive heterogeneity, a waiting strategy with time-sensitive priority is proposed to determine rerouting timing and divide the dynamic MDRP into several static subproblems. Secondly, to generate efficient delivery routes for couriers, the hybrid metaheuristic algorithm of AGA–ALNS is proposed to solve the subproblem. The flowchart of the solution method in this paper is shown in Figure 4.

4.1. Waiting Strategy with Time-Sensitive Priority

In the context of the MDRP, meal orders arrive dynamically, and order information is only available when the customer places the order. To address this issue, it is essential to transform the dynamic MDRP into a series of static subproblems and reoptimize courier routes by accounting for the new orders generated in each period. Previous studies often involved splitting the overall dynamic problem into multiple time periods of fixed length and rerouting for each subperiod [14,38,39]. However, this approach faces two significant challenges. Firstly, the inherent randomness of dynamic meal orders presents a challenge in devising optimal assignment strategies, particularly when the density of orders fluctuates across decision periods and exceeds the available delivery capacity. Additionally, this method lacks efficiency to handle urgent orders. Customers with high time sensitivity have low tolerance for order delays, necessitating quick system responses for order assignment and routing decisions.
To tackle the challenges mentioned above, we propose a waiting strategy with time-sensitive priority to address the dynamic MDRP. The waiting strategy determines whether the route needs re-optimization based on the cumulative number of orders. It makes order assignment and rerouting decisions considering the number of couriers in the current delivery system. This provides better adaptability to the stochastic characteristics of meal orders. This strategy also incorporates a time-sensitive priority to address customer heterogeneity. In cases of increased urgent orders from customers with high time sensitivity, the strategy shortens the decision period to serve these orders as early as possible. Conversely, for orders from customers with low time sensitivity, it is suitable to extend the decision period, allowing for more orders to be processed in a single decision period and thereby reducing rerouting costs. The pseudocode of the waiting strategy with time-sensitive priority is shown in Algorithm 1.
Algorithm 1: The pseudocode of the waiting strategy with time-sensitive priority
Begin
1: New   order   set   N
2: Sum   of   new   order   S N 0
3: t 1
4: Rerouting   counts   p 0
5:While  t t max  do
6: N O ( t ) New customer demand generated at moment t
7: N N N O ( t )
8:for i N O ( t ) do
9:  if customer with high time sensitivity then
10:    Time - sensitive   priority   T P 2
11:  else if customer with medium time sensitivity then
12:    T P 1.5
13:  else
14:    T P 1
15:  end if
16: S N S N + o r d e r ( i ) * T P
17:end for
18:if S N D T then
19:  Identify couriers’ unfinished routes and now order set N
20:  Perform rerouting decision by AGA-ALNS algorithm
21:   N
22:   S N 0
23:   p p + 1
24:end if
25: t t + 1
26:end while
End
Note: N represents the new order set; S N represents the sum of new orders; p is the rerouting count; N O ( t ) represents the set of new orders generated at moment t; T P is the time-sensitive priority; D T is the decision threshold.
The key idea of the waiting strategy is to divide the entire dynamic decision period into multiple static subproblems. It is assumed that the entire time horizon consists of n continuous time points. At each time point, the system generates multiple stochastic meal orders. When orders arrive in the system, the waiting strategy first recognizes the time-sensitive characteristics of the orders and then accumulates the generated new orders and adds them to the set N (Algorithm 1, lines 6–17). If the number of accumulated orders reaches the decision threshold (DT), the system takes this moment as a decision point. We take the time passed from the last decision point to the current decision point as a decision period. At the beginning of each decision period, the strategy identifies the couriers’ current locations and the orders on their routes that have not been completed. It re-optimizes the couriers’ delivery routes by inserting the orders in the set N into the existing routes using the AGA–ALNS algorithm in Section 4.2 (Algorithm 1, lines 18–24).

4.1.1. Decision Threshold

The decision threshold (DT) in the waiting strategy is used to determine the time needed to reoptimize the delivery route. Upon the arrival of an order, the waiting strategy accumulates the order without immediately executing assignment or rerouting decisions. Instead, it compares the cumulative number of orders with the DT. When the accumulated orders reach the DT, orders are assigned to couriers, and delivery routes are reoptimized. A larger DT means that, the more orders are accumulated by the system, the fewer counts the system has to reorganize the routes. However, when the value of the DT is too high, it increases the risk of order delays. In particular, meal orders are limited by the time window. If the arrival time of cumulative orders exceeds the latest delivery time allowed by the customer, customer satisfaction will decrease. Therefore, the reasonable setting of the DT is a key factor affecting the waiting strategy’s performance.

4.1.2. Time-Sensitive Priority

To enhance the satisfaction of customers with heterogeneous time sensitivities, we introduce the concept of time-sensitive priority in the waiting strategy. For customers with high time sensitivity, the system needs to schedule delivery routes for their orders as early as possible because their tolerance for order delays is low. Conversely, for customers with low time sensitivity, the decision time can be extended relatively, so that as many orders as possible can be assigned to couriers in one decision period. Therefore, we assign a time-sensitive priority to different time-sensitive customers in the waiting strategy, and the higher the time sensitivity of the customer, the higher the value of the time-sensitive priority. In this paper, we set priority levels as 1, 1.5, and 2, representing customers with low, medium, and high time sensitivities, respectively. According to this setting, if the current decision period has a high percentage of orders from customers with high time sensitivity, the system will shorten the order accumulation time and expedite the rerouting decision for these orders.

4.2. AGA–ALNS Algorithm

In this section, we present a hybrid metaheuristic algorithm known as adaptive genetic algorithm and adaptive large neighborhood search (AGA–ALNS) to solve the static meal delivery routing subproblem. The AGA–ALNS algorithm is designed within the framework of the genetic algorithm (GA). The GA, rooted in natural evolution theory, exhibits exceptional robustness and flexibility, rendering it a valuable approach to address NP-hard problems. On the one hand, GA stands out as a population-based metaheuristic algorithm with excellent global search capabilities. On the other hand, it boasts strong extensibility, facilitating easy hybridization with other heuristic or metaheuristic algorithms to enhance its capacity for solving complex optimization problems.
To achieve an efficient solution to the meal delivery routing subproblem, the AGA–ALNS algorithm is developed within the traditional GA framework, thus improving its performance in three aspects: (1) To effectively improve the global search capability of the GA and reduce computational complexity, an enhanced best route with a stochastic insertion crossover (EBRSIC) operator is introduced. (2) In order to enhance the local search ability and population diversity of the algorithm, the neighborhood search mechanism of ALNS is embedded as a mutation operator within the GA. (3) Adaptive crossover and mutation probabilities are incorporated into the algorithm to prevent degradation in individual quality due to improper operator probability settings. The pseudocode for the AGA–ALNS algorithm is shown in Algorithm 2.
Algorithm 2: The pseudocode of AGA–ALNS algorithm
Begin
1: P   Initialize   population   with   population   size   P s i z e
2: g e n 1
3:while  g e n g e n _ m a x  do
4: Calculate the fitness value
5: P ( g ) Tournament selection (P)
6:for   individuals   S i , S i + 1 P ( g ) do/Perform crossover operation based on EBRSIC/
7:   P c Calculate crossover probability based on Equation (24)
8:   r n Random generate from [0, 1]
9:  if r n P c then
10:    ( C 1 , C 2 )   Crossover   ( S i , S i + 1 )
11:    ( S i , S i + 1 ) ( C 1 , C 2 )
12:  end if
13:end for
14: P ( g )   update   P ( g ) after crossover operation
15:for each   individual   S P ( g ) do/Perform mutation operation based on ALNS/
16:   P m Calculate mutation probability based on Equation (25)
17:   r n Random generate from [0, 1]
18:  if r n P m then
19:    S * A L N S ( S )
20:  end if
21:  if o b j ( S * ) o b j ( S ) then
22:    S S *
23:  end if
24:end for
25: P ( g )   update   P ( g ) after mutation operation
26: P   Update   population   according   to   P ( g ) and P
27: Update weights of removal and insertion operators of ALNS
28: g e n g e n + 1
29:end while
End
Note: P s i z e represents the population size; g e n represents the current iteration number of AGA–ALNS; g e n _ m a x denotes the maximum iteration number of the algorithm; P indicates the population containing all individuals; P ( g ) represents the offspring population after tournament selection; P ( g ) represents the offspring population after crossover operation; P ( g ) represents the offspring population after mutation operation; P c is the adaptive crossover probability calculated by Equation (24); P m is the adaptive mutation probability calculated by Equation (25); r n is a random number within the interval [0, 1]; ( C 1 , C 2 ) represents individuals after crossover operation; S is the individual selected for the mutation operation; S * is the individual after the mutation operation.
The most significant modification in AGA–ALNS involves integrating the neighborhood search mechanism of ALNS into the traditional GA. A notable challenge in hybridizing ALNS with GA is that it adds to the computational complexity since ALNS itself requires several iterations of a single solution to generate an improved solution. To address this issue, in the mutation operator of AGA–ALNS, we conduct the neighborhood search operation in ALNS only once for each solution, retaining the new solution if it has a better objective value than the current solution to prevent degradation in solution quality (Algorithm 2, lines 18–23). Furthermore, in standard ALNS, the weights of the neighborhood search operator need to be updated using an adaptive adjustment mechanism. In this paper, we record operator scores based on the search performance of the operators for the offspring population in the current iteration and conduct this updating process at the end of each iteration in AGA–ALNS (Algorithm 2, line 27).

4.2.1. Chromosome Representation

In the GA, encoding and decoding mechanisms facilitate the transformation between a chromosome and delivery routes. The natural number-encoding method is employed to represent all nodes involved in the MDRP studied in this paper. Additionally, when dealing with different orders generated at the same restaurant, it is possible that one or more couriers may need to visit the location multiple times. To handle this situation, for multiple orders generated at the same restaurant, we replicate the restaurant nodes based on the number of orders by adding dummy location nodes.
Assuming that the number of orders is n, we use P = { 1 , 3 , ... , 2 n 1 } to indicate the set of restaurant nodes and D = { 2 , 4 , ... , 2 n } to indicate the set of customer nodes. ( 2 n 1 , 2 n ) represents the restaurant and customer nodes for the same meal order. Figure 5 illustrates the chromosome consisting of four orders and three couriers, and the delivery route after the decoding operation. In this example, 0 indicates the split node for different couriers’ routes. At the beginning, each courier departs from their initial locations and then travels to the restaurant corresponding to the order for pickup and the customer location for delivery. If orders from the same courier will not be delayed in a short time period, it is unnecessary for the courier to immediately deliver to the customer after the pickup is completed, which may cause higher delivery costs. For example, the most efficient route for courier 2 is to visit nodes 3 and 7 (pickup locations for order 2 and order 4) and then nodes 8 and 4 (customer locations for order 4 and order 2). The final delivery route for courier 2 is 3-7-8-4.

4.2.2. Population Initialization

In this paper, the initial population of the AGA–ALNS algorithm is generated using the random insertion method. As outlined in Section 4.1, the waiting strategy transforms the dynamic MDRP into multiple static subproblems. At the beginning of each decision period, the system must identify the set N comprising new orders and ascertain the status of all couriers in the current delivery scenario. In the case of a courier’s route that has yet to be completed, it needs to be reorganized by inserting orders from set N into feasible positions. Assuming the set of courier routes is denoted as R, it encompasses routes of couriers with unfinished orders as well as those who have completed all delivery orders from previous periods and are awaiting assignment of new orders. The procedure for generating the initial solution of the AGA–ALNS is outlined below:
  • Copy the set N and R as N’ and R’;
  • Select an order i N and a route r R randomly;
  • Insert the restaurant location and customer location of order i into the route r and ensure that the restaurant location precedes the customer location;
  • When order i is inserted, check the capacity constraints of the route r, and if the capacity constraints are violated, reselect the insertion location or another route;
  • Remove the order i in set N’ and update the route r in set R’;
  • Repeat step 2–5 until set N’ is empty;
  • Transform the route scheme in the set R’ into a chromosome with the encoding method of Section 4.2.1.
The above process is repeated until P s i z e chromosomes are generated as the initial population.

4.2.3. Fitness Calculation

In Section 3, we introduce customer satisfaction with heterogeneous time sensitivity, delay penalty cost, and riding cost as optimization objectives for the MDRP. In order to comprehensively consider the above optimization objectives, we use Equation (21) to transform the multi-objective problem into a single objective.
F c = k 1 F 1 + k 2 F 2 + k 3 F 3
where F 1 is customer satisfaction with heterogeneous time sensitivity, F 2 and F 3 indicate the normalization of delay penalty cost and riding cost, respectively; k 1 , k 2 , and k 3 are corresponding weights, and k 1 + k 2 + k 3 = 1 . Then, the fitness function in the AGA–ALNS algorithm is represented by Equation (22).
f = 1 F c
It is critical to note that the solution is deemed infeasible if the route scheme violates the capacity limit specified in constraint (19). For such generated infeasible solutions, a substantial penalty is imposed on the objective function to significantly reduce the fitness value associated with this chromosome.

4.2.4. Selection Operation

The binary tournament selection strategy is employed to select excellent chromosomes from the parent population to generate an offspring population for subsequent genetic operations. The core method of this selection strategy is to randomly choose two chromosomes from the parent population and remain the individual with the superior fitness value for inclusion in the offspring population. This process is iteratively repeated to form a subset of chromosomes until the number of selected individuals meets the population requirement, and any duplicate chromosomes are removed from this subset.

4.2.5. Crossover Operation

The crossover operation is crucial to ensure the global search capability of the AGA–ALNS algorithm. However, classical crossover operations such as single-point crossover and multi-point crossover do not consider the vehicles’ routes which are not applicable to solve the VRP. With the intensive research, a series of crossover operations were developed for solving the VRP, such as best cost route crossover (BCRC) [40], best route with stochastic insertion crossover (BRSIC) [41], and so on. Among them, the BRSIC operation can effectively reduce computational complexity while guaranteeing solution quality by randomly removing and inserting orders in consecutive positions several times. However, BRSIC cannot be directly used to solve the static meal delivery routing subproblem. Firstly, in the subproblem, only the insertion positions of new orders in set N can be optimized. Delivery sequences determined in previous periods cannot be changed. Moreover, there are both restaurant and delivery locations for each meal order, and the sequence of visits to these locations is strictly limited. Therefore, we propose an enhanced BRSIC (EBRSIC) to solve meal delivery routing subproblems.
Figure 6 and Figure 7 illustrate our EBRSIC operator through an example. In this example, three courier routes are included. The orders (5, 6) and (7, 8) that have been assigned but not yet completed in the previous periods belong to courier 1 and courier 2, respectively. n e w 1 n e w 4 are the new orders from set N. As shown in Figure 6, EBRSIC first selects two parent chromosomes. Subsequently, it randomly selects n new orders from the set N in each parent chromosome (n = 2 in this example) and removes the restaurant location n e w + and the customer location n e w of the chosen new order from another chromosome. Figure 7 illustrates the chromosome repair process, where k random reinsertions are performed for each removed order (k = 3 in this example). The optimal insertion solution is chosen from the k results based on the fitness value. In addition, the reinsertion of an order needs to ensure that the restaurant location and the customer location are on the same route, and the restaurant location precedes the customer location. Figure 7a,b demonstrate this process. As shown in Figure 7c, when all removed orders are inserted into the chromosomes, new offspring chromosomes are finally generated. In this paper, we set the values of n and k are 2 and 10, respectively.

4.2.6. Mutation Operation Based on ALNS

The mutation operation in AGA–ALNS aims to enhance the local search capability and increase population diversity. The ALNS algorithm uses various operators to remove certain orders from the current route, reinserting them into appropriate locations. This process effectively enhances the neighborhood search capability. Additionally, an adaptive adjustment mechanism updates the weights of the removal and insertion operators based on the operators’ performance, further augmenting the search capability for high-quality solutions. Consequently, ALNS is introduced into the AGA–ALNS algorithm as a mutation operator, and certain mechanisms are adjusted to enhance the hybridization of ALNS with population-based metaheuristic algorithms. The searching method of ALNS for a single solution is shown in Algorithm 3.
Algorithm 3: The pseudocode of ALNS
Begin
1: S Current solution
2:Identify new orders in S   and   add   them   to   set   o n
3: R O Select the removal operator according to the roulette wheel method and operator weights
4: ( S * , o k ) R O ( S , o n ) /Remove k orders from S  that   belong   to   o n /
5: I O Select the insertion operator according to the roulette wheel method and operator weights
6: S * * I O ( S * , o k ) / Reinsert   the   removed   k   orders   into   S * /
7:if  o b j ( S * * ) < o b j ( S )  do
8: S S * *
9: π R O π R O + σ 1 , π I O π I O + σ 1
10:else
11: π R O π R O + σ 2 , π I O π I O + σ 2
12:end if
End
Note: S represents the current solution; o n denotes the temporary set for storing new orders; R O represents the removal operator after roulette wheel method; ( S * , o k ) denotes the order set o k removed from the current solution, and the solution S * after the removal operation; I O represents the insertion operator after roulette wheel method; S * * indicates the solution after insertion operation; π R O is the score of removal operator; π I O is the score of insertion operator; σ 1 and σ 2 indicates the score increment parameters.
In ALNS, the algorithm first identifies the orders belonging to the set N in the current solution, since the neighborhood search operator can only be performed on these orders in the subproblem. In addition, the neighborhood solution is generated by selecting the removal and insertion operators based on the weights and roulette wheel method. Finally, the algorithm determines whether to accept a neighborhood solution based on the objective value and updates the operator score. As mentioned before, to reduce computational complexity, we perform only one neighborhood search for each solution in the population, and to ensure solution quality, we only accept the solution with a better objective value. Therefore, we modify the score update mechanism for the neighborhood search operator. σ 1 > σ 2 are two operator score increment parameters. When the objective value of the neighborhood solution surpasses that of the current solution, the operator score increment parameter is σ 1 ; otherwise, it is σ 2 (Algorithm 3, line 7–12).
(1)
Neighborhood search operators
Removal and insertion operators are pivotal to ensuring neighborhood search quality in ALNS. In this paper, the removal and insertion operators are as follows:
Random removal operator: this operator randomly selects multiple orders that belong to set N and removes them from the current solution.
Worst removal operator: the operator initially computes the change in objective value after removing order i N , based on c o s t ( i , S c u r r e n t ) = f S c u r r e n t f i ( S c u r r e n t ) . Subsequently, it removes the portion of orders with the most significant change in objective value from the current solution.
Random insertion operator: similar to the random removal operator, the random insertion operator randomly inserts the removed orders into solution S, while ensuring capacity constraints.
Greedy insertion operator: the method of the greedy insertion operator is to find the position in the solution S that causes the lowest change in the value of the objective function to insert the removed order i. Let Δ f i , h represent the difference in the objective value after inserting order i into position h. The objective of greedy insertion is m i n h S { Δ f i , h } .
Regret insertion operator: unlike the greedy insertion operator, regret insertion introduces look-ahead information [42]. The operator first finds the best insertion position h1 and the second-best insertion position h2 of removed order i in the solution S using a greedy insertion operator and uses the regret value to represent the difference between the two positions. The objective of the regret insertion operator is to find an optimal position that maximizes the regret value. The formula for calculating the optimal regret value is m a x { Δ f i , h 1 Δ f i , h 2 } .
(2)
Adaptive weight adjustment mechanism
As outlined in Algorithm 3, ALNS utilizes a roulette wheel method to select removal and insertion operators based on their weights. In the standard ALNS algorithm, operators’ weights undergo adaptive updates through operator scores after several iterations. However, the AGA–ALNS algorithm in this paper incorporates ALNS as a mutation operator within the genetic operation, necessitating a modification to adapt this mechanism to population-based searches. In this algorithm, the initial weight of each operator is set to 1. In each iteration of the mutation operation, ALNS selects chromosomes in the offspring population probabilistically and executes a neighborhood search operation. The algorithm records the operators’ scores based on their performance. At the end of each iteration, the weights are adaptively updated based on the operators’ scores. The method of updating the weights is shown in Equation (23).
w i , j + 1 = w i , j , π i , j = 0 ( 1 δ ) w i , j + δ π i , j μ i , j , π i , j 0
In Equation (23), w i , j is the weight of operator i at the j-th iteration; π i , j is the score of operator i at the j-th iteration; μ i , j is the number of times the operator is used at the j-th iteration; δ 0 , 1 is the reaction factor.

4.2.7. Adaptive Crossover and Mutation Probabilities

Inadequate parameter settings for crossover and mutation operations in the GA can degrade solution quality. Chromosomes with lower fitness values need a higher crossover/mutation probability to produce better individuals. In addition, chromosomes with higher fitness values require a lower crossover/mutation probability to prevent individuals from being destroyed. To address this issue, we introduced the adaptive crossover and mutation probabilities used by Tang et al. [43] into AGA–ALNS, which determines probabilities by evaluating chromosome fitness values.
P c = P c i n i t F max     f F max     F a v g f F a v g P c i n i t   f < F a v g
P m = P m i n i t F max     f c F max     F a v g f c F a v g P m i n i t   f c < F a v g
In Equations (24) and (25), P c i n i t represents the initial crossover probability; P m i n i t is the initial mutation probability; f denotes the greater fitness value in the two parent chromosomes in crossover operation; f c represents the fitness value of chromosome c; F max is the largest fitness value in the current population; F a v g is the average fitness value of the current population.

5. Computational Experiments

In this section, we perform computational experiments to test the effectiveness of the proposed method in solving the dynamic MDRP. Firstly, we describe the instances used in the experiments. After that, we compare the AGA–ALNS algorithm with two GA-based algorithms and the simulated annealing (SA) algorithm in previous studies to validate the performance of the proposed algorithm in addressing the dynamic MDRP. In addition, the effectiveness of the waiting strategy is estimated by comparing it with different dynamic order-processing strategies. Finally, some potentially managerial insights are derived from sensitivity analysis experiments. All experiments presented in this paper were coded using MATLAB R2020a software. And all experiments were conducted on a computer with an Intel(R) Core (TM) i7-10700T CPU @ 2.00 GHz and 16 GB of RAM.

5.1. Experimental Setting

To verify the effectiveness of the proposed waiting strategy and AGA–ALNS algorithm in this paper, we generate four instance groups (ISGs) with different scales to perform numerical experiments. And each ISG includes five different instances. The experimental instances used in this paper are based on real data from Grubhub (https://github.com/grubhub/mdrplib (accessed on 12 November 2023)) [8,14] and delivery characteristics in Chinese regions. For meal delivery services in most regions of China, the typical delivery area of each courier is within 5 km [15]. Therefore, in each instance, we randomly select a 5 km area from the dataset of Grubhub to determine the coordinate locations of restaurants, customers, and couriers. Similar to the setting of Chen et al. [18] on dynamic instant delivery services in the Chinese region, the total scheduling period in each instance is half an hour, and orders are dynamically generated over time. The instance groups used for the experiments in this paper can be downloaded from https://github.com/Hellogscode/MDRP-instance (accessed on 14 April 2024).
In each instance, the latest delivery time and the acceptable delayed delivery time for each order are generated less than 40 min after the customer places the order. Additionally, following the setting of Liao et al. [15], where many customers prefer their meal orders to be delivered as soon as possible, we set the earliest delivery time to be the placement time of the order. Each customer has only one associated order and the number of meals included in the order is randomly generated in the interval [1,5]. The capacity of the courier’s vehicle is 15 meals. The service time at the customer location will not exceed 5 min. To simulate customers’ time-sensitive heterogeneity, in each instance, we set a time-sensitivity parameter for each order and use the value of [1,2,3] to indicate that customers’ time sensitivity is high, medium, and low, respectively. And to ensure the fairness of the experiment, we evenly distributed the number of different time-sensitive types of customers. On the basis of the relevant literature [12,15,17] and the stability cases of the instances, Table 3 shows the parameter settings used in this paper.
Through the above analysis, we generate four ISGs with different scales for experiments, and the settings are shown in Table 4. Following the criteria proposed by Chen et al. [18], which suggests an average of two orders generated per minute, we set instances on different scales by varying the density of order generation. The number of meal orders ranges from 25 to 100, and the number of couriers ranges from 5 to 15. For each ISG, we create five instances of the same scale. Additionally, in the waiting strategy, the DT value changes with the ISG scale, being twice the number of couriers. For example, in ISG1, where the number of couriers is 5, the DT value is set to 10.

5.2. Performance of AGA–ALNS Algorithm

To further evaluate the performance of the AGA–ALNS algorithm in solving the MDRP, we compare it with two GA-based algorithms (enhanced GA [44] and GA–LNS [45]) and the simulated annealing (SA) algorithm [46]. We chose these comparison algorithms for the following reasons: (1) Two GA-based algorithms incorporate the modified BCRC as a crossover operator, providing a basis for comparing the performance of the EBRSIC operator in the AGA–ALNS algorithm; (2) GA–LNS uses large neighborhood search (LNS) as a mutation operator, enabling us to assess the effectiveness of ALNS in enhancing the local search capability of the AGA–ALNS algorithm; (3) Enhanced GA adaptively adjusts the crossover probability using a local optimality detection strategy, which is employed to validate the performance of the adaptive probability. If the same solution is obtained in ten consecutive generations in the enhanced GA, it is recognized as reaching a local optimum. The value of crossover threshold is reduced by 10%. If the crossover threshold is reduced to 0.1, it is reset to 1; (4) In order to comprehensively validate the solution quality of the AGA–ALNS algorithm, SA is employed as a comparison algorithm. SA is commonly used to solve VRP and is also a common benchmark for algorithm performance comparison. The relevant parameters of comparison algorithms are set as follows: population size and maximum iterations for the two GA-based algorithms are set to 50 and 100, respectively; in enhanced GA, the initial crossover threshold is 1, and the mutation probability is 0.1; both crossover and mutation probability are set to 1 in GA–LNS; in SA, the number of iterations is 2000, the initial temperature is 50, and the cooling rate is 0.98. Table 5 compares the results of the four algorithms in different ISGs, including customer time satisfaction F 1 , delay penalty cost F 2 , riding cost F 3 , and computation time (CT). To ensure a comprehensive comparison of results, we employ the waiting strategy proposed in this paper to handle dynamic orders for each algorithm. Additionally, we execute each algorithm 10 times in each instance and report the optimal results from the 10 runs in Table 5.
According to the average results of the 20 instances in Table 5, the following conclusions are drawn. Firstly, the customer satisfaction of AGA–ALNS is 93.97%, which is 2.86%, 6.13%, and 16.21% higher than enhanced GA, GA–LNS, and SA, respectively. Secondly, AGA–ALNS achieves a minimum delay penalty cost of CNY 13.64, which saves CNY 7.01, CNY 17.69, and CNY 85.91 relative to enhanced GA, GA–LNS, and SA, respectively. In addition, AGA–ALNS generates a riding cost of CNY 363.24, which is a decrease of CNY 9.93 and CNY 105.31 compared with enhanced GA and SA, but an increase of CNY 17.67 compared with GA–LNS. Finally, the computation time of AGA–ALNS is 103.6 s, which is significantly reduced compared with enhanced GA (245.01 s) and SA (131.93 s), and the difference between AGA–ALNS and GA–LNS (101.48 s) is only about 2 s. Although GA–LNS produces less riding cost and CT, GA–LNS results for both customer time satisfaction and delay penalty cost are inferior to those of AGA–ALNS in all cases. In the MDRP, as a strongly timely vehicle routing problem, meal delivery punctuality is a crucial factor to measure route effectiveness. Therefore, the AGA–ALNS algorithm proposed in this paper has obvious advantages in terms of both accuracy and computation time when dealing with the dynamic MDRP.
In order to further illustrate the solution method in solving the dynamic MDRP, Table 6 presents the optimal delivery routes generated by the waiting strategy and the AGA–ALNS algorithm for the instance from ISG1 [47]. The waiting strategy categorizes orders based on customer time sensitivity and accumulates them until the decision threshold (DT) is reached. This process divides the dynamic horizon into four decision periods: (0, 9), (9, 16), (16, 28), and (28, 30). Each decision period corresponds to a static meal delivery subproblem, and the AGA–ALNS algorithm is employed to solve the subproblem. In each decision period, the system assigns orders and reoptimizes routes considering couriers’ load capacities and starting locations. For instance, in the decision period (9,16), where courier 1 and courier 5 are awaiting new assignments, the system allocates order 9 and order 15 to courier 1, while order 11, order 12, and order 16 are allocated to courier 5. The delivery routes of the two couriers are R9→C9→R15→C15 and R11→R16→R12→C11→C16→C12, respectively. Meanwhile, courier 2, courier 3, and courier 4 still have unfinished orders on their routes, with only one additional order assigned to each of them. The final delivery routes generated by the solution method are also presented in Table 6.

5.3. Performance of the Waiting Strategy with Time-Sensitive Priority

In this section, the performance of the waiting strategy with time-sensitive priority (WSTP) is further evaluated through numerical experiments. To demonstrate the effectiveness of the WSTP, we propose two comparison strategies. On the one hand, WS is developed to indicate where time-sensitive priority is not considered in the waiting strategy. On the other hand, the rolling horizon (RH) method is employed to validate the effectiveness of the two waiting strategies. The DT of the WS takes the same value as the WSTP in Table 4. In the RH method, we set the scheduling period to 5 min. The performance of the three strategies in different instances is summarized in Table 7 and Figure 8. We employ the AGA–ALNS algorithm proposed in this paper to solve each static subproblem. And we execute each strategy 10 times in each instance and report the optimal results from the 10 runs in Table 7.
According to the results in Table 7, we found the following: Firstly, both waiting strategies perform better than the RH method on all objectives. In the average results of the 20 instances, the customer satisfaction calculated by RH decreased by 4.45% and 3.39% relative to WSTP and WS, respectively; the delay penalty cost increased by CNY 11.68 and CNY 9.11, respectively; and the riding cost increased by CNY 25.85 and CNY 18.21 relative to the two waiting strategies, correspondingly. This conclusion indicates that the waiting strategy, which uses the number of meal orders as the timing of dynamic processing, is more effective than the rolling horizon method in solving MDRP. In addition, in the comparison of the two waiting strategies, WSTP produces higher customer satisfaction as well as lower delay penalty cost and riding cost. WSTP generates a 1.06% increase in customer satisfaction relative to WS. Moreover, WSTP saves CNY 2.57 and CNY 7.64 in delay penalty cost and riding cost, respectively. Thus, this conclusion provides further evidence that it is essential to consider the customer’s time-sensitive priority in the waiting strategy.
Figure 8 visualizes the above results. In Figure 8, we also compare the average rerouting counts of the three strategies over 20 instances. Relative to the RH method, the two waiting strategies generate lower rerouting counts, which indicates that postponing the assignment of more meal orders while guaranteeing customer satisfaction will generate a lower cost. In addition, accelerating the decision to reroute based on the customers’ time-sensitive type will generate better routes, which explains the that, although the rerouting counts of the WSTP increase relative to the WS, the WSTP outperforms the WS in each objective.
In order to further investigate the impact of the three dynamic processing strategies on the service efficiency of time-sensitive heterogeneous customers, we analyze the delay times of orders from customers with high, medium, and low time sensitivities by choosing one instance from each of the four ISGs. The results are presented in Figure 9a–d. As illustrated in Figure 9, it is observed that, in most cases of the WSTP, the delay time tends to decrease with higher customer time sensitivity. Conversely, the WS and RH methods do not have a significant regular pattern for reducing the delay time of orders from heterogeneous time-sensitive customers. This finding suggests that the incorporation of time-sensitive priority into the waiting strategy effectively addresses orders from heterogeneous time-sensitive customers. The decision horizon of orders is shorter for customers with higher time sensitivity, thereby effectively mitigating the risk of order delays.

5.4. Managerial Insights Based on Sensitivity Analysis

This section conducts some sensitive analysis to derive managerial insights that support platform operational decisions. To ensure representation across different problem sizes, we select one instance from each ISG for analysis.

5.4.1. Sensitivity Analysis on Different DT Values

In the waiting strategy, the DT determines the number of orders accumulated in this scheduling and the decision timing for rerouting. In Section 5.1, we set the DT value to be twice the couriers’ number K for different ISGs. In order to comprehensively analyze the impact of DT, in this subsection, we set the values of DT to K, 2 K, 3 K and 4 K, respectively.
The results of the objective values when the DT is changed in the four ISGs are shown in Figure 10a–d. Moreover, Figure 10 also records the variation in rerouting counts under different DTs. As shown in Figure 10, among the four instance groups, when the value of DT is K, the results of the three objectives are the worst and the number of reroutes is also the highest. When the DT is changed from K to 2K, the results are significantly improved. However, the results for each objective become worse when DT values are 3K and 4K, although rerouting counts are further reduced. The reason for this situation is that, when the DT takes a very small value, the waiting strategy accumulates a small number of orders. More frequent rerouting will result in higher detour costs. In addition, when the DT value is too large, although it decreases courier rerouting counts, the system accumulates more orders. As a strong timely problem, if the number of couriers is insufficient to support the delivery of these orders, more delays will occur. Therefore, it is crucial for decision-makers to optimize operational costs and enhance customer satisfaction by appropriately setting the value of DT in accordance with the current number of couriers in the delivery system.

5.4.2. Sensitivity Analysis on the Time-Sensitive Priority in the Waiting Strategy

In order to flexibly process dynamic orders from customers with heterogeneous time sensitivity, we introduce the concept of time-sensitive priority into the waiting strategy. In Section 4.1.2, we assign time-sensitive priorities of 1, 1.5, and 2 to represent orders from customers with low, medium, and high time sensitivity, respectively. To comprehensively assess the impact of time-sensitive priorities on the waiting strategy, we conduct a sensitivity analysis of different priority combinations. Specifically, we examine four sets of time-sensitive priorities: (0.5, 1, 1.5), (1, 1.5, 2), (1.5, 2, 2.5), and (2, 2.5, 3). The values in parentheses sequentially indicate that the customer’s time sensitivity is low, medium, and high, respectively. The experimental results of four ISGs are shown in Figure 11a–d.
As depicted in Figure 11, the frequency of rerouting increases with higher levels of time-sensitive priority. Optimal results across all objectives are observed in most cases when the time-sensitive priority is set to 1, 1.5, and 2, respectively. However, as the priority grows, all three objectives deteriorate incrementally. Similar to Section 5.4.1, the waiting strategy accumulates fewer orders as the time-sensitive priority rises. Nonetheless, frequent rerouting leads to increased detours and a higher likelihood of order delays. Furthermore, although the rerouting counts and riding cost in ISG1 and ISG2 are fewer when the time-sensitive priority is 0.5, 1, and 1.5, the objectives related to customer satisfaction and delay penalty cost become worse. This outcome occurs because, when the time-sensitive priority is too small, the waiting strategy extends the decision period, leading to a higher number of accumulated orders. Therefore, some urgent orders cannot receive immediate delivery, leading to order delays.

5.4.3. Sensitivity Analysis on Different Proportions of Customers with High Time Sensitivity

The sensitivity analysis in this section is about the change in the total delay time for different types of orders when the proportion of customers with high time sensitivity changes. We analyze the change in the percentage of customers with high time sensitivity from 10% to 50%, with an average distribution of the number of medium and low-sensitive types of customers. The results are shown in Figure 12a–d.
The results in Figure 12 show that, as the percentage of customers with high time sensitivity gradually increases, the total delay time of such customers tends to increase. However, it is lower than the other two categories. In addition, although there has been a gradual decrease in the number of orders from customers with medium and low time sensitivity, there has not been a significant decrease in the total delay time for these orders. This situation arises because, with no change in the number of couriers, an increase in orders from customers with high time sensitivity escalates delivery complexity. To ensure the timely delivery of urgent orders, the system may compromise the punctuality of some medium and low-sensitivity orders.

6. Conclusions

In this paper, we study the MDRP with time-sensitive customers considering the dynamic characteristics of meal orders. A multi-objective optimization model considering customers with time-sensitive heterogeneity is developed to maximize customer time satisfaction and minimize the delay penalty cost and riding cost. To solve the dynamic MDRP, we propose a novel waiting strategy with time-sensitive priority to divide the dynamic problem into a series of static subproblems. For each static meal delivery routing subproblem, the hybrid AGA–ALNS algorithm is developed. We test the effectiveness of the proposed solution method on different scales of dynamic MDRP instance groups. This paper contributes to the existing MDRP research in the following aspects.
From the modelling perspective, taking customers’ time-sensitive heterogeneity into account, we build a multi-objective optimization model combined with three objectives, including customer satisfaction, delay penalty, and riding cost, which determines the sequence of locations visited by couriers in the MDRP. Considering customers’ time-sensitive heterogeneity is meaningful and practical for meal delivery.
From the perspective of the dynamic meal order-processing strategy, we design a waiting strategy with time-sensitive priority to transform the dynamic problem into several static subproblems. The proposed waiting strategy accumulates new orders arriving at the system and uses a decision threshold to determine rerouting time. Furthermore, time-sensitive priority is introduced in the waiting strategy to accelerate assignment and rerouting decisions for orders from customers with high time sensitivity. Experimental analysis shows that the proposed waiting strategy is more suitable for addressing the dynamic MDRP than the rolling horizon method. In addition, the introduction of time-sensitive priority in the waiting strategy can effectively handle the meal orders from time-sensitive customers.
From an algorithmic perspective, we develop a hybrid metaheuristic algorithm, namely AGA–ALNS, to solve the static meal delivery routing subproblem. On the one hand, the EBRSIC operator is employed to enhance the global search capability of the algorithm. On the other hand, ALNS is used as a mutation operator to further improve the algorithm’s local search capability. In addition, adaptive crossover and mutation probabilities are introduced to optimize the algorithm probability settings. By comparing with SA and two GA-based enhanced algorithms in previous studies, our AGA–ALNS algorithm has evident advantages in terms of solution accuracy and computation time.
From a practical perspective, through sensitivity analysis we have derived some managerial implications to provide recommendations for platforms to serve time-sensitive customers. Firstly, as the DT value increases in the waiting strategy, it can effectively improve customer satisfaction and reduce operational costs. However, if the value of the DT is too high, the strategy will accumulate more orders, causing higher delays in the case of insufficient delivery capacity. Similarly, a reasonable setting of time-sensitive priority is also significant for improving the performance of the waiting strategy. When the time-sensitive priority is too large, it will increase the detour cost and decrease the delivery efficiency. Conversely, if the time-sensitive priority is too small, it will extend the decision period, leading to some urgent orders which cannot be delivered immediately. Furthermore, as the number of customers with high time sensitivity gradually increases, it will further increase the delivery complexity, and the strategy will sacrifice the punctuality of orders from customers with medium and low time sensitivity to ensure the timely delivery of high time-sensitive customers.
This study still has some limitations and further research can be conducted in the future. In the meal delivery process, there are several uncertainties affecting route planning, such as meal preparation time and real-time traffic conditions. It is worth exploring these uncertainties comprehensively for future studies, rather than focusing solely on the uncertainty of customers’ ordering time in our current study. In addition, the courier’s shifts considered in our study are certain; however, in the case of meal delivery mostly using crowdsourced couriers, their available time and delivery efficiency are different. Considering the characteristics of couriers in meal delivery is a more realistic scenario.

Author Contributions

Conceptualization, W.W. and S.G.; methodology, W.W. and S.G.; software, S.G.; formal analysis, S.G.; investigation, W.W. and S.G.; writing—original draft preparation, S.G.; writing—review and editing, W.W. and S.G.; visualization, S.G.; supervision, W.W.; funding acquisition, W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 71872038 and 71832001.

Data Availability Statement

Relative data have been included in the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. CCFA; NSRC. Report on the Digitization of Chinese Life Services in 2022. Available online: http://www.aliresearch.com/ch/presentation/presentiondetails?articleCode=380527674285756416&type=%E6%8A%A5%E5%91%8A&organName= (accessed on 29 February 2024).
  2. Statista. Online Food Delivery-Worldwide. Available online: https://statista.com/outlook/374/100/online-food-delivery/worldwide (accessed on 29 February 2024).
  3. Meituan. Meituan Q3 2023 Financial Report. Available online: https://www.meituan.com/news/NN231128062002354 (accessed on 29 February 2024).
  4. Afeche, P.; Pavlin, J.M. Optimal Price/Lead-Time Menus for Queues with Customer Choice: Segmentation, Pooling, and Strategic Delay. Manag. Sci. 2016, 62, 2412–2436. [Google Scholar] [CrossRef]
  5. Jin, X.; Li, K.; Sivakumar, A.I. Scheduling and optimal delivery time quotation for customers with time sensitive demand. Int. J. Prod. Econ. 2013, 145, 349–358. [Google Scholar] [CrossRef]
  6. Sainathan, A. Technical Note-Pricing and Prioritization in a Duopoly with Self-Selecting, Heterogeneous, Time-Sensitive Customers Under Low Utilization. Oper. Res. 2020, 68, 1364–1374. [Google Scholar] [CrossRef]
  7. Ulmer, M.W.; Thomas, B.W.; Campbell, A.M.; Woyak, N. The Restaurant Meal Delivery Problem: Dynamic Pickup and Delivery with Deadlines and Random Ready Times. Transp. Sci. 2021, 55, 75–100. [Google Scholar] [CrossRef]
  8. Yildiz, B.; Savelsbergh, M. Provably High-Quality Solutions for the Meal Delivery Routing Problem. Transp. Sci. 2019, 53, 1372–1388. [Google Scholar] [CrossRef]
  9. Iresearch. Report on the Trends of Chinese Immediate Delivery Industry in 2022. Available online: https://www.iresearch.com.cn/Detail/report?id=3964&isfree=0 (accessed on 29 February 2024).
  10. Xue, G.; Wang, Z.; Wang, Y. The restaurant delivery problem with uncertain cooking time and travel time. Comput. Ind. Eng. 2024, 190, 110039. [Google Scholar] [CrossRef]
  11. Bi, H.; Zhu, X.; Lu, F.; Huang, M. The Meal Delivery Routing Problem in E-commerce Platforms under the Shared Logistics Mode. J. Theor. Appl. Electron. Commer. Res. 2023, 18, 1799–1819. [Google Scholar] [CrossRef]
  12. Simoni, M.D.; Winkenbach, M. Crowdsourced on-demand food delivery: An order batching and assignment algorithm. Transp. Res. Pt. C Emerg. Technol. 2023, 149, 104055. [Google Scholar] [CrossRef]
  13. Wang, W.; Jiang, L. Two-Stage Solution for Meal Delivery Routing Optimization on Time-Sensitive Customer Satisfaction. J. Adv. Transp. 2022, 2022, 9711074. [Google Scholar] [CrossRef]
  14. Reyes, D.; Erera, A.; Savelsbergh, M.; Sahasrabudhe, S.; O’Neil, R. The meal delivery routing problem. Optim. Online 2018, 6571, 2018. [Google Scholar]
  15. Liao, W.; Zhang, L.; Wei, Z. Multi-objective green meal delivery routing problem based on a two-stage solution strategy. J. Clean. Prod. 2020, 258, 120627. [Google Scholar] [CrossRef]
  16. Wang, Z. Delivering meals for multiple suppliers: Exclusive or sharing logistics service. Transp. Res. Pt. E Logist. Transp. Rev. 2018, 118, 496–512. [Google Scholar] [CrossRef]
  17. Tu, W.; Zhao, T.; Zhou, B.; Jiang, J.; Xia, J.; Li, Q. OCD: Online crowdsourced delivery for on-demand food. IEEE Internet Things J. 2019, 7, 6842–6854. [Google Scholar] [CrossRef]
  18. Chen, J.; Fan, T.; Gu, Q.; Pan, F. Emerging technology-based online scheduling for instant delivery in the O2O retail era. Electron. Commer. Res. Appl. 2022, 51, 101115. [Google Scholar] [CrossRef]
  19. Hu, Y.; Zhang, P.; Zhao, K.; Zhang, S.; Fan, B. Disruption Recovery for the Pickup and Delivery Problem with Time Windows—A Scenario-based Approach for Online Food Delivery. Comput. Oper. Res. 2023, 159, 106337. [Google Scholar] [CrossRef]
  20. Berbeglia, G.; Cordeau, J.F.; Laporte, G. Dynamic pickup and delivery problems. Eur. J. Oper. Res. 2010, 202, 8–15. [Google Scholar] [CrossRef]
  21. Mitrović-Minić, S.; Laporte, G. Waiting strategies for the dynamic pickup and delivery problem with time windows. Transp. Res. Pt. B Methodol. 2004, 38, 635–655. [Google Scholar] [CrossRef]
  22. Bent, R.W.; Van Hentenryck, P. Scenario-based planning for partially dynamic vehicle routing with stochastic customers. Oper. Res. 2004, 52, 977–987. [Google Scholar] [CrossRef]
  23. Pillac, V.; Guéret, C.; Medaglia, A.L. An event-driven optimization framework for dynamic vehicle routing. Decis. Support Syst. 2012, 54, 414–423. [Google Scholar] [CrossRef]
  24. Park, H.; Son, D.; Koo, B.; Jeong, B. Waiting strategy for the vehicle routing problem with simultaneous pickup and delivery using genetic algorithm. Expert Syst. Appl. 2021, 165, 113959. [Google Scholar] [CrossRef]
  25. Vonolfen, S.; Affenzeller, M. Distribution of waiting time for dynamic pickup and delivery problems. Ann. Oper. Res. 2016, 236, 359–382. [Google Scholar] [CrossRef]
  26. Wong, K.I.; Han, A.F.; Yuen, C.W. On dynamic demand responsive transport services with degree of dynamism. Transp. A 2014, 10, 55–73. [Google Scholar] [CrossRef]
  27. Agius, M.; Absi, N.; Feillet, D.; Garaix, T. A branch-and-price algorithm for a routing problem with inbound and outbound requests. Comput. Oper. Res. 2022, 146, 105896. [Google Scholar] [CrossRef]
  28. Domínguez-Martín, B.; Hernández-Pérez, H.; Riera-Ledesma, J.; Rodríguez-Martín, I. A branch-and-cut algorithm for the one-commodity pickup and delivery location routing problem. Comput. Oper. Res. 2024, 161, 106426. [Google Scholar] [CrossRef]
  29. Goksal, F.P.; Karaoglan, I.; Altiparmak, F. A hybrid discrete particle swarm optimization for vehicle routing problem with simultaneous pickup and delivery. Comput. Ind. Eng. 2013, 65, 39–53. [Google Scholar] [CrossRef]
  30. Tan, Z.; Zhen, L.; Yang, Z.; Liu, L.; Fan, T. Multi-period emergency vehicle fleet redistribution and dispatching. Transp. A 2023, 1–33. [Google Scholar] [CrossRef]
  31. Lu, F.; Chen, W.; Feng, W.; Bi, H. 4PL routing problem using hybrid beetle swarm optimization. Soft Comput. 2023, 27, 17011–17024. [Google Scholar] [CrossRef]
  32. Wang, H.F.; Chen, Y.Y. A genetic algorithm for the simultaneous delivery and pickup problems with time window. Comput. Ind. Eng. 2012, 62, 84–95. [Google Scholar] [CrossRef]
  33. Dubey, N.; Tanksale, A. A Multi-Depot Vehicle Routing Problem with Time Windows, Split Pickup and Split Delivery for Surplus Food Recovery and Redistribution. Expert Syst. Appl. 2023, 232, 120807. [Google Scholar] [CrossRef]
  34. Zarouk, Y.; Mahdavi, I.; Rezaeian, J.; Santos-Arteaga, F.J. A novel multi-objective green vehicle routing and scheduling model with stochastic demand, supply, and variable travel times. Comput. Oper. Res. 2022, 141, 105698. [Google Scholar] [CrossRef]
  35. Bi, H.; Lu, F.; Duan, S.; Huang, M.; Zhu, J.; Liu, M. Two-level principal-agent model for schedule risk control of IT outsourcing project based on genetic algorithm. Eng. Appl. Artif. Intell. 2020, 91, 103584. [Google Scholar] [CrossRef]
  36. Wang, Y.; Ran, L.; Guan, X.; Fan, J.; Sun, Y.; Wang, H. Collaborative multicenter vehicle routing problem with time windows and mixed deliveries and pickups. Expert Syst. Appl. 2022, 197, 116690. [Google Scholar] [CrossRef]
  37. Ren, J.; Jin, W.; Wu, W. Multi-objective optimization for multi-depot heterogeneous first-mile transportation system considering requests’ preference ranks for pick-up stops. Transp. A 2023, 19, 2103205. [Google Scholar] [CrossRef]
  38. Zhan, X.; Szeto, W.Y.; Shui, C.S.; Chen, X.M. A modified artificial bee colony algorithm for the dynamic ride-hailing sharing problem. Transp. Res. Pt. E Logist. Transp. Rev. 2021, 150, 102124. [Google Scholar] [CrossRef]
  39. Zhan, X.; Szeto, W.Y.; Chen, X.M. The dynamic ride-hailing sharing problem with multiple vehicle types and user classes. Transp. Res. Pt. E Logist. Transp. Rev. 2022, 168, 102891. [Google Scholar] [CrossRef]
  40. Ombuki, B.; Ross, B.J.; Hanshar, F. Multi-objective genetic algorithms for vehicle routing problem with time windows. Appl. Intell. 2006, 24, 17–30. [Google Scholar] [CrossRef]
  41. Sajid, M.; Mittal, H.; Pare, S.; Prasad, M. Routing and scheduling optimization for UAV assisted delivery system: A hybrid approach. Appl. Soft. Comput. 2022, 126, 109225. [Google Scholar] [CrossRef]
  42. Ropke, S.; Pisinger, D. An adaptive large neighborhood search heuristic for the pickup and delivery problem with time windows. Transp. Sci. 2006, 40, 455–472. [Google Scholar] [CrossRef]
  43. Tang, L.; D’Ariano, A.; Xu, X.; Li, Y.; Ding, X.; Samà, M. Scheduling local and express trains in suburban rail transit lines: Mixed–integer nonlinear programming and adaptive genetic algorithm. Comput. Oper. Res. 2021, 135, 105436. [Google Scholar] [CrossRef]
  44. AbdAllah, A.M.F.; Essam, D.L.; Sarker, R.A. On solving periodic re-optimization dynamic vehicle routing problems. Appl. Soft. Comput. 2017, 55, 1–12. [Google Scholar] [CrossRef]
  45. Wang, X.; Lin, N.; Li, Y.; Shi, Y.; Ruan, J. An integrated modeling method for collaborative vehicle routing: Facilitating the unmanned micro warehouse pattern in new retail. Expert Syst. Appl. 2021, 168, 114307. [Google Scholar] [CrossRef]
  46. Foroutan, R.; Rezaeian, J.; Mahdavi, I. Green vehicle routing and scheduling problem with heterogeneous fleet including reverse logistics in the form of collecting returned goods. Appl. Soft. Comput. 2020, 94, 106462. [Google Scholar] [CrossRef]
  47. Xie, F.; Chen, Z.; Zhang, Z. Research on Dynamic Takeout Delivery Vehicle Routing Problem under Time-Varying Subdivision Road Network. Mathematics 2024, 12, 962. [Google Scholar] [CrossRef]
Figure 1. Delivery options from Uber Eats.
Figure 1. Delivery options from Uber Eats.
Systems 12 00170 g001
Figure 2. Illustration of the dynamic routing in MDRP.
Figure 2. Illustration of the dynamic routing in MDRP.
Systems 12 00170 g002
Figure 3. Dynamic optimization timeline for solving the MDRP.
Figure 3. Dynamic optimization timeline for solving the MDRP.
Systems 12 00170 g003
Figure 4. Flowchart of the solution method for the dynamic meal delivery routing problem.
Figure 4. Flowchart of the solution method for the dynamic meal delivery routing problem.
Systems 12 00170 g004
Figure 5. The method of chromosome encoding and decoding.
Figure 5. The method of chromosome encoding and decoding.
Systems 12 00170 g005
Figure 6. Select and remove new orders from parent chromosomes.
Figure 6. Select and remove new orders from parent chromosomes.
Systems 12 00170 g006
Figure 7. Crossover operator.
Figure 7. Crossover operator.
Systems 12 00170 g007
Figure 8. Comparison of performance in three strategies.
Figure 8. Comparison of performance in three strategies.
Systems 12 00170 g008
Figure 9. Comparison of order delay time in three strategies.
Figure 9. Comparison of order delay time in three strategies.
Systems 12 00170 g009
Figure 10. Sensitivity analysis on different DT values.
Figure 10. Sensitivity analysis on different DT values.
Systems 12 00170 g010
Figure 11. Sensitivity analysis on the time-sensitive priority in the waiting strategy.
Figure 11. Sensitivity analysis on the time-sensitive priority in the waiting strategy.
Systems 12 00170 g011
Figure 12. Sensitivity analysis on different proportions of customers with high time sensitivity.
Figure 12. Sensitivity analysis on different proportions of customers with high time sensitivity.
Systems 12 00170 g012
Table 1. Comparison of this study with related literature.
Table 1. Comparison of this study with related literature.
ReferenceProblemObjectivesProblem CharacteristicsSolution Methods
CSDPRCTHCDDTWPDMODDPAlgorithm
Ulmer et al. [7]MDRP MDPHeuristic
Yildiz and Savels-bergh [8]MDRP Exact
Bi et al. [11]MDRP IALO
Simoni and Winken-bach [12]MDRP RHHeuristic
Wang and Jiang [13]MDRP GA
Reyes et al. [14]MDRP RHHeuristic
Mitrović-Minić and Laporte [21]DPDP WSTS
Park et al. [24]DPDP WSGA
Vonolfen and Affen-zeller [25]DPDP WSHeuristic
Dubey and Tanksale [33]VRPPDTW GA
Wang et al. [36]VRPPDTW GA–PSO
Zarouk et al. [34]VRPPDTW MOGASA
This workMDRPWSTPAGA–ALNS
Note: CS: Customer satisfaction; DP: Delay penalty cost; RD: Riding cost; THC: Time-sensitive heterogeneity of customers; DD: Dynamic demands; TW: Time window; PD: Pickup and delivery; MO: Multiple objectives; DDP: Dynamic demand process method; RH: Rolling horizon method; MDP: Markov decision process; WS: Waiting strategy; WSTP: Waiting strategy with time-sensitive priority.
Table 2. Notions used in the model.
Table 2. Notions used in the model.
NotionDescription
Set
VSet of all locations on the route network
ASet of all arcs on the route network
NSet of new orders waiting for assignment
USet of orders that have been assigned but not yet completed
WSet that includes all restaurant and customer locations in N
MSet that includes all restaurant and customer locations in U
iMeal order i
i + Restaurant location for order i
i Customer location for order i
KSet of available couriers
C n o w Set of all couriers’ start locations for the current decision period
Parameter
S i Time satisfaction of customer i
N O Total number of meal orders for sets N and U in the current decision period
k ( m ) Matched courier k for m M before any new meal orders are inserted into route
c k Courier k’s start location at the beginning of the current decision period
v Travel speed of the courier’s vehicle
Q The maximum capacity of the vehicle
q i Number of meals in order i
d m n Euclidian distance from location m to n
T b m k Arrive time at m M before new meal orders are inserted into courier k’s route
s m k The service time of courier k at location m
t m n The travel time from location m to n
E i The earliest delivery time of order i
F i The latest delivery time of order i
L i The acceptable delayed delivery time of order i
λ The riding cost per kilometer of the vehicle
α The time-sensitive coefficient
β 1 Delay penalty cost per minute for customers with low time sensitivity
β 2 Delay penalty cost per minute for customers with medium time sensitivity
β 3 Delay penalty cost per minute for customers with high time sensitivity
Decision Variable
T a m k The arrival time of courier k at location m
q m k The current number of meals that courier k is carrying when leaving location m
x m n k Binary decision variable that takes the value of 1 if courier k travels from location m to n, otherwise 0
Table 3. Corresponding parameter setting.
Table 3. Corresponding parameter setting.
NotationDefinitionValue
Model parameters setting
v The speed of courier’s vehicle20 km/h
λ The riding cost per kilometer of the vehicle3.33 CNY/km
β 1 Delay penalty cost per minute for customers with low time sensitivity0.2 CNY/min
β 2 Delay penalty cost per minute for customers with medium time sensitivity0.3 CNY/min
β 3 Delay penalty cost per minute for customers with high time sensitivity0.5 CNY/min
k 1 Weight of customer time satisfaction with heterogeneous time sensitivity0.4
k 2 Weight of delay penalty cost0.4
k 3 Weight of riding cost0.2
Parameters in AGA–ALNS algorithm
P s i z e Population size50
g e n _ m a x Maximum number of iterations100
p c i n i t Initial crossover probability 0.9
p m i n i t Initial mutation probability0.5
δ Reaction factor0.4
σ 1 Score for the new solution that is better than the current solution5
σ 2 Score for the new solution that is worse than the current solution1
Table 4. Setting of experimental instance groups.
Table 4. Setting of experimental instance groups.
IDThe Number of OrdersThe Number of CouriersDT in Waiting Strategy
ISG125510
ISG250816
ISG3751326
ISG41001530
Table 5. Comparison results of algorithms in different instances.
Table 5. Comparison results of algorithms in different instances.
InstancesAGA–ALNSEnhanced GAGA–LNSSA
IDNumber F 1
(%)
F 2
(CNY)
F 3
(CNY)
CT
(s)
F 1
(%)
F 2
(CNY)
F 3
(CNY)
CT
(s)
F 1
(%)
F 2
(CNY)
F 3 (CNY)CT
(s)
F 1
(%)
F 2
(CNY)
F 3 (CNY)CT
(s)
ISG1193.465.19157.118.9291.339.35167.5339.0790.976.7169.9629.2683.1219.43189.1632.13
294.544.66165.1316.1393.15.01169.7639.2192.546.01167.5128.7783.8428.55185.7134.96
396.582.68164.9913.7794.735.05165.0841.8694.764.24165.7931.8693.854.85183.3731.91
493.75.54176.4315.9989.798.59180.5738.1489.9911.5177.2431.5281.420.22215.5546.43
596.353.17182.7713.9595.473.79186.4230.1687.8210.3184.9529.4479.6132.69191.0138.8
ISG2695.896.4289.6446.8994.598.54289.72111.2790.7420.37274.4877.8473.8790.87362.1873.23
791.7216.6323.6264.0289.3521.9324.38106.6684.2733.77310.1373.5375.44105.72348.2471.69
896.445.79291.6834.9292.6412.59292.22139.0690.2318.37280.7676.7682.0154.07334.0773.55
995.88.27300.8853.559019.13308.1109.9183.8127.85287.1882.6277.8474.38344.9672.09
1089.216.76278.6247.5487.7721.3282.31127.1884.3334.01280.6476.0773.1498.33381.9471.04
ISG31197.536.1432.61112.594.3512.57448.51340.1688.9731.62400.65119.6687.1750.87537.66186.23
1288.630.97476.69115.886.336.59495.2292.1778.2652.37439.51119.4575.3989.41598.63185.49
1396.249.34474.5611791.6125.49478.52339.2389.3434.61431.87122.5880.0184.39575.44188.55
1497.476.81409.3149.395.0211.34411.77378.7393.4420.49374.48137.4284.4147.69524.43187.46
1597.049.32416.79105.494.0912.58428.97356.4592.2418.86417.14118.6184.0769.77551.71188.66
ISG41691.6129.93568.09237.587.9442.46585.82445.8382.5664.42516.92175.7368.72195.39745.17241.83
1791.5428.07540.26179.188.3139.71576.47422.7884.866.45504.25167.2365.73275.37821.62230.36
1893.3820.78546.13287.487.742.9555.8522.1285.3159.91510.74170.8968.27200.08738.57229.89
1988.9935.31519.37181.286.9744.91532.51513.6984.557.61504.7191.6867.72221.73770.44226.33
2093.3821.16550.23271.291.1929.11583.72506.5987.8747.06512.52168.7269.52227.23771.32227.95
Average93.9713.64363.24103.691.1120.65373.17245.0187.8431.33345.57101.4877.7699.55468.55131.93
Note: Bold shows the optimal results of the three algorithms for the same case; F 1 shows the results for customer satisfaction; F 2 represents the results for delay penalty cost; F 3 indicates the results for riding cost; and CT is the computation time of the algorithms.
Table 6. Optimization routes generated by the proposed solution method.
Table 6. Optimization routes generated by the proposed solution method.
Decision PeriodOrdersCourier IDStarting Locations Traveled RoutesPlanned Routes
(0, 9)1, 2, 3, 4, 5, 6, 7, 81InitialR8*C8*
2InitialR6*R5*C6*C5*
3InitialR7*R1*R2*C2*C1*C7*
4InitialR4*R3*C4*C3*
5Initial
(9, 16)9, 10, 11, 12, 13, 14, 15, 161C8R8→C8R9*C9*R15*C15*
2R5R6R5→R10*→C6→C10*→C5
3R1R7R1→R2→R14*→C2→C1→C7→C14*
4R3R4R3→R13*C13*→C4→C3
5InitialR11*R16*R12*C11*C16*C12*
(16, 28)17, 18, 19, 20, 21, 22, 231R15R9→C9R15→C15→R22*R21*C21*C22*
2C10R5→R10→C6C10→R17*→C5→C17*R23*C23*
3C7R1→R2→R14→C2→C1C7→C14→R19*R20*C19*C20*
4C3R3→R13→C13→C4C3→R18*C18*
5R16R11R16→R12→C11→C16→C12
(28, 30)24, 251R15R15→C15→R22→R21→C21→C22→R24*C24*
2R17C10R17→C5→C17→R23→C23
3C14C7C14→R19→R20→C19→C20
4C3C3→R18→C18→R25*C25*
5C11R16→R12C11→C16→C12
Final delivery routes1R8→C8→R9→C9→R15→C15→R22→R21→C21→C22→R24→C24
2R6→R5→R10→C6→C10→R17→C5→C17→R23→C23
3R7→R1→R2→R14→C2→C1→C7→C14→R19→R20→C19→C20
4R4→R3→R13→C13→C4→C3→R18→C18→R25→C25
5R11→R16→R12→C11→C16→C12
Note: R and C represent the restaurant and customer node corresponding to the same order; R* and C* represent the orders generated in the current decision period; Starting locations indicate the initial locations of couriers in each decision period; Traveled routes represent the completion of the routes generated in the last decision period; Planned routes represent the delivery routes reoptimized in the current decision period.
Table 7. Performance comparison of different strategies.
Table 7. Performance comparison of different strategies.
WSTPWSRH
IDNumber F 1
(%)
F 2
(CNY)
F 3
(CNY)
F 1
(%)
F 2
(CNY)
F 3
(CNY)
F 1
(%)
F 2
(CNY)
F 3
(CNY)
ISG1193.465.19157.1194.444.49163.9793.834.86181.11
294.544.66165.1394.495.36163.7189.3117.81195.39
396.582.68164.9995.543.47159.01925.42169.34
493.75.54176.4390.696.77170.3487.729.82192.32
596.353.17182.7794.814.56175.9194.265.28187.26
ISG2695.896.4289.6494.179.37284.793.2912.65295.42
791.7216.6323.6290.715.73294.2685.8130.3322.4
896.445.79291.6894.389.48275.391.9315.75310.37
995.88.27300.8893.1314.19297.1184.7731.99311.3
1089.216.76278.6288.4522.6304.1984.430.19314.75
ISG31197.536.1432.6196.557.49432.2890.2125.64458.56
1288.630.97476.6988.2831.62493.6782.4152.21538.34
1396.249.34474.5695.4615.3468.6290.6231.06482.47
1497.476.81409.396.537.81421.2294.8415.5454.87
1597.049.32416.7995.5410.6142592.8719.42459.57
ISG41691.6129.93568.0990.535.07578.8186.6848.06606.87
1791.5428.07540.269131589.4589.2336.94577.01
1893.3820.78546.1391.7330.94560.9189.7232.05568.82
1988.9935.31519.3789.0936.93578.7886.8949.49577.66
2093.3821.16550.2392.7621.49580.2889.7532.05577.96
Average93.9713.64363.2492.9116.21370.8889.5225.32389.09
Note: WSTP represents waiting strategy with time-sensitive priority; WS represents the waiting strategy that does not include time-sensitive priority; RH represents the rolling horizon method; bold shows the optimal results of the three strategies for the same case; F 1 shows the results for customer time satisfaction; F 2 represents the results for delay penalty cost; F 3 indicates the results for riding cost.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, W.; Gao, S. Waiting Strategy for the Dynamic Meal Delivery Routing Problem with Time-Sensitive Customers Using a Hybrid Adaptive Genetic Algorithm and Adaptive Large Neighborhood Search Algorithm. Systems 2024, 12, 170. https://doi.org/10.3390/systems12050170

AMA Style

Wang W, Gao S. Waiting Strategy for the Dynamic Meal Delivery Routing Problem with Time-Sensitive Customers Using a Hybrid Adaptive Genetic Algorithm and Adaptive Large Neighborhood Search Algorithm. Systems. 2024; 12(5):170. https://doi.org/10.3390/systems12050170

Chicago/Turabian Style

Wang, Wenjie, and Shen Gao. 2024. "Waiting Strategy for the Dynamic Meal Delivery Routing Problem with Time-Sensitive Customers Using a Hybrid Adaptive Genetic Algorithm and Adaptive Large Neighborhood Search Algorithm" Systems 12, no. 5: 170. https://doi.org/10.3390/systems12050170

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop