1. Introduction
The rapid evolution of e-commerce has propelled the instant retail model into the spotlight, fueled by consumer demand for unprecedented speed and convenience. This trend is powerfully illustrated by the Chinese market. According to a recent report by the Chinese Academy of International Trade and Economic Cooperation, China’s instant retail market surged to 650 billion yuan in 2023, marking a remarkable year-on-year growth of 28.89% [
1]. This expansion is not confined to urban centers; the market in county-level areas alone reached 150 billion yuan, underscoring its deep geographical penetration. The same report forecasts the total market to exceed 2 trillion yuan by 2030, signaling its sustained strategic importance. While these figures highlight the scale of the Chinese market, the logistical challenges are global; major urban centers from New York to London are facing similar bottlenecks where traditional fuel-powered fleets intensify traffic congestion and carbon emissions, hindering the long-term environmental sustainability of urban distribution.
Concurrent with this market explosion, drone-assisted delivery has emerged as a compelling solution, driven by robust governmental tailwinds. China has emerged as an ideal primary testbed for pioneering drone-assisted delivery networks, driven by robust governmental tailwinds. The development of the “low-altitude economy” was first articulated in China’s National Comprehensive Transportation Network Plan [
2]. This high-level vision has since been translated into tangible, industry-specific standards [
3] and elevated to a strategic emerging industry [
4]. This convergence of market-driven demand and robust policy support has created an unprecedentedly fertile ground for new delivery technologies. The strategic importance of the low-altitude economy has been further solidified by recent top-level policy directives. In July 2024, a key decision [
5] mandated the development of the “low-altitude economy.” This was reinforced in the March 2025 Government Work Report, which called for promoting the “safe and healthy development” of emerging industries, specifically naming the low-altitude economy as a component of the nation’s drive to cultivate new quality productive forces [
6].
This proactive domestic environment aligns with a broader international movement toward standardizing drone operations. Regulatory frameworks established by the FAA in the United States and the EASA in the European Union are creating enabling environments for commercial UAS adoption. Furthermore, the rapid pace of technological iteration, recently accelerated by the high-frequency technical cycles observed in modern geopolitical conflicts, has significantly enhanced the accessibility of drone hardware, although their sustainable integration remains contingent on navigating complex regional compliance and pilot-related costs.
In this context, drone delivery has emerged as a particularly compelling solution, especially for the pre-warehouse operational model. The pre-warehouse model, designed for rapid fulfillment, is inherently constrained by a limited radius. The unique characteristics of drones—high-speed, point-to-point transit, and a naturally circular flight range—make them ideally suited to not only serve customers efficiently within this area but also to extend its operational frontier. Moreover, as electric-powered vehicles, drones offer a significant reduction in the carbon footprint per delivery compared to ground vehicles, while their ability to bypass road networks mitigates urban congestion and the associated energy waste from vehicle idling.
However, drones are not a panacea. Their widespread adoption is hindered by inherent limitations such as limited payload capacity and battery endurance, necessitating that a significant portion of deliveries still rely on traditional ground vehicles. This complementary nature of drones and trucks logically points towards a collaborative delivery system. Recognizing this potential, industry giants [
7,
8] are actively experimenting with hybrid truck–drone delivery models, signaling a clear industry trend towards such synergistic solutions. From an economic sustainability perspective, such hybrid systems allow for the optimized allocation of diverse carrier resources, ensuring long-term operational viability by balancing high-cost truck routes with low-cost drone sorties.
The challenge, then, for both academics and practitioners, is to determine the most effective collaborative configuration. The academic literature has formalized these emerging concepts into several distinct operational models, primarily categorized into four approaches: Flying sidekick traveling salesman problem (FSTSP), the parallel mode, the vehicle-supported mode, and the drone-supported vehicle mode [
9,
10]. The suitability of each model is highly dependent on the specific operational context. Previous research suggests that for scenarios where customers are distributed radially around a central depot—a geographical layout that closely mirrors the pre-warehouse model—the parallel mode, as conceptualized in the Parallel Drone Scheduling Vehicle Routing Problem (PDSVRP), often yields superior performance in terms of delivery time and resource utilization [
11].
The evolution of PDSVRP research can be understood as a progression through three distinct, yet overlapping, streams aimed at enhancing model realism and applicability.
Early foundational work focused primarily on the fundamental challenge of scaling the operational fleet. The initial single-truck, single-drone models [
12] were quickly extended to accommodate multiple drones. This included the development of integrated scheduling frameworks to coordinate synchronized operations between heterogeneous fleets [
13]. Subsequent advancements further increased fleet complexity by introducing the collective drone concept, which allows multiple UAVs to synchronize their movements to enhance lifting capabilities and overcome the payload limitations of independent sorties [
14]. While crucial in establishing the problem’s scalability, these models [
13,
14] often operated under simplified assumptions, such as unlimited truck capacity and service range, that overlooked the variable costs and energy consumption rates critical for economic and environmental sustainability.
A second, more pragmatic research stream emerged to address these simplifications by incorporating critical operational constraints. A significant advancement was the integration of customer time windows, which transformed the problem from a simple scheduling task into a time-constrained logistics challenge [
13]. Equally critical has been the shift toward realistic drone endurance modeling. While early studies treated flight range as a constant, the recent literature emphasizes the fundamental tradeoff among battery capacity, payload weight, and flight time. Because heavier parcels non-linearly accelerate battery discharge, ignoring this payload-energy dynamic can lead to physically infeasible routing solutions [
15,
16]. To account for this, researchers have begun incorporating payload-dependent energy consumption models or proposing infrastructural workarounds, such as the strategic placement of drone charging and battery swapping facilities [
17]. Beyond drone-specific constraints, the broader PDSVRP framework has transitioned from a simplified TSP-based structure to a more comprehensive VRP formulation. By explicitly incorporating finite vehicle capacity and maximum tour duration, these models significantly boost the practical relevance of air–ground systems for resource-efficient urban distribution [
18,
19,
20].
Concurrently, a third stream has broadened the problem’s scope and objectives. The traditional single-objective focus on minimizing makespan has been expanded to multi-objective frameworks that also consider total system cost [
21]. The strict separation of parallel and collaborative modes has been challenged by hybrid models [
19], while the introduction of dynamism and lead-time constraints has adapted the problem for same-day delivery contexts [
22]. This line of work, which even explores multimodal fleets including electric and autonomous vehicles [
23], demonstrates the PDSVRP’s evolution into a highly flexible and powerful tool for modeling diverse, modern logistics systems.
The literature on PDSVRP, while technically evolving, has yet to fully address two critical gaps in the highly time-sensitive context of pre-warehouse-based instant retail. First, most existing models treat customers as a homogeneous group, neglecting the strategic requirement for customer prioritization. In sustainable logistics management, failing to safeguard service levels for high-value customers can lead to asset attrition, thereby undermining the provider’s long-term economic stability. Second, current studies on pre-warehouse last-mile delivery remain predominantly centered on traditional ground-based transportation. This leaves significant room for optimizing distribution through hybrid systems, as the potential for drones to bypass urban congestion and guarantee rapid fulfillment for premium orders remains largely under-explored.
To bridge these gaps, this paper introduces the PDSVRP from a Pre-Warehouse with Priorities (PW-PDSVRP-P). This new model extends the classical PDSVRP by incorporating two novel features: (1) A comprehensive objective function rooted in the customer pyramid theory, which dynamically weighs priority-based costs for delayed service against the operational expenses of heterogeneous fleets. Unlike traditional hard time windows, early arrivals incur no penalty, but any delay beyond the promised deadline accumulates a penalty cost scaled by the customer’s priority tier. This mechanism ensures that high-margin or time-critical orders are prioritized via rapid drone dispatch when necessary. (2) A more realistic constraint set capturing both customer time windows and differentiated operational ranges for trucks and drones.
To solve this complex model, we propose a highly effective hybrid LNS algorithm as our primary solution methodology. Its performance is benchmarked against the Gurobi solver 10.0.1 and SSIRs. Through extensive computational experiments, we validate the effectiveness of our model and algorithms, providing insights into how a prioritized truck–drone system fosters sustainable last-mile logistics by balancing operational costs and service-level objectives.
The remainder of this paper is organized as follows:
Section 2 presents the mathematical formulation,
Section 3 details the solution methodologies,
Section 4 discusses the experimental results, and
Section 5 concludes the paper.
3. Algorithm Design
3.1. The LNS Algorithmic Framework
The framework operates on a master–slave architecture where the LNS functions as a high-level allocator, iteratively refining the partition of customers between the truck and drone fleets. The solution methodology is developed as a hybrid metaheuristic utilizing a master–slave architecture. The high-level master framework employs Large Neighborhood Search (LNS), which is fundamentally a metaheuristic characterized by its iterative evolution through successive “Destroy and Recreate” cycles to explore the global customer allocation space and escape local optima. In contrast, the low-level sub-solvers (SolveVRP and SolvePMS) are categorized as classical constructive heuristics. These slave algorithms function as non-iterative, single-pass evaluation engines; they do not involve internal iterative optimization but instead provide the computational speed required to assess the master-level allocation schemes rapidly.
This model is strictly a static optimization framework, serving as a tactical pre-planning tool for pre-warehouse logistics. All routing and assignment decisions are finalized prior to the execution phase. Consequently, the current scope is bounded to a deterministic environment where all delivery tasks are assumed to be successfully completed. Therefore, the algorithm does not support online adaptation or real-time re-optimization during active tasks, and explicit contingency rules for failed sorties or task re-assignments are excluded from this study.
As shown in the initialization phase, the process begins by receiving the initial customer sets and invoking the slave heuristics to establish a baseline performance. This initial state is registered as both the current operating solution () and the global best solution (), providing a rigorous benchmark for the subsequent iterative search.
Upon entering the master loop the metaheuristic begins its exploration by applying significant structural modifications to the solution configuration. This is achieved through the LNSSearch function, which corresponds to the “Destroy and Recreate” block. By deconstructing the current customer allocation and re-inserting nodes into alternate fleets using the greedy insertion logic, the algorithm can transcend the boundaries of local minima that typically trap standard local search methods. A critical aspect of this architecture is the master–slave interaction occurring in lines 8–10 of Algorithm 1. Once the LNS master proposes a candidate allocation (
), the framework delegates the precise routing and cost calculation to the slave solvers. This feedback loop is indispensable because the master level only manages set membership, while the true objective value is contingent upon the optimized schedules produced by the sub-problem solvers, ensuring the search is always guided by accurate data.
| Algorithm 1: LNS Main Framework |
|
|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | do |
| 7 |
|
| 8 |
|
| 9 |
|
| 10 |
|
| 11 |
then |
| 12 |
|
| 13 |
then |
| 14 |
|
| 15 |
End if |
| 16 |
End if |
| 17 | End for |
| 18 | |
| 19 | |
| 20 | |
| 21 | |
The search trajectory is governed by a hierarchical acceptance mechanism, represented by the decision diamonds in the flowchart and the conditional statements. If the candidate solution demonstrates a cost improvement over the current state (), it is accepted as the new current solution to serve as the starting point for the next iteration. Simultaneously, the algorithm compares the new state against the global best; if a superior configuration is identified, is updated to preserve the most efficient configuration encountered during the search. This iterative process concludes when the maximum iteration count () is reached. To ensure the final output attains the highest possible precision, the framework performs a final execution of the SolveVRP and SolvePMS solvers on the best-found allocation. This finalization step yields the optimized vehicle routes (), drone schedules (), and the minimum total cost (), which constitute the definitive output of the PW-PDSVRP-P solution framework.
3.2. LNS Operators and Marginal Cost Evaluation
The efficacy of the LNS depends on a strategic balance between diversification and intensification, a process visually conceptualized in
Figure 2 and formally defined in Algorithm 2. In each iteration, the search executes a structural reconfiguration of the current solution by deconstructing the existing customer allocation and rebuilding it through a cost-driven competitive mechanism.
As illustrated in
Figure 2 and Algorithm 2, the procedure initiates a Destroy phase by identifying the set of shared customers,
, who are eligible for both delivery modes. Based on a predefined destruction rate
, the Random Destroy operator selects a subset of these nodes to be moved into a “removal pool,” effectively stripping them from their current truck routes or drone sorties. This unbiased deconstruction is critical for promoting broad exploration, as it breaks the existing local topology and allows the algorithm to jump to distant, potentially superior regions of the solution space.
| Algorithm 2: LNS Iteration |
| ) |
|
|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | |
| 6 | |
| 7 | ) |
| 8 | ) |
| 9 | do |
| 11 |
|
| 12 |
|
| 13 |
then |
| 14 |
|
| 15 |
else |
| 16 |
|
| 17 |
End if |
| 18 | End for |
| 19 | |
| 20 | End function |
In the subsequent Recreate phase, the algorithm reconstructs the solution by re-inserting the customers from the pool. To prevent systematic bias in the reconstruction, the pool is first shuffled to randomize the insertion order. The reconstruction logic is governed by a Competitive Greedy Insertion strategy, where each customer is evaluated for two competing delivery modes. For the truck fleet, the algorithm identifies the optimal feasible insertion point between consecutive nodes in the existing routes, accounting for the additional travel distance and the resulting temporal shift that affects the tardiness penalties of all downstream customers. Simultaneously, the marginal cost of drone assignment is calculated by evaluating whether the customer should be integrated into an existing drone’s scheduled sequence of sorties or requires the deployment of a new virtual drone.
This dual-operator evaluation ensures that the solution is reconstructed with high intensification, focusing on immediate cost minimization. The framework maintains high computational efficiency by using sub-problem solvers to establish a temporary routing baseline () and then applying incremental delta-evaluations to calculate marginal costs rather than re-solving the entire VRP or PMS for every candidate position. As depicted in the transition to the “Repaired Solution”, each customer is finally assigned to the mode and position that yields the minimum aggregate cost increase. This iterative “Destroy and Recreate” cycle repeats until the termination criteria are met, producing a candidate allocation that optimally balances delivery efficiency with the strategic priorities of the customer pyramid.
3.3. Core Subroutines: VRPTW and PMSP Solvers
The efficacy of our LNS frameworks relies on the rapid and accurate evaluation of solutions. To achieve this, we developed specialized heuristic solvers for the two embedded subproblems: a VRPTW solver for the truck fleet and a PMSP solver for the drone fleet. These solvers function as the core “cost-engine” of our algorithms, enabling both the incremental evaluation during the recreate step and the final cost calculation of candidate solutions.
3.3.1. VRPTW Subproblem Solver
To address the CVRPTW for the truck fleet, we employ a classic and computationally efficient two-stage heuristic approach: Giant Tour Construction followed by Route Splitting. This methodology allows for the rapid generation of high-quality, feasible solutions from any given set of truck-assigned customers.
The first stage, Giant Tour Construction, aims to generate a promising preliminary ordering of the customers. For this, we utilize a Greedy Construction Heuristic that implements a nearest-neighbor strategy. The procedure begins with a random customer and iteratively appends the geographically closest unvisited customer to the end of the sequence. The output of this stage is a single, ordered list of all truck customers, representing a “giant tour” that does not yet consider vehicle constraints.
The second stage, Route Splitting & Evaluation, is a deterministic decoding procedure. It takes the giant tour sequence as input and partitions it into a set of feasible vehicle routes. The algorithm processes the customer sequence in the given order, iteratively adding customers to the current truck route as long as doing so violates neither the truck’s capacity
nor the maximum tour duration
. Whenever a customer cannot be added, the current route is finalized by returning to the depot, and a new route is initiated with that customer. During this process, all costs—including fixed vehicle deployment costs, variable travel costs, and any tardiness penalties incurred—are accumulated. The final output is a complete set of feasible truck routes and their total aggregated cost. The entire two-stage process is visually summarized in
Figure 3 and Algorithm 3.
| Algorithm 3: SolveVRP |
|
|
|
| 1 | |
| 2 | |
| 3 | do |
| 4 |
do |
| 5 |
|
| 6 |
|
| 7 |
do |
| 8 |
|
| 9 |
|
| 10 |
then |
| 11 |
|
| 12 |
|
| 13 |
Else |
| 14 |
|
| 15 |
|
| 16 |
|
| 17 |
|
| 18 |
End if |
| 19 |
End for |
| 20 |
|
| 21 |
|
| 22 |
|
| 23 |
End if |
| 24 |
End for |
| 25 | Update particle positions and velocities based on gLine and pBest |
| 26 | End for |
| 27 | |
| 28 | End Function |
3.3.2. IPMS Subproblem Solver
To solve the PMSP subproblem, we developed a fast and effective heuristic that combines customer prioritization with a load-balancing assignment strategy, as illustrated in
Figure 4 and Algorithm 4.
First, all customers assigned to the drone fleet are sorted in ascending order, primarily by their priority class and secondarily by their delivery deadline, ensuring that critical and time-sensitive orders are considered first. The algorithm then estimates the minimum required drone fleet size by dividing the total mission flight time by the maximum endurance per drone.
The core of the heuristic is an intelligent Round-Robin assignment. The algorithm iterates through the prioritized customer list, assigning customers to the estimated drone fleet in a cyclical fashion to evenly distribute the workload. For each assignment, feasibility is checked against the drone’s cumulative flight time. If an assignment is infeasible, a fallback strategy is enacted: the algorithm first searches for any other active drone that can accommodate the mission. If none is found and the fleet limit has not been reached, a new drone is activated. This procedure returns a set of high-quality, well-balanced drone schedules.
| Algorithm 4: SolvePMS |
|
|
| |
| 1 | |
| 2 | |
| 3 | |
| 4 | |
| 5 | do |
| 6 |
|
| 7 |
|
| 8 |
|
| 9 |
then |
| 10 |
|
| 11 |
|
| 12 |
Else |
| 13 |
|
| 14 |
then |
| 15 |
|
| 16 |
then |
| 17 |
|
| 18 |
|
| 19 |
End if |
| 20 |
End if |
| 21 | End for |
| 22 | |
| 23 | |
| 24 | End Function |
5. Conclusions
5.1. Key Findings & Managerial Implications
This study introduces the PW-PDSVRP-P Model and develops a highly effective hybrid Large Neighborhood Search (LNS) algorithm to solve it. By comparing our approach against the exact solver Gurobi and traditional heuristics (e.g., SSIRs), we demonstrate that the proposed hybrid LNS offers a computationally efficient and superior tool for large-scale operational planning. Building upon this methodological foundation, our computational analysis reveals two primary operational findings. First, we identify a clear “economic advantage zone” for drone adoption: the collaborative truck–drone model can reduce total costs by up to 11.3% compared to truck-only systems, though this advantage diminishes as UAV operational costs surpass a critical break-even point. Second, the model exhibits an adaptive assignment strategy, dynamically shifting the role of drones from a high-volume delivery mechanism in low-cost scenarios to a targeted routing optimization tool when operational costs are high. Finally, although the computational experiments in this study were contextualized within specific urban landscapes in China, the proposed PW-PDSVRP-P model and the hybrid LNS algorithm serve as a robust, universally applicable methodology.
Returning to the specific operational insights, the identification of a critical break-even point offers a strategic guide for fleet investment, particularly when viewed through the lens of logistics governance. Methodologically, our PW-PDSVRP-P model abstracts the complexities of drone operations into a specific operational cost parameter. While mathematically straightforward, this abstraction provides a highly practical mechanism to quantify the economic burden of regulatory compliance. As aviation authorities such as the FAA and EASA establish regulatory frameworks for Beyond Visual Line of Sight (BVLOS) operations, compliance becomes a direct operational variable. Requirements like mandatory third-party airspace monitoring, specific pilot-to-aircraft ratios, or geofencing constraints ultimately manifest as elevated fixed or unit costs per drone dispatch. By adjusting this single operational cost parameter within our model, managers can effectively evaluate how varying degrees of regulatory stringency impact the break-even point.
At the operational level, the model’s adaptive assignment logic demonstrates that dispatching decisions should be highly dynamic and cost-dependent. When drone OPEX is relatively low, drones function efficiently as a high-volume delivery mechanism to relieve the truck’s payload. Conversely, as drone operating costs rise, their role should shift to a targeted routing optimization tool. In this scenario, dispatchers can utilize the algorithm to assign drones exclusively to geographically isolated nodes, thereby preventing the primary ground vehicle from making costly and time-consuming detours.
Beyond cost optimization, the integration of the customer pyramid concept provides a tangible framework for service differentiation and social sustainability. In real-world logistics, time windows and vehicle capacities are strictly constrained. The priority-weighted penalty mechanism allows operators to translate these constraints into tiered Service Level Agreements (SLAs). By prioritizing drone dispatches to bypass ground congestion, companies can reliably fulfill urgent, high-margin requests, such as medical supplies or premium e-commerce orders. Even during peak demand periods where some nodes may experience delays, the algorithm inherently protects the service levels of the most valuable customer segments. This ensures resource allocation is both economically viable and aligned with broader social responsibility goals, allowing operators to justify premium pricing models while maintaining critical service standards.
5.2. Limitations and Future Research
This study opens several avenues for future research. While the proposed model provides a foundational strategic framework, it relies on several simplifying assumptions—namely deterministic travel times, constant speeds, homogeneous fleets, and negligible service times. These simplifications, though necessary to ensure computational tractability and consistent with the established PDSVRP literature [
18,
20], may limit the immediate realism of the results. A critical spatial limitation of the current PW-PDSVRP-P model is its reliance on a 2D topographical framework, assuming unobstructed flight paths suitable for flat terrains. Extending this model to integrate 3D spatial modeling and geographic information systems (GIS) will be critical to navigate complex topographies (e.g., mountains) and urban obstacles (e.g., high-rises, no-fly zones).
Furthermore, sustainability is currently evaluated through distance-based cost metrics. Future extensions should integrate high-fidelity, payload-dependent energy consumption models to capture the non-linear impact of battery degradation and payload weight on drone endurance, as well as explicit emissions data for trucks. Finally, successfully scaling this collaborative model requires addressing specific logistics governance constraints.