Next Article in Journal
Advancing Hospital Sustainability: A Multidimensional Index Integrating ESG and Digital Transformation
Previous Article in Journal
Performance Evaluation of a 140 kW Rooftop Grid-Connected Solar PV System in West Virginia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Coordinating V2V Energy Sharing for Electric Fleets via Multi-Granularity Modeling and Dynamic Spatiotemporal Matching

1
School of Mechanical Engineering, Beijing Institute of Technology, Beijing 100081, China
2
National Key Laboratory of Multi-Perch Vehicle Driving Systems, Beijing Institute of Technology, Beijing 100081, China
*
Authors to whom correspondence should be addressed.
Sustainability 2025, 17(19), 8783; https://doi.org/10.3390/su17198783
Submission received: 31 July 2025 / Revised: 4 September 2025 / Accepted: 15 September 2025 / Published: 30 September 2025
(This article belongs to the Section Sustainable Transportation)

Abstract

The increasing adoption of electric delivery fleets introduces significant challenges related to uneven energy utilization and suboptimal scheduling efficiency. Vehicle-to-Vehicle (V2V) energy sharing presents a promising solution, but its effectiveness critically depends on precise matching and co-optimization within dynamic urban traffic environments. This paper proposes a hierarchical optimization framework to minimize total fleet operational costs, incorporating a comprehensive analysis that includes battery degradation. The core innovation of the framework lies in coupling high-level path planning with low-level real-time speed control. First, a high-fidelity energy consumption surrogate model is constructed through model predictive control simulations, incorporating vehicle dynamics and signal phase and timing information. Second, the spatiotemporal longest common subsequence algorithm is employed to match the spatio-temporal trajectories of energy-provider and energy-consumer vehicles. A battery aging model is integrated to quantify the long-term costs associated with different operational strategies. Finally, a multi-objective particle swarm optimization algorithm, integrated with MPC, co-optimizes the rendezvous paths and speed profiles. In a case study based on a logistics network, simulation results demonstrate that, compared to the conventional station-based charging mode, the proposed V2V framework reduces total fleet operational costs by a net 12.5% and total energy consumption by 17.4% while increasing the energy utilization efficiency of EV-Ps by 21.4%. This net saving is achieved even though the V2V strategy incurs a marginal increase in battery aging costs, which is overwhelmingly offset by substantial savings in logistical efficiency. This study provides an efficient and economical solution for the dynamic energy management of electric fleets under realistic traffic conditions, contributing to a more sustainable and resilient urban logistics ecosystem.

1. Introduction and Literature Review

1.1. Introduction

Driven by global ‘dual-carbon’ goals, the electrification of the transportation sector is advancing at an unprecedented pace. In urban logistics, electric delivery fleets have become a cornerstone of green, low-carbon transport. However, these fleets face unique operational challenges. Due to factors such as task assignments, travel routes, dynamic traffic conditions, and varying initial states of charge, significant energy utilization imbalances often arise within a fleet. Some vehicles, termed electric vehicle-providers (EV-Ps), complete their tasks with substantial surplus energy, while others, electric vehicle-consumers (EV-Cs), risk premature energy depletion and face ‘range anxiety’.
The traditional solution involves directing energy-deficient vehicles to fixed charging stations (CSs). This approach, however, has numerous drawbacks, including suboptimal station distribution, limited charger availability, and long queuing times during peak hours. When a vehicle’s energy level becomes critical far from a suitable station, the resulting detour not only consumes extra energy due to complex road conditions but also causes severe task delays, increasing time-window penalty costs and degrading overall fleet efficiency. Ultimately, the challenges facing electric fleets are deeply rooted in the three pillars of sustainability. Environmentally, the goal is to minimize energy consumption and reduce the carbon footprint per delivery. Economically, fleet operators must reduce operational costs—including energy, time penalties, and long-term asset depreciation like battery aging—to ensure financial viability. Socially and operationally, enhancing the efficiency of logistics reduces traffic congestion and lessens the strain on public charging infrastructure, contributing to a more resilient and functional urban system. To address these challenges, Vehicle-to-Vehicle (V2V) energy sharing has emerged as a novel, distributed energy interaction paradigm [1]. However, achieving efficient V2V energy sharing is non-trivial. It demands a solution to a complex, multi-faceted optimization problem that traditional research has not fully addressed [2].

1.2. Literature Review

In recent years, V2V energy sharing technology has become a research hotspot in intelligent transportation and energy management [3]. Existing studies primarily focus on supply–demand matching mechanisms, path planning and scheduling optimization [4].
The primary step in achieving efficient energy sharing lies in accurately identifying and matching potential supply-and-demand pairs. Hosseini et al. [5] proposed a V2V supply and demand matching strategy based on the Hungarian algorithm, which realizes the dynamic matching of the minimum cost under the distributed fog computing architecture by constructing a bipartite graph model of energy supply vehicles and energy demand vehicles. For the first time, this strategy integrates the cost of electricity, driving, and time waiting, and it provides a low-cost distributed energy trading solution for the Internet of Vehicles. Shurrab et al. [1] proposed a two-layer stable matching framework, the first layer of which uses Gale–Shapley game to minimize the cost of generation and maximize the profit of stable pairing, to solve the problem of individual conflict of interest in traditional matching. The second layer innovatively introduces the user satisfaction model, which screens the pairs with high performance willingness through satisfaction ranking, and increases the transaction success rate by 25–40%, overcoming the industry pain point of high default rate caused by low satisfaction. Zhang et al. [6] proposed a V2V dynamic matching protocol based on bipartite graphs, which optimizes social welfare through the maximum weight algorithm and introduces stable matching to solve individual rational problems. Wang et al. [7] proposed a Spatio-temporal coordinated V2V exchange strategy, which realizes supply and demand matching through Mixed-Integer Nonlinear Programming (MINLP) and Branch Cutting External Approximation Algorithm (BCBOA), and incorporates the waiting time of traffic lights into the energy consumption model for the first time, which solves the problem of charging overload in suburban scenarios. Liu et al. [8] proposed a dynamic V2V pairing algorithm based on path proximity, combined with the Dynamic Time Regularization (DTW) algorithm to quantify the Spatiotemporal similarity of the two vehicle paths. The path fragments are extracted through the sliding window, and the minimum cumulative distance between the EV-C and EV-P path nodes is calculated to ensure the Spatiotemporal alignment of the matching vehicle path.
Once matching pairs are identified, the next critical challenge is to plan optimal rendezvous and subsequent travel paths to minimize the additional costs incurred during the interaction. Yang et al. [9] proposed a time-oriented improved Dijkstra algorithm to minimize the travel time of electric vehicles through the dynamic weight mechanism and the queuing theory model. Sari et al. [10] proposed a hybrid ant colony algorithm, the core feature of which is multi-strategy fusion: by combining the genetic algorithm, particle swarm optimization or artificial potential field method, multi-objective collaborative optimization of path length, number of turns and energy consumption can be realized. Liu et al. [11] developed a JPS-enhanced A* algorithm to significantly improve the computational efficiency. Li et al. [12] developed a Q-learning dynamic path planning framework that combined LSTM traffic prediction with a Kuhn–Munkres (KM) matching algorithm. The innovation lies in the establishment of a longitudinal dynamic energy consumption model to generate the optimal charging path through real-time road condition prediction. In the measured road network in Nanjing, the scheme reduces the charging waiting time by 15% and realizes cooperative navigation with low communication overhead. Liu et al. [8] designed a Spatio-temporal collaborative path planning algorithm based on the dynamic V2V pairing algorithm of path proximity, which solves the problem of additional travel time and energy waste caused by the traditional parking charging mode, significantly shortens the charging time and improves the service efficiency. Yang et al. [13] proposed a crowd perception-driven path planning framework that reduced travel time by 21.7% and charging cost by 14.5% in the Shenzhen case. Wu et al. [14] developed a PHEV path–velocity joint optimization model, which solves the defect of traditional TSP ignoring velocity elasticity through sub-gradient cutting and effective inequality, achieves a maximum of 59% reduction in energy consumption, and promotes the refined scheduling of electric logistics.
Closely related to path planning is scheduling optimization, which involves coordinating the entire fleet or multiple energy-sharing events at a macroscopic level to ensure overall system efficiency and stability. Merkuryeva et al. [15] developed a simulation-based vehicle scheduling algorithm to solve the problem of minimizing the idle time of trucks with time windows, which is characterized by dealing with multiple constraints (such as capacity and time windows) through OptQuest and evolutionary algorithms, and using simulation models to verify the feasibility. Rashmi et al. [16] proposed a local optimal and fair allocation scheduling algorithm to solve the problem of charging cost optimization of electric vehicles (EVs), which is characterized by the use of the PSO algorithm to deal with dynamic EV arrival and the sliding window mechanism to achieve efficient grid load balancing. Gupta et al. [17] designed a hierarchical scheduling algorithm (VAA/VSA) for wind–solar charging stations, which prioritizes the use of renewable energy in synergy with V2V, and only uses the grid as a backup. The contribution is to verify the feasibility of collaborative diSPATch in reducing grid dependence (energy consumption by 94%) and cost (by 68%), and providing practical solutions for green charging infrastructure. Fan et al. [2] proposed a fairness-aware MADDPG scheduling algorithm and introduced Noisy Net to enhance the exploration ability. By designing a reward function with a fairness factor, the scheduling robustness problem when some chargers fail. Abdolmaleki et al. [18] proposed a V2V charging scheduling framework based on dynamic programming, which realizes the real-time scheduling of vehicle formation charging through Spatiotemporal-energy network modeling. It solves the problems of insufficient charging facilities, high battery cost and range anxiety, and reduces the peak pressure of the power grid by 15%. Kabir et al. [19] proposed a mobile charging vehicle scheduling framework, first established an ILP model, but proved to be an NP difficult problem, and then proposed three heuristic algorithms: SWSPF (the strictest time window first), SDSPF (minimum power demand first), EASPF (early arrival first), and in the Montreal case, the SDSPF algorithm achieved a service efficiency of 5.33 EVs/truck (close to the ILP optimal solution of 6.66), which solved the problem of insufficient coverage of fixed charging piles
Despite these significant advancements, a critical review of the literature as shown in Table 1 reveals two persistent gaps. First, the majority of existing V2V routing and scheduling models rely on oversimplified, macroscopic energy consumption models. As shown in our review, these models typically neglect the nonlinear dynamics of the vehicle powertrain and, more importantly, fail to incorporate the influence of real-time traffic signal information (SPAT), which is a key determinant of energy efficiency in urban driving. This leads to inaccurate energy and travel time predictions, undermining the reliability of the entire optimization process. Furthermore, a fundamental limitation of prior work lies in its decoupled approach, which often neglects the nuanced spatiotemporal dynamics of vehicle matching while treating path planning and speed control as separate problems. This fragmentation inherently leads to suboptimal system-wide performance. In contrast, our framework resolves this by holistically co-optimizing these interdependent variables, ensuring that the vehicle pairing, their rendezvous path, and their speed profiles are mutually optimal.
To address these limitations, this paper proposes a hierarchical framework that bridges the gap between microscopic vehicle control and macroscopic fleet coordination. Our primary contributions are threefold: (1) Develop a high-fidelity energy surrogate model using MPC and SPAT data to provide accurate cost estimations. (2) Employ a sophisticated spatiotemporal matching algorithm (STLCSS) to identify the most promising V2V pairs. (3) Introduce an integrated optimization approach that co-optimizes rendezvous paths and speed profiles simultaneously, leading to a more efficient and economical V2V energy-sharing strategy.

2. Research Object and Problem Statement

2.1. Research Object

Figure 1 presents an integrated framework designed to optimize vehicle operations, with a particular emphasis on enabling energy sharing through vehicle-to-vehicle (V2V) interactions. The core of this framework is V2V communication technology, which serves as the fundamental mechanism for inter-vehicle coordination and data exchange. Surrounding this core are three interconnected modules, each providing distinct functionalities that collectively support the system’s operation.
Located in the top-left quadrant, the SUMO Vehicle Modeling module establishes a high-fidelity simulation environment that accurately replicates vehicle dynamics and traffic scenarios. This module’s capability to capture the complexities of real-world traffic is pivotal, as it generates reliable input data essential for subsequent optimization stages. The simulation integrity ensures that vehicle interactions, road network behaviors, and traffic flow patterns are authentically represented, providing a robust foundation for system-wide operational analysis.
Positioned in the top-right quadrant, the Scheduled Speed and Path Optimization module performs dynamic adjustments to predefined routes and velocities. By employing the Frank–Wolfe algorithm for route computation combined with PSO (Particle Swarm Optimization) for multi-objective enhancement, this module ensures vehicles adhere to highly efficient trajectories according to operational schedules. Energy efficiency serves as the primary performance metric, driving the selection of routes and speeds that minimize energy consumption while satisfying all scheduling constraints and operational requirements.
The Supply–Demand-Based Vehicle Matching for Energy Sharing module (bottom), which utilizes the specialized STLCSS algorithm, facilitates energy-sharing partnerships by matching vehicles with surplus energy to those in need within the energy network. This process supports V2V energy transfer, improves overall energy utilization, and offers operational flexibility for vehicles with diverse energy requirements.
These three components constitute the core architecture of the V2V energy reciprocity system, enabling seamless interaction between urban vehicles and cloud-based central control through bidirectional energy information and routing data exchange. This integrated mechanism ensures optimal task execution with enhanced energy efficiency and minimized operational consumption, while facilitating real-time system-level coordination and adaptive decision-making capabilities.

2.2. Technical Route for V2V Energy Sharing

In this paper, a three-level topology (node-section-path arc) of urban road network based on graph theory is constructed. The node layer integrates real-time phase information (SPAT) of traffic lights, and dynamically analyzes timing parameters such as traffic light cycle and green light window through V2I communication. The road section layer integrates the road speed limit and the road length; The path arc layer characterizes the energy characteristics of the complete travel path. The digital map is used to extract the pre-planned path, and the geometric attributes (node information, road length) and signal light time series are modeled by the open-source traffic simulation tool Netedit, so as to form a dynamic road network basic framework with temporal and SPAT constraints.
A longitudinal vehicle dynamics model quantifies the balance between traction, aerodynamic drag, and rolling resistance, capturing discrepancies between commanded and actual acceleration. SPAT-driven target speed range calculations ensure signal-passing within green windows, reducing idling losses. A MPC framework formulates a multi-objective optimization problem: the cost function combines energy consumption (piecewise traction/regenerative braking), traffic light synchronization penalties, and acceleration smoothness, with constraints including speed limits and jerk thresholds. The CasADi-IPOPT solver enables efficient nonlinear optimization. A cubic polynomial surrogate model linking average speed to energy consumption supports path-level evaluation.
Based on the Floyd–Warshall algorithm, the global shortest path is calculated, and a logistics distribution model with time windows and dynamic constraints is established. Multi-objective Particle Swarm Optimization (MOPSO) is used to optimize the path velocity sequence: the particle position encodes the road section speed, the objective function joint minimizes the energy consumption proxy model output and the time window default penalty, and the inertia weight linear attenuation balances the global exploration and local development. At the same time, the SPATiotemporal Longest Common Subsequence Algorithm (STLCSS) is used to achieve supply and demand matching: the SPAT distance threshold (ε, δ) is set, the trajectory point alignment is recursively calculated, and the normalized similarity score is used to screen the optimal EV-P/EV-C pairing, so as to solve the SPAT dislocation problem of traditional matching.
Vehicles are classified as Energy-Providers (EV-Ps, surplus SOC) or Energy-Consumers (EV-Cs, deficit SOC) via non-charging simulation. For matched pairs, candidate rendezvous points (nodes/neighboring stations) along the EV-C’s remaining path are generated. Multi-objective optimization selects rendezvous locations by maximizing STLCSS similarity, minimizing arrival time differences, and satisfying energy conservation (EV-P post-transfer SOC suffices for return). Energy transfer compensates the EV-C’s deficit, with dynamic SOC updates. Post-transfer speed re-optimization and receding horizon control ensure time-window compliance and mission completion.
The framework achieves a breakthrough through three layers of innovation: (1) coupled modeling of road network–signal lights–dynamics improves the accuracy of energy consumption prediction; (2) the MOPSO-STLCSS synergy mechanism reduces the suboptimal risk of path planning and matching; (3) the dynamic mutual aid strategy activates the redundant energy circulation of the fleet. Compared with the traditional charging station mode, it significantly reduces the additional energy consumption and time window default costs caused by bypasses, effectively improves the utilization rate of redundant energy, and achieves system-level optimization of both operating costs and total energy consumption.
The technology roadmap is shown in Figure 2:

3. Modeling of Road Section Efficiency and Dynamic Cost Quantification Under Free Traffic Flow

The energy efficiency optimization of urban transportation systems is a key technological pathway to achieving the ‘dual carbon’ goals. Traditional energy consumption models are mostly based on static road network assumptions, neglecting the dynamic nature of traffic signals, vehicle power characteristics, and the coupling effects of road network topology. As a result, these models fail to meet the needs of real-time energy consumption prediction and optimization in intelligent and connected environments. To address this, this chapter proposes a three-stage modeling framework of ‘multi-granularity road network modeling–energy efficiency agent training–dynamic cost quantification,’ aiming to overcome the limitations of traditional static models and achieve refined energy consumption prediction in complex traffic scenarios. The main notations and variables used throughout this paper are summarized in the Nomenclature table in Appendix A for easy reference.

3.1. Multi-Granularity Road Network Energy Consumption Modeling

3.1.1. Energy Consumption Model Based on Vehicle Dynamics

In a connected environment, various vehicle information such as position and speed can be obtained through V2I technology and GPS systems. Since the speed optimization is performed solely based on V2V and V2I information, lateral and vertical control of the vehicle are neglected, and a vehicle dynamics model is established. The longitudinal dynamics model constructed in this study is as follows:
x = s v T
where s denotes the vehicle position and v represents the vehicle speed. The control variable u represents the acceleration a, and the vehicle dynamics are described by the following system of ordinary differential equations:
d s d t = v ( t ) d v d t = a real ( t )
In the equation, the actual acceleration areal is determined jointly by the commanded acceleration acmd and the resistance acceleration.
a real , k = a cmd , k F resist , k m
Here, Fresist,k represents the total resistance force, and m denotes the vehicle mass. The total resistance comprises aerodynamic resistance Faero and rolling resistance Froll (road gradient is neglected for urban roads). The calculation formulas are as follows:
F resist , k = 1 2 C d A f ρ air v k 2 + C r m g
Here, Cd is the aerodynamic drag coefficient, A is the frontal area of the vehicle, ρ is the air density, Cr is the rolling resistance coefficient, and g is the gravitational acceleration.
Since the system is already linear, linearization is not required. The continuous-state equations for the vehicle’s longitudinal motion are given by
x ˙ = A x + B u
A = 0 1 0 1 τ d , B = 0 T s τ d
Here, x ˙ denotes the system state variables, and u represents the system control variables. The kinematic equations are discretized using an explicit discretization method based on the analytical solution of uniformly accelerated motion. Under the assumption of approximately constant acceleration, this method offers high accuracy and fast computation, making it suitable for real-time Model Predictive Control (MPC).
By integrating the position s, the velocity
v ( t ) = d s d t
is obtained. Assuming the actual acceleration areal remains constant, the displacement update equation is given by
s k + 1 = s k + t k t k + Δ t v k + a real , k ( t t k ) d t = s k + v k Δ t + 1 2 a real , k Δ t 2
Here, sk denotes the vehicle position at step k, vk is the vehicle speed (m/s) at step k, ak represents the actual acceleration at step k, and Δt is the time step duration (seconds).
These constraints ensure the safety and feasibility of the vehicle’s operation in real-world conditions. Next, we consider the velocity update equation. By integrating the velocity v(t) and assuming that the actual acceleration areal remains constant, the velocity update equation can be derived as follows:
v k + 1 = v k + a real , k Δ t
Here, vk+1 represents the velocity at the next time step, vk is the velocity at the current time step, areal,k denotes the actual acceleration at the current time step, and Δt is the time step duration.
Based on the dynamic model, the energy consumption and recovery characteristics are further quantified. The relationship between the traction force Ftraction and the instantaneous power Pk is expressed as
F traction , k = m a actual , k + F aero , k + F roll , k
P k = F traction , k v k
Energy consumption calculations distinguish between driving and braking conditions. When P > 0, the vehicle consumes energy during acceleration or cruising; when P < 0, the vehicle decelerates and recovers energy through regenerative braking.
Δ E consumed = P Δ t ( P > 0 ) Δ E recovered = | P Δ t | η regen ( P < 0 )
Here, Δt denotes the simulation time step, and ηregen represents the regenerative braking efficiency.
During braking, kinetic energy is converted into heat or electrical energy:
Δ K E = 1 2 m v k 2 v k + 1 2
E brake = ( 1 η ) Δ K E
The total energy consumption Etotal is calculated by the integral formula:
E total = k = 0 N max ( P k , 0 ) Δ t η regen min ( P k , 0 ) Δ t
To simplify the model, it is assumed that energy conversion occurs without additional losses (e.g., the motor efficiency is assumed to be constant at 1), focusing primarily on capturing the macroscopic energy consumption trends.

3.1.2. Construction of the Planned Road Network Model

To achieve dynamic optimization of urban road network energy efficiency, this study constructs a three-level topological model of ‘node–edge–arc’ based on graph theory, as shown in Figure 3. i and j: These are “customer points,” marked with blue stars. They represent key locations in the network, such as the starting point and destination of a journey, or specific pickup/delivery locations.
a and b: These are “intermediate nodes,” marked with red circles. They represent road intersections or waypoints along a route that are neither the origin nor the final destination. They are points that must be passed through when traveling between other locations.
Specifically, the road network graph G is decomposed into the following layers:
Node: Represents road intersections or key points (e.g., customer points i, j and intermediate nodes a, b). Traffic signal nodes are defined as special control points, and their states (red/green light phases and remaining time) are updated in real time via V2I (Vehicle-to-Infrastructure) communication.
Edge: Physical road segments that connect adjacent nodes (e.g., ia, ab, jb), integrating road segment speed limits and traffic flow density data.
Arc: A complete path composed of multiple consecutive edges (e.g., the feasible path from customer point i to j), whose energy efficiency characteristics are influenced by the coordination of all traffic signals along the path.
The model dynamically integrates SPAT information to enable online parsing of parameters such as the signal cycle tc, green light window tgreen, and red light window tred. Specifically, the traffic signal uses the red light phase as the cycle start point, and its state switching logic satisfies the following conditions:
t current = mod ( t , t c )   and   K w = t t c
In the equation, Kw is the cycle counter used to synchronize the signal timings of multiple intersections. This design provides the dynamic temporal constraints necessary for subsequent speed planning.
Traditional cruise control (CCC) and car-following models (such as the Gipps model) have the issue of stopping at red lights. To address this, this study proposes a speed planning method based on SPAT information. This method analyzes the status of the upcoming traffic signals (including the remaining green light time tgreen_remain and the red light waiting time tred_wait) to calculate the target speed range [vilb,vihb]. When the speed of the extended-range vehicle falls within this range, it ensures the vehicle can pass through the traffic signal within the current green light time window, thus reducing stop or idling time and alleviating traffic congestion. The calculation of the target speed range is shown in Figure 4.
The formulas for calculating the target speed range based on SPAT information are given by Equations (17)–(21). It should be noted that these formulas must satisfy the constraints of urban road speed limits and the longitudinal dynamics of the vehicle. Additionally, the formulas also provide the calculation method for the traffic signal states during the modeling process. In order to increase the number of extended-range vehicles passing through the traffic signal within the same green light time window, thereby reducing the time for a specific trip, this section sets the target speed of extended-range vehicles as the upper limit of the target speed range. The specific formulas are as follows:
v i t h b ( t ) = d i a ( t ) K w t c t g t Green   light v i max Green   light   and   d i a ( t ) K w t c t v i max d i a ( t ) K w t c + t r t Other   cases   when   green   light
v i l b ( t ) = d i a ( t ) K w t c t Red   light d i a ( t ) K w t c t Green   light   and   d i a ( t ) K w t c t v i max d i a ( t ) ( K w + 1 ) t c t Other   cases   when   green   light
Traffic   light   status = Red   light if   0 mod t t c < t r Green   light if   t r mod t t c t c
t _ c = t _ g + t _ r
K w > k t c
In the formula, vitarget denotes the target speed of the i-th vehicle; vilb and vihb represent the lower and upper bounds of the target speed range, respectively; t is the total travel time of the vehicle; dia is the distance from the i-th vehicle to the upstream traffic signal a; Kw is an integer describing the cycle count of the traffic signal; tg and tr denote the durations of the green and red lights, respectively; tc represents the duration of one full signal cycle; and mod is the modulo function producing the remainder of t divided by tc.
Within the dynamic programming framework, traffic signal coordination is quantified through a time-window penalty term. The estimated time of arrival (ETA) of a vehicle at traffic signal a is defined as
t ETA = d i a v k
If t ETA [ K w t c , K w t c + t green ] , the departure penalty function is defined as
P traffic _ light = μ max ( 0 , K w t c t ETA ) + max ( 0 , t ETA ( K w t c + t green ) )
In the equation, μ represents the penalty weight used to dynamically adjust the vehicle speed in order to synchronize with the traffic signal timing. This mechanism effectively reduces energy waste caused by stopping at red lights, including idling fuel consumption and braking energy losses.
Following the proposed methodology, a set of pre-planned candidate driving routes is first established based on node coordinates and real-world road network data, as visualized in Figure 5. Subsequently, focusing on the longitudinal vehicle dynamics (while temporarily neglecting lateral dynamic effects), each extracted route is meticulously modeled and post-processed using the open-source traffic network simulation tool Netedit. Specifically, for every pair of directly connected nodes in the route topology, a precise road segment model is constructed that reflects actual road geometric features such as the number of lanes, gradient, and curvature. Additionally, timing models of key traffic control devices, such as traffic signals, are integrated. Through this series of modeling steps, the original route planning tasks are ultimately abstracted and transformed into directed arcs within the road network, each possessing specific energy consumption attributes—referred to as ‘energy arcs’—thus laying a solid foundation for subsequent energy consumption optimization or route evaluation.

3.2. Optimal Control Problem Solution for Vehicle Speed Planning Based on MPC

Addressing the vehicle passage problem at continuously signal-controlled urban intersections, the objective function is constructed as a weighted sum of electric energy consumption, travel time, speed fluctuations, and overspeed penalties. Considering constraints such as vehicle longitudinal dynamics, road speed limits, and target speed range, an optimal control problem for economically efficient vehicle speed planning through continuous urban arterial intersections is formulated based on the state transitions of the vehicle speed state variables. The following sections detail the design of the state transition matrix, objective function, and constraint conditions within this optimal control problem.

3.2.1. State Transition Matrix

To accommodate real-time MPC, an explicit discretization method based on the uniform acceleration assumption is employed. We introduce the state vector xk and the control vector uk. The state vector xk consists of position sk and velocity vk, while the control vector uk includes the commanded acceleration ak. The state update equation can be expressed as
x k + 1 = A x k + B u k + D ( v k )
Here, matrices A, B, and D(vk) represent the system dynamics matrix, control input matrix, and resistance influence matrix, respectively. Their specific forms are as follows:
A = 1 Δ t 0 1
B = ( Δ t ) 2 2 Δ t
D ( v k ) = ( Δ t ) 2 2 m F resist ( v k ) Δ t m F resist ( v k )
Finally, based on the analytical solution of uniformly accelerated motion, the discretized state update equation can be expressed as
x k + 1 = f ( x k , u k ) = s k + v k Δ t + 1 2 u k F aero , k + F roll m Δ t 2 v k + u k F aero , k + F roll m Δ t
Here, Faero and Froll represent the aerodynamic resistance and rolling resistance, respectively, and uk denotes the commanded acceleration.

3.2.2. Constraints

The constraints include speed limits, acceleration bounds, and jerk (rate of change of acceleration) limits:
a min a k a max 0 v k v max v [ k ] v legal ( s [ k ] ) s k s k 1 j k j max = a max a min Δ t d safe d min = 5   m s ego [ k ] s lead [ k ] ( d safe + v ego [ k ] t reaction ) if   s [ k ] [ s stop _ line ϵ , s stop _ line + ϵ ] ,   then   v [ k ] = 0 v [ k ] v max + ϵ v , ϵ v 0 , w e ϵ v min
vlegal(s) is the speed limit function for the road segment; dsafe is the static safety distance; sego(k), vego(k) represent the position and speed of the ego vehicle at step k, respectively; slead(k) is the position of the lead vehicle at step k; treactiont is the driver’s reaction time; and ϵ is the relaxation variable, which converts the hard constraint into a soft constraint.

3.2.3. Objective Function

For the research subject, the range-extended vehicle, the primary performance metric for evaluating the vehicle’s driving economy is the consumed electric energy, while also considering the requirement to arrive at the destination on time. Therefore, the optimization objective of the model focuses on two core indicators: energy consumption and travel time, ensuring that the vehicle minimizes energy consumption while reaching the target location within the specified time. To improve driving smoothness, a mechanism is designed to mitigate situations with large speed fluctuations. This is achieved through acceleration smoothing and jerk (rate of change of acceleration) control, reducing abrupt acceleration and deceleration, thereby avoiding unnecessary energy loss and negative impacts on passenger comfort. Additionally, the model incorporates a penalty term for speed points exceeding the target speed range, ensuring the vehicle maintains a reasonable speed range during operation, thus enhancing driving efficiency while balancing economy and safety.
(1) Speed Tracking Cost Term
This cost term aims to optimize driving efficiency and reduce travel time by minimizing the deviation between the vehicle’s speed and the preset maximum allowable speed. Its mathematical expression is the weighted sum of the squared differences in speed at each time step within the prediction horizon:
C speed = w v k = 0 N v max v [ k ] 2
In this control model, vmax represents the speed limit of the road or the maximum allowable speed of the vehicle, while v(k) is the vehicle’s speed at the k-th time step within the prediction horizon.
(2) Acceleration Smoothing Cost Term
To improve driving smoothness and reduce the mechanical system load, this cost term penalizes abrupt changes in acceleration between adjacent time steps, thus constraining the continuity of the control inputs.
C accel _ smooth = w a k = 1 N a [ k + 1 ] a [ k ] 2
Here, a(k) represents the acceleration control input at the k-th time step.
(3) Jerk Penalty Term
Jerk (the time derivative of acceleration) directly affects passenger comfort. To suppress the shocks caused by rapid acceleration or braking, this cost term applies a quadratic penalty to jerk:
C jerk = w j k = 1 N a [ k + 1 ] a [ k ] Δ t 2
In the equation, Δt represents the discrete time step. According to the ISO 2631 standard [20] for human vibration comfort, the jerk threshold is generally set to 0.8 m/s3.
(4) Energy Consumption Cost Term
For energy efficiency optimization in electric vehicles, this cost term differentiates between traction and regenerative braking conditions and constructs a piecewise energy consumption penalty function:
C energy = w e k = 0 N a [ k ] 2 ( a > 0 , driving ) ( η regen a [ k ] ) 2 ( a < 0 , regenerative   braking )
Here, ηregen represents the regenerative braking efficiency coefficient.
(5) Traffic Light Time Window Penalty Term
To reduce intersection delays and improve traffic flow, this cost term guides the vehicle to pass through the traffic light within the predicted green light time window. Its mathematical form is:
C tl = i , j II ( C o n s t r a i n t s ) T arrive , i , j proj g r e e n   l i g h t ( T arrive , i , j ) 2
η i , k = tl _ pos i s k max ( v k , 0.1 )
T a r r i v e , i , j represents the arrival time of the vehicle; t l _ p o s i is the position of the iii-th traffic light; v k , s k are the speed and position of the vehicle at the kkk-th time step, respectively; g r e e n _ s t a r t i and g r e e n _ e n d i are the start and end times of the green light, respectively; η i , k is the predicted arrival time of the vehicle; and Llookahead is the lookahead distance.
(6) Moderate Speed Cost Function
Moderate speed plays a multifaceted positive role in vehicle control strategies, helping to achieve energy savings, safety, efficiency, and comfort in travel. Its mathematical form is:
J mid = w mid k = 0 N 1 v k v mid 2
w m i d is the weight corresponding to the moderate speed, and v m i d is the preset moderate speed value.
(7) Multi-Objective Coordination and Weight Design
C total = k = 0 N w v ( v max v [ k ] ) 2 + w a a [ k + 1 ] a [ k ] 2 I ( k > 0 ) + w j a [ k + 1 ] a [ k ] Δ t 2 I ( k > 0 ) + w ε a [ k ] 2 I ( a [ k ] > 0 ) ( η regen a [ k ] ) 2 I ( a [ k ] < 0 ) + C tl + w mid k = 0 N 1 ( v k v mid ) 2
where δ activate [ k ] is the activation function, which takes the value of 1 when approaching an intersection. The total cost function is achieved by the weighted fusion of multiple objectives, enabling comprehensive optimization of the vehicle’s longitudinal motion control. Its core consists of five cost terms: speed tracking, acceleration smoothing, jerk suppression, energy consumption efficiency, and traffic light coordination, with their respective priorities adjusted through weight coefficients wv, wa, wj, we, wt.
In Model Predictive Control (MPC), the weight coefficients of each cost term are determined through multi-objective optimization theory. The core idea is to unify heterogeneous objectives into a dimensionless weighted sum.
w i = α i σ i 2
where αi is the subjective adjustment factor for the priority of the objective (determined by the scenario requirements), and σi represents the standard deviation of the historical data for the corresponding cost term (used to eliminate dimensional differences).
w v = α v ( v max ) 2 w a = α a ( a max a min ) 2 w j = α j ( j max ) 2 w e = α e P base 2 w t = α t ( Δ t window ) 2 w mid = 1.2 w mid base , v < 0.3 v mid 0.8 w mid base , v > 0.7 v mid w mid base , otherwise
(8) Dynamic Weight Adjustment and Weight Varying with Scenario Changes:
w = w base α ( d it , Δ v )
Traffic Light Distance Adaptation:
α ( d i t ) = 3.75 d i t < 100   m 2.0 100   m d i t < 300   m 1.5 There   exists   a   secondary   traffic   light   ( d i t 2 < 500   m ) 1.0 Other   cases
Speed Error Compensation:
α ( Δ v ) = 1.25 if   | v v des | < 2   m / s 1.0 Other   cases
Dynamic weight adjustment involves real-time regulation of the priorities of different control objectives, enabling the vehicle to make optimal decisions in complex and dynamic traffic scenarios. Its core significance lies in achieving dynamic balance among multiple objectives: when the vehicle approaches a traffic light, the system automatically increases the weight of traffic flow efficiency, prioritizing the calculation of the optimal passing speed or stopping strategy; during cruising, the focus shifts to energy consumption optimization, minimizing unnecessary acceleration fluctuations to improve energy efficiency; when encountering deceleration by the leading vehicle or sudden obstacles, the weight of the safety distance becomes dominant to ensure driving safety.

3.2.4. Solution Method

This study utilizes the combination of CasADi symbolic modeling and the IPOPT solver to address nonlinear optimization problems. CasADi, through symbolic expressions and automatic differentiation techniques, enables intuitive construction of mathematical models for complex systems (such as vehicle dynamics) and efficiently computes gradients and Hessian matrices. Additionally, CasADi supports the generation of efficient code, significantly improving computation speed, making it particularly suitable for systems with high real-time computation demands.
As a nonlinear optimization solver, IPOPT uses the Interior Point Method to handle large-scale, non-convex, and highly constrained optimization problems. It features sparse matrix optimization and adaptive parameter adjustment, ensuring numerical stability and efficient convergence during the solving process. In practice, CasADi symbolizes the optimization problem and passes it to IPOPT, leveraging its powerful solving capabilities and optimization strategies to ensure the efficient solution of complex optimization problems. Additionally, through warm-starting and dynamic parameter updates (such as time-varying traffic light constraints), this combination significantly enhances the real-time performance and dynamic adaptability of the optimization problem solving.
The combination of CasADi and IPOPT is particularly suitable for the rolling optimization problems in Model Predictive Control (MPC), allowing real-time adjustment of control strategies based on the latest system states and constraints. This combination not only benefits from being open-source and free, reducing deployment costs, but also offers excellent cross-platform compatibility, supporting the entire process from prototype development to embedded deployment. Therefore, CasADi and IPOPT have become the preferred tools for multi-objective nonlinear optimization problems in fields such as autonomous driving and robotics, balancing modeling flexibility, solving efficiency, and engineering practicality.

3.2.5. Generation of the Energy Consumption Surrogate Model

Initially, pre-planned candidate driving routes were extracted from the digital map. These routes were then input into the open-source traffic network simulation tool Netedit for detailed modeling, with a primary focus on the vehicle’s longitudinal dynamics (temporarily ignoring the effects of lateral dynamics). Specifically, for each pair of directly connected nodes in the road network, a precise road segment model was constructed to realistically reflect the geometric characteristics of the road, including parameters such as the number of lanes, slope, and curvature.
Building upon this, the model further integrated the timing and scheduling model for traffic signals and other key traffic control facilities to accurately capture the impact of traffic signals on the driving process during the simulation. Through these modeling steps, the initial route planning task was gradually transformed into a physically meaningful directed arc model in the road network. Each directed arc represents a specific road segment in the network and is assigned a particular energy consumption attribute, referred to as the ‘energy arc’. The energy consumption value considers the road’s geometric characteristics, traffic signal timing, and the influence of other traffic control facilities.
This series of modeling and post-processing steps transforms the initial route planning task into a surrogate model that can be used for energy consumption assessment and optimization. To further accurately assess the energy consumption of each ‘energy arc’, a curve fitting of speed versus unit motor shaft energy consumption was performed, which was then used as the surrogate model for energy consumption.
A cubic polynomial least squares fitting method is used to establish a quantitative relationship model between the average speed (v) and unit motor shaft energy consumption (E):
Mathematical model:
E ( v ) = a v 3 + b v 2 + c v + d
Fitting Principle:
min i = 1 n E i a v i 3 + b v i 2 + c v i + d 2
where (vi,Ei) represents the i-th experimental data point, and nnn is the total number of data points. This optimization process is implemented using the polyfit function from the NumPy library. Input the 500 data points from Figure 6 into the function to obtain the coefficients [a, b, c, d]. By minimizing the sum of squared vertical distances between the experimental data points and the fitted curve, the optimal polynomial coefficients are obtained using NumPy’s polyfit function, leading to the final energy-speed equation:
E = 7.61 × 10 5 v 3 + 5.05 × 10 3 v 2 4.06 × 10 2 v + 2.35 × 10 1
This fitting method demonstrates high accuracy within the experimental speed range, with a coefficient of determination R2 of 0.7 and an average relative error of less than 3%. The figure also shows a ±5% fitting error band, providing a visual representation of the model’s prediction confidence interval. The curve clearly reveals the nonlinear relationship between energy consumption and speed: in the low-speed region (<10 m/s), energy consumption increases super-linearly; in the medium-speed region (10–20 m/s), it approaches a linear increase; while in the high-speed region (>20 m/s), a significant cubic growth characteristic is observed, with a notable minimum energy consumption point around 8.5 m/s.

4. Space-Time Coordination and Path Planning for Vehicles

4.1. Coordinated Speed-Route Optimization for Individual Vehicles

4.1.1. Problem Definition and Modeling

In urban logistics distribution systems, route planning must comprehensively consider multiple factors including road network topology, vehicle dynamics constraints, and energy supply–demand balance. The urban road network is defined as a weighted graph G = (V,E), where the node set V contains n delivery points (e.g., universities, warehouses), and the edge set E represents roadway connectivity. Each vehicle departs from the origin and sequentially visits multiple delivery points while satisfying the following constraints:
Time Window: Each delivery point has a strict time window [tearliest,tlatest]; vehicles must arrive within this interval.
Dynamics: v∈[vmin,vmax].
Energy Balance: Vehicle energy consumption must align with charging/supply facility SPATiotemporal distribution.
Objective Function:
Total   Cost = min i = 1 n E i + P e Δ t e a r l y , i + P l Δ t l a t e , i + C C
where n is the fleet size, Pe and Pl are penalties for early/late arrivals, and CC is the charging cost.

4.1.2. Inter-Node Distance Calculation (Haversine Formula)

For geographical coordinates (latitude/longitude) in urban road networks, we calculate spherical distances using the Haversine formula:
a = sin 2 Δ ϕ 2 + cos ϕ 1 cos ϕ 2 sin 2 Δ λ 2
c = 2 atan 2 a , 1 a
d = R c d
where R is the Earth’s radius, ϕ1, ϕ2 are start/end latitudes (radians), and λ1,λ2 are start/end longitudes (radians).
Δ φ = φ 2 φ 1
Δ λ = λ 2 λ 1

4.1.3. Global Shortest Path Calculation (Floyd–Warshall Algorithm)

To obtain the shortest path matrix between all node pairs:
(i) Initialization: Adjacency matrix D(0)(i,j) stores initial edge weights (direct distances)
(ii) Iterative update: For each intermediate node k∈[1,n], update all node pairs (i,j):
D k ( i , j ) = min D k 1 ( i , j ) , D k 1 ( i , k ) + D k 1 ( k , j )
(iii) Termination: When k = n, D(n)(i,j) contains global shortest paths.

4.1.4. Battery Aging Cost Model

To provide a holistic assessment of the V2V energy sharing strategy and address its long-term economic sustainability by accounting for the impact on fleet assets, we incorporated a battery aging (BA) cost model [21]. The total BA cost for the entire fleet, denoted as Caging, is the summation of costs from every operational segment (e.g., driving, charging, V2V transfer) for every vehicle in the fleet. The aging cost for a single vehicle v during a specific operational segment ss, denoted as Cs(v,s), is modeled as follows:
C s ( v , s ) = E c y c l e d ( v , s ) β α s o c ( v , s )
The total fleet aging cost is therefore
C aging = v Fleet s Segments ( v ) C s ( v , s )
Ecycled(v,s) is the absolute amount of energy (in kWh) that is either charged into or discharged from the battery of vehicle v during segment s.
Β is the base degradation cost factor, a constant representing the baseline monetary cost per kWh of energy cycled under normal conditions (e.g., β = 0.1 ¥/kWh).
αsoc(v,s) is the dimensionless SOC Stress Factor, which applies a penalty for operating at non-ideal states of charge. It is defined as a piecewise function of the average SOC during the segment, SOCavg(v,s):
α soc ( v , s ) = α high if   S O C avg ( v , s ) < 20 %   or   S O C avg ( v , s ) > 80 % α low if   20 % S O C avg ( v , s ) 80 %
In our simulation, we set high = 1.5 and αlow = 1.0 to reflect the accelerated aging that occurs at the extremes of the SOC range. This comprehensive model ensures that every process affecting battery life—driving, conventional charging, and V2V energy sharing—is fairly quantified and contributes to the total aging cost.

4.1.5. Cost Calculation

This section calculates the total operating costs under both charging station and mutual energy supply modes through weighted summation of multi-dimensional cost components to achieve comprehensive cost evaluation.
Energy Costs:
Charging Station Mode:
C = k = 1 num _ vehicles ( E I , k + E C , k γ E F , k )
Here, EI,k is the initial energy of the k-th vehicle (kW·h), E C , k is the energy replenished at charging stations (kW·h), E F , k is the final remaining energy (kW·h), and γ is the electricity price (¥/kW·h).
Mutual Supply Mode:
C = k = 1 num _ vehicles ( E I , k + E C , k i E F , k ) i = ± 1
In this mode, i is a weighting coefficient: i = −1 for energy-supplying vehicles and i = 1 for energy-demanding vehicles.
Time Window Penalty Costs:
C P = k = 1 num _ vehicles C P , k
where CP represents the time window penalty cost (¥) for all vehicle, arising from early or late arrivals. CP,k represents the time window penalty cost (¥) for the k-th vehicle, arising from early or late arrivals.
Time Consumes Costs:
C t = k = 1 num _ vehicles ρ T k
where Ct represents the cost of time (¥) incurred by all vehicles to drive, recharge, and perform delivery tasks. Tk denotes the total time (¥) consumed by the kth car.

4.1.6. Coordinated Route-Speed Optimization

This section uses MOPSO method to optimize vehicle speed along the shortest path derived by the Floyd–Warshall algorithm. The particle swarm optimization algorithm treats the solution of the optimization problem as a ‘particle’ in a multidimensional search space. Each particle has a position (representing the candidate solution) and velocity (representing the search direction and step size). For urban logistics scenarios, we focus on minimizing energy consumption and time window violations. Given the path node sequence R = [r1, r2, …, rm] (m nodes), the adjacent node spacing di = distance (ri, ri 1). The velocity vi∈[vmin, vmax] needs to be assigned to each segment of the path i∈[1, m − 1], and the objective function consists of two parts:
Energy Consumption Items:
E ( v i , d i ) = [ a ( v i 3.6 ) 3 + b ( v i 3.6 ) 2 + c ( v i 3.6 ) + d ] d i / 100
The coefficients a = −7.61 × 10−5, b = 0.00505, c = −0.0406, d = 0.235 were obtained by fitting the measured data.
Time Window Penalties:
P i = P e × max ( t e t a , 0 ) P l × max ( t a t l , 0 )
where [ej,lj] is the time window for node rj and tiarr is the arrival time.
ej represents the earliest allowed arrival time at task node j;
lj represents the latest allowed arrival time at task node j.
Objective function:
f ( v ) = i = 1 m 1 [ E ( v i , d i ) + P i ]
Constraints:
SOC k = SOC 0 j = 1 k E ( v j , d j ) SOC threshold k [ 1 , m 1 ]
Solutions that violate constraints are marked as infeasible (fitness is set to ∞).
The particle position represents the velocity sequence:
x = v 1 , v 2 , , v m 1 m 1
Speed Update Equation:
v t + 1   =   ω ( t ) v ( t ) +   c 1 r 1 p b e s t , t   x ( t )   + c 2 r 2 g b e s t   x ( t )
Position Update Equation:
x t + 1 = x t + v t + 1
where i is the number of the particle, v(t) and x(t) represent the velocity and position of the particle at time t, pbest,t represent the historical optimal position of particle i, and gbest represents the historical optimal position of the population. ω denotes the inertia weight; c1 and c2 denote the learning factor, adjusting the weight of individual and group experiences, respectively; usually c1 = c2∈[1,2]; r1, r2 denotes the random number of [0,1]; and the search randomness is introduced to avoid local optimality.
Inertia weights decrease linearly:
ω ( t ) = ω m a x   ω m a x   ω min t     T m a x  
where ω(t) (the current inertia weight) represents the dynamic weight value at iteration step t, which directly controls the strength of the inertia of the particle motion; ωmax (maximum inertia weight) is the weight benchmark value at the initial iteration (t = 0), usually in the range of [0.8, 0.9]. ωmin (minimum inertia weight) is the lower limit of the weight when the algorithm is terminated (t = Tmax), and the typical value range is [0.4, 0.6]. t (the current number of iterations) represents the execution progress of the algorithm, and its value ranges from 0 to Tmax. Tmax (maximum number of iterations) defines the overall threshold for the number of steps executed by the algorithm, and the optimization process is terminated when t reaches Tmax. The linear decreasing design smoothly attenuates the inertia weight from the initial ωmax to ωmin, maintaining a large inertia in the early stage of optimization to enhance the global search ability, and reducing the inertia weight in the later stage to improve the efficiency of local fine development.

4.2. SPATIotemporal Energy Matching and Dynamic Route Planning

4.2.1. D Energy Situation Matrix

The energy situation matrix M(x,y,t,q) is defined as a four-dimensional representation, where x,y is geographic coordinates (SPATial dimensions), t is timestamp (temporal dimension), q is energy supply/demand quantity (positive values indicate demand, negative values indicate supply). Heuristic algorithms populate this matrix to quantify energy interaction requirements across SPATiotemporal units, providing a foundation for path merging decisions.

4.2.2. SPATiotemporal Longest Common Subsequence (STLCSS)

In multi-vehicle route optimization, evaluating similarity between vehicle trajectories is critical for optimizing scheduling and resource allocation. The SPATiotemporal Longest Common Subsequence (STLCSS) method, an enhanced version of the classic LCSS algorithm, integrates SPATial and temporal information to measure trajectory similarity. By defining SPATial and temporal thresholds, STLCSS identifies the longest subsequence where corresponding points in two trajectories meet both SPATial and temporal alignment criteria.
Given two trajectories:
T 1 = x 1 i , y 1 i , t 1 i T 2 = x 2 j , y 2 j , t 2 j
Matching Conditions:
SPATial Match: The Euclidean distance between two points must satisfy:
d s i , j = s q r t ( x 1 i x 2 j ) 2 + ( y 1 i y 2 j ) 2 ϵ
where ϵ is the SPATial distance threshold.
Temporal Match: The time difference between two points must satisfy:
d t i , j = t 1 i t 2 j δ
where δ is the temporal difference threshold.
Recurrence Relation:
The STLCSS length L(i,j) is computed as
L i , j = 0 , i = 0   o r   j = 0 L i 1 , j 1 + 1 , d s i , j     ϵ   a n d   d t i , j   δ d s i , j   ϵ   a n d   d t i , j     δ , o t h e r w i s e
L(i − 1, j − 1) + 1: Increments the subsequence length when both SPATial and temporal matches occur.
max(L(i − 1, j), L(i, j − 1)): Skips a point to continue searching for the longest subsequence.
Similarity Calculation:
The normalized similarity score is
S i m i l a r i t y T 1 , T 2 = L n , m / m i n n , m
where L(n,m) is the final STLCSS length, and n,m are the number of points in T1 and T2, respectively.

4.2.3. Dynamic Route Planning

Paths are merged and re-optimized based on similarity thresholds. A mixed-integer program minimizes total energy consumption:
min i = 1 num _ vehicles k = 1 K E i , k x i , k
Subject to vehicle capacity, time windows, path continuity, and energy balance.
Here, xi,k∈{0,1} is a Binary decision variable (vehicle I selects path k or not), and Ei,k is an Energy consumption coefficient for vehicle i on path k, (kW·h).

4.3. Energy Mutual Assistance

4.3.1. SPaTiotemporal Matching Based on Energy State Classification

To facilitate V2V energy sharing, vehicles are first classified into energy-providing vehicles (capable of providing energy) and energy-demanding vehicles (requiring energy) based on their final State of Charge (SOC) after completing their assigned delivery routes under a non-charging simulation scenario. This classification process is as follows: Non-charging Simulation:
Energy State Classification: The first step is to determine each vehicle’s energy status by simulating its entire delivery mission without any charging events. This involves using the multi-objective particle swarm optimization to determine the optimal speed profile for each route segment, thereby minimizing a combined cost of energy consumption and time window penalties.
Final SOC Calculation: To classify each vehicle’s energy state, we first calculate its final State of Charge (SOCF) after completing the simulated delivery route without any charging events. This calculation is based on the vehicle’s initial State of Charge (SOCI), its total battery capacity, and the total energy consumed on its assigned path.
First, the total energy consumed, E_consumed (in kW·h), is calculated by summing the energy consumption over all m segments of the vehicle’s route, using our high-fidelity energy surrogate model (as detailed in Equation (57)):
E c o n s u m e d = k = 1 m 7.61 × 10 5 v k 3 + 5.05 × 10 3 v k 2 4.06 × 10 2 v k + 2.35 × 10 1 d k
S O C F = S O C I E consumed C total
  • SOCI: The initial State of Charge of the vehicle (dimensionless, e.g., 0.9 for 90%).
  • SOCF: The final State of Charge of the vehicle after completing the tasks (dimensionless).
  • Econsumed: The total energy consumed throughout the journey, as calculated by Equation (71) (in kW·h).
  • Ctotal: The total rated capacity of the vehicle’s battery (in kW·h).
  • vk: The optimized speed on the k-th route segment (in m/s).
  • dk: The distance of the k-th segment (in meters).
S P = v i v | S O C F ( v i ) > 0 S C = v i v | S O C F ( v i ) 0
V = { v 1 , v 2 , , v N }  is the set of all vehicles in the fleet.
Vehicles in S p are classified as EV-P. The condition S O C F v i > 0  indicates that the vehicle possesses remaining charge that can be utilized for sharing.
Vehicles in S c are classified as EV-C. The condition S O C F v i 0 signifies a depleted or critically low energy state, requiring external energy replenishment.
Following the classification of vehicles into Energy-Providers (EV-Ps) and Energy-Consumers (EV-Cs), the subsequent challenge is to identify the most compatible pairs for energy exchange. The classification serves as a critical prerequisite, transforming the fleet-wide coordination problem into a targeted bipartite matching problem between a ‘supply pool’ (EV-Ps) and a ‘demand pool’ (EV-Cs). The STLCSS algorithm is then employed as the core matching mechanism to quantify the spatiotemporal compatibility between potential provider-consumer pairs, ensuring that only feasible and efficient matches are considered.
Given two trajectories:
T1 = {(x1i,y1i,t1i)} (e.g., a EV-P’s trajectory)
T2 = {(x2j,y2j,t2j)} (e.g., a EV-C’s trajectory)
Matching Conditions:
(1)
PATial Match:
d s i , j = s q r t ( x 1 i x 2 j ) 2 + ( y 1 i y 2 j ) 2 ϵ
The Euclidean distance between corresponding points must not exceed a SPATial threshold ϵ.
(2)
Temporal Match:
d t i , j = t 1 i t 2 j δ
The time difference between corresponding points must not exceed a temporal threshold δ.
(3)
Recurrence Formula:
L i , j = 0 , i = 0   o r   j = 0 L i 1 , j 1 + 1 , d s i , j   ϵ   a n d   d t i , j   δ d s i , j   ϵ   a n d   d t i , j   δ , o t h e r w i s e
The algorithm recursively calculates the length of the longest SPATiotemporally aligned subsequence. If a pair of points meets both SPATial and temporal thresholds, the subsequence length increments; otherwise, it skips mismatched points to continue searching.
Similarity Calculation:
S i m i l a r i t y T 1 , T 2 = L n , m / m i n n , m
The similarity score is normalized by dividing the STLCSS length by the minimum trajectory length, ensuring a value between 0 (no similarity) and 1 (perfect alignment)
  0 L n , m 1
Optimal Supplier Assignment:
For each EV-C CjD:
Compute STLCSS similarity scores with all EV-Ps PiS, forming a score matrixM|S|×|D|.
Assign the EV-P with the highest score to Cj:
s i * = arg max s i S M i , j
where S = {s1, s2, …, s|S|} is a set of EV-Ps, and M(i,j) is the STLCSS score between EV-P Pi and EV-C Cj. This process ensures energy-efficient pairing based on SPATiotemporal trajectory alignment.
To provide a clear and reproducible implementation of this logic, the STLCSS algorithm is detailed in Algorithm 1.
Algorithm 1: Spatio-Temporal Longest Common Subsequence (STLCSS)
1: FUNCTION STLCSS(TrajA, TrajB, delta_d, delta_t):
2: // 1. Initialization
3: m = length(TrajA)
4: n = length(TrajB)
5: // Create an (m + 1) × (n + 1) Dynamic Programming (DP) table.
6: CREATE a 2D array dp of size (m + 1) × (n + 1)
7: INITIALIZE all elements of dp to 0
8: // 2. Fill the DP table
9: FOR i FROM 1 TO m:
10: FOR j FROM 1 TO n:
11: // 2.1 Check for a spatio-temporal match
12: spatial_distance = HaversineDistance(TrajA[i].coordinates, TrajB[j].coordinates)
13: temporal_difference = abs(TrajA[i].timestamp − TrajB[j].timestamp)
14: // 2.2 Apply the recurrence relation
15: IF spatial_distance ≤ delta_d AND temporal_difference ≤ delta_t THEN
16: dp[i][j] = 1 + dp[i − 1][j − 1]
17: ELSE
18: dp[i][j] = max(dp[I − 1][j], dp[i][j − 1])
19: END IF
20: END FOR
21: END FOR
22: // 3. Return Result
23: RETURN dp[m][n]
24: END FUNCTION
Explanation of the Algorithm:
The algorithm utilizes a dynamic programming (DP) approach to efficiently compute the length of the STLCSS between two trajectories, TrajA and TrajB.
  • Initialization: The algorithm begins by initializing an (m + 1) × (n + 1) DP table, named dp, where m and n are the number of points in each trajectory. Each cell dp[i][j] is designed to store the STLCSS length for the subtrajectories TrajA [1..i] and TrajB[1..j]. All values are initialized to zero.
  • Dynamic Programming Recurrence: It then iterates through each pair of points from the two trajectories. For each pair (TrajA[i], TrajB[j]), it checks if they constitute a spatio-temporal match by comparing their spatial distance and temporal difference against predefined thresholds (delta_d and delta_t).
If the points match: They form a common point in the subsequence. The length of the STLCSS is thus incremented by one based on the length of the STLCSS of the preceding sub-trajectories (1 + dp[i − 1][j − 1]).
If the points do not match: No new common point is found. The length of the STLCSS is carried over from the longest subsequence found in the preceding comparisons, which is the maximum of dp[i − 1][j] and dp[i][j − 1].
3.
Final Result: After the table is fully populated, the value in the final cell, dp[m][n], represents the length of the STLCSS for the complete trajectories.

4.3.2. Mutual Assistance Rendezvous Point Optimization

The objective of rendezvous point optimization is to identify an optimal meeting point s* for supply and EV-Cs, enabling efficient SPATiotemporal coordination for energy mutual assistance. This process must satisfy the following criteria:
SPATiotemporal Synchronicity: Minimize the arrival time difference between vehicles at s*.
Energy Transfer Efficiency: Minimize energy loss during transfer while fulfilling the EV-C’s energy deficit.
Route Feasibility: Ensure paths to s* comply with topological constraints (e.g., road network connectivity).
Optimization Objectives:
s * = arg max s C [ α M ( s ) β | t P ( s ) t C ( s ) | γ E t ( s ) ]
M ( s ) = STLCSS ( Traj P , Traj C ) max ( L P , L C )
E t ( s ) = max 0 , E n ( C ) E a ( P )
Maximize the SPATiotemporal matching score (M(s)), while minimizing both arrival time difference (|ts(s) − td(s)|) and energy transfer cost (Etransfer), weighted by coefficients α,β,γ.
Here, LP and LC are the trajectory lengths of EV-Ps and EV-Cs, En is the Energy deficit of the EV-C (kW·h), and Ea is the transferable energy from the EV-P at s* (kW·h).
Optimization Process:
(1)
Candidate Set Construction:
Generate candidate points from Nodes along the EV-C’s remaining route.
C = { s | s Path C } { s | dist ( s , Path C ) δ s }
Topology-based neighboring nodes (e.g., charging stations, intersections) within a SPATial radius δs (e.g., 1 km).
(2)
Trajectory Extraction & Similarity Calculation:
Extract trajectory segments TrajP (EV-P) and TrajC (EV-C) before reaching candidate point s. Compute similarity score M(s) using Equation (3.40).
(3)
Speed Coordination:
Optimize speeds vs (EV-P) and vd (EV-C) to minimize arrival time difference:
min v P , v C d P v C d P v C s . t . v min v P , v C v max
where vP is the speed of the EC-P, vC is the speed of the EV-C, dP is the distance of the EC-P, dC is the distance of the EC-C.
(4)
Energy Conservation Constraint:
Ensure the EV-P’s post-transfer energy suffices to return to the distribution center:
E max ( s ) = SOC P ( s ) Capacity E r ( s )
where SOCP (s*) is State of Charge (energy reserve) of the supply vehicle at meeting point s*, Er(s*) is energy consumed during the return trip from s*, Emax(s*) is Maximum transferable energy from the EV-P (kW·h).
This framework balances energy efficiency, temporal coordination, and route feasibility for robust mutual assistance.

4.3.3. Energy Transfer

Energy Transfer Calculation Model
The energy transfer calculation aims to determine the optimal energy transfer amount Etransfer at the rendezvous point s*, balancing the EV-C’s energy deficit while preventing excessive discharge from the EV-P.
Core Formulas:
(1)
Energy Transfer Amount:
E t = min E n , c , E a ( s * )
where En,c is Energy deficit of the EV-C (kW·h).
(2)
Energy Deficit Calculation:
E n , c = Capacity 1 S O C C ( s * )
where SOCC(s*) is EV-C’s SOC at s*.
(3)
Available Energy Calculation:
E a ( s * ) = Capacity S O C P ( s * ) S O C min
where Capacity is battery capacity (kW·h), SOCC(s*) is remaining charge percentage of the EV-C upon reaching s*, SOCP(s*) is Remaining charge percentage of the EV-P upon reaching s*, SOCmin is minimum safe discharge threshold for the EV-P.
Constraints:
(1)
Energy Conservation:
S O C C new = S O C C ( s * ) + E t Capacity
The energy transfer must comply with energy conservation principles.
(2)
Post-Transfer SOC Requirement:
S O C C new S O C r
Transfer Time and Distance:
(1)
Transfer Time:
T t = E t P e
where Pe is Mutual compensation power (kW).
Calculated based on energy transfer rate.
(2)
Target Distance:
D t = v t T t
Determined by transfer time and vehicle speed.
Energy Consumption Model for Mobility:
This model calculates energy consumption (Econsumed) for a vehicle traversing a route segment, linking speed (v), distance (d), and energy usage.
Core Formula:
E c = E 100 d
Energy consumption is proportional to distance traveled.
Post-Segment SOC Update:
S O C F = S O C I E c Capacity 100 %
The remaining SOC after completing a route segment is dynamically updated based on distance and speed.

4.3.4. Post-Transfer Speed-Route Optimization

Remaining Route Planning
After energy transfer, the EV-C must complete its remaining delivery tasks. This requires:
Calculating the optimal route from the rendezvous point s* to the destination.
Ensuring the updated SOC (SOCF) supports the remaining distance.
Satisfying time window constraints at all delivery points.
Optimization Objective:
Path remaining = arg min Path k = 1 K E c , k + k = 1 K P p , k
Minimize total costs, including energy consumption (Ec,k) and time window penalties (Pp,k) for each route segment k.
Dynamic SOC Update Model
The SOC is dynamically updated for each segment of the remaining route based on distance, speed, and energy consumption:
Update Rule:
SOC j = SOC i E 100 d i j Capacity 100 %
The SOC at the next node (SOCj) depends on the current SOC (SOCi), speed (vij), and distance (dij) between nodes i and j.
Time Window Constraints and Penalty Handling
SOC Lower Bound Constraint:
SOC j SOC r , j j Path
Ensure SOCSOCrequired throughout the remaining route. If violated, adjust routes or speeds.
Time Window Penalty:
Penalty j = P e max ( 0 , t e , j t a , j ) + P l max ( 0 , t a , j t l , j )
Penalize deviations from delivery time windows using a multi-objective ant colony algorithm to optimize speeds.

5. Simulation Analysis

To validate the effectiveness and economic viability of the proposed energy-efficient V2V sharing framework for electric delivery vehicles, incorporating SPATio-temporal matching and path-speed co-optimization, a series of simulation experiments were designed. The experimental environment is based on an actual delivery network of a logistics company, employing light-duty electric vans as the subject vehicles. This section first details the experimental parameter settings, including network topology, node information, vehicle specifications, and cost components. Subsequently, the energy predicament under the original diSPATch plan is presented. Then, the superiority of multi-objective speed planning is analyzed. Finally, a comparative analysis is conducted, focusing on the operational costs, energy efficiency, and time-window management of the proposed V2V mutual-aid mode versus the conventional station-based charging mode.

5.1. Case Study of V2V Energy Sharing

The simulation scenario for this study is set within the delivery area of a logistics company. A total of 26 critical nodes are established, comprising 1 distribution center (Node 26, serving as the origin and destination for all vehicles), 20 customer nodes (Nodes 1–20; assuming task point 21 from original Table 2 is re-indexed or the count is adjusted to 20 distinct task points for consistency), and 5 charging stations (CS, Nodes 22–25) incorporated to facilitate comparison between different replenishment strategies. Detailed coordinates and designated delivery time windows for task points are provided in Table 2, while coordinates and charging tariffs for stations and the depot are listed in Table 3.
Acknowledging the complexity of urban road networks, a sparse network model was constructed where each node is connected to its four nearest neighbors.
The vehicle model is based on light-duty electric vans, with key parameters (e.g., gross vehicle mass, battery capacity, maximum/minimum speed, charging/discharging power) itemized in Table 4. Operational cost parameters, including daily vehicle usage cost, time consumption cost coefficient, and penalties for early/late arrivals at time windows, are summarized in Table 4. The critical vehicle model is based on light-duty electric vans, each with a battery capacity of 80 kWh. Key parameters, including gross vehicle mass, initial SOC distribution, operational thresholds, and cost components, are itemized in Table 4. For this study, our primary focus was on the logistical optimization of matching and routing. Therefore, to simplify the energy transfer model, the efficiency of both station-based charging and V2V energy transfer was assumed to be 97%. The critical parameters for the Multi-Objective Particle Swarm Optimization (MOPSO) algorithm employed in this study are detailed in Table 5
Detailed Parameter Description:
EV Battery Capacity: Each electric van in the fleet is modeled with a battery capacity of 80 kWh, which is representative of common light-duty electric logistics vehicles.
Delivery Demand Distribution: The demand is modeled as a fixed set of predefined mandatory customer nodes for each of the 9 vehicles, creating a deterministic and repeatable daily task assignment. The specific route assignments for each vehicle are detailed in Table 6 below. This setup ensures a realistic scenario where different vehicles have varying route lengths and energy requirements.

5.2. The Advantages of Multi-Objective Speed Planning

Figure 7 illustrates the baseline operational route for a fleet of electric vans, which was establishd prior to the implementation of V2V (vehicle-to-vehicle) energy sharing. To demonstrate the enhanced performance of multi-objective speed planning, this section provides a case study focused on vehicle 1, which was assigned to the delivery tasks at points 13 and 21. A comparative analysis was carried out to compare the energy consumption, time window penalty cost, and time consumption cost generated by two different algorithm methods: (i) the single-objective particle swarm (PSO) algorithm, which specifically prioritizes time window compliance, and (ii) the proposed multi-objective PSO, which optimizes both time window compliance and energy saving. The optimized travel path and corresponding speed profile for Vehicle 1, derived from the multi-objective strategy, are depicted in Figure 8. This visualization demonstrates that the multi-objective planning, by strategically modulating the vehicle’s speed, effectively achieves a balance between minimizing time window associated costs and reducing energy expenditure.
Further extending this analysis to the fleet level, the application of multi-objective speed planning yielded remarkable results. As shown in Figure 9, the speed optimization of the entire fleet has little impact on the penalty cost of the time window, and the time consumption cost is increased by ¥30.9. However, this was largely offset by a significant reduction in energy costs, which cost ¥79.7 per trip. As a result, the integrated approach resulted in a net reduction of ¥37.3 in overall operating costs per trip per diSPATch cycle, translating into an overall optimization efficiency of 7.7%.

5.3. Comparative Analysis of Energy Replenishment Strategies

This section compares the performance of two energy replenishment modes (charging station-based replenishment and V2V energy sharing) through simulation experiments, focusing on three dimensions: cost efficiency, time window compliance rate, and redundant energy utilization.

5.3.1. SPATio-Temporal Matching for V2V

Following the energy state classification (as per Section 4.1), vehicles were designated as either energy-providing (EV-P) or energy-consuming (EV-C). Specifically, vehicles #2, #6, #8, and #9 were identified as EV-Cs, with the remainder classified as EV-Ps. The SPATio-Temporal Trajectory Local Similarity Search (STLCSS) algorithm was employed to identify potential V2V exchange opportunities by analyzing the SPATio-temporal trajectories of EV-Ps and EV-Cs. The resulting STLCSS matching score matrix is presented in Table 7. The similarity scores predominantly ranged between 30 and 50, indicating the algorithm’s efficacy in capturing latent SPATio-temporal interaction opportunities between potential donor and recipient vehicles.

5.3.2. Scenario Illustration: V2CS vs. V2V Replenishment

In the V2CS mode, energy-deficient EVs are diSPATched to the nearest feasible charging station. For instance, consider the pair EV-P7 and EV-C6 (where EV-C6 requires charging). The operational trajectory for EV-C6, including a detour to charging station CS24 for replenishment, is depicted in Figure 10. Although EV-C6 completed its deliveries post-recharge, the time incurred for accessing and utilizing the charging station led to significant deviations from its designated service time windows.
Conversely, the dynamic V2V energy exchange mechanism, illustrated with the donor–recipient pair (EV-P7 → EV-C6) in Figure 11, facilitates direct energy transfer at optimized rendezvous points along their routes (e.g., between nodes S8 and S2). In this instance, EV-P1 transferred 19.76 kW·h to EV-C6, enabling EV-C6 to continue its mission without detouring to a charging station. The comprehensive V2V fleet-wide route plan, integrating such dynamic exchanges, is presented in Figure 12.

5.3.3. Performance Evaluation and Discussion

The V2V energy exchange strategy demonstrated marked superiority over the V2CS mode in terms of cost control and efficient energy utilization. To provide a comprehensive and fair comparison, our analysis was updated to include battery aging (BA) costs, as detailed in Section 4.1.4. Figure 13 illustrates the changes in individual vehicle operational costs, while the revised Figure 14 provides a comparative summary of total fleet operational costs, now including the BA component, under both replenishment paradigms.
Our analysis reveals a key trade-off. As shown in Figure 14, the V2V strategy incurs a slightly higher total battery aging cost (25.8 ¥) compared to the V2CS strategy (23.2 ¥). This result is expected and validates our model. The increase is primarily due to the “extra cycle” inherent in V2V sharing: energy is first discharged from the provider vehicle (EV-P) and then charged into the consumer vehicle (EV-C). This double-handling of energy naturally leads to slightly higher cumulative battery wear across the fleet.
However, and most importantly, this modest increase in battery aging cost (2.6 ¥) is overwhelmingly offset by the substantial savings in other operational areas. The V2V strategy achieves significant reductions in:
Time Window Penalty (TWP) costs: A reduction of 12.5% (from 142.9 ¥ to 125.0 ¥).
Energy Consumption (EC) costs: A reduction of 17.4% (from 141.7 ¥ to 117.0 ¥).
Time Costs (TC): A reduction of 11.6% (from 165.1 ¥ to 145.9 ¥).
When all cost components are aggregated, the net effect is a 12.5% reduction in overall operating costs compared to the V2CS model. These findings highlight the robust economic benefits of internalizing the remaining energy cycle within the fleet, demonstrating that the logistical efficiencies gained far outweigh the marginal increase in long-term battery degradation.
The V2V exchange model excels at utilizing the fleet’s remaining energy capacity. Figure 15 depicts the energy transfer relationship established in this mode. In stark contrast, the V2CS approach relies entirely on external grid power, leaving a large portion (typically 50–70%) of the remaining SOC that is not utilized in the vehicles that provide energy (e.g., EV-P1, EV-P4, EV-P5, EV-P7). The energy transmission network further shows that 66.2 kW·h of redundant energy is used. As shown in Figure 16, the V2V energy sharing model achieves a 24.7 kW·h reduction in total energy consumption per fleet mission (17.4 creases) and a 21.4% increase in energy efficiency (EU) for EV-PS, demonstrating that V2V not only improves the energy efficiency of the entire fleet, but also significantly reduces operational energy consumption.

5.4. Sensitivity Analysis

To address the potential impact of real-world uncertainties and enhance the confidence in our findings, we conducted a sensitivity analysis to evaluate the robustness of our proposed V2V framework. The original results were point estimates based on our high-fidelity energy consumption surrogate model. However, the accuracy of this model can be influenced by unmodeled factors such as payload variations, changing road conditions, or tire pressure. This analysis, therefore, assesses the stability of our framework’s performance against systematic perturbations in the core energy model.
Methodology:
We performed four sets of simulation runs, each comparing our proposed V2V sharing mode against the conventional charging station (V2CS) mode:
Baseline: Using the original energy consumption model.
Perturbation 1: Increasing the energy consumption of all vehicles by 1%.
Perturbation 2: Increasing the energy consumption of all vehicles by 2%.
Perturbation 3: Increasing the energy consumption of all vehicles by 3%.
For each run, we calculated the total operational costs for both modes and the resulting percentage cost reduction achieved by our V2V framework. The results of this analysis are summarized in Table 8.
The analysis clearly demonstrates the robustness of our proposed framework. Despite introducing up to a 3% uncertainty in the energy consumption model, the performance advantage of the V2V sharing strategy remains highly consistent. The percentage reduction in total operational cost stayed within a very narrow range (11.76% to 11.99%), with a mean reduction of 11.86%. The extremely low variance across all scenarios (e.g., 0.13 (¥2) for cost reduction and 0.009 (%)2 for the percentage) quantitatively confirms this stability. This indicates that the benefits of our framework—such as avoiding long detours, minimizing time window penalties, and utilizing surplus fleet energy—are fundamental structural advantages and are not overly sensitive to minor inaccuracies in model parameters. This analysis provides a stronger, statistically supported validation of our conclusions.

5.5. Practical Implications and Scalability

Our proposed framework has significant practical implications for various stakeholders.
For Logistics Providers: The primary benefit is direct operational cost reduction, as demonstrated by the 13.8% decrease in our case study. This stems from reduced energy expenses and lower time-window penalties. Furthermore, by internalizing energy replenishment within the fleet, companies can reduce their reliance on public charging infrastructure, which minimizes vehicle downtime spent queuing at stations. This increases asset utilization and enhances the fleet’s operational resilience, especially in areas with sparse charging coverage. Fleet managers can integrate this framework into their existing dispatch systems as an intelligent energy management module.
For City Planners and Grid Operators: The V2V energy sharing model promotes a decentralized energy ecosystem. By reducing the fleet’s collective demand on fixed charging stations during peak hours, it helps alleviate strain on the local power grid. This can defer the need for costly grid infrastructure upgrades. Moreover, the integration of SPAT information for speed optimization contributes to smoother traffic flow and reduced congestion at intersections, aligning with smart city initiatives for efficient and green urban mobility.
Scalability: The scalability of our hierarchical approach is a key consideration. The low-level MPC-based speed control is inherently scalable as it operates on individual vehicles. The high-level optimization, involving MOPSO and STLCSS, is more computationally intensive. For small-to-medium fleets (e.g., up to 50–100 vehicles), the proposed centralized optimization is feasible with modern computing power. For larger fleets, a decentralized or clustered approach would be necessary to manage computational complexity. Future work could explore dividing the fleet into regional clusters, with intra-cluster V2V optimization and inter-cluster coordination, or employing distributed machine learning algorithms like Multi-Agent Reinforcement Learning (MARL) for fully decentralized decision-making.

6. Conclusions

This paper successfully developed and validated a hierarchical optimization framework for V2V energy sharing that addresses critical sustainability challenges in electric delivery fleets. By tightly coupling macroscopic path planning with microscopic speed control, our approach delivers quantifiable benefits across the environmental, economic, and operational pillars of sustainability. The key innovation lies in the tight coupling of macroscopic path planning with microscopic speed control. This was achieved through a multi-stage process: constructing a high-fidelity energy surrogate model via MPC and SPAT data, employing the STLCSS algorithm for precise SPATiotemporal matching of donor and recipient vehicles, and utilizing a MOPSO algorithm for the integrated optimization of rendezvous routes and speed profiles.
Simulation results, based on a realistic logistics network, quantitatively confirmed the framework’s superiority. Compared to traditional station-based charging, our V2V approach demonstrated robust performance, reducing total operational costs by an average of 11.9% (with low variance against model perturbations) and energy consumption by 17.4%. Furthermore, a sensitivity analysis confirmed that these benefits are stable and not overly dependent on the precise accuracy of the energy model, highlighting the structural advantages of the proposed framework. Crucially, the framework unlocked the fleet’s latent energy potential, and the utilization of surplus energy from provider vehicles increased by 21.4%. This not only offers a direct economic benefit but also contributes to a more resilient and sustainable logistics ecosystem. By bridging the gap between detailed vehicle dynamics and fleet-level coordination, this work presents an effective and economical strategy for the advanced energy management of modern electric fleets.
For future work, several avenues warrant exploration to facilitate practical deployment. First, conducting a real-world pilot study with a logistics partner would be essential to validate the model’s performance under unpredictable real-world conditions, including communication delays and unexpected traffic events. Second, the framework relies heavily on reliable Vehicle-to-Infrastructure (V2I) and Vehicle-to-Vehicle (V2V) communication. Future work should focus on developing secure, low-latency communication protocols tailored for V2V energy-sharing transactions, ensuring data integrity and protecting against cyber threats. Finally, developing scalable, decentralized optimization algorithms, such as those based on multi-agent reinforcement learning, will be critical for applying this framework to very large-scale fleets, ensuring real-time performance without prohibitive computational overhead.

Author Contributions

Conceptualization, Z.Y. and K.H.; software, Z.Y., Q.H. and H.Y.; software, Q.H.; formal analysis, Y.W.; writing—original draft preparation, Z.Y.; writing—review and editing, J.D. and C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by [National Defense Science and Technology Key Laboratory Fund] grant number [2024-CXPT-GF-JJ-091-03-01].

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author, [Y.W.], upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Nomenclature

Table A1. Nomenclature.
Table A1. Nomenclature.
AFrontal area of the vehicle (m2)taArrival time (h)
akCommanded accelerationTtTransfer Time (h)
a(k) Acceleration control input at the k-th time stepuSystem control variables
acmdCommanded accelerationUThe terminal voltage of the battery (V)
arealActual accelerationU1The voltage across the polarization resistance R1 (V)
akActual acceleration at step kUmThe motor input voltage (V)
CThe rated capacity of the battery (Ah)UbRated voltage (V)
CdAerodynamic drag coefficientUocPower supply voltage (V)
CCCharging cost (¥)ukControl vector
CPTime window penalty cost (¥)vkSpeed on the k-th route segment (km/h)
CjSTLCSS of EV-CvThe vehicle speed (km/h)
CrRolling resistance coefficientvPSpeed of the EC-P (km/h)
CP,kTime window penalty cost (¥) for the k-th vehiclevCSpeed of the EV-C (km/h)
CtThe cost of time (¥) incurred by all vehicles to drive, recharge, and perform delivery tasks.VmaxMaximum speed of electric vans (km/h)
c1Learning factorVminMinimum speed for electric vans (km/h)
c2Learning factorvitargetTarget speed of the i-th vehicle
CagingThe summation of costs from every operational segmentvihbUpper bounds of the target speed range
Cs(v,s) The aging cost for a single vehicle vv during a specific operational segment ssvlegal(s)Speed limit function for the road segment
DtMutual aid distance (km)vmaxSpeed limit of the road or the maximum allowable speed of the vehicle
dPDistance of the EC-P (km)vego(k)Speed of the ego vehicle at step k
dCDistance of the EC-C (km)v(k)Vehicle’s speed at the k-th time step within the prediction horizon
dkDistance of the k-th segment (km)v(t)Velocity of the particle at time t
dsafeStatic safety distancevilbLower bounds of the target speed range
diaDistance from the i-th vehicle to the upstream traffic signal a v mid Preset moderate speed value
EEnergy consumption proxy model w mid Weight corresponding to the moderate speed
EI,kInitial energy of the k-th vehicle (kW·h)xkState vector
EC,kEnergy replenished at CS (kW·h) x · System state variables
EF,kFinal remaining energy (kW·h)xi,kBinary decision variable (vehicle i selects path k or not)
Ei,kEnergy consumption coefficient for vehicle i on path k (kW·h)x(t)Position of the particle at time t
En,cEnergy deficit of the EV-C (kW·h)ACOAnt colony optimization
Emax(s*)Maximum transferable energy from the EV-P (kW·h)BMSBattery Management System
Ea(s*)Available energy transfer for EV-P (kW·h)BCBOABranch Cutting External Approximation Algorithm
EtEnergy transfer (kW·h)BEVBattery Electric Vehicles
Er(s*)Energy consumed during the return trip from s* (kW·h)BABattery aging
Ec,kEnergy consumption for each route segment k (kW·h)CSCharging Stations
EcEnergy consumption for each vehicle (kW·h)CHTC-LTChina Heavy-duty vehicle test Cycle-Light trucks
E t o t a l Total energy consumptionDTWDynamic time warping
EcycledThe absolute amount of energy (kW·h)ETAEstimated time of arrival
E c , k Energy replenished at charging stations (kW·h)EVElectric vehicles
Fresist,kTotal resistance forceEV-PElectric vehicle-provider
FaeroAerodynamic resistanceEV-CElectric vehicle-consumer
FrollRolling resistanceECEnergy consumption
FtractionTraction forceHTWHard time window
gGravitational accelerationILPInteger Linear Programming
gbestHistorical optimal position of the populationLSTMLong Short-Term Memory
green _ start i Start times of the green lightLDMLongitudinal dynamics model
green _ end i End times of the green lightMOACOMulti-objective ant colony optimization
ImThe motor working current (A)MINLPMixed-Integer Nonlinear Programming
IThe battery output current (A)MADDPGMulti-Agent Deep Deterministic Policy Gradient
iter_maxMaximum number of iterationsMOPSOMulti-objective Particle Swarm Optimization
iWeighting coefficientMPCModel Predictive Control
iNumber of the particleOCVOpen-circuit voltage
KCycle counter used to synchronize the signal timings of multiple intersectionsPSOParticle Swarm Optimization
KwAn integer describing the cycle count of the traffic signalPoRProof of Reputation
L(i,j)STLCSS length of the point (i,j)PMSMPermanent magnet synchronous motor
L(n,m) Final STLCSS lengthREEVRange-Extended Electric Vehicles
LPTrajectory lengths of EV-PsSPATSignal Phase and Timing
LCTrajectory lengths of EV-CsSOCState of charge
L l o o k a h e a d Lookahead distanceSTWSoft time window
LkTotal cost (energy + penalty) of ant k’s path (kW·h)STLCSSSpatiotemporal Longest Common Subsequence
m1Gross vehicle weight (kg)SISignal Info
maNumber of AntsTWPTime window penalties
mVehicle massTWTime window
nMotor speed corresponding to the actual vehicle speed (r/min)V2IVehicle-to-Infrastructure
nmActual rotational speed of the motor (r/min)VAA/VSAHierarchical scheduling algorithm
PmTotal power demand of the vehicleVCUVehicle Control Unit
PrThe power requestV2CSVehicle-to-Charging Station
PePenalties for early arrivals (kW·h)V2VVehicle-to-Vehicle
PeMutual compensation power (kW)VRPVehicle routing problem
PlPenalties for late arrivals (kW·h)VVRV2V recharge
PiSTLCSS score of EV-PGT-SuiteAssumed to be a known simulation software
Pp,kTime window penalties for each route segment kfThe rolling resistance coefficient
PcCharging station charging power (kW)ηmThe motor efficiency
pbest,tHistorical optimal position of particle iηTTransmission system efficiency (%)
PkInstantaneous powerαRoad gradient (slope angle)
qEnergy supply/demand quantityϕ1Start latitudes (radians)
QConstant scaling factorϕ2End latitudes (radians)
R0The ohmic internal resistance of the battery (Ω)λ1Start longitudes (radians)
R1Resistance of component 1 (Ω)λ2End longitudes (radians)
REarth’s radiusτij(v)Pheromone concentration for speed v on path segment (i,j).
r1Random number of [0, 1]ηij(v):Heuristic factor inversely related to the optimization objective.
r2Random number of [0, 1]α1Hyperparameters controlling the weights of pheromones and heuristic factors
sego(k)Position of the ego vehicle at step kαsocThe dimensionless SOC Stress Factor
slead(k)Position of the lead vehicle at step kβThe base degradation cost factor
SOCP(s*)EV-P’s SOC at meeting point s* (%)β1Hyperparameters controlling the weights of pheromones and heuristic factors
sVehicle positionρEvaporation coefficient controlling pheromone decay
SOCIInitial SOC (%)ϵThe SPATial distance threshold
SOCFFinal SOC (%)δThe temporal difference threshold
SOCminMinimum safe discharge threshold for the EV-P (%)δsSPATial radius (km)
SOCrPost-Transfer SOC Requirement (%)ξThe unit time cost coefficient (¥/h)
s*Optimal meeting pointρAir density
tl _ pos i Position of the iii-th traffic lightΔtTime step duration (seconds)
tcurrentCurrent time of the vehicleηregenRegenerative braking efficiency
TkTotal time (¥) consumed by the kth carμPenalty weight used to dynamically adjust the vehicle speed in order to synchronize with the traffic signal timing
tExecution progress of the algorithmϵRelaxation variable
tTimestamp η i ,   k Predicted arrival time of the vehicle
TmaxOverall threshold for the number of steps executed by the algorithmαiSubjective adjustment factor for the priority of the objective
tcDuration of one full signal cycleσiStandard deviation of the historical data for the corresponding cost term
tgDurations of the green lightsϕ1Start latitudes
trDurations of the red lightsλ1Start longitudes
tTotal travel time of the vehicleϕ2End latitudes
tgreenGreen light windowλ2End longitudes
tred_waitRed light waiting timeωmaxWeight benchmark value at the initial iteration (t = 0)
tredRed light windowω(t)Dynamic weight value at the iteration step t
tcSignal cycleγElectricity price (¥/kW·h)
tgreen_remainRemaining green light timeωInertia weight
treactiontDriver’s reaction timeωminLower limit of the weight when the algorithm is terminated (t = Tmax)
TmOutput torque of the drive motor (N·m) ϵ s SPATial distance threshold
tlLatest allowed arrival time (h)δTemporal difference threshold
teEarliest allowed arrival time (h)

References

  1. Shurrab, M.; Singh, S.; Otrok, H.; Mizouni, R.; Khadkikar, V.; Zeineldin, H. A Stable Matching Game for V2V Energy Sharing–A User Satisfaction Framework. IEEE Trans. Intell. Transp. Syst. 2022, 23, 7601–7613. [Google Scholar] [CrossRef]
  2. Fan, J.; Wang, H.; Liebman, A. MARL for Decentralized Electric Vehicle Charging Coordination with V2V Energy Exchange. In Proceedings of the 49th Annual Conference of the IEEE Industrial Electronics Society (IECON 2023), Singapore, 16–19 October 2023; pp. 1–6. [Google Scholar]
  3. Alinejad, M.; Rezaei, O.; Habibifar, R.; Azimian, M. A Charge/Discharge Plan for Electric Vehicles in an Intelligent Parking Lot Considering Destructive Random Decisions, and V2G and V2V Energy Transfer Modes. Sustainability 2022, 14, 12816. [Google Scholar] [CrossRef]
  4. Xu, Y.; Peralta, A.A.; Balta-Ozkan, N. Vehicle-to-Vehicle Energy Trading Framework: A Systematic Literature Review. Sustainability 2024, 16, 5020. [Google Scholar] [CrossRef]
  5. Hosseini, S.; Yassine, A. A Novel V2V Charging Scheme to Optimize Cost and Alleviate Range Anxiety. In Proceedings of the 2022 IEEE Electrical Power and Energy Conference (EPEC), Online, 5–7 December 2022; pp. 354–359. [Google Scholar]
  6. Zhang, R.; Cheng, X.; Yang, L. Flexible Energy Management Protocol for Cooperative EV-to-EV Charging. IEEE Trans. Intell. Transp. Syst. 2019, 20, 172–184. [Google Scholar] [CrossRef]
  7. Wang, M.; Ismail, M.; Zhang, R.; Shen, X.; Serpedin, E.; Qaraqe, K. Spatio-Temporal Coordinated V2V Energy Swapping Strategy for Mobile PEVs. IEEE Trans. Smart Grid 2018, 9, 1566–1579. [Google Scholar] [CrossRef]
  8. Liu, S.; Zhang, W.; Cao, Y.; Ni, Q.; Maple, C.; Lin, H. Time-Efficient EV Energy Management Through In-Motion V2V Charging. In Proceedings of the 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore, 24–27 June 2024; pp. 1–5. [Google Scholar]
  9. Yang, J.; Cai, B.; Li, X.; Ge, R. Optimal Path Planning for Electric Vehicle Travel Time Based on Dijkstra. In Proceedings of the 35th Chinese Control and Decision Conference (CCDC 2023), Yichang, Chian, 20–22 May 2023; pp. 721–726. [Google Scholar]
  10. Sari, D.W.; Dwijayanti, S.; Suprapto, B.Y. Path Planning for an Autonomous Vehicle based on the Ant Colony Algorithm: A Review. In Proceedings of the 2023 International Workshop on Artificial Intelligence and Image Processing (IWAIIP 2023), Yogyakarta, Indonesia, 1–2 December 2023; pp. 57–62. [Google Scholar]
  11. Liu, H.; Chen, P.; Chen, J.; Tan, D. Research on Global Path Planning for Driverless Vehicles Based on Improved A* Algorithm. In Proceedings of the 2023 IEEE 5th International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Dali, China, 11–13 October 2023; pp. 1444–1449. [Google Scholar]
  12. Li, G.; Sun, Q.; Boukhatem, L.; Wu, J.; Yang, J. Intelligent Vehicle-to-Vehicle Charging Navigation for Mobile Electric Vehicles via VANET-Based Communication. IEEE Access 2019, 7, 170888–170906. [Google Scholar] [CrossRef]
  13. Yang, H.; Deng, Y.; Qiu, J.; Li, M.; Lai, M.; Dong, Z. Electric Vehicle Route Selection and Charging Navigation Strategy Based on Crowd Sensing. IEEE Trans. Ind. Inform. 2017, 13, 2214–2226. [Google Scholar] [CrossRef]
  14. Wu, F.; Adulyasak, Y.; Cordeau, J.F. Modeling and Solving the Traveling Salesman Problem with Speed Optimization for a Plug-In Hybrid Electric Vehicle. Transp. Sci. 2024, 58, 562–577. [Google Scholar] [CrossRef]
  15. Merkuryeva, G.; Bolshakovs, V. Simulation-Based Vehicle Scheduling with Time Windows. In Proceedings of the 2010 International Conference on Intelligent Systems, Modelling and Simulation, Liverpool, UK, 27–29 January 2010; pp. 134–139. [Google Scholar]
  16. Rashmi, D.; Sivasubramani, S. An Efficient Scheduling Scheme for Plug-In Electric Vehicles. In Proceedings of the 2023 IEEE IAS Global Conference on Renewable Energy and Hydrogen Technologies (GlobConHT 2023), Male, Maldive, 11–12 March 2023; pp. 1–6. [Google Scholar]
  17. Gupta, H.K.; Shah, M.K.; Chopra, K. Coordinate Scheduling of Plug-in Electric Vehicles at Charging Station Fed by Renewable Energy Sources. In Proceedings of the 2023 5th International Conference on Energy, Power and Environment: Towards Flexible Green Energy Technologies (ICEPE 2023), Shillong, India, 15–17 June 2023; pp. 1–6. [Google Scholar]
  18. Abdolmaleki, M.; Masoud, N.; Yin, Y. Vehicle-to-vehicle wireless power transfer: Paving the way toward an electrified transportation system. Transp. Res. Part C Emerg. Technol. 2019, 103, 261–280. [Google Scholar] [CrossRef]
  19. Kabir, M.E.; Sorkhoh, I.; Moussa, B.; Assi, C. Routing and Scheduling of Mobile EV Chargers for Vehicle to Vehicle (V2V) Energy Transfer. In Proceedings of the 2020 IEEE Power & Energy Society General Meeting (PESGM 2020), Montreal, QC, Canada, 2–6 August 2020; pp. 1–5. [Google Scholar]
  20. ISO 2631-1:1997; Mechanical Vibration and Shock—Evaluation of Human Exposure to Whole Body vi-Bration—Part 1: General Requirements. International Organization for Standardization: Geneva, Switzerland, 1997.
  21. Schmalstieg, J.; Käbitz, S.; Ecker, M.; Sauer, D.U. A holistic aging model for Li(NiMnCo)O2 based 18650 lithium-ion batteries. J. Power Sources 2014, 257, 325–334. [Google Scholar] [CrossRef]
Figure 1. Hierarchical optimization framework for V2V energy sharing in delivery fleets.
Figure 1. Hierarchical optimization framework for V2V energy sharing in delivery fleets.
Sustainability 17 08783 g001
Figure 2. The overall logical structure of this paper.
Figure 2. The overall logical structure of this paper.
Sustainability 17 08783 g002
Figure 3. Illustration of Node, Edge, and Arc.
Figure 3. Illustration of Node, Edge, and Arc.
Sustainability 17 08783 g003
Figure 4. Schematic of Target Speed Calculation.
Figure 4. Schematic of Target Speed Calculation.
Sustainability 17 08783 g004
Figure 5. Real-world Digital Road Network Map.
Figure 5. Real-world Digital Road Network Map.
Sustainability 17 08783 g005
Figure 6. Average Speed vs. Energy Consumption per 100 km Curve.
Figure 6. Average Speed vs. Energy Consumption per 100 km Curve.
Sustainability 17 08783 g006
Figure 7. Baseline routes before V2V.
Figure 7. Baseline routes before V2V.
Sustainability 17 08783 g007
Figure 8. Comparison of Speed and Time Window for Vehicle 1.
Figure 8. Comparison of Speed and Time Window for Vehicle 1.
Sustainability 17 08783 g008
Figure 9. Comparison of Costs Before and After Multi-Objective Optimization without BA totail costs.
Figure 9. Comparison of Costs Before and After Multi-Objective Optimization without BA totail costs.
Sustainability 17 08783 g009
Figure 10. Caption: Path planning for a paired vehicle (EV-C2) under the V2CS replenishment mode, including a detour to CS23.
Figure 10. Caption: Path planning for a paired vehicle (EV-C2) under the V2CS replenishment mode, including a detour to CS23.
Sustainability 17 08783 g010
Figure 11. Path planning for the paired vehicles (EV-P7 and EV-C6) under the V2V energy exchange mode.
Figure 11. Path planning for the paired vehicles (EV-P7 and EV-C6) under the V2V energy exchange mode.
Sustainability 17 08783 g011
Figure 12. Overall V2V fleet route planning incorporating dynamic energy exchanges.
Figure 12. Overall V2V fleet route planning incorporating dynamic energy exchanges.
Sustainability 17 08783 g012
Figure 13. Variation in individual vehicle operational costs under different replenishment modes.
Figure 13. Variation in individual vehicle operational costs under different replenishment modes.
Sustainability 17 08783 g013
Figure 14. Comparison of Costs Before and After Multi-Objective Optimization.
Figure 14. Comparison of Costs Before and After Multi-Objective Optimization.
Sustainability 17 08783 g014
Figure 15. Energy transfer relationships among vehicles in the V2V energy exchange mode.
Figure 15. Energy transfer relationships among vehicles in the V2V energy exchange mode.
Sustainability 17 08783 g015
Figure 16. Comparison of Total EC and EU under V2CS vs. V2V Modes.
Figure 16. Comparison of Total EC and EU under V2CS vs. V2V Modes.
Sustainability 17 08783 g016
Table 1. Summary of this study in comparison with related works.
Table 1. Summary of this study in comparison with related works.
Ref.ScenarioTime Window (TW)Optimization AlgorithmMulti-ObjectiveEnergy ModelSignal Info (SI)Spatio-Temporal MatchingLimitations
[1]Areas without charging stations×Gale-Shapley Stable MatchCost, profit, user satisfactionEconomic model×TW and lDM are not taken into account
[2]Charging station energy exchange×MADDPGCost/equity×××TW and lDM are not taken into account
[5]Highway/city charging stations×Hungarian AlgorithmCostLinear model××TW and lDM are not taken into account
[6]EV systems in IoE environment×Max-Weight Matching, Stable MatchingSocial welfare, grid loadLinear model××TW and lDM are not taken into account
[7]Suburban aggregation stationMINP, BCBOACost/load balancing×lDM are not taken into account
[8]Wireless V2V charging in traffic×DTW, Spatio-temporal path planningCharging efficiencyLinear model×TW and lDM are not taken into account
[13]Urban dynamic road network×Branch and CutTravel time, costLinear model××TW and lDM are not taken into account
[14]Urban logistics distributionBranch-and-CutEnergy costLongitudinal dynamics model (LDM)××SI are not taken into account
[15]Truck distribution scheduling in the logistics centerOptQuestThe total idle time of the truck×××lDM are not taken into account
[17]Wind–solar charging stationsVehicle Admission/Scheduling AlgorithmGrid dependency×××lDM are not taken into account
This paperUrban logistics energy mutual assistanceMOPSO, STLCSSTime window, energy consumptionLongitudinal dynamics modelConsider TW and SI and LMD with spatiotemporal matching
Table 2. Task Point Coordinates and Related Information.
Table 2. Task Point Coordinates and Related Information.
Node TypeNode Coordinates (°)Time Window (h)
Task point116.321, 39.9598.0, 8.5
116.365, 39.9617.0, 7.5
116.357, 39.9818.0, 8.5
116.328, 40.0029.0, 9.5
116.223, 39.9919.0, 9.5
116.461, 39.9097.0, 8.0
116.447, 39.9377.5, 8.0
116.403, 39.9347.0, 7.5
116.519, 39.9248.0, 8.5
116.548, 39.9629.0, 9.5
116.385, 39.9077.5, 8.0
116.331, 39.9038.0, 8.5
116.423, 39.9167.0, 7.5
116.422, 39.8748.0, 8.5
116.408, 39.9047.0, 7.5
116.495, 39.8798.0, 9.5
116.417, 39.8928.5, 9.0
116.354, 39.8707.0, 7.5
116.302, 39.8959.0, 9.5
116.374, 39.8998.0, 8.5
116.278, 39.9049.0, 9.5
Table 3. Charging Station and Depot Coordinates and Related Information.
Table 3. Charging Station and Depot Coordinates and Related Information.
Node TypeNode Sequence NumberNode Coordinates (°)Charging Price (¥)
CS22116.270, 39.9601.5
23116.350, 39.9201.5
24116.450, 39.9201.5
25116.550, 39.9001.5
Distribution center26116.472, 39.992
Table 4. Vehicle-related Parameters and Cost Parameters.
Table 4. Vehicle-related Parameters and Cost Parameters.
Parameter SymbolParameterDataUnit
×Fleet Size9Vehicles
m1Gross vehicle weight4495kg
CRated capacity of the battery200Ah
UbatteryRated voltage400V
×EV Battery Capacity80kWh
×Initial SOC Distribution[80.8, 19.6, 92.1, 91.0, 75.1, 14.4, 90.0, 18.3, 12.8]%
VmaxMaximum speed of electric vans60km/h
VminMinimum speed for electric vans10km/h
PchargeCharging station charging power50kW
PexchangeV2V transfer power30kW
×Charging/Transfer Efficiency97%
×SOC Threshold (for CS decision)10.5%
×SOC Threshold (for V2V decision)10.0%
ξThe unit time cost coefficient5¥/h
PePenalties for early arrivals5kW·h/h
PlPenalties for late arrivals10kW·h/h
Table 5. Particle Swarm Optimization Parameters.
Table 5. Particle Swarm Optimization Parameters.
Parameter SymbolParameterData
ωmaxMaximum inertia weight0.9
ωminMinimum inertia weight0.4
TmaxMaximum number of iterations800
c1The learning factor1.5
c2The learning factor1.5
r1,r2The random number(0,1)
NNumber of particles80
Table 6. Vehicle Route Assignments.
Table 6. Vehicle Route Assignments.
Vehicle IDAssigned Route (Node Sequence, Depot = 26)
1[26-13-21-26]
2[26-11-12-4-26]
3[26-15-3-26]
4[26-9-19-26]
5[26-2-18-7-26]
6[26-5-22-26]
7[26-16-1-20-26]
8[26-8-14-26]
9[26-10-6-17-26]
Table 7. STLCSS Matching Score.
Table 7. STLCSS Matching Score.
EV-C2EV-C6EV-C8EV-C9
EV-P150.000025.000012.500033.3333
EV-P314.285728.571414.285716.6667
EV-P440.000011.111125.000033.3333
EV-P511.111122.222225.000033.3333
EV-P716.666733.333316.666716.6667
Table 8. Sensitivity Analysis of Total Operational Costs (¥) under Energy Model Perturbations.
Table 8. Sensitivity Analysis of Total Operational Costs (¥) under Energy Model Perturbations.
LevelV2V Sharing Cost (¥)Charging Station Cost (¥)Cost Reduction (¥)Cost Reduction (%)
0% (Baseline)416.11472.8156.7011.99%
+1%417.02473.1656.1411.86%
+2%418.79474.6255.8311.76%
+3%420.56477.0456.4811.84%
Mean418.12474.4156.2911.86%
Variance3.72 (¥2)3.35 (¥2)0.13 (¥2)0.009 (%)2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ye, Z.; Han, Q.; Han, K.; Wang, Y.; Zhao, C.; Yang, H.; Du, J. Coordinating V2V Energy Sharing for Electric Fleets via Multi-Granularity Modeling and Dynamic Spatiotemporal Matching. Sustainability 2025, 17, 8783. https://doi.org/10.3390/su17198783

AMA Style

Ye Z, Han Q, Han K, Wang Y, Zhao C, Yang H, Du J. Coordinating V2V Energy Sharing for Electric Fleets via Multi-Granularity Modeling and Dynamic Spatiotemporal Matching. Sustainability. 2025; 17(19):8783. https://doi.org/10.3390/su17198783

Chicago/Turabian Style

Ye, Zhaonian, Qike Han, Kai Han, Yongzhen Wang, Changlu Zhao, Haoran Yang, and Jun Du. 2025. "Coordinating V2V Energy Sharing for Electric Fleets via Multi-Granularity Modeling and Dynamic Spatiotemporal Matching" Sustainability 17, no. 19: 8783. https://doi.org/10.3390/su17198783

APA Style

Ye, Z., Han, Q., Han, K., Wang, Y., Zhao, C., Yang, H., & Du, J. (2025). Coordinating V2V Energy Sharing for Electric Fleets via Multi-Granularity Modeling and Dynamic Spatiotemporal Matching. Sustainability, 17(19), 8783. https://doi.org/10.3390/su17198783

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop