Next Article in Journal
Sediment Distribution, Depositional Trends, and Their Impact on the Operational Longevity of the Jiahezi Reservoir
Previous Article in Journal
From Pixels to Plates: Exploring AI Stimuli and Digital Engagement in Reducing Food Waste Behavior in Lithuania Among Generation Z and Y
error_outline You can access the new MDPI.com website here. Explore and share your feedback with us.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Hierarchical Spatio-Temporal Framework for Sustainable and Equitable EV Charging Station Location Optimization: A Case Study of Wuhan

1
School of Civil Engineering, Architecture and Environment, Hubei University of Technology, Wuhan 430068, China
2
School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, China
*
Author to whom correspondence should be addressed.
Sustainability 2026, 18(1), 497; https://doi.org/10.3390/su18010497
Submission received: 25 November 2025 / Revised: 26 December 2025 / Accepted: 30 December 2025 / Published: 4 January 2026

Abstract

Deploying public EV charging infrastructure while balancing efficiency, equity, and implementation feasibility remains a key challenge for sustainable urban mobility. This study develops an integrated, grid-based planning framework for Wuhan that combines attention-enhanced ConvLSTM demand forecasting with a trajectory-derived, rank-based accessibility index to support equitable network expansion. Using large-scale charging-platform status observations and citywide ride-hailing mobility traces, we generate grid-level demand surfaces and an accessibility layer that helps reveal structurally connected yet underserved areas, including demand-sparse zones that may be overlooked by utilization-only planning. We screen feasible grid cells to construct a new-station candidate set and formulate expansion as a constrained three-objective optimization problem solved by NSGA-II: maximizing demand-weighted neighborhood service coverage, minimizing the Group Parity Gap between low-accessibility populations and the citywide population, and minimizing grid-connection friction proxied by road-network distance to the nearest power substation. Practical deployment plans for 15 and 30 sites are selected from the Pareto set using TOPSIS under an explicit weighting scheme. Benchmarking against random selection and single-objective greedy baselines under identical candidate pools, constraints, and evaluation metrics demonstrates a persistent coverage–equity–cost tension: coverage-driven heuristics improve demand capture but worsen parity, whereas equity-prioritizing strategies reduce gaps at the expense of coverage and feasibility.

1. Introduction

With the global energy transition and carbon-neutrality targets, the electric car market has expanded rapidly. In 2024, global electric car sales surpassed 17 million units, raising the worldwide share of electric cars in new-car sales to over 20% [1]. China remained the largest market, with over 11 million electric cars sold in 2024; in the domestic market, electric cars accounted for almost half of all new car sales in China that year [1]. The International Energy Agency projects that global electric car sales will exceed 20 million units in 2025, representing more than one-quarter of cars sold worldwide [2]. However, the deployment of public charging infrastructure has not kept pace with this rapid adoption. According to [2], by the end of 2024, more than 75% of the European highway network had chargers at most 50 km apart, whereas only 35% of the U.S. interstate highway system achieved the same level of coverage. In the United States, total non-home charging deployment increased from about 151,000 in mid-2023 to 204,000 in 2024 [3].
In this context, the scientific accuracy and precision of charging station location optimization decisions have a direct impact on the development of the electric vehicle industry and the effectiveness of energy system transformation. Research demonstrates that optimized siting can reduce total system costs by 30–40%, increase charging station utilization rates by 60–80% compared to suboptimal locations, and reduce transmission losses by 6.4% [4]. Charging station location optimization has become a critical technical issue for promoting the sustainable development of the electric vehicle industry.
Early studies typically formulated charging station placement using facility location models and heuristic algorithms. For example, p-median and related formulations have been used to minimize travel distance and improve accessibility, but they often rely on static or simplified demand representations and thus struggle to capture short-term spatio-temporal dynamics [5]. Hybrid approaches combining location optimization with simulation can better reflect behavioral responses but may become computationally expensive for city-scale deployment [6]. Metaheuristics such as genetic algorithms, simulated annealing hybrids, and particle-swarm variants have also been widely adopted to handle non-convex search spaces and complex constraints [7,8,9,10]. However, many of these approaches focus on a single objective (or collapse multiple objectives into a single score), which can obscure the tensions among competing planning goals.
Multi-objective evolutionary algorithms have consequently gained popularity in charging infrastructure planning. NSGA-II and related methods have been used to jointly optimize coverage, cost, and other performance metrics, and comparative studies frequently report NSGA-II as a strong baseline for approximating Pareto-optimal trade-offs [11,12,13]. Beyond NSGA-II, multi-objective particle swarm optimization and other evolutionary designs have been applied to bi- or tri-objective charging station problems that consider electricity costs, emissions, or grid-oriented performance measures [14,15]. Despite these advances, two limitations remain common. First, many multi-objective siting studies still rely on coarse demand proxies (e.g., static POIs, aggregate travel indicators, or historical averages) rather than planning-ready spatio-temporal forecasts, which can lead to hotspot misallocation under demand sparsity. Second, the objectives and feasibility terms are often parameterized by assumptions that are difficult to verify at a city scale, which can weaken interpretability and transferability for practical planning.
Recent work has begun to incorporate richer mobility evidence and data-driven prediction to better match real-world charging dynamics. Activity-based and mobility-informed approaches can better reflect where charging opportunities are structurally needed beyond observed utilization alone [16,17,18]. Complementing these mobility-informed approaches, GIS-enabled multi-criteria screening frameworks have been used to synthesize heterogeneous spatial layers (e.g., land-use constraints, walk/cycle catchments, population exposure, and proximity to grid assets) into interpretable suitability surfaces or ranked zones, providing a transparent pre-planning step for narrowing candidate areas before detailed optimization [19,20]. In parallel, spatio-temporal deep learning has emerged as a promising tool for forecasting charging demand, including graph-based models and recurrent variants that better capture temporal dependence and spatial spillovers [21,22,23]. However, these components are often studied in isolation: forecasting is evaluated primarily by generic error metrics rather than planning-relevant hotspot fidelity, and mobility representations are rarely integrated into an end-to-end decision framework that explicitly balances efficiency, equity, and feasibility through constrained multi-objective optimization. As a result, planners still lack a unified workflow that (1) produces robust demand signals under extreme sparsity, (2) complements realized charging demand with citywide mobility structure to avoid overlooking demand-sparse but strategically important areas, and (3) yields transparent, constraint-feasible trade-off portfolios rather than a single opaque “optimal” solution.
To address these gaps, this study develops a hierarchical spatio-temporal planning framework that integrates demand forecasting, mobility representation learning, and constrained multi-objective siting within a unified grid-based decision system. First, a ConvLSTM-based forecasting module with attention is trained to produce short-term grid-level demand surfaces, and performance is evaluated in a planning-relevant manner under demand sparsity by reporting hotspot-oriented and system-level indicators. Second, to complement realized charging demand, a trajectory-based representation module (a denoising autoencoder with temporal self-attention) learns citywide mobility patterns and derives accessibility-related signals that remain available even in demand-sparse areas. Third, we formulate a constrained multi-objective siting problem that jointly considers demand-oriented service effectiveness, accessibility, and equity-oriented objectives, as well as infrastructure feasibility, which is proxied by network connection frictions. We solve it using NSGA-II to obtain Pareto-optimal portfolios. Finally, TOPSIS is used to rank Pareto solutions and recommend implementable deployment plans under different budget levels, enabling transparent comparison of efficiency–equity–cost trade-offs and supporting phased planning.
The remainder of the paper is organized as follows. Section 2 introduces the study area and datasets. Section 3 describes the hierarchical modeling and optimization framework. Section 4 reports forecasting performance, mobility-layer characteristics, and multi-objective siting results, including recommended plans. Section 5 and Section 6 discuss implications, limitations, and conclusions.

2. Study Area and Data

2.1. Overview of Experimental Area

This study was conducted in Wuhan, a major megacity in central China, located approximately. The city covers a total area of 8569.2 km2 and has a resident population of approximately 11.21 million, providing a large and diverse urban context for examining public EV charging demand. Administratively, Wuhan administers 13 districts (including Jiang’an, Jianghan, Qiaokou, Hanyang, Wuchang, Qingshan, Hongshan, Dongxihu, Caidian, Jiangxia, Huangpi, Xinzhou, and Hannan). These are commonly described as consisting of seven central urban districts and six suburban districts, reflecting a distinct core–periphery structure in urban functions and mobility. Wuhan has experienced rapid urban development and maintains a high level of urbanization; for instance, official statistics reported an urbanization level of 80.29% in 2018. Against this backdrop, the city has witnessed fast growth in EV adoption. Public reporting indicates that by 2023, Wuhan’s EV ownership had risen to around 360,000 vehicles, implying increasing pressure on public charging infrastructure to serve both routine and intensive users. In terms of urban structure, Wuhan has historically been shaped by the “three towns” (Hankou, Hanyang, and Wuchang), separated by the Yangtze and Han rivers, and has evolved into a polycentric metropolis. Such geography and functional heterogeneity tend to produce spatially concentrated activity hotspots and corridor effects, while also creating frictions (e.g., river-crossing constraints and uneven land-use intensity) that can lead to localized service gaps. These characteristics make Wuhan a suitable empirical setting for studying the spatial concentration of public charging demand and for evaluating planning strategies under realistic metropolitan conditions.

2.2. Charging Dataset Description

This study utilizes EV charging demand data collected from the Charging Bar mobile application. Charging Bar is an online platform that helps EV users locate nearby public charging stations and provides real-time availability at the connector level. A distinct advantage of this data source is its cross-platform aggregation: the dataset covers public charging stations operated by 22 charging service providers, which mitigates single-operator platform bias and offers a more comprehensive view of citywide public charging activity.
Within Wuhan, the platform lists 1701 public charging stations and 24,763 charging connectors. To capture realistic short-term demand dynamics, we conducted high-frequency monitoring from 00:00 on 14 February 2025 to 00:00 on 20 February 2025 (see Table 1). Data acquisition was performed using a time-series crawler that periodically queried the platform’s real-time status feed. At each snapshot, every connector is labeled as either “occupied” or “idle”. Because the platform API occasionally returns incomplete station lists and duplicated entries, we performed validity filtering and retained 5,859,191 connector-status records as the cleaned snapshot dataset.
Because a single snapshot does not represent a complete charging session, we optionally derive a session-like indicator from status transitions in each connector’s time series. Specifically, for a given connector, if its status is observed as occupied at time t and idle at the subsequent snapshot t′, we record an occupied-to-idle transition, indicating that a charging session likely ended within [t, t′]. Aggregating such transitions across all connectors during the observation window yields 1,823,781 transition-based events, which we use as an auxiliary behavioral statistic rather than the primary forecasting target.
Finally, all stations are mapped to a regular 1 km grid system, and the connector-level snapshots are aggregated into an hourly grid-level time series. For each grid cell i and hour t, the demand variable D i ( t ) is defined as the hourly count of active connectors (i.e., connectors in an occupied/charging state) within the cell. This connector-occupancy demand proxy serves as the primary input for both spatiotemporal forecasting and downstream location optimization. The transition-based events provide additional supporting evidence on usage intensity and charging dynamics, complementing the main connector-level demand representation.

2.3. Trajectory Dataset Description

GPS trajectory data enable the characterization of urban mobility patterns and the identification of potential public-charging hotspots. Unlike static built-environment variables, trajectory records capture dynamic activity intensity and spatial mobility flows, which serve as a strong behavioral precursor of where and when public charging demand is likely to occur. In this study, we use ride-hailing vehicle trajectories as a city-scale mobility proxy (see Table 2).
Ride-hailing trajectories are adopted for three reasons. First, ride-hailing vehicles are commercial vehicles with substantially higher daily mileage and operating hours than private cars, and thus tend to rely more heavily on public charging facilities, making them an informative proxy for intensive public-charging demand. Second, ride-hailing trips are strongly associated with significant activity centers such as commercial districts, transport hubs, and office clusters, which helps identify latent charging demand hotspots. Third, ride-hailing vehicles serve as a transport carrier and do not alter passengers’ underlying travel purposes, allowing them to represent urban residents’ macro-level mobility intentions with relatively low behavioral bias.
The GPS data are obtained from the Gaode Open Platform. It is important to note that unlike experimental datasets with fixed-frequency continuous sampling, this commercial dataset utilizes an event-triggered and non-uniform sampling mechanism to optimize data transmission efficiency. Specifically, the recording frequency adapts to vehicle status: our statistical analysis reveals a median sampling interval of 10.0 s during active movements, ensuring high-resolution capture of mobility flows, whereas intervals extend significantly during idling or congestion to avoid redundant data:
(1)
Spatial filtering: removing records with coordinates outside the Wuhan administrative boundary;
(2)
Outlier removal: filtering implausible jump points caused by GPS drift (e.g., abnormal displacement within a short time interval that implies unrealistic speed);
(3)
Invalid record cleaning: removing duplicates, missing values, and logically inconsistent trips (e.g., end time earlier than start time).
(4)
Activity filtering: To focus on effective mobility demand, we specifically excluded records corresponding to long-duration idling, parking, and inactive periods (e.g., driver rest breaks or offline shifts).
After strictly applying these cleaning criteria, we constructed a high-quality ride-hailing trajectory database containing 29,143 vehicles operating within Wuhan from 17 to 19 October 2019. The final dataset consists of 16,791,722 valid GPS waypoints. It is important to clarify that the vehicles were not sampled on a continuous 24 h basis. Instead, the dataset captures the discontinuous nature of ride-hailing operations (typically 8–12 h shifts), recording only active service periods while naturally excluding offline non-service time. Consequently, the retained data effectively represent the spatiotemporal distribution of urban charging demand without redundancy.
To quantify mobility intensity at a macro level, we map trajectory records to the same grid system used in the charging-demand analysis and discretize time at an hourly resolution. Within each grid cell and each hour, we compute grid-level summaries that reflect urban activity intensity (e.g., trajectory-point counts and/or vehicle-passage intensity). We then apply smoothing to reduce the influence of extreme values and differences in reporting frequency and obtain a stable grid-based mobility intensity layer. This mobility layer captures the spatial concentration of activity hotspots and serves as an input to downstream accessibility/activity characterization, as well as location-planning modules.

2.4. Data Preprocessing and Gridding

We discretize Wuhan into a regular 1 km × 1 km grid to construct a unified spatial analysis framework. The grid resolution is selected empirically as a practical trade-off: coarser grids can obscure localized demand hotspots and residential exposure, whereas finer grids rapidly expand the decision space and increase the computational cost of repeated coverage and distance evaluations in NSGA-II. Consistent with prior city-scale studies adopting kilometer-level gridding to balance spatial representativeness and computational feasibility [24,25], we select a 1 km grid. This produces 9023 valid cells within the Wuhan administrative boundary, providing adequate granularity for planning without incurring prohibitive computational burden.
In addition to the charging-demand and trajectory-based mobility layers, the multi-objective station location model requires auxiliary spatial datasets to represent equity, connection frictions, and feasibility constraints. All auxiliary datasets are harmonized to the Wuhan municipal boundary and aggregated to the same 1 km grid to ensure spatial consistency across layers and objectives. To operationalize equity in service provision, we use gridded population data derived from China’s Seventh National Population Census. The population surface is clipped to the Wuhan boundary and aggregated to the analysis grid via zonal summation, yielding a grid-level population indicator. This layer captures the spatial distribution of residents and supports equity-aware assessment of alternative station deployment plans [26] (see Figure 1a).
To represent power-infrastructure availability, we extract substation locations from OpenStreetMap (OSM). Substation point features are filtered to the Wuhan boundary and treated as proxy supply nodes for grid connection. For each candidate grid cell, we quantify connection friction by its proximity to the nearest substation and incorporate this indicator as a cost proxy in the optimization, discouraging solutions that systematically place new facilities far from existing power nodes. Because Euclidean distance can underestimate real-world connection frictions, distances between candidate locations and substations are computed as shortest-path distances on the drivable road network. Specifically, we construct a routable road graph (drivable links only) and calculate the shortest path length from each candidate cell centroid to its nearest substation, yielding a more realistic approximation of connection difficulty (see Figure 1b).

3. Methodology

3.1. Spatio-Temporal Charging Demand Forecasting

Public charging demand is highly sparse and exhibits strong spatio-temporal clustering. We therefore formulate short-term public-charging demand forecasting on a regular 1 KM grid as a planning-oriented prediction task, where the goal is to provide a reliable demand surface for subsequent siting decisions.
We discretize the study area into a regular lattice of valid grid cells indexed by i . For each hour t , we aggregate platform status observations within each cell to obtain a grid-level demand proxy y i , t , defined as the number of charging connectors that are actively charging during hour t (utilization intensity). Separately, we derive a capacity proxy cap i as the total number of connectors located in cell i (supply scale).
For each time step t , we construct a 6-channel input tensor X t H × W × 6 consisting of:
Demand channel: log ( 1 + y i , t ) to stabilize the heavy-tailed distribution of observed utilization.
Capacity channel: cap i (broadcast over time), representing grid-level supply intensity.
Time encodings (four channels): deterministic cyclic features capturing hour-of-day and day-of-week seasonality:
sin ( 2 π h t / 24 ) , cos ( 2 π h t / 24 ) , sin ( 2 π d t / 7 ) , cos ( 2 π d t / 7 )
These time-encoding channels are broadcast to all grid cells for the corresponding time step. Given an input window length L ( L = 12 h), the model predicts the next-hour demand map:
y ^ t + 1 = f θ ( X t L + 1 , , X t )
Training samples are generated by sliding the window across the full time series. We split sequences chronologically into 80 % training, 10 % validation, and 10 % test. Demand and capacity channels are standardized using training-set statistics and then denormalized for evaluation on the original scale (inverse transform of log ( 1 + y ) ).
We employ a multi-layer ConvLSTM backbone with attention to capture local spatio-temporal dynamics. Specifically, the network consists of three stacked ConvLSTM layers (16.8 and 4 filters, 3 × 3 kernels) with batch normalization and dropout (0.2) after the first two layers, followed by a Convolutional Block Attention Module (CBAM) to refine features via channel and spatial attention. A 1 × 1 convolution with ReLU activation produces the non-negative prediction in the log ( 1 + y ) space.
To address extreme sparsity, we optimize a composite loss that emphasizes active regions while maintaining global fit and spatial coherence:
L = L sparse + λ g L grad
The sparse-aware regression term combines a masked MSE on active cells with an unmasked global MSE:
L sparse = α MSE masked ( y ^ , y ) + ( 1 α ) MSE global ( y ^ , y )
where the binary mask is defined as m i , t = 1   if   y i , t > 0 ,   and   0   otherwise (computed on the original-scale y, applied in the transformed space). The gradient term penalizes discrepancies in first-order spatial gradients to preserve hotspot structure and avoid spurious speckle:
L grad = x y ^ x y 1 + y y ^ y y 1
The model is trained with Adam, early stopping (patience 12), and ReduceLROnPlateau scheduling (factor 0.5, patience 6), with mixed precision enabled for efficiency.
For benchmarking, we evaluate two standard baselines on the original scale: (i) Persistence, which predicts y t + 1 = y t ; and (ii) Historical Average (HistAvg), which uses the mean demand for the same (day-of-week, hour) computed from the training period, with hierarchical backoff (same hour; same day-of-week; global mean) if a specific slot is missing. Given the sparsity of grid demand, we report two complementary evaluation sets computed after inverse transformation: (i) active-only metrics (MAE, RMSE, WMAPE) computed on cells with y i , t > 0 , and (ii) a global WMAPE computed over all grid cells and time steps (including zeros), reflecting system-level aggregate error.

3.2. Trajectory-Driven Denoising Autoencoder with Temporal Self-Attention

Public charging demand is shaped not only by realized utilization (which is supply-constrained) but also by the broader mobility structure of the city. We therefore learn a mobility-informed representation from ride-hailing trajectories and derive a unitless, rank-based accessibility index ( access rel ) that captures relative structural connectivity rather than an absolute travel-time measure.
Let g i , t denote the hourly ride-hailing trajectory intensity aggregated to grid cell i at hour t (e.g., the number of GPS waypoints or trajectory counts falling within the cell). We apply a log transform and min-max scaling to obtain x i , t = MinMax ( log ( 1 + g i , t ) ) , where the scaling parameters are estimated from the training period and then applied consistently to all splits to avoid leakage.
Because mobility signals are dominated by persistent level differences (core vs. periphery), we construct a de-seasonalized residual series to emphasize local temporal deviations:
r i , t = x i , t μ h ( t ) , d ( t ) ,
where μ h , d is the mean of x i , t over the training period at the same hour-of-day h and day-of-week d . This revisualization suppresses stationary intensity levels and highlights temporal signatures linked to commuting and activity rhythms.
We then extract fixed-length windows r i ( k ) L m for each grid cell i with window length L m = 48 h and stride 3 h (Table S1). To improve robustness against missing records and measurement noise, we train a denoising sequence autoencoder. Each input window is corrupted by (i) random element masking (mask ratio ρ = 0.20 ) and (ii) additive Gaussian noise ε ~ N ( 0 , σ 2 ) with σ = 0.05 on the scaled residual series:
r ˜ = b r + ε
where b is a Bernoulli mask. The model reconstructs the clean residual sequence by minimizing mean squared reconstruction error.
The encoder uses stacked LSTMs (64 and 32 units) to obtain a latent sequence representation, followed by temporal self-attention to focus on critical time steps. Given hidden states H = [ h 1 , , h L m ] , we compute:
Q = H W Q , K = H W K , V = H W V ;
A = softmax ( Q K T / d ) ; H = A V .
The attended sequence H′ is then pooled (global average pooling) to yield a fixed-length embedding z i ( k ) d z with d z = 16 (Table S1). The decoder (32 and 64 units) reconstructs the residual sequence from z i ( k ) , enforcing information preservation through the bottleneck.
For each grid cell i , we aggregate embeddings across all windows by temporal averaging, z i = mean k z i ( k ) . To obtain a comparable scalar accessibility score, we project the embeddings onto a one-dimensional structural axis via principal-component projection (first PC), yielding s i . Finally, we convert s i into a unitless, rank-based index:
access rel ( i ) = rank ( s i ) N 1
where N is the number of valid cells (ties broken by average rank). By construction, access rel reflects the relative ordering of structural connectivity (higher values indicate more centrally connected mobility signatures) and is therefore suitable for equity stratification without claiming absolute accessibility levels.
Model training uses Adam with a batch size of 1024 for up to 60 epochs with an x 80 / 20 window split into training/validation (Table S1). The learned access rel surface is exported as a planning layer for downstream optimization.

3.3. Multi-Objective Optimization and Spatial Economics-Based EV Charging Station Location Planning

We formulate EV charging station expansion as a discrete, grid-based location planning problem that integrates (i) demand potential, (ii) mobility-derived relative accessibility, (iii) population exposure, and (iv) implementation feasibility.
For each feasible buildable grid cell i , we construct four planning attributes: (1) a demand score D i from the ConvLSTM demand surface (mean predicted daily peak intensity); (2) the mobility-derived relative accessibility index access rel ( i ) ; (3) population P i aggregated to the grid from a population raster; and (4) an implementation feasibility proxy C i defined as the road-network shortest-path distance (km) from cell i to its nearest power substation (connection friction).
We screen feasible cells to form a tractable candidate pool I (maximum size 600 in the main configuration). Let x i { 0 , 1 } indicate whether a candidate cell i is selected, with the deployment scale fixed to i I x i = K ( K { 15 , 30 } ).
Neighborhood service rule (served indicator). Because service is evaluated on a discrete grid and planners often interpret coverage in neighborhood terms, we define a binary served indicator for each demand cell j based on the 3\times 3 Moore neighborhood N ( j ) centered at j :
served ( j ) = 1   if   i N ( j )   such   that   x i = 1 ;   otherwise   0
Under a 1   km grid, the farthest neighbor in N ( j ) is at distance 2   km ( 1.41   km ), which is consistent with the neighborhood-scale service radius used in mapped interpretation.
Smooth distance-decayed coverage (kernel). To provide a smooth optimization signal and reflect diminishing marginal returns with distance, we define the effective service received by cell j from the selected set S = { i I x i = 1 } as the best (closest) exponential influence:
cov ( j ) = max i S exp ( d i j / λ )
where d i j is the centroid-to-centroid Euclidean distance (km) between grid cells i and j , and λ is the decay scale ( λ = 2.5   km in the main configuration; Table S2).
Objective 1: Maximize demand-weighted service effectiveness (kernel-based coverage). We maximize:
F 1 ( x ) = j D j cov ( j )
This objective prioritizes locating new sites near high-demand areas while allowing benefits to attenuate smoothly with distance and preventing double counting via the max operator.
Objective 2: Minimize equity gap (Group Parity Gap, GPG) under the served indicator. We define low-accessibility zones as the bottom quantile of access rel (q = 0.25 in the main configuration), and compute population-weighted served shares:
S ( G ) = j G P j served ( j ) j G P j .
The equity objective is then the absolute parity gap between the low-access group G L and the overall population G all :
F 2 ( x ) = | S ( G L ) S ( G all ) | .
To ensure a minimum level of investment in structurally disadvantaged areas, we impose a hard quota constraint on selected sites:
i I G L x i m low ,
where m low is set in the main configuration as reported in Table S2 (e.g., m low = 3 ).
Objective 3: Minimize implementation feasibility friction. We minimize the mean substation-distance proxy of selected sites:
F 3 ( x ) = 1 K i I x i C i .
Spatial dispersion constraint. To avoid redundant clustering and reflect practical spacing, we enforce a minimum separation distance d min between any two selected sites ( d min = 1.0   km in the main configuration; Table S2), computed on centroid Euclidean distance:
d ( i , k ) d min   for   all   i k   with   x i = x k = 1 .
We solve the constrained three-objective problem using NSGA-II and run three independent trials per deployment scale to address algorithmic stochasticity.
Because the Pareto set contains multiple efficient trade-offs, we select an implementable compromise plan using TOPSIS in the normalized objective space with explicit weights ( w cov / w eq / w cost = 0.45 / 0.35 / 0.20 ; Table S2).

4. Results

4.1. ConvLSTM Demand Forecasting Results

As public charging activity is highly uneven across space and time, the hourly grid tensor is dominated by zero-demand cells, with only 7.75% of grid–time observations being active. Under such sparsity, overall error metrics can be misleading because a model may appear accurate simply by predicting near-zero values, while still failing in operationally relevant hotspots. We therefore report two complementary sets of metrics (see Table 3): (i) active-only errors computed on non-zero grid–time cells to assess hotspot dynamics, and (ii) citywide aggregate error (WMAPE_total) computed on the sum over grids to evaluate whether the model preserves the overall demand trajectory. This dual reporting provides a balanced assessment of both local fidelity and system-level consistency, which is particularly important when forecasts are later used to parameterize planning objectives and spatial prioritization. Table 3 shows that ConvLSTM achieves the strongest performance across all metrics. On active cells, it reduces MAE_active to 2.586 and RMSE_active to 3.941, compared with 4.422/8.449 for Persistence and 3.315/6.058 for Historical Average. In relative terms, this corresponds to a 41.5% reduction in MAE_active compared to Persistence and 22.0% compared to HistAvg; for RMSE_active, the reductions are 53.4% and 34.9%, respectively. The same advantage is reflected in WMAPE_active, where ConvLSTM achieves 19.90%, substantially lower than Persistence (34.02%) and HistAvg (25.50%). Crucially, the improvement is not achieved by merely fitting sparse zeros: at the citywide level, ConvLSTM also attains the lowest WMAPE_total (21.04%), representing an approximately 39.3% reduction relative to both Persistence (34.64%) and HistAvg (34.67%). This indicates that the proposed model better captures not only the magnitude of demand in active locations but also the aggregate evolution of urban charging load, thereby reducing systematic underestimation or overestimation in total demand.
To interpret these improvements, Figure 2 shows the citywide total demand time series (sum over all grids) for observed versus predicted values across the test horizon. Persistence typically reproduces short-term inertia but struggles with turning points, leading to lagged responses around demand ramps and peak hours. HistAvg provides a smoother baseline but tends to regress toward an “average day profile,” dampening peak amplitude under non-stationary conditions. By contrast, ConvLSTM tracks both timing and amplitude more closely, showing reduced peak underestimation and fewer phase shifts at high-demand hours. This behavior is consistent with the lower WMAPE_total, suggesting that the model learns transferable spatio-temporal dependencies beyond simple carry-forward or average seasonal patterns. Importantly for planning, improved peak tracking means hotspot prioritization is driven by meaningful temporal variation rather than being biased toward the “average” condition. Beyond temporal alignment, the model’s value for spatial planning depends on whether it preserves the geography of demand hotspots.
Figure 3 compares the observed and predicted peak demand surfaces, visualizing the absolute error. ConvLSTM reproduces the dominant hotspot structure (core clusters and corridor-like patterns) while controlling spurious diffusion into low-demand areas. The error map further shows that residuals are concentrated in a limited subset of high-intensity cells rather than being spatially widespread, indicating that the model’s remaining mistakes are primarily associated with the most volatile hotspots (where short-term fluctuations and local constraints are most substantial). This spatial behavior supports using the predicted demand layer as a planning-relevant surface: it maintains the relative ranking of high-priority grids while avoiding systematic false positives in peripheral cells. A common pitfall in short observation windows is that a weekday–hour historical average can degenerate into near-constant predictions when the training period does not cover the same day-of-week patterns as the test period. In our aligned implementation, the HistAvg baseline is corrected using a backoff strategy, ensuring it remains informative and preventing the creation of an artificially weak comparator. Under this fair setting, ConvLSTM still yields substantial gains, particularly on active grids and aggregate demand, demonstrating that the improvement is not a result of “win by baseline collapse” but reflects genuinely stronger predictive skill. From a planning perspective, the improved accuracy at both active-grid and total-demand levels reduces the risk of misallocating resources due to distorted hotspot intensity or biased citywide load estimates, thereby providing a more reliable demand layer for subsequent multi-objective siting.

4.2. Mobility-Derived Spatial Layers from Ride-Hailing Trajectories

This section reports the mobility-derived grid surfaces constructed from the ride-hailing GPS dataset using the trajectory encoder (TDAE). The purpose is not to forecast mobility itself, but to extract planning-relevant spatial representations of (i) mobility exposure and (ii) relative accessibility at the grid level. These layers are subsequently used to parameterize the accessibility-related objective and spatial constraints in the multi-objective siting model. Accordingly, our evaluation focuses on whether the learned surfaces are spatially interpretable, stable under rank-based comparison, and consistent with the observed charging-demand pattern (Figure 4).
Figure 4a visualizes the grid-level mobility intensity surface, which reflects the concentration of vehicle activity captured by the cleaned ride-hailing trajectories. The resulting field exhibits a pronounced core–corridor structure, with high-intensity clusters centered around the central urban area and elongated corridors aligned with principal transportation axes, while the peripheral grids are markedly weaker. This spatial organization is consistent with the expected activity pattern of a polycentric metropolis, indicating that the mobility signal is not dominated by random sampling noise. This concentration implies that mobility exposure is highly clustered and that siting decisions based on uniform spatial allocation would be inefficient in terms of serving high-activity locations.
Figure 4b reports the derived relative accessibility surface. Importantly, we interpret relative accessibility as a relative ranking/index of grid accessibility or attractiveness within the citywide mobility system, rather than an absolute travel-time measure. This framing is essential because the mobility trajectories (October 2019) and charging observations (February 2025) are not temporally aligned. By focusing on a unitless, rank-based accessibility index, we avoid over-claiming year-specific traffic conditions while still capturing the more persistent spatial structure of activity centers and corridor connectivity. The access_rel surface displays a coherent pattern in which central hubs and well-connected corridors consistently achieve higher relative accessibility, providing a complementary planning layer that reflects structural connectivity rather than demand realization alone.
To verify the stability of the accessibility pattern over time, we conducted a perturbation-based sensitivity analysis (see Figure 5). We compared the 48 h baseline against 24 h and 72 h windows and also tested a scenario random noise to simulate potential future changes (20% masking + Gaussian noise std 0.05). The results show that the core hotspots remain highly stable. The top 10% hotspots achieved an overlap of 94.6–95.7% across time windows and maintained 89.6% even under random noise. Furthermore, the global rank correlation remained near-perfect ( ρ > 0.99 ). These findings confirm that permanent urban infrastructure (e.g., road networks and commercial centers) drives these hotspots rather than short-term fluctuations. This justifies using the 2019 data as a robust proxy for the long-term relative spatial structure.
To examine whether mobility-derived signals are descriptively aligned with charging behaviors, Figure 6 compares hotspots from the mobility exposure layer against hotspots from the charging-demand surface. We define the analytical universe as the complete grid set (N = 9023). A key finding is the extreme sparsity of realized charging demand: non-zero charging observations occur in only 679 grids (accounting for just 7.53% of the map). In contrast, mobility signals derived from the TDAE layer provide a ubiquitous measure of potential exposure across the city. Comparing the spatial peaks, the hotspot sets (defined as the top 10% of grids in the valid intersection universe) show a moderate spatial overlap, with a Jaccard index of 0.39. This suggests that while high-mobility corridors often exhibit high charging demand, the two are distinct: mobility represents potential, while demand represents actual usage. Within the subset of locations that actually capture charging demand (active grids, N = 679), the distribution is moderately concentrated, with the top 10% of busiest grids accounting for 35.05% of the total realized load. This suggests a hierarchical service structure where a minority of core stations serve a significant portion of demand, but not to the extent of total monopolization. These results fundamentally justify the multi-objective integration of ConvLSTM and TDAE. Optimization driven solely by predicted demand would be blinded by the 7.53% sparsity, likely reinforcing existing cores while ignoring the vast majority of the city where latent demand exists but historical data is zero. By incorporating TDAE, which provides a continuous accessibility surface even in demand-blind zones, the framework can identify and serve structurally well-connected locations that are currently underserved, balancing the exploitation of known demand hotspots with the exploration of high-mobility potentials.

4.3. Multi-Objective Optimization Results and Trade-Offs

We solve the charging-station siting problem as a constrained multi-objective optimization problem. To address the stochastic nature of the NSGA-II algorithm and ensure the statistical reliability of the results, we executed three independent runs for each deployment scale (K = 15 and K = 30). The detailed hyperparameter settings for the optimization (e.g., population size, crossover/mutation rates) are provided in Table S2.
Across all independent runs, the distributions of objective values remained highly stable, confirming robust search behavior and reproducible trade-off patterns. Notably, the “allow existing” and “new-only” settings yielded identical objective summaries in our experiments, indicating that the existing-site option is not binding under the current candidate pool and constraints; thus, we report the aggregated results below.
Trade-offs for K = 15 (Figure 7a): The solution set exhibits an apparent tension among efficiency, equity, and cost. The coverage-oriented extreme achieves a coverage of 1504.5–1528.9, but with higher equity gaps (0.0853–0.0995) and moderate costs (1.866–2.491 km). In contrast, the equity-oriented extreme reduces the equity gap to a near-zero level (≈0),) at the expense of lower coverage (739.7–1027.2) and higher cost (1.529–2.938 km). Cost minimization alone yields very low-cost solutions (0.655–0.831 km), but these come with inferior service effectiveness (coverage 457.1–866.2) and non-negligible equity gaps (0.0152–0.0409). To obtain a balanced plan, we select a best-compromise solution from the Pareto set using TOPSIS. The compromise solutions are highly stable across the three independent runs: Coverage = 1009.0 ± 54.6, Equity Gap = 0.0078 ± 0.0045, and Cost = 1.550 ± 0.114 km (Mean ± SD). For the representative replicate shown in Figure 7a (rep_03; red star), the selected plan achieves Coverage = 1070.2, Equity Gap = 0.0129, and Cost = 1.462 km.
Trade-offs for K = 30 (Figure 7b): Increasing the deployment scale shifts the feasible trade-off surface upward in coverage while preserving the equity–cost tension. The coverage-oriented extreme ranges from 2530.5 to 2574.6, but with substantially larger equity gaps (0.1885–0.2264) and costs of 1.795–2.106 km. The equity-oriented extreme again achieves a near-zero equity gap (≈0) but requires higher cost (1.973–5.967 km) and sacrifices coverage (556.0–1176.9). The cost-oriented extreme attains relatively low costs (1.042–1.176 km) while maintaining moderate coverage (859.5–1561.1), albeit with higher equity gaps (0.0181–0.0825).The TOPSIS compromise solutions for K = 30 are likewise stable across runs: Coverage = 1728.9 ± 124.8, Equity Gap = 0.0179 ± 0.0113, and Cost = 2.535 ± 0.362 km (Mean ± SD). For the representative replicate shown in Figure 7b (rep_03; red star), the selected plan achieves Coverage = 1872.7, Equity Gap = 0.0229, and Cost = 2.704 km. Overall, the results highlight (i) diminishing coverage returns relative to cost, and (ii) a persistent equity–cost tension, where near-zero equity gaps are achievable only by accepting higher infrastructure costs and moderate coverage reductions.

4.4. Recommended Siting Plan and Spatial Characteristics of Selected Sites

Figure 8 maps the selected new-build stations under the Strictly New + Neighborhood setting, overlaid on the feasible buildable grid (dark gray) and existing stations (light gray). The K = 15 solution (Figure 8a) exhibits a clearly core-oriented but non-redundant spatial pattern. Most selected cells cluster in the central urban corridor, where existing stations are already dense. However, the new sites are placed in adjacent buildable cells rather than on existing-station cells, indicating a gap-filling strategy around saturated hubs. Beyond this central cluster, only a small number of sites are allocated to outer gaps—including a few cells in the north and south—which act as early “anchors” to reduce uncovered pockets under a 1.5 km neighborhood service radius.
When the budget increases to K = 30 (Figure 8b), the solution retains the K = 15 core (red) and adds an expansion set (yellow) that follows two dominant spatial logics. First, it densifies the central corridor by filling remaining adjacent buildable cells around the main activity cluster, strengthening local redundancy resistance while improving neighborhood coverage continuity. Second, the expansion sites extend along a north–northeast belt, forming a more elongated chain of new-build cells that pushes service toward peripheral and secondary-demand areas. This produces a more citywide footprint without uniformly dispersing sites, suggesting that the optimizer prioritizes connecting and extending existing service basins rather than scattering stations across low-demand areas. n addition to the layer-level stability test, we conducted a decision-level robustness check by perturbing the learned accessibility index access_rel with a ±20% noise scenario and re-running the full NSGA-II siting procedure under the same configuration (three independent runs). The TOPSIS-selected compromise plans remain broadly consistent with the baseline recommendations: for K = 15, the overlap is 11/15, 12/15, and 11/15 across runs (overlap ratio 0.733–0.800), and for K = 30, the overlap is 23/30, 25/30, and 22/30 (overlap ratio 0.733–0.833). Differences are limited to a small number of marginal substitutions near selection thresholds, while the overall spatial targeting and the two dominant logics identified above (central infill around hubs without co-location, and targeted peripheral extensions) remain consistent.
Overall, the mapped results confirm that the “Strictly New” requirement does not force unrealistic relocation away from demand. Instead, it yields a structured pattern of (i) central infill around existing hubs (without co-location) and (ii) targeted peripheral extensions that exploit neighborhood spillovers while directing part of the additional capacity to underserved edges.
To improve interpretability of the TOPSIS-recommended plan beyond Pareto summaries, we provide a compact descriptive table of representative selected cells and their key grid-level attributes (see Table 4). The table reports neighborhood-level served-demand indicators—computed over the 3 × 3 neighborhood centered at each selected cell, consistent with the neighborhood service rule—together with density proxy, baseline accessibility (trajectory-derived), substation-distance feasibility proxy, and population. These examples illustrate the typical placement logic under the joint coverage–equity–feasibility objectives: several selected cells are located adjacent to strong demand neighbors and therefore exhibit high neighborhood demand (e.g., neighborhood-demand sums of 161.133 at (57,57) and 143.733 at (71,53)), while still spanning heterogeneous urban contexts (density proxy 1.097–3.500; access_rel 0.335–0.421) and moderate substation distances (1.251–2.985 km). Importantly, because service is evaluated at the neighborhood level, a station does not need to coincide with the demand peak cell to deliver substantial coverage, and the neighborhood-demand columns provide a decision-relevant summary of the effective demand served around each selected location. For completeness, the full list of selected cells for the TOPSIS plan is provided in the exported results.
Finally, Figure 9 reports a like-for-like comparison of siting strategies under the same candidate pool (the exported 600-cell pool), the same constraints (fixed K , the 3 × 3 neighborhood service rule, d min , and K low ), and the same metric definitions for coverage, GPG, and cost. All values are normalized to the selected MOO compromise solution (MOO = 1.0; values > 1 indicate better performance after orienting cost-type criteria).Across both K = 15 and K = 30 , the greedy MaxCov baseline achieves higher coverage than MOO (=1.36 and =1.30), but its equity performance drops sharply (GPG ratios 0.10 in both scenarios) and its cost performance is also weaker (0.61–0.71), indicating that coverage maximization alone can severely compromise parity and feasibility. Conversely, MaxEq improves parity relative to MOO (1.25–1.11) but delivers substantially lower coverage (0.36–0.40) and poorer cost performance (0.40–0.39), reflecting efficiency losses when equity is pursued in isolation. Overall, the MOO framework avoids these one-sided extremes and provides a stable compromise solution that jointly manages the coverage–equity–cost tension under realistic constraints.

5. Discussion

This research developed a comprehensive hierarchical framework for optimizing EV charging station locations, with results demonstrating significant improvements in efficiency, cost-effectiveness, and equity. The findings have direct implications for sustainable urban development.

5.1. Principal Findings and Interpretation

This study yields three substantive insights that are relevant to charging-infrastructure planning. First, forecasting must be formulated and evaluated as a planning signal under sparsity, not as a purely predictive exercise. At the 1 km grid–hour resolution, the demand tensor is dominated by zero values (only 7.75% of grid–time observations are active), so “overall” metrics can reward models that fit zeros while remaining uninformative for operational hotspots. The dual evaluation protocol—hotspot-oriented errors on active cells plus system-level WMAPE on the citywide aggregate—therefore reflects a planning logic: siting decisions are driven by where peak stress emerges and whether the total demand trajectory is preserved.
Second, the analysis clarifies that mobility exposure and realized charging demand capture different constructs. The trajectory-derived layers provide ubiquitous citywide signals and exhibit a stable core–corridor structure, while realized charging demand is highly spatially sparse and constrained by the existing network. The moderate hotspot overlap (Jaccard = 0.39) indicates complementarity rather than redundancy: mobility represents potential exposure/structural connectivity, whereas demand reflects realized utilization shaped by supply and behavioral adaptation. This distinction explains why integrating both layers can mitigate “demand-blind” zones and supports an exploration–exploitation balance in siting.
Third, the optimization results show that equity gains are not a by-product of coverage expansion. Across deployment scales, the Pareto structure reveals a persistent tension between equity and cost: approaching near-zero equity gaps typically requires accepting higher connection frictions and/or moderate coverage reductions. Importantly, the TOPSIS-selected compromise solutions remain stable across independent NSGA-II runs, providing an implementable recommendation while keeping the underlying trade-offs transparent.

5.2. Planning Implications for Phased Deployment and Spatial Strategy

The framework provides several practical implications for planners. First, phased deployment is not merely an operational convenience but a rational response to structural trade-offs. Increasing the deployment budget (e.g., K = 15 to K = 30) expands the achievable service frontier and improves citywide coverage, yet it does not eliminate the equity–cost tension; rather, it reveals what additional coverage requires in terms of feasibility frictions (proxied here by substation-access distance) and distributional balance. This has two planning implications. On the one hand, early-stage investments should prioritize high-leverage “gap-filling” sites—locations that produce large neighborhood-level coverage gains while avoiding redundancy with already-served cores—because marginal returns are typically higher when service basins are fragmented. On the other hand, later-stage expansion can shift toward network consolidation and corridor extension, where the objective is not only to add coverage but to improve continuity of neighborhood service and to reduce persistent underserved pockets. From a management standpoint, this staged interpretation suggests a practical workflow: use smaller-K compromise solutions as “Phase-I candidates” for rapid deployment and pilot evaluation, then update the weight scenario (or the feasible candidate set) to generate Phase-II plans once utilization feedback and implementation constraints become clearer.
Second, the recommended layouts suggest a spatial logic that is more realistic than “uniform dispersion” interpretations of equity. Under the strictly new-build requirement and the minimum-separation constraint, the optimizer is discouraged from selecting already saturated cells and from co-locating with existing stations. Instead, it tends to place new sites in adjacent buildable cells around demand hubs—core-oriented but non-redundant infill—combined with a limited number of peripheral or secondary-center anchors that reduce uncovered pockets under the neighborhood service rule. This pattern carries a concrete equity message: equity-oriented planning does not necessarily mean reallocating investment away from the core; rather, it can be operationalized as (i) reducing local access gaps by targeting cells adjacent to saturated hubs, where marginal additions still improve neighborhood-level availability and reduce queuing pressure, while (ii) strategically extending service to edges and secondary belts where residents are otherwise persistently outside the service radius. In other words, the framework supports an interpretable “gap-filling + anchor” strategy that aligns with how networks are actually expanded: strengthening coverage in high-use basins without redundant co-location, while preventing peripheral exclusion via targeted expansions instead of diffuse scatter.
Finally, because TOPSIS is applied with an explicit weighting scheme (baseline weights: 0.45/0.35/0.20 for coverage/equity gap/cost; Supplementary Materials, Table S2), the framework naturally supports transparent scenario analysis. Rather than presenting a single “optimal” plan, the Pareto set provides the menu of feasible trade-offs, and TOPSIS offers a reproducible rule to select an implementable compromise under a stated policy preference. This enables planners to translate policy priorities into operational choices: for example, a reliability- or congestion-relief scenario may emphasize coverage (higher weight on coverage), a social-inclusion scenario may prioritize reducing disparity (higher weight on equity), and a grid-coordination scenario may emphasize feasibility (higher weight on cost proxy). Importantly, reweighting does not change the underlying optimization engine or the constraint environment; it changes only the decision rule for selecting a plan from the same trade-off surface, which keeps the planning process auditable and easy to communicate to stakeholders. We therefore view the weighting scheme not as a fixed “expert truth” but as a policy lever that can be stress-tested, documented, and updated as implementation feedback accumulates.

5.3. Limitations and Directions for Future Research

Several limitations define the boundary of interpretation and point to extensions:
(1)
Temporal representativeness: the demand layer is derived from a limited observation window; more extended temporal coverage would better capture seasonal and event-driven variability, thereby improving the stability of hotspot definitions.
(2)
Temporal mismatch between mobility and charging datasets: mobility trajectories and charging observations are not temporally aligned. We therefore interpret relative accessibility as a unitless, rank-based index that captures persistent structural connectivity, rather than an absolute measure of travel time. Future work should incorporate temporally matched trajectory data to strengthen contemporaneous access inference.
We also note that both city-scale ride-hailing GPS trajectories and real-world public-charging usage records are valuable yet inherently scarce datasets and fully synchronized multi-source data at this scale are rarely available in practice. We therefore aim to make a transparent and defensible use of the best-available data, while quantifying the associated uncertainty via sensitivity/robustness diagnostics wherever possible.
(3)
Engineering realism of the cost proxy: the cost term is modeled as road-network shortest-path distance to the nearest substation, capturing connection friction but not feeder capacity, transformer constraints, or permitting feasibility. Incorporating grid-capacity constraints and station-type heterogeneity would improve implementation realism and may reshape the equity–cost frontier.

6. Conclusions

This study develops a hierarchical spatio-temporal decision framework for public EV charging station expansion that integrates demand forecasting, mobility representation learning, and constrained multi-objective siting into a unified grid-based planning pipeline. The framework is designed to support sustainability-oriented planning decisions in the face of two persistent realities of urban charging systems: extreme demand sparsity and multi-objective trade-offs among service effectiveness, equity, and infrastructure feasibility.
Three conclusions can be drawn. First, planning-relevant demand forecasting must be hotspot-aware in the presence of sparse demand tensors. Because only a small share of grid–time observations are active, overall metrics alone can overstate performance for siting purposes. Using a dual evaluation protocol, the ConvLSTM with attention yields the most reliable planning signal, resulting in improved hotspot-oriented accuracy.
Second, mobility exposure and realized charging demand should be treated as complementary rather than interchangeable. Trajectory-derived layers supply ubiquitous citywide information about structural activity and accessibility patterns that remain visible even in demand-sparse zones. The moderate hotspot overlap between mobility and demand suggests that optimizing solely on demand risks reinforcing existing utilization patterns. In contrast, mobility-informed objectives can better identify structurally connected yet underserved locations.
Third, equity improvements are not a by-product of coverage expansion, and transparent trade-off management is therefore essential. The NSGA-II Pareto portfolios show persistent equity–cost tension under realistic constraints. TOPSIS provides an interpretable mechanism to select implementable compromise solutions from the Pareto set under an explicit weighting scheme, enabling scenario-based decision support and phased deployment comparisons.
Several limitations motivate future research. Demand is reconstructed from a limited temporal window, and longer observation horizons would strengthen generalizability. Mobility trajectories and charging observations are not temporally aligned; future work should incorporate temporally matched mobility data to support stronger contemporaneous accessibility inference. Finally, the infrastructure feasibility term is represented by a proxy based on road-network distance to substations; incorporating grid-capacity constraints, land feasibility, and station-type heterogeneity would improve implementation realism and may reshape the equity–cost frontier.
Overall, the proposed framework provides a reproducible pathway to integrate spatio-temporal prediction, mobility opportunity structures, and multi-objective optimization for equitable and operationally effective charging network expansion.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/su18010497/s1, Table S1: Hyperparameter settings for the TDAE model; Table S2: Detailed hyperparameter settings and constraint configurations for the multi-objective optimization model (NSGA-II).

Author Contributions

Conceptualization, Z.L.; methodology, Y.H. and H.R.; software, H.R.; validation, Y.H., H.R. and Z.L.; formal analysis, Y.H. and Z.L.; investigation, Y.H. and H.R.; resources, D.C.; data curation, H.R. and Z.L.; writing—original draft preparation, Y.H.; writing—review and editing, Y.H., H.R., Z.L. and D.C.; visualization, H.R.; supervision, D.C.; project administration, Y.H.; funding acquisition, Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Technology of China, National Foreign Experts Project, “Research on the Planning and Layout of Electric Vehicle Charging Facilities” (Grant No. G2023027008L); the Doctoral Research Start-up Fund of Hubei University of Technology, “Coupling Mechanisms between Low-Altitude Economy Infrastructure and Urban Spatial Structure and Planning Responses” (Grant No. XJ2025001502); and the Xiangyang Industrial Research Institute of Hubei University of Technology, “Intelligent Design and Renovation of Industrial Buildings” (Grant No. XYYJ2023A07).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data underlying this article are available in the article.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Nomenclature

SymbolDefinition
Sets and Indices
i, jIndices for candidate sites and demand grid cells, respectively
G Set of all grid cells in the study area
S Set of selected charging station locations
K Target number of new charging stations to be deployed ( K = 15 , 30 )
t Time step index for spatiotemporal series
Input Data
D i Charging demand score for grid i (Derived from ConvLSTM)
A i Relative accessibility index for grid i (Derived from ME-TDAE)
P o p i Population count within grid i
d i j Euclidean or network distance between location i and j
C i Infrastructure cost proxy for grid i (Distance to power/roads)
Optimization Model
x i Binary decision variable: x i = 1 if grid i is selected, 0 otherwise
f c o v ( x ) Objective 1: Maximize Service Coverage
f e q ( x ) Objective 2: Minimize Equity Gap (Service Parity)
f c o s t ( x ) Objective 3: Minimize Infrastructure Cost Proxy
β Distance decay coefficient for coverage calculation ( β = 2.5 )
d m i n Minimum separation distance constraint ( d m i n = 1.0 km)
Algorithms
N p o p Population size in NSGA-II
N g e n Maximum number of generations in NSGA-II
ρ Spearman’s rank correlation coefficient

References

  1. International Energy Agency. More Than 1 in 4 Cars Sold Worldwide This Year Is Set to Be Electric as EV Sales Continue to Grow; IEA: Paris, France, 2025; Available online: https://www.iea.org/news/more-than-1-in-4-cars-sold-worldwide-this-year-is-set-to-be-electric-as-ev-sales-continue-to-grow (accessed on 14 November 2025).
  2. International Energy Agency. Electric Vehicle Charging—Global EV Outlook 2025; IEA: Paris, France, 2025; Available online: https://www.iea.org/reports/global-ev-outlook-2025/electric-vehicle-charging (accessed on 14 November 2025).
  3. Pierce, L.U.S. Charging Infrastructure Deployment through 2024 (Market Spotlight); International Council on Clean Transportation (ICCT): Washington, DC, USA, 2025; Available online: https://theicct.org/publication/us-charging-infrastructure-deployment-through-2024-apr25/ (accessed on 14 November 2025).
  4. Yenchamchalit, K.; Kongjeen, Y.; Bhumkittipich, K.; Mithulananthan, N. Optimal Sizing and Location of the Charging Station for Plug-in Electric Vehicles Using the Particle Swarm Optimization Technique. In Proceedings of the 2018 International Electrical Engineering Congress (iEECON), Krabi, Thailand, 7–9 March 2018; pp. 1–4. [Google Scholar] [CrossRef]
  5. Cintrano, C.; Toutouh, J.; Alba, E. Citizen Centric Optimal Electric Vehicle Charging Stations Locations in a Full City: Case of Malaga. In Advances in Artificial Intelligence—19th Conference of the Spanish Association for Artificial Intelligence, CAEPIA 2020/2021; Alba, E., Luque, G., Chicano, F., Cotta, C., Camacho, D., Ojeda-Aciego, M., Montes, S., Troncoso, A., Riquelme, J., Gil-Merino, R., Eds.; Lecture Notes in Computer Scienc; Springer: Cham, Switzerland, 2021; Volume 12882, pp. 247–257. [Google Scholar] [CrossRef]
  6. Hosseini, M.; Rahmani, A.; Hooshmand, F. A robust model for recharging station location problem. Oper. Res. 2022, 22, 4397–4440. [Google Scholar] [CrossRef]
  7. Chen, T.D.; Kockelman, K.M.; Khan, M. Locating electric vehicle charging stations: Parking-based assignment method for Seattle, Washington. Transp. Res. Rec. 2013, 2385, 28–36. [Google Scholar] [CrossRef]
  8. Kumar, B.A.; Jyothi, B.; Singh, A.R.; Bajaj, M.; Rathore, R.S.; Tuka, M.B. Hybrid genetic algorithm–simulated annealing based electric vehicle charging station placement for optimizing distribution network resilience. Sci. Rep. 2024, 14, 7637. [Google Scholar] [CrossRef] [PubMed]
  9. Altaf, M.; Hussain, S.; Khan, M. A PSO-based optimal placement of electric vehicle charging stations and distributed generation in radial distribution networks. IET Renew. Power Gener. 2024, 18, 1124–1136. [Google Scholar] [CrossRef]
  10. Pang, X.; Fang, X.; Yu, Y.; Zheng, Z.; Li, H. Optimal scheduling method for electric vehicle charging and discharging via Q-learning-based particle swarm optimization. Energy 2025, 316, 134611. [Google Scholar] [CrossRef]
  11. Zhang, W.; Shi, C. A multi-objective site selection of electric vehicle charging station based on NSGA-II. Int. J. Ind. Eng. Comput. 2024, 15, 271–284. [Google Scholar] [CrossRef]
  12. Cintrano, C.; Toutouh, J. Optimizing electric vehicle charging station placement integrating daily mobility patterns and residential locations. In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO ’24), Melbourne, Australia, 14–18 July 2024; ACM: New York, NY, USA, 2024; pp. 1456–1464. [Google Scholar] [CrossRef]
  13. Zapotecas-Martínez, S.; García-Nájera, A.; López-Jaimes, A. A multi-objective evolutionary approach for the electric vehicle charging stations problem. Expert Syst. Appl. 2024, 240, 122514. [Google Scholar] [CrossRef]
  14. Liu, F.; Xu, H.; Zhang, R. A multi-objective optimization model for fast electric vehicle charging stations with wind, PV power and energy storage. J. Clean. Prod. 2021, 288, 125564. [Google Scholar] [CrossRef]
  15. Mohammed, A.; Saif, O.; Abo, M.A.; Elazab, R. Multiobjective optimization for sizing and placing electric vehicle charging stations considering comprehensive uncertainties. Energy Inform. 2024, 7, 45. [Google Scholar] [CrossRef]
  16. Baouche, F.; Billot, R.; Trigui, R.; El Faouzi, N.E. Electric vehicle charging station location analysis for urban areas. Transp. Res. Part C Emerg. Technol. 2014, 45, 187–201. [Google Scholar] [CrossRef]
  17. Yi, Z.; Liu, X.H.; Wei, R. Electric vehicle demand estimation and charging station allocation using urban informatics. Transp. Res. Part D Transp. Environ. 2022, 106, 103264. [Google Scholar] [CrossRef]
  18. Dong, J.; Liu, C.; Lin, Z. Charging infrastructure planning for promoting battery electric vehicles: An activity-based approach using multiday travel data. Transp. Res. Part C Emerg. Technol. 2014, 38, 44–55. [Google Scholar] [CrossRef]
  19. Rane, N.L.; Achari, A.; Choudhary, S.P. An Integrated GIS, MIF, and TOPSIS Approach for Appraising Electric Vehicle Charging Station Suitability Zones in Mumbai, India. Sustain. Cities Soc. 2023, 97, 104717. [Google Scholar] [CrossRef]
  20. Charly, A.; Kumar, S.S. Identifying Optimal Locations for Community Electric Vehicle Charging. Sustain. Cities Soc. 2023, 94, 104573. [Google Scholar] [CrossRef]
  21. Wang, S.; Guo, J.; Qin, Y.; Wang, Y.; Zhang, Z. A spatio-temporal distribution prediction model for electric vehicles charging load in transportation-power coupled network. Sci. Rep. 2025, 15, 4022. [Google Scholar] [CrossRef]
  22. Al Momin, K.; Saikia, L.C. Application of multi-criteria decision analysis for optimal electric vehicle charging station site selection: A data-driven framework. Energy Rep. 2025, 11, 100345. [Google Scholar] [CrossRef]
  23. Abdelaziz, M.M.; El-Shahat, A. Multistage prediction approach of EVs charging load: Integration of deep neural network and feature-based models. Sci. Rep. 2025, 15, 21625. [Google Scholar] [CrossRef]
  24. Mutua, A.M.; de Fréin, R. Sustainable Mobility: Machine Learning-Driven Deployment of EV Charging Points in Dublin. Sustainability 2024, 16, 9950. [Google Scholar] [CrossRef]
  25. Suanpang, P.; Jamjuntr, P.; Kaewyong, P.; Niamsorn, C.; Jermsittiparsert, K. An Intelligent Recommendation for Intelligently Accessible Charging Stations: Electronic Vehicle Charging to Support a Sustainable Smart Tourism City. Sustainability 2023, 15, 455. [Google Scholar] [CrossRef]
  26. Huang, Y.; Ren, H.; Jia, X.; Yu, X.; Xie, D.; Zou, Y.; Chen, D.; Yang, Y. A Spatially Aware Machine Learning Method for Locating Electric Vehicle Charging Stations. World Electr. Veh. J. 2025, 16, 445. [Google Scholar] [CrossRef]
Figure 1. Spatial distribution of population and substations in Wuhan. (a) Population Distribution (b) Substations Distribution.
Figure 1. Spatial distribution of population and substations in Wuhan. (a) Population Distribution (b) Substations Distribution.
Sustainability 18 00497 g001
Figure 2. Citywide total demand time series: observed vs. model predictions.
Figure 2. Citywide total demand time series: observed vs. model predictions.
Sustainability 18 00497 g002
Figure 3. Spatial accuracy on daily peak demand: ground truth and prediction error. (a) Ground truth daily peak; (b) Prediction error (Pred -True) on daily peak.
Figure 3. Spatial accuracy on daily peak demand: ground truth and prediction error. (a) Ground truth daily peak; (b) Prediction error (Pred -True) on daily peak.
Sustainability 18 00497 g003
Figure 4. Trajectory-derived mobility indicators and relative accessibility. (a) Mobility intensity; (b) Relative accessibility.
Figure 4. Trajectory-derived mobility indicators and relative accessibility. (a) Mobility intensity; (b) Relative accessibility.
Sustainability 18 00497 g004
Figure 5. Sensitivity analysis of spatial hotspot identification. (a) Rank Correlation Matrix; (b) Top 10% Hotspot Consistency.
Figure 5. Sensitivity analysis of spatial hotspot identification. (a) Rank Correlation Matrix; (b) Top 10% Hotspot Consistency.
Sustainability 18 00497 g005
Figure 6. Spatial alignment between mobility exposure and charging demand hotspots.
Figure 6. Spatial alignment between mobility exposure and charging demand hotspots.
Sustainability 18 00497 g006
Figure 7. Pareto trade-offs and knee solutions under two deployment scales. (a) K = 15 (Strictly New + Neighborhood); (b) K = 30 (Strictly New + Neighborhood). The red star indicates the TOPSIS-selected best-compromise solution from the Pareto set.
Figure 7. Pareto trade-offs and knee solutions under two deployment scales. (a) K = 15 (Strictly New + Neighborhood); (b) K = 30 (Strictly New + Neighborhood). The red star indicates the TOPSIS-selected best-compromise solution from the Pareto set.
Sustainability 18 00497 g007
Figure 8. Recommended new-build charging station locations selected from the Pareto set. (a) K = 15 New-build Siting; (b) K = 30 New-build Siting.
Figure 8. Recommended new-build charging station locations selected from the Pareto set. (a) K = 15 New-build Siting; (b) K = 30 New-build Siting.
Sustainability 18 00497 g008
Figure 9. Performance comparison between the proposed MOO siting plan and baseline strategies.
Figure 9. Performance comparison between the proposed MOO siting plan and baseline strategies.
Sustainability 18 00497 g009
Table 1. Charging Pile Dataset Field Structure.
Table 1. Charging Pile Dataset Field Structure.
Field NameData TypeDescriptionValue Range
stationLatGcj02FloatCharging station latitude coordinate (GCJ02 coordinate system)30.35–31.37° N
stationLngGcj02FloatCharging station longitude coordinate (GCJ02 coordinate system)113.68–115.05° E
standard_timeDateTimeStandard timestamp14 February 2025 00:00 to 20 February 2025 00:00
powerFloatReal-time charging power (kW)0–200
priceFloatCumulative charging cost (Yuan)0–500
stationIdStringCharging station unique identifier-
portIdStringCharging pile unique identifier-
connectorStatusIntegerConnector occupancy status0: Idle (Available)
1: Charging (Occupied)
Table 2. Traffic Trajectory Dataset Field Structure.
Table 2. Traffic Trajectory Dataset Field Structure.
Field NameData TypeDescriptionValue Range
Vehicle_IDStringRide-hailing vehicle unique identifier
LongitudeFloatGPS longitude coordinate113.68–115.05° E
LatitudeFloatGPS latitude coordinate30.35–31.37° N
Data_Send_TimeDateTimeGPS positioning timestamp17 October 2019 00:00 to 19 October 2019 23:59
SpeedFloatInstantaneous speed (km/h)0–120
Direction_AngleFloatDriving direction angle (degrees)0–360
Passenger_StatusInteger0-Empty, 1-Occupied{0, 1}
Operation_StatusInteger0-Off-duty, 1-Operating{0, 1}
Table 3. Forecasting performance of demand prediction models.
Table 3. Forecasting performance of demand prediction models.
ModelMAE_ActiveRMSE_ActiveWMAPE_Active% WMAPE_Total_%
ConvLSTM2.5863.94119.8921.03
Persistence4.4228.44934.0234.64
HistAvg3.3156.05825.0334.67
Table 4. Representative selected cells from the TOPSIS compromise plan and their neighborhood-level attributes (K = 30).
Table 4. Representative selected cells from the TOPSIS compromise plan and their neighborhood-level attributes (K = 30).
Location
(gx, gy)
Neighborhood Demand (Sum)Neighborhood Demand (Max)Density ProxyAccess RelSubstation Dist (km)Population
(57, 57)161.13381.0671.7360.3912.98510.7k
(68, 55)140.40062.2673.3330.4201.42310.3k
(72, 51)122.60054.6671.0970.3351.53110.3k
(61, 57)113.33334.7332.2780.4081.25119.8k
(71, 53)143.73354.6672.0830.4041.71111.5k
(65, 73)131.73334.2002.0140.4022.11127.0k
(62, 68)120.66742.2003.5000.4212.97827.4k
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, Y.; Ren, H.; Liu, Z.; Chen, D. A Hierarchical Spatio-Temporal Framework for Sustainable and Equitable EV Charging Station Location Optimization: A Case Study of Wuhan. Sustainability 2026, 18, 497. https://doi.org/10.3390/su18010497

AMA Style

Huang Y, Ren H, Liu Z, Chen D. A Hierarchical Spatio-Temporal Framework for Sustainable and Equitable EV Charging Station Location Optimization: A Case Study of Wuhan. Sustainability. 2026; 18(1):497. https://doi.org/10.3390/su18010497

Chicago/Turabian Style

Huang, Yanyan, Hangyi Ren, Zehua Liu, and Daoyuan Chen. 2026. "A Hierarchical Spatio-Temporal Framework for Sustainable and Equitable EV Charging Station Location Optimization: A Case Study of Wuhan" Sustainability 18, no. 1: 497. https://doi.org/10.3390/su18010497

APA Style

Huang, Y., Ren, H., Liu, Z., & Chen, D. (2026). A Hierarchical Spatio-Temporal Framework for Sustainable and Equitable EV Charging Station Location Optimization: A Case Study of Wuhan. Sustainability, 18(1), 497. https://doi.org/10.3390/su18010497

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop