1. Introduction
With the advancement of battery technology and the implementation of the “dual carbon” policy, electric vehicles (EVs) have been widely adopted in industrial, commercial, and civil sectors, while charging infrastructure in highway service areas across China has also experienced rapid development. However, EV charging load in highway scenarios exhibits strong randomness and uncertainty, as it is simultaneously influenced by traffic flow, vehicle state of charge (SOC), and user charging behavior. These complex dynamics make it challenging to achieve accurate predictions using traditional mathematical models, yet precise forecasting is critical for power system planning and real-time operation.
Recent studies have increasingly emphasized the necessity of incorporating spatial, temporal, and contextual factors into EV load forecasting. Yang et al. [
1] developed a spatiotemporal distribution model within a transportation–power coupled network, highlighting the interdependence between traffic flow and grid demand. Tang et al. [
2] proposed a weighted measurement fusion UKF algorithm to improve prediction accuracy across heterogeneous functional areas, while Feng et al. [
3] integrated traffic conditions and temperature, demonstrating the importance of external dynamic factors. Together, these works indicate that multi-source information fusion is essential for accurate load forecasting.
Complementary efforts have explored forecasting from diverse methodological perspectives. Arias et al. [
4] examined charging demand under realistic urban traffic networks, underscoring the influence of mobility patterns. Rassaei et al. [
5] employed statistical approaches to analyze residential charging demand in smart grids, while Iversen et al. [
6] applied a Markov decision process to optimize charging strategies, showing the potential of stochastic control in reducing system costs. Tang and Wang [
7] further introduced a probabilistic nodal demand model based on spatiotemporal EV dynamics, providing flexible representations of uncertainty. Collectively, these studies laid essential methodological foundations for probabilistic and data-driven approaches.
Building upon these foundations, more recent research has incorporated user heterogeneity, real-time dynamics, and infrastructure considerations. Bian et al. [
8] proposed a forecasting approach integrating user portrait information with traffic flow to capture evolving charging behaviors. Guo et al. [
9] developed a multithreaded acceleration framework to enhance computational efficiency in large-scale functional area prediction. Li et al. [
10] emphasized user behavior in spatiotemporal modeling, while Ma et al. [
11] investigated battery swap station location planning, highlighting the interplay between infrastructure layout and load forecasting. These works illustrate a clear trend toward integrating behavioral, spatial, and infrastructural dimensions.
At the same time, methodological advances have significantly enriched the modeling toolkit. Sun and Che [
12] applied a Monte Carlo-based framework to couple multiple information sources, improving probabilistic accuracy. Wang et al. [
13] enhanced SOC estimation and peak power prediction via unscented Kalman filtering, offering insights for load allocation. Li et al. [
14] proposed DiffPLF, a conditional diffusion model for probabilistic forecasting, while Pan et al. [
15] utilized a variational auto-encoder to generate data-driven EV load profiles. On a broader scale, Farhadi et al. [
16] provided a comprehensive review of stochastic modeling strategies, and Faridimehr et al. [
17] developed a stochastic programming approach linking forecasting with network design. At the system level, Boudina et al. [
18] evaluated the grid impacts of large-scale EV integration, emphasizing the need for accurate forecasting to ensure stability. Taken together, these works underscore the growing role of stochastic, probabilistic, and machine learning techniques in capturing the multifaceted dynamics of EV charging demand.
Nevertheless, traditional statistical methods, such as stochastic programming, remain limited in capturing dynamic user behavior and complex spatiotemporal dependencies. Although recent deep learning and probabilistic forecasting models, such as DiffPLF [
14], have advanced predictive accuracy and uncertainty quantification, they often lack the ability to explicitly represent hierarchical dynamics across multiple levels of demand formation, particularly in highway scenarios characterized by strong randomness.
To address these challenges, this paper proposes a hierarchical Markov chain Monte Carlo (H-MMC) framework for spatiotemporal EV charging load forecasting. The framework integrates Markov processes and Monte Carlo simulations with hierarchical modeling techniques to capture multi-level dynamics ranging from individual charging behavior to macro-level spatiotemporal patterns. By constructing a Markov chain converging to the target distribution and designing an inter-layer transfer mechanism, the framework progressively derives the load evolution process while reducing interference across dimensions. Comparative experiments against ARIMA, BP neural networks, random forest, and LSTM demonstrate that the proposed H-MMC achieves higher predictive accuracy in highway scenarios, significantly reduces forecasting errors, and improves model stability and interpretability.
The main contributions of this paper are summarized as follows:
Spatiotemporal coupling in hierarchical modeling—The HMMC framework explicitly decouples macro–meso–micro-factors, enabling finer-grained load prediction and reducing the confounding effects that plague monolithic models.
Markov-based probabilistic simulation—By integrating Monte Carlo sampling into a Markov chain framework, the method achieves stable convergence to the target distributions of individual features while preserving stochasticity, thereby better reflecting real-world uncertainties.
Scenario-based adaptability—The model incorporates heterogeneous operational scenarios, including seasonal traffic patterns, toll-free holiday surges, and extreme weather conditions, ensuring robustness and generalizability across diverse highway environments.
Comparative experiments with state-of-the-art methods—including ARIMA, BP neural networks, random forests, and LSTM networks—demonstrate that the proposed HMMC approach achieves higher predictive accuracy, reduces error variance, and enhances interpretability. These results suggest that the HMMC framework provides a viable and scalable solution for EV charging load forecasting in highway multi-energy systems, supporting more efficient charging station deployment, energy scheduling, and grid stability management.
3. Case Study
3.1. Parameter Settings
This study considers a combined section composed of four highway multi-energy systems, each containing one service area charging station and several toll station nodes, resulting in a total of sixteen stations along a west–east corridor (
Figure 1). All four service areas are equipped with fast-charging facilities, assumed to consist of 80 kW chargers. For instance, the K + 43 service area represents Charging Station No. 1, located 43 km from the starting point, with Stations No. 2, No. 3, and No. 4 distributed sequentially eastward along the main highway. Different vehicle types are assigned distinct charging powers, as summarized in
Table 2, and the EV penetration rate on the highway is assumed to be 18%, with a vehicle-type ratio of 6:3:1 (small, medium, and large), derived from Ministry of Transport of the People’s Republic of China statistics indicating that EVs account for about 20% of total highway traffic during peak travel periods [
27]. This value is conservatively adjusted downward to better represent average, non-peak conditions.
To capture realistic load characteristics for this section, the dataset was first constructed from 30 consecutive days of measured traffic and charging load data at a representative highway charging station. This base dataset was then expanded to incorporate diverse operating conditions. Specifically, four representative daily patterns—workdays, weekends, holidays, and extreme weather days—were extracted from empirical traffic flow images, normalized, and scaled to the observed peak values. Each day in the 30-day period was assigned a type consistent with the actual calendar, and the corresponding flow and load profiles were generated using Gaussian perturbations (≤3% of the mean flow) and Monte Carlo simulations (100 iterations per scenario). Vehicle composition, state-of-charge distributions, and user charging behaviors were modeled according to empirical ratios and behavioral patterns, ensuring that the generated data preserved the statistical properties of the measured dataset while extending its variability across traffic, weather, and vehicle scenarios. Based on these data, the charging loads of the four service area stations within the combined section were predicted, providing the foundation for comparative analysis in subsequent sections.
3.2. Results Analysis
According to hierarchical theory, the traffic flow characteristics of the macroscopic layer serve as conditional variables for the state transition matrices of the other two layers. Based on the frequency of occurrence in historical data, four representative scenarios—workdays, weekends, holidays, and extreme weather days—are selected in each simulation. These scenarios exhibit distinct trends and amplitudes in the charging load curves. The forecasting model is applied to predict the total charging load across the four highway service areas within this section, and the scenario-specific load profiles are illustrated in
Figure 2.
According to the predicted load results, it can be seen that the charging load changes in different scenarios are different. The charging load curves for holidays and rest days are similar, showing a single-peak distribution within the day and generally after 15:00 in the afternoon, when the charging load of the highway multi-energy system reaches its maximum value. However, due to the policy of toll exemption on highways during holidays, the traffic flow increases, so the charging load increases accordingly. The probability of occurrence in system operation on weekdays is the highest, and its load distribution presents a bimodal distribution, with peaks mainly at around 8 am and 17:00 pm. When extreme weather occurs, there is no obvious peak change in the load distribution, but its value is generally low. Given that the example in this article takes the Shandong Peninsula as an example, the frequency of disastrous meteorological outbreaks is not high, so the extreme weather is mainly heavy rain and snow weather, so the charging load still fluctuates. If more destructive weather conditions such as typhoons occur, the load in this scenario will be lower.
To prove the effect of charging load prediction, the historical data of the charging load of this road section is used for a comparison. The loads of four scenarios are classified, and typical curves are extracted through data analysis. Taking the charging station of service area No. 1 as an example, the load prediction curves and actual loads under four typical scenarios are compared to compare the accuracy of the prediction. The comparison between the predicted values and actual values of the charging load under four typical scenarios is shown in
Figure 3.
By comparing the four common prediction curves, it can be seen that they are very close to the actual load value in terms of trend and amplitude. In order to further prove the optimization effect of the hierarchical Markov chain Monte Carlo simulation method, this paper selected typical scenarios on weekdays and used ARIMA, the random forest method, a BP neural network, and an LSTM neural network to compare the prediction results, using evaluation indicators such as the MAPE and R
2. The ARIMA model was configured with orders (
p,
d,
q) and seasonal components
, where the optimal parameters were selected using AIC/BIC and estimated by maximum likelihood. The random forest baseline consisted of 500 regression trees with a maximum depth of 12, a minimum leaf size of 5, the mean squared error as the split criterion,
features per split, and bootstrap sampling with out-of-bag evaluation. The BP neural network was implemented as a multilayer perceptron with three hidden layers (with 128-64-32 neurons and ReLU activation), a dropout of 0.2, and L2 regularization
and trained using Adam (with a learning rate of 0.001 and a batch size of 128) for up to 200 epochs with early stopping. The LSTM model consisted of two stacked layers with 128 hidden units, a sequence length of 24, and a dropout of 0.2 and was trained with Adam (with a learning rate of 0.001, weight decay of
, gradient clipping of 1.0, and batch size of 64) for 150 epochs, using cosine learning rate decay and early stopping. The specific comparison is shown in
Table 3 and
Figure 4.
By comparing the results in the charts, it can be seen that, when dealing with the charging load of electric vehicles in highway scenarios under the influence of complex characteristics, the hierarchical Markov chain Monte Carlo simulation method can better take into account the time series characteristics and the coordination relationship of various parameters. Compared with methods such as auto-regressive models, random forests, and BP neural networks, the load data predicted by Markov Monte Carlo is closer to the actual load situation in terms of change trend. This is mainly because the model describes the distribution of each layer of features and sub-features, and this can better reflect the accuracy of its prediction in subtle time periods. From the evaluation index, this paper selects two indicators, the mean absolute percentage error and the coefficient of determination. It can be seen that the Markov Monte Carlo method has the lowest prediction data error (MAPE) and the highest coefficient of determination (R2). This result shows that the Markov Monte Carlo prediction method has a better fitting effect and higher prediction accuracy.
It can be observed in
Figure 5 that the charging load distributions of the four stations are generally consistent with the overall load curve predictions while also exhibiting scenario-specific differences. On workdays, the charging load demonstrates a bimodal distribution, with peaks concentrated around 8:00 and 15:00, reflecting typical commuting patterns. The loads across the four stations remain relatively balanced. On weekends, however, the distribution shifts to a unimodal pattern with a higher overall peak, and Station No. 2 records a significantly greater load than the others, suggesting a higher traffic density and concentrated travel demand along its corresponding OD path.
During holidays, the charging demand across all four stations increases, and the charging times become more concentrated. Notably, Stations No. 3 and No. 4 exhibit larger loads, indicating heavier traffic volumes along their respective road sections, likely linked to large-scale vehicle transfers and directional holiday travel patterns. Under extreme weather conditions, the charging loads of all stations decrease markedly, with reduced variability both across stations and throughout the day. Although the absolute demand is lower in this case, the temporal dynamics of load change still reveal the influence of shifting user decisions and traffic flow characteristics.
3.3. Feature Analysis
According to the theory of hierarchical Markov chain Monte Carlo simulation, the distribution of charging load corresponds to the probability distribution of traffic flow and affects the SOC distribution of individual vehicles under different traffic flow conditions and the charging behavior decision-making state of individual users. Taking the traffic flow density distribution as an example, five traffic flow states can be obtained through the state transition probability matrix of the Markov chain, namely, very low, low, medium, high, and very high. The transfer rule is that the car flow density cannot change abruptly and should be continuous in the peak stage. Therefore, the problem can be explained by drawing the Markov chain state transition matrix of the traffic flow state:
As can be seen in
Figure 6, the transfer of the traffic flow density from the initial state to the target state shows regularity, and the probability of transferring from an extremely low state to a high state is very small, which avoids the possibility of sudden changes in traffic flow during the valley period. This ensures that the load peak stage can be maintained for a certain period of time, thereby ensuring a gradual change from the peak value to the flat value. The introduction of the state transition probability matrix can effectively solve the actual characteristics of ignoring variables in time series prediction. By predicting the size of the state change probability, instead of predicting by regression fitting, it can avoid the prediction deviation caused by the prediction result being only related to the time series and enhance the authenticity of the prediction result. The Markov transition probability matrix of traffic flow shows the non-instantaneous nature of traffic flow changes, and its changes are stable. As a direct influencing feature of charging load, the stability of traffic flow can ensure the smooth change of prediction results.
The state transition probability matrix not only directly affects the traffic flow state at the macro-level but also indirectly affects the SOC of the vehicle. The charging load of each vehicle in any path is affected by its own SOC, and this state is different under different traffic flow conditions. For example, when the traffic density is too high, vehicles will travel off-peak, charge in advance, and change routes. For the same type of vehicle, assuming that the travel time is the same, the changes in its initial SOC and path distance will affect the SOC value of the individual vehicle passing through the charging station. This state can directly affect the choice of charging probability, and the overall impact is shown in
Figure 7.
Under the same driving distance, the higher the initial SOC of the electric vehicle, the greater the user’s tolerance for the remaining power, that is, the lower the threshold of the anxiety power and the lower the probability of charging; if the vehicle travels a shorter distance under the same initial SOC, the lower the threshold of the anxiety power, the lower the probability of charging, and vice versa.
The anxiety power can be valued through prospect theory, and, combined with the waiting time, it can be substituted into the logistics fitting function to calculate the specific probability of charging selection. However, the specific impact of the anxiety power threshold cannot be directly reflected in the formula. Therefore, by comparing the charging ratio and charging time of electric vehicles under different anxiety power values, the impact of the anxiety power coefficient can be intuitively reflected, as shown in
Figure 8.
It can be seen in the figure that, when the anxiety power increases, the charging ratio increases, and the average charging time decreases. This is mainly because the anxiety power reflects the user’s sensitivity to the minimum power. The threshold reflects the critical value for the user to choose to charge. If the value is too high, the user will choose to charge when the remaining power is high. However, since charging is chosen when the current remaining power is high, the charge state itself is high, so the charging time will be shortened.
4. Discussion
The proposed hierarchical Markov chain Monte Carlo (HMMC) method demonstrates significant advantages in EV charging load forecasting under complex highway scenarios. By explicitly integrating macro-level traffic flow states, vehicle SOC dynamics, and user charging decisions through layered state transition matrices, the model captures temporal dependencies while preserving cross-level interactions often neglected in traditional approaches. This hierarchical structure, combined with scenario-based simulations (weekdays, weekends, holidays, and extreme weather days), enables the model to reproduce realistic load patterns across diverse operating conditions. Comparative experiments further confirm its superiority, with HMMC achieving the lowest MAPE and highest R2 among benchmark models such as ARIMA, RF, BPNN, and LSTM. Moreover, the introduction of “range anxiety thresholds” and logistic-based behavioral modeling adds interpretability, linking user decision-making with observed charging demand.
Compared with other baseline models, the HMMC framework maintains a relatively modest computational burden. In the training phase, the workload grows linearly with the size of the historical dataset while also requiring additional processing for route decomposition and charging station allocation. During forecasting, the macroscopic level involves state updating and path allocation for each time step, whereas the microscopic level, if conducted at the vehicle scale, grows in proportion to the number of simulated vehicles. Despite these requirements, runtime efficiency is effectively ensured through vectorization and Monte Carlo parallelization. By contrast, LSTM models are computationally more expensive, as their training depends heavily on the number of iterations, network depth, hidden unit size, and time sequence length, often requiring hours of computation on standard hardware. Tree-based methods, such as CatBoost, also incur significant cost, as their complexity grows with dataset size, tree depth, and the number of trees. Taken together, these comparisons demonstrate that the HMMC framework achieves a favorable balance of computational efficiency, interpretability, and predictive accuracy, making it particularly suitable for large-scale highway charging demand applications.
Beyond transport systems, recent studies on enterprise energy consumption forecasting have shown the effectiveness of hybrid neural models [
28] and gradient boosting approaches, such as CatBoost, achieving up to 92% accuracy [
29], offering a complementary benchmark to our probabilistic HMMC framework. Despite these strengths, several limitations remain. First, the method relies heavily on detailed historical traffic, weather, and charging data, which may not be consistently available across regions, restricting large-scale applicability. Second, extreme weather scenarios in this study primarily account for heavy rain and snow; more disruptive events such as typhoons or earthquakes could induce discontinuous transitions not well represented by the current framework. Third, while range anxiety is incorporated, broader socioeconomic and psychological factors influencing user behavior remain simplified, limiting generalizability. Finally, the layered Monte Carlo simulation requires substantial computation, posing challenges for real-time applications or large-scale deployment.
In summary, the HMMC approach offers clear improvements in accuracy, adaptability, and interpretability, particularly in capturing peak load dynamics. However, its data dependence, simplified behavior modeling, limited extreme-event coverage, and computational cost highlight areas for further refinement. Future work should explore multi-source data fusion, richer behavioral representations, and efficient parallel computation to enhance the method’s scalability and practical impact.
5. Conclusions
This paper presents a systematic analysis of EV charging load characteristics in multi-energy systems and proposes a prediction model based on the HMMC method. By incorporating a hierarchical structure, the model effectively mitigates cross-dimensional interference among traffic flow, SOC, and user charging behavior. A spatiotemporal load prediction framework is established by deriving the traffic flow distribution via Markov chains and further modeling the joint distribution of the SOC and user behavior through inter-layer state transitions. The proposed model improves both the temporal stability and interpretability of charging load forecasting compared with conventional approaches. Benchmark studies against ARIMA, BPNN, RF, and LSTM confirm that the HMMC model achieves lower relative errors and higher accuracy in highway scenarios. These results demonstrate the effectiveness and reliability of the proposed method, highlighting its potential for practical deployment in multi-energy system planning and operation.
Future work will focus on extending the model’s applicability and adaptability. First, incorporating real-time traffic and weather data is expected to enhance prediction accuracy under dynamic conditions. Second, applying the model to urban distribution networks and integrating renewable energy uncertainty will broaden its practical relevance. Third, the scalability of HMMC will be investigated for large-scale EV penetration scenarios. Finally, coupling HMMC with advanced deep learning techniques may further improve its capability in capturing complex spatiotemporal patterns.