1. Introduction
With the increasing penetration of renewable energy and the growing need for low-carbon operation, modern energy systems are becoming more integrated and flexible. IESs (shown in
Figure 1) coordinate electricity, heating, cooling, and other energy carriers to improve energy efficiency and operational resilience [
1,
2]. In practice, multi-energy loads exhibit significant coupling relationships and complex dynamic patterns, which are influenced by meteorological and temporal factors. Therefore, multi-energy load forecasting in IESs is more difficult than single-energy forecasting. Accurate load forecasting is important for the secure, stable, and coordinated operation of IES [
3].
Early load forecasting was largely based on statistical models. In univariate forecasting tasks, traditional machine learning models such as ARIMA and SARIMA perform predictions by modeling statistical characteristics of time series [
4]. Meanwhile, Support Vector Machines and Random Forests incorporate exogenous variables to improve model expressiveness and forecasting accuracy [
5,
6]. However, these models depend on manual feature engineering struggling to capture nonlinear coupling relationships among multiple variables. Furthermore, they are prone to error accumulation in multi-step forecasting, limiting their applicability in complex load scenarios.
As deep learning has advanced, neural network methods have been broadly adopted for multi-energy load forecasting [
7,
8]. Early research focused on univariate forecasting, using models such as RNN and LSTM to capture temporal dependencies [
9,
10]. Subsequently, the research expanded to single-step and short-term forecasting tasks, improving accuracy through architectural optimization [
11,
12]. Furthermore, multivariate multi-step forecasting methods were proposed, enabling joint modeling of multi-energy loads by incorporating meteorological and temporal features [
13,
14,
15,
16]. Although deep learning improves forecasting performance, it still has limitations in capturing long-term dependencies and complex dynamic patterns [
17].
In recent years, Transformer-based models have been widely used in time series forecasting [
18,
19,
20]. Zhou et al. proposed Informer, which reduces computational complexity via the ProbSparse attention mechanism and enables efficient long-sequence modeling [
21]. Wu et al. introduced Autoformer, which enhances modeling of trend and periodic components via a series decomposition architecture [
22]. Zhou et al. further proposed FEDformer, which incorporates frequency-domain modeling into the Transformer framework to extract critical spectral features [
23]. These methods achieve high accuracy in multivariate multi-step forecasting tasks. However, these Transformer-based models rely on complex attention mechanisms, leading to large parameter scales and high computational costs. They are also prone to overfitting in noisy or non-stationary environments.
In contrast, RNN-based models are effective in modeling local temporal dynamics in time series. Cho et al. developed forecasting models based on the GRU structure, improving computational efficiency through simplified gating mechanisms [
10]; Zeng et al. proposed SegRNN, which enhances local pattern modeling capabilities via sequence segmentation [
24]. In addition, some studies adopt multi-layer architectures or multi-task learning to improve the representational capacity of multivariate sequences [
13]. However, although RNN-based methods perform well in short-term forecasting, their recursive structures suffer from information decay in long-sequence modeling, and their sequential computation limits training efficiency.
Moreover, various hybrid models and probabilistic modeling methods have been proposed to further improve forecasting performance. Lai et al. proposed LSTNet, which combines convolutional with recurrent neural networks to capture both local and long-term dependencies [
13]. Lim et al. introduced the Temporal Fusion Transformer (TFT), achieving multi-source information fusion via a variable selection mechanism [
14]. Salinas et al. developed DeepAR to model time series uncertainty from a probabilistic perspective [
25]. Song et al. used a hierarchical multi-task learning model for multi-energy load forecasting [
26]. Although these methods excel in multi-source information integration, their structures are complex and sensitive to parameter tuning. Consequently, they remain limited in capturing complex multi-energy coupling relationships.
Overall, existing methods still have three limitations. First, time-decomposition methods mainly focus on trend and seasonal components, but they do not fully exploit frequency-domain periodic information. Second, frequency-enhanced methods can extract periodic patterns, but they often lack explicit modeling of local fluctuations and trend evolution. Third, some high-performing models rely on complex structures, leading to higher computational costs. These limitations leave a clear problem: how to jointly model trends, fluctuations, and periodic patterns in a lightweight framework. To address this issue, this study proposes FTimeDD, a time–frequency collaborative model for multi-energy load forecasting. It integrates time-domain trend–fluctuation disentanglement with frequency-domain periodic enhancement to capture multi-scale dynamics in multi-energy loads. The study’s primary contributions are as follows:
- (1)
A multi-energy load forecasting model, FTimeDD, is developed to handle multivariate inputs and multi-horizon outputs, enabling unified modeling of multi-energy loads.
- (2)
A dual-path decoupling mechanism is designed to achieve time–frequency collaboration. The time-domain path captures trend evolution and local fluctuations. The frequency-domain path enhances dominant periodic patterns.
- (3)
A compact multi-scale fusion module is developed to integrate time- and frequency-domain representations. This module learns cross-load interactions while controlling the parameter scale.
- (4)
Experiments on the ASU Integrated Energy System dataset verify the effectiveness of FTimeDD. The results show lower forecasting errors than several baseline models across different horizons, with competitive computational efficiency.
2. Problem Definition and Feature Analysis
2.1. Problem Definition
In IES, multi-energy load forecasting aims to characterize deep coupling relationships among multiple loads based on historical observations, and thereby predict future load variations. This task can be formulated as a multivariate multi-step time series forecasting problem [
27].
Let the input historical multivariate sequence be
, where
B denotes the batch size,
denotes the look-back window length, and
represents the input feature dimension. The objective is to predict the multi-energy load states
over a forecasting horizon of
time steps by learning a mapping function
f. Here,
corresponds to the three target loads: electricity, cooling, and heating. The forecasting process is formulated as:
In the training process, we minimize the error between the predicted sequence
and the ground truth
by
, a set of parameters to be learned. In this study, Mean Absolute Error (MAE) is adopted as the optimization objective:
By minimising this loss function, the model parameters are iteratively updated to achieve accurate prediction of multi-energy load sequences.
2.2. Feature Analysis
This study utilizes the public Integrated Energy System dataset from Arizona State University (ASU) in Tempe. The dataset spans from 2021 to 2023 with a 1 h sampling interval. Electricity, cooling, and heating loads are used as target variables. Meteorological and temporal variables, including ambient temperature, sea-level pressure, wet-bulb temperature, day of week, and greenhouse gas emission index, are used as auxiliary variables. These features cover the primary external factors affecting load variations, enabling the model to better describe complicated energy demand patterns while providing a reliable data foundation for modeling long-term dependencies.
To investigate the dynamic relationship between multi-energy loads and exogenous variables, a time series analysis is conducted, as shown in
Figure 2.
Overall, all three load types exhibit distinct periodic patterns superimposed with non-stationary fluctuations. Cooling and heating loads show strong temporal consistency with temperature variations, exhibiting synchronization during high- and low-temperature periods. This indicates that temperature and temporal features are key driving factors for these loads. In contrast, electricity load exhibits more complex dynamic characteristics, with significant random fluctuations superimposed on its periodic patterns.
The above analysis suggests that multi-energy load sequences contain long-term trends as well as short-term variations, with varying impacts from external factors across different loads. To further quantify and validate variable correlations, Grey Relational Analysis (GRA) and Spearman rank correlation are employed to evaluate the relationships between multi-energy loads and auxiliary features, as shown in
Figure 3a and
Figure 3b, respectively.
GRA characterizes the strength of association by measuring the trend consistency across sequences. It is adapted for nonlinear and non-stationary time series in this study. The grey relational coefficient is defined as:
where
is the reference sequence,
is the comparison sequence, and
is the resolution coefficient (typically 0.5). The grey relational grade is obtained by averaging the coefficients across time steps; a higher value represents a stronger correlation.
The upper triangle in
Figure 3a displays the relational grades between auxiliary features and target loads. All values are above 0.6, indicating that incorporating meteorological and time characteristics into the model can significantly improve its representational ability. Specifically, temperature shows strong correlation with cooling and heating loads, while the day-of-week feature significantly influences electricity load. These variations reflect that load variations are driven by multiple factors. Thus, multi-dimensional feature fusion is essential to improve model expressiveness [
28]. The fitting results in the lower triangle reveal complex nonlinear coupling relationships among variables. This further suggests that traditional linear methods struggle to adequately capture such relationships, necessitating the introduction of models capable of nonlinear representation. The histograms on the diagonal illustrate the statistical distributions of features, highlighting distinct differences in value ranges and shapes. This indicates that auxiliary features are complementary rather than redundant in describing load variations.
To further support the GRA-based analysis, Spearman rank correlation was used as a complementary method. GRA measures the similarity of variation trends, while Spearman correlation evaluates monotonic relationships among variables. The Spearman correlation heatmap is shown in
Figure 3b. The results show that electricity and cooling loads show a strong positive correlation, with a coefficient of 0.82, while heating load is negatively correlated with electricity and cooling loads, with coefficients of −0.75 and −0.81, respectively. Temperature-related variables are positively correlated with electricity and cooling loads but negatively correlated with heating load, which is consistent with the opposite seasonal patterns of cooling and heating demands. In addition, DOW and GHG also show noticeable correlations with the loads, indicating that temporal and exogenous variables contain useful forecasting information. These results are consistent with the GRA analysis and further support the use of meteorological and temporal variables as auxiliary inputs.
Overall, the GRA and Spearman analyses provide consistent evidence for the complex relationships in multi-energy loads. Temperature variations are closely related to cooling and heating loads, while electricity load is also affected by temporal and exogenous factors. This reflects coupling relationships and complex dynamic patterns between different loads. Therefore, it is necessary to extract key information from multi-dimensional features and perform decomposition-based modeling to enhance the representation of complex load patterns.
3. Method
This paper proposes FTimeDD, a time–frequency collaborative modeling framework for multi-energy load forecasting. It captures multi-scale features by jointly modeling sequences from the time and frequency domains. It employs a dual-path parallel architecture, consisting of three core modules: the Frequency Filtering Module (FFM), the Dual-stream Disentanglement Module (DSDM), and the Multi-scale Feature Fusion Module (MSFFM). The framework consists of five stages: input preprocessing, time-domain feature extraction, frequency-domain modeling, dual-path feature fusion, and forecast generation. Specifically, the historical multivariate sequence is first processed by the FFM to reduce noise and alleviate non-stationarity. The denoised sequence is then sent to two parallel paths. The time-domain path extracts trend and fluctuation features, while the frequency-domain path captures dominant periodic information. The MSFFM fuses the two types of features and generates the final forecasts for electricity, cooling, and heating loads, as illustrated in
Figure 4.
3.1. Frequency Filtering Module (FFM)
To address high-frequency noise and non-stationary distributions in raw load data, the Frequency Filtering Module (FFM) is employed for frequency-domain preprocessing. The module integrates low-pass filtering (LPF) and complex-valued linear layers, utilizing frequency-domain interpolation to suppress noise and enhancing periodic features, providing high-quality inputs for subsequent forecasting.
First, the input data undergoes reversible instance normalization (RIN) [
29]. By computing the mean
and standard deviation
, the input sequence is converted into a normalized distribution
Z with a zero-mean and unit-variance distribution, improving the modeling of non-stationary sequences.
The normalized signal is mapped to the complex frequency domain via real Fast Fourier Transform (rFFT) to obtain complex frequency-domain features
:
where
is the total number of frequency points.
The model performs hard-threshold low-pass filtering (LPF) on the spectrum based on dominant frequencies
, thereby filtering out irregular dynamic interference. In this study,
is determined based on the look-back window and the daily periodicity of the load sequences. It is calculated as:
where
denotes the look-back window length,
denotes the number of time steps in one daily period,
H denotes the retained harmonic order, and
is an additional low-frequency margin. Since the dataset is sampled hourly,
. With
,
, and
, the cutoff is
. This cutoff is used to retain the dominant low-frequency components and daily-period-related harmonic information in the frequency filtering process. The LPF process is formulated as:
where
is a mask vector, with
if the frequency index
, and 0 otherwise. This process suppresses high-frequency noise and enhances the primary periodic structure, reducing noise interference and computational redundancy.
The core of the FFM is a complex-valued linear projection layer to perform frequency-domain interpolation and extrapolation. Unlike traditional zero-padding, this layer utilizes learnable complex weight matrices , which explicitly models amplitude and phase variations. It maps truncated low-frequency components to a spectral space aligned with the target forecasting horizon, deriving the required spectral features for future sequences.
Here,
and
represent the input and output dimensions of the complex-valued linear layer respectively, and
represents the complex bias vector. The relevant parameters and formulae are as follows:
Finally, the interpolation-enhanced frequency representation is converted to the time domain through inverse real FFT (irFFT), and subsequently rescaled to the original data scale using interpolation rate and inverse RIN (IRIN). The stationarized signal produced by FFM provides a robust representational foundation for subsequent time–frequency dual-stream modeling and improves stability in long-horizon forecasting.
The computational cost of FFM mainly comes from FFT and the complex-valued linear projection. The FFT operations have a complexity of . The projection layer is controlled by the retained dominant frequencies and the projected frequency dimension , with a complexity of . Since is much smaller than the full sequence length, the FFM remains compact while enhancing useful periodic information.
3.2. Dual-Stream Decoupling Module
The core architecture employs a dual-path parallel mechanism in both time and frequency domains, enabling multi-dimensional disentanglement and complementary feature extraction. The time-domain path (TDP) explicitly decomposes sequences into trend and seasonal components via moving average decomposition, capturing long-term evolution and short-term fluctuations, respectively. The frequency-domain path (FDP) adaptively weights the spectrum via a learnable filter to enhance dominant periodic components and suppress noise. This time–frequency joint modeling mechanism enhances the model’s ability to capture transient patterns and improves forecasting robustness in multi-energy coupling scenarios.
3.2.1. Time Domain Path
To disentangle long-term trends from short-term variations, our model adopts an additive decomposition strategy in the time domain to reduce modeling complexity. A Moving Average (MA) operator performs 1D average pooling on the denoised sequence to extract the trend component
, followed by a residual connection to obtain the seasonal component
.
reflects long-term evolution patterns, while
represents temporary fluctuations, thereby reducing the complexity of subsequent modeling. The formula is as follows:
The denotes the padding operation. The window size k is specifically configured for electricity, cooling, and heating loads to mitigate feature blurring in heterogeneous loads under a unified temporal resolution.
The disentangled components are fed into two identical Multi-Layer Perceptrons (MLPs) for independent evolution forecasting. Each MLP comprises two fully connected layers with LeakyReLU activation. It maps the input sequence onto future time steps
via linear projection. Finally, the predicted trend and fluctuation components are aggregated to obtain the global feature representation
for the TDP, where C is the hidden dimension. The specific process is as follows:
3.2.2. Frequency-Domain Path
The FDP utilizes spectral modeling to separate amplitude and phase information, achieving high-precision coupling feature extraction. This module uses the denoised sequence input and transforms it into frequency domain via rFFT to obtain complex-valued features
. To achieve disentanglement of steady-state and dynamic components, a learnable spectral filter
is introduced to adaptively redistribute spectral energy via spectral convolution:
The filter assigns higher weights to periodic low-frequency steady-state components to strengthen static coupling patterns and ensure consistency in long-horizon forecasting. Conversely, for high-frequency dynamic components caused by noise and transient interference, adaptive filtering is achieved via weight attenuation mechanisms. This mechanism mitigates overfitting issues commonly observed in traditional time-domain models. Element-wise spectral convolution in the frequency domain is equivalent to full-length circular convolution in the time domain, where the element-wise spectral filtering has a linear complexity with respect to the number of frequency point. Additionally, the FFT operations introduce a complexity of . Consequently, frequency-domain operations can capture global steady-state features more effectively. Its computational overhead is significantly lower than that of attention mechanisms in Transformer-based models . This design helps balance lightweight design and long-term modeling capabilities.
The filtered frequency-domain features
are restored to the time domain via irFFT, yielding refined temporal features. To further enhance the representational capacity of frequency-domain features and capture nonlinear phase evolution over time, a nonlinear evolution layer (self.model) is introduced. This module is composed of stacked linear layers and ReLU activations, formulated as follows:
where
is the final forecasting output of the FDP. It captures complex nonlinear phase-shift features by learning the phase evolution of frequency features over time. This compensates for the limited expressiveness of purely linear frequency-domain modeling while minimizing computations.
3.3. Multi-Scale Feature Fusion Module
This module performs deep integration of time-domain and frequency-domain features, combining temporal dependencies with periodic patterns. First, the model concatenates forecasting feature
from the time domain and
from the frequency domain. This integrates temporal dependencies with periodic patterns to provide a comprehensive feature space for global coupling learning.
After concatenating multi-dimensional features, a final linear mapping layer achieves feature fusion and forecasting output. This fully connected layer utilizes a high-dimensional linear projection matrix
and a bias term
to enable cross-channel interaction and automatically learn multi-energy flow coupling relationships. Upon inputting all forecasting features, each weight in the
matrix represents the influence of input components on the target load, automatically learning the interdependencies among all energy variables. The formulation is as follows:
The final output consists of three core channels—electricity, cooling and heating (); however, information from all channels participates in joint weight updates during training. This enables the model to leverage implicit coupling features from auxiliary variables to refine predicted trajectories of core loads in real time, achieving implicit auxiliary learning. Furthermore, the module maintains low computational complexity (), facilitating effective multivariate information interaction within a compact architecture.
4. Experiments
This study is conducted using the Integrated Energy System (IES) dataset published by Arizona State University (ASU) Tempe campus. The dataset is chronologically divided into training, validation, and test sets in an 8:1:1 ratio. This split preserves the chronological order of the forecasting task. The present validation is based on the ASU IES dataset, and separate tests on other datasets or specific seasonal subsets are not included. The look-back window is set to 168 h (7 days), and the forecasting horizons are 24, 48, 72, and 96 h. These settings are used to evaluate the model under different prediction lengths.
To avoid information leakage, all preprocessing statistics are estimated from the training set only and then applied to the validation and test sets. Outliers are identified using a box-plot-based quartile method and corrected via exponential smoothing to mitigate the impact of abnormal fluctuations on model training.
Missing data are imputed using linear interpolation. Additionally, to eliminate scale differences, all input variables undergo Min–Max normalization, thereby improving training stability and convergence efficiency.
During forecasting, exogenous variables are used only within the historical look-back window. The model does not use future meteorological observations or future target load values over the prediction horizon as inputs.
To evaluate performance, several typical time series forecasting models are selected as baselines, including ARIMA [
5], LSTM [
12], DLinear [
24], iTransformer [
30], TimesNet [
15], TimeXer [
31], TimeMixer [
32], SegRNN [
33], and Amplifier [
34]. All these models are retrained using the same training, validation, and test splits. The recommended configurations from official implementations are used as initial settings. For models with tunable settings, we conducted basic hyperparameter tuning on the validation set. The evaluation metrics comprise Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Mean Absolute Percentage Error (MAPE), and Average Correlation Coefficient (ACCR). These metrics assess model performance from the perspectives of error magnitude, relative error, and sequence correlation respectively.
The experiments are implemented using Python 3.10 and the PyTorch 2.5.1 framework. Parameter updates are performed utilizing the Adam optimizer. The key hyperparameters are set as follows: the dominant frequency cutoff = 68, the moving average window size k = 13, the hidden dimension C = 64, the learning rate , the maximum number of training epochs 100, and the random seed 2026. The same fixed random seed is used for neural-network models in the experiments to maintain a consistent comparison protocol. Early stopping is applied with a patience of 10 epochs based on the validation MAE. A learning-rate decay strategy reduces the learning rate by a factor of 0.5 if the validation loss does not improve for 3 consecutive epochs.
Runtime was measured under the same hardware environment for all models, and FLOPs were calculated using the same input setting and measurement protocol. The experiments were conducted on a workstation equipped with an Intel i7-13700KF CPU, Intel, USA and an NVIDIA GeForce RTX 4070 Ti SUPER GPU, NVIDIA, USA.
4.1. Comparative Experiments
4.1.1. Comparison of Fitting Curves
To intuitively evaluate the model’s fitting capability,
Figure 5 compares predictions for electricity, cooling, and heating loads with ground truth on the test set.
Overall, predictions (red line) closely match the ground truth (blue line), maintaining consistency in both periodic patterns and local fluctuations. For electricity load with complex fluctuations, the model maintains stable tracking during rapid changes, with no significant deviations. For cooling and heating loads, the model captures peaks and valleys accurately without noticeable lag.
In summary, the proposed method effectively captures dynamic patterns in the load sequence while maintaining the smoothness of the forecast results, demonstrating strong fitting capability and adaptability.
4.1.2. Comparison of Error Metrics
Quantitative analysis evaluates the reduction in MAE and RMSE achieved by FTimeDD relative to representative baselines, as shown in
Figure 6. Across the tested forecasting horizons, FTimeDD generally obtains lower errors than the compared baselines. This indicates that the proposed time–frequency collaborative structure is effective for reducing forecasting errors in multi-energy load prediction.
Figure 7 further compares multi-metric performance across electricity, cooling, and heating loads. FTimeDD achieves competitive or near-optimal results in most metrics, particularly for cooling and heating loads. This suggests that the method effectively captures periodic load characteristics. For an electricity load with more irregular fluctuations, the model still maintains competitive performance, indicating its ability to capture complex load variations.
Table 1 lists overall model performance and computational cost across different forecasting horizons. As the forecasting horizon increases, the prediction errors generally increase for most models, indicating the higher difficulty of long-horizon multi-energy load forecasting. FTimeDD shows the lowest overall MAE and MAPE at all forecasting horizons. For RMSE, FTimeDD obtains the best results at the 24, 72, and 96 h horizons, while Amplifier shows a slightly lower RMSE at the 48 h horizon. In addition, FTimeDD maintains a compact parameter scale and relatively low FLOPs. These results indicate that FTimeDD achieves a balance between forecasting accuracy and computational cost.
Table 2 presents the load-wise forecasting results for electricity, cooling, and heating loads. The results show that the model performance varies across load types. For electricity load, FTimeDD achieves lower MAE and MAPE at the 24, 48, and 72 h horizons, while Amplifier is more competitive at the 96 h horizon. For cooling load, several baselines remain competitive at longer horizons, which suggests that the cooling sequence contains strong seasonal regularity. FTimeDD maintains high ACCR values at the 72 and 96 h horizons, indicating that it can still preserve the temporal correlation of the cooling load. For heating load, FTimeDD achieves the lowest MAE at all horizons and obtains strong RMSE and MAPE results in most cases. Overall, FTimeDD demonstrates relatively balanced forecasting performance across different energy loads, rather than relying on improvements for a specific load type.
4.2. Ablation Study
To evaluate the contribution of each component, multiple ablation settings are constructed, including removing the frequency-domain filtering module (w/o FFM), removing the time-domain path (w/o Time domain Decoupling), removing the frequency-domain path (w/o Frequency domain Decoupling), removing the dual-stream decoupling module (w/o Dual-stream), removing exogenous variables (w/o Exogenous Variables), and removing reversible instance normalization (w/o RIN).
4.2.1. Module Necessity Analysis
Table 3 lists the ablation results for different model variants across horizons. The results show that the Full model achieves the lowest MAE and RMSE at all horizons, outperforming all variants. This indicates that the Full model provides the most stable error control. Each module contributes positively to performance and exhibits effective synergy.
As we can see, removing the time-domain path or dual-stream module leads to noticeable error increases, especially in long-horizon forecasting. This is because the time-domain path extracts trends, while dual-stream decoupling enhances multi-scale features expression via time–frequency collaborative modeling. Once these modules are removed, capturing long-term dependencies becomes difficult, which leads to model performance degradation. By contrast, removing the frequency-domain path mainly affects short-term forecasting, highlighting its role in capturing periodic features.
Further analysis shows that removing RIN leads to a clear degradation in prediction accuracy, especially at the 48 h and 96 h horizons. This indicates that normalization is important for alleviating the non-stationarity of multi-energy load sequences. Additionally, the w/o Exogenous Variables variant performs worse than the full model. This suggests that meteorological and temporal variables provide useful auxiliary information for multi-energy load forecasting.
4.2.2. Analysis of Parameter and Performance Relationship
The correlation of parameter scale and performance is analyzed in
Figure 8.
Figure 8a compares parameter scales across different forecasting horizons. While removing modules reduces the number of parameters, the overall change is only within several dozen K. This indicates that the parameter scale for each module is well-controlled and does not significantly increase model complexity.
Figure 8b shows error growth curves for different structural variants as the forecasting horizon expands. While MAE increases for all variants, the full model maintains the lowest level and flattest growth. This suggests that the main structural modules enhance the system’s robustness, while removing any module leads to faster error accumulation.
Figure 8c reveals differences in the relationship between parameter size and performance across different models: top-left models have fewer parameters but higher errors; top-right models have high parameters and high errors. However, FTimeDD (full model) occupies the optimal bottom-left region, indicating better efficiency-to-performance ratios. This suggests that improvements in our model stem from structural optimization rather than parameter scaling alone.
4.2.3. Module Contribution Analysis
Figure 9 illustrates the contributions of the main modules across different forecasting horizons. Overall, the contribution of each module varies with the prediction length. This is mainly because short- and long-horizon forecasting emphasize different temporal patterns.
At the 24 h horizon, removing the frequency-domain decoupling path causes the largest MAE increase, from 624.77 to 695.74. This indicates that short-term forecasting relies more on periodic and high-frequency fluctuation information, which is captured by frequency-domain modeling.
As the horizon becomes longer, trend evolution and accumulated temporal dependency become more important. At the 96 h horizon, removing the time-domain decoupling path increases MAE from 1038.73 to 1162.98, highlighting its role in modeling long-term trends. In contrast, the dual-stream decoupling module maintains a stable contribution in different forecasting horizons, because it can integrate trend-related information from the time domain with periodic information from the frequency domain. Although FFM shows smaller error variations, removing it consistently increases MAE and RMSE. This indicates that LPF-based frequency filtering and denoising provide more stable inputs for subsequent time–frequency modeling.
5. Conclusions
To address the challenges of complex multivariate coupling, strong non-stationarity, and difficulty in capturing long-term dependencies in integrated energy load forecasting, this paper proposes a time–frequency collaborative modeling method, termed FTimeDD. The method constructs a dual-path parallel architecture combining time-and frequency-domain modeling. It performs trend–fluctuation decoupling modeling in the time domain, while extracting dominant periodic features in the frequency domain, enabling effective representation of multi-scale dynamic information. This architecture improves the model’s representational capacity while maintaining computational efficiency, providing a lightweight solution for joint multi-energy load forecasting.
Experimental results indicate that the proposed method performs better than several representative baseline methods across different forecasting horizons. It achieves lower overall errors in most cases as the forecasting horizon increases. Ablation results further validate the necessity for each module. The time-domain path plays a key role in capturing long-term trends, the frequency-domain path facilitates extraction of periodic structures, and the dual-stream collaborative mechanism helps the model better represent complicated dynamic variations. In addition, RIN helps alleviate non-stationarity, while exogenous variables provide useful auxiliary information for multi-energy load forecasting.
The present study still has several limitations. The experiments are conducted under a fixed random seed, and multi-seed statistical evaluation is not included. The validation is also based on the ASU IES dataset with a chronological split, while tests on other datasets or specific seasonal periods are left for future work. Moreover, the dominant frequency cutoff is determined from the look-back window and daily-period setting, and the effect of other cutoff values has not been further examined. Future work will extend the evaluation from these aspects and further consider uncertainty estimation, peak-demand errors, and dispatch-oriented assessment.
Overall, the proposed model provides an effective and compact forecasting framework for complex multivariate load forecasting tasks. It shows potential to provide more accurate load forecasting information for integrated energy system operation and to support subsequent operation-related analysis.