A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems

Aizenberg, Igor; Becchi, Lorenzo; Bindi, Marco; Intravaia, Matteo; Luchetta, Antonio

doi:10.3390/en19092247

Open AccessArticle

A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems

by

Igor Aizenberg

¹

,

Lorenzo Becchi

^2,*,

Marco Bindi

²

,

Matteo Intravaia

²

and

Antonio Luchetta

²

¹

Department of Computer Science, Manhattan University, New York, NY 10471, USA

²

Department of Information Engineering, Università degli Studi di Firenze, Via di Santa Marta 3, 50139 Firenze, Italy

^*

Author to whom correspondence should be addressed.

Energies 2026, 19(9), 2247; https://doi.org/10.3390/en19092247

Submission received: 4 April 2026 / Revised: 29 April 2026 / Accepted: 30 April 2026 / Published: 6 May 2026

(This article belongs to the Special Issue Artificial Intelligence in Modern Power and Energy Systems)

Download

Browse Figures

Versions Notes

Abstract

This work is devoted to the application of complex-valued neural networks based on the multilayer neural network with multi-valued neurons (MLMVN) for short-term electrical load forecasting in smart grid energy systems. Accurate forecasting is a critical component of energy management systems, as it directly impacts the efficiency of control and optimization strategies in increasingly distributed and stochastic environments. The proposed approach leverages the intrinsic properties of complex numbers to model periodicity and nonlinear relationships typical of load time series. A compact feedforward architecture with two hidden layers is adopted and combined with multiple preprocessing strategies, including unit circle encoding, Fourier transform representations, and hybrid feature mappings incorporating temporal information such as the day of the week. The performance of the proposed models is evaluated on real-world prosumer data and compared against two benchmarks: a seasonal persistence model and a Long Short-Term Memory network. Results show that MLMVN-based approaches achieve comparable or improved performance in terms of RMSE and error reduction capability, despite their lower architectural complexity. Fourier-based preprocessing methods demonstrate strong effectiveness in capturing underlying temporal patterns. These findings suggest that complex-valued representations provide a promising alternative to traditional deep learning approaches, offering a favorable balance between accuracy, interpretability, and computational efficiency in Smart Grid forecasting applications.

Keywords:

time series forecasting; smart grid energy systems; MLMVN; complex-valued neural networks

1. Introduction

The rapid changes in climate and geopolitics are increasingly affecting everyday life. These global transformations impact multiple layers of society, and among the most critical sectors involved is the energy system [1]. Traditional supply chains are under pressure due to international political instability, while conventional energy sources are becoming progressively unsustainable. Consequently, the need to increase efficiency in energy production, both in terms of cost and emissions, is becoming urgent. The green transition toward a system that relies more on renewable energy sources (RESs) represents a key strategy to mitigate these challenges [2].

However, changing the production mix alone is not sufficient. Energy demand is forecasted to increase in the coming years, further stressing the system. Moreover, renewable technologies such as photovoltaic and wind power are characterized by a high degree of uncontrollability. At the same time, the system is shifting from a centralized model with few large production plants to a configuration that is going to heavily rely on distributed energy resources (DERs). If not properly managed, this variability and distribution of resources may introduce additional challenges for grid stability and availability [3,4,5,6].

For this reason, improving efficiency and coordination in energy usage becomes essential [7]. The deployment of smart metering and smart control systems enables the coordinated management of distributed resources, including generation plants, storage systems, and loads, leading to the development of smart grids (SGs) [8].

Also in response to these challenges, recent European legislation, alongside a full package of regulations [9,10], introduced renewable energy communities (RECs) as a regulatory and organizational framework to promote local energy sharing and active participation of consumers and prosumers.

To achieve this objective, modern grid frameworks integrate hierarchical control systems such as power management systems (PMSs) and energy management systems (EMSs) [10]. The PMS operates in real time, monitoring and maintaining the network’s operational status. The EMS acts as a supervisory planner, scheduling and optimizing the use of available resources according to economic or energy-related objectives [11].

The primary goal of an EMS is to optimize the usage of flexible resources of a user, such as controllable loads and battery energy storage systems, according to a specified objective. These optimization processes rely on predictive information about future consumption and generation [12,13].

Such systems are typically implemented using a rolling-horizon approach forecasting and optimization are executed repeatedly over time. At each step, predicted trajectories are updated with the most recent measurements and the optimization problem is solved again [14,15]. Consequently, control performance is tightly linked to forecasting accuracy. Inaccurate forecasts lead to suboptimal decisions and reduce economic benefits. More precise predictions extend the effective foresight of the EMS and improve its ability to coordinate energy flows efficiently [16,17].

Climate change and the increasing occurrence of extreme events are making the EMS an essential asset for guarantee the correct function of the grid under such extreme condition by strengthening the resilience of the electric system [18].

Forecasting, referred to as predicting the future trend of a profile based on its historical trend and recent observations, is therefore a fundamental component of such systems [19]. Starting from historical measurements, it aims to predict the future evolution of a time series over a selected horizon. In electrical systems, predictions are typically performed with a sampling granularity between 15 min and one hour [20,21], depending on control requirements. Accurate short-term forecasting is essential because control effectiveness depends on the ability to anticipate future operating conditions.

Time series forecasting has been extensively studied and a common baseline for evaluating performance is the persistence model, which assumes that the next value equals the most recent measurement. In multi-step forecasting, the seasonal persistence model predicts each future point by repeating the value observed at the same hour of the previous day or week. Although simple, these baselines are often competitive, and any advanced method must demonstrate consistent improvements over them [22].

Energy consumption and production forecasting is a long-standing problem in power systems, yet it remains challenging, especially at fine temporal and spatial resolutions [23]. While forecasting renewable production, particularly for photovoltaic systems, largely depends on meteorological predictions, electrical load forecasting presents a different challenge. Load time series exhibit a strong stochastic component driven by human behavior, lifestyle variability, and external factors that are difficult to model explicitly [24]. As a result, load forecasting is regarded as a more complex and uncertain task, especially at the single-household level [25].

Classical statistical models such as ARMA, ARIMA, and ARIMAX have historically shown solid performance for time series with stable autocorrelation structures and identifiable seasonal patterns [26,27]. These models are computationally efficient and interpretable, making them suitable for environments with limited computational resources. However, their linear structure limits their ability to capture nonlinear dependencies and abrupt changes typical of prosumer electrical loads. Within the study of stochastic properties of load time series, an alternative approach consists in directly extracting statistical dependencies from historical data and approximating the autocorrelation structure of the process, leading to lightweight and easily implementable methodologies [28].

With the increase in computational power, artificial intelligence and deep learning approaches have become central in short-term load forecasting research. Techniques include feed-forward neural networks [29,30,31], recurrent neural networks [32], long short-term memory networks [33,34], hybrid models [35,36,37], and Bayesian [38] and transformer architectures [39,40]. Among these, LSTM networks are often considered state of the art due to their ability to model long-range temporal dependencies and nonlinear relationships.

Nevertheless, increased architectural complexity does not automatically translate into superior forecasting performance. This is especially true in low-aggregation settings such as buildings, households and small energy communities, where historical records are often limited and load profiles are more volatile [41]. Under these conditions, deep sequence models may generalize poorly or yield only marginal gains unless they are supported by global training, transfer learning or architecture-aware preprocessing [42]. Recent studies show that transformer and LSTM-based performance is highly sensitive to the training regime and data representation, rather than being uniformly superior across settings.

Moreover, the trade-off between predictive accuracy and operational efficiency remains central in applied energy forecasting: tree-based models such as LightGBM have recently outperformed several deep architectures in both accuracy and robustness [43], while lower-complexity models remain attractive for EMS integration because they enable faster retraining, lower latency and easier real-time deployment [44]. At the same time, this should not be interpreted as a blanket limitation of deep models, since efficient sparse-attention Transformer variants can achieve competitive accuracy with significantly faster inference [45]. These findings motivate the exploration of forecasting approaches that jointly optimize accuracy, robustness and computational efficiency rather than maximizing architectural complexity alone [46,47].

In this work, the authors investigate the application of multilayer neural network with multi-valued neurons to the electrical load forecasting problem. Unlike traditional real-valued neural networks, MLMVNs operate entirely in the complex domain, with both weights and activation functions being complex-valued. This formulation introduces additional expressiveness, enabling the model to capture intricate relationships even with relatively simple network architectures, whereas real-valued networks may require significantly more complex structures to achieve comparable performance. A detailed explanation of this type of network is provided in Section 2.1.

The objective of the authors is to evaluate the performance of this neural network in the context of load forecasting by designing a predictor capable of estimating the behavior of a time series over the next three days based on the previous three days. Section 2.2 describes the different preprocessing methodologies adopted to prepare the data before feeding it to the network. Section 3.1 presents the training procedure together with the hyperparameter tuning process. Finally, Section 3.2 reports and discusses the obtained results.

2. Materials and Methods

2.1. MLMVN and Proposed Architecture

The neural network tool used for solving a time series forecasting problem considered here is a multilayer feedforward neural network with multi-valued neurons (MLMVN) whose continuous formulation was given in [48]. The mapping from n inputs of a multi-valued neuron (MVN) to a single output is defined by n variables in the function

f (x_{1}, \dots, x_{n}) : O^{n} \to O

, where

O

denotes the set of points

e^{i φ}

located on the unit circle of the complex plane, or in the more general case by the function

f (x_{1}, \dots, x_{n}) : C^{n} \to O

, where C is a field of complex numbers. Thus, while an MVN input can be either a complex number located on the unit circle or an arbitrary complex-valued number, its output is located on the unit circle if the continuous MVN activation function is used. It is defined as:

P (z) = e^{j A r g (z)} = \frac{z}{|z|}

(1)

where

z = w_{0} + w_{1} x_{1} + \dots w_{n} x_{n}

represents the weighted sum (a complex number), and

A r g (z)

denotes the phase (argument) of

z

. Thus, the neuron output, in the continuous version (regression output) corresponds to the projection of the weighted sum onto the unit circle, as determined by the activation function. In neural networks based on MVNs, the learning process relies on the generalization of the error-correction rule rather than on an iterative minimization procedure. In general, the weight update rule is expressed as:

W_{r + 1} = W_{r} + \frac{C_{r}}{(n + 1) |z_{r}|} δ \bar{X}

(2)

where

δ

denotes the error term,

\bar{X}

is the component-wise reciprocal of the input vector, r indicates the learning step index,

n

is the number of neuron inputs,

W_{r}

and

W_{r + 1}

are the weight vectors before and after correction, respectively,

z_{r}

is a weighted sum on the step r, and

C_{r}

represents the learning rate (complex-valued in general, but in all known applications

C_{r} = 1

is used).

MLMVN is a multilayer neural network employing this type of neuron and has a standard feedforward architecture: neurons are arranged in layers, and each neuron’s output in a given layer feeds the corresponding inputs of neurons in the subsequent layer.

The adoption of MVNs as fundamental processing units introduces significant differences and advantages compared to the classical multilayer perceptron (MLP), as detailed in [48]. The canonical training algorithm for MLMVNs extends the same error-correction principle used for a single MVN, propagating the error backward to the hidden layers. For example, in a network with one output neuron and

M

hidden neurons in a single hidden layer, the global output error is defined as:

δ_{k}^{*} = D_{k} - Y_{k}

(3)

where

D_{k}

and

Y_{k}

denote the desired and actual network outputs, respectively.

For training neurons in the input layer, a local error term is employed:

δ_{k} = \frac{1}{M + 1} δ_{k}^{*}

(4)

The weights are corrected by adding an adjustment:

{\tilde{W}}_{i}^{k} = W_{i}^{k} + \frac{C_{r}}{(N_{i n} + 1) |z_{k}|} δ_{k} \bar{\tilde{Y_{i}}} i = 1, \dots, N_{i n}

(5)

where the tilde indicates a corrected weight,

N_{i n}

is the number of a respective neuron inputs,

C_{r}

is the learning rate,

Y_{i}

represents the actual output of the i-th hidden layer neuron (with the tilde indicating correction and the overbar indicating complex conjugation), and

|z_{k}|

is an absolute value of the current weighted sum of a respective neuron. The convergence of the process based on the previous learning rule is formally proven in [49].

In the case of a two-layer network with a single output, operating with real-valued inputs and output in the real domain, the real inputs

X_{R}

are converted into complex inputs

X

using:

X = e^{j n o r m (X_{R})}

(6)

where

n o r m (\cdot)

is a suitable function mapping the input data into the interval [0, 2π). The complex output of the final neuron is then converted back into a real value through the

a r g (\cdot)

function.

A major limitation of MLMVNs is their relatively slower training time compared to real-valued neural networks [49]. However, this drawback has been significantly alleviated by an improved training algorithm explained in [50], which introduces a batch algorithm based on QR decomposition of errors calculated over the whole dataset or a large batch of elements of a big learning set. In the same paper, the high effectiveness of MLMVNs for continuous function approximation is also demonstrated.

In our application the objective of the model is to forecast active power consumption for the next three days, using the consumption measured during the previous three days as an input, with an hourly sampling rate. So, the number of inputs

N_{I}

and the number of outputs

N_{O}

are taken equally to be 72.

With respect to the single layer format described above, if we have

N_{O}

outputs the change is simply in the fact that the error term is calculated as the sum

δ_{k}^{*} = \sum_{i = 1}^{N_{O}} δ_{i k}^{*}

over all output neurons, so the hidden-layer error aggregates contributions from every output error through the inverse outgoing weights.

The architectural choice shown in Figure 1 results in a compact yet expressive structure suitable for experimentation and follows an intentional design strategy:

The first hidden layer performs a compression of the input data, projecting the high-dimensional time series into a compact complex subspace; the second hidden layer expands this representation to reconstruct the temporal structure required for multi-day forecasting. This compression–expansion mechanism proved effective in extracting relevant patterns while keeping the overall complexity manageable.

2.2. Evaluated Data Processing

During the experimentation phase, several preprocessing strategies were assessed to determine their influence on the performance and stability of the MLMVN. Domestic load data are inherently noisy, highly irregular, and strongly influenced by daily and weekly periodicities. For this reason, preprocessing plays a critical role in shaping the input representation before it is passed to MLMVN.

The following subsections describe the different strategies explored, each designed to enhance the compatibility of the data with complex-domain operations and to facilitate the learning dynamics of the network. A notable advantage of a custom training code is the ability to visualize how information propagates through the layers of the MLMVN. Because the network operates in the complex domain, the transformations applied to the samples can be observed as rotations, distortions and twists within the complex plane.

This offers valuable insight into the internal behavior of the model, an uncommon but powerful feature in deep learning workflows. For every normalization approach considered, the evolution of the training sequences across the network layers will be shown. In the resulting plots, each “plane” corresponds to one sequence of the training set, enabling a direct visualization of how the MLMVN manipulates the data in the complex domain.

All the proposed methodologies implement a logarithmic normalization step. In this way we achieve a reduction in dynamics, given that electrical loads often have highly skewed distributions (daily/seasonal peaks vs. low nighttime values) and the logarithmic transform compresses large values and expands small ones. This prevents both the peaks from dominating the training and gradients from exploding or concentrating in a few regions; therefore, the problem becomes better conditioned for optimization.

Moreover, with the log transform, the network becomes more sensitive to relative (percentage) variations rather than absolute ones, linearizing the distribution.

2.2.1. Unit Circle Normalization

The first normalization strategy consists of encoding active power information into the phase of complex numbers. In Figure 2, a graphical representation of this approach is shown.

The preprocessing pipeline involves:

Applying a logarithmic compression to the load values, to reduce the dynamic range of the signal. Given an input sequence $S_{I n}$ the corresponding normalized sequence $S_{l o g}$ is defined as described in Equation (7).

S_{l o g} = \log (S_{I n} + 1)

(7)

Mapping the resulting values onto a sector of the unit circle, specifically within the interval $[0.35 π, 0.6 π]$ radians. This angular interval was selected empirically to ensure a good balance between resolution and phase separability. Given $θ^{M a x}$ and $θ^{m i n}$ are respectively the maximum and minimum boundaries of the considered angular sector, and $S_{l o g}^{M a x}$ and $S_{l o g}^{m i n}$ are respectively the maximum and minimum values observable in the training set post-log normalization, the corresponding “unit circle” sequence is defined as shown in Equation (8).

S_{U C} = S_{l o g} * \frac{θ^{M a x} - θ^{m i n}}{S_{l o g}^{M a x} - S_{l o g}^{m i n}} + θ^{m i n}

(8)

Figure 3 shows how the data are transformed by the network. In this case, the transformations experienced by the data as they propagate through the network layers are relatively simple. At a macroscopic level, the samples preserve their angular-sector shape after each layer and continue rotating along the unit circle.

Since in this normalization all information is encoded exclusively in the phase, the geometric evolution remains constrained to that sector. The output of the network eventually settles in the same angular region as the input, confirming that the MLMVN preserves and reshapes the phase structure.

2.2.2. Normalization via Fourier Transform

A second preprocessing approach consists of transforming the input time series using the fast Fourier transform (FFT). Working in the frequency domain naturally aligns with the mathematical structure of MLMVN, given its strong affinity with complex arithmetic and harmonic components.

The methodology is simple:

Apply the logarithmic compression;
Compute the FFT of the load profile;
Normalize the Fourier spectrum by its maximum magnitude;
Pass the resulting complex vector directly to the MLMVN.

As the information is already expressed in amplitude and phase, the format is inherently compatible with complex-valued operations. In Figure 4 an example of the effect of the use of such transformation on a time series sequence is shown.

Figure 5 shows how, in this configuration, the geometric transformation applied by the network becomes more significant: the FFT spectrum initially appears as a “pillar” in the complex plane, then the activations collapse into a compact angular sector on the unit circle after both hidden layers, before expanding again toward the final output. This behavior reflects how the network contracts and reorganizes frequency components during processing.

2.2.3. Simplified FFT Normalization

A variation of the previous method exploits two well-known properties of the Fourier transform:

The DC component (“zero” frequency coefficient) can be removed and later reintroduced after prediction;
On real-valued time-domain signals, the Fourier spectrum exhibits conjugate symmetry, meaning that only half of the coefficients carry unique information.

Following these principles, this preprocessing approach follows these steps:

Logarithmic compression;
FFT computation;
Removal of the DC component from the input;
Retention of only half of the spectrum;
Re-mirroring and reinsertion of the DC term in post-processing to reconstruct the full signal.

This reduces the input dimensionality and computational burden while preserving all relevant frequency information. Figure 6 shows an example of the use of this third method of preprocessing on the input data.

Geometrically, Figure 7 shows how the transformations across the MLMVN layers resemble those obtained in the full FFT method, though the points are less tightly concentrated around the unit circle due to the reduced spectral content.

2.2.4. Encoding the Day of the Week

To enrich the input and strengthen the representation of weekly seasonality, the day of the week was also integrated into the complex-valued input.

Complex numbers are naturally suited to encode two-dimensional information: one variable in the magnitude, the other in the phase.

The procedure is as follows:

The active power is mapped to the modulus of the complex number, using logarithmic compression to linearize its distribution;
The day of the week, encoded as an integer from 1 to 7, is mapped to the phase.

This angular encoding is particularly effective since the phase is a circular variable: the mapping covers the full

2 π

range, ensuring smooth transitions between adjacent days (e.g., from Sunday to Monday). In Figure 8 an example of the form of the magnitude and phase of the complex-valued input sequence obtained by using this methodology is shown.

Figure 9 shows how, in this configuration, the input dataset takes a seven-point star shape when represented in three dimensions. After passing through the first and second layers, this structure is transformed into a spiral, reflecting how the MLMVN manipulates the phase–magnitude coordinates. At the output, the seven-point star re-emerges, demonstrating the network’s ability to reconstruct the encoded weekly periodicity.

2.3. Benchmark Definition

To properly evaluate the performance of the developed MLMVN architectures, two benchmark models were identified. The first model is a seasonal persistence predictor, selected as the baseline to surpass. The second model is a long short-term memory (LSTM) network, used as the gold standard to match or approach.

These two benchmarks were deliberately chosen to bracket the expected performance range for short-term load forecasting. The seasonal persistence predictor represents the minimum acceptable threshold: any viable forecasting method must outperform a naive, training-free reference. The LSTM, on the other hand, represents the current gold standard for sequence modeling, offering a well-established and competitive upper reference. Together, they provide a meaningful evaluation framework, allowing the proposed architectures to be assessed in terms of both practical utility and competitiveness with state-of-the-art approaches.

The seasonal persistence baseline was defined on a weekly horizon: the predictor assumes that the future load value at each time step is equal to the value observed exactly one week earlier. This choice reflects the strong weekly periodicity often present in residential consumption patterns and provides a simple, yet surprisingly competitive, reference point.

The LSTM network was designed to exploit both active power measurements and the day-of-week index, allowing the model to internalize weekly seasonality. The day-of-week variable was transformed using its sine and cosine components, producing a smooth circular encoding and avoiding discontinuities between consecutive days, for example from Sunday to Monday. The active power was logarithmically normalized, as done in the MLMVN configurations, to compress the dynamic range.

Figure 10 shows the four-layer LSTM architecture, implemented in MATLAB, used in this study. The number of hidden units in the LSTM layer was set to 40, following empirical tuning aimed at balancing representational capacity and training stability.

2.4. Comparison Metrics

Defining as

{\hat{y}}_{h}

the response of the network at an input sequence

x_{h}

and

y_{h}

as the expected ground truth, the performance of all predictors, the MLMVN variants, the LSTM model and the seasonal persistence, was evaluated on the test set of

N_{T e s t}

samples using two complementary metrics:

RMSE, which quantifies the magnitude of prediction error across the forecast horizon. The metric was evaluated for each step of the forecasting horizon, showing how the error varies while the predicted points get closer to the present. To do so, the following formulation was applied for each step of the sequence:

R M S E_{h} = \sqrt{\frac{1}{N_{T e s t}} \sum_{i = 1}^{N_{T e s t}} {(y_{h, i} - {\hat{y}}_{h, i})}^{2}}

(9)

ERR (error reduction ratio), which evaluates the ability of the predictor to adapt its forecast trajectory as new data become available. $T$ denotes the number of steps between the moment the forecast was generated and the present time. To quantify how effectively the predictor reduces the absolute prediction error as the target samples in the sequence $y_{h}$ , forecast at a past step, become closer to the present, the following formulation is adopted to define the error reduction $y_{h}^{E R_{T}}$ in (10) and then to define the metric in (11):

y_{h}^{E R_{T}} = | y_{h} - {\hat{y}}_{h, T} |

(10)

E R R_{T} = \frac{1}{N_{S a m p l e s}} \sum_{k = 1}^{N_{S a m p l e s}} (y_{h, k}^{E R_{1}} - y_{h, k}^{E R_{T}})

(11)

2.5. Dataset

The methodologies were evaluated on the historical load data of the Ausgrid dataset [51]. This dataset collects load and production profiles of Australian prosumers with a time resolution of half an hour. From the available data, one full year was extracted and 127 profiles, those with the fewest missing values, were selected and then converted to an hourly resolution.

For each user, the time series was divided into monthly subsets of 30 days, corresponding to 720 h. The neural networks were trained using three adjacent months as the training set and the subsequent month as the test set. With the aim of exploiting information about the behavior of the previous three days to forecast the following 72 h, the dataset was organized into pairs of “past sequences” and corresponding “real future sequences”, each with a length of 72 time steps.

This entire procedure was repeated five times on different months for each user, resulting in a total of

N_{s} = 5 \times 127

different evaluated scenarios, each composed of 720 distinct input–output sequence pairs.

3. Results

3.1. Training Description and Tuning of the Hyperparameters

The software simulator is designed in MATLAB (2025b). The folder with all the material used is available on GitHub to allow the reproducibility of the experiments [52]. Each layer employs a complex logarithmic-normalization activation function, applied to the modulus of the complex-valued signals flowing through the network. This choice stabilizes the magnitude of activations, mitigates gradient explosion, and preserves the phase information that is essential when operating in the complex domain. Such activation functions are commonly used in MLMVN because they balance non-linearity with the need to maintain meaningful geometric transformations in the complex plane.

Training is performed using supervised learning with early stopping based on a validation set. A fixed 5% portion of the training dataset is always held out and used as a validation set to determine convergence and prevent overfitting. Training is halted when the validation loss fails to improve for a predefined number of epochs, ensuring that the model does not over-adapt to noise.

This architecture, although relatively simple in terms of depth and parameter count, enables the exploration of the unique representational capabilities of complex-valued neural networks and sets the foundation for the more advanced experiments described in the following sections.

To tune the number of neurons in the two layers of the architecture, a small subset of the dataset was used to train and preliminarily evaluate the RMSE of the first step of the forecast horizon. Such a tuning map, shown in Figure 11, has been drawn by repeating.

The training and the metric evaluation in multiple configurations, changing each time the number of neurons in the layers to cover as many cases as possible.

Note that each configuration of the network was tested for all the preprocessing methodologies, and the result is an average of all cases. The map presents a minimum, albeit not heavily defined, when the first hidden layer has 75 neurons and the second has 100 neurons, and this configuration was used for the rest of the tests.

3.2. Evaluation of the Metrics

To evaluate the performance of the proposed methods, all the methodologies previously discussed were implemented using the dataset sequences. To compare the performance of all the proposed MLMVN configurations against the two benchmark models, the metrics were evaluated over the entire length of the forecasted sequence for each of the

N_{s}

different scenarios.

Some of the following figures, show a comparison of the metrics across the different methodologies. In these cases, the different approaches are referred to as: Normalization on the Unit Circle (UCircle), Normalization via Fourier Transform (FFT), Simplified FFT Normalization (FFT2), Day of the Week Encoding (WeekD), Persistency (Pers) and LSTM.

In Figure 12 and Figure 13, the metric results are presented as the median values computed over the full output sequence across all scenarios. The choice of reporting median errors rather than mean values is intended to reduce the influence of heavy-tailed error distributions and to highlight the typical model performance.

In Table 1 and Table 2, the same results are expressed in terms of distributions. The chi-square distribution was then used to estimate the boundaries of the confidence intervals. Here,

x_{i}

represents the observed value,

\overset{ˉ}{x}

is the mean,

n

is the number of observations, and

d f = n - 1

denotes the degrees of freedom of the chi-square distribution. To remove outliers, the first and ninety-ninth percentiles of the resulting datasets were excluded for all methodologies and for all steps of the forecast horizon.

s = \sqrt{\frac{1}{n - 1} \sum_{i = 1}^{n} (x_{i} - \bar{x})}

(12)

σ_{1 - α} = [\sqrt{\frac{d f s^{2}}{χ_{1 - α / 2, d f}^{2}}}, \sqrt{\frac{d f s^{2}}{χ_{α / 2, d f}^{2}}}]

(13)

In this study,

α

is chosen equal to 0.05, corresponding to a confidence interval of 95%.

The comparison based on absolute values is useful for assessing the actual magnitude of the metrics; however, when used alone, it may be misleading when comparing different methodologies. To ensure a fairer comparison with the benchmark, the results were normalized using the persistence model as a reference. For each methodology, the RMSE values were processed independently for each hour of the forecasting horizon according to (14):

{R M S E}_{M e t h o d}^{r e l} (h) = \frac{{R M S E}_{M e t h o d}^{a b s} (h) - {R M S E}_{P e r s i s t e n c e}^{a b s} (h)}{{R M S E}_{P e r s i s t e n c e}^{a b s} (h)}

(14)

Such relative results are expressed as percentage in Figure 14 as median values, and in Table 3 as distributions with a confidence interval of 95%.

In Figure 15, the RMSE of the first step of the prediction error committed by each method is represented as a boxplot, showing it both in absolute and relative form.

Regarding the ERR, persistency is affected by a full incapacity for self-correcting, making it unsuitable as a reference for showing the results in relative form. For this reason, the LSTM has been used as the reference.

E R R_{M e t h o d}^{r e l} (h) = \frac{E R R_{M e t h o d}^{a b s} (h) - E R R_{L S T M}^{a b s} (h)}{E R R_{L S T M}^{a b s} (h)}

(15)

The results are presented once again as percentages, with the median values of the full horizon shown in Figure 16 and the distribution shown in Table 4.

4. Discussion

The overall results show that the methodologies based on the MLMVN achieve, for the considered case study, performance comparable to or even better than that of the LSTM, both in terms of RMSE and ERR, while relying on significantly simpler architectures. A lower RMSE indicates a smaller magnitude of the prediction error, whereas a higher ERR reflects a stronger ability of the network to correct its forecast trajectory as new data become available. These findings suggest that complex-valued representations may offer expressive modeling capabilities without the computational overhead typically associated with recurrent networks.

In particular, the results consistently highlight the promise of approaches based on the Fourier transform, which in some forecasting steps exhibit even lower RMSE values than the LSTM and a noticeably higher ERR. This indicates that such forecasting networks are expected not only to produce smaller errors overall, but also to show greater reactivity in correcting prediction mistakes.

While these results are encouraging, the proposed approach also presents limitations and open directions for future research. Further investigation is required to assess whether this behavior leads to tangible improvements in the complete control chain, and to validate the robustness and generality of these findings across different contexts.

Future work may proceed along three main directions.

First, a more detailed analysis of computational complexity and training time could provide additional insights into the scalability of the method, particularly in scenarios involving larger datasets or more complex architectures. Although training cost is not a critical factor in typical energy management workflows, where the time scale allows the models to be trained and updated almost offline, its systematic evaluation may become relevant in large-scale or adaptive frameworks.

Second, future research may explore the integration and direct comparison with more recent architectures, such as transformer-based and temporal convolutional network (TCN) models. However, such investigation should carefully account for deployment constraints, as these architectures often require higher computational and memory resources, which may limit their applicability in resource-constrained or edge environments.

Third, and most importantly, a key objective will be the integration of the proposed methods into a full closed-loop EMS control framework to evaluate their operative performance.

In conclusion, the proposed approach demonstrates promising performance within the considered application context, justifying further developments along the directions outlined above.

Author Contributions

Conceptualization, I.A. and L.B.; methodology, I.A.; software, I.A. and L.B.; validation, L.B., M.B. and M.I.; formal analysis, L.B. and A.L.; investigation, M.I.; resources, A.L.; data curation, M.B.; writing—original draft preparation, L.B.; writing—review and editing, A.L.; visualization, L.B.; supervision, I.A. and A.L.; project administration, A.L. All authors have read and agreed to the published version of the manuscript.

Funding

This project was funded by the European Union, Next Generation EU, Mission 4, Component 1, CUP J53D23000760006. The authors also acknowledge the financial support of the National Research Center in High-Performance Computing, Big Data and Quantum Computing (https://www.supercomputing-icsc.it/en/icsc-home/, accessed on 1 October 2024) foreseen within Mission 4 (Education and Research) of the “National Recovery and Resilience Plan” (NRRP) that is part of the Next Generation EU (NGEU) program (https://www.italiadomani.gov.it/en/, accessed on 1 October 2024).

Data Availability Statement

The data presented in this study were derived from the following resources available in the public domain: Ausgrid dataset, https://www.ausgrid.com.au/ (accessed on 1 October 2024); Italian energy market trends, https://www.gse.it/ (accessed on 1 October 2024); and solar irradiation data, https://joint-research-centre.ec.europa.eu/photovoltaic-geographical-information-system-pvgis_en (accessed on 1 October 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

RES	Renewable Energy Sources
DER	Distributed Energy Resources
SG	Smart Grids
REC	Renewable Energy Communities
PMS	Power Management System
EMS	Energy Management System
MLMVN	Multi-Level Multi-Valued Neuron
MVN	Multi-Valued Neuron
MLP	Multilayer Perceptron
FFT	Fast Fourier Transform
LSTM	Long Short-Term Memory
ARMA	AutoRegressive Moving Average
ARIMA	AutoRegressive Integrated Moving Average
ARIMAX	AutoRegressive Integrated Moving Average with eXogenous inputs
RMSE	Root Mean Square Error
ERR	Error Reduction Ratio

References

McNutt, M. Climate change impacts. Science 2013, 341, 6145. [Google Scholar]
Skea, J.; Shukla, P.; Reisinger, A.; Slade, R.; Pathak, M.; Al Khourdajie, A.; van Diemen, R.; Abdulla, A.; Akimoto, K.; Babiker, M.; et al. Mitigating Climate Change; The Intergovernmental Panel on Climate Change (IPCC): Geneva, Switzerland, 2022. [Google Scholar]
Muruganantham, B.; Gnanadass, R.; Padhy, N.P. Challenges with renewable energy sources and storage in practical distribution systems. Renew. Sustain. Energy Rev. 2017, 73, 125–134. [Google Scholar] [CrossRef]
Alsayegh, O.; Alhajraf, S.; Albusairi, H. Grid-connected renewable energy source systems. Challenges and proposed management schemes. Energy Convers. Manag. 2010, 51, 1690–1693. [Google Scholar] [CrossRef]
Grid Incident in Spain and Portugal on 28 April 2025, ICS Investigation Expert Panel, Factual Report. Available online: https://eepublicdownloads.blob.core.windows.net/public-cdn-container/clean-documents/Publications/2025/iberian-blackout/entso-e_incident_report_ES-PT_April_2025_06.pdf (accessed on 14 November 2025).
Fathollahi, A. Machine Learning and Artificial Intelligence Techniques in Smart Grids Stability Analysis: A Review. Energies 2025, 18, 3431. [Google Scholar] [CrossRef]
International Energy Agency. Executive Summary—Unlocking the Potential of Distributed Energy Sources. 2022. Available online: https://www.iea.org/reports/unlocking-the-potential-of-distributed-energy-resources/executive-summary (accessed on 1 April 2026).
Li, Y.R.; Nejabatkhah, F.; Tian, H. Smart Hybrid AC/DC Microgrids; John Wiley & Sons: Hoboken, NJ, USA, 2018. [Google Scholar]
European Commission. Renewable Energy Targets—Revised Directive (EU/2023/2413). 2023. Available online: https://energy.ec.europa.eu/topics/renewable-energy/renewable-energy-directive-targets-and-rules/renewable-energy-targets_en (accessed on 14 November 2025).
Fagetan, A.M. Transposition and Implementation of European Union Renewable Energy Legislation in France, Italy, and Germany: A Regulatory Perspective and a Comprehensive Analysis of Opportunities and Challenges. Laws 2026, 15, 3. [Google Scholar]
Marzband, M.; Sumper, A.; Ruiz-Álvarez, A.; Domínguez-García, J.L.; Tomoiagă, B. Experimental evaluation of a real time energy management system for stand-alone microgrids in day-ahead markets. Appl. Energy 2013, 106, 365–376. [Google Scholar]
Vinothine, S.; Widanagama Arachchige, L.N.; Rajapakse, A.D.; Kaluthanthrige, R. Microgrid Energy Management and Methods for Managing Forecast Uncertainties. Energies 2022, 15, 8525. [Google Scholar] [CrossRef]
Becchi, L.; Belloni, E.; Bindi, M.; Intravaia, M.; Grasso, F.; Lozito, G.M.; Piccirilli, M.C. A Computationally Efficient Rule-Based Scheduling Algorithm for Battery Energy Storage Systems. Sustainability 2024, 16, 10313. [Google Scholar] [CrossRef]
Mattingley, J.; Wang, Y.; Boyd, S. Code generation for receding horizon control. In Proceedings of the IEEE International Symposium on Computer-Aided Control System Design 2010, Yokohama, Japan, 8–10 September 2010; pp. 985–992. [Google Scholar]
Becchi, L.; Bindi, M.; Grasso, F.; Intravaia, M.; Lozito, G.M.; Luchetta, A. Distributed EMS Coordination via Price-Signal Control for Renewable Energy Communities. Energies 2025, 18, 6072. [Google Scholar] [CrossRef]
La Tona, G.; Di Piazza, M.C.; Luna, M. Effect of Daily Forecasting Frequency on Rolling-Horizon-Based EMS Reducing Electrical Demand Uncertainty in Microgrids. Energies 2021, 14, 1598. [Google Scholar]
Becchi, L.; Bindi, M.; Intravaia, M.; Grasso, F.; Pasqui, M.; Carcasci, C. Application of Control Algorithms for Battery Scheduling in Grid-Connected Energy Prosumers. In Proceedings of the 2024 IEEE 22nd Mediterranean Electrotechnical Conference (MELECON), Porto, Portugal, 25–27 June 2024; pp. 932–937. [Google Scholar]
Castañeda-Arias, N.; Díaz-Aldana, N.L.; Luna Hernandez, A.; Jutinico, A.L. Energy Management in Microgrid Systems: A Comprehensive Review Toward Bio-Inspired Approaches for Enhancing Resilience and Sustainability. Electricity 2025, 6, 73. [Google Scholar] [CrossRef]
Misiurek, K.; Olkuski, T.; Zyśk, J. Review of Methods and Models for Forecasting Electricity Consumption. Energies 2025, 18, 4032. [Google Scholar] [CrossRef]
Rehman, S.; Iqbal, N. Electricity consumption prediction in smart buildings using deep learning approaches. Energy Inform. 2025, 8, 131. [Google Scholar]
Sheng, W.; Liu, K.; Jia, D.; Chen, S.; Lin, R. Short-Term Load Forecasting Algorithm Based on LST-TCN in Power Distribution Network. Energies 2022, 15, 5584. [Google Scholar]
Bot, K.; Ruano, A.; Ruano, M.d.G. Short-Term Forecasting Photovoltaic Solar Power for Home Energy Management Systems. Inventions 2021, 6, 12. [Google Scholar] [CrossRef]
Qureshi, M.; Arbab, M.A.; Rehman, S. Deep learning-based forecasting of electricity consumption. Sci. Rep. 2024, 14, 6489. [Google Scholar] [CrossRef]
Abd’Azeez, T.A.; Olatomiwa, L. A machine learning-powered energy consumption prediction system with API. J. Electr. Syst. Inf. Technol. 2025, 12, 50. [Google Scholar] [CrossRef]
Gasparin, A.; Lukovic, S.; Alippi, C. Deep learning for time series forecasting: The electric load case. CAAI Trans. Intell. Technol. 2022, 7, 1–25. [Google Scholar]
Chen, J.F.; Wang, W.M.; Huang, C.M. Analysis of an adaptive time-series autoregressive moving-average (arma) model for short-term load forecasting. Electr. Power Syst. Res. 1995, 34, 187–196. [Google Scholar]
Huang, C.M.; Huang, C.J.; Wang, M.L. A particle swarm optimization to identifying the armax model for short-term load forecasting. IEEE Trans. Power Syst. 2005, 20, 1126–1133. [Google Scholar] [CrossRef]
Becchi, L.; Camerota, C.; Intravaia, M.; Bindi, M.; Luchetta, A.; Pecorella, T. Load Profile Generation for Robust Optimization: A Stochastic Approach Based on Conditional Probability Approximation. In Proceedings of the 2025 IEEE International Conference on Environment and Electrical Engineering and 2025 IEEE Industrial and Commercial Power Systems Europe (EEEIC/I&CPS Europe), Chania, Greece, 15–18 July 2025; pp. 1–6. [Google Scholar] [CrossRef]
Lee, K.Y.; Cha, Y.T.; Park, J.H. Short-term load forecasting using an artificial neural network. IEEE Trans. Power Syst. 1992, 7, 124–132. [Google Scholar] [CrossRef]
Park, D.C.; El-Sharkawi, M.A.; Marks, R.J.; Atlas, L.E.; Damborg, M.J. Electric load forecasting using an artificial neural network. IEEE Trans. Power Syst. 1991, 6, 442–449. [Google Scholar] [CrossRef]
Belhaiza, S.; Al-Abdallah, S. A Neural Network Forecasting Approach for the Smart Grid Demand Response Management Problem. Energies 2024, 17, 2329. [Google Scholar] [CrossRef]
Yelamali, S.N.; Byahatti, K. Electricity short term load forecasting using elman recurrent neural network. In Proceedings of the International Conference on Advances in Recent Technologies in Communication and Computing 2010, Washington, DC, USA, 16–17 October 2010; pp. 351–354. [Google Scholar]
Kong, W.; Dong, Z.Y.; Jia, Y.; Hill, D.J.; Xu, Y.; Zhang, Y. Short-term residential load forecasting based on lstm recurrent neural network. IEEE Trans. Smart Grid 2019, 10, 841–851. [Google Scholar] [CrossRef]
Dakheel, F.; Çevik, M. Optimizing Smart Grid Load Forecasting via a Hybrid Long Short-Term Memory-XGBoost Framework: Enhancing Accuracy, Robustness, and Energy Management. Energies 2025, 18, 2842. [Google Scholar] [CrossRef]
Wang, J.; Xue, S.; Lin, L.; Tan, B.; Huang, H. An Attention-Driven Hybrid Deep Network for Short-Term Electricity Load Forecasting in Smart Grid. Mathematics 2025, 13, 3091. [Google Scholar] [CrossRef]
Ali, S.; Bogarra, S.; Riaz, M.N.; Phyo, P.P.; Flynn, D.; Taha, A. From Time-Series to Hybrid Models: Advancements in Short-Term Load Forecasting Embracing Smart Grid Paradigm. Appl. Sci. 2024, 14, 4442. [Google Scholar] [CrossRef]
Guo, W.; Liu, S.; Weng, L.; Liang, X. Power Grid Load Forecasting Using a CNN-LSTM Network Based on a Multi-Modal Attention Mechanism. Appl. Sci. 2025, 15, 2435. [Google Scholar] [CrossRef]
Yang, Y.; Li, W.; Gulliver, T.A.; Li, S. Bayesian Deep Learning-Based Probabilistic Load Forecasting in Smart Grids. IEEE Trans. Ind. Inform. 2020, 16, 4703–4713. [Google Scholar] [CrossRef]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 6000–6010. [Google Scholar]
Zhao, H.; Wu, Y.; Ma, L.; Pan, S. Spatial and temporal attention-enabled transformer network for multivariate short-term residential load forecasting. IEEE Trans. Instrum. Meas. 2023, 72, 2524611. [Google Scholar] [CrossRef]
Zakynthinos, A.; Michalakopoulos, V.; Sarmas, E.; Marinakis, V. Transfer learning techniques on temporal fusion transformers for short-term building load forecasting under limited data conditions. Energy Build. 2026, 354, 116935. [Google Scholar] [CrossRef]
Hertel, M.; Beichter, M.; Heidrich, B.; Neumann, O.; Schäfer, B.; Mikut, R.; Hagenmeyer, V. Transformer training strategies for forecasting multiple load time series. Energy Inform. 2023, 6, 20. [Google Scholar]
Shiblee, M.F.H.; Koukaras, P. Short-Term Load Forecasting in the Greek Power Distribution System: A Comparative Study of Gradient Boosting and Deep Learning Models. Energies 2025, 18, 5060. [Google Scholar] [CrossRef]
Hernández, J.J.; Zapirain, I.; Camblong, H.; Barroso, N.; Curea, O. Real Implementation and Testing of Short-Term Building Load Forecasting: A Comparison of SVR and NARX. Energies 2025, 18, 1775. [Google Scholar] [CrossRef]
Chan, J.W.; Yeo, C.K. A Transformer based approach to electricity load forecasting. Electr. J. 2024, 37, 107370. [Google Scholar] [CrossRef]
Wood, M.; Matrone, S.; Ogliari, E.; Leva, S. Comparing peak electricity load forecasting models for an industrial and a residential building. Math. Comput. Simul. 2026, 240, 303–316. [Google Scholar] [CrossRef]
Musbah, H.; Elsaraiti, M. Forecasting load consumption: A comprehensive evaluation of deep learning and machine learning techniques. Electr. Power Syst. Res. 2025, 247, 111834. [Google Scholar] [CrossRef]
Aizenberg, I. Complex-Valued Neural Networks with Multi-Valued Neurons; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Aizenberg, I.; Paliy, D.; Zurada, J.; Astola, J. Blur identification by multilayer neural network based on multivalued neurons. IEEE Trans. Neural Netw. 2008, 19, 883–898. [Google Scholar] [CrossRef]
Aizenberg, I.; Luchetta, A.; Manetti, S. A modified learning algorithm for the multilayer neural network with multi-valued neurons based on the complex QR decomposition. Soft Comput. 2012, 16, 563–575. [Google Scholar] [CrossRef]
Ausgrid Dataset. Available online: https://www.ausgrid.com.au (accessed on 14 November 2025).
GitHub Repository of the Code. Available online: https://github.com/TheBeccaccino/A-Complex-Valued-Neural-Network-Approach-to-Time-Series-Forecasting-in-Smart-Grid-Energy-Systems (accessed on 14 November 2025).

Figure 1. MLMVN proposed architecture.

Figure 2. Representation of unit circle normalization.

Figure 3. Unit circle preprocessing training data transformation: (a) 3D representation; (b) planar representation.

Figure 4. Example of the Fourier preprocessing.

Figure 5. FFT preprocessing training data transformation: (a) 3D representation; (b) planar representation.

Figure 6. Example of the simplified Fourier preprocessing.

Figure 7. Simplified FFT preprocessing training data transformation: (a) 3D representation; (b) planar representation.

Figure 8. Example of the processing with the day feature.

Figure 9. Day feature preprocessing training data transformation: (a) 3D representation; (b) planar representation.

Figure 10. Implemented LSTM architecture.

Figure 11. RMSE tuning map for the choice of neurons of the hidden layers.

Figure 12. Comparison of the resulting median RMSE in the absolute value of the different methodologies.

Figure 13. Comparison of the resulting median ERR in the absolute value of the different methodologies.

Figure 14. Comparison of the median of the RMSE in the relative value of the different methodologies.

Figure 15. Boxplot of some RMSE values expressed in absolute (a) and relative (b) form related to the first step of the forecasting horizon of the studied methods.

Figure 16. Comparison of the median of the ERR in the relative value of the different methodologies.

Table 1. Distribution of the RMSE along the forecasted horizon expressed in W.

$R M S E_{s t e p}$	$1$	$2$	$3$	$24$	$48$	$72$
$U C i r c l e$	203.3 $\pm_{157.7}^{141.1}$	226.5 $\pm_{168}^{150.3}$	219.1 $\pm_{157.7}^{141.1}$	215.5 $\pm_{151.6}^{135.7}$	217.5 $\pm_{154.2}^{138.0}$	220.3 $\pm_{153.2}^{137.0}$
$F F T$	163.2 $\pm_{79.0}^{70.7}$	185.1 $\pm_{97.3}^{67.1}$	191.2 $\pm_{103.0}^{92.2}$	193.0 $\pm_{108.1}^{96.7}$	195.0 $\pm_{108.8}^{97.3}$	197.3 $\pm_{110.9}^{99.2}$
$F F T 2$	163.8 $\pm_{81.1}^{72.6}$	184.5 $\pm_{97.9}^{87.6}$	190.7 $\pm_{104.5}^{93.5}$	192.7 $\pm_{109.6}^{98.1}$	194.8 $\pm_{109.3}^{97.8}$	197.7 $\pm_{112.9}^{101.0}$
$W e e k D$	179.2 $\pm_{96.0}^{85.9}$	192.5 $\pm_{105.0}^{93.9}$	196.1 $\pm_{109.2}^{97.7}$	195.6 $\pm_{109.6}^{98.0}$	196.4 $\pm_{109.4}^{97.9}$	199.3 $\pm_{112.7}^{100.9}$
$P e r s$	251.6 $\pm_{139.9}^{125.2}$	251.6 $\pm_{139.9}^{125.2}$	251.6 $\pm_{139.9}^{125.2}$	247.5 $\pm_{134.8}^{120.6}$	246.2 $\pm_{132.8}^{118.8}$	246.1 $\pm_{133.8}^{119.7}$
$L S T M$	175.6 $\pm_{91.6}^{81.9}$	185.6 $\pm_{98.1}^{87.8}$	190.2 $\pm_{102.5}^{91.7}$	191.8 $\pm_{105.9}^{94.7}$	193.7 $\pm_{106.8}^{95.6}$	199.4 $\pm_{110.9}^{99.2}$

Table 2. Distribution of the ERR along the forecasted horizon expressed in W.

$R M S E_{s t e p}$	$1$	$2$	$3$	$24$	$48$	$71$
$U C i r c l e$	13.7 $\pm_{13.4}^{14.9}$	11.9 $\pm_{11.1}^{18.6}$	13.0 $\pm_{12.8}^{21.7}$	14.8 $\pm_{14.1}^{22.9}$	17.9 $\pm_{16.3}^{28.9}$	21.3 $\pm_{20}^{29.4}$
$F F T$	13.2 $\pm_{11.1}^{12.4}$	16.9 $\pm_{14.8}^{16.6}$	18.6 $\pm_{16.7}^{18.7}$	20.8 $\pm_{20.2}^{22.5}$	22.3 $\pm_{21.1}^{23.6}$	23.9 $\pm_{22.6}^{25.3}$
$F F T 2$	12.6 $\pm_{10.8}^{12.1}$	16.4 $\pm_{14.5}^{16.2}$	18.1 $\pm_{16.9}^{18.9}$	20.7 $\pm_{19.0}^{21.2}$	22.3 $\pm_{20.2}^{22.6}$	24.9 $\pm_{22.5}^{25.2}$
$W e e k D$	8.0 $\pm_{6.1}^{6.8}$	10.4 $\pm_{8.2}^{9.2}$	${11.5 \pm}_{9.4}^{10.5}$	13.5 $\pm_{11.9}^{13.3}$	14.9 $\pm_{12.5}^{14.0}$	15.8 $\pm_{14.2}^{15.9}$
$P e r s$	0	0	0	0	0	0
$L S T M$	6.0 $\pm_{5.8}^{6.5}$	${9.0 \pm}_{8.4}^{9.4}$	10.4 $\pm_{10.1}^{11.3}$	12.7 $\pm_{13.1}^{14.6}$	15.5 $\pm_{15.4}^{17.6}$	18.3 $\pm_{17.0}^{19.0}$

Table 3. Distribution of the relative RMSE along the forecasted horizon expressed in percentage.

$R M S E_{s t e p}$	$1$	$2$	$3$	$24$	$48$	$72$
$U C i r c l e$	$- 14.4 \pm_{25.4}^{22.7}$	$- 7.2 \pm_{26.4}^{23.6}$	$- 11.3 \pm_{23.6}^{21.1}$	$- 11.7 \pm_{20.6}^{18.4}$	$- 10.8 \pm_{23.9}^{21.4}$	$- 8.8 \pm_{21.4}^{19.1}$
$F F T$	$- 32.2 \pm_{10.8}^{9.7}$	$- 24.9 \pm_{9.0}^{8.1}$	$- 22.9 \pm_{8.8}^{7.9}$	$- 21.5 \pm_{9.7}^{8.7}$	$- 20.4 \pm_{9.8}^{8.8}$	$- 19.2 \pm_{9.9}^{8.9}$
$F F T 2$	$- 32.4 \pm_{10.0}^{8.9}$	$- 25.4 \pm_{8.5}^{7.6}$	$- 23.5 \pm_{8.3}^{7.4}$	$- 22.3 \pm_{8.7}^{7.8}$	$- 21.2 \pm_{8.6}^{7.7}$	$- 19.9 \pm_{8.1}^{7.3}$
$W e e k D$	$- 26.0 \pm_{13.0}^{11.6}$	$- 21.4 \pm_{12.3}^{11.0}$	$- 20.0 \pm_{12.0}^{10.7}$	$- 19.2 \pm_{13.3}^{11.9}$	$- 18.5 \pm_{13.4}^{12.0}$	$- 17.4 \pm_{13.7}^{12.3}$
$L S T M$	$- 28.3 \pm_{9.7}^{8.7}$	$- 24.6 \pm_{9.6}^{8.6}$	$- 23.2 \pm_{9.1}^{8.1}$	$- 21.7 \pm_{9.4}^{8.4}$	$- 20.4 \pm_{9.7}^{8.7}$	$- 17.7 \pm_{10.0}^{9.0}$

Table 4. Distribution of the relative ERR along the forecasted horizon expressed in percentage.

${E R R}_{s t e p}$	$1$	$2$	$3$	$24$	$48$	$72$
$U C i r c l e$	$43.8 \pm_{25.4}^{22.7}$	$34.7 \pm_{26.4}^{23.6}$	$24.9 \pm_{23.6}^{21.1}$	$23.8 \pm_{20.6}^{18.4}$	$18.0 \pm_{23.9}^{21.4}$	$6.9 \pm_{21.4}^{19.1}$
$F F T$	$64.0 \pm_{10.8}^{9.7}$	$52.7 \pm_{9.0}^{8.1}$	$58.4 \pm_{8.8}^{7.9}$	$41.2 \pm_{9.7}^{8.7}$	$25.1 \pm_{9.8}^{8.8}$	$18.3 \pm_{9.9}^{8.9}$
$F F T 2$	$64.6 \pm_{10.0}^{8.9}$	$54.7 \pm_{8.5}^{7.6}$	$49.5 \pm_{8.3}^{7.4}$	$45.0 \pm_{8.7}^{7.8}$	$36.6 \pm_{8.6}^{7.7}$	$32.4 \pm_{8.1}^{7.3}$
$W e e k D$	$17.4 \pm_{13.0}^{11.6}$	$5.1 \pm_{12.3}^{11.0}$	$1.5 \pm_{12.0}^{10.7}$	$- 1.9 \pm_{13.3}^{11.9}$	$- 9.3 \pm_{13.4}^{12.0}$	$- 18.9 \pm_{13.7}^{12.3}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Aizenberg, I.; Becchi, L.; Bindi, M.; Intravaia, M.; Luchetta, A. A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems. Energies 2026, 19, 2247. https://doi.org/10.3390/en19092247

AMA Style

Aizenberg I, Becchi L, Bindi M, Intravaia M, Luchetta A. A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems. Energies. 2026; 19(9):2247. https://doi.org/10.3390/en19092247

Chicago/Turabian Style

Aizenberg, Igor, Lorenzo Becchi, Marco Bindi, Matteo Intravaia, and Antonio Luchetta. 2026. "A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems" Energies 19, no. 9: 2247. https://doi.org/10.3390/en19092247

APA Style

Aizenberg, I., Becchi, L., Bindi, M., Intravaia, M., & Luchetta, A. (2026). A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems. Energies, 19(9), 2247. https://doi.org/10.3390/en19092247

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Complex-Valued Neural Network Approach to Time Series Forecasting in Smart Grid Energy Systems

Abstract

1. Introduction

2. Materials and Methods

2.1. MLMVN and Proposed Architecture

2.2. Evaluated Data Processing

2.2.1. Unit Circle Normalization

2.2.2. Normalization via Fourier Transform

2.2.3. Simplified FFT Normalization

2.2.4. Encoding the Day of the Week

2.3. Benchmark Definition

2.4. Comparison Metrics

2.5. Dataset

3. Results

3.1. Training Description and Tuning of the Hyperparameters

3.2. Evaluation of the Metrics

4. Discussion

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI