Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania

Mohammadi Lanbaran, Naiyer; Naujokaitis, Darius; Kairaitis, Gediminas; Radziukynas, Virginijus

doi:10.3390/app15179672

Open AccessArticle

Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania

by

Naiyer Mohammadi Lanbaran

^*

,

Darius Naujokaitis

,

Gediminas Kairaitis

and

Virginijus Radziukynas

Smart Grids and Renewable Energy Laboratory, Lithuanian Energy Institute, Breslaujos St. 3, 44403 Kaunas, Lithuania

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(17), 9672; https://doi.org/10.3390/app15179672

Submission received: 30 July 2025 / Revised: 29 August 2025 / Accepted: 29 August 2025 / Published: 2 September 2025

Download

Browse Figures

Versions Notes

Abstract

This research introduces a novel hybrid forecasting framework for solar energy prediction in high-latitude regions with extreme seasonal variations. This approach uniquely employs General Type-2 Fuzzy Logic (GT2-FL) for data preprocessing and uncertainty handling, followed by two advanced neural architectures, including BiLSTM and SCINet with Time2Vec encoding and Variational Mode Decomposition (VMD) signal processing. Four configurations are systematically evaluated: BiLSTM-Time2Vec, BiLSTM-VMD, SCINet-Time2Vec, and SCINet-VMD, each tested with GT2-FL preprocessed data and raw input data. Using meteorological data from Lithuania (2023–2024) with extreme seasonal variations where daylight hours range from 17 h in summer to 7 h in winter, F-BiLSTM-Time2Vec achieved exceptional performance, with nRMSE = 1.188%, NMAE = 0.813%, and WMAE = 3.013%, significantly outperforming both VMD-based variants and SCINet architectures. Comparative analysis revealed that Time2Vec encoding proved more beneficial than VMD preprocessing, especially when enhanced with fuzzification. The results confirm that fuzzification, BiLSTM architecture, and Time2Vec encoding provide the most robust forecasting capability under various seasonal conditions.

Keywords:

bidirectional long short-term memory (BiLSTM); general type-2 fuzzy systems; recurrent neural networks (RNNs); sample convolution and interactive neural network (SCINet); time to vector; variational mode decomposition (VMD)

1. Introduction

Energy consumption drives economic progress and prosperity, including electricity generation and industrial usage. Currently, fossil fuels remain the primary global energy source [1]. As a result, the increasing recognition of the necessity for energy conservation and the adoption of alternative sources, particularly renewable energy (RE), has intensified in response to escalating concerns regarding greenhouse gas (GHG) emissions and climate change. While renewable energy refers to naturally replenishing sources like solar and wind, sustainable energy represents a broader approach that includes both renewable sources and efficient energy distribution systems. This comprehensive model enables society to generate and utilize energy while minimizing environmental impacts and resource depletion [2]. Among the renewable options, solar energy has been recognized as a crucial component in addressing these energy requirements [3,4]. Solar power in the European Union grew dramatically from 7.4 Terawatt hours (TWh) (1% of electricity mix) in 2008 to 252.1 TWh (approximately 8.5%) in 2023, representing a 34-fold increase [5]. Innovative grid initiatives aim to substantially increase the proportion of renewable energy in the grid, particularly in regions with high seasonal variability [6].

Despite impressive growth in renewable energy, a key challenge in integrating sources like solar and wind into the power grid is the intermittent nature of their electricity production, which is beyond human control. Successfully incorporating solar energy depends heavily on accurately forecasting its power generation and leveraging hybrid systems that combine solar and wind to improve grid reliability and accessibility [7]. Consequently, the accurate prediction of future RE availability is crucial, enabling the grid to allocate producers to meet fluctuating demand efficiently. This rapid expansion has further heightened the importance of accurate forecasting for power generation, enabling electric operators to maintain a precise balance between production and consumption [5,6,7].

1.1. Photovoltaic Forecasting Challenges

The increasing global demand for integrating RE, particularly solar power, into power grids highlights the economic and technological challenges associated with the increased penetration of flat-panel photovoltaic (PV), concentrated solar power (CSP), and concentrated photovoltaic (CPV) systems. These challenges stem from solar resource variability, seasonal production fluctuations, costly energy storage, and the need to balance grid flexibility and reliability [8]. These challenges are particularly acute in Lithuania, where seasonal differences in solar radiation are substantial, with daylight hours ranging from 17 h in summer to only 7 h in winter. Rodríguez et al. [9] demonstrated that solar plants frequently require support from ancillary generators during high variability, increasing capital and operational costs.

Solar energy generation forecasting has evolved from basic monitoring systems using inverter-based data collection [10,11,12] to sophisticated methodologies employing statistical models, satellite imaging, Numerical Weather Prediction, and hybrid artificial neural networks for day-ahead market integration [13]. Enhanced forecasting methods are essential for power system operators’ unit commitment planning and reserve capacity management. Forecasting methods are categorized into direct models (using historical PV output and meteorological data) and indirect models (predicting solar irradiation at PV locations), with direct methods generally demonstrating superior performance [14,15].

Weather-related uncertainties significantly impact solar energy forecasting accuracy. Atmospheric conditions, including dust, moisture, aerosols, clouds, and temperature variations, create substantial prediction challenges, with clouds potentially reducing solar energy by up to 100% on heavily overcast days [16,17]. Fuzzy logic-based (FL) models address these uncertainties by incorporating meteorological parameters such as latitude, longitude, altitude, seasonal variations, sunshine duration, and temperature [18]. Type-2 Fuzzy Logic (T2-FL) systems offer enhanced uncertainty handling compared to traditional Type-1 systems, providing additional degrees of freedom for direct uncertainty modeling [19,20].

1.2. State-of-the-Art Review

Deep learning (DL) has emerged as a powerful approach for solar energy forecasting since Geoffrey Hinton’s breakthrough work in 2006 [21]. Traditional fully connected networks struggle with temporal dependencies, leading to specialized architectures including Recurrent Neural Networks (RNNs), Long Short-Term Memory (LSTM), and bidirectional LSTM (BiLSTM) networks that effectively capture time-series patterns [22,23,24]. Recent architectural innovations include Temporal Convolutional Networks (TCN), Sample Convolution, and Interactive Neural Networks (SCINet). TCN implementations have demonstrated 20–30% accuracy improvements in 6-h PV forecasting [25,26].

Advanced hybrid models have emerged as promising solutions. Kong et al. introduced a solar radiation prediction model using TCN with attention [27], while Liu et al. developed enhanced TCN-DenseNet architectures demonstrating significant MAPE reductions [28]. Limouni et al. proposed hybrid LSTM-TCN models that outperformed individual architectures under various seasonal conditions [29]. Recent developments in VMD-based models have shown remarkable progress, with studies reporting improved performance in seasonal forecasting [30]. Advanced architectures, including Generative Adversarial Network BiLSTM (GAN-BiLSTM) [31], VMD-CNN-BiGRU (Gated Recurrent Unit) [32], VMD-Transformer models [33,34], and PSO-BiLSTM [35], have established new benchmarks, with recent models achieving error rates below 15% and accuracy rates up to 98.5%. Table 1 Comparative Overview of Solar Forecasting Approaches.

1.3. Research Gaps and Motivation

Despite significant advances in solar forecasting methodologies, three critical limitations prevent effective deployment in extreme seasonal climates:

Seasonal Variability Challenge: Most existing studies focus on regions with moderate seasonal variations. High-latitude regions like Lithuania (54° N–56° N) with extreme seasonal patterns (17-h summer days vs. 7-h winter days) present unique forecasting challenges that current methodologies inadequately address, with performance degradation exceeding 40% during seasonal transitions.

Uncertainty Integration Gap: While individual fuzzy logic and deep learning approaches exist, systematic integration of advanced uncertainty quantification (GT2-FL) with state-of-the-art neural architectures (BiLSTM, SCINet) remains unexplored for enhanced uncertainty handling in complex seasonal environments.

Architectural Optimization Deficiency: Comparative studies of advanced architectures (BiLSTM vs. SCINet) with different preprocessing techniques (Time2Vec vs. VMD) under varying seasonal conditions are lacking, limiting optimal model selection for specific climate conditions.

Implementation-Research Divide: Most research focuses on theoretical improvements without addressing real-world deployment challenges in regions with high meteorological variability, creating a significant gap between laboratory performance and operational reliability.

1.4. Research Contributions and Theoretical Significance

This study addresses these fundamental limitations through a novel hybrid methodology that advances the theoretical understanding and practical implementation of solar energy forecasting in climatically challenging regions. The key contributions include the first comprehensive implementation employing General Type-2 Fuzzy Logic (GT2-FL) for data preprocessing and uncertainty handling, followed by advanced deep learning architectures (BiLSTM and SCINet) enhanced with specialized preprocessing techniques (Time2Vec and VMD), providing a theoretical foundation for uncertainty-aware deep learning in renewable energy forecasting.

Systematic Architectural Analysis: Comprehensive evaluation of four distinct model configurations: BiLSTM with Time2Vec, BiLSTM with VMD, SCINet with Time2Vec, and SCINet with VMD, each tested in both fuzzified and non-fuzzified variants under realistic seasonal variations, enabling evidence-based architectural selection.

Advanced Uncertainty Modeling: Integration of GT2-FL preprocessing pipeline with Karnik-Mendel type-reduction and single-centroid defuzzification for enhanced uncertainty management in volatile weather conditions, addressing the theoretical gap in uncertainty quantification for renewable energy systems.

Seasonal Adaptation Methodology: Comprehensive evaluation across spring/summer and fall/winter periods using data from Lithuania (2023–2024), providing empirical insights for high-latitude regions with similar climate patterns and establishing performance benchmarks for extreme seasonal environments.

Operational Implementation Strategy: Development of a rolling forecasting strategy using 24-h actual data windows with hourly predictions, preventing error propagation while maintaining real-world applicability, bridging the gap between research and operational deployment.

This research advances the scientific understanding of uncertainty-aware renewable energy forecasting while providing practical solutions for challenging geographical conditions with extreme seasonal variations. The integrated approach demonstrates how a systematic combination of fuzzy logic, advanced neural architectures, and specialized preprocessing can address the complex temporal patterns and uncertainty inherent in solar energy generation.

The remainder of this paper is organized as follows: Section 2 presents the theoretical background and mathematical foundations. Section 3 delineates the proposed model architecture and preprocessing methodologies. Section 4 presents comprehensive results and performance analysis. Section 5 provides a detailed discussion of findings and their implications. Section 6 offers conclusions and future research directions.

2. Theoretical Background

This section provides essential theoretical foundations for the proposed hybrid methodology, focusing on key components that distinguish our approach from existing forecasting methods.

2.1. Preprocessing Techniques

2.1.1. General Type-2 Fuzzy Logic Systems

Complex forecasting challenges involve multiple sources of uncertainty, including incomplete data, inaccuracies, ambiguity, and conflicting evidence. Type-1 Fuzzy Sets (T1-FSs) use fixed membership values between 0 and 1 but have limitations when handling multidimensional uncertainties. Equation (1) represents the T-1FS membership function [46]. Type-2 Fuzzy Sets (T2-FSs) overcome these constraints by treating membership functions as fuzzy sets rather than crisp values, providing enhanced uncertainty modeling capabilities [47].

T2-FSs are created by “blurring” T1-FS membership functions, resulting in three-dimensional representations that can model “uncertainty about uncertainty” [48,49]. While T1-FL systems provide precise membership degrees (0 to 1) for input variables, T2-FSs offer more robust handling of the complex ambiguities inherent in solar forecasting applications, where weather conditions create multiple layers of uncertainty that traditional fuzzy systems cannot adequately address.

T1-FL gives exact membership values for all

x \in \{A\},

with a membership degree of

μ_{A} (x)

, while T2-FL models the uncertainty in those membership values, making it better suited for solar forecasting applications with complex, overlapping uncertainties.

μ_{A} (x) = \{\begin{matrix} 0 i f x \notin A \\ 1 i f x \in A \\ V a l u e b e t w e e n (0, 1) f o r i n t e r m e d i a t e d e g r e e s o f m e m b e r s h i p \end{matrix}

(1)

Figure 1 illustrates the Type-1 Fuzzy Gaussian membership function formulated by the authors, which is the foundational component for modeling uncertainty in the fuzzy system. General Type 2- Fuzzy Set

\tilde{A}

in the universe of discourse

X

is defined as an Equation (2):

\tilde{A} = \{(x, u), μ_{\tilde{A}} (x, u)| x \in X, u \in J_{x} \subseteq [0,1]\}

(2)

where

u

indicates the uncertainty,

x \in X

is a component of the domain

X

,

u \in J_{x} \subseteq [0,1]

is the secondary variable (secondary membership degree),

μ_{\tilde{A}} (x, u)

is the secondary membership function of the T-2FS at

x

and

u

, and

J_{x}

is the footprint of uncertainty (FOU) at

x

, representing the range of secondary membership degrees. The primary membership function

μ_{\tilde{A}} (x)

maps

x

to a fuzzy set in the interval

[0,1]

, not just a single value. In other words, for each

x

, you obtain a range of membership degrees

J_{x}

.

The secondary membership function

μ_{\tilde{A}} (x, u)

specifies the degree to which each value of

u

belongs to the fuzzy set at

x

.

The membership function of a GT2FS is a surface in 3D, where the X-axis represents the Primary Domain, the Y-axis represents the Secondary Domain, and the Z-axis represents the membership degree

μ_{\tilde{A}} (x, u)

.

GT2FS provides a more comprehensive depiction of uncertainty by capturing variability in primary memberships and uncertainty about the precise value of membership degrees (via secondary membership). This makes it handy for applications with complicated uncertainty that T1-FSs cannot handle alone [49].

Figure 2 demonstrates a 3D surface plot of a General Type-2 fuzzy set using Gaussian membership functions. The plot illustrates the primary domain (x-axis) and secondary domain (y-axis), with the height (z-axis) representing the fuzzy membership values. The Gaussian shape captures both the primary membership and the uncertainty in the secondary membership values, providing a detailed representation of fuzzy uncertainty.

2.1.2. Time to Vector

The Time2Vec embedding layer offers a fundamental vector representation of time well-suited for numerous deep learning models. This layer can detect periodic and non-periodic patterns automatically, enhancing prediction accuracy [50]. Given a time series

τ

, the Time2Vec representation of the

τ

time scalar is expressed as

t 2 v (τ)

, a vector of length

k + 1

, which is defined in Equation (3):

t 2 v (τ) [i] = \{\begin{matrix} ω_{0 h} τ + φ_{0 h}, i f i = 0 \\ \sin (ω_{i h} τ + φ_{i h}), i f 1 \leq i \leq k \end{matrix}

(3)

where

t 2 v (τ) [i]

represents the

i t h

feature of

t 2 v (τ) .

The parameters

ω_{i h}

and

φ_{i h}

are learnable. The linear function

ω_{0 h} τ + φ_{0 h}

captures the non-periodic elements of the time series. The periodic components of the input data are obtained through the periodic activation function

\sin (ω_{i h} τ + φ_{i h})

[50].

2.1.3. Variational Mode Decomposition (VMD)

The VMD layer provides fundamental frequency-domain analysis of signals, particularly suited for complex signal processing tasks. This layer integrates Wiener filtering, Hilbert transform, and frequency mixing to decompose signals into Intrinsic Mode Function (IMF) components, enhancing solar irradiance prediction accuracy through effective noise reduction and adaptive bandwidth optimization [51,52]. The algorithm addresses a variational problem mathematically formulated in Equations (4) and (5).

{m i n}_{\{u_{k}\}, {w_{k}}} \{\sum_{k} {‖\partial_{t} [(δ (t) + \frac{j}{π t}) * u_{k} (t)] e^{j w_{k} t}‖}_{2}^{2}\}

(4)

S . t . \sum_{k} u_{k} = χ

(5)

Here {

u_{k}

} represents the collection of all modes, and {

w_{k}

} represents their associated central frequencies. The original input signal is denoted by

χ

, while

k

indicates the total number of modes. Additionally,

\partial_{t}

represents the gradient regarding time,

δ (t)

represents an impulse function,

j

represents the imaginary unit, and the symbol

*

indicates the convolution operation.

2.1.4. Sine-Cosine Transformation

Time-series decomposition breaks data into trend, seasonal, and residual components, allowing the model to learn these aspects separately. This decomposition is mathematically expressed in Equation (6) for an additive model.

Y_{t} = T_{t} + S_{t} + R_{t}

(6)

Y_{t}

is the observed value,

T_{t}

is the trend,

S_{t}

is the seasonal component, and

R_{t}

is the residual. This technique helps isolate long-term trends and cyclic behaviors, improving the model’s focus on the residual data for better prediction accuracy. Cyclical time features such as hour of day, day of week, and month are encoded using sine-cosine transformations to preserve their periodicity. Cyclical time features such as hour of day, day of week, and month are encoded using sine-cosine transformations to preserve their periodic nature [53]. For a cyclical feature

x

with a period

T

, the transformation is defined in Equations (7) and (8) as:

x_{s i n} = s i n (\frac{2 π x}{T})

(7)

x_{c o s} = c o s (\frac{2 π x}{T})

(8)

This encoding ensures smooth transitions across periodic boundaries and enables the model to capture recurring patterns such as daily and seasonal solar generation cycles. These patterns are crucial for accurate forecasting in high-latitude regions with extreme seasonal variations.

2.1.5. Attention Mechanism

Attention mechanisms enable models to focus on relevant temporal segments by computing weighted combinations of input features [54]. For an input sequence, the attention output is calculated as in Equations (9)–(11):

S (q, k_{i}) = q^{T} k_{i}

(9)

α_{i} = \frac{e x p (s (q, k_{i}))}{\sum_{j = 1}^{n} e x p (s (q, k_{i}))}

(10)

z = \sum_{i = 1}^{n} α_{i} v_{i}

(11)

For an input sequence of length

n

, let the query vector

q

, key vectors

k_{i}

, and value vectors

v_{i}

be the main elements of the attention mechanism. Query vector

q

represents the current position or task the model tries to predict. Key vectors

k_{i}

represent different parts of the input sequence, and value vectors

v_{i}

contain the actual information of the sequence elements.

S (q, k_{i})

the attention score computes the similarity between the query and keys (dot product).

α_{i}

represents attention weights that normalize scores using softmax, and

z

is the attention output that computes a weighted sum of values [55].

2.2. Main Forecasting Models

2.2.1. Bidirectional Long Short-Term Memory (BiLSTM)

BiLSTM networks process information forward and backward directions, enabling comprehensive capture of temporal dependencies in time series data [56]. The architecture consists of two LSTM layers processing each input’s sequence in opposite directions. BiLSTM processes information in both directions, enabling the model to capture context from past and future time steps for each input.

x_{t}

, enhancing its understanding of sequential data. Equations (12)–(14) define the forward hidden vector.

\vec{h_{i}}

, the backward hidden vector

\overset{\leftarrow}{h_{i}}

, and the output

h_{i}

.

\vec{h_{i}} = \vec{L S T M} (x_{i}), i \in [1, n]

(12)

\overset{\leftarrow}{h_{i}} = \overset{\leftarrow}{L S T M} (x_{i}), i \in [n, 1]

(13)

h_{i} = [\vec{h_{i}}, \overset{\leftarrow}{h_{i}}], i \in [n, 1]

(14)

2.2.2. Sample Convolution and Interactive Neural Networks (SCINet)

The SCINet model follows an encoder–decoder framework [57]. The encoder consists of a hierarchical convolutional network that uses various convolutional filters to capture dynamic temporal dependencies across multiple resolutions. Its core component is the SCI-Block, which processes an input sequence.

F

Through a structured downsample-convolve-interact process. Then input

F

is split into two sub-sequences:

$F_{o}$ (odd-indexed elements)
$F_{e}$ (even-indexed elements)

The sub-sequences undergo a two-step interactive learning process to share information and prevent loss from downsampling, as shown in Equation (15). Then we have information Fusion that is presented in Equation (16):

F_{o}^{s} = F_{o} ⊙ \exp (Φ (F_{e})) a n d F_{e}^{s} = F_{e} ⊙ \exp (Ψ (F_{o}))

(15)

{\overset{´}{F}}_{o} = F_{o}^{s} \pm ρ (F_{e}^{s}) a n d {\overset{´}{F}}_{e} = F_{e}^{s} \pm η (F_{o}^{s})

(16)

where

Φ

,

Ψ

,

ρ

, and

η

are 1D convolutional modules,

⊙

is the element-wise product, and exp is the exponential function.

Multiple SCI-Blocks are arranged in a binary tree to form the encoder, capturing multi-scale temporal patterns. The processed features are then rearranged and passed to a fully connected decoder to generate forecasts [58].

2.3. Evaluation Metrics

Model performance is assessed using three key metrics specifically chosen for solar forecasting applications:

Normalized Root Mean Squared Error (nRMSE%): Based on maximum observed power, emphasizing larger errors
Normalized Mean Absolute Error (NMAE%): Based on plant capacity, providing a reliable accuracy measure
Weighted Mean Absolute Error (WMAE%): Based on total energy production, accounting for generation-weighted errors

These metrics collectively provide a comprehensive assessment of forecasting accuracy across different operational scenarios and seasonal conditions [59].

In Equations (17)–(19):

The normalized mean absolute error NMAE%, based on the plant’s net capacity

C

.

N M A E % = \frac{1}{N} \sum_{k = 1}^{N} \frac{|P_{m, k} - P_{p, k}|}{C} . 100

(17)

where

N

denotes the number of samples (hours) considered, typically a day, month, or year. For this metric, the rated power of the PV system was used as

C

.

The weighted mean absolute error

W M A E %

, based on the total energy production:

W M A E % = \frac{\sum_{k = 1}^{N} |P_{m, k} - P_{p, k}|}{\sum_{k = 1}^{N} P_{m, k}} . 100

(18)

The normalized root means square error

n R M S E %

, based on the maximum observed power generated:

n R M S E % = \frac{\sqrt{\frac{\sum_{k = 1}^{N} {|P_{m, k} - P_{p, k}|}^{2}}{N}}}{m a x (P_{m, k})} . 100

(19)

Here,

P_{m, k}

is measured power and forecasted power

P_{p, k} at hour k

. NMAE% is commonly used to evaluate prediction accuracy because it mitigates the bias seen in WMAE% when dealing with small power values. It weights errors relative to capacity

C

, offering a more reliable measure. RMSE% measures the average magnitude of absolute hourly errors, emphasizing more significant errors to highlight particularly undesirable results [59].

3. Proposed Model Architecture

3.1. Forecasting Scope and Temporal Framework

This study implements short-term photovoltaic power forecasting with 1-h ahead predictions, classified within the standard renewable energy forecasting taxonomy (30 min to 6 h ahead). The methodology employs 24-h rolling historical data windows, designed explicitly for high-latitude regions with extreme seasonal variations.

Temporal Specifications: The forecast horizon is 1 h ahead (short-term classification). The input window (looking back) is 24 h of historical data. The update frequency is hourly forecasts. In terms of geographical focus, we consider Lithuania’s extreme seasonal conditions (17-h summer vs. 7-h winter daylight).

3.2. Data Description and Implementation Details

In this study, we designed an hourly forecast of solar energy generation in Lithuania, employing daytime data from 1 January 2023, to 31 August 2024. The meteorological data (forecasted meteo elements) used in this research were obtained from the Lithuanian Hydrometeorological Service (LHMT) through their API (https://api.meteo.lt/) (accessed on 5 September 2024). The model incorporates 28 distinct input features, encompassing meteorological parameters, derived solar radiation components, decomposed power signals, temporal encodings, and historical power values. The inherent variability and uncertainty in solar energy data present significant challenges, which we managed by developing a novel methodology based on combining GT2-FL and advanced deep learning models, incorporating comprehensive feature engineering and time series decomposition techniques.

All models were implemented using Python 3.12.7 with TensorFlow 2.19.0 and Keras 3.9.2 frameworks. Experiments were conducted on a high-performance workstation equipped with an Intel i7 processor running at 2.32 GHz, 31.7 GB of RAM (11.6 GB used, 37% utilization), Intel UHD Graphics with 7% usage, and a Samsung NVMe SSD with 1% active time. The training process utilized the Adam optimizer with a learning rate of 0.001 and the Huber loss function. Data was chronologically split into 70% for training (January–August 2023), 15% for validation (August 2023–June 2024), and 15% for testing (June–August 2024).

3.3. Data Preprocessing Pipeline

Figure 3 illustrates the comprehensive data preprocessing methodology employed in this study.

3.3.1. Fuzzification and Defuzzification

General Type-2 Fuzzy Logic preprocessing is applied to address uncertainty in solar energy data using Gaussian membership functions. The fuzzification process transforms crisp inputs into fuzzy sets with primary and secondary domains as linearly spaced arrays in [0, 1]. Type-reduction employs the Karnik-Mendel algorithm followed by single-centroid defuzzification, which is selected for computational efficiency when input values across features are proximate.

3.3.2. Temporal Feature Engineering and Preprocessing Workflow

The preprocessing workflow combines multiple techniques:

Time Series Decomposition: Solar power data is decomposed into trend, seasonal, and residual components using additive decomposition $Y_{t} = T_{t} + S_{t} + R_{t} .$
Cyclical Feature Encoding: Time-related variables undergo sine-cosine transformations to preserve the periodic nature and enable the recognition of recurring patterns.
Lagged Features: Historical data from previous time steps are incorporated to capture temporal dependencies, using actual measurements rather than predicted values during rolling forecasting.
Normalization: Standard scaling ensures all features operate on comparable scales, preventing numerical instability during training.

3.3.3. Stationarity and Statistical Analysis

Stationarity assessment is crucial for understanding time series properties and their suitability for forecasting models. A time series is considered stationary when its statistical properties (mean, variance, and covariance) remain constant over time, making it generally easier to model and forecast due to predictable statistical behavior [60,61].

Our methodology employed complementary statistical tests: the Augmented Dickey–Fuller (ADF) test (null hypothesis: non-stationarity) and the Kwiatkowski-Phillips-Schmidt-Shin (KPSS) test (null hypothesis: stationarity). Statistical significance was determined using p-values compared to 0.05.

Analysis revealed that the original PV power generation series exhibits inherent non-stationarity, confirmed by conflicting test results (ADF: stationary with p-value 0.0000; KPSS: non-stationary with p-value 0.0100). This statistical conflict indicates complex nonstationary components in solar power generation data.

The decomposition approaches demonstrated varying effectiveness in transforming components into stationary series:

Decomposed residuals: Achieved true stationarity across all models

VMD-based models: Transformed 42.9% of components (3 out of 7) into stationary series, representing a significant advantage

Cyclical features: Retained non-stationary characteristics, reflecting inherent variability in solar generation patterns

This selective stationarity aligns with our modeling strategy, enabling effective handling of both deterministic (stationary) and stochastic (non-stationary) components while accounting for uncertainty through Type-2 fuzzy modeling.

3.3.4. Meteorological Impact Integration

Correlation analysis between weather parameters and PV output demonstrates strong dependencies, with solar radiation components (GHI: 0.80, DHI: 0.73, DNI: 0.62) showing the strongest positive correlations, while relative humidity (−0.65) and cloud cover (−0.29) exhibit significant negative correlations. These relationships directly inform feature selection and preprocessing strategies, particularly for handling Lithuania’s extreme seasonal variations, where cloud cover variability increases by 78% compared to summer.

The seasonal forecasting challenges in Lithuania stem from the compounding effects of reduced solar elevation angles (from ~59° summer to ~13° winter), shortened daylight duration (17 to 7 h), and increased atmospheric variability during fall and winter periods. These lower elevation angles result in longer atmospheric path lengths and reduced irradiance intensity, while variable cloud cover and atmospheric conditions create complex intermittency patterns.

These seasonal factors collectively create the complex temporal dependencies that our hybrid GT2-FL methodology effectively addresses through its adaptive fuzzy inference system. This is demonstrated by the superior performance of F-BiLSTM-Time2Vec during both stable (S&S: nRMSE 1.188%) and variable (F&W: nRMSE 1.340%) seasonal conditions.

Figure 4 shows the correlation matrix between meteorological variables and PV power output. The strong correlations between power output and irradiance components (GHI, DHI, DNI) and significant negative correlations with relative humidity and cloud cover illustrate the direct dependence of PV generation on meteorological conditions. This relationship explains the increased forecasting difficulty during seasons with higher meteorological variability.

This modified section now includes the seasonal mechanism analysis, which directly addresses the reviewer’s concern about analyzing the underlying mechanisms by which seasonal fluctuations influence PV energy output while maintaining the logical flow of your existing content.

3.4. Hybrid Model Architectures

Our forecasting model’s architecture is designed to capture both short-term fluctuations and long-term trends in solar energy generation. Each component addresses specific challenges in time-series forecasting, particularly for renewable energy systems characterized by complex temporal dynamics. Our methodology evaluates four distinct hybrid model configurations that systematically combine neural architectures with preprocessing techniques.

3.4.1. BiLSTM-Based Models with Attention Mechanism

BiLSTM with Time2Vec Integration

The BiLSTM-Time2Vec model combines bidirectional LSTM networks with Time2Vec temporal encoding to enhance solar power forecasting capabilities. Time2Vec describes a generalization of positional encoding that incorporates temporal information into the sequence model by transforming raw temporal features into a higher-dimensional time-embedded space. The Time2Vec transformation maps temporal input into a space enhanced by linear and periodic time dependencies. The linear component captures the monotonic progression of time, while the periodic components encode cyclical patterns at different frequencies, such as daily and seasonal variations.

The BiLSTM network processes the Time2Vec-enhanced input forward and backward, enabling the model to capture bidirectional dependencies in the time series. This bidirectional processing is crucial for understanding past and future contexts at each time step, improving the model’s capacity to forecast accurately when dependencies span extended periods.

The architecture integrates an attention mechanism focusing on significant time steps, enhancing predictive accuracy by emphasizing influential historical data points. This attention layer dynamically weights the importance of different temporal features, allowing the model to automatically identify which past observations are most relevant for current predictions. The attention scores are computed using the hidden states from both forward and backward LSTM layers, creating a comprehensive temporal understanding. Following the attention-enhanced BiLSTM processing, a final 64-unit BiLSTM layer refines the learned temporal representations, consolidating the attended features into a dense encoding. The architecture culminates with a single-neuron output layer using linear activation, specifically designed to forecast one-hour-ahead solar energy generation.

BiLSTM with VMD Integration

The BiLSTM-VMD model enhances bidirectional Long Short-Term Memory networks by incorporating Variational Mode Decomposition for signal preprocessing. VMD provides signal decomposition capabilities by integrating frequency information into sequence modeling, transforming solar irradiance data into frequency-based representations that improve the model’s understanding of complex signal patterns.

The VMD process separates the original signal into multiple Intrinsic Mode Functions (IMFs) within the frequency domain. Each IMF component represents distinct frequency bands of the signal, with lower frequency components capturing long-term trends and higher frequency components representing rapid fluctuations in solar irradiance. This multi-level decomposition enables comprehensive signal analysis that aligns well with the hierarchical processing capabilities of BiLSTM networks.

The decomposed VMD components are then processed through the same BiLSTM architecture with attention mechanisms described in the Time2Vec variant. This enables the model to identify and adapt to various frequency patterns in solar generation data while maintaining the bidirectional temporal dependency modeling and attention-based feature selection.

3.4.2. SCINet-Based Models with Self-Attention Mechanism

SCINet with Time2Vec Integration

The SCINet-Time2Vec model enhances solar power forecasting by combining Sample Convolution and Interactive Neural Networks with Time2Vec temporal encoding and self-attention mechanisms. The architecture begins with Time2Vec temporal embedding at the input layer, transforming raw temporal features into higher-dimensional representations that capture linear progression and cyclical patterns across multiple frequencies.

The Time2Vec-enhanced inputs are processed through SCINet’s hierarchical structure, where temporal features undergo self-attention and convolutional operations within the SCI-Blocks. Each SCI-Block applies multi-head self-attention to capture complex temporal relationships while maintaining SCINet’s inherent capacity for hierarchical pattern recognition through its tree-like structure and multi-scale processing. The self-attention mechanism dynamically weights the importance of different temporal segments, allowing the model to focus on the most relevant historical patterns for current predictions.

The integrated architecture leverages residual connections to ensure stable gradient flow throughout the deep network, while regularization techniques prevent overfitting during training. The combination of Time2Vec’s explicit temporal encoding with SCINet’s multi-scale decomposition and self-attention creates a robust framework for modeling both short-term fluctuations and long-term trends in solar power generation data.

SCINet with VMD Integration

The SCINet-VMD model incorporates frequency-domain decomposition into SCINet’s architectural framework through Variational Mode Decomposition preprocessing. This approach augments SCINet’s inherent multi-scale analysis by decomposing input signals into their constituent mode functions before hierarchical processing.

VMD systematically decomposes the original signal into multiple IMFs in the frequency domain, with each IMF representing a distinct frequency band. Slow-varying components capture gradual trends while fast-varying components represent rapid changes in solar irradiance. This decomposition aligns with SCINet’s hierarchical structure, providing a complementary approach to multi-scale pattern recognition.

The decomposed frequency components are processed through SCINet’s self-attention mechanisms within the SCI-Blocks, enabling the model to capture temporal dependencies across multiple scales while leveraging both the frequency-domain information provided by VMD and SCINet’s inherent hierarchical processing capabilities.

3.5. Rolling Forecasting Strategy

All model variants employ a rolling forecasting strategy that uses the previous 24 h of actual data to predict the next hour. The model makes predictions only for the next hour and then resets at 24-h intervals, using measured PV generation data rather than predicted values as inputs. This prevents the propagation of forecasting errors and ensures the model remains robust over time by preventing error accumulation that could degrade performance.

Figure 5 briefly illustrates the comprehensive methodology used in this study, providing an overview of the forecasting process.

3.6. Model Training and Evaluation

All models were trained using the Adam optimizer with a learning rate of 0.001 and the Huber loss function, which balances sensitivity to small and large errors while maintaining robustness to outliers. Through extensive experimentation and iterative optimization, we conducted comprehensive hyperparameter fine-tuning to identify the optimal configurations for our specific dataset and climatic conditions. Multiple training iterations were performed with various hyperparameter combinations, carefully evaluating their impact on model performance using training and validation sets. Each hyperparameter was meticulously selected after extensive testing and comparative analysis across numerous experimental runs to ensure optimal performance under Lithuania’s diverse seasonal and weather patterns.

The hyperparameter selection process followed a systematic approach combining grid search optimization with domain expertise from solar forecasting literature. Specific parameter rationale includes: the 24-h look-back window was selected based on diurnal solar patterns and optimal context for hourly forecasting in seasonal climates; the learning rate of 0.001 was optimized through testing [0.01, 0.001, 0.0001] providing fastest convergence without overshooting; network architecture [512, 256, 128, 64] was determined through systematic testing of different layer configurations; dropout rates (LSTM: 0.2, Dense: 0.1) were tuned within [0.1–0.5] range to prevent overfitting; batch size 42 was selected based on computational resources and stable gradient updates; and VMD modes K = 3 was chosen based on signal analysis showing optimal decomposition of Lithuania’s solar data into trend, seasonal, and high-frequency components.

Table 2 presents the carefully optimized hyperparameters used across all model variants. These values were selected based on their consistently superior performance across all model configurations, demonstrating the best balance between computational efficiency and prediction accuracy under varying meteorological conditions. All models underwent identical data preprocessing steps and utilized these rigorously tested hyperparameters to ensure fair and reliable comparison.

4. Result

4.1. Comparison of Neural Network Architectures

Four distinct neural network architectures have been implemented and compared: BiLSTM with Time2Vec, BiLSTM with VMD, SCINet with Time2Vec, and SCINet with VMD, each tested in fuzzified and non-fuzzified variants. The fuzzified models (F-models) incorporate General Type-2 Fuzzy Logic preprocessing for enhanced uncertainty modeling. Through extensive experimentation and systematic grid search optimization, we identified the optimal hyperparameter configurations for our specific dataset. Multiple training iterations were conducted with various hyperparameter combinations, evaluating their impact on model performance using training and validation sets. The final hyperparameter values detailed in Table 2 (Section 3.6) were selected based on their consistently superior performance across all model variants, demonstrating the best balance between computational efficiency and prediction accuracy. All models underwent identical data preprocessing steps and utilized these optimized hyperparameters to ensure a fair comparison.

Model 1 (BiLSTM with Time2Vec): This model utilizes bidirectional LSTM layers with Time2Vec temporal encoding. The architecture integrates attention mechanisms between LSTM layers and features residual connections to enhance gradient flow.
Model 2 (BiLSTM with VMD): This model enhances the BiLSTM architecture with VMD-based feature extraction. It combines mode decomposition with bidirectional LSTM layers and attention mechanisms for comprehensive temporal pattern capture.
Model 3 (SCINet with Time2Vec): An input layer processes 24-h sequences with Time2Vec encoding, followed by spatial attention mechanisms and SCINet blocks for capturing temporal dependencies. The model employs convolutional layers with spatial attention and residual connections, culminating in global mean pooling and dense layers for prediction.
Model 4 (SCINet with VMD): This architecture builds upon SCINet by incorporating VMD for signal processing. It maintains self-attention and SCINet blocks while leveraging decomposed signal components for enhanced feature extraction.

All models were trained using the custom loss function combining MSE, MAE, and Huber losses, and employing a rolling forecast approach for prediction generation. The evaluation metrics include nRMSE, NMAE, and WMAE. The results demonstrate varying efficacy across models in capturing temporal patterns and providing accurate solar power forecasts. Model 1 (F-BiLSTM with Time2Vec) consistently outperforms other architectures, particularly during periods of high variability.

Table 3 and Table 4 present the summary of findings, providing a comparative analysis of the performance of all four neural network models utilizing three error metrics (nRMSE, NMAE, and WMAE) across multiple months from 2023 to 2024. The data is categorized into S&S (spring and summer) and F&W (fall and winter) segments, with averages for each segment and the entire dataset. All metric results are expressed as percentages. The F-BiLSTM with Time2Vec model demonstrates the lowest error rates across most metrics, while the basic BiLSTM-VMD exhibits higher error values, particularly in the WMAE metric. Implementing attention-mechanism BiLSTM and self-attention SCINet models with Time2Vec and VMD preprocessing techniques presents several significant challenges. The primary technical difficulties include complex hyperparameter tuning for attention windows and VMD modes, alongside substantial computational demands of self-attention mechanisms and preprocessing operations. The data-specific challenges arise from Lithuania’s geographical location, where extreme seasonal variations create significant data imbalances between summer and winter periods, and additional difficulties emerge from model integration complexities, particularly in synchronizing VMD components and maintaining temporal alignment, while training stability issues arise from gradient problems in deep networks and convergence challenges with attention mechanisms. The data was divided into three distinct periods: the training period from January 2023 to August 2023, the validation period from August 2023 to June 2024, and the testing period from June 2024 to August 2024. This chronological split allows for comprehensive model evaluation across different seasonal patterns and ensures the models are tested on future data points. The extensive validation period spanning nearly a year helps assess the models’ ability to generalize across different seasons, while the test period focuses on summer months to evaluate final performance.

4.2. Performance Evaluation Metrics

F-BiLSTM-Time2Vec demonstrates the best overall performance among all models, with the lowest error metrics (nRMSE: 1.256, NMAE: 0.810, WMAE: 11.624). This performance is particularly notable in the spring and summer (S&S) period, when it achieves significantly lower error rates than other models (nRMSE: 1.188, NMAE: 0.813, WMAE: 3.013), outperforming all alternatives by a substantial margin. The model’s ability to maintain this level of accuracy across different evaluation metrics highlights the effectiveness of combining fuzzy logic with BiLSTM architecture and Time2Vec encoding, creating a synergistic effect that enhances forecasting precision beyond what each component could achieve individually.

4.3. Comparing Fuzzified and Non-Fuzzified Versions of Each Architecture

The BiLSTM-Time2Vec pair shows an interesting pattern where the fuzzified version (F-BiLSTM-Time2Vec) outperforms its non-fuzzified counterpart (BiLSTM-Time2Vec) across all metrics, which suggests that fuzzy transformation particularly benefits this architecture. For BiLSTM-VMD, the fuzzified version performs better in the S&S period but shows slightly higher errors in the F&W period. However, both versions maintain relatively stable performance compared to other models. SCINet-based models (both Time2Vec and VMD variants) show consistently higher error rates in both fuzzified and non-fuzzified versions, particularly in handling seasonal variations.

4.4. Seasonal Performance

During the spring and summer (S&S) period, F-BiLSTM-Time2Vec exhibits exceptional performance (nRMSE: 1.188, NMAE: 0.813, WMAE: 3.013), significantly outperforming other models during these stable seasonal periods. However, the non-fuzzified BiLSTM-Time2Vec shows slightly degraded performance (nRMSE: 1.572, NMAE: 1.092, WMAE: 4.063), indicating that fuzzy transformation particularly benefits prediction during stable seasons. In contrast, during the F&W period, all models experienced performance degradation, but to varying degrees:

F-BiLSTM-Time2Vec maintains relatively stable performance (nRMSE: 1.340, NMAE: 0.807, WMAE: 22.149) and exhibits the least degradation among all models. The combination of fuzzy transformation and Time2Vec encoding helps handle seasonal volatility. BiLSTM-VMD models (both fuzzy and non-fuzzy) show moderate degradation in the F&W period, with error rates approximately doubling compared to their S&S performance, indicating that encoding helps handle seasonal volatility. BiLSTM-VMD models (both fuzzy and non-fuzzy) show moderate degradation in the F&W period, with error rates approximately doubling compared to their S&S performance, indicating that the VMD preprocessing, while effective in stable seasons, may not fully capture the complex patterns in volatile periods. SCINet-based models show the most severe seasonal sensitivity, significantly deteriorating performance in the F&W period (error rates increasing up to 3–4 times). It suggests that SCINet architecture, regardless of fuzzy transformation, struggles to adapt to seasonal volatility. The high WMAE values in the F&W period across all models (particularly pronounced in SCINet variants) indicate that handling extreme events or outliers during volatile seasons remains challenging. However, F-BiLSTM-Time2Vec demonstrates the most robust performance in these conditions. These results demonstrate that while all models exhibit performance degradation during the fall and winter months, the F-BiLSTM-Time2Vec and BiLSTM-VMD models maintain substantially better accuracy compared to SCINet-based architectures, particularly in managing the increased volatility characteristic of the F&W period. The F-BiLSTM-Time2Vec model shows explicitly remarkable resilience, with its error metrics experiencing a minimal increase during volatile seasons compared to other models. Furthermore, the superior performance of fuzzified variants, especially evident in the F-BiLSTM-Time2Vec’s consistently lower error rates across all seasons (nRMSE: 1.256 compared to non-fuzzified 1.733), emphasizes the significant advantage of incorporating fuzzy transformation in time series forecasting models. This pattern highlights the effectiveness of combining fuzzy logic with deep learning architectures to enhance prediction accuracy and seasonal adaptability, mainly when dealing with complex temporal patterns and seasonal variations.

The results from the test dataset timeframe (16 June 2024 to 31 August 2024) partially corroborated our findings and revealed significant nuances. The F-BiLSTM-Time2Vec model maintained its superiority in the NMAE and WMAE metrics during the test period, outperforming the BiLSTM-VMD model. However, the BiLSTM-VMD model demonstrated notable robustness in the test period for nRMSE, nearly matching the performance of F-BiLSTM-Time2Vec and exhibiting significantly less performance degradation from the main evaluation period.

While the overall finding of seasonal consistency is supported, F-BiLSTM-Time2Vec exhibits the smallest performance gap between seasons, which suggests that BiLSTM-VMD may offer competitive advantages in specific deployments, particularly when prioritizing nRMSE. These findings indicate that particular performance requirements and seasonal conditions should guide model selection, although F-BiLSTM-Time2Vec remained the most balanced performer across all evaluation criteria.

4.5. Structural Advantages of the Hybrid Approach

The proposed hybrid methodology demonstrates distinct structural advantages by systematically integrating multiple specialized components within a unified framework.

Comprehensive Preprocessing Pipeline: Our approach combines GT2-FL uncertainty modeling, sine-cosine transformations for cyclical features, time series decomposition, and power lag incorporation in a coherent preprocessing workflow. Unlike conventional methods that apply these techniques in isolation, our integrated pipeline ensures synergistic operation across all preprocessing stages.

Dual-Pathway Architecture: The methodology systematically evaluates four configurations combining two neural architectures (BiLSTM with bidirectional processing vs. SCINet with hierarchical multi-scale analysis) and two preprocessing approaches (Time2Vec temporal encoding vs. VMD frequency-domain decomposition). Each configuration is tested in fuzzified and non-fuzzified variants, enabling direct assessment of the benefits of uncertainty modeling.

Rolling Forecast Innovation: Unlike traditional forecasting that accumulates prediction errors, our rolling forecast strategy uses actual measured data for each 24-h prediction window, preventing error propagation while maintaining operational applicability.

Integrated Uncertainty Management: The GT2-FL preprocessing serves as intelligent uncertainty quantification that enhances rather than replaces deep learning capabilities, creating a comprehensive uncertainty-aware framework that addresses meteorological variability and temporal pattern complexity inherent in high-latitude solar generation. This systematic integration represents a novel paradigm where fuzzy logic, advanced temporal encoding, signal decomposition, and specialized neural architecture collectively address the complex forecasting challenges of extreme seasonal environments.

4.6. Statistical Significance Analysis

To establish the statistical validity of our findings, we conducted a comprehensive significance analysis using 20 monthly observations from January 2023 to August 2024. This analysis employed parametric and non-parametric approaches to validate performance differences between the hybrid models.

Descriptive Statistics: F-BiLSTM-Time2Vec demonstrated the most consistent performance with the lowest mean nRMSE of 1.256% ± 0.464%, followed by F-BiLSTM-VMD (1.571% ± 0.660%) and BiLSTM-Time2Vec (1.733% ± 0.521%). SCINet-based architectures exhibited significantly higher error rates with greater variability (F-SCINet-Time2Vec: 4.393% ± 2.783%).
Statistical Testing: Using F-BiLSTM-Time2Vec as a reference, paired t-tests revealed statistically significant differences across all competing models (p < 0.001). Cohen’s d calculations showed large effect sizes: BiLSTM-Time2Vec (d = 1.663), F-BiLSTM-VMD (d = 0.994), and BiLSTM-VMD (d = 2.582), indicating practically significant improvements beyond statistical significance.
Non-parametric Validation: Shapiro–Wilk tests revealed non-normal distributions across models, necessitating Wilcoxon signed-rank tests for validation. These confirmed the parametric results (p < 0.001 for all comparisons), strengthening confidence in our conclusions despite non-normal data distributions.
Time Series Considerations: Lag-1 autocorrelation analysis showed no significant temporal dependency (r = 0.374, p = 0.115), validating the independence assumption. One-way Analysis of Variance (ANOVA) confirmed substantial model differences (F = 11.85, p < 0.001).

The statistical analysis conclusively establishes F-BiLSTM-Time2Vec’s superiority with both statistical significance (p < 0.001) and practical importance (large effect sizes). The consistency between parametric and non-parametric tests, combined with substantial effect sizes, provides robust evidence for the optimal performance of fuzzy logic integration with BiLSTM architecture and Time2Vec encoding.

4.7. Architectural Comparison and Computational Considerations

Figure 6, Figure 7, Figure 8 and Figure 9 demonstrate August 2023, from the S&S period, and March 2024, from the F&W period, representing the months when all models achieved their most consistent and generally better performance. This pattern holds for nRMSE and is reflected in these months’ NMAE and WMAE metrics.

5. Discussion

In the experiment, various variations in LSTM and Convolutional Neural Network (CNN) models are applied to forecast energy production in solar power plants. The literature review suggests that LSTM and CNN are highly effective for predicting renewable energy generation. Therefore, these models have been examined in experimental analysis. This study highlights the effectiveness of combining attention-enhanced deep learning models with specialized preprocessing techniques for solar power forecasting in Lithuania’s unique geographical context. By comparing BiLSTM with attention mechanisms and SCINet with self-attention mechanisms, each integrated with either Time2Vec or VMD preprocessing, we provide valuable insights into solar forecasting in high-latitude regions. The four models, BiLSTM-Time2Vec with attention, BiLSTMVMD with attention, SCINet-Time2Vec with self-attention, and SCINet-VMD with self-attention, have distinct strengths. The BiLSTM variants with attention mechanisms excel at capturing sequential patterns, while the SCINet models with self-attention are more effective at modeling long-range dependencies. Including Time2Vec or VMD preprocessing further enhances the models’ ability to capture periodic patterns in the data.

Lithuania’s geographical location (54° N–56° N) presents specific challenges for solar forecasting. Unlike regions with consistent solar radiation or mild seasonal variations, Lithuania experiences significant fluctuations in daylight hours, ranging from 17 h in summer to only 7 h in winter, which is further complicated by the low solar altitude in winter, which reduces solar radiation intensity even during daylight hours. Additionally, the frequent cloud cover and atmospheric instability in the Baltic region make forecasting more complex compared to areas with stable, sunny climates. Our proposed models performed well under these challenging conditions, demonstrating that attention mechanisms combined with Time2Vec or VMD preprocessing effectively address both seasonal patterns and irregular weather variations. By separating the two preprocessing methods, we could assess their contributions to forecasting accuracy. The findings suggest that this approach could improve grid stability and resource allocation. Moreover, the successful application of these models in Lithuania suggests they could also be applied to other high-latitude regions with similar weather patterns. The comparative analysis of the preprocessing techniques (Time2Vec and VMD) offers valuable directions for future research. Combining attention mechanisms, temporal encoding, and signal decomposition is a promising area for advancing solar power forecasting methods. As the field develops, these techniques can contribute to more accurate and reliable solar power predictions, supporting the integration of renewable energy and enhancing sustainability. This study focuses on seasonal patterns in Lithuania. However, the hybrid fuzzy logic and deep learning method can also be applied to other high-latitude regions (50°–60°N), such as northern Europe and Canada, which show similar seasonal variability. Regions with milder seasonal changes may still benefit from the model’s ability to handle uncertainty, although seasonal adaptability is less important in those areas. Initial testing with data from the Baltic states shows prediction accuracy within 5–10% of Lithuania’s results, suggesting the method is transferable. However, applying the model to very different climates, such as equatorial or desert regions, would require adjustments. A significant limitation of this study is the limited access to photovoltaic (PV) data from other regions. Expanding validation to a broader range of climates is an important direction for future research.

Our study and [13] share the same geographical focus in Lithuania but cover different periods (2023–2024 versus 2019- 2020), allowing for a direct comparison of the advancement in forecasting methodologies. While [13] achieved moderate success using LSTM variations (R² scores between 0.72 and 0.87, with the best performance R² = 0.927 in August), our hybrid framework combining GT2 Fuzzy logic with advanced neural architectures demonstrates significant improvements. Our FBiLSTM-Time2Vec configuration not only achieved higher accuracy (approximately 99% in favorable seasons) but also exhibited remarkable stability across seasons (spring/summer: nRMSE = 1.188%, NMAE = 0.813%, WMAE = 3.013%; fall/winter: nRMSE = 1.340%, NMAE = 0.807%, WMAE = 22.149%) which represents a substantial improvement over [13]’s reported 0.3% gain from multivariate inputs and addresses their noted winter performance limitations. Integrating GT2-FL with BiLSTM architecture and Time2Vec encoding has effectively addressed the complex seasonal patterns of Lithuania’s maritime-continental climate, suggesting a significant advancement in renewable energy forecasting methodology. These improvements offer promising implications for grid management and energy planning applications while opening new avenues for future regional adaptation and system integration research.

Figure 10 compares model performance using three error metrics: nRMSE, NMAE, and WMAE, with results broken down into spring and summer, fall and winter, and Overall Average. The analysis reveals a significant advantage in models that use General Type 2-Fuzzyfied datasets (indicated by the “F-” prefix) compared to those using typical datasets. The F-models, such as F-SCINet-Time2Vec, F-BiLSTMTime2Vec, F-SCINet-VMD, and F-BiLSTM-VMD, consistently achieve lower error rates across all metrics, demonstrating the effectiveness of fuzzy data preprocessing. This improvement through General Type 2 Fuzzification is particularly evident in F-BiLSTM-Time2Vec, which shows remarkably low error rates in nRMSE and NMAE metrics. The consistent superior performance of F-models across different seasons suggests that General Type 2 Fuzzification provides a more robust approach to handling data uncertainties and variations than the original, non-fuzzified datasets. This pattern indicates that incorporating fuzzy logic in dataset preprocessing can significantly enhance model prediction accuracy.

6. Conclusions

Our study introduces significant advancements in time series forecasting by developing and evaluating hybrid frameworks that combine General Type-2 Fuzzy logic systems with advanced neural architectures. We tested four distinct hybrid frameworks using data from Lithuania (2023–2024). We demonstrated the effectiveness of different architectural combinations in handling complex temporal patterns and seasonal variations. The fuzzified BiLSTM with Time2Vec (F-BiLSTM-Time2Vec) emerged as the most effective configuration, achieving exceptional accuracy with notably low error rates during spring and summer (nRMSE = 1.188%, NMAE = 0.813%, WMAE = 3.013%). Distinctively, the model maintained stable performance even during the challenging fall and winter periods (nRMSE = 1.340%, NMAE = 0.807%, WMAE = 22.149%). This hybrid approach, combining fuzzy logic with BiLSTM architecture and Time2Vec encoding, demonstrates consistently high accuracy levels of around 99% during favorable seasons, providing stability crucial for practical applications in grid management and energy planning. Our analysis reveals several key insights about the relative effectiveness of different components. The BiLSTM-based architectures consistently outperformed SCINet models in handling the complex seasonal patterns of Lithuania’s maritime–continental climate.

While both preprocessing techniques showed promise, Time2Vec encoding proved particularly effective when enhanced with fuzzification, surpassing the moderate effectiveness of VMD preprocessing. Integrating General Type-2 Fuzzy logic significantly improved the model’s ability to handle uncertainties and complex patterns inherent in seasonal data. Our hybrid methodology’s robust performance has important implications for renewable energy forecasting and grid integration. The ability to maintain high accuracy levels, particularly during challenging seasonal transitions, suggests that this approach could significantly improve the reliability of forecasting systems across various applications. This study focuses on Lithuanian conditions; the insights gained provide valuable guidance for developing robust forecasting systems across different geographical contexts. This research opens several promising avenues for future investigation, including adapting these models across various regions, exploring additional preprocessing techniques, and potentially integrating with other forecasting systems. The demonstrated success of combining appropriate architecture choice with temporal encoding and fuzzification suggests that this approach could significantly enhance forecasting accuracy across various domains where reliable prediction is crucial. These findings represent a meaningful step toward developing more accurate and dependable forecasting systems with potential applications that extend well beyond the specific context of this study. In the test set evaluation, F-BiLSTM-Time2Vec maintained superior NMAE (0.834%) and WMAE (3.099%) metrics while showing competitive nRMSE (1.532%) performance. The consistency of F-BiLSTM-Time2Vec across seasons makes it valuable for grid operators to prioritize year-round reliability, though each model offers distinct advantages in different scenarios. Future work should explore geographic generalization and real-time deployment, as the success of both fuzzification and temporal encoding approaches advances robust forecasting in renewable energy applications.

Author Contributions

Conceptualization, N.M.L. and D.N.; methodology, N.M.L. and D.N.; software, N.M.L.; validation, N.M.L., D.N., and G.K.; formal analysis, G.K.; investigation, N.M.L.; resources, D.N. and V.R.; data curation, D.N. and V.R.; writing—original draft preparation, N.M.L.; writing—review and editing, N.M.L., D.N., G.K., and V.R.; visualization, N.M.L. and G.K.; supervision, D.N.; project administration, V.R.; funding acquisition, D.N. and V.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by a grant (ATE-No. P-LU-24-110) from the Research Council of Lithuania.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The meteorological data used in this study were obtained from the Lithuanian Hydrometeorological Service (LHMT) through their API (https://api.meteo.lt/) (accessed on 5 September 2024).

Acknowledgments

The authors acknowledge the Lithuanian Hydrometeorological Service (LHMT) for providing access to meteorological data through their API. We also thank the Smart Grids and Renewable Energy Laboratory at the Lithuanian Energy Institute for providing computational resources and technical support.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the study’s design; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

Abbreviation	Full Form
ADF	Augmented Dickey–Fuller
AI	Artificial Intelligence
ANN	Artificial Neural Network
ANOVA	Analysis of Variance
API	Application Programming Interface
ARIMA	Autoregressive Integrated Moving Average
BiGRU	Bidirectional Gated Recurrent Unit
BiLSTM	Bidirectional Long Short-Term Memory
CEEMDAN	Complete Ensemble Empirical Mode Decomposition with Adaptive Noise
CNN	Convolutional Neural Network
CPV	Concentrated Photovoltaic
CSP	Concentrated Solar Power
DHI	Diffuse Horizontal Irradiance
DL	Deep Learning
DNN	Deep Neural Network
DNI	Direct Normal Irradiance
EMD	Empirical Mode Decomposition
EU	European Union
F&W	Fall and Winter
FL	Fuzzy Logic
FOU	Footprint of Uncertainty
GAN	Generative Adversarial Network
GHG	Greenhouse Gas
GHI	Global Horizontal Irradiance
GRU	Gated Recurrent Unit
GT2-FL	General Type-2 Fuzzy Logic
IMF	Intrinsic Mode Function
KPSS	Kwiatkowski-Phillips-Schmidt-Shin
LHMT	Lithuanian Hydrometeorological Service
LSTM	Long Short-Term Memory
MAE	Mean Absolute Error
MAPE	Mean Absolute Percentage Error
MRE	Mean Relative Error
NARX	Non-linear Autoregressive Exogenous
NMAE	Normalized Mean Absolute Error
nRMSE	Normalized Root Mean Square Error
PSO	Particle Swarm Optimization
PV	Photovoltaic
RE	Renewable Energy
RMSE	Root Mean Square Error
RNN	Recurrent Neural Network
S&S	Spring and Summer
SCINet	Sample Convolution and Interactive Neural Network
SVM	Support Vector Machine
T1-FL	Type-1 Fuzzy Logic
T1-FS	Type-1 Fuzzy Set
T2-FL	Type-2 Fuzzy Logic
T2-FS	Type-2 Fuzzy Set
TCN	Temporal Convolutional Network
Time2Vec	Time to Vector
TWh	Terawatt hours
UHD	Ultra High Definition
VMD	Variational Mode Decomposition
WMAE	Weighted Mean Absolute Error
WOA	Whale Optimization Algorithm

References

Paramati, S.R.; Shahzad, U.; Doğan, B. The role of environmental technology for energy demand and energy efficiency: Evidence from OECD countries. Renew. Sustain. Energy Rev. 2022, 153, 111735. [Google Scholar] [CrossRef]
Paraschiv, L.S.; Paraschiv, S. Contribution of renewable energy (hydro, wind, solar and biomass) to decarbonization and transformation of the electricity generation sector for sustainable development. Energy Rep. 2023, 9, 535–544. [Google Scholar] [CrossRef]
Foley, A.M.; McIlwaine, N.; Morrow, D.J.; Hayes, B.P.; Zehir, M.A.; Mehigan, L.; Papari, B.; Edrington, C.S.; Baran, M. A critical evaluation of grid stability and codes, energy storage and smart loads in power systems with wind generation. Energy 2020, 205, 117671. [Google Scholar] [CrossRef]
Simsek, Y.; Santika, W.G.; Anisuzzaman, M.; Urmee, T.; Bahri, P.A.; Escobar, R. An analysis of additional energy requirement to meet the sustainable development goals. J. Clean. Prod. 2020, 272, 122646. [Google Scholar] [CrossRef]
Eurostat. Renewable Energy Statistics. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php?title=Renewable_energy_statistics (accessed on 4 July 2025).
Sharma, N.; Sharma, P.; Irwin, D.; Shenoy, P. Predicting solar generation from weather forecasts using machine learning. In Proceedings of the IEEE International Conference on Smart Grid Communications (SmartGridComm), Brussels, Belgium, 17–20 October 2011; pp. 528–533. [Google Scholar] [CrossRef]
Nwagu, C.N.; Ujah, C.O.; Kallon, D.V.; Aigbodion, V.S. Integrating solar and wind energy into the electricity grid for improved power accessibility. Unconv. Resour. 2025, 5, 100129. [Google Scholar] [CrossRef]
Cocco, D.; Migliari, L.; Petrollese, M. A hybrid CSP–CPV system for improving the dispatchability of solar power plants. Energy Convers. Manag. 2016, 114, 312–323. [Google Scholar] [CrossRef]
Rodríguez, F.; Fleetwood, A.; Galarza, A.; Fontán, L. Predicting solar energy generation through artificial neural networks using weather forecasts for microgrid control. Renew. Energy 2018, 126, 855–864. [Google Scholar] [CrossRef]
Moser Baer Solar. Solar PV Monitoring: Maximizing Performance Through Real-Time Analytics. Available online: https://www.moserbaersolar.com/maintenance-and-performance-optimization/solar-pv-monitoring-maximizing-performance-through-real-time-analytics/ (accessed on 4 July 2025).
Rodriguez, J.; Pontt, J.; Correa, P.; Lezana, P.; Cortes, P. Predictive power control of an AC/DC/AC converter. In Proceedings of the 40th IAS Annual Meeting Conference Record of the Industry Applications Conference, Hong Kong, China, 2–6 October 2005; pp. 934–939. [Google Scholar]
Narkutė, V. Forecast of Energy Produced by Solar Power Plants. Ph.D. Thesis, Vilniaus Universitetas, Vilnius, Lithuania, 2022. [Google Scholar]
Mellit, A.; Massi Pavan, A.; Ogliari, E.; Leva, S.; Lughi, V. Advanced methods for photovoltaic output power forecasting: A review. Appl. Sci. 2020, 10, 487. [Google Scholar] [CrossRef]
Sobri, S.; Koohi-Kamali, S.; Rahim, N.A. Solar photovoltaic generation forecasting methods: A review. Energy Convers. Manag. 2018, 156, 459–497. [Google Scholar] [CrossRef]
Kudo, M.; Takeuchi, A.; Nozaki, Y.; Endo, H.; Sumita, J. Forecasting electric power generation in a photovoltaic power system for an energy network. Electr. Eng. Jpn. 2009, 167, 16–23. [Google Scholar] [CrossRef]
Sen, Z. Fuzzy algorithm for estimation of solar irradiation from sunshine duration. Sol. Energy 1998, 63, 39–49. [Google Scholar] [CrossRef]
Gautam, N.K.; Kaushika, N.D. A model for the estimation of global solar radiation using fuzzy random variables. J. Appl. Meteorol. Climatol. 2002, 41, 1267–1276. [Google Scholar] [CrossRef]
Rizwan, M.; Jamil, M.; Kirmani, S.; Kothari, D.P. Fuzzy logic-based modeling and estimation of global solar energy using meteorological parameters. Energy 2014, 70, 685–691. [Google Scholar] [CrossRef]
Nalbant, K.G.; Özdemir, Ş.; Özdemir, Y. Evaluating the campus climate factors using an interval type-2 fuzzy ANP. Sigma J. Eng. Nat. Sci. 2024, 42, 89–98. [Google Scholar] [CrossRef]
Mendel, J.M.; John, R.B. Type 2-Fuzzy sets made simple. IEEE Trans. Fuzzy Syst. 2002, 10, 117–127. [Google Scholar] [CrossRef]
Hinton, G.E.; Osindero, S.; Teh, Y. A fast learning algorithm for deep belief nets. Neural Comput. 2006, 18, 1527–1554. [Google Scholar] [CrossRef]
Jlidi, M.; Hamidi, F.; Barambones, O.; Abbassi, R.; Jerbi, H.; Aoun, M.; Karami-Mollaee, A. An artificial neural network for solar energy prediction and control using Jaya-SMC. Electronics 2023, 12, 592. [Google Scholar] [CrossRef]
Zhidan, L.; Wei, G.; Qingfu, L.; Zhengjun, Z. A hybrid model for financial time-series forecasting based on mixed methodologies. Expert Syst. 2021, 38, e12633. [Google Scholar]
Ni, Z.; Zhang, C.; Karlsson, M.; Gong, S. A study of deep learning-based multi-horizon building energy forecasting. Energy Build. 2024, 303, 113810. [Google Scholar] [CrossRef]
Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar] [CrossRef]
Li, Y.; Song, L.; Zhang, S.; Kraus, L.; Adcox, T.; Willardson, R.; Komandur, A.; Lu, N. A TCN-based hybrid forecasting framework for hours-ahead utility-scale PV forecasting. IEEE Trans. Smart Grid 2023, 14, 4073–4085. [Google Scholar] [CrossRef]
Kong, X.; Du, X.; Xu, Z.; Xue, G. Predicting solar radiation for space heating with thermal storage system based on temporal convolutional network-attention model. Appl. Therm. Eng. 2023, 219, 119574. [Google Scholar] [CrossRef]
Liu, M.; Qin, H.; Cao, R.; Deng, S. Short-term load forecasting based on improved TCN and DenseNet. IEEE Access 2022, 10, 115945–115957. [Google Scholar] [CrossRef]
Limouni, T.; Yaagoubi, R.; Bouziane, K.; Guissi, K.; Baali, E.H. Accurate one step and multistep forecasting of very short-term PV power using LSTM-TCN model. Renew. Energy 2023, 205, 1010–1024. [Google Scholar]
Zhao, Y.; Peng, X.; Tu, T.; Li, Z.; Yan, P.; Li, C. WOA-VMD-SCINet: Hybrid model for accurate prediction of ultra-short-term Photovoltaic generation power considering seasonal variations. Energy Rep. 2024, 12, 3470–3487. [Google Scholar] [CrossRef]
Huang, X.; Li, Q.; Tai, Y.; Chen, Z.; Liu, J.; Shi, J.; Liu, W. Time series forecasting for hourly photovoltaic power using conditional generative adversarial network and Bi-LSTM. Energy 2022, 246, 123403. [Google Scholar] [CrossRef]
Zhang, C.; Peng, T.; Nazir, M.S. A novel integrated photovoltaic power forecasting model based on variational mode decomposition and CNN-BiGRU considering meteorological variables. Electr. Power Syst. Res. 2022, 213, 108796. [Google Scholar] [CrossRef]
Trong, T.N.; Son, H.V.X.; Dinh, H.D.; Takano, H.; Duc, T.N. Short-term PV power forecast using hybrid deep learning model and Variational Mode Decomposition. Energy Rep. 2023, 9, 712–717. [Google Scholar] [CrossRef]
Wang, X.; Ma, W. A hybrid deep learning model with an optimal strategy based on improved VMD and transformer for short-term photovoltaic power forecasting. Energy 2024, 295, 131071. [Google Scholar] [CrossRef]
Peng, S.; Zhu, J.; Wu, T.; Yuan, C.; Cang, J.; Zhang, K.; Pecht, M. Prediction of wind and PV power by fusing the multi-stage feature extraction and a PSO-BiLSTM model. Energy 2024, 298, 131345. [Google Scholar] [CrossRef]
Voyant, C.; Notton, G.; Kalogirou, S.; Nivet, M.L.; Paoli, C.; Motte, F.; Fouilloy, A. Machine learning methods for solar radiation forecasting: A review. Renew. Energy 2017, 105, 569–582. [Google Scholar] [CrossRef]
Chaturvedi, D.K.; Isha, I. Solar power forecasting: A review. Int. J. Comput. Appl. 2016, 140, 28–50. [Google Scholar] [CrossRef]
Rana, M.; Rahman, A. Multiple steps ahead solar photovoltaic power forecasting based on univariate machine learning models and data re-sampling. Sustain. Energy Grids Netw. 2020, 21, 100286. [Google Scholar] [CrossRef]
Wang, K.; Qi, X.; Liu, H. A comparison of day-ahead photovoltaic power forecasting models based on deep learning neural network. Appl. Energy 2019, 251, 113315. [Google Scholar] [CrossRef]
Gao, M.; Li, J.; Hong, F.; Long, D. Day-ahead power forecasting in a large-scale photovoltaic plant based on weather classification using LSTM. Energy 2019, 187, 115838. [Google Scholar] [CrossRef]
Qing, X.; Niu, Y. Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM. Energy 2018, 148, 461–468. [Google Scholar] [CrossRef]
Cabello-López, T.; Carranza-García, M.; Riquelme, J.C.; García-Gutiérrez, J. Forecasting solar energy production in Spain: A comparison of univariate and multivariate models at the national level. Appl. Energy 2023, 350, 121645. [Google Scholar] [CrossRef]
Gao, B.; Huang, X.; Shi, J.; Tai, Y.; Zhang, J. Hourly forecasting of solar irradiance based on CEEMDAN and multi-strategy CNN-LSTM neural networks. Renew. Energy 2020, 162, 1665–1683. [Google Scholar] [CrossRef]
Liu, M.; Zeng, A.; Chen, M.; Xu, Z.; Lai, Q.; Ma, L.; Xu, Q. Scinet: Time series modeling and forecasting with sample convolution and interaction. Adv. Neural Inf. Process. Syst. 2022, 35, 5816–5828. [Google Scholar]
Li, F.F.; Wang, S.Y.; Wei, J.H. Long term rolling prediction model for solar radiation combining empirical mode decomposition (EMD) and artificial neural network (ANN) techniques. J. Renew. Sustain. Energy 2018, 10, 1. [Google Scholar] [CrossRef]
Lanbaran, N.M.; Celik, E.; Yigider, M. Evaluation of investment opportunities with interval-valued fuzzy TOPSIS method. Appl. Math. Nonlinear Sci. 2020, 5, 461–474. [Google Scholar] [CrossRef]
Melin, P.; Castillo, O. A review on Type 2-Fuzzy logic applications in clustering, classification and pattern recognition. Appl. Soft Comput. 2014, 21, 568–577. [Google Scholar] [CrossRef]
Mittal, K.; Jain, A.; Vaisla, K.S.; Castillo, O.; Kacprzyk, J. A comprehensive review on type 2 fuzzy logic applications: Past, present and future. Eng. Appl. Artif. Intell. 2020, 95, 103916. [Google Scholar] [CrossRef]
Juang, C.-F.; Huang, R.-B.; Lin, Y.-Y. A recurrent self-evolving interval Type 2-Fuzzy neural network for dynamic system processing. IEEE Trans. Fuzzy Syst. 2009, 17, 1092–1105. [Google Scholar] [CrossRef]
Geng, D.; Wang, B.; Gao, Q. A hybrid photovoltaic/wind power prediction model based on Time2Vec, WDCNN and BiLSTM. Energy Convers. Manag. 2023, 291, 117342. [Google Scholar] [CrossRef]
Zosso, D.; Dragomiretskiy, K.; Bertozzi, A.L.; Weiss, P.S. Two-dimensional compact variational mode decomposition: Spatially compact and spectrally sparse image decomposition and segmentation. J. Math. Imaging Vis. 2017, 58, 294–320. [Google Scholar] [CrossRef]
Guermoui, M.; Fezzani, A.; Mohamed, Z.; Rabehi, A.; Ferkous, K.; Bailek, N.; Ghoneim, S.S.M. An analysis of case studies for advancing photovoltaic power forecasting through multi-scale fusion techniques. Sci. Rep. 2024, 14, 6653. [Google Scholar] [CrossRef] [PubMed]
Efimov, V.M.; Reznik, A.L.; Bondarenko, Y.V. Increasing the sine-cosine transformation accuracy in signal approximation and interpolation. Optoelectron. Instrum. Data Process. 2008, 44, 218–227. [Google Scholar] [CrossRef]
Niu, Z.; Zhong, G.; Yu, H. A review on the attention mechanism of deep learning. Neurocomputing 2021, 452, 48–62. [Google Scholar] [CrossRef]
Lin, J.; Ma, J.; Zhu, J.; Cui, Y. Short-term load forecasting based on LSTM networks considering attention mechanism. Int. J. Electr. Power Energy Syst. 2022, 137, 107818. [Google Scholar] [CrossRef]
Yang, M.; Wang, J. Adaptability of financial time series prediction based on BiLSTM. Procedia Comput. Sci. 2022, 199, 18–25. [Google Scholar] [CrossRef]
Parri, S.; Teeparthi, K. VMD-SCINet: A hybrid model for improved wind speed forecasting. Earth Sci. Inform. 2024, 17, 329–350. [Google Scholar] [CrossRef]
Dong, P.; Zuo, Y.; Chen, L.; Li, S.; Zhang, L. VSNT: Adaptive Network Traffic Forecasting via VMD and Deep Learning with SCI-Block and Attention Mechanism. In Proceedings of the 2023 IEEE International Conference on Parallel and Distributed Processing with Applications, Big Data and Cloud Computing, Sustainable Computing and Communications, Social Computing and Networks (ISPA/BDCloud/SocialCom/SustainCom), Haikou, China, 21–23 December 2023; pp. 104–109. [Google Scholar]
Leva, S.; Dolara, A.; Grimaccia, F.; Mussetta, M.; Ogliari, E. Analysis and validation of 24 hours ahead neural network forecasting of photovoltaic output power. Math. Comput. Simul. 2017, 131, 88–100. [Google Scholar] [CrossRef]
Arltová, M.; Fedorová, D. Selection of Unit Root Test on the Basis of Length of the Time Series and Value of AR (1) Parameter. Stat. Stat. Econ. J. 2016, 96, 4–17. [Google Scholar]
Musbah, H.; Aly, H.H.; Little, T.A. A proposed novel adaptive DC technique for non-stationary data removal. Heliyon 2023, 9, 3. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Type-1 Fuzzy Gaussian membership function developed by the authors.

Figure 2. General Type 2- Fuzzy membership function, developed by authors.

Figure 3. Data preprocessing methodology flowchart developed by the authors.

Figure 4. Correlation Between Meteorological Variables and PV Power Output, developed by the authors.

Figure 5. Overview of the hybrid forecasting methodology showing data preprocessing, model architecture, and evaluation framework.

Figure 6. Actual versus predicted solar power generation for BiLSTM-Time2Vec and F-BiLSTM models during S&S and F&W periods.

Figure 7. Actual versus predicted solar power generation for SCINet-Time2Vec and F-SCINet-Time2Vec models during S&S and F&W periods.

Figure 8. Actual versus predicted solar power generation for BiLSTM-VMD and F-BiLSTM-VMD models during S&S and F&W periods.

Figure 9. Actual versus predicted solar power generation for SCINet-VMD and F-SCINet-VMD models during S&S and F&W periods.

Figure 10. Comparative performance analysis of all model variants across nRMSE, NMAE, and WMAE metrics for spring/summer, fall/winter, and overall periods.

Table 1. Comparative Overview of Solar Forecasting Approaches.

Approach Category	Key Methods	Limitations	Best Performance	Reference
Statistical Models	ARIMA, SVM, Random Forest	Limited handling of non-linear patterns	15% MRE (Random Forest), 17.70% MAPE (ARIMA)	[36,37,38]
Classical Neural Networks	ANN, CNN, DNN	Difficulty with temporal dependencies	R² = 0.93 (ANN monthly), R = 0.92 (NARX NN)	[22,39]
Recurrent Architectures	LSTM, BiLSTM, GRU	Computational complexity, gradient issues	5% nRMSE (LSTM), 7% nMAE (LSTM), 8% nMAE (GRU)	[24,40,41]
Hybrid Deep Learning	CNN-LSTM, TCN-DenseNet	High computational requirements	23.38% MAPE reduction (TCN-DN), 38.49% error rate (CEEM-CNN-LSTM)	[28,42,43]
Advanced Architectures	SCINet, Transformer	Complex hyperparameter tuning	13.7% MAE (VMD-Transformer), 0.619–1.149 kW RMSE (WOA-VMD-SCINet)	[33,34,44]
Signal Processing	VMD, EMD, CEEMDAN	Preprocessing complexity	93.0% correlation (monthly), 69.8% correlation (daily), 29.05% RMSE, 4.157 RMSE	[32,43,45]

Note: Lower values indicate better performance for error metrics (MAPE, MRE, nRMSE, nMAE, RMSE, MAE, Error Rate). Higher values indicate better performance for accuracy metrics (R², correlation coefficients).

Table 2. Optimized hyperparameters used across all model variants.

Parameter Category	Parameter	Value
Data Configuration	Look Back Window	24
	Forecast Horizon	1
	Training Period	1 January 2023 08:00–5 August 2023 18:00
	Validation Period	6 August 2023 03:00–15 June 2024 19:00
	Testing Period	16 June 2024 02:00–31 August 2024 17:00
Training Parameters	Batch Size	42
	Learning Rate	1.00 × 10⁻³
	Weight Decay	1.00 × 10⁻⁵
	Loss Function Epochs	0.4MSE + 0.3MAE + 0.3Huber 100
Architecture	Main Units	[512, 256, 128, 64]
	Time2Vec Models	kernel_size = 64, Dropout: LSTM 0.2, Dense 0.1
	VMD Models	K = 3 modes, Dropout: LSTM 0.2, Dense 0.1

Table 3. Performance Metrics Comparison of BiLSTM Variants.

Date	F- BiLSTM-Time2Vec			BiLSTM-Time2Vec			F-BiLSTM-VMD			BiLSTM-VMD
	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE
	S&S
2023-04	1.0995	0.7467	3.2800	1.2082	0.8457	3.7149	1.2012	0.8110	3.5623	1.9509	1.3322	5.8516
2023-05	1.3551	0.9453	2.7461	1.2352	0.8606	2.5002	1.1321	0.7785	2.2615	2.1012	1.5118	4.3918
2023-06	1.1948	0.8375	2.8634	1.3627	0.9362	3.2006	1.2164	0.8207	2.8061	1.9619	1.3692	4.6813
2023-07	1.1574	0.8170	2.8759	1.5974	1.1387	4.0082	1.2951	0.9215	3.2438	1.9152	1.3519	4.7586
2023-08	0.9719	0.6816	2.6613	1.6710	1.1600	4.5291	1.5871	1.0900	4.2557	2.1316	1.5195	5.9327
2023-09	1.0987	0.7432	2.4083	2.2094	1.6347	5.2975	1.6899	1.1565	3.7477	2.9258	2.1495	6.9659
2024-04	1.2697	0.8714	4.5141	1.6860	1.2171	6.3050	1.5477	1.0748	5.5677	2.3897	1.6987	8.7997
2024-05	1.1912	0.8233	2.5346	1.2658	0.8803	2.7101	1.1764	0.8029	2.4719	2.1061	1.5092	4.6463
2024-06	1.1514	0.7842	3.0696	1.4826	1.0134	3.9670	1.3484	0.9017	3.5297	1.9505	1.3389	5.2409
2024-07	1.6030	0.9706	3.7529	2.0055	1.2217	4.7240	1.5110	1.0423	4.0302	2.2931	1.5486	5.9878
2024-08	0.9720	0.7169	2.4385	1.5632	1.0999	3.7412	1.5014	1.0646	3.6210	2.1073	1.5156	5.1549
F&W
2023-01	0.9410	0.4999	28.4122	1.3034	0.7086	40.2726	1.2342	0.7069	40.1770	1.5645	0.8865	50.3838
2023-02	1.0867	0.7183	8.0396	1.7122	1.1475	12.8430	1.4119	0.9855	11.0305	2.0110	1.3836	15.4861
2023-03	0.8935	0.6154	3.1645	1.6153	1.2130	6.2379	1.1976	0.8437	4.3386	2.0256	1.4361	7.3851
2023-10	2.1331	1.4092	10.7094	2.7625	1.8522	14.0767	2.7239	1.7460	13.2694	3.7712	2.4648	18.7321
2023-11	2.7027	1.7211	36.1714	3.0977	1.9351	40.6689	3.8945	2.3198	48.7539	4.4511	2.6487	55.6667
2023-12	1.5820	0.6387	87.4059	2.3255	0.9315	127.4846	1.7999	0.7049	96.4637	1.8740	0.7226	98.8857
2024-01	0.6196	0.2542	12.3043	1.1701	0.4833	23.3938	0.9579	0.4090	19.7990	1.2848	0.5376	26.0215
2024-02	1.0537	0.7018	9.5734	1.8725	1.2583	17.1650	1.6827	1.1407	15.5608	2.2131	1.5262	20.8197
2024-03	1.0447	0.7087	3.5622	1.5149	1.1040	5.5493	1.3168	0.9199	4.6240	2.1507	1.5578	7.8308
Averages
Ave of S&S	1.1877	0.8125	3.0132	1.5715	1.0917	4.0634	1.3824	0.9513	3.5543	2.1667	1.5314	5.6738
Ave of F&W	1.3397	0.8075	22.1492	1.9305	1.1815	31.9658	1.8022	1.0863	28.2241	2.3718	1.4627	33.4679
Entire Ave	1.2561	0.8103	11.6244	1.7331	1.1321	16.6195	1.5713	1.0120	14.6557	2.2590	1.5005	18.1812
Test Set	1.5320	0.8340	3.0990	2.6973	1.3392	4.9780	1.724	1.0630	3.9520	2.9962	1.4470	3.625

Table 4. Performance Metrics Comparison of SCINet Variants.

Date	F-SCINet-Time2Vec			SCINet-Time2Vec			F-SCINet-VMD			SCINet-VMD
	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE	nRMSE	NMAE	WMAE
	S&S
2023-04	4.1863	2.4873	10.9255	3.7030	2.2168	9.7375	4.5508	3.2229	14.1568	4.3533	3.1900	14.0123
2023-05	4.9930	3.2968	9.5777	4.1448	2.7157	7.8893	5.0733	3.5023	10.1745	5.0824	3.6188	10.5131
2023-06	4.4851	2.8912	9.8848	3.8973	2.4807	8.4814	4.2367	2.8235	9.6533	4.5913	3.0199	10.3247
2023-07	4.3423	2.8022	9.8638	3.6288	2.3615	8.3125	3.8229	2.6315	9.2630	4.6475	3.1405	11.0545
2023-08	4.0035	2.5730	10.0455	3.7268	2.4104	9.4109	4.8280	3.3020	12.8918	5.1443	3.4783	13.5801
2023-09	3.4453	2.2114	7.1663	3.1122	2.0243	6.5599	6.7737	4.4505	14.4223	6.4818	4.3321	14.0388
2024-04	3.5722	2.1741	11.2625	3.5196	2.2200	11.5001	5.7042	3.7965	19.6668	5.5777	3.6967	19.1496
2024-05	4.9638	3.1574	9.7207	4.5764	2.7809	8.5617	5.2325	3.5183	10.8317	5.0154	3.4181	10.5233
2024-06	4.1457	2.5733	10.0731	3.7812	2.3378	9.1512	4.6228	3.0239	11.8367	4.7279	3.0874	12.0855
2024-07	4.5398	2.7464	10.6194	4.3067	2.5460	9.8448	6.1678	3.9366	15.2219	5.8850	3.9495	15.2718
2024-08	3.8082	2.4527	8.3424	3.4340	2.1947	7.4648	5.2756	3.5145	11.9539	5.7736	3.9082	13.2930
F&W
2023-01	1.4482	1.1512	65.4267	1.5671	1.2267	69.7188	1.1467	0.8738	49.6618	1.7574	1.3434	76.3508
2023-02	2.4027	1.3478	15.0857	2.3943	1.4760	16.5199	2.3086	1.5976	17.8814	2.7894	2.0555	23.0055
2023-03	3.5334	2.0916	10.7560	3.1646	1.8918	9.7283	3.5082	2.4037	12.3609	3.8249	2.6719	13.7398
2023-10	14.6823	8.5658	65.0988	16.5972	9.3639	71.1646	10.6315	6.4674	49.1510	9.9314	6.1672	46.8697
2023-11	8.0100	4.1350	86.9027	8.9692	4.4608	93.7511	5.8113	3.6476	76.6612	5.5910	3.7098	77.9664
2023-12	1.9130	0.7307	100.0000	1.9130	0.7307	100.0000	4.6434	3.3954	464.6711	5.8990	4.4411	607.7771
2024-01	2.7605	2.0812	100.7387	2.9707	2.1891	105.9653	2.9437	1.9604	94.8939	3.5564	2.4776	119.9293
2024-02	2.6717	1.5802	21.5573	2.4559	1.4407	19.6538	3.9645	2.2521	30.7230	4.0916	2.6434	36.0611
2024-03	3.9556	2.5313	12.7243	3.6999	2.3810	11.9686	4.7269	3.1850	16.0101	4.5084	3.1120	15.6431
Averages
Ave of S&S	4.2259	2.6696	9.7711	3.8028	2.3899	8.8104	5.1171	3.4293	12.7339	5.2073	3.5309	13.0770
Ave of F&W	4.5975	2.6905	53.1434	4.8591	2.7956	55.3856	4.4094	2.8648	90.2238	4.6611	3.1802	113.0381
Entire Ave	4.3931	2.6790	29.2886	4.2781	2.5725	29.7692	4.7987	3.1753	47.6044	4.9615	3.3731	58.0595
Test Set	4.5310	2.6331	9.7862	4.4300	2.5432	9.4501	5.5860	3.6263	13.4751	5.8334	3.7504	13.9360

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mohammadi Lanbaran, N.; Naujokaitis, D.; Kairaitis, G.; Radziukynas, V. Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania. Appl. Sci. 2025, 15, 9672. https://doi.org/10.3390/app15179672

AMA Style

Mohammadi Lanbaran N, Naujokaitis D, Kairaitis G, Radziukynas V. Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania. Applied Sciences. 2025; 15(17):9672. https://doi.org/10.3390/app15179672

Chicago/Turabian Style

Mohammadi Lanbaran, Naiyer, Darius Naujokaitis, Gediminas Kairaitis, and Virginijus Radziukynas. 2025. "Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania" Applied Sciences 15, no. 17: 9672. https://doi.org/10.3390/app15179672

APA Style

Mohammadi Lanbaran, N., Naujokaitis, D., Kairaitis, G., & Radziukynas, V. (2025). Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania. Applied Sciences, 15(17), 9672. https://doi.org/10.3390/app15179672

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Hourly Solar Energy Forecasting Using BiLSTM Networks with Attention Mechanism, General Type-2 Fuzzy Logic Approach: A Comparative Study of Seasonal Variability in Lithuania

Abstract

1. Introduction

1.1. Photovoltaic Forecasting Challenges

1.2. State-of-the-Art Review

1.3. Research Gaps and Motivation

1.4. Research Contributions and Theoretical Significance

2. Theoretical Background

2.1. Preprocessing Techniques

2.1.1. General Type-2 Fuzzy Logic Systems

2.1.2. Time to Vector

2.1.3. Variational Mode Decomposition (VMD)

2.1.4. Sine-Cosine Transformation

2.1.5. Attention Mechanism

2.2. Main Forecasting Models

2.2.1. Bidirectional Long Short-Term Memory (BiLSTM)

2.2.2. Sample Convolution and Interactive Neural Networks (SCINet)

2.3. Evaluation Metrics

3. Proposed Model Architecture

3.1. Forecasting Scope and Temporal Framework

3.2. Data Description and Implementation Details

3.3. Data Preprocessing Pipeline

3.3.1. Fuzzification and Defuzzification

3.3.2. Temporal Feature Engineering and Preprocessing Workflow

3.3.3. Stationarity and Statistical Analysis

3.3.4. Meteorological Impact Integration

3.4. Hybrid Model Architectures

3.4.1. BiLSTM-Based Models with Attention Mechanism

3.4.2. SCINet-Based Models with Self-Attention Mechanism

3.5. Rolling Forecasting Strategy

3.6. Model Training and Evaluation

4. Result

4.1. Comparison of Neural Network Architectures

4.2. Performance Evaluation Metrics

4.3. Comparing Fuzzified and Non-Fuzzified Versions of Each Architecture

4.4. Seasonal Performance

4.5. Structural Advantages of the Hybrid Approach

4.6. Statistical Significance Analysis

4.7. Architectural Comparison and Computational Considerations

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI