Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning

Sapundzhi, Fatima; Georgiev, Slavi; Georgiev, Ivan; Todorov, Venelin

doi:10.3390/app16105053

Open AccessArticle

Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning

¹

Department of Communication and Computer Engineering, Faculty of Engineering, South-West University “Neofit Rilski”, 66 Ivan Mihailov Str., 2700 Blagoevgrad, Bulgaria

²

Department of Applied Mathematics and Statistics, Faculty of Natural Sciences and Education, University of Ruse “Angel Kanchev”, 8 Studentska Str., 7004 Ruse, Bulgaria

³

Department of Information Modeling, Institute of Mathematics and Informatics, Bulgarian Academy of Sciences, 8 Acad. Georgi Bonchev Str., 1113 Sofia, Bulgaria

⁴

Department of Parallel Algorithms and Machine Learning with a Laboratory in Neurotechnologies, Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, 1113 Sofia, Bulgaria

⁵

Centre of Excellence in Informatics and Information and Communication Technologies, Institute of Information and Communication Technologies, Bulgarian Academy of Sciences, 1113 Sofia, Bulgaria

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2026, 16(10), 5053; https://doi.org/10.3390/app16105053

Submission received: 31 March 2026 / Revised: 14 May 2026 / Accepted: 15 May 2026 / Published: 19 May 2026

(This article belongs to the Special Issue Selected Papers from the 6th International Electronic Conference on Applied Sciences)

Download

Browse Figures

Versions Notes

Featured Application

This study provides a practical neural forecasting framework for monthly solar energy production in distributed photovoltaic systems. By combining data from several related PV installations in a global pooled model and comparing feed-forward and recurrent neural architectures, the proposed approach supports energy management, performance monitoring, uncertainty-aware planning, and decision-making for sustainable integration of rooftop solar power.

Abstract

This paper investigates solar energy production forecasting at a monthly temporal resolution using a pooled neural network framework applied to the Chikalov photovoltaic systems in southwestern Bulgaria. The study considers several related PV installations with unequal time-series lengths and formulates the forecasting task as one-step-ahead prediction of the next monthly total energy yield, measured in kWh, in a global pooled setting. Two complementary neural architectures are compared: a multilayer perceptron (MLP), which serves as a nonlinear feed-forward benchmark based on lagged observations and seasonal descriptors, and a gated recurrent unit (GRU), which explicitly models sequential temporal dependence. In both cases, seasonality is represented through cyclical calendar encodings, while model selection is performed by chronological hyperparameter search using a separate validation block. Forecast accuracy is assessed by RMSE, MAE, coefficient of determination (R²), MAPE, and sMAPE, and uncertainty is quantified through validation residual prediction intervals. The results show that the MLP achieves stronger validation performance, whereas the GRU provides better final out-of-sample generalization after refitting on the combined training and validation data. For both architectures, the best configurations are obtained with a 12-month input horizon, indicating that one full annual cycle contains the most informative memory for forecasting monthly aggregated photovoltaic energy yield in the considered dataset. After refitting on the combined training and validation data, the GRU achieved the best final out-of-sample performance, with RMSE = 296.38 kWh, MAE = 213.16 kWh, R² = 0.9231, MAPE = 7.52%, and sMAPE = 7.49%. Overall, the findings demonstrate that pooled neural modeling is an effective framework for monthly PV production forecasting and can provide practically useful support for sustainable energy planning, monitoring, and optimization.

Keywords:

photovoltaic energy forecasting; monthly solar energy yield; pooled neural networks; multilayer perceptron; gated recurrent unit; distributed photovoltaic systems; uncertainty quantification; sustainable energy planning

1. Introduction

Solar energy has become one of the central components of the transition toward sustainable and low-carbon energy systems. Among renewable technologies, photovoltaic (PV) generation has expanded particularly rapidly due to declining equipment costs, modular deployment, and the increasing economic attractiveness of distributed and rooftop installations. At the same time, the growing penetration of solar power in modern electricity systems has intensified the need for accurate forecasting tools that can support operational management, grid integration, and longer-term sustainable energy planning [1,2,3,4]. Solar PV recorded the largest absolute increase among renewable technologies in 2023, and official IEA analyses continue to identify PV as a leading driver of renewable electricity growth in the coming years.

Recent progress in photovoltaic forecasting reflects the broader role of numerical, statistical, and machine learning models in modern energy systems. In renewable-energy applications, accurate PV forecasts are required because solar generation is inherently variable and weather-dependent, while higher PV penetration increases the need for reliable planning, dispatch, self-consumption optimization, and grid-balancing tools [1,5,6,7,8]. At the same time, data-driven numerical models are increasingly used across the wider energy sector, including fossil fuel and reservoir engineering applications. Recent examples include pressure-transient testing for evaluating hydraulic fracturing effectiveness, analytic hierarchy and reliability analysis models for shale-drilling drag reduction, and machine learning frameworks for enhanced oil recovery screening [7,8,9]. These studies illustrate that model-based decision support is becoming important across both clean and conventional energy systems. In the present work, however, the focus is specifically on solar PV forecasting, where improved predictions directly support sustainable energy planning and the integration of distributed renewable generation.

Forecasting photovoltaic production remains a challenging task because PV output depends on seasonality, weather variability, local operating conditions, and temporal persistence in the historical generation series. These features often induce nonlinear behavior and changing dependence structures, which may limit the performance of purely linear forecasting models, especially when the available datasets are relatively short or heterogeneous. For this reason, recent literature has increasingly emphasized machine learning and deep learning methods as flexible alternatives capable of capturing more complex temporal patterns and improving predictive accuracy [2,3,4,5,10,11]. Recent review studies have also stressed the importance of selecting models according to forecasting horizon, data characteristics, and application context, particularly when renewable-energy forecasting is intended to support planning and control decisions.

Recent studies further confirm the rapid development of photovoltaic forecasting from isolated single-site models toward broader multi-site, pooled, and uncertainty-aware frameworks. Recent review papers have emphasized that the field is moving toward more systematic comparisons of machine learning and deep learning methods, with growing attention to forecasting horizon, model robustness, and transferability across sites and datasets [6,12,13,14]. At the same time, recent methodological contributions have shown that multi-site or global formulations can be beneficial when historical records are limited or uneven across installations, while recent probabilistic studies have highlighted the importance of reliable uncertainty quantification for operational use in energy management and grid integration [15,16,17,18]. Solar energy has become one of the central components of the transition toward sustainable and low-carbon energy systems, and accurate PV forecasting is increasingly required for grid integration, dispatch, self-consumption optimization, and planning [1,5,6,19]. These developments support the motivation of the present study, which combines a pooled multi-system setting with a comparative evaluation of feed-forward and recurrent neural architectures for monthly photovoltaic forecasting.

Contemporary PV forecasting literature can be grouped into statistical models, classical machine learning methods, deep neural architectures, hybrid models, and probabilistic or uncertainty-aware methods [5,6]. Recent reviews emphasize that deep learning has become increasingly important for PV time-series forecasting, but they also note that model performance depends strongly on forecast horizon, available input variables, preprocessing, benchmark design, and hyperparameter selection [5,6]. Commonly compared architectures include multilayer perceptrons (MLPs), recurrent neural networks, LSTM and GRU models, convolutional neural networks, graph neural networks, and Transformer-based models [5,6,20,21]. These studies usually evaluate forecasting quality by error and goodness-of-fit indicators such as MAE, RMSE, MAPE, sMAPE, and R², while recent comparative works also consider validation/test separation, residual diagnostics, and computational efficiency [6,19,20]. For example, recent solar power studies have compared several deep architectures under common evaluation protocols and have reported RMSE, MAE, MAPE, and R² as standard regression metrics; other work has emphasized that GRU-type recurrent models may provide competitive accuracy with reduced training time, which is relevant for operational forecasting environments [19,20]. Recent reviews also stress that there is still no universally accepted benchmark for PV forecasting, making transparent reporting of datasets, horizons, metrics, and validation protocols particularly important [6].

Several recent works have moved beyond isolated single-model forecasting toward multi-architecture, multi-site, or data-scarce PV forecasting frameworks. Kim et al. developed Transformer and recurrent-network variants for multi-step day-ahead PV forecasting using power, weather, and solar geometry inputs from two PV plants [21]. Jang et al. proposed a common deep learning model applicable to multiple solar generation sites and explicitly addressed the use of shared information across locations [22]. Depoortere et al. introduced SolNet, an open-source deep learning framework for PV forecasting across many sites, emphasizing that high-quality long observational histories are often unavailable in practice and that transfer or pooled learning can be valuable in data-scarce settings [23]. These studies support the motivation for the present pooled formulation. However, most recent multi-site studies rely on high-frequency data, meteorological variables, or large collections of PV systems. In contrast, the present work addresses a more restricted but practically common setting: monthly yield forecasting for several related rooftop PV installations with unequal record lengths and without rich exogenous meteorological predictors. This motivates the comparison of a pooled MLP and a pooled GRU using only lagged production values, cyclical calendar encodings, and plant-specific embeddings.

In addition to point forecasting, recent PV forecasting literature increasingly recognizes the importance of uncertainty quantification. Prediction intervals are useful because energy planning decisions depend not only on the central forecast but also on the range of plausible production outcomes [24,25]. Recent conformal-prediction studies for PV power forecasting use calibration or validation residuals to transform point forecasts into prediction intervals with improved reliability, and they show that such uncertainty-aware forecasts can support electricity-market and operational decision-making [24,25]. Although the present study does not implement a full conformal-prediction framework, its validation residual prediction intervals follow the same practical rationale: residuals from a chronologically subsequent validation block are used to estimate empirical uncertainty bounds around the final point forecasts.

Our earlier study [26] addressed monthly photovoltaic energy yield forecasting for the Chikalov PV installation using ARIMA-type models. That work showed that even compact monthly datasets can provide practically useful forecasts for both total yield and specific yield, thereby demonstrating the relevance of time-series methods for PV performance analysis. The Chikalov installations are rooftop photovoltaic systems in southwestern Bulgaria, monitored through Sunny Portal, and highlighted the practical role of forecasting for energy management, renewable resource optimization, and grid-related planning [26]. However, the underlying ARIMA framework is inherently linear and treats the forecasting problem from the perspective of a single installation.

In the present study, the forecasting setting is extended from a single-series formulation to a pooled multi-system framework based on the Chikalov family of photovoltaic datasets. The attached data include monthly production records for Chikalov 1, Chikalov 3, Chikalov 4, Chikalov 5, and Chikalov 6, associated with Simitli, Cherniche, and Poleto, and the series lengths are unequal across plants. This structure naturally motivates a global pooled model, since a shared learning framework can simultaneously exploit temporal dependence within each PV system and structural similarities across systems, thereby increasing the effective information available for model estimation.

Against this background, the present paper investigates forecasting solar energy production through a comparative pooled neural network framework using the Chikalov PV data. Two complementary architectures are considered. The first is a multilayer perceptron (MLP), which serves as a nonlinear feed-forward benchmark based on lagged observations and seasonal descriptors. The second is a gated recurrent unit (GRU), which is specifically designed for sequence modeling and can retain informative temporal structure through recurrent hidden-state updates [27,28,29,30]. This comparison is methodologically meaningful for monthly PV forecasting. On the one hand, an MLP may already capture substantial seasonal and nonlinear structure when appropriate lagged inputs are supplied. On the other hand, a GRU may provide additional gains by explicitly modeling chronological dependence and persistence effects in the monthly production sequence [5,10,27,28,29,30].

The practical importance of this work lies in its focus on small and medium-sized distributed photovoltaic systems, for which long, homogeneous, and meteorologically enriched datasets are often unavailable. In such cases, accurate forecasting must rely mainly on historical production records, seasonal information, and robust modeling strategies capable of extracting shared structure across related installations. By developing a pooled neural forecasting framework for several Chikalov PV systems, the present study provides a data-driven tool that can support monthly production planning, performance monitoring, maintenance scheduling, and uncertainty-aware decision-making in distributed solar energy management.

The main objective of this study is therefore to assess whether pooled neural modeling can provide an effective and practically relevant framework for forecasting monthly photovoltaic production in support of sustainable energy planning. More specifically, the paper compares MLP and GRU models trained on the same pooled Chikalov dataset and evaluates their predictive behavior through standard forecasting criteria. In this way, the study contributes in three directions: first, by extending the Chikalov forecasting setting from a single-system linear approach to a pooled multi-system learning framework. Second, by providing a direct comparison between feed-forward and recurrent neural architectures for monthly PV production forecasting. Third, by emphasizing the role of accurate PV forecasts in planning, monitoring, and optimizing distributed solar energy systems [1,2,3,4,5,10,11,26,27,28,29,30].

The remainder of the paper is structured as follows: Section 2 presents the Chikalov photovoltaic systems and dataset, and then describes the pooled MLP and GRU forecasting architectures together with the experimental design, hyperparameter tuning procedure, evaluation criteria, validation residual prediction intervals, computational workflow and replication details. Section 3 reports the numerical results of the comparative study. Section 4 discusses the main findings, with emphasis on the relative behavior of the feed-forward and recurrent models, the role of the 12-month input horizon, and the implications of pooled learning for short and unbalanced monthly PV series. Finally, Section 5 concludes the paper and outlines directions for further research.

2. Data and Methods

2.1. Chikalov PV Systems and Dataset

The analysis is based on monthly energy production records from the Chikalov family of photovoltaic systems. They are located in southwestern Bulgaria, where a 30 kW rooftop photovoltaic system is mounted on a residential building and monitored through the Sunny Portal platform [26]. The systems employ three-phase Sunny Tripower 5000TL inverters by SMA Solar Technology AG, Niestetal, Germany and BYD P6-30 Series-3BB photovoltaic modules manufactured by BYD Company Ltd., Shanghai, China.

Since the installations are located in southwestern Bulgaria, in the northern hemisphere and in a region influenced by both temperate continental and Mediterranean climatic conditions with a pronounced annual solar radiation cycle, the monthly energy yield is expected to exhibit clear seasonality, with higher production during late spring and summer and lower production during winter.

The data employed here contain monthly production records for five systems labeled Chikalov 1, Chikalov 3, Chikalov 4, Chikalov 5, and Chikalov 6. According to the dataset in Table 1, Table 2, Table 3, Table 4 and Table 5, these systems are associated with the locations Simitli, Cherniche, and Poleto, and the records are given as monthly energy values. The available histories are unbalanced: Chikalov 1 has the longest record, beginning in 2012 (Table 1), whereas the remaining systems begin later, mainly between 2020 and 2021, and all series continue into 2024 with partial final-year observations (Table 2, Table 3, Table 4 and Table 5). The dataset, therefore, has the structure of a short monthly panel rather than a collection of equally long individual time series.

Let

Y_{i, t}

denote the monthly total energy yield of plant

i

in month

t

. The forecasting target is the one-step-ahead value

Y_{i, t}

, predicted from historical observations of the same plant together with shared information learned across all plants. Because the monthly frequency implies a strong annual cycle, seasonality is represented through cyclical encodings of the calendar month, namely

s_{t} = s i n (2 π m_{t} / 12) and c_{t} = c o s (2 π m_{t} / 12),

where

m_{t}

is the month index. This representation preserves the circular nature of the calendar and avoids the artificial discontinuity between December and January. The monthly yield values are standardized using parameters estimated from the training data only, and the observations are divided chronologically into training, validation, and test subsets in order to preserve the forecasting interpretation of the experiment. The pooled formulation is adopted because it allows the models to exploit both temporal dependence within each plant and structural similarity across plants, which is particularly important when several series are short.

Table 1, Table 2, Table 3, Table 4 and Table 5 report the complete raw monthly total energy yield values, measured in kWh, for the five considered 30 kW Chikalov photovoltaic systems. The long-format panel used for model construction is obtained directly by stacking the monthly records by plant and calendar month. In each table, rows correspond to calendar years, and columns correspond to months. Blank cells at the beginning or end of a table indicate that the corresponding month lies outside the available observation period for the respective system, while explicitly marked “n. a.” entries denote missing records within the covered observation period. The raw tables, therefore, document not only the magnitude of the monthly yields but also the unequal availability of data across the five systems. Table 1, Table 2, Table 3, Table 4 and Table 5 show that Chikalov 1 has the longest history, whereas Chikalov 3, Chikalov 4, Chikalov 5, and Chikalov 6 have shorter records starting later in the observation window.

The descriptive statistics in Table 6 summarize this structure. The pooled dataset contains 296 observed monthly yield values over 300 potential months within the covered plant-specific observation periods. Only four values are missing, all belonging to Chikalov 1, and they correspond to the unrecorded values in November and December 2015 and 2016. The mean monthly yield ranges from 3061.89 kWh for Chikalov 6 to 3550.19 kWh for Chikalov 1, while the standard deviations are relatively large, ranging from 1331.07 kWh to 1432.57 kWh. This variability is expected for monthly photovoltaic production because the installations are located in a region with a pronounced annual solar radiation cycle and therefore produce substantially higher yields in late spring and summer than in winter. The minimum values, ranging from 590.00 kWh to 989.00 kWh, occur in low-production months, whereas the maximum values, ranging from 5208.50 kWh to 6273.00 kWh, correspond to high-production months. Overall, the table confirms that the datasets are comparable in scale but strongly unbalanced in length, which motivates the use of a pooled forecasting framework capable of borrowing information across related PV systems.

2.2. Multilayer Perceptron (MLP) Architecture

The first forecasting model is a pooled multilayer perceptron (MLP), which serves as a nonlinear feed-forward benchmark. In contrast to classical linear autoregressive models, the MLP can learn nonlinear interactions among lagged production values and seasonal descriptors. The MLP is used as a nonlinear feed-forward benchmark because MLP-type models are widely used in PV forecasting and are recognized as capable of learning nonlinear relationships between historical production values, meteorological or calendar descriptors, and future output [5,20]. In the present formulation, the input vector for plant

i

and month

t

is constructed from the previous

L

monthly yields together with the seasonal encodings and a plant-specific identifier. After standardization, the lag window is flattened into a single feature vector of the form

x_{i, t} = [{\tilde{Y}}_{i, t - 1}, {\tilde{Y}}_{i, t - 2}, \dots, {\tilde{Y}}_{i, t - L}, s_{t}, c_{t}, e_{i}],

where

{\tilde{Y}}_{i, t - k}

denotes the standardized yield, and

e_{i}

denotes the numerical representation of plant

i

. In the present implementation,

e_{i}

is not a one-hot vector but a learned plant embedding. Each photovoltaic system is first assigned an integer identifier, and this identifier is mapped by an embedding layer to a four-dimensional trainable vector. The embedding dimension was fixed at 4 and was not included in the hyperparameter search. For the MLP, this learned embedding is concatenated to the flattened lagged input vector, so that the network receives both the temporal production history and a compact trainable representation of the plant identity. The embedding parameters are estimated jointly with the remaining MLP weights during training by backpropagation. Feed-forward neural networks trained by backpropagation form one of the standard foundations of modern nonlinear prediction, and recent PV studies confirm that MLP-type models remain competitive baselines for solar power forecasting when suitable lagged and seasonal features are provided [27,28].

The proposed pooled MLP consists of an input layer followed by two fully connected hidden layers with nonlinear activation functions and a final scalar output layer. Denoting the input vector by

x_{i, t}

, the hidden transformations can be written schematically as

h^{(1)} = ϕ (W_{1} x_{i, t} + b_{1}), h^{(2)} = ϕ (W_{2} h^{(1)} + b_{2}),

and the one-step-ahead forecast is then obtained as

{\hat{Y}}_{i, t} = W_{3} h^{(2)} + b_{3} .

Here,

ϕ (\cdot)

is chosen as a rectified linear unit (ReLU), since it provides a simple and effective nonlinear transformation while keeping the network computationally light. Dropout regularization is included to reduce overfitting, which is particularly relevant for short PV datasets and neural models trained with limited observations [22,23,24]. Dropout regularization may be inserted after the hidden layers to reduce overfitting, which is a relevant consideration for short monthly datasets. The principal role of the MLP in the present comparison is to provide a strong nonlinear baseline that uses the same pooled data structure as the GRU but does not explicitly model sequential recurrence. Its predictive ability therefore depends on the informativeness of the lagged feature vector rather than on an internal hidden state updated over time.

In recent PV forecasting comparisons, feed-forward neural networks are commonly retained as baseline models against recurrent, convolutional, and Transformer-based architectures because they provide a simple reference for evaluating the additional value of explicit sequence modeling [5,19,20]. This architecture is well suited as a benchmark for three reasons. First, it is simple, transparent, and easy to optimize on relatively small datasets. Second, it allows a direct assessment of how much predictive information is already contained in a fixed lagged representation of monthly PV yield. Third, because the same lag horizon, seasonal variables, and plant identifiers can be used in both the MLP and GRU settings, the comparison between the two models becomes methodologically cleaner. In other words, differences in performance can be attributed primarily to the architectural treatment of temporal dependence rather than to differences in the underlying information set.

2.3. Gated Recurrent Unit (GRU) Architecture

The second forecasting model is a pooled gated recurrent unit (GRU), which extends the feed-forward baseline by explicitly modeling sequential dependence. GRU networks were introduced as a recurrent neural architecture capable of learning temporal structure through gating mechanisms that regulate the flow of past information. In contrast to a standard recurrent neural network, the GRU uses an update gate and a reset gate to control how much of the previous hidden state is retained and how strongly past information contributes to the candidate state. This design improves the ability of the model to capture medium- and long-range dependencies while remaining more compact than more elaborate recurrent alternatives. For monthly photovoltaic data, such a mechanism is attractive because the target variable exhibits persistence, seasonality, and possible nonlinear carry-over effects from one month to the next [29].

In the present pooled implementation, the input for month

t

is not a flattened lag vector but an ordered sequence of the previous

L

monthly observations. Each time step contains three numerical quantities: the standardized yield, the sine seasonal encoding, and the cosine seasonal encoding. As in the MLP model, plant identity is represented by a learned embedding rather than by one-hot encoding. Each plant is assigned an integer identifier, which is mapped to a four-dimensional trainable embedding vector. The embedding dimension was fixed at 4 and was not tuned. For the GRU, this plant embedding is repeated across the

L

time steps of the input window and concatenated to the time-step features

[{\tilde{Y}}_{i, τ}, s_{τ}, c_{τ}]

. Thus, at each time step, the GRU receives the standardized yield, the cyclical seasonal descriptors, and the same plant-specific latent vector. This allows the recurrent model to distinguish among the PV systems while still learning shared temporal dynamics from the pooled dataset. Thus, the effective input at time step

τ

can be written as

x_{i, τ} = [{\tilde{Y}}_{i, τ}, s_{τ}, c_{τ}, e_{i}] .

For a given target month

t

, the GRU processes the sequence

(x_{i, t - L}, x_{i, t - L + 1}, \dots, x_{i, t - 1}),

and produces a hidden state summarizing the relevant temporal information in the lag window. The final hidden representation is then passed to a dense output layer that returns the one-step-ahead forecast

{\hat{Y}}_{i, t}

.

At the cell level, the recurrent transitions are governed by the update gate

z_{t}

, the reset gate

r_{t}

, the candidate state

{\tilde{h}}_{t}

, and the hidden state

h_{t}

. In standard notation,

\begin{matrix} z_{t} = σ (W_{z} x_{t} + U_{z} h_{t - 1} + b_{z}), \\ r_{t} = σ (W_{r} x_{t} + U_{r} h_{t - 1} + b_{r}), \\ {\tilde{h}}_{t} = \tanh (W_{h} x_{t} + U_{h} (r_{t} ⊙ h_{t - 1}) + b_{h}), \\ h_{t} = (1 - z_{t}) ⊙ {\tilde{h}}_{t} + z_{t} ⊙ h_{t - 1} . \end{matrix}

The reset gate determines how strongly the previous hidden state contributes to the candidate representation, whereas the update gate controls the balance between newly computed information and previously stored memory. This allows the GRU to preserve useful temporal structure while suppressing irrelevant or outdated components of the sequence. In the monthly PV context, such behavior is especially relevant because the model must extract information from seasonal repetition, recent production persistence, and cross-plant regularities without becoming excessively parameterized.

The pooled GRU is expected to offer several advantages in the present application. First, recurrent architectures process the lagged observations as an ordered sequence and can therefore preserve temporal dependence more naturally than a feed-forward model based on a flattened static feature vector [5,19,20,21]. Second, because monthly PV production is strongly influenced by seasonal solar radiation patterns, a one-year or near-one-year sequence can provide the model with a complete annual production cycle; recent PV studies have similarly emphasized the importance of seasonal decomposition, temporal context, and seasonal patterns in improving forecast accuracy [23,31]. Third, the pooled formulation is consistent with recent multi-site and data-scarce PV forecasting studies, where shared models are used to exploit common structure across locations or systems while retaining site-specific information [22,23]. For these reasons, the GRU constitutes the main sequence model in the comparative study, while the MLP provides the reference nonlinear feed-forward alternative.

2.4. Experimental Design and Evaluation Metrics

To ensure a fair comparison between the multilayer perceptron (MLP) and the gated recurrent unit (GRU), both models are trained and evaluated under the same pooled-data setting. The dataset is organized as a monthly panel of Chikalov photovoltaic systems, and all observations are sorted chronologically within each plant. The forecasting task is one-step-ahead prediction of monthly total energy yield. Thus, for a target month

t

, the models use only information available up to month

t - 1

, which preserves the genuine forecasting interpretation of the experiment. The same training, validation, and test partition is applied to both architectures, with earlier observations used for model fitting, a subsequent block reserved for model selection, and the most recent block retained for final out-of-sample evaluation. This chronological design avoids information leakage and is more appropriate than random shuffling for time-series forecasting problems [26].

All numerical experiments were conducted in Python 3.9.7 using standard scientific computing and machine learning packages. The main libraries used in the implementation were NumPy 1.20.3 and Pandas 1.3.4 for data handling, scikit-learn 1.0.1. for preprocessing and evaluation metrics, PyTorch 1.10.0 for neural network implementation and training, Matplotlib 3.4.3 for visualization, and openpyxl 3.0.9 for exporting numerical results. The computations were performed on a workstation with an Intel^® Core^™ i9-12900 processor at 2.4 GHz, 128 GB RAM operating at 4000 MT/s, and Intel^® UHD Graphics 770. The same software environment, chronological data split, and evaluation protocol were used for both the MLP and GRU models.

In both models, the input information is built from lagged monthly yields and seasonal variables. The month of the year is represented through sine and cosine encodings in order to capture the annual cycle while respecting its circular structure. The yield values are standardized using statistics computed from the training subset only, and the same transformation is then applied to the validation and test subsets. The pooled formulation is identical for both architectures: all Chikalov systems are used jointly during training so that the models can learn not only the temporal dynamics of each installation but also the common structure shared across the PV systems. This is particularly important because the available monthly series are unbalanced in length, and some plants contain substantially fewer observations than others.

The MLP and GRU differ only in the way they process the same predictive information. For the MLP, the previous

L

monthly observations are flattened into a fixed feature vector, together with the seasonal encodings and plant-specific identifier. For the GRU, the same lagged observations are preserved in their natural chronological order and fed as a sequence to the recurrent layer. This design makes the comparison methodologically transparent: both models use the same forecasting target, the same pooled data, the same seasonal information, and the same chronological validation protocol while differing only in their internal architectural treatment of temporal dependence. The MLP therefore serves as a nonlinear feed-forward benchmark, whereas the GRU serves as the recurrent sequence model [27,28,29]. The plant embedding dimension was fixed at 4 in both architectures. The hyperparameter search therefore covered only the lag or sequence length, hidden dimension, dropout rate, and learning rate; the plant-embedding dimension was kept constant in order to maintain a symmetric comparison between the MLP and GRU and to limit the search space for the short monthly panel dataset.

Hyperparameter selection is carried out separately for the two architectures by chronological grid search. For the GRU, the tuning parameters include the input sequence length, hidden dimension, dropout rate, and learning rate. Following the exploratory setup already developed for the pooled recurrent model, the tested sequence lengths are

L \in {12, 18, 24}

, the hidden dimensions are chosen from

{16, 32, 48}

, dropout is varied in

{0, 0.1, 0.2}

, and the learning rate is selected from

{10^{- 3}, 5 \times 10^{- 4}}

. The GRU is trained with the Adam optimizer and a smooth

L_{1}

loss, while early stopping is applied on the validation subset. The final GRU specification is selected by minimizing validation RMSE and is then refitted on the combined training and validation data before final testing. This recurrent setup is appropriate for monthly PV forecasting because it allows the model to process one full annual cycle and to retain informative hidden-state dynamics over the lag window.

A parallel hyperparameter-tuning strategy is adopted for the MLP in order to maintain symmetry between the two models. The principal tuning parameters for the MLP are the lag length

L

, the number and width of hidden layers, the dropout rate, and the learning rate. A practical and balanced search space for the present dataset is to use

L \in {12, 18, 24}

, one or two hidden layers, hidden dimensions in

{32, 64, 128}

, dropout in

{0, 0.1, 0.2}

, and learning rate in

{10^{- 3}, 5 \times 10^{- 4}}

. As with the GRU, the MLP is trained with Adam and a smooth

L_{1}

loss, and early stopping is controlled by the validation subset. The final MLP is likewise selected according to validation RMSE and subsequently refitted on the combined training and validation data. This design ensures that the comparison between MLP and GRU is not biased by unequal optimization effort.

The use of validation-based model selection is especially important in the present study because the dataset is short and the number of candidate architectures is nontrivial. The validation subset is therefore used exclusively for hyperparameter tuning, early stopping, and model calibration, while the test subset remains untouched until the final evaluation stage. This separation is essential for preserving the objectivity of the reported test results. In practical terms, the validation phase determines which configuration is retained, whereas the test phase quantifies the true out-of-sample performance of the selected model. The final refitting step, in which the selected architecture is retrained on the union of the training and validation observations, is justified by the limited size of the dataset and by the need to exploit as much historical information as possible before generating the final test forecasts.

Forecast quality is assessed by several complementary evaluation criteria. The first is the root mean square error (RMSE), which emphasizes larger forecast errors and therefore provides a sensitive measure of overall predictive accuracy. The second is the mean absolute error (MAE), which offers a more direct average measure of absolute deviation. The coefficient of determination R² is also reported to quantify the proportion of variance explained by the model. In addition, two percentage-based criteria are used: the mean absolute percentage error (MAPE) and the symmetric mean absolute percentage error (sMAPE). MAPE is widely used in forecasting applications because of its interpretability in relative terms, while sMAPE is included because it is typically more stable when the target variable assumes smaller values. These metrics are standard in recent PV forecasting studies, where RMSE and MAE are typically used to quantify absolute forecast error, MAPE and sMAPE to assess relative error, and R² to measure explained variance [16,19,20]. Taken together, these five indicators provide a balanced assessment of absolute, relative, and variance-based predictive performance, and they allow a detailed comparison between the MLP and the GRU.

Formally, if

y_{t}

denotes the observed monthly yield and

{\hat{y}}_{t}

its forecast, then RMSE is defined as the square root of the mean squared prediction error, MAE as the mean absolute prediction error, and MAPE as the average of the absolute percentage deviations. The sMAPE criterion replaces the conventional denominator by the average magnitude of observed and predicted values, thereby reducing sensitivity to scale effects. In the present study, all metrics are computed on the original scale of the monthly yield after inverting the standardization transform, so that the reported results remain directly interpretable in physical units and in percentage terms.

Besides point forecasting, the study also considers uncertainty quantification through validation residual prediction intervals. This step is implemented separately for each model after hyperparameter selection. Let

e_{t} = y_{t} - {\hat{y}}_{t}

denote the residuals on the validation subset of the selected model. The empirical distribution of these residuals is used to estimate lower and upper quantiles, denoted by

q_{α / 2}

and

q_{1 - α / 2}

, for a given confidence level

1 - α

. Then, for a point forecast

{\hat{y}}_{t}

, the corresponding prediction interval is constructed as

[{\hat{y}}_{t} + q_{α / 2}, {\hat{y}}_{t} + q_{1 - α / 2}] .

For instance, with

α = 0.05

, the interval provides an empirical 95% uncertainty band around the forecast. This residual-based approach is especially suitable in the present setting because it does not impose a strong parametric assumption on forecast errors and remains easy to interpret and implement for both neural architectures. Residual-based interval calibration is related to recent uncertainty-aware and conformal-prediction approaches in PV forecasting, where residuals or nonconformity scores computed on a calibration set are used to construct prediction intervals around point forecasts [24,25].

The use of validation residuals for interval calibration has two methodological advantages. First, it aligns naturally with the chronological model-selection framework because the interval width is estimated from a data block that is temporally subsequent to training but still prior to the final test period. Second, it allows direct comparison between MLP and GRU not only in terms of point accuracy but also in terms of practical uncertainty quantification. In the final experimental comparison, each model therefore produces two outputs: a point forecast and an accompanying empirical prediction interval. This is valuable from an applied perspective since sustainable energy planning requires not only accurate central predictions but also a realistic representation of forecast uncertainty. The use of a validation block for interval calibration is consistent with recent probabilistic PV forecasting studies, where calibration data are kept separate from the final test period in order to estimate uncertainty bounds without contaminating the out-of-sample evaluation [24,25].

Finally, the evaluation procedure is supplemented by residual diagnostics. After the best hyperparameter configuration is selected, the residual autocorrelation and partial autocorrelation functions may be examined in order to detect any remaining temporal structure not captured by the model. Such diagnostics are especially useful in monthly photovoltaic forecasting, where unmodeled seasonality or persistence may remain visible even when the overall error indicators are satisfactory. In this way, the experimental design combines three complementary layers of assessment: validation-based hyperparameter tuning, test-based out-of-sample accuracy evaluation, and residual-based diagnostic and uncertainty analysis. Together, these elements provide a coherent framework for comparing the MLP and GRU models on equal methodological grounds.

2.5. Computational Workflow and Replication Details

For reproducibility, the complete forecasting workflow was organized into five consecutive stages: data preprocessing, supervised sample construction, hyperparameter tuning, final model refitting, and out-of-sample evaluation. The raw monthly production records were first transformed into a long-format panel dataset containing the plant identifier, calendar month, and monthly total energy yield. Missing observations were kept as missing values and were not interpreted as zero production. For each plant, the monthly records were ordered chronologically, and supervised samples were constructed only when the full lag window and the corresponding target value were available.

Let

Y_{i, t}

denote the monthly total energy yield of plant

i

in month

t

. For a selected lag length

L

, the forecasting problem is formulated as a one-step-ahead prediction:

\hat{Y_{i, t}} = f_{θ} (X_{i, t}^{(L)}),

where

f_{θ}

denotes either the MLP or GRU model with trainable parameters

θ

, and

X_{i, t}^{(L)}

containing the information available from months

t - L, \dots, t - 1

. The target value

Y_{i, t}

is not included among the predictors.

The monthly yield values were standardized using the mean and standard deviation estimated from the training subset only:

\tilde{Y_{i, t}} = \frac{Y_{i, t} - μ_{train}}{σ_{train}},

where

μ_{train}

and

σ_{train}

are the training-set mean and standard deviation, respectively. The same scaling parameters were then applied to the validation and test subsets. Forecasts were transformed back to the original kWh scale before computing all reported evaluation metrics.

For the MLP, the lagged observations were flattened into a fixed input vector containing the standardized yields and seasonal encodings from the previous

L

months, together with a plant-specific embedding. For the GRU, the same information was retained as an ordered sequence so that the recurrent layer could process the chronological structure of the monthly observations. In both cases, the models were trained by minimizing the smooth

L_{1}

loss on the training subset:

L (θ) = \frac{1}{N_{train}} \sum_{(i, t) \in D_{t r a i n}} SmoothL 1 (\hat{\tilde{Y_{i, t}}}, \tilde{Y_{i, t}}),

where

D_{t r a i n}

denotes the training subset,

N_{t r a i n}

is the number of training samples,

\tilde{Y_{i, t}}

is the standardized observed yield, and

\hat{\tilde{Y_{i, t}}}

is the corresponding standardized prediction. The Adam optimizer was used for parameter estimation, and early stopping was controlled by the validation error in order to reduce overfitting.

The hyperparameter search was performed chronologically and independently for the MLP and GRU architectures. For the MLP, the tuned parameters were the lag length, hidden-layer width, dropout rate, and learning rate. For the GRU, the tuned parameters were the input sequence length, hidden-state dimension, dropout rate, and learning rate. Each candidate configuration was trained on the training subset and evaluated on the validation subset. The configuration with the lowest validation RMSE was selected:

λ * = \underset{λ \in Λ}{argmin} RMS E_{val} (λ),

where

λ

denotes a candidate hyperparameter configuration and

Λ

is the corresponding hyperparameter grid.

After model selection, the selected architecture was refitted on the combined training and validation data and evaluated once on the held-out test subset. This protocol ensured that the test data were not used for hyperparameter tuning or early stopping. The final test evaluation therefore provides an out-of-sample assessment of the selected model.

Prediction intervals were obtained by validation residual calibration. For the selected model, validation residuals were computed as

e_{i, t} = Y_{i, t} - \hat{Y_{i, t}}, (i, t) \in D_{v a l},

where

D_{v a l}

denotes the validation subset.

The empirical lower and upper residual quantiles,

q_{α / 2}

and

q_{1 - α / 2}

, were then added to each point forecast to obtain the

(1 - α)

-level prediction interval:

{P I}_{i, t}^{(1 - α)} = [\hat{Y_{i, t}} + q_{α / 2}, \hat{Y_{i, t}} + q_{1 - α / 2}] .

In this study,

α = 0.05

, corresponding to empirical 95% prediction intervals. This procedure was applied separately to the MLP and GRU models, allowing direct comparison of both point forecasts and uncertainty bounds.

Pseudocodes for data preprocessing and supervised sample construction, MLP and GRU model selection and forecasting, and validation residual prediction intervals are given in Algorithm A1–A4 (Appendix A).

3. Results

The hyperparameter search produced clear patterns for both neural architectures. For the MLP, the best validation model was achieved for a lag length of 12 months, hidden size 128, dropout 0.2, learning rate 0.001, and 12 training epochs. Its validation performance was RMSE = 421.60, MAE = 290.17, R² = 0.8924, MAPE = 11.96%, and sMAPE = 10.59% (Figure A1, Figure A2, Figure A3, Figure A4 and Figure A5, Appendix B). For the GRU, the best validation model was obtained for a sequence length of 12 months, hidden size 48, dropout 0, learning rate 0.0005, and 31 training epochs. Its validation performance was RMSE = 457.53, MAE = 308.96, R² = 0.8733, MAPE = 12.73%, and sMAPE = 11.24% (Figure A6, Figure A7, Figure A8, Figure A9 and Figure A10, Appendix B).

Thus, on the validation subset, the MLP outperformed the GRU across all reported criteria. In particular, the MLP reduced RMSE by approximately 35.9 units and MAE by about 18.8 units relative to the GRU, while also improving the coefficient of determination and both percentage-based error measures. These results indicate that, at the model-selection stage, a feed-forward nonlinear architecture based on lagged inputs and seasonal descriptors was able to fit the available monthly validation block more effectively than the recurrent alternative.

The search tables also show that the leading configurations for both models are concentrated around a 12-month input horizon. For the GRU, the top-ranked validation models are dominated by sequence length 12, with a few competitive 18-month alternatives appearing slightly below the optimum. The 24-month recurrent specifications generally produce weaker validation scores. For the MLP, the same pattern is even more pronounced: the best validation results are also obtained mainly with 12-month lag windows, especially when combined with hidden size 64 or 128 and mild to moderate dropout. This indicates that one full annual cycle contains the most informative memory for one-step-ahead monthly photovoltaic forecasting in the Chikalov dataset.

To further clarify the influence of the input horizon, we additionally summarized the best validation RMSE obtained for each tested lag length after minimizing over the remaining hyperparameters. The results confirm that the 12-month horizon provides the most favorable validation performance for both architectures. For the MLP, the best validation RMSE was 421.60 kWh for L = 12, compared with 460.87 kWh for L = 18 and 465.86 kWh for L = 24. Thus, extending the input window beyond one annual cycle did not improve the validation accuracy of the feed-forward model. A similar pattern was observed for the GRU. The best validation RMSE was 457.53 kWh for L = 12, compared with 462.45 kWh for L = 18 and 500.31 kWh for L = 24. These results show that longer windows introduce additional estimation burden without improving validation performance.

After hyperparameter selection, both models were refitted on the combined training and validation data and then evaluated on the held-out test period. At this final stage, the ranking changed. The GRU achieved RMSE = 296.38, MAE = 213.16, R² = 0.9231, MAPE = 7.52%, and sMAPE = 7.49%, whereas the refitted MLP achieved RMSE = 332.01, MAE = 242.06, R² = 0.9035, MAPE = 7.57%, and sMAPE = 7.92%. Therefore, although the MLP was superior on validation, the GRU produced clearly better out-of-sample performance on the final test subset.

The magnitude of the GRU improvement on the test set is practically meaningful. Relative to the MLP, the GRU reduced RMSE by about 35.63 units and MAE by about 28.89 units, while increasing R² by roughly 0.0196. The MAPE values of the two models are very close, differing by only around 0.05 percentage points, but the GRU retains the advantage in sMAPE as well. These results suggest that the recurrent architecture generalizes better to unseen monthly observations, even though the feed-forward model appeared more favorable during validation.

Another useful observation concerns the training dynamics. The selected MLP converged very quickly, stopping after only 12 epochs, while the selected GRU required 31 epochs. This behavior is consistent with the simpler optimization structure of the feed-forward network. However, faster convergence did not translate into better final generalization. On the contrary, the recurrent model, despite a slightly weaker validation fit, proved more robust once refitted and tested on the most recent block of observations.

To place these values in context, it should be noted that direct numerical comparison with the literature is not straightforward because PV forecasting studies differ substantially in installed capacity, temporal aggregation, forecast horizon, input variables, and train–test protocols. In particular, RMSE and MAE are scale-dependent and therefore cannot be directly compared between a 30 kW monthly yield dataset and studies based on hourly or daily power measurements from larger PV plants. For this reason, the relative indicators MAPE, sMAPE, and R² are more informative for cross-study interpretation. On the final test subset, the proposed pooled GRU achieved R² = 0.9231, MAPE = 7.52%, and sMAPE = 7.49%, while the pooled MLP achieved R² = 0.9035, MAPE = 7.57%, and sMAPE = 7.92%. These values indicate a high level of explained variance and a moderate relative forecasting error for a monthly dataset constructed from short and unbalanced PV series.

Recent studies report comparable or a bit better relative accuracy when richer input information is available. For example, recent reviews of PV forecasting show that MLP, recurrent, convolutional, graph-based, and hybrid neural models are widely used, and that RMSE, MAE, MAPE, sMAPE, and R² are standard evaluation criteria in this field [5,32]. Transformer and recurrent neural models have also been used for day-ahead or multi-step PV forecasting with historical power, weather observations, weather forecasts, and solar-geometry variables; in such settings, hybrid Transformer-LSTM variants may substantially reduce MAE relative to simple recurrent baselines [22]. Other recent comparative studies of short- and medium-term PV forecasting evaluate LSTM, CNN, and GRU models using MAE, RMSE, MAPE, and R², confirming that recurrent architectures often provide strong performance when temporal dependence is important [29]. In this context, the present results are competitive, especially because the proposed models use only lagged monthly yield, cyclical calendar encodings, and plant identifiers, without direct irradiance, temperature, cloud-cover, or numerical-weather-prediction inputs.

4. Discussion

The comparison between the two architectures leads to an interesting and substantively relevant conclusion. The MLP appears to be highly competitive as a pooled nonlinear baseline and is capable of extracting substantial predictive information from lagged monthly yields and seasonal encodings. Its strong validation performance indicates that a large part of the forecastable variation in the Chikalov data can indeed be represented through a fixed lagged feature vector. This is an important finding because it shows that monthly photovoltaic forecasting does not necessarily require a complex recurrent architecture in order to achieve good predictive accuracy.

At the same time, the final test results show that the GRU is ultimately more effective when the goal is robust out-of-sample forecasting. This can be interpreted in several ways. First, the recurrent hidden state allows the GRU to preserve chronological information more naturally than the MLP, which only sees the lagged inputs as a flattened vector. Second, the GRU may be better able to encode persistence effects and subtle transitions across neighboring months, especially when the annual cycle is strong but not perfectly regular. Third, the pooled recurrent representation may offer a more stable way of borrowing information across systems of unequal length, which is important in the present monthly panel setting [15]. These considerations are consistent with the rationale of the GRU design itself, namely, the use of update and reset gates to manage temporal memory and filter irrelevant sequence components.

The hyperparameter search further reinforces the importance of the 12-month input horizon. For both models, the best or near-best configurations are concentrated around one-year windows. This supports the interpretation that a full annual seasonal cycle is the most informative lag structure for monthly PV energy yield. Six-month memory is insufficient to capture the complete seasonal cycle. On the other hand, extending the horizon to 18 months occasionally produces competitive models, particularly in the GRU case, but does not improve the best validation score. Extending the window to 24 months generally worsens the validation results of both architectures. For a short and unbalanced monthly panel, this additional history appears to be partly redundant and may increase estimation uncertainty. This likely reflects the trade-off between richer historical context and the loss of effective sample size in short monthly datasets. When the time window becomes too long, the models appear to inherit additional estimation burden without receiving sufficiently informative new structure in return.

The selected hyperparameters also reveal architectural differences. The best GRU is comparatively compact, with hidden size 48 and no dropout, whereas the best MLP requires a wider hidden representation of 128 units together with dropout 0.2. This suggests that the MLP needs greater feed-forward capacity and stronger regularization in order to compete effectively, while the GRU attains its best performance with a more parsimonious recurrent structure. From a modeling perspective, this is reasonable: recurrence itself provides an inductive bias toward temporal organization, so the GRU can rely more on architecture and less on width.

One further point deserves attention. The selected validation winner is not necessarily the model with the best preliminary test-at-selection score. For example, several GRU configurations with 18-month windows achieve very strong test-set values during the search stage, and some MLP variants also perform competitively on test-at-selection. However, model selection must remain validation-based in order to preserve the integrity of the final test comparison. From this perspective, the fact that the GRU outperforms the MLP on the fully held-out test block is particularly informative: it indicates that recurrent modeling provides greater robustness beyond the tuning phase.

Overall, the experimental evidence suggests a balanced conclusion. The pooled MLP is a strong and computationally efficient benchmark that performs very well on the validation block and should not be dismissed as a simplistic baseline. Nevertheless, the pooled GRU delivers the best final forecasting accuracy and the strongest generalization on the unseen test period. For the present Chikalov monthly photovoltaic dataset, this makes the GRU the preferred model when predictive robustness is the primary objective, while the MLP remains an attractive alternative when simplicity, speed, and ease of implementation are emphasized.

The comparison with recent literature should be interpreted carefully. Many recent PV forecasting studies report lower MAPE or sMAPE values than those obtained here, but they often use high-frequency data, meteorological variables, weather forecasts, solar geometry descriptors, or much larger training datasets. For instance, recent Transformer and recurrent network studies for day-ahead PV forecasting use historical power together with weather observations, weather forecasts, and solar geometry inputs, which provide substantially richer information than the monthly production-only setting considered here [22]. Similarly, recent short-term PV forecasting studies evaluate many machine learning and deep learning models on high-resolution datasets and often benefit from dense temporal information and exogenous predictors [19,33]. By contrast, the present study deliberately focuses on a constrained but practically common case: monthly energy-yield forecasting for several related rooftop PV systems with unequal record lengths and no direct meteorological covariates.

From this perspective, the obtained GRU test performance is encouraging. The final R² = 0.9231 indicates that the model explains more than 92% of the variance in the held-out monthly observations, while the MAPE of 7.52% and sMAPE of 7.49% indicate a practically acceptable relative error for monthly energy planning purposes. The MLP also performs competitively, with R² = 0.9035, MAPE = 7.57%, and sMAPE = 7.92%, confirming that much of the forecastable structure is already contained in the lagged annual pattern. However, the GRU provides lower RMSE and MAE on the final test period, which is consistent with the broader literature showing that recurrent architectures are well suited to forecasting tasks where ordered temporal dependence and seasonal persistence are important [5,26,34,35].

5. Conclusions

This study investigated the forecasting of monthly solar energy production for the Chikalov photovoltaic systems through a global pooled neural network framework. By combining data from several related PV installations, the proposed approach addressed the practical difficulty of short and unbalanced monthly series while preserving the possibility of plant-specific learning. Two neural architectures were compared under the same chronological training, validation, and test design: a multilayer perceptron (MLP) based on lagged inputs and seasonal descriptors, and a gated recurrent unit (GRU) designed to model sequential temporal dependence.

The results show that both architectures are capable of producing accurate and practically meaningful forecasts. The reported findings indicate that the feed-forward model is a strong benchmark, but the recurrent architecture provides better generalization on unseen monthly observations.

An additional important conclusion is that the best configurations for both models are based on a 12-month input horizon, which confirms that one full annual cycle contains the most informative memory for monthly photovoltaic forecasting in the considered dataset. The selected MLP used a wider hidden representation with dropout, whereas the selected GRU achieved its best results with a more compact recurrent structure and no dropout. This suggests that the recurrent inductive bias of the GRU allows it to capture temporal persistence and seasonality more efficiently in the pooled monthly setting.

Overall, the study demonstrates that pooled neural forecasting is an effective framework for modeling monthly photovoltaic production and can support sustainable energy planning through accurate point forecasts and uncertainty-aware prediction intervals [13,14]. From a practical perspective, the MLP may be preferred when simplicity and computational speed are primary considerations, whereas the GRU appears to be the more suitable option when the main objective is robust predictive performance. Future work may extend the present framework by incorporating exogenous meteorological variables, comparing additional sequence architectures, and testing the pooled methodology on larger collections of photovoltaic systems.

Author Contributions

Conceptualization, F.S. and S.G.; methodology, S.G. and I.G.; software, I.G.; validation, F.S. and S.G.; formal analysis, I.G.; investigation, S.G.; resources, F.S. and V.T.; data curation, S.G.; writing—original draft preparation, F.S. and S.G.; writing—review and editing, F.S., S.G., I.G. and V.T.; visualization, I.G.; supervision, F.S. and S.G.; project administration, S.G. and V.T.; funding acquisition, S.G. and I.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the European Union-NextGenerationEU, through the National Recovery and Resilience Plan of the Republic of Bulgaria, project BG-RRP-2.013-0001-C01.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

The first three authors were partially supported by project No. 2025–FNSE–02 “Research of Natural and Anthropogenic Phenomena”, financed by the “Scientific Research” Fund of Ruse University. The fourth author was partially supported by the Centre of Excellence in Informatics and ICT under the Grant No BG16RFPR002-1.014-0018, financed by the Research, Innovation and Digitalization for Smart Transformation Programme 2021–2027 and co-financed by the European Union.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A. Pseudocode of the Forecasting Procedure

In this appendix section, pseudocode is provided for all stages of the forecasting procedure.

Algorithm A1. Data preprocessing and supervised sample construction

Input:
    Monthly PV yield tables for all Chikalov systems
    Lag length L
    Training end date and validation end date
Output:
    Training, validation, and test supervised samples
1. Read the monthly production tables for all PV systems.
2. Convert the data into long-format panel form:
       plant_id, date, monthly_yield.
3. Mark unrecorded observations as missing values.
4. For each plant:
       4.1. Sort observations chronologically.
       4.2. Add month index m_t.
       4.3. Compute seasonal encodings:
              s_t = sin(2πm_t/12),
              c_t = cos(2πm_t/12).
5. Estimate the mean and standard deviation of monthly yield using training data only.
6. Standardize all available monthly yields using the training mean and standard deviation.
7. For each plant and each target month t:
       7.1. Select the previous L months as the input window.
       7.2. If the full input window and the target value are observed:
              create one supervised sample.
       7.3. Otherwise:
              skip the sample.
8. Split all supervised samples chronologically into:
       training subset,
       validation subset,
       test subset.

Algorithm A2. MLP model selection and forecasting

Input:
    Supervised samples from Algorithm A1
    Grid of MLP hyperparameters:
       lag length L,
       hidden size,
       dropout rate,
       learning rate
Output:
    Selected MLP model, test forecasts, and evaluation metrics
1. For each hyperparameter configuration in the MLP grid:
       1.1. Construct flattened lagged input vectors.
       1.2. Concatenate the plant embedding to each input vector.
       1.3. Initialize the MLP model.
       1.4. Train the model on the training subset using Adam and smooth L1 loss.
       1.5. Apply early stopping using the validation subset.
       1.6. Compute validation RMSE, MAE, R2, MAPE, and sMAPE.
2. Select the MLP configuration with the lowest validation RMSE.
3. Refit the selected MLP on the combined training and validation subsets.
4. Generate forecasts for the test subset.
5. Transform predictions back to the original kWh scale.
6. Compute final test metrics.
7. Compute validation residual quantiles.
8. Construct 95% prediction intervals for the forecasts.

Algorithm A3. GRU model selection and forecasting

Input:
    Supervised samples from Algorithm A1
    Grid of GRU hyperparameters:
       sequence length L,
       hidden size,
       dropout rate,
       learning rate
Output:
    Selected GRU model, test forecasts, and evaluation metrics
1. For each hyperparameter configuration in the GRU grid:
       1.1. Construct ordered input sequences of length L.
       1.2. For each time step, include:
              standardized yield,
              sine seasonal encoding,
              cosine seasonal encoding,
              plant embedding.
       1.3. Initialize the GRU model.
       1.4. Train the model on the training subset using Adam and smooth L1 loss.
       1.5. Apply early stopping using the validation subset.
       1.6. Compute validation RMSE, MAE, R2, MAPE, and sMAPE.
2. Select the GRU configuration with the lowest validation RMSE.
3. Refit the selected GRU on the combined training and validation subsets.
4. Generate forecasts for the test subset.
5. Transform predictions back to the original kWh scale.
6. Compute final test metrics.
7. Compute validation residual quantiles.
8. Construct 95% prediction intervals for the forecasts.

Algorithm A4. Validation residual prediction intervals

Input:
    Selected trained model
    Validation observations and validation forecasts
    Forecasts for the target subset
    Significance level alpha = 0.05

Output:
    Lower and upper prediction bounds

1. Compute validation residuals:
       e_t = y_t - yhat_t.
2. Estimate empirical residual quantiles:
       q_lower = quantile(e_t, alpha/2),
       q_upper = quantile(e_t, 1 - alpha/2).
3. For each point forecast yhat_t:
       lower_t = yhat_t + q_lower,
       upper_t = yhat_t + q_upper.
4. Return the prediction interval:
       [lower_t, upper_t].

Appendix B. Forecast Results

This appendix section contains the forecasted yields for all PVs obtained by both MLP and GRU, together with their prediction intervals.

Figure A1. Forecasted yield for PV “Chikalov 1”, obtained by MLP.

Figure A2. Forecasted yield for PV “Chikalov 3”, obtained by MLP.

Figure A3. Forecasted yield for PV “Chikalov 4”, obtained by MLP.

Figure A4. Forecasted yield for PV “Chikalov 5”, obtained by MLP.

Figure A5. Forecasted yield for PV “Chikalov 6”, obtained by MLP.

Figure A6. Forecasted yield for PV “Chikalov 1”, obtained by GRU.

Figure A7. Forecasted yield for PV “Chikalov 3”, obtained by GRU.

Figure A8. Forecasted yield for PV “Chikalov 4”, obtained by GRU.

Figure A9. Forecasted yield for PV “Chikalov 5”, obtained by GRU.

Figure A10. Forecasted yield for PV “Chikalov 6”, obtained by GRU.

References

International Energy Agency. Renewables 2024; IEA: Paris, France, 2024. [Google Scholar]
Ahmed, R.; Sreeram, V.; Mishra, Y.; Arif, M.D. A review and evaluation of the state-of-the-art in PV solar power forecasting: Techniques and optimization. Renew. Sustain. Energy Rev. 2020, 124, 109792. [Google Scholar] [CrossRef]
Iheanetu, K.J. Solar photovoltaic power forecasting: A review. Sustainability 2022, 14, 17005. [Google Scholar] [CrossRef]
Mohamad Radzi, P.N.L.; Akhter, M.N.; Mekhilef, S.; Mohamed Shah, N. Review on the application of photovoltaic forecasting using machine learning for very short- to long-term forecasting. Sustainability 2023, 15, 2942. [Google Scholar] [CrossRef]
Yu, J.; Li, X.; Yang, L.; Li, L.; Huang, Z.; Shen, K.; Yang, X.; Yang, X.; Xu, Z.; Zhang, D.; et al. Deep learning models for PV power forecasting: Review. Energies 2024, 17, 3973. [Google Scholar] [CrossRef]
Di Leo, P.; Ciocia, A.; Malgaroli, G.; Spertino, F. Advancements and challenges in photovoltaic power forecasting: A comprehensive review. Energies 2025, 18, 2108. [Google Scholar] [CrossRef]
Tsai, W.-C.; Tu, C.-S.; Hong, C.-M.; Lin, W.-M. A Review of state-of-the-art and short-term forecasting models for solar PV power generation. Energies 2023, 16, 5436. [Google Scholar] [CrossRef]
Verdone, A.; Panella, M.; De Santis, E.; Rizzi, A. A review of solar and wind energy forecasting: From single-site to multi-site paradigm. Appl. Energy 2025, 392, 126016. [Google Scholar] [CrossRef]
Benitez, I.B.; Singh, J.G. A comprehensive review of machine learning applications in forecasting solar PV and wind turbine power output. J. Electr. Syst. Inf. Technol. 2025, 12, 55. [Google Scholar] [CrossRef]
Kaneva, T.; Valov, N.; Valova, I. Deep learning for PV power forecasting: An LSTM approach for residential systems. In Proceedings of the 2025 IEEE 31st International Symposium for Design and Technology in Electronic Packaging (SIITME), Brasov, Romania, 2025; IEEE: New York, NY, USA, 2025; pp. 1–4. [Google Scholar]
Husein, M.; Gago, E.J.; Hasan, B.; Pegalajar, M.C. Towards Energy Efficiency: A comprehensive review of deep learning-based photovoltaic power forecasting strategies. Heliyon 2024, 10, e33419. [Google Scholar] [CrossRef]
Feng, S.; Chen, R.; Huang, M.; Wu, Y.; Liu, H. Multisite long-term photovoltaic forecasting model based on VACI. Electronics 2024, 13, 2806. [Google Scholar] [CrossRef]
Massidda, L.; Bettio, F.; Marrocu, M. Probabilistic day-ahead prediction of PV generation. A comparative analysis of forecasting methodologies and of the factors influencing accuracy. Sol. Energy 2024, 271, 112422. [Google Scholar] [CrossRef]
Wang, G.; Zhou, Y.; Yan, Y.; Zhou, Z.; Yang, Z.; Dai, L.; Huang, J. Probabilistic photovoltaic power forecasting with reliable uncertainty quantification via multi-scale temporal–spatial attention and conformalized quantile regression. Sustainability 2026, 18, 739. [Google Scholar] [CrossRef]
Li, L.; Li, Z. A hybrid interval prediction framework for photovoltaic power prediction using BiLSTM–transformer and adaptive kernel density estimation. Appl. Sci. 2026, 16, 3023. [Google Scholar] [CrossRef]
Sapundzhi, F.; Chikalov, A.; Georgiev, S.; Georgiev, I. Predictive modeling of photovoltaic energy yield using an ARIMA approach. Appl. Sci. 2024, 14, 11192. [Google Scholar] [CrossRef]
Aksan, F.; Suresh, V.; Janik, P. PV Generation prediction using multilayer perceptron and data clustering for energy management support. Energies 2025, 18, 1378. [Google Scholar] [CrossRef]
Aksan, F.; Pawlica, A.; Suresh, V.; Janik, P. A comparative study of machine learning models for PV energy prediction in an energy community. Energies 2025, 18, 5980. [Google Scholar] [CrossRef]
Cho, K.; van Merriënboer, B.; Gulcehre, C.; Bahdanau, D.; Bougares, F.; Schwenk, H.; Bengio, Y. Learning phrase representations using RNN encoder–decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 2014; Association for Computational Linguistics: Stroudsburg, PA, USA, 2014; pp. 1724–1734. [Google Scholar]
Li, P.; Zhou, K.; Lu, X.; Yang, S. A hybrid deep learning model for short-term PV power forecasting. Appl. Energy 2020, 259, 114216. [Google Scholar] [CrossRef]
Abdelsattar, M.; Azim, M.A.; AbdelMoety, A.; Emad-Eldeen, A. Comparative Analysis of Deep Learning Architectures in Solar Power Prediction. Sci. Rep. 2025, 15, 31729. [Google Scholar] [CrossRef]
Kim, J.; Obregon, J.; Park, H.; Jung, J. Multi-Step Photovoltaic Power Forecasting Using Transformer and Recurrent Neural Networks. Renew. Sustain. Energy Rev. 2024, 200, 114479. [Google Scholar] [CrossRef]
Jang, S.Y.; Oh, B.T.; Oh, E. A Deep Learning-Based Solar Power Generation Forecasting Method Applicable to Multiple Sites. Sustainability 2024, 16, 5240. [Google Scholar] [CrossRef]
Depoortere, J.; Driesen, J.; Suykens, J.; Kazmi, H.S. SolNet: Open-Source Deep Learning Models for Photovoltaic Power Forecasting across the Globe. Int. J. Forecast. 2025, 41, 1223–1236. [Google Scholar] [CrossRef]
Ouyang, J.; Zuo, Z.; Wang, Q.; Duan, Q.; Zhu, X.; Zhang, Y. Seasonal Distribution Analysis and Short-Term PV Power Prediction Method Based on Decomposition Optimization Deep-Autoformer. Renew. Energy 2025, 246, 122903. [Google Scholar] [CrossRef]
Aydın, E.Y.; Önal, K.; Haydaroğlu, C.; Kılıç, H.; Yıldırım, Ö.; Katar, O.; Erdoğan, H. A Novel Scenario-Based Comparative Framework for Short- and Medium-Term Solar PV Power Forecasting Using Deep Learning Models. Appl. Sci. 2025, 15, 12965. [Google Scholar] [CrossRef]
Renkema, Y.; Visser, L.; AlSkaif, T. Enhancing the Reliability of Probabilistic PV Power Forecasts Using Conformal Prediction. Sol. Energy Adv. 2024, 4, 100059. [Google Scholar] [CrossRef]
Renkema, Y.; Brinkel, N.; AlSkaif, T. Conformal Prediction for Stochastic Decision-Making of PV Power in Electricity Markets. Electr. Power Syst. Res. 2024, 234, 110750. [Google Scholar] [CrossRef]
Tian, L.; Zhang, Q.; Li, X.; Li, C. Fracturing Effectiveness Evaluation Based on Flowback Data Using Pressure Transient Testing. Reserv. Sci. 2026, 2, 97–110. [Google Scholar] [CrossRef]
Hu, Y.; Yang, Y. A Comparative Study on Drag Reduction Methods for Continental Shale Drilling in the Fuxing Block, Southeastern Sichuan Basin. Reserv. Sci. 2026, 2, 81–96. [Google Scholar] [CrossRef]
Ali, J.; Ansari, U.; Ali, F.; Javed, T.; Hullio, I.A. Application of Machine Learning for Effective Screening of Enhanced Oil Recovery Methods. Reserv. Sci. 2026, 2, 65–80. [Google Scholar] [CrossRef]
Asghar, R.; Fulginei, F.R.; Quercio, M.; Mahrouch, A. Artificial Neural Networks for Photovoltaic Power Forecasting: A Review of Five Promising Models. IEEE Access 2024, 12, 90461–90485. [Google Scholar] [CrossRef]
Singh, P.K.; Saraswat, A.; Gupta, Y. Deep Learning Prediction Models for Short-Term Solar Photovoltaic Power Generation. Next Energy 2026, 11, 100531. [Google Scholar] [CrossRef]
Sapundzhi, F. Study of the Effect of the Energy Produced from a Grid-Connected Rooftop Solar PV System for Small Households. Int. J. Online Biomed. Eng. 2022, 18, 147–154. [Google Scholar] [CrossRef]
Sapundzhi, F.; Chikalov, A.; Georgiev, S.; Georgiev, I.; Todorov, V. Robust time series analysis for forecasting photovoltaic energy yield. E3S Web Conf. 2025, 638, 02003. [Google Scholar] [CrossRef]

Table 1. Values of the total energy yield [kWh] generated by the 30 kW photovoltaic system—“Chikalov 1”, located in the town of Simitli, Blagoevgrad, Southwestern Bulgaria.

Year	January	February	March	April	May	June	July	August	September	October	November	December
2012						5286	5634	5201	4201	3460	1842	1574
2013	1500	1660	2921	4224	4892	4897	6273	5730	4554	3964	2105	2286
2014	1187	3035	4244	3601	4926	5440	5725	5539	3685	2804	1677	1299
2015	2269	3116	3058	4752	5389	5424	5861	5271	3360.25	2887.25	n. a.	n. a.
2016	1709.25	2458.25	3442.25	4289	4421.25	4917.75	5235.75	4862.25	3803	2627.75	n. a.	n. a.
2017	1612	1388.25	2824	3112.5	4264.75	4549.25	4590.25	4331.25	4065	3590.25	2418.75	1258.75
2018	2012.25	1626	2805	4573.5	4899	4317.5	4863	4880	4439	3201	1145.5	1290
2019	989	2633.5	3688	3996.25	4307.25	4806.5	5124.75	5172.25	4260.25	3708.75	1474	1286.75
2020	2154	2887.25	3322.5	4218	4450.75	4441.75	5202.25	4897	4162.75	2982.25	2320.50	1100.00
2021	1575.75	2698.50	3645.25	4068.25	4681.50	4932.25	5176.00	4624.00	3904.25	2800	1761.75	1336.25
2022	2375.25	2431.00	3823.25	3950.25	5033.00	5030.50	5426.25	4424.25	4088	3603.5	1992.25	1569
2023	1267.75	2477.00	2987.50	3187.50	3636.00	4616.00	5292.25	4939.50	3944.00	3420	1806.5	1423.25
2024	2036.00	2702.00	3395.00	4442.00	4341.00

In Table 1, the missing records for November and December 2015 and 2016 are marked as “n. a.” because these values were not recorded.

Table 2. Values of the total energy yield [kWh] generated by the 30 kW photovoltaic system—“Chikalov 3”, located in the village of Cherniche, Blagoevgrad, Southwestern Bulgaria.

Year	January	February	March	April	May	June	July	August	September	October	November	December
2020		814.5	3227.5	4265.25	4599.75	4032	5154.5	4685	3995	2775.5	1916.50	1023.25
2021	1320.50	2134.50	3490.75	4010	4717.75	4907.25	5101.00	4530.50	3230.75	2538.75	1487.25	1092.75
2022	1938.00	2118.50	3605.00	3945.00	4792.75	4952.00	5252.25	3518.00	3808	3149.5	1691.75	1364
2023	1145.50	2358.50	3127.75	3644.75	2131.50	4411.00	4921.00	4572.00	3523.50	2915	1389	1183.25
2024	1400.00	2308.50	3121.00	3650.25	3549.25

Table 3. Values of the total energy yield [kWh] generated by the 30 kW photovoltaic system—“Chikalov 4”, located in the village of Poleto, Blagoevgrad, Southwestern Bulgaria.

Year	January	February	March	April	May	June	July	August	September	October	November	December
2020										801	2081.75	1052.75
2021	1389.50	2023	3351.50	3970.50	4860.50	4883	5426	4671	3641.75	2575.25	1650	1095.75
2022	1826.25	1878.50	3285.75	4052.75	5019.25	5125.25	5411	4276.25	3911.5	3177	1714	1444.5
2023	1216.75	2408.00	3146.50	3711.00	3771.50	4655.25	5368.00	4687.25	3645.50	2976.25	1420.50	1312.50
2024	1376.50	2388.00	3153.00	4394.50	4437.00

Table 4. Values of the total energy yield [kWh] generated by the 30 kW photovoltaic system—“Chikalov 5”, located in the village of Poleto, Blagoevgrad, Southwestern Bulgaria.

Year	January	February	March	April	May	June	July	August	September	October	November	December
2021												597.50
2022	1796.50	1962.50	3229.25	3961.5	4889	4991	5269.75	4166	3809.5	3091.5	1671.75	1412.75
2023	1191.50	2351.25	3067.00	3617.50	3672.00	4518.25	5209.50	4559.00	3549.25	2903.75	1394.25	1285.75
2024	1346.25	2328.00	3072.50	4267.75	4304.75

Table 5. Values of the total energy yield [kWh] generated by the 30 kW photovoltaic system—“Chikalov 6”, located in the village of Poleto, Blagoevgrad, Southwestern Bulgaria.

Year	January	February	March	April	May	June	July	August	September	October	November	December
2021												590
2022	1796.50	1831	3191.50	3906	4834.75	4940.25	5208.5	4101.25	3750.75	3047	1643.5	1390
2023	1171.25	2318.25	3030.50	3570.75	3623.00	4465.25	4908.75	4502.25	3493.50	2851.25	1363.00	1260.25
2024	1304.00	2292.00	3026.00	4206.25	4239.50

Table 6. Descriptive statistics of the monthly total energy yield for the Chikalov PV datasets.

PV System	Location	Covered Period	Months in Covered Period	Missing Records	Mean [kWh]	Std. Dev. [kWh]	Min [kWh]	Max [kWh]
Chikalov 1	Simitli	June 2012–May 2024	144	4	3550.19	1365.71	989.00	6273.00
Chikalov 3	Cherniche	February 2020–May 2024	52	0	3164.17	1331.07	814.50	5252.25
Chikalov 4	Poleto	October 2020–May 2024	44	0	3151.44	1432.57	801.00	5426.00
Chikalov 5	Poleto	December 2021–May 2024	30	0	3116.22	1368.81	597.50	5269.75
Chikalov 6	Poleto	December 2021–May 2024	30	0	3061.89	1345.46	590.00	5208.50

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sapundzhi, F.; Georgiev, S.; Georgiev, I.; Todorov, V. Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning. Appl. Sci. 2026, 16, 5053. https://doi.org/10.3390/app16105053

AMA Style

Sapundzhi F, Georgiev S, Georgiev I, Todorov V. Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning. Applied Sciences. 2026; 16(10):5053. https://doi.org/10.3390/app16105053

Chicago/Turabian Style

Sapundzhi, Fatima, Slavi Georgiev, Ivan Georgiev, and Venelin Todorov. 2026. "Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning" Applied Sciences 16, no. 10: 5053. https://doi.org/10.3390/app16105053

APA Style

Sapundzhi, F., Georgiev, S., Georgiev, I., & Todorov, V. (2026). Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning. Applied Sciences, 16(10), 5053. https://doi.org/10.3390/app16105053

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Forecasting Solar Energy Production Through Modeling of Photovoltaic System Data for Sustainable Energy Planning

Featured Application

Abstract

1. Introduction

2. Data and Methods

2.1. Chikalov PV Systems and Dataset

2.2. Multilayer Perceptron (MLP) Architecture

2.3. Gated Recurrent Unit (GRU) Architecture

2.4. Experimental Design and Evaluation Metrics

2.5. Computational Workflow and Replication Details

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Pseudocode of the Forecasting Procedure

Appendix B. Forecast Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI