Next Article in Journal
Boiling and Condensing Two-Phase Frictional Pressure Drop Within Minichannel Tubes—Comparison and New Model Development Based on Experimental Measurements
Next Article in Special Issue
Adaptive Droop Control for Power Distribution of Hybrid Energy Storage Systems in PV-Fed DC Microgrids
Previous Article in Journal
Solar-Pumped Ce:Nd:YAG Laser Amplifier Design
Previous Article in Special Issue
Sensorless Compensation of DC-Link Current Pulsations in Energy Storage Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

A Review of Methodologies for Photovoltaic Energy Generation Forecasting in the Building Sector

1
Department of Electrical and Computer Engineering, University of Coimbra, 3030-290 Coimbra, Portugal
2
Institute for Systems Engineering and Computers at Coimbra, DEEC, University of Coimbra, 3030-290 Coimbra, Portugal
3
Institute of Systems and Robotics, DEEC, University of Coimbra, 3030-290 Coimbra, Portugal
*
Author to whom correspondence should be addressed.
Energies 2025, 18(18), 5007; https://doi.org/10.3390/en18185007
Submission received: 14 August 2025 / Revised: 10 September 2025 / Accepted: 17 September 2025 / Published: 20 September 2025

Abstract

Photovoltaic (PV) systems are swiftly expanding within the building sector, offering significant benefits such as renewable energy integration, yet introducing challenges due to mismatches between local generation and demand. With the increasing availability of data and advanced modeling tools, stakeholders are increasingly motivated to adopt energy management and optimization techniques, where accurate forecasting of PV generation is essential. While the existing literature provides valuable insights, a comprehensive review of methodologies specifically tailored for the forecast of PV generation in buildings remains scarce. This study aims to address this gap by analyzing the forecasting methods, data requirements, and performance metrics employed, with the primary objective of providing an in-depth review of previous research. The findings highlight the critical role of improving PV energy generation forecasting accuracy in enhancing energy management and optimization for individual buildings. Additionally, the study identifies key challenges and opportunities for future research, such as the limited exploration of localized environmental and operational factors (such as partial shading, dust, and dirt); insufficient data on building-specific PV output patterns; and the need to account for variability in PV generation. By clarifying the current state of PV energy forecasting methodologies, this research lays essential groundwork for future advancements in the field.

1. Introduction

1.1. Motivation

About 40% of primary energy [1] and 60% of the world’s electricity are consumed by residential and commercial buildings, contributing to about 33% of global greenhouse gas emissions [2]. Hence, the building sector has become a significant concern for policymakers and scholars, with the drive for green technologies focused on the minimization of environmental impacts [3] and energy costs [4]. Within this framework, photovoltaic (PV) systems stand out as the preferred choice to be integrated into buildings [5], primarily due to their capacity to produce decentralized electricity [6] and seamless integration into architectural structures [7], using the building-attached PV (BAPV) and building-integrated PV (BIPV) options. Furthermore, technological advancements and market implementation have driven down the cost of PV panels, rendering them increasingly appealing for integration into building infrastructure [8].
Building on this momentum, the adoption of PV systems in buildings is rapidly increasing. For instance, the building-integrated photovoltaics (BIPV) market is experiencing rapid growth, with a projected compound annual growth rate of 21% between 2024 and 2030 [9]. Also, global rooftop solar capacity has continued its rapid growth, increasing from 118 GW in 2022 [10] to 159 GW in 2023 [11], with projections reaching nearly 190 GW by the end of 2024 [12].
However, the integration of solar energy into buildings presents a challenge due to its inherent variability [13]. Solar power is available only during daylight hours and is susceptible to the impact of climate, geographical location, seasonal variations, and time of day. Additionally, the performance of PV modules may be impacted by the buildup of dust in the summer [14] and snowfall during the winter [15]. Hence, the growing integration of variable and intermittent renewable energy sources can lead to a misalignment between energy supply and demand [16], potentially disrupting power system stability, efficiency, quality, and reliability [17].
To address the mismatch between energy supply and demand, a range of solutions can be found in the existing body of literature. Incorporating buildings into communities [18] by facilitating the sharing of surplus renewable energy generation stands as a viable solution for addressing this challenge. Additionally, Energy Storage Systems (ESSs) and Demand Response (DR) programs assume vital roles in bestowing the requisite flexibility to ensure the matching between renewable generation and energy demand [19]. In this framework, flexible energy resources assume an essential role within DR initiatives, enabling the minimization of peak demand and maximization of the matching with renewable generation availability [20]. In addition, electric vehicles play a crucial role in promoting energy flexibility by serving as DR (with charging management) or ESS (with vehicle-to-building) resources [21].
In the context of the mismatch between renewable energy generation and demand, accurate forecasting in both cases emerges as a crucial tool for effectively managing energy flexibility resources [22] and consumption in buildings [23]. Moreover, it facilitates the efficient utilization of ESSs by providing insights into periods with potential renewable generation surplus or deficit [24].
The forecasting algorithms are typically integrated within Building Energy Management Systems (BEMSs) [25]. BEMSs gather and process data from diverse sources, including weather stations; historical energy consumption records; grid information [26]; and the internet of things devices, such as smart meters, occupancy sensors, and temperature monitors, which supply real-time data on energy consumption patterns and building conditions [27]. Subsequently, the algorithms analyze this data to forecast PV energy generation and energy demand [28]. These forecasts are typically presented as time series data, indicating energy production or consumption at specific intervals, such as hourly or daily [29].
BEMSs commonly use PV generation forecasts as inputs to optimization and scheduling routines that control flexible loads and battery storage. Forecasts are fed into receding horizon controllers or optimization engines to decide when to charge/discharge batteries, shift electric water heating or HVAC preconditioning, and manage EV charging, thereby enabling anticipatory, cost-, and emissions-aware operations.
For example, Amabile et al. [30] studied a residential home equipped with PV and an electric water heater. In this case, the EMS utilized a 30-min PV generation forecast to schedule hot water heating. The forecast-driven optimization boosted PV self-consumption by about 11% compared to a rule-based controller. Angenendt et al. [31] implement forecast-based operation strategies (tested with perfect and persistence forecasts) combined with variable PV feed-in limits in a residential PV–battery EMS. In simulation, this approach substantially increased the total energy throughput and lifetime of the battery and reduced the cost by approximately 12%. In a university microgrid, Duran et al. [32] incorporated LSTM-based PV power forecasts into a mixed-integer algorithm. By anticipating PV generation (to store energy and dispatch it by EMS when solar resources are scarce) and load forecasts, the EMS manages the state of charge of the battery system. It reduced peak loads, resulting in lower emissions. In their case, the forecast-based strategy reduced CO2 emissions by 3.58% and covered 59% of the demand.
These examples illustrate that integrating PV forecasts into BEMSs enables anticipatory, optimization-driven control of storage and flexible loads—shifting consumption into high-PV periods, reducing curtailment and peak demand, and moderating battery cycling—which yields higher self-consumption, lower operating costs and emissions, and reduced grid stress. Advanced management strategies, supported by forecasting models, optimize renewable integration, besides contributing to more efficient energy usage. Ultimately, these advancements have the potential to make buildings more sustainable and responsive to energy demands.
Therefore, given the challenges posed by energy demand and renewable energy’s variability within the building sector, accurate forecasting of energy production and demand is essential. However, to realize these benefits in practice, forecasts must explicitly account for building-specific factors (partial shading, soiling, inverter topology, and façade orientation) that increase uncertainty in on-site PV output.

1.2. Research Gap and Contributions

According to the review articles, numerous studies have explored the forecasting of demand in buildings [33,34,35,36] and the prediction of PV output [37,38,39,40,41]. The authors of [37,38] aimed to highlight the significance of ML and the importance of data preprocessing and parameter selection in predicting solar energy production. In [39], Erdener et al. concentrated on PV prediction applications for aggregate, regional behind-the-meter production effective to bulk systems and utility operations. Sobri et al. [40] examined a range of methods for solar production forecasting, including time series statistical methods, physical methods, and ensemble methods. In 2019, Viscondi et al. conducted a systematic review to explore the role of big data in solar energy production forecasting [41].
To our best understanding, there is currently no review article exclusively dedicated to forecasting PV generation, specifically within the building sector. The novelty of this paper lies in providing a systematic, in-depth review of PV generation forecasting methods specific to the building sector. The contributions of this paper are fourfold: (1) It identifies the diverse objectives and constraints associated with building-scale PV forecasting, (2) it analyzes the various techniques and computational methods applied in recent studies, (3) it examines the different metrics utilized in recent researches, and (4) it highlights areas where the current methodologies fall short, particularly in integrating emerging technologies and sustainable practices. Ultimately, this work provides a comprehensive perspective on the state-of-the-art methods in building-scale PV forecasting, offering critical insights that can inform future research and guide innovation in this crucial area.

1.3. Paper Organization

The subsequent sections of this paper are structured as follows. Section 2 outlines the literature review methodology, detailing the keywords, databases, and encountered limitations. Section 3 examines PV generation forecasting in buildings, emphasizing building-specific peculiarities affecting PV performance, data quality, key datasets, forecasting methods, real-world use cases, and emerging forecasting methods. It also explores PV adoption across building types, associated challenges, and evaluation approaches, covering metrics, post-processing, and uncertainty quantification. Additionally, a review of studies since 2019 highlights diverse forecasting approaches and algorithms. Section 4 presents key findings, discusses the literature limitations, and proposes a research framework. Finally, Section 5 summarizes the study’s main contributions and implications.

2. Literature Review Methodology

This section details the approach used to identify and analyze the relevant research on PV energy generation forecasting in the building sector. It outlines the search strategy, selection criteria, and limitations inherent to the review process.
This literature review used a manual qualitative content analysis approach adapted from Neuendorf [42]. Each included paper was read in full, and key information (forecasting methods, evaluation metrics, geographic region, etc.) was summarized in the body of the article and tables. An inductive thematic synthesis of these extractions was then performed to identify recurring methodological patterns, application gaps, and directions for future research. A comprehensive search was conducted across the Scopus database, using a set of targeted keywords: TITLE-ABS-KEY (photovoltaic AND generation OR production AND forecast* AND building) AND PUBYEAR > 2018 AND PUBYEAR < 2025 AND (LIMIT-TO (SUBJAREA, “ENGI”)) AND (LIMIT-TO (DOCTYPE, “ar”) OR LIMIT-TO (DOCTYPE, “cp”)) AND (LIMIT-TO (LANGUAGE, “English”)). Relevant articles were identified through a thorough review of article titles, abstracts, and contents. The selected articles were then subjected to a rigorous analysis to identify existing solutions, research gaps, and potential advancements to address the identified gaps.
Around 83.15% of the papers on this topic appeared in journals, and 75.28% of all journal and conference papers focused on BAPV.
This study’s literature selection was restricted to articles published between 2019 and 2024 to ensure a focus on the most recent advancements in the field. This time frame was chosen due to significant developments in AI techniques for energy forecasting, enabling a current and comprehensive overview of best practices. The choice of keywords has influenced the scope of the review, and thus, future studies could expand keyword selection to include emerging trends and topics. Additionally, this paper offers a narrative review of relevant literature, recognizing that this approach differs from a systematic review. The criteria for including or excluding articles retrieved through the search are outlined in Figure 1.
Furthermore, in this review, the main differences compared to forecasting for power plants are discussed, even though the methods and technologies of PV generation forecasting in the building sector and power plants have many similarities. Both types of forecasting rely on the same fundamental principles, such as the use of historical data [43], weather forecasts [44], and physical models [45] to estimate future energy generation. Additionally, both types of forecasting can benefit from the use of AI techniques, including Artificial Neural Networks (ANNs) and Machine Learning (ML) [46]. They also exhibit distinct characteristics due to their scale (size/capacity and distributed/centralized) [47], data availability, and optimization goals.
PV power forecasting can be categorized based on how far into the future it forecasts, with each horizon useful for different applications. Table 1 describes different forecasting horizons. Rooftop PV forecasting often focuses on a finer granularity, such as hourly or sub-hourly predictions, to align with the dynamic nature of residential energy consumption [48]. Utility-scale forecasting may employ daily or even longer-term predictions to manage grid stability and resource planning [49].
Rooftop PV forecasting may have access to more detailed data from individual installations, including shading patterns [51], panel orientation, and historical energy production data [52]. Additionally, power plants have more advanced monitoring systems and can collect more data than rooftop PV arrays, which allows for more accurate forecasting [53]. Finally, rooftop PV forecasting may prioritize optimizing energy generation for individual households or communities, considering aspects like self-consumption, self-sufficiency, and peak shaving [54].
It is important to acknowledge a limitation in the scope of this review, as it exclusively concentrates on the building sector. While this choice enables a detailed examination of rooftop PV forecasting methodologies tailored to residential and small-scale installations, it may not encompass the full spectrum of forecasting challenges and solutions in larger utility-scale power plants. Consequently, this review may not fully represent the unique considerations, methodologies, and technologies specific to utility-scale PV forecasting. Furthermore, the results of this study may not be generalizable to other sectors, which may have different forecasting challenges and solutions.

3. Results

Adequate energy management in buildings is imperative for energy acquisition cost reduction, minimizing energy consumption, and mitigating environmental impact, all while ensuring that user comfort remains uncompromised. As mentioned earlier, forecasting stands out as a powerful tool for achieving these goals. Exploring the field of PV generation forecasting in the building sector, Section 3 provides a comprehensive literature review to categorize and understand the various efforts in forecasting PV generation in buildings. Section 3.1 explains how physical, environmental, and technological factors influence photovoltaic system generation. Section 3.2 highlights the critical role of data quality in ensuring accurate PV generation prediction. Section 3.3 details the different prediction methods. In Section 3.4, there is an anticipation of the evaluation approach’s role in addressing challenges within the building sector. Section 3.5 introduces the various building types that utilize PV systems. Section 3.6 offers a thorough review of studies for PV generation forecasting in the building sector, with a specific focus on the diverse approaches and algorithms used in this area of research. Section 3.7 presents real-world applications of PV generation forecasting in buildings. Section 3.8 explores recent approaches to time series forecasting. Finally, the practical implications of forecasting accuracy are explained in Section 3.9.

3.1. Building-Specific Peculiarities/Environment Affecting PV Performance

Building-related characteristics/environment affecting PV performance are classified into two broad groups, as outlined in the sections below.

3.1.1. Physical and Environmental Factors

PV installations in the building sector have unique characteristics that affect generation forecasting. façade-mounted PV experiences different irradiance incidences (a wider range of incidence angles); much more frequent and complex partial shading in urban settings [55,56]; and different thermal boundary conditions (ventilation, wind exposure, and convective heat transfer) [57] that change module back sheet temperatures and therefore electrical output and reliability.
Vertical PV refers to PV modules installed with a near-vertical plane rather than the conventional tilted roof orientation. In general, vertically mounted mono-facial modules produce substantially less annual energy per unit area than optimally tilted arrays under the same location because of higher incidence angles and lower direct irradiance; documents report that optimally tilted arrays can produce 50–70% more annual energy than vertical façades in many cases [58].
However, two important caveats apply. First, the use of vertical bifacial PV can help recover the yield gap, because it harvests reflected and diffuse radiation [59]. In addition, vertical bifacial PV may outperform nearby vertical mono-facial systems under high-albedo, high-latitude, or snow-enhanced conditions [60]. Second, vertical configurations (especially east–west orientations) shift production towards morning and evening hours, which often improves temporal matching with building loads and increases self-consumption values despite lower midday output [60]. Because of these dependencies, accurate yield assessment for vertical PV requires explicit modeling of bifacial gains, ground albedo, view factors, shading, and angle-of-incidence losses.
Partial shading stands as one of the most important causes of performance loss in PV arrays, potentially leading to reduced power output and cell damage if undetected [61]. This issue is particularly critical in the building sector. As noted by IEA-PVPS (2023) [62], BIPVs often impose shading or suboptimal orientation (panels may receive non-ideal sun angles) on PV modules, exacerbating energy losses. Standard forecasting models often overlook these effects, but they can significantly bias PV output. For example, Kappler et al. [63] developed a Machine Learning shading correction for PV forecasts that uses internal inverter signals and previous day irradiance to detect shading intensity and correct predictions, reporting RMSE reductions of up to 40% on several 10 kWp systems, showing that explicitly modeling shading-related fluctuations substantially improves short-term forecast accuracy.
Similarly, soiling constitutes one of the most critical environmental factors that can be the largest sources of error for rooftop PV forecasts, and incorporating them yields more robust predictions [64]. Li et al. [65] found that, even in a rainy, subtropical city, residual soiling (especially bird droppings) caused up to 15.5% PV power loss. Such nonuniform soiling introduces extra uncertainty into forecasts.
Urban canyons are defined as street-scale spaces bounded by building façades [66], which materially alter the solar resource available to building envelopes and, hence, affect PV forecasting and EMS decisions. Urban canyons are commonly characterized by the height-to-width ratio (H/W) [67] and the sky view factor (SVF). H/W describes the canyon’s relative depth and strongly influences canyon shadowing, along with actual shading [67], while SVF (fraction of visible sky) quantifies the sky exposure (range 0–1 means completely obstructed sky to completely open sky) [68], so SVF often correlates with irradiance on façades. In addition, vegetation, canyon orientation, height variability, and building density further modulate both direct and diffuse irradiance [68]. Altogether, all the mentioned factors determine the availability, composition (direct vs. diffuse), and timing of solar irradiance reaching façades and rooftops and, thus, the electrical yield of façade-mounted PV. Considering SVF and vegetation, Rostami et al. [69] found that narrow, tree-lined canyons can reduce incident solar radiation by up to 75%, whereas broad canyons (H/W < 0.5) received roughly four times the radiation of narrow (H/W > 1.8), shaded canyons. The same study reported that 65% of total solar radiation was received by rooftops, highlighting the potential of rooftops to install PV systems.
As a result, PV generation is influenced by physical and environmental factors. These include the building façade, vertical PV installations, partial shading, soiling, and urban canyon effects, all of which can significantly alter the amount of solar energy captured and the overall system performance.

3.1.2. Technology Factors

Inverter topology describes the electronic architecture that converts PV DC to grid-compatible AC and implements system-level controls such as Maximum Power Point Tracking (MPPT) [70]. Topologies range from centralized/string inverters (many modules series-connected to a single DC–AC stage) to hybrid systems with per module DC optimizers to fully distributed Module-Level Power Electronics (MLPEs) such as microinverters that perform per module MPPT and DC–AC conversion [71]. Figure 2 shows a simplified schematic of the different types of inverters.
String inverters remain the dominant, cost-efficient choice. However, they require uniform module orientation/tilt within each string and are sensitive to partial shading or individual panel faults; when a panel is bypassed, the performance of the whole string can decline [71]. Therefore, Module-Level Power Electronics (MLPEs) devices were applied to address these challenges. Microinverters are commonly used MLPE devices that mitigate these issues by providing per module MPPT, reducing mismatch and partial shading losses, and improving yield under nonuniform irradiance. Microinverters also support module-level rapid shutdown for safety, greater design flexibility, and detailed monitoring and remote diagnostics at a higher cost [71].
Different inverter architectures provide different telemetry profiles; microinverters typically expose module-level signals, while centralized/string inverters usually report string- or inverter-level aggregates. Inverters offer rich module-level telemetry, which is highly useful for building energy management and reproducible forecasting studies. Modern systems typically expose instantaneous AC power and cumulative energy, device temperature, MPPT status, grid voltage/frequency, power factor, and fault/status codes [72]. These signals are aggregated by a local gateway and exported via PLC or IP to cloud APIs [72]. Module-level data enable fault localization, façade-level performance attribution, and tighter integration with BEMSs for control evaluation.
Consequently, the inverter topology class (central/string, optimizer, and microinverter) influences the model complexity, error metrics, and the interpretation of performance differences between studies.

3.2. The Role of High-Quality Data in PV Generation Forecasting in the Building Sector

Data play an important role in forecasting PV generation in buildings, where accurate forecasts are essential for effective energy management, increasing self-sufficiency and self-consumption, and reducing electricity bills.
Forecasting models rely on historical and real-time data to identify patterns, trends, and anomalies. Accurate input data, such as historical solar radiation and PV generation records, are crucial for capturing the dynamic and intermittent nature of solar energy and PV generation. Additionally, incorporating building-specific factors, such as shading effects, enhances the accuracy of the forecasts. Poor or incomplete data can result in unreliable predictions, adversely affect energy planning, and increase dependency on the grid. Given the critical role of data in the field, Table 2 introduces the relevant databases.

3.3. Types of Forecasting Methods

A comprehensive taxonomy of prediction models encompasses physical, statistical, hybrid, and ensemble methodologies [50]. A brief description of the methods is given below.

3.3.1. Physical Methods

Physical methods for PV forecasting encompass a variety of approaches, including Numerical Weather Prediction (NWP) models and PV module performance models. NWP models [94], which can also be applied to satellite imagery [95] to forecast the state of the weather conditions, operate independently from historical PV data [96] and primarily rely on the stability of prevailing weather conditions [97]. Hence, employing post-processing techniques proves to be an effective tool in this context [98]. NWP forecasting models work by solving mathematical formulations grounded in meteorological data. Consequently, their operational implementation requires substantial computational resources accompanied by elevated costs. Although NWP forecasting models have high computational demand, they exhibit enhanced reliability, particularly in the context of long-term weather forecasts [99].
As mentioned, NWP models are just one piece of the puzzle in physical methods for PV forecasting. PV module performance models account for the electrical behavior of the PV modules. They may consider factors like temperature, irradiance levels, and module characteristics to predict the power output, including inverter losses [100].

3.3.2. Statistical Methods

Statistical methodologies rely on establishing correlations between the explanatory variables employed and the model’s target outcomes to facilitate precise forecasting. Time series models are the foundation for predicting the electricity generated by solar panels on buildings. A time series constitutes a sequence of observations of a parameter recorded at consecutive time intervals. The statistical model can model the system’s behavior without needing to understand its underlying mechanisms [40].
With the advancement of science, Artificial Intelligence, which encompasses statistical models, is revolutionizing time series forecasting, offering powerful tools for forecasting PV generation, particularly because of the complex, nonlinear relationship between power output and weather conditions [101].
Artificial Neural Networks
ANNs are inspired by the structure and function of the human brain. The structure of an ANN typically includes an input layer, several hidden layers, and an output layer. It also incorporates connection weights, biases, an activation function, and a summation node. A schematic representation of Shallow Neural Networks [102] and Deep Neural Networks (DNNs) is provided in Figure 3 [103].
Fundamentally, an ANN is an information processing model that is intrinsically structured to autonomously identify and monitor the relationships among various data. ANNs stand out as a suitable choice for forecasting PV generation due to their self-training capability, which involves continuously refining their predictions by comparing forecasted values to actual outcomes. Furthermore, ANNs’ inherent self-learning ability to minimize future discrepancies between forecasted values and actual outcomes proves to be a crucial factor for accurate forecasting. This adaptability makes ANNs particularly well suited for forecasting PV generation [101]. While ANNs offer remarkable capabilities, they also have drawbacks, namely inaccurate predictions, high processing time, and no proven method for optimizing network structures [104].
Other Machine Learning Methods
The building sector applies a range of forecasting techniques, with ML being the most widely adopted approach. ML algorithms are inherently data-driven, meaning they rely on large datasets to extract patterns and insights [105]. Since PV generation often exhibits nonlinear dependencies on weather conditions [106], ML algorithms can efficiently process and make sense of complex and multi-dimensional data [107] generated by buildings, such as sensor data, meter data, and weather data. This allows ML models to capture the complex relationships between different factors that affect PV generation [108].
In addition, ML models can adapt to changing patterns. If there is a change in weather patterns, ML models can learn from the new data and adjust their forecasts accordingly [109]. This is important, because PV generation is constantly changing throughout the day and over time. ML models can analyze historical time series data, which consists of data points recorded over time at regular intervals [110]. This data format is particularly valuable, because it captures the temporal dependencies (the relationships among data points at different time intervals) and trends in PV generation [111]. By examining these patterns, ML models can provide accurate forecasts of future energy generation [112]. This capability allows building owners and operators to make more informed decisions regarding energy storage and demand side management strategies.
The process of developing a data-driven model for PV generation prediction involves four essential steps: data collection, data preprocessing, model training, and model testing [103]. Data collection entails acquiring historical PV generation and relevant data, such as outdoor weather conditions, which will be used in model training. Data preprocessing involves techniques such as data cleaning and data reduction, all of which aim to enhance the data quality and usability [113]. Figure 4 presents a step-by-step process for applying ML to solve real-world problems.
Sundararajan and Sarwat [114] conducted statistical analyses to understand missing data in PV systems within a grid. They assessed the efficacy of diverse imputation methods, such as K-Nearest Neighbors (KNNs), random imputation, multiple imputation, and Random Forests (RFs). The results suggested that KNNs and RFs emerged as the most effective techniques for imputation. Model training involves feeding the model with a training dataset, which enables the model to learn from the data and identify patterns [115]. Finally, model testing aims to assess the model’s performance using standard evaluation measures, which ensures the model’s reliability and accuracy [116].
Support Vector Machine (SVM) is one of the most important types of ML algorithms that draw upon statistical learning theory and the concept of structural risk minimization. SVMs have demonstrated superior accuracy and computational time compared to ANNs in tackling nonlinear problems [104].
Another important approach in ML to forecast PV energy generation is ensemble learning. Ensemble learning methods create multiple models and then combine them to produce improved results [117]. These methods are particularly effective in reducing overfitting, improving accuracy, and enhancing the stability and robustness of the model [118]. Ensemble methods can effectively handle the inherent variability of solar irradiance affecting PV generation [119]. Ensemble methods come in various forms, such as bagging, which trains models on different subsets of data (e.g., RFs), and boosting, which leverages the weaknesses of previous models to improve subsequent ones [120]. For instance, RFs, an example of a bagging technique, can be used to predict PV energy generation by creating a multitude of Decision Trees (DTs) at training time and outputting the class that is the mode of the classes of the individual trees [121].
In summary, the integration of ML in PV generation forecasting in buildings has demonstrated its potential to address the complexities of nonlinear relationships and temporal dependencies in energy data. By leveraging large and diverse datasets, ML models enhance forecast accuracy and adapt effectively to dynamic weather patterns. Additionally, advancements in methods such as ensemble learning further improve model performance. These strengths position ML as a powerful tool for PV generation forecasting, empowering stakeholders to make informed decisions that align with economic, technical, and environmental objectives.

3.3.3. Hybrid Approach

Hybrid methods combine two or more different types of models to leverage the strengths of each. In the case of accurate forecasting of renewable energy production and demand, the complexities associated with solar radiation and weather conditions pose a significant challenge for modeling using simple methods, particularly when high precision is crucial. Consequently, relying on a solitary model can result in significant errors. In recent years, hybrid models have been introduced to address the limitations of individual models and enhance forecasting precision [122].

3.3.4. Ensemble Approach

Ensemble methods combine multiple models of the same type to create a stronger overall model [123]. Although hybrid methods and ensemble methods share similarities, they have key differences. Their concept and differences are discussed in Table 3.

3.4. Evaluation Approaches in PV Generation Forecasting for the Building Sector

This section explores evaluation approaches for PV generation forecasting, which are critical for maximizing the effectiveness and reliability of predictions.

3.4.1. Evaluation Metrics for PV Generation Forecasting in the Building Sector

Evaluation is an essential step for forecasting models, since it ensures the reliability of the forecasts and ultimately leads to more informed decision-making [125]. Some standard evaluation metrics that can be used for PV generation forecasting in buildings include:
  • Mean Absolute Error (MAE) measures the average absolute difference between the predicted and actual values. It is primarily employed for model fitting, helping to select the optimal parameters for a given model. It is also utilized for model validation, model selection, comparing different models, and evaluating forecasts [126].
  • Root Mean Squared Error (RMSE) measures the square root of the average squared difference between the predicted and actual values. RMSE is more sensitive to larger errors than MAE due to the squaring effect. When dealing with normally distributed errors, RMSE is a more suitable choice for evaluating model performance compared to MAE [126].
  • Mean Absolute Percentage Error (MAPE) measures the average absolute percentage difference between the predicted and actual values. MAPE is a good metric due to its scale independency and interpretability. However, it can be problematic with zero actual values [127].
  • Normalized Root Mean Squared Error (NRMSE) normalizes the RMSE by the range of the actual values, making it a more useful metric for comparing models with different scales of prediction [128].
  • Coefficient of Determination (R2) measures the proportion of variance in the actual values that is explained by the model. R2 is a good measure of the model’s fit to the data [129], but it has limitations with nonlinear models [130]. It is often used to evaluate the effectiveness of energy efficiency measures, model stability, and reliability [131].
  • Mean Bias Error (MBE) measures the average difference between the predicted and actual values. This metric helps to identify systematic errors in the model, indicating whether the model tends to overpredict or underpredict [132].
The choice of evaluation metrics should be based on the specific needs of the application. In many research studies, common metrics such as MAE, RMSE, and MAPE [133] are used to evaluate quality in regression tasks. R2 is also widely employed to assess energy efficiency measures in various research studies [134,135]. No single evaluation metric is perfect, and it is generally recommended to apply a combination of metrics to assess the performance of a forecasting model.
Table 4 provides a comprehensive analysis of the key metrics utilized. The diverse array of evaluation metrics equips researchers and practitioners with a robust toolkit to assess and enhance the performance of forecasting models for PV generation, ensuring a nuanced understanding of model effectiveness.

3.4.2. Post-Processing and Quantifying Uncertainty for PV Generation Forecasting in the Building Sector

Post-processing and quantifying forecast uncertainty are key players in improving the performance of forecasting models. Post-processing involves applying various techniques to the model’s forecast to improve its accuracy and interpretability [136]. Post-processing helps correct errors or biases in initial PV generation forecasts [137]. Solar PV generation is highly dependent on cloud cover and solar irradiance. Thus, these errors can be caused by inaccuracies in the forecasting model, weather data, or other factors. Post-processing can incorporate cloud cover data and advanced irradiance modeling to improve forecasting accuracy. By assimilating real-time cloud cover information, forecasts can be adjusted to reflect changing weather conditions [138]. In addition, PV generation can vary significantly based on the specific location of the building. Post-processing allows for the integration of site-specific data, such as shadowing effects, panel orientation, and local weather conditions, to enhance the accuracy of forecasts for a particular building [96].
In a practical demonstration, Böök and V. Lindfors [96] proposed a post-processing method for improving the accuracy of PV output forecasts by utilizing actual PV output data to adjust a baseline model that was trained on NWP forecasts. This approach was validated by comparing the performance of the baseline and adjusted forecasts against actual PV output, demonstrating its ability to enhance the accuracy of forecasts for specific sites. Alvarenga et al. [139] analyzed three approaches (an ANN and a linear statistical model, along with Kalman filter and Kernel Conditional Density Estimation) aimed at enhancing the resolution of solar irradiance forecasts from NWP. Applying a combination of an ANN followed by Kalman filter post-processing resulted in a notable reduction of 45% in MAE and 91% in MBE.
In the context of PV generation forecasting in buildings, quantifying forecast uncertainty involves estimating the range of possible outcomes and their likelihood. Fluctuations in weather patterns make it challenging to accurately predict the amount of electricity produced by PV systems. Cloud cover, atmospheric conditions, and seasonal changes can impact the amount of sunlight reaching solar panels. Quantifying uncertainty involves considering variations in weather forecasts and their potential effects on PV generation [30]. In addition, inherent limitations of the models applied to forecast PV generation [140] and variability in sensor readings and calibration errors contribute to the overall uncertainty in solar energy output predictions [141]. The Monte Carlo method has become the favored approach for dealing with forecast uncertainty in PV generation and building energy analysis, outpacing other methods in terms of both popularity and effectiveness [142].
Post-processing and quantifying forecast uncertainty play crucial roles in data-driven forecasting models. These methods address prediction uncertainties in PV generation, enhancing accuracy, reliability, and strategic insights, ultimately supporting better decision-making in the building sector.

3.5. An Analysis of Building Attached and Building Integrated Photovoltaic

In this section, the distinct approaches of BAPV and BIPV methods for seamlessly incorporating solar arrays into building structures are explored. The discussion examines the impacts of the installation and integration nuances of BIPV and BAPV. In the following, the simulation challenges in the mentioned buildings are briefly explained. Finally, a review of relevant articles published in recent years is subsequently completed.

3.5.1. Introduction to BAPV and BIPV

Two methods for incorporating PV arrays into buildings are BAPV and BIPV. The primary distinction between BAPV and BIPV lies in their integration into respective buildings. In the BAPV approach, modules are installed onto existing surfaces by stacking them after the completion of the building. The integration of BAPV is less seamless with the architecture, as it often involves adding solar panels as an additional layer to the building. On the other hand, it is often more straightforward to implement in existing buildings. The BIPV method involves replacing conventional building components, such as roofing, windows, or façades, with materials containing solar modules. The main advantage of BIPV is its seamless integration into the building’s design, contributing to both energy generation and architectural aesthetics.

3.5.2. Challenges in Simulating BIPV and BAPV Systems

Simulating building-attached and specially integrated PV systems poses some challenges, such as heterogeneous irradiance patterns and uneven temperature distribution across the PV modules. Higher temperatures can negatively impact solar cell efficiency, leading to reduced power output [143]. BIPV and BAPV systems’ thermal boundary conditions diverge from those of conventional PV power plants. Conventional PV power plants are typically ground-mounted in open fields with good airflow, allowing for natural heat dissipation. In addition, conventional PV plants often have space between panels for ventilation. On the other hand, building materials like roofs can absorb and retain heat, further increasing the temperature of PV panels in buildings. BIPV and BAPV systems may have limited ventilation due to their close integration with the building, potentially compromising power prediction accuracy. BIPV and BAPV modules frequently encounter partial shading and masking conditions (dirt, dust, or other debris covering the solar panels), which affects the generated energy [144].

3.6. PV Generation Forecasting in the Building Sector

This subsection reviews the studies conducted in the fields of BIPV and BAPV.

3.6.1. Review of BIPV Studies and Modeling Approaches

This section reviews various methodologies for BIPV generation forecasting, including physical approaches, statistical methods, hybrid techniques, and ensemble models. The studies discussed explore different modeling strategies, data sources, and evaluation metrics to assess prediction accuracy under diverse conditions.
Li et al. [145] developed a multi-physics model predicting electrical, thermal, and structural performance in various conditions, considering shading and masking. The model included a simplified electrical representation for simulating PV system power output under the mentioned conditions. Based on another physical approach called the Sandia Array Performance Model (SAPM), Polo et al. [146] utilized data from satellite-derived solar irradiance and a digital model derived from light detection and ranging information to estimate PV generation. An observed RMSE of 8% was obtained for the monthly predicted power for the west-facing façades.
Hao et al. [147] investigated the practical use of ANNs to assess the energy production of façades incorporating these technologies. Under realistic conditions, Nguyen and Ishikawa [148] employed an ANN to forecast the annual output energy of BIPV, considering the two-terminal perovskite/silicon tandem cells. The outcomes indicated that the suggested model achieved a MAPE of 0.01884 for the training set. Paiva et al. [149] presented a methodology comprising a data processing strategy and ANN modeling specifically designed for intra-day PV generation prediction. ANN exhibited superior performance compared to the persistence model used as the benchmark. By applying ANN, Polo et al. [150] formulated a model for PV generation forecasting. The methodology introduced involved integrating shading computation with ML algorithms tailored for PV cells. The MRE varied within the range of 6–15%.
In the domain of forecasting PV generation in buildings, estimating the Maximum Power Point (MPP) plays a crucial role in fault detection and diagnosis. Utilizing an ANN, Lee et al. [151] developed a model aimed at estimating the MPP for BAPVs. The validation results of the model suggested satisfactory performance, as evidenced by a CvRMSE measuring less than 30%.
Dimd et al. [152] employed Long Short-Term Memory (LSTM) to assess the influence of diverse orientations, including different tilts and azimuth angles, on the precision of a PV prediction model. In this scenario, the incorporation of mixed orientations led to a 51% increase in RMSE. With a similar goal, Dimd et al. [153] achieved a 34% reduction in RMSE by developing a forecasting model that considers all mixed orientations, i.e., different azimuth angles and tilt, since the factors substantially impact the prediction accuracy.
Jeong et al. [154] employed the Nonlinear Autoregressive with exogenous input (NARX) network as the forecasting model and modified Sky Condition (SC*). The model exhibited good performance, especially on overcast days, when utilizing the SC* and the solar altitude.
When colored PV modules are in use, predicting PV generation becomes more challenging due to the variability in shading loss, which is influenced by the entrance altitude of the irradiance. Shin et al. [155] developed a Regression Neural Network model to tackle the aforementioned challenge. According to the outcomes, the suggested model exhibited superior performance compared to the Linear Regression (LR) model. Utilizing organic solar technology, Corrêa et al. [156] projected the output of PV systems through several LR models. The most successful models attained R2 values of 0.76 and 0.81, corresponding to simple and multiple regressions, respectively.
Multiple algorithms were utilized in various papers to generate predictions, and the outcomes of their simulations, along with other relevant metrics, were compared. Utilizing statistical methods and ML techniques, Fara et al. [157] created both an Autoregressive Integrated Moving Average (ARIMA) model and an ANN for PV generation forecasting. A comparison of the mentioned model’s outcomes revealed that the ARIMA model exhibited greater efficiency compared to the ANN model. Luo et al. [158] introduced three multi-objective prediction frameworks utilizing an ANN, SVM, and LSTM. These frameworks aim to simultaneously predict the electrical power output from BIPV, cooling, and lighting loads. The predictive model utilizing SVM required the shortest computation time, while the ANN-based model demonstrated the most favorable outcome with the smallest MAPE.
Kabilan et al. [159] estimated PV power generation through the application of ANN, Quadratic Support Vector Machine (QSVM), and DTs. Utilizing the LR coefficients enhanced the accuracy of the forecast. According to the results, the NN achieved the highest accuracy, with a RMSE of 4.42%. Dourhmi et al. [160] evaluated five methods to identify the most effective prediction approach: ANN, RF, Decision Tree Regression (DTR), Support Vector Regression (SVR), and LR. Evaluation metrics such as MAE, MSE, RMSE, and R2 were employed to assess the accuracy of these methods. The results indicated that the proposed ANN and RF models yielded the most favorable outcomes.
Abouelaziz et al. [161] introduced a methodology that integrates photogrammetry with Deep Learning (DL) techniques to generate data-driven PV output forecasts. Their findings revealed that the Convolutional Neural Network (CNN) model achieved superior performance in prediction accuracy compared to other ML models. Lee et al. [162] applied ML methods for predicting PV generation through the incorporation of feature engineering. Among Recurrent Neural Networks (RNNs), ANNs, SVM, RF, Classification And Regression Trees (CARTs), and CHi-square Automatic Interaction Detection (CHAID), RNNs exhibited superior performance in predicting power at both hourly and daily resolutions. Feature selection, utilizing variable importance determined with SVM, was implemented by applying weather forecast data.
Jouane et al. [163,164] formulated three models employing LSTM, CNN, and a hybrid model (CNN-LSTM) within a Positive Energy Winter House context (houses generating more energy than their consumption). The evaluation of these models was conducted using MAE, RMSE, and MSE metrics. The distinction between the two articles is that [163] included side parameters such as temperature, while [164] relied solely on historical data on PV generation. While the CNN-LSTM model demonstrated better accuracy, it showed a slower performance compared to the others. Sarkar et al. [165] created a hybrid model for forecasting PV generation by combining RF, LSTM, and Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN). The findings demonstrated that the hybrid strategy demonstrated superior performance compared to the reference methods.
Polo et al. [166] investigated two DT-based ML algorithms, namely RF and Extreme Gradient Boost (XGBoost), in their study. They observed that, in deterministic forecasting, RF generally exhibited a slightly superior RMSE compared to XGBoost across various cases.
As a summary of the section, Table 5 presents the methodologies employed in the reviewed articles on BIPV.
An analysis of Section 3.5.1 shows that only two studies [164,165] on BIPV utilized only historical PV data as input variables. In contrast, the majority of the other studies relied primarily on weather data.
From Table 5, it can be concluded that statistical methodologies dominate because they are highly adaptable and effective at capturing the nonlinear relationships inherent in PV generation data. Techniques like Neural Networks and traditional methods of ML are frequently used, because they can handle time series data and adjust to dynamic weather conditions. In contrast, physical models, while based on system physics, require detailed input parameters that may be difficult to obtain in real-world scenarios. Hybrid and ensemble models, although powerful, are more complex and computationally demanding, which may limit their widespread application. Overall, the large presence of statistical models may reflect their historical utility, simplicity, and ease of implementation, but the increasing inclusion of Machine Learning, hybrid, and ensemble methods points to an evolution in the field toward more accurate, albeit complex, forecasting methodologies.

3.6.2. Review of BAPV Studies and Modeling Approaches

This section reviews several forecasting methodologies for BAPV power generation, including physical models, statistical approaches, hybrid and ensemble models, and emerging decentralized methods such as federated learning. The reviewed studies examine various modeling approaches and evaluation metrics to evaluate forecasting accuracy across different conditions.
Zhi et al. [167] introduced a physical model incorporating an enhanced Maximum Power Point Tracking algorithm, achieving a MAE of 15.9% under diverse weather conditions. Massucco et al. [168] devised a hybrid model, utilizing either clear sky models or an ensemble of ANNs, selecting the optimal approach via DTs.
Reynolds et al. [169] utilized a back-propagation ANN to forecast the next 24 h, achieving R2 values of 0.9489 for training and 0.9412 for testing. Gheouany et al. [170] utilized a stacking-based ANN to forecast PV power output. They predicted weather variables, including global horizontal irradiance, temperature, and wind speed, as inputs for both long-term (30-day) and short-term (24-h) forecasting, significantly reducing the errors. Sabzehgar et al. [171] created a NN-based NWP model for a residential microgrid, which outperformed SVM and Multi-Variable Regression (MVR) in terms of MSE and MAPE accuracy.
Kaffash et al. [172] employed a Feed-Forward Neural Network (FFNN) for day-ahead PV generation forecasting, achieving a NRMSE of 11.89 without weather data. The same authors [173] introduced an ensemble method combining FFNN and SVM, using RF for feature selection. Despite the higher computational cost associated with SVR or ANN, the ensemble approach improved the forecasting accuracy.
Chung [174] introduced a multilayer FFNN to forecast next-day PV generation based on current weather conditions, showing that more input variables did not always improve accuracy. Costas et al. [175] utilized a Standard Neural Network (SNN) for PV generation and electricity demand forecasting, achieving a NRMSE of 7.15% (PV generation). Ghenai et al. [176] applied NARX to predict bifacial PV output, finding that increasing the surface albedo from 0.2 to 0.5 and 0.8 boosted the annual bifacial power generation by 7.75% and 14.96%, respectively.
Parvez et al. [177] developed a Multi-Layer Perceptron (MLP) model for rooftop PV forecasting, achieving MAE and NRMSE of 0.0809 and 0.0054, respectively. Sarmas et al. [178] suggested an online MLP model for PV output, improving MAE by 11.9% over offline models. Meteier et al. [179] utilized Extra-Trees (ETs) and MLP for energy production forecasting by incorporating weather- and time-based features. The MLP yielded the most precise prediction, achieving an R2 score of 84.81%. Piotrowski et al. [180] examined ten forecasting methods (both hybrid and team approaches) and introduced a hybrid approach using three independent MLP-type neural networks, minimizing RMSE and nMAPE.
Rooftop PV systems are commonly located behind the meter and unmonitored. In addition, traditional centralized prediction methods raise privacy concerns (due to the need to share data with a centralized entity), prompting the need for decentralized methods like federated learning (FL). Hosseini et al. [181] developed a MLP-based FL model for behind-the-meter PV forecasting, achieving an 18.7% reduction in RMSE. Kim et al. [182] forecasted behind-the-meter production by considering the unauthorized PV capacity, employing the SVR and upscaling method. In comparison to estimates that do not incorporate the unauthorized PV capacity, the methodology results in a reduction of 2.95% and 5.41% in NMAE and NRMSE, respectively. González et al. [183] compared ANN and SVM using sky camera images and found similar error rates. VanDeventer et al. [184] introduced a stochastic Genetic Algorithm-based Support Vector Machine model for short-term prediction, boasting accuracy, speed, and memory efficiency over standard SVM.
Using historical PV generation and weather data, Bird et al. [185] employed SVM, RF, and ANN for prediction objectives, with RF achieving the best performance (MRE of 2.7%). Scott et al. [186] compared ML algorithms for university campuses, finding that RF had the lowest RMSE compared to SVM, LR, and NN. Teixeira et al. [187] estimated PV generation by applying ANN, SVM, RF, and ARIMA, with the appropriate hyperparameters tailored for each method. The best outcomes were achieved using ANN and SVM, respectively.
Tavares et al. [188] compared two Neural Network architectures, an ANN and a DNN, for PV generation in buildings, with the ANN model exhibiting superior forecasting accuracy. Duhirwe et al. [189] developed a four-layer DNN, employing the PV generation dataset. The R2 values of the developed DL-based forecasting models exceeded 0.95. Shivam et al. [190] developed a dual forecasting model using the Residual Dilated Causal Convolutional Network to forecast both energy generation and electric load, outperforming both SVM and ANN models in terms of accuracy. Kazem et al. [191] introduced four predictive models employing DL techniques, specifically utilizing Time Lagged Recurrent Networks and RNNs by considering input data such as solar irradiance and temperature.
Park et al. [192] employed single-LSTM and multi-LSTM models for their forecasting purposes, noting no significant difference, though the multi-layer model exhibited a slightly lower error rate. Son and Jung [193] formulated 22 multivariate models by integrating six meteorological factors. To forecast power, the LSTM method was employed, and the results showed that the PV power forecast was influenced by six meteorological factors, ranked from most to least important, including solar radiation, sunlight period, wind speed, temperature, cloud cover, and humidity. Xu et al. [194] introduced a LSTM-based model for rooftop PV output forecasting, incorporating the predictions into a Q-learning framework to optimize pricing for PV and ESS utilization.
Alden et al. [195] utilized an Encoder–Decoder LSTM for day-ahead PV output prediction, achieving goodness of fit of 0.975 on an hourly basis and a daily CVRMSE of 11.1%. To enhance the balance between supply and demand, Wen et al. [196] devised a Deep Recurrent Neural Network with LSTM to forecast aggregated energy demand and PV power output within a building community. Using real-world data, the model outperformed both MLP and SVM. Lateko et al. [197] introduced a stacking ensemble model, incorporating DNN, ANN, LSTM, SVR, and CNN, with an RNN as the meta-learner. The MRE demonstrated improvements in the proposed ensemble learner model over individual models.
Iao et al. [198] applied CNN-LSTM, BiLSTM, and Gated Recurrent Unit (GRU) for PV generation and load forecasting, employing KNN interpolation to fill missing values in historical PV power data. Among the models, BiLSTM with KNN interpolation yielded the lowest RMSE but required the longest training time. Expanding on this work, the same authors [199] integrated KNN and Generative Adversarial Networks for data interpolation and augmentation, formulating LSTM and Transformer-based forecasting models. The model achieved an RMSE of 4.603 kW for PV power forecasting.
Hu et al. [200] proposed a LSTM model enhanced with self-attention mechanisms, achieving high accuracy and adaptability for both short-term and long-term forecasts. Hernandez-Robles et al. [201] investigated five Neural Network architectures, including the MLP, single-layer, dual-layer, and bidirectional LSTM models, as well as GRU. While the MLP model had the fastest computation times, the dual-layer LSTM model demonstrated the highest reliability and accuracy. Cordeiro-Costas et al. [202] illustrated the effectiveness of a hybrid LSTM-MLP model, emphasizing the advantages of NSGA-II for hyperparameter optimization and energy management in sustainable building practices, with a NRMSE of approximately 5.5%.
Li et al. [203] addressed the issue of incomplete historical and real-time measurement data by proposing a Recursive LSTM model for PV generation forecasting in buildings. The method outperformed the Gaussian Processes and ANN methods, indicating its capability to learn from time series PV data. Costa [204] evaluated LSTM, CNN, and Convolutional-LSTM (ConvLSTM) networks at half-hourly, daily, and monthly intervals, finding that CNN and ConvLSTM consistently achieved better performance than the Prophet algorithm and LSTM based on MAE, RMSE, and NRMSE metrics.
Sarmas et al. [205] applied meta-learning to four LSTM variants (Stacked LSTM, Bidirectional LSTM (BiLSTM), CNN-LSTM, and ConvLSTM), enhancing performance by up to 5% over the best base model. Guo et al. [206] introduced a hybrid model integrating Singular Spectrum Analysis and BiLSTM, optimized via the Bayesian Optimization algorithm. The 7.5 and 15 min-ahead predictions by the model reduced errors by up to 380.51% and 296.01%, respectively.
Pan et al. [207] introduced a PV generation prediction approach and decoupling PV load in a residential power distribution system with behind-the-meter PV utilizing the KNN algorithm. The RAE values exhibited reductions of 31.60%, 44.08%, 42.40%, and 60.51% during spring, summer, autumn, and winter. Sangrody et al. [208] developed three Similarity-Based Forecasting Models (SBFMs)—basic SBFM, categorical SBFM, and hierarchical SBFM (hSBFM)—leveraging the KNN approach for PV generation forecasting at a 5-min resolution. Evaluated against ANN and persistence models, the results demonstrated that hSBFM exhibited the best overall performance.
Tripathy et al. [209] developed a Multi-Time Instant Probabilistic PV Generation forecasting model for rooftop PV installations, utilizing Quantile Regression Forests (QRFs), which outperformed basic Quantile Regression (QR). They later introduced a Quantile K-Nearest Neighbors Regression Averaging model [210] that achieved lower forecast errors compared to methods such as regression bootstrapping, traditional QR averaging, QRF, Quantile KNN, and basic QR.
To estimate behind-the-meter PV generation, Kabir et al. [211] combined a statistical demand prediction model with a physical PV output forecasting model using Hidden Markov model regression, reducing MSE by 44% compared to the consumer mixture model. They further utilized a Mixed Hidden Markov Model [212] to estimate the technical parameters of solar PV systems.
Allouhi [213] proposed a multi-objective optimization approach using a Genetic Algorithm (GA) to balance economic and environmental factors, incorporating Multiple Linear Regression (MLR) to forecast PV generation, achieving a NRMSE of 0.943. Kallio and Siroux [214] employed MLR and ANN methods, improving the accuracy of predictions by incorporating micro-inverter technology into their modeling approach. Capotosto et al. [215] compared persistence, MLR, and ARIMA for day-ahead PV generation forecasting in residential and public buildings, finding that ARIMA and MLR performed similarly for residential users, while MLR outperformed ARIMA in public buildings, highlighting that those sophisticated methods may not always yield the optimal solution.
Jiranantacharoen and Benjapolakul [216] employed ARIMA with the Kalman filter algorithm for five-minute intervals and day-ahead forecasting. Gellert et al. [217] assessed the ARIMA and Trigonometric Seasonal, Box-Cox Transformation, ARMA residuals, Trend, and Seasonality (TBATS) methods to forecast PV energy output in smart homes, with TBATS significantly outperforming the Neural Network-based methods.
Lee et al. [218] introduced Automatic Machine Learning (AML) to generate multiple forecasting models using meteorological data and solar power generation, employing 24 different algorithms. Furusawa et al. [219] utilized PyCaret, an AML tool, incorporating diverse ML methods like ET regressor, Ridge regression, Bayesian Ridge, and RF Regressor. The ET regressor yielded the best performance among the methods.
Lee et al. [220] devised a PV generation forecasting framework based on the autoregressive model, incorporating temperature, relative humidity, cloud cover, and solar radiation obtained from the ASHRAE Clear-Sky model, achieving a MAPE of 0.227 for sunny days and 0.386 on cloudy days. Bottieau et al. [221] developed cross-learning forecasting for household PVs, combining Gradient-Boosted Regression Trees, several NNs, and the KNN algorithm. Results demonstrated the superior performance of cross-learning forecasting over a clear sky-based physical approach.
Zhang et al. [222] developed a data-driven PV generation forecasting model using Spatial-Temporal Correlation Analysis within a Bayesian Network framework. The proposed method outperformed various models, including ARIMA, KNN, LSTM, Persistent Model, the deterministic Autoregressive-based spatiotemporal forecasting algorithm, and an Autoregressive with exogenous input-based spatiotemporal solar forecast model. Bot et al. [223] implemented an ensemble of Radial Basis Function ANNs (RBF-ANNs) to predict variables relevant to HEMSs, particularly for PV power output forecasting. This study utilized a Multi-Objective Genetic Algorithm for both feature and topology selection, along with a stochastic convex-hull algorithm known as ApproxHull for data selection. Yang et al. [224] introduced a Competitive Swarm-Optimized (CSO) RBF-ANN, demonstrating superior accuracy in comparison to other benchmarks.
Sun et al. [225] introduced a WTEEMD-FCM-IGWO-LSTM approach for PV generation forecasting in buildings. The method integrated Wavelet Threshold Improvement Ensemble Empirical Mode Decomposition (WTEEMD) to reduce noise in weather data, followed by an adaptive weight-improved Fuzzy C-Means (FCM) clustering algorithm, was employed to categorize varying weather conditions. The Improved Gray Wolf Optimization (IGWO) algorithm was then employed for hyperparameter tuning, addressing suboptimal issues. Comparative analysis confirmed the approach’s superior performance in PV power forecasting. Peng et al. [226] introduced an approach that utilizes Variational Mode Decomposition (VMD), an improved version of Informer, and an Enhanced Chaos Game Optimization algorithm. Significantly, the Informer model has been enhanced through the incorporation of Locality-Sensitive Hashing attention. Results indicated that VMD effectively reduced noise in PV power data, improving model accuracy. Khan et al. [227] introduced an improved generally applicable stacked ensemble algorithm (DSE-XGB) that incorporates ANN and LSTM for predicting PV generation. Performance comparisons against individual bagging, ANN, and LSTM demonstrated that DSE-XGB improved the R2 value by 10–12% over competing approaches.
Etxegarai et al. [228] compared four distinct models. The first model was analytical, while the other three were ML-based, specifically SVR, NARX, and FFNN. SVR emerged as the model with the most precise forecasts. Kaffash et al. [229] investigated various forecasting models for scheduling a multi-energy system, including Persistence, Simple Moving Average, Least Absolute Shrinkage and Selection Operator (LASSO), FFNN, and SVR, to identify the most accurate PV generation forecasting methodology. LASSO exhibited the best performance across all metrics, whereas SVR performed the least satisfactorily. The study highlighted that, despite the complexity of models like FFNN and SVR, they were unable to surpass LASSO, primarily due to the limited dataset size, which constrained the training of intricate methods like FFNN. Hatamian et al. [230] explored PV power output prediction by using weather data and various ML models, including RF, XGBoost, KNN, MLP, and SVR. The analysis revealed similar forecasting patterns among MLP, KNN, and SVR, while RF and XGBoost exhibited comparable performance. Giovanni et al. [231] evaluated four ML algorithms—KNN, DTs, Support Vector Regression, and RFs—to predict daily power production. Among these, RFs emerged as the most effective model.
Costas et al. [232] evaluated ML (XGBoost, RF, and SVR) and DL (RNN, SNN, and CNN), identifying SVR, SNN, and CNN as the most fitting models for forecasting PV generation. Mohana et al. [233] employed several ML methods (RF, LASSO, SVM, LR, XGBoost, Polynomial Regression, and DL) and employed a backward feature elimination technique to identify the most relevant set of features. The mentioned method aided in revealing that similar outcomes can be achieved with a reduced set of features. Their results showed that the DL technique yielded the lowest error using the minimal set of selected features, while LR had the highest errors.
As a summary of the section, Table 6 presents the methodologies employed in the reviewed articles on BAPV.
By examining input variables for prediction models, it was observed that only 12 studies on BAPV incorporated historical PV data as input variables. By contrast, most other studies predominantly relied on weather data.
Statistical methods, especially Neural Networks such as ANN, MLP, and LSTM, are the dominant approaches in forecasting BAPV power generation. The literature favors these models for their strong capability to effectively capture and model complex, nonlinear patterns in the data. LSTM emerges as the most frequently DL-applied technique, attributed to its robustness in handling time-dependent data and generalization capability. Additionally, widely used ML methods like SVR and SVM have been employed as benchmark models. In contrast, physical models, while reliable in deterministic settings, are less prominent.
Hybrid models highlight the integration of strengths across methodologies, combining features like optimization (e.g., GA-SVM) and multi-model architectures (e.g., CNN-LSTM). These approaches address specific forecasting challenges, such as capturing long-term dependencies and improving prediction accuracy under variable conditions. Similarly, ensemble models have gained traction by leveraging diverse algorithms to reduce bias and variance, with RFs and XGBoost being especially notable.
Table 7 presents the metrics employed in the reviewed articles on BAPV, BIPV, and Total. Only metrics used at least five times are included in the table.
Table 7 highlights the prevalence of performance metrics in the forecasting of PV generation in buildings, with RMSE being the most commonly used metric. RMSE’s popularity stems from its ability to heavily penalize larger errors, which is crucial in PV forecasting, where significant deviations can impact energy management. It is easy to interpret, as it is expressed in the same units as the predicted values, making it practical for comparing forecast accuracy. Other metrics, such as MAE, R2, and MAPE, are also frequently used but serve slightly different purposes. MAE, for example, treats all errors equally and is often used to measure overall accuracy without prioritizing larger errors. R2 helps understand the model fit, while MAPE normalizes errors by expressing them as percentages. However, their usage is less widespread compared to RMSE, likely because they do not penalize large errors as strongly.
This trend may stem from the fact that weather data can significantly enhance model accuracy by capturing external factors. However, the challenge lies in obtaining reliable real-time weather data, which are not always easily accessible or consistent across regions. Therefore, some studies may have chosen to rely solely on historical PV data. Among predicted climate variables, the Normative Sky Index shows the lowest correlation with PV power output under most sky conditions, making it redundant for forecasting [162]. Solar power forecasting is influenced by six key weather factors, ranked by importance: solar radiation, sunlight, wind speed, temperature, cloud cover, and humidity [186,193].
Table 8 presents an overview of the reviewed studies on PV forecasting in the building sector. The table summarizes forecasting methodologies; evaluation metrics; deployment types (e.g., residential, office, and campus); and geographic distribution of the datasets. Additionally, the Tips column offers insights into partial shading, dust accumulation, building-specific PV patterns, or any other relevant information related to the article. While multiple prediction methods were used in some studies, the one with the lowest error is listed as a reference in the Method column.
A column-by-column analysis and discussion of the above table is presented below.
Although RMSE and MAE (Table 7 and Table 8) are widely used in the PV forecasting literature, these absolute error metrics are scale-dependent and therefore not directly comparable when studies involve systems with different installed capacities or generation patterns. The distribution of the data and the presence of outliers disproportionately affect RMSE (because errors are squared), and when the data range is large (e.g., from zero to very high irradiance values), RMSE can appear large even when relative errors are small. For these reasons, reporting RMSE/MAE alone can be misleading for cross-study or cross-system comparisons. Therefore, it is recommended to report capacity-normalized metrics such as NRMSE, NMAE, as well as CVRMSE, since these indicate the error per unit installed capacity or relative to mean production and substantially improve comparability across system sizes, climates, and aggregation levels. Finally, the chosen normalization method should be reported explicitly (for example, specifying that Prated is given in kWh).
Figure 5 displays the distribution of the reported NRMSE (%) values extracted from the surveyed studies (N = 15).
Individual markers represent the single NRMSE value reported in each study. The median of the collected NRMSE values is 6.5%, and the interquartile range is 6.4%, with reported values spanning from 1.9% to 16.9%. This illustrates both the variability across forecasting methods and the importance of explicitly stating the chosen normalization approach in each study. Since the reported values come from different systems, datasets, and, in some cases, different normalization conventions, the figure is not a direct head-to-head benchmark. Instead, it highlights the central tendency and spread of published normalized errors and helps identify studies that report comparatively low or high NRMSEs.
The primary purpose of the “Tips” column is to highlight shading issues reported in the articles. As observed, BIPV buildings with PV modules on their façades experience a higher degree of shading, which adversely affects energy production (the majority of shading-related articles concern BIPV systems). Moreover, varying shading conditions across different façades lead to differences in their respective outputs.
The other column of the table presents case studies focused on deployment. From the extracted set, residential deployments represent the largest share (23 out of 86 studies; 26.7%), followed by research/laboratory testbeds (17 out of 86; 19.8%) and university campuses (15 out of 86; 17.4%). Industrial and mixed-community deployments are scarce (four studies each; 4.7%). This distribution indicates a dominance of small-scale and academic testbeds in the literature; large commercial and industrial deployment-focused evaluations, which are critical for grid-integration, remain underrepresented.
The final column presents the geographical focus of the studies, which was carried out in countries. Their distribution is shown in Figure 6.
In order to conduct a comparative analysis considering forecasting methods, building types, and geographical contexts, it is essential that studies employ a consistent evaluation metric. Since previous research has used a variety of metrics (in some cases, even when metrics were normalized, the units or rates were not clearly reported), R2 emerges as the most comparable scale, ranging from 0 to 1, where 1 indicates perfect performance and 0 indicates the worst. Accordingly, studies reporting R2 values were re-extracted and summarized in Table 9, providing a clear basis for comparison.
Traditional statistical models, such as the SAPM and LASSO, demonstrate a variable level of performance. For instance, a study using SAPM on a university campus in Madrid reported R2 values ranging from 0.73 to 0.90 across different building façades. However, the LASSO model in an industrial setting in Germany yielded a significantly lower R2 of 0.57, suggesting a strong dependency on the operational environment and data characteristics.
ML models, particularly ANN, are widely used and show a large spectrum of reported accuracy. One study on residential buildings in Singapore reported R2 values from 0.63 to 0.88, while another ANN application in Korea achieved a much higher R2 of 0.9768. This wide performance range underscores a critical observation: the model type itself is only one of several determinants of accuracy. The performance of a model is not solely a function of its inherent sophistication but is profoundly influenced by the geographical condition, quality, and nature of the data on which it is trained and tested. Other ML methods like SVR and RF also demonstrated high performance, with RF achieving an exceptional R2 of 0.987 in Estonia and 0.99 in the USA.
The most compelling trend from the data is the superior performance of complex, DL, and hybrid ensemble models. This category includes architectures like CNN, LSTM, and advanced hybrid models. For example, a CNN-based model on a campus building in France reported an R2 of 0.96, while a LSTM model with self-attention in Japan achieved an R2 of 0.934. The highest reported R2 values are consistently associated with these more complex architectures. An AML model achieved a near-perfect R2 of 0.997 in an energy institute setting in South Korea, and a hybrid model combining WTEEMD, FCM, IGWO, and LSTM in a ZEB in Australia reached an R2 of 0.9915. The ability of these models to capture highly nonlinear, temporal, and chaotic patterns inherent in PV data is likely the reason for their superior performance.
The numerical superiority of DL and ensemble models, however, comes with a significant trade-off. While their performance can be exceptional, these models often function as “black boxes”, making it difficult for researchers and engineers to understand the causal relationships and underlying reasons for their predictions. This lack of interpretability is a key consideration for practical implementation, where understanding model failures or the influence of specific input variables can be as important as the accuracy itself.
The findings confirm that, while complex DL and hybrid models often report the highest R2 values, their success is not universal but is highly dependent on the operational environment. Controlled settings like laboratories and ZEBs inherently present less complex forecasting problems, allowing models to achieve near-perfect performance. In contrast, real-world scenarios such as industrial sites (owing to greater operational variability and pollution) or university campuses introduce significant variability that can limit the achievable accuracy, regardless of the model’s sophistication.
Furthermore, the analysis demonstrates that geographical location serves as a critical proxy for underlying climatic conditions, which introduce a unique set of challenges to forecasting. The unique case of BIPV systems highlights the importance of considering micro-environmental conditions, even across different façades of the same building.
Finally, it should be noted that, in addition to the forecasting method, building type, and geographical context, various other factors (such as the dataset characteristics, data intervals, forecasting horizon, season, PV panel cleanliness, temperature, ventilation, etc.) significantly affect forecasting accuracy.

3.7. Real-World Use Cases of PV Forecasting in Buildings

PV generation forecasting is not limited to labs or academic research, since it also has real-world applications. An empirical study [234] evaluated PV generation forecasting on PV rooftop systems at Hong Kong University of Science and Technology using a high-resolution inverter and meteorological dataset. Three models (SVM, Random Forest, and MLP) were trained to predict normalized PV output from selected weather features. Random Forest performed best (R2 0.96, MAE 3.6 W, RMSE 4.8 W, and MAPE 5.2%), followed by MLP and SVM. The measured annual degradation averaged 1.0%, primarily due to humidity-related potential-induced degradation, with the authors estimating that forecast-informed predictive maintenance could reduce maintenance costs by around 10–15%.
Kut et al. [235] proposed a short-term forecasting of energy production in residential PV systems, leveraging meteorological data collected in a real-world environment and applying ML techniques. Their approach could be integrated into EMS without relying on storage solutions, thereby promoting a more conscious and efficient use of local energy resources.
An important practical consequence for PV forecasting in buildings is that both system configuration (fixed vs. tracking) and the energy-balance timescale (yearly vs. monthly) materially affect forecasting needs and the types of forecast errors that are operationally relevant. D’Agostino et al. [236] showed that systems sized to achieve net zero on an annual basis can still exhibit large monthly deficits (fixed: 808.1 kWh; biaxial tracking system: 727.4 kWh when sized yearly) and that sizing to a monthly balance required substantial oversizing in their case study (fixed: 2.4 → 7.2 kW; tracking: 1.5 → 3.6 kW), with consequent increases in capital cost. Tracking PV systems can meet monthly electricity demands with fewer panels and typically outperform fixed PV systems regarding CO2 emissions. However, solar trackers increase capital and maintenance costs and require larger spacing to avoid self-shading.
Beyond algorithmic advances, PV forecasting in the building sector must be evaluated for its role in real decision-making. D’Agostino et al. [237] applied EnergyPlus coupled with a PV model to compare multiple deployment options (roof-mounted fixed, large parking-area array, and roof-mounted tracking) alongside envelope and HVAC measures. The alternatives were evaluated using technical and economic Key Performance Indicators (KPIs), including annual electric energy savings, Net Present Value (NPV), and discounted payback period, and were ranked through a robust multiple-criteria decision-making framework that explicitly models policymaker vs. tenant preferences. The study shows how PV sizing and cost assumptions lead to divergent stakeholder rankings (a large PV maximizes energy and CO2 savings, while a smaller tracking rooftop system yields the best NPV for tenants), illustrating the necessity of stakeholder-aware economic metrics and robustness/sensitivity analyses when assessing PV in retrofit decisions.
Therefore, forecasting frameworks should consider forecast horizon/accuracy and match them to the intended decision (e.g., seasonal/monthly forecasts for planning and sizing; day-ahead forecasts for storage scheduling; intra-day/hour-ahead forecasts for curtailment and load-following) and be evaluated with decision-relevant KPIs and robustness/sensitivity analyses.
These studies highlight how PV generation forecasting can move beyond theoretical research to deliver tangible benefits for real-world energy management.

3.8. Emerging Time Series Forecasting Technologies

The rapid development of AI-based technologies has led to the emergence of new methods aimed at enhancing forecasting accuracy. This section reviews these advances and underscores their potential application to PV generation forecasting in the building sector.
Given widespread concerns over data protection and privacy, Google introduced FL to permit model development on large, distributed datasets in a privacy-preserving manner, avoiding the transfer of raw user data to a central server [238]. There are only a small number of published applications of FL for PV generation forecasting. Although Ref. [181] (Section 3.5.2) highlighted FL’s potential to lower prediction RMSE, the limited research suggests a clear need for further evaluation, particularly for building-level deployments.
A Transformer [239] is a neural sequence transduction model that dispenses with recurrence and convolutions and instead computes representations using self-attention together with positional encodings. Attention-based Transformer models have rapidly become prominent in PV forecasting; a recent review [240] reported that Transformers often achieve the highest accuracy in PV power prediction. Refs. [199,200] (Section 3.5.2) highlighted the potential of transformer architectures and attention mechanisms to improve forecasting performance. Reflecting their growing popularity, the number of studies applying these methods to PV generation forecasting has increased in recent years.
Quantum Machine Learning (QML) is an emerging area that integrates principles of quantum computation (such as superposition, entanglement, and interference) into Machine Learning models and algorithms [241]. To the best of our knowledge, there are currently no published studies that explicitly apply QML techniques to PV generation forecasting in buildings. Nevertheless, recent works have explored QML or hybrid quantum–classical methods for closely related problems, including solar irradiance forecasting [241], building energy consumption forecasting [242], and solar power forecasting in solar farms [243]. These works provide preliminary evidence of QML’s potential for time series problems.
Spiking Neural Networks, regarded as third-generation neural models, emulate biological neurons by communicating with discrete spikes [244]. Two applications include wind-speed forecasting, where Thangaraj et al. [245] reported that a Spiking Neural Network-based predictor attains a lower MSE than earlier Neural Network approaches. For short-term PV power forecasting, Chen et al. [246] combined K-Means clustering with Spiking Neural Networks and demonstrated superior accuracy relative to CNNs and SVMs.
Graph Neural Networks (GNNs) employ a message-passing scheme in which each node’s representation is formed from its own attributes together with aggregated information received from adjacent nodes [247]. The graph is defined by an adjacency matrix (most studies use a fixed matrix based on statistical similarity/geographic location) [248]. Hu et al. [247] developed a data-driven model for urban building energy that integrates solar-based interdependencies into a spatiotemporal graph convolutional network. Their experiments showed robust performance under multiple scenarios, with the model yielding 5% MAPE. Yang et al. [249] adopted a combined graph-convolutional and LSTM architecture to capture spatial shading among rooftops for spatiotemporal PV potential estimation. Their results indicated notable gains over standard LSTM models: relative error reductions of approximately 12% (MAPE), 13% (RMSE), 21% (MAE), and 22% (MSE). Although these studies demonstrate the promise of graph-based spatiotemporal models for rooftop PV estimation, the number of works that explicitly address building sector PV generation forecasting remains limited, and further research is needed.
A digital twin is a dynamic virtual model of a physical asset that enables real-time monitoring [250]. Through connectivity and fast data processing, it mirrors the building’s state, maintains synchronized data, and can simulate system behavior to support operational decision-making [250]. This approach has been widely used for building energy consumption forecasting; Boukaf et al. recently published a review summarizing key advances in this area [250]. Digital-twin applications are also well established in wind energy systems [251], yet their deployment for PV systems remains limited to date [252].
Overall, these novel approaches provide exciting opportunities to enhance PV generation forecasting for buildings, though broader evaluation and practical implementation are still required.

3.9. Practical Implications of Forecasting Accuracy

The practical implications of PV generation forecasting accuracy include optimal system sizing and strategies for grid integration.
PV generation forecasting is a critical element of efficient EMS operation. El-Baz et al. [253] discussed that uncertainties in PV generation forecasts and variability lead to less efficient operation plans for household devices, thereby reducing building autonomy, self-sufficiency, and potential cost savings and self-consumption. Most studies recommend forecasting horizons of 6–48 h for effective load planning (shorter horizons may be acceptable only when power-to-heat devices such as heat pumps are absent) [253]. The typical temporal resolution for building-scale applications is 15 min–1 h [253].
Sorour et al. [254] developed an EMS for PV–battery systems and demonstrated that integrating PV forecasts into its optimization can deliver substantial economic benefits. Over six months, their proposed method reduced net energy exchange with the grid by up to 194% and total operating costs by 54% compared to the existing approaches while lowering energy bills by as much as 46%. They also showed that forecast-based scheduling is more efficient when the EMS objective explicitly minimizes the net grid exchange.
Accurate forecasting of PV generation enables the estimation of inverter capacity and the optimal sizing of the overall energy system. Despite its importance, there is a striking lack of studies addressing forecast-driven system sizing in the building sector. Reliable predictions could help designers balance investment costs with operational performance, avoiding undersized installations that fail to meet demand or oversized systems that increase unnecessary expenses. Moreover, precise forecasts support integration with building energy management systems, enabling efficient load scheduling and storage utilization to maximize self-consumption and reduce grid dependence. Given the growing adoption of rooftop PV systems in buildings, research on forecast-informed sizing strategies is essential to achieve cost-effective, resilient, and sustainable energy solutions.
Forecast accuracy directly influences how effectively renewable energy can be integrated into the grid [255]. In systems with high wind shares, integration is highly sensitive to forecast errors, while PV integration is less dependent on accuracy. As renewable penetration rises, forecast errors increase the demand for control reserves, though this effect stabilizes once the installed capacity reaches the annual peak demand. Furthermore, the length of the forecast horizon is critical, since better-aligned forecasts improve predictive dispatch and overall system stability.

4. Discussion

The discussion section seeks to present the principal findings related to this topic. Through a summary and analysis of the existing literature in Section 3, researchers can pinpoint the most effective methodologies and strategies for PV generation forecasting while also identifying significant challenges and opportunities for future exploration. This section presents the main outcomes of the study, and the identified research gaps are evaluated.

4.1. Outcomes

This study explores key aspects such as the significance of high-quality data in PV generation forecasting, various forecasting methodologies, evaluation approaches (metrics and post-processing), challenges in modeling PV systems in the building sector, and a comprehensive review of strategies related to BIPV and BAPV systems. Building on the previous sections and the conducted analysis, the following outcomes were verified.
Factors influencing forecasting accuracy include the number of variables, as an increase in variables does not necessarily lead to a relative improvement in prediction accuracy. While using larger sets of input variables resulted in the smallest forecast errors, the computational time increased as the number of input variables grew. Furthermore, improving prediction accuracy is not necessarily related to the complexity of the model, and in some cases, simpler models may yield better results. In this regard, the dataset plays a key role.
Factors influencing the impact of environmental and system characteristics include the azimuth angles and tilt in PV systems, which influence the energy yield and power profiles and play a crucial role in the design and accuracy of PV output forecasting models. In addition, the accuracy of solar radiation forecasting will directly influence the framework’s forecasting of PV output capability. PV generation is more consistent on sunny days, leading to better prediction performance compared to cloudy days. Despite the expectation of more sunshine in the summer, some locations experience their wettest season during this time, leading to reduced sunshine. Consequently, summer weather patterns in these areas may resemble those of cold months. Therefore, the location is crucial, as seasonal changes may not always correlate with weather conditions.
The selection of models is influenced by the balance between accuracy and computational time. While the hybrid model is acknowledged for its superior accuracy in most cases, it compromises speed compared to individual models. On the other hand, when forecasting methods are applied to building applications, some ML methods show superior performance, suggesting that more complex approaches do not always yield the best accuracy. Enhancing the accuracy of PV output prediction is achievable by exploiting the temporal correlations within the PV generation data and spatial similarities. Another challenge is that the accuracy of NWP-independent models deteriorates after a few hours, resulting in lower reliability compared to NWP-dependent models.
The prediction models for BAPV and BIPV systems include physical, statistical, hybrid, and ensemble models. Based on Table 5 and Table 6, it was possible to conclude that physical models are rarely used due to their reliance on detailed system parameters, which are often unavailable or difficult to measure. Statistical models, such as LSTM, are widely applied due to their ability to capture temporal dependencies and handle time series data effectively. Other Neural Networks models, like ANN and MLP, are popular for modeling nonlinear relationships, while SVR and SVM are frequently used as benchmarks for their robustness and reliable performance. Hybrid models, like CNN-LSTM, effectively integrate spatial and temporal features, addressing the complexity of PV forecasting tasks. Other hybrid approaches, such as LSTM-based combinations with optimization techniques, offer improvements in accuracy but increase the computational time. Ensemble methods, like RFs and XGBoost, are prominent for combining weak learners and enhancing accuracy.
In terms of performance evaluation, RMSE is the most widely used metric in PV generation forecasting due to its ability to penalize larger errors, making it critical for energy management. Its interpretability, being in the same units as the predicted values, also adds to its popularity.

4.2. Gaps and Future Work

Although extensive research has been conducted on the forecasting of PV generation in buildings, certain gaps persist, requiring further exploration. This section outlines key areas where additional investigation could yield valuable insights into improving the accuracy, efficiency, and applicability of PV generation forecasting methodologies in the building sector.
Priority roadmap. Among the identified gaps, immediate priority should be given to (1) translating PV forecasts into operational EMS decisions (to quantify peak shaving, cost, and sizing benefits); (2) addressing methodological weaknesses through standardized benchmarks and rigorous comparative studies (including missing data handling and spatial–temporal modeling); and (3) advancing algorithmic capabilities (online/federated learning and robust feature selection). Subsequent priorities include targeted studies on façade temperature mismatch, soiling and equipment aging, systematic investigations of localized environmental factors, and the development of BIPV/bifacial-specific forecasting datasets.
Although the majority of reviewed studies focused on developing and fine-tuning forecasting models for PV generation, there was a clear and pressing gap in translating those forecasts into operational decisions within BEMSs. Future research should treat PV forecasts as inputs (and not as final results) and rigorously evaluate how forecast-driven EMS strategies affect peak shaving, electricity bill reduction, operational costs, and technoeconomic outcomes such as energy storage deployment and PV/inverter sizing.
The methodological gaps include several critical issues, such as the limited comparison of forecasting methodologies, the assumption of constant PV generation despite real-world variations, the neglect of spatial and temporal correlations, the lack of missing value imputation strategies, and the underutilization of heuristic optimization techniques. Many studies have conducted comparisons among a restricted set of methodologies, indicating the need for researchers to undertake comprehensive assessments to identify the most reliable and efficient models, thereby advancing forecasting methodologies. Additionally, most forecasting models assume constant PV generation, disregarding real-world variations (ignoring panel degradation, inverter losses, and temperature-dependent performance variations) and leading to inaccurate forecasts. Furthermore, the focus of many PV output forecasting methods on specific PV systems neglects spatial and temporal correlations among numerous systems, which is essential for improving forecasting accuracy. Moreover, the scarcity of literature discussing missing value imputation in building PV output forecasting further obstructs the development of robust methods for handling missing data in real-world datasets. Addressing this gap is essential to improving the accuracy of PV generation forecasts. Lastly, heuristic optimization techniques, despite their proven potential to improve the accuracy of forecasting models, are rarely applied in the literature. These methods could offer significant benefits in optimizing model parameters and enhancing predictive performance, particularly for complex PV systems.
Advancing methods in PV forecasting involve addressing several key challenges, such as the reliance on traditional offline ML, the need for online learning techniques, the lack of backward feature elimination, and the limited availability of standardized data. Additionally, privacy concerns around data sharing could be addressed using federated learning algorithms to improve forecasting accuracy. The common use of traditional offline ML in PV generation forecasting limits the effectiveness of these models in real-world applications. Further research on online learning techniques for ML in PV generation forecasting can improve the accuracy and applicability of these models. Online models, which regularly update with both predicted and real values, could play a significant role in future literature. In addition, the lack of application of backward feature elimination techniques hinders the identification of the most relevant set of features, potentially limiting the accuracy and efficiency of the predictive model. Finally, the limited availability of comprehensive and standardized data on building PV output patterns restricts the development and deployment of reliable models for PV output forecasting. Despite building owners having the data, they are hesitant to share them due to privacy concerns. Future research could investigate federated learning algorithms and their potential to improve energy prediction accuracy while preserving privacy.
Due to varying ventilation conditions, significant temperature differences, and uneven temperature distribution across PV façade modules, substantial mismatch power losses occur, adversely impacting electrical performance. Future research should focus on analyzing the temperature distribution of PV façades to mitigate these losses. Additionally, PV system efficiency declines over time due to equipment aging and the accumulation of surface dust. Therefore, future evaluation methods should consider the system’s full life cycle and its interaction with the BEMS.
Environmental and operational factors affecting PV generation include factors such as partial shading, building orientation and tilt, building height, proximity to trees and other obstructions, dust and dirt, snow and ice, and pollution. However, a noticeable gap remains in comprehensive investigations into the impact of these localized factors. This lack of research limits our understanding of their effects on PV generation in building environments.
The insufficient research on BIPV, despite the growing importance of integrating solar technology into building structures, hampers the development of innovative BIPV solutions and the advancement of the intersection of PVs and building technology. Moreover, the significant gap in research specifically investigating the energy forecasting of bifacial solar PV systems in buildings obstructs the development of accurate forecasting models. However, these critical aspects have been overlooked in the current literature.

5. Conclusions

To tackle the growing challenges of energy consumption and GHG emissions, buildings play a crucial role in transforming the energy sector by facilitating the integration of PV systems. By accurately predicting PV generation alongside building energy demands, energy management strategies can be optimized. This paper highlights the significance of forecasting PV generation in buildings and underscores the critical need for reliable data. It also clarifies the key distinctions between PV forecasting in the building sector and other sectors. Moreover, various forecasting methodologies and evaluation metrics are thoroughly examined.
A comprehensive review of the current landscape of PV generation forecasting in buildings was conducted. The reviewed articles were categorized into BIPV and BAPV using a content analysis approach. This review examined a range of forecasting methods, including physical models and statistical approaches (such as Artificial Neural Networks (ANNs) and Machine Learning (ML)), as well as hybrid and ensemble techniques, within both BIPV and BAPV contexts. Statistical methodologies dominate due to their adaptability and ability to capture nonlinear relationships. While physical models offer deterministic accuracy, their reliance on detailed input parameters limits practical applications. Hybrid and ensemble models, despite their enhanced predictive capabilities, remain computationally demanding. Additionally, this study analyzed the most frequently used evaluation metrics. RMSE emerged as the most widely used evaluation metric, reflecting its effectiveness in penalizing large forecasting errors, though other metrics, like MAE, are also employed for complementary insights.
Recognizing the vast potential of PV generation forecasting, this article presents in-depth discussions on existing research gaps (such as the impact of localized environmental factors, the need for advanced forecasting methodologies, and data privacy challenges) and outlines key future research directions (including the development of online learning models, improved missing value imputation techniques, and applying heuristic optimization) for energy researchers to explore.

Author Contributions

O.P.: Writing—original draft, Visualization, Validation, Methodology, Investigation, Formal analysis, Data curation, and Conceptualization. A.S.: Writing—review and editing, Methodology, and Supervision. P.M.: Writing—review and editing, Methodology, Supervision, Conceptualization, and Project administration. All authors have read and agreed to the published version of the manuscript.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data Availability Statement

No data were used for the research described in the article.

Acknowledgments

This research was supported by the Portuguese Foundation for Science and Technology of Portugal (FCT) through the project grants UIDB/00308/2020 (DOI 10.54499/UIDB/00308/2020) and UIDB/00048/2020 (DOI 10.54499/UIDB/00048/2020) and by the ERDF and national funds through the projects flexREC (COMPETE2030-FEDER-00818200), EnergyFlex (COMPETE2030-FEDER-00736300), and the Alliance for the Energy Transition (56) co-financed by the Recovery and Resilience Plan (RRP) through the European Union. The authors acknowledge the use of ChatGPT (GPT-5, OpenAI, 2025) during the preparation of this manuscript for the purposes of improving clarity, grammar, and readability. The authors have reviewed and edited the output and take full responsibility for the content of this publication.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
AIArtificial Intelligence
ANNArtificial Neural Networks
ARIMAAutoregressive Integrated Moving Average
AMLAutomatic Machine Learning
BiLSTMBidirectional Long Short-Term Memory
BAPVBuilding Attached Photovoltaics
BEMSBuilding Energy Management Systems
BIPVBuilding Integrated Photovoltaics
CSOCompetitive Swarm Optimizer
CHAIDCHi-square Automatic Interaction Detection
CARTClassification And Regression Trees
CEEMDANComplete Ensemble Empirical Mode Decomposition with Adaptive Noise
CNNConvolutional Neural Network
ConvLSTMConvolutional Long Short-Term Memory
DTRDecision Tree Regression
DTsDecision Trees
DLDeep Learning
DNNDeep Neural Networks
DRDemand Response programs
DSE-XGBImproved generally applicable stacked ensemble algorithm
ESSEnergy Storage Systems
ETsExtra-Trees
FLFederated Learning
FFNNFeed-Forward Neural Network
FCMFuzzy C-Means
GAGenetic Algorithm
GNNsGraph Neural Networks
GRUGated Recurrent Unit
GWOGrey Wolf Optimizer
HVACHeating, Ventilation, and Air Conditioning
hSBFMhierarchical Similarity-Based Forecasting Models
IGWOImproved Gray Wolf Optimization
KNNsK-Nearest Neighbors
KPIsKey Performance Indicators
LASSOLeast Absolute Shrinkage and Selection Operator
LRLinear Regression
LSTMLong Short-Term Memory
MLMachine Learning
MLPEModule-Level Power Electronics
MPPMaximum Power Point
MPPTMaximum Power Point Tracking
MAEMean Absolute Error
MAPEMean Absolute Percentage Error
MBEMean Bias Error
MLPMulti-Layer Perceptron
MVRMulti-Variable Regression
MLRMultiple Linear Regression
NARXNonlinear Autoregressive with exogenous input
NPVNet Present Value
NRMSENormalized Root Mean Squared Error
NWPNumerical Weather Prediction
PVPhotovoltaic
QSVMQuadratic Support Vector Machine
QRQuantile Regression
QRFQuantile Regression Forests
QMLQuantum Machine Learning
RBF-ANNRadial Basis Function Artificial Neural Networks
RFsRandom Forests
RNNRecurrent Neural Networks
RMSERoot Mean Squared Error
SAPMSandia Array Performance Model
SBFMsSimilarity-Based Forecasting Models
SC*Sky Condition
SVFSky View Factor
SNNStandard Neural Network
SVMSupport Vector Machine
SVRSupport Vector Regression
R2The Coefficient of Determination
TBATSTrigonometric Seasonal, Box-Cox Transformation, Autoregressive Integrated Moving Average residuals, Trend, and Seasonality
VMDVariational Mode Decomposition
WTEEMDWavelet Threshold Improvement Ensemble Empirical Mode Decomposition
XGBoostExtreme Gradient Boost

References

  1. Alanne, K. Study on Reducing the Grid Dependency of Urban Housing in Nordic Climate by Hybrid Renewable Energy Systems. Renew. Energy Focus 2023, 46, 1–15. [Google Scholar] [CrossRef]
  2. Emil, F.; Diab, A. Energy Rationalization for an Educational Building in Egypt: Towards a Zero Energy Building. J. Build. Eng. 2021, 44, 103247. [Google Scholar] [CrossRef]
  3. Belloni, E.; Fioriti, D.; Poli, D. Optimal Design of Renewable Energy Communities (RECs) in Italy: Influence of Composition, Market Signals, Buildings, Location, and Incentives. Electr. Power Syst. Res. 2024, 235, 110895. [Google Scholar] [CrossRef]
  4. Ullah, Z.; Qazi, H.S.; Rehman, A.U.; Hasanien, H.M.; Wang, S.; Elkadeem, M.R.; Badshah, F. Efficient Energy Management of Domestic Loads with Electric Vehicles by Optimal Scheduling of Solar-Powered Battery Energy Storage System. Electr. Power Syst. Res. 2024, 234, 110570. [Google Scholar] [CrossRef]
  5. Huseinbegović, S.; Smajkić, A.; Ahmethodžić, L.; Smaka, S.; Gajip, S. Optimal Building Integrated Photovoltaic Sizing Approach According to Load Profile under Zero Export Restrictions with Real Data Validation. Renew. Energy Focus 2024, 50, 100605. [Google Scholar] [CrossRef]
  6. Lee, J.; Shepley, M.M. Benefits of Solar Photovoltaic Systems for Low-Income Families in Social Housing of Korea: Renewable Energy Applications as Solutions to Energy Poverty. J. Build. Eng. 2020, 28, 101016. [Google Scholar] [CrossRef]
  7. Kurian, J.; Karthi, L. Building Integrated Photovoltaics-an Overview. Sustain. Agri Food Environ. Res. 2022, 10. [Google Scholar] [CrossRef]
  8. Kavlak, G.; McNerney, J.; Trancik, J.E. Evaluating the Causes of Cost Reduction in Photovoltaic Modules. Energy Policy 2018, 123, 700–710. [Google Scholar] [CrossRef]
  9. Building-integrated Photovoltaics Market Size, Share & Trends Analysis Report By Technology (Crystalline Silicon, Thin Film), by Application, by End-use, by Region, and Segment Forecasts, 2024–2030. Available online: https://www.grandviewresearch.com/industry-analysis/building-integrated-photovoltaics-bipv-market#:~:text=The%20global%20building%2Dintegrated%20photovoltaics,21.2%25%20from%202024%20to%202030. (accessed on 16 September 2025).
  10. New Report: Solar for the Masses as Global Solar Rooftops Skyrocket by 50%—SolarPower Europe. Available online: https://www.solarpowereurope.org/press-releases/new-report-solar-for-the-masses-as-global-solar-rooftops-skyrocket-by-50-2 (accessed on 4 December 2024).
  11. Global Market Outlook For Solar Power 2023–2027—SolarPower Europe. Available online: https://www.solarpowereurope.org/insights/outlooks/global-market-outlook-for-solar-power-2023-2027/detail (accessed on 4 December 2024).
  12. Emerging Solar Era: The Global Expansion of Solar PV and Energy Storage. Available online: https://www.rystadenergy.com/insights/solar-pv-and-energy-storage-outlook-emerging-solar-era (accessed on 4 December 2024).
  13. Nwaigwe, K.N.; Mutabilwa, P.; Dintwa, E. An Overview of Solar Power (PV Systems) Integration into Electricity Grids. Mater. Sci. Energy Technol. 2019, 2, 629–633. [Google Scholar] [CrossRef]
  14. Yi, C.; Peng, Y.; Su, S.; Li, B.; Wang, X.; Zhou, W.; Guo, X.; Yang, H.; Meng, W. Anomaly Detection of Photovoltaic Power Generation Based on Quantile Regression Recurrent Neural Network. Electr. Power Syst. Res. 2025, 238, 111132. [Google Scholar] [CrossRef]
  15. Xie, J.; Zhao, B.; Zhang, H.; Fu, Z.; Yang, T.; Li, R. Experimental Study on the Effect of Dust Particle Deposition on Photovoltaic Performance of Urban Buildings. Renew. Energy 2023, 219, 119424. [Google Scholar] [CrossRef]
  16. Wang, Z.; Liu, Z.; Huo, Y. A Distributionally Robust Optimization Approach of Multi-Park Integrated Energy Systems Considering Shared Energy Storage and Uncertainty of Demand Response. Electr. Power Syst. Res. 2025, 238, 111116. [Google Scholar] [CrossRef]
  17. Ranjan, M.; Shankar, R. A Literature Survey on Load Frequency Control Considering Renewable Energy Integration in Power System: Recent Trends and Future Prospects. J. Energy Storage 2022, 45, 103717. [Google Scholar] [CrossRef]
  18. Pedram, O.; Asadi, E.; Chenari, B.; Moura, P.; Gameiro da Silva, M. A Review of Methodologies for Managing Energy Flexibility Resources in Buildings. Energies 2023, 16, 6111. [Google Scholar] [CrossRef]
  19. Barreto, R.; Gonçalves, C.; Gomes, L.; Faria, P.; Vale, Z. Evaluation Metrics to Assess the Most Suitable Energy Community End-Users to Participate in Demand Response. Energies 2022, 15, 2380. [Google Scholar] [CrossRef]
  20. Silva, W.N.; Henrique, L.F.; Silva, A.F.P.d.C.; Dias, B.H.; Soares, T.A. Market Models and Optimization Techniques to Support the Decision-Making on Demand Response for Prosumers. Electr. Power Syst. Res. 2022, 210, 108059. [Google Scholar] [CrossRef]
  21. Massat, S.; Franke, G.; Rinderknecht, S. Exploring the Potential of a Cross-Sector Integration of Electric Vehicles in Residential Buildings as a Possible Energy Supply Source. J. Build. Eng. 2023, 77, 107486. [Google Scholar] [CrossRef]
  22. Ahmad, T.; Zhang, H.; Yan, B. A Review on Renewable Energy and Electricity Requirement Forecasting Models for Smart Grid and Buildings. Sustain. Cities Soc. 2020, 55, 102052. [Google Scholar] [CrossRef]
  23. Seyedzadeh, S.; Rahimian, F.P.; Glesk, I.; Roper, M. Machine Learning for Estimation of Building Energy Consumption and Performance: A Review. Vis. Eng. 2018, 6, 5. [Google Scholar] [CrossRef]
  24. Sharma, V.; Cortes, A.; Cali, U. Use of Forecasting in Energy Storage Applications: A Review. IEEE Access 2021, 9, 114690–114704. [Google Scholar] [CrossRef]
  25. Aguilar, J.; Garces-Jimenez, A.; R-Moreno, M.D.; García, R. A Systematic Literature Review on the Use of Artificial Intelligence in Energy Self-Management in Smart Buildings. Renew. Sustain. Energy Rev. 2021, 151, 111530. [Google Scholar] [CrossRef]
  26. Immonen, A.; Kiljander, J. Flexibility Services for Household Consumers in Finland: Requirements and Provided Properties. Renew. Energy Focus 2024, 49, 100567. [Google Scholar] [CrossRef]
  27. Jia, M.; Komeily, A.; Wang, Y.; Srinivasan, R.S. Adopting Internet of Things for the Development of Smart Buildings: A Review of Enabling Technologies and Applications. Autom. Constr. 2019, 101, 111–126. [Google Scholar] [CrossRef]
  28. Lee, H.-C.; Liu, H.-Y.; Lin, T.-C.; Lee, C.-Y. A Customized Energy Management System for Distributed PV, Energy Storage Units, and Charging Stations on Kinmen Island of Taiwan. Sensors 2023, 23, 5286. [Google Scholar] [CrossRef]
  29. Sumorek, M.; Idzkowski, A. Time Series Forecasting for Energy Production in Stand-Alone and Tracking Photovoltaic Systems Based on Historical Measurement Data. Energies 2023, 16, 6367. [Google Scholar] [CrossRef]
  30. Amabile, L.; Bresch-Pietri, D.; El Hajje, G.; Labbé, S.; Petit, N. Optimizing the Self-Consumption of Residential Photovoltaic Energy and Quantification of the Impact of Production Forecast Uncertainties. Adv. Appl. Energy 2021, 2, 100020. [Google Scholar] [CrossRef]
  31. Angenendt, G.; Zurmühlen, S.; Axelsen, H.; Sauer, D.U. Comparison of Different Operation Strategies for PV Battery Home Storage Systems Including Forecast-Based Operation Strategies. Appl. Energy 2018, 229, 884–899. [Google Scholar] [CrossRef]
  32. Durán, F.; Pavón, W.; Minchala, L.I. Forecast-Based Energy Management for Optimal Energy Dispatch in a Microgrid. Energies 2024, 17, 486. [Google Scholar] [CrossRef]
  33. Amasyali, K.; El-Gohary, N.M. A Review of Data-Driven Building Energy Consumption Prediction Studies. Renew. Sustain. Energy Rev. 2018, 81, 1192–1205. [Google Scholar] [CrossRef]
  34. Wei, Y.; Zhang, X.; Shi, Y.; Xia, L.; Pan, S.; Wu, J.; Han, M.; Zhao, X. A Review of Data-Driven Approaches for Prediction and Classification of Building Energy Consumption. Renew. Sustain. Energy Rev. 2018, 82, 1027–1047. [Google Scholar] [CrossRef]
  35. Ahmad, T.; Chen, H.; Guo, Y.; Wang, J. A Comprehensive Overview on the Data Driven and Large Scale Based Approaches for Forecasting of Building Energy Demand: A Review. Energy Build. 2018, 165, 301–320. [Google Scholar] [CrossRef]
  36. Chen, Y.; Guo, M.; Chen, Z.; Chen, Z.; Ji, Y. Physical Energy and Data-Driven Models in Building Energy Prediction: A Review. Energy Rep. 2022, 8, 2656–2671. [Google Scholar] [CrossRef]
  37. Panda, S.; Dhaka, R.K.; Panda, B.; Pradhan, A.; Jena, C.; Nanda, L. A Review on Application of Machine Learning in Solar Energy & Photovoltaic Generation Prediction. In Proceedings of the 2022 International Conference on Electronics and Renewable Systems (ICEARS), Tuticorin, India, 16–18 March 2022; pp. 1180–1184. [Google Scholar]
  38. Akhter, M.N.; Mekhilef, S.; Mokhlis, H.; Mohamed Shah, N. Review on Forecasting of Photovoltaic Power Generation Based on Machine Learning and Metaheuristic Techniques. IET Renew. Power Gener. 2019, 13, 1009–1023. [Google Scholar] [CrossRef]
  39. Erdener, B.C.; Feng, C.; Doubleday, K.; Florita, A.; Hodge, B.-M. A Review of Behind-the-Meter Solar Forecasting. Renew. Sustain. Energy Rev. 2022, 160, 112224. [Google Scholar] [CrossRef]
  40. Sobri, S.; Koohi-Kamali, S.; Rahim, N.A. Solar Photovoltaic Generation Forecasting Methods: A Review. Energy Convers. Manag. 2018, 156, 459–497. [Google Scholar] [CrossRef]
  41. de Freitas Viscondi, G.; Alves-Souza, S.N. A Systematic Literature Review on Big Data for Solar Photovoltaic Electricity Generation Forecasting. Sustain. Energy Technol. Assess. 2019, 31, 54–63. [Google Scholar] [CrossRef]
  42. Neuendorf, K.A. The Content Analysis Guidebook; 9. [print.].; Sage Publ: Thousand Oaks, CA, USA, 2010; ISBN 978-0-7619-1977-3. [Google Scholar]
  43. Li, G.; Guo, S.; Li, X.; Cheng, C. Short-Term Forecasting Approach Based on Bidirectional Long Short-Term Memory and Convolutional Neural Network for Regional Photovoltaic Power Plants. Sustain. Energy Grids Netw. 2023, 34, 101019. [Google Scholar] [CrossRef]
  44. Brecl, K.; Topič, M. Photovoltaics (PV) System Energy Forecast on the Basis of the Local Weather Forecast: Problems, Uncertainties and Solutions. Energies 2018, 11, 1143. [Google Scholar] [CrossRef]
  45. Mayer, M.J.; Gróf, G. Extensive Comparison of Physical Models for Photovoltaic Power Forecasting. Appl. Energy 2021, 283, 116239. [Google Scholar] [CrossRef]
  46. Başaran, K.; Bozyiğit, F.; Siano, P.; Yıldırım Taşer, P.; Kılınç, D. Systematic Literature Review of Photovoltaic Output Power Forecasting. IET Renew. Power Gener. 2020, 14, 3961–3973. [Google Scholar] [CrossRef]
  47. Alshahrani, A.; Omer, S.; Su, Y.; Mohamed, E.; Alotaibi, S. The Technical Challenges Facing the Integration of Small-Scale and Large-Scale PV Systems into the Grid: A Critical Review. Electronics 2019, 8, 1443. [Google Scholar] [CrossRef]
  48. Jiang, H.; Zhang, X.; Yao, L.; Lu, N.; Qin, J.; Liu, T.; Zhou, C. High-Resolution Analysis of Rooftop Photovoltaic Potential Based on Hourly Generation Simulations and Load Profiles. Appl. Energy 2023, 348, 121553. [Google Scholar] [CrossRef]
  49. Ming, B.; Liu, P.; Guo, S.; Zhang, X.; Feng, M.; Wang, X. Optimizing Utility-Scale Photovoltaic Power Generation for Integration into a Hydropower Reservoir by Incorporating Long- and Short-Term Operational Decisions. Appl. Energy 2017, 204, 432–445. [Google Scholar] [CrossRef]
  50. Mellit, A.; Massi Pavan, A.; Ogliari, E.; Leva, S.; Lughi, V. Advanced Methods for Photovoltaic Output Power Forecasting: A Review. Appl. Sci. 2020, 10, 487. [Google Scholar] [CrossRef]
  51. Nassereddine, M. A Novel Lightning Mast Layout to Eliminate Shading Effect on PV Panels. Electr. Power Syst. Res. 2024, 236, 110972. [Google Scholar] [CrossRef]
  52. Li, H.X.; Zhang, Y.; Edwards, D.; Hosseini, M.R. Improving the Energy Production of Roof-Top Solar PV Systems through Roof Design. Build. Simul. 2020, 13, 475–487. [Google Scholar] [CrossRef]
  53. Høiaas, I.; Grujic, K.; Imenes, A.G.; Burud, I.; Olsen, E.; Belbachir, N. Inspection and Condition Monitoring of Large-Scale Photovoltaic Power Plants: A Review of Imaging Technologies. Renew. Sustain. Energy Rev. 2022, 161, 112353. [Google Scholar] [CrossRef]
  54. Luthander, R.; Widén, J.; Nilsson, D.; Palm, J. Photovoltaic Self-Consumption in Buildings: A Review. Appl. Energy 2015, 142, 80–94. [Google Scholar] [CrossRef]
  55. Köker, N.I.; Manni, M.; Giorio, M.; Jelle, B.P.; Di Sabatino, M.; Lobaccaro, G. Defining Challenges of Solar Irradiance Modeling on Façades in Urban Environments: A Systematic Review. Energy Build. 2025, 347, 116137. [Google Scholar] [CrossRef]
  56. Zomer, C.; Custódio, I.; Antoniolli, A.; Rüther, R. Performance Assessment of Partially Shaded Building-Integrated Photovoltaic (BIPV) Systems in a Positive-Energy Solar Energy Laboratory Building: Architecture Perspectives. Sol. Energy 2020, 211, 879–896. [Google Scholar] [CrossRef]
  57. Gonçalves, J.E.; Montazeri, H.; van Hooff, T.; Saelens, D. Performance of Building Integrated Photovoltaic Facades: Impact of Exterior Convective Heat Transfer. Appl. Energy 2021, 287, 116538. [Google Scholar] [CrossRef]
  58. Building Integrated Photovoltaics (BIPV)|WBDG—Whole Building Design Guide. Available online: https://www.wbdg.org/resources/building-integrated-photovoltaics-bipv (accessed on 2 September 2025).
  59. Badran, G.; Dhimish, M. Comprehensive Study on the Efficiency of Vertical Bifacial Photovoltaic Systems: A UK Case Study. Sci. Rep. 2024, 14, 18380. [Google Scholar] [CrossRef]
  60. Viriyaroj, B.; Jouttijärvi, S.; Jänkälä, M.; Miettunen, K. Performance of Vertically Mounted Bifacial Photovoltaics under the Physical Influence of Low-Rise Residential Environment in High-Latitude Locations. Front. Built Environ. 2024, 10, 1343036. [Google Scholar] [CrossRef]
  61. Zabihi, A.; Sadeghkhani, I.; Fani, B. A Partial Shading Detection Algorithm for Photovoltaic Generation Systems. J. Sol. Energy Res. 2021, 6, 678–687. [Google Scholar] [CrossRef]
  62. IEA-PVPS (2023).Pdf. Available online: https://iea-pvps.org/wp-content/uploads/2023/10/PVPS_Trends_Report_2023_WEB.pdf (accessed on 16 September 2025).
  63. Kappler, T.; Starosta, A.S.; Munzke, N.; Schwarz, B.; Hiller, M. Detection of Shading for Short-Term Power Forecasting of Photovoltaic Systems Using Machine Learning Techniques. EPJ Photovolt. 2024, 15, 17. [Google Scholar] [CrossRef]
  64. Kappler, T.; Starosta, A.S.; Schwarz, B.; Munzke, N.; Hiller, M. Inclusion of Shading and Soiling With Physical and Data-Driven Algorithms for Solar Power Forecasting. PV-Symp. Proc. 2024, 1, 39. [Google Scholar] [CrossRef]
  65. Li, F.; Yuan, Z.; Wu, W. Experimental Investigation of Soiling Losses on Photovoltaic in High-Density Urban Environments. Appl. Energy 2024, 369, 123572. [Google Scholar] [CrossRef]
  66. Muniz-Gäal, L.P.; Pezzuto, C.C.; de Carvalho, M.F.H.; Mota, L.T.M. Urban Geometry and the Microclimate of Street Canyons in Tropical Climate. Build. Environ. 2020, 169, 106547. [Google Scholar] [CrossRef]
  67. Khalvandi, R.; Karimimoshaver, M. The Optimal Ratio in the Street Canyons: Comparison of Two Methods of Satellite Images and Simulation. Build. Environ. 2023, 229, 109927. [Google Scholar] [CrossRef]
  68. Dirksen, M.; Ronda, R.J.; Theeuwes, N.E.; Pagani, G.A. Sky View Factor Calculations and Its Application in Urban Heat Island Studies. Urban Clim. 2019, 30, 100498. [Google Scholar] [CrossRef]
  69. Rostami, E.; Nasrollahi, N.; Khodakarami, J. A Comprehensive Study of How Urban Morphological Parameters Impact the Solar Potential, Energy Consumption and Daylight Autonomy in Canyons and Buildings. Energy Build. 2024, 305, 113904. [Google Scholar] [CrossRef]
  70. Ismail, A.A.; El-Deib, A.A. DC/AC Conversion Topologies for Photovoltaic Applications. In Proceedings of the 2023 IEEE Conference on Power Electronics and Renewable Energy (CPERE), Luxor, Egypt, 19–21 February 2023; pp. 1–6. [Google Scholar]
  71. Sarwar, S.; Javed, M.Y.; Jaffery, M.H.; Ashraf, M.S.; Naveed, M.T.; Hafeez, M.A. Modular Level Power Electronics (MLPE) Based Distributed PV System for Partial Shaded Conditions. Energies 2022, 15, 4797. [Google Scholar] [CrossRef]
  72. He, W.; Baig, M.J.A.; Iqbal, M.T. An Internet of Things—Supervisory Control and Data Acquisition (IoT-SCADA) Architecture for Photovoltaic System Monitoring, Control, and Inspection in Real Time. Electronics 2025, 14, 42. [Google Scholar] [CrossRef]
  73. Current Weather and Forecast—OpenWeatherMap. Available online: https://openweathermap.org/ (accessed on 17 September 2024).
  74. POWER|DAV. Available online: https://power.larc.nasa.gov/data-access-viewer/ (accessed on 18 September 2024).
  75. COMMpla|Trust-IT Forecasts. Available online: https://www.ecmwf.int/en/forecasts (accessed on 18 September 2024).
  76. Home|Copernicus. Available online: https://atmosphere.copernicus.eu/ (accessed on 18 September 2024).
  77. National Weather Service. Available online: https://www.weather.gov/ (accessed on 18 September 2024).
  78. MesoWest Data. Available online: https://mesowest.utah.edu/ (accessed on 18 September 2024).
  79. Korea Meteorological Administration Weather Data Service. Available online: https://data.kma.go.kr/cmmn/main.do (accessed on 18 September 2024).
  80. Ladda Ner Meteorologiska Observationer SMHI. Available online: https://www.smhi.se/data/meteorologi/ladda-ner-meteorologiska-observationer#param=airtemperatureInstant,stations=core (accessed on 18 September 2024).
  81. Home Page|DKA Solar Centre. Available online: https://dkasolarcentre.com.au/ (accessed on 18 September 2024).
  82. NSRDB. Available online: https://nsrdb.nrel.gov/ (accessed on 18 September 2024).
  83. Global Solar Atlas. Available online: https://globalsolaratlas.info/map (accessed on 18 September 2024).
  84. PD: 3D Sun-Path. Available online: https://andrewmarsh.com/apps/staging/sunpath3d.html (accessed on 18 September 2024).
  85. PVGIS. Available online: https://pvgis.com (accessed on 17 September 2024).
  86. PVOutput. Available online: https://www.pvoutput.org/ (accessed on 18 September 2024).
  87. NREL Data Catalog l NREL. Available online: https://data.nrel.gov/ (accessed on 18 September 2024).
  88. Renewables.Ninja. Available online: https://www.renewables.ninja/ (accessed on 18 September 2024).
  89. Open Data Sets—IEEE PES Intelligent Systems Subcommittee. Available online: https://site.ieee.org/pes-iss/data-sets/ (accessed on 16 September 2025).
  90. Welcome to ENERGYDATA.INFO—ENERGYDATA.INFO. Available online: https://energydata.info/ (accessed on 18 September 2024).
  91. Healy, W.M.; Fanney, A.H.; Dougherty, B.P.; Ng, L.; Payne, V.; Ullah, T.; Omar, F. Performance Data from the NIST Net-Zero Energy Residential Test Facility. J. Res. Natl. Inst. Stan. 2017, 122, 14. [Google Scholar] [CrossRef]
  92. Dehury, C. Chinmaya-Dehury/Loc_Green_Energy_Availability_Pred 2022. Available online: https://github.com/chinmaya-dehury/Loc_Green_Energy_Availability_Pred/ (accessed on 16 September 2025).
  93. Global Wind Atlas. Available online: https://globalwindatlas.info (accessed on 17 September 2024).
  94. Abisoye, B.O.; Sun, Y.; Zenghui, W. A Survey of Artificial Intelligence Methods for Renewable Energy Forecasting: Methodologies and Insights. Renew. Energy Focus 2024, 48, 100529. [Google Scholar] [CrossRef]
  95. Si, Z.; Yang, M.; Yu, Y.; Ding, T. Photovoltaic Power Forecast Based on Satellite Images Considering Effects of Solar Position. Appl. Energy 2021, 302, 117514. [Google Scholar] [CrossRef]
  96. Böök, H.; Lindfors, A.V. Site-Specific Adjustment of a NWP-Based Photovoltaic Production Forecast. Sol. Energy 2020, 211, 779–788. [Google Scholar] [CrossRef]
  97. Liu, B.; Liu, M.; Cui, Y.; Shao, D.; Mao, Z.; Zhang, L.; Khan, S.; Luo, Y. Assessing Forecasting Performance of Daily Reference Evapotranspiration Using Public Weather Forecast and Numerical Weather Prediction. J. Hydrol. 2020, 590, 125547. [Google Scholar] [CrossRef]
  98. Schulz, B.; El Ayari, M.; Lerch, S.; Baran, S. Post-Processing Numerical Weather Prediction Ensembles for Probabilistic Solar Irradiance Forecasting. Sol. Energy 2021, 220, 1016–1031. [Google Scholar] [CrossRef]
  99. Bazionis, I.K.; Georgilakis, P.S. Review of Deterministic and Probabilistic Wind Power Forecasting: Models, Methods, and Future Research. Electricity 2021, 2, 13–47. [Google Scholar] [CrossRef]
  100. Jaber, M.; Abd Hamid, A.S.; Sopian, K.; Fazlizan, A.; Ibrahim, A. Prediction Model for the Performance of Different PV Modules Using Artificial Neural Networks. Appl. Sci. 2022, 12, 3349. [Google Scholar] [CrossRef]
  101. Pazikadin, A.R.; Rifai, D.; Ali, K.; Malik, M.Z.; Abdalla, A.N.; Faraj, M.A. Solar Irradiance Measurement Instrumentation and Power Solar Generation Forecasting Based on Artificial Neural Networks (ANN): A Review of Five Years Research Trend. Sci. Total Environ. 2020, 715, 136848. [Google Scholar] [CrossRef]
  102. Rao K, D.V.S.K.; Premalatha, M.; Naveen, C. Analysis of Different Combinations of Meteorological Parameters in Predicting the Horizontal Global Solar Radiation with ANN Approach: A Case Study. Renew. Sustain. Energy Rev. 2018, 91, 248–258. [Google Scholar] [CrossRef]
  103. Taye, M.M. Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions. Computers 2023, 12, 91. [Google Scholar] [CrossRef]
  104. Zendehboudi, A.; Baseer, M.A.; Saidur, R. Application of Support Vector Machine Models for Forecasting Solar and Wind Energy Resources: A Review. J. Clean. Prod. 2018, 199, 272–285. [Google Scholar] [CrossRef]
  105. De Oliveira, R.A.; Bollen, M.H.J. Deep Learning for Power Quality. Electr. Power Syst. Res. 2023, 214, 108887. [Google Scholar] [CrossRef]
  106. Wazirali, R.; Yaghoubi, E.; Abujazar, M.S.S.; Ahmad, R.; Vakili, A.H. State-of-the-Art Review on Energy and Load Forecasting in Microgrids Using Artificial Neural Networks, Machine Learning, and Deep Learning Techniques. Electr. Power Syst. Res. 2023, 225, 109792. [Google Scholar] [CrossRef]
  107. Khosravi, B.; Rouzrokh, P.; Faghani, S.; Moassefi, M.; Vahdati, S.; Mahmoudi, E.; Chalian, H.; Erickson, B.J. Machine Learning and Deep Learning in Cardiothoracic Imaging: A Scoping Review. Diagnostics 2022, 12, 2512. [Google Scholar] [CrossRef]
  108. Pombo, D.V.; Bindner, H.W.; Spataru, S.V.; Sørensen, P.E.; Bacher, P. Increasing the Accuracy of Hourly Multi-Output Solar Power Forecast with Physics-Informed Machine Learning. Sensors 2022, 22, 749. [Google Scholar] [CrossRef]
  109. Chen, L.; Han, B.; Wang, X.; Zhao, J.; Yang, W.; Yang, Z. Machine Learning Methods in Weather and Climate Applications: A Survey. Appl. Sci. 2023, 13, 12019. [Google Scholar] [CrossRef]
  110. Ciaburro, G.; Iannace, G. Machine Learning-Based Algorithms to Knowledge Extraction from Time Series Data: A Review. Data 2021, 6, 55. [Google Scholar] [CrossRef]
  111. Isah, A.; Shin, H.; Oh, S.; Oh, S.; Aliyu, I.; Um, T.; Kim, J. Digital Twins Temporal Dependencies-Based on Time Series Using Multivariate Long Short-Term Memory. Electronics 2023, 12, 4187. [Google Scholar] [CrossRef]
  112. Rajagukguk, R.A.; Ramadhan, R.A.A.; Lee, H.-J. A Review on Deep Learning Models for Forecasting Time Series Data of Solar Irradiance and Photovoltaic Power. Energies 2020, 13, 6623. [Google Scholar] [CrossRef]
  113. Fan, C.; Chen, M.; Wang, X.; Wang, J.; Huang, B. A Review on Data Preprocessing Techniques Toward Efficient and Reliable Knowledge Discovery From Building Operational Data. Front. Energy Res. 2021, 9, 652801. [Google Scholar] [CrossRef]
  114. Sundararajan, A.; Sarwat, A.I. Evaluation of Missing Data Imputation Methods for an Enhanced Distributed PV Generation Prediction. In Proceedings of the Future Technologies Conference (FTC) 2019; Arai, K., Bhatia, R., Kapoor, S., Eds.; Springer International Publishing: Cham, Switzerland, 2020; pp. 590–609. [Google Scholar]
  115. Garud, K.S.; Jayaraj, S.; Lee, M.-Y. A Review on Modeling of Solar Photovoltaic Systems Using Artificial Neural Networks, Fuzzy Logic, Genetic Algorithm and Hybrid Models. Int. J. Energy Res. 2021, 45, 6–35. [Google Scholar] [CrossRef]
  116. Zhang, J.M.; Harman, M.; Ma, L.; Liu, Y. Machine Learning Testing: Survey, Landscapes and Horizons. IEEE Trans. Softw. Eng. 2019, 48, 1–36. [Google Scholar] [CrossRef]
  117. Ren, Y.; Suganthan, P.N.; Srikanth, N. Ensemble Methods for Wind and Solar Power Forecasting—A State-of-the-Art Review. Renew. Sustain. Energy Rev. 2015, 50, 82–91. [Google Scholar] [CrossRef]
  118. Rahimi, N.; Park, S.; Choi, W.; Oh, B.; Kim, S.; Cho, Y.; Ahn, S.; Chong, C.; Kim, D.; Jin, C.; et al. A Comprehensive Review on Ensemble Solar Power Forecasting Algorithms. J. Electr. Eng. Technol. 2023, 18, 719–733. [Google Scholar] [CrossRef] [PubMed]
  119. Yang, D.; Wang, W.; Gueymard, C.A.; Hong, T.; Kleissl, J.; Huang, J.; Perez, M.J.; Perez, R.; Bright, J.M.; Xia, X.; et al. A Review of Solar Forecasting, Its Dependence on Atmospheric Sciences and Implications for Grid Integration: Towards Carbon Neutrality. Renew. Sustain. Energy Rev. 2022, 161, 112348. [Google Scholar] [CrossRef]
  120. Mienye, I.D.; Sun, Y. A Survey of Ensemble Learning: Concepts, Algorithms, Applications, and Prospects. IEEE Access 2022, 10, 99129–99149. [Google Scholar] [CrossRef]
  121. Liu, D.; Sun, K. Random Forest Solar Power Forecast Based on Classification Optimization. Energy 2019, 187, 115940. [Google Scholar] [CrossRef]
  122. Guermoui, M.; Melgani, F.; Gairaa, K.; Mekhalfi, M.L. A Comprehensive Review of Hybrid Models for Solar Radiation Forecasting. J. Clean. Prod. 2020, 258, 120357. [Google Scholar] [CrossRef]
  123. Wu, H.; Levinson, D. The Ensemble Approach to Forecasting: A Review and Synthesis. Transp. Res. Part C Emerg. Technol. 2021, 132, 103357. [Google Scholar] [CrossRef]
  124. Sina, L.B.; Secco, C.A.; Blazevic, M.; Nazemi, K. Hybrid Forecasting Methods—A Systematic Review. Electronics 2023, 12, 2019. [Google Scholar] [CrossRef]
  125. Zhou, J.; Gandomi, A.H.; Chen, F.; Holzinger, A. Evaluating the Quality of Machine Learning Explanations: A Survey on Methods and Metrics. Electronics 2021, 10, 593. [Google Scholar] [CrossRef]
  126. Karunasingha, D.S.K. Root Mean Square Error or Mean Absolute Error? Use Their Ratio as Well. Inf. Sci. 2022, 585, 609–629. [Google Scholar] [CrossRef]
  127. Kim, S.; Kim, H. A New Metric of Absolute Percentage Error for Intermittent Demand Forecasts. Int. J. Forecast. 2016, 32, 669–679. [Google Scholar] [CrossRef]
  128. Shcherbakov, M.V.; Brebels, A.; Shcherbakova, N.L.; Tyukov, A.P.; Janovsky, T.A.; Kamaev, V.A. A Survey of Forecast Error Measures. World Appl. Sci. J. 2013, 24, 171–176. [Google Scholar]
  129. Onyutha, C. From R-Squared to Coefficient of Model Accuracy for Assessing “Goodness-of-Fits”. Geosci. Model Dev. Discuss. 2020, 2020, 1–25. [Google Scholar] [CrossRef]
  130. Editor, M.B. Why Is There No R-Squared for Nonlinear Regression? Available online: https://blog.minitab.com/en/adventures-in-statistics-2/why-is-there-no-r-squared-for-nonlinear-regression (accessed on 17 May 2024).
  131. Panjapornpon, C.; Bardeeniz, S.; Hussain, M.A. Improving Energy Efficiency Prediction under Aberrant Measurement Using Deep Compensation Networks: A Case Study of Petrochemical Process. Energy 2023, 263, 125837. [Google Scholar] [CrossRef]
  132. Quej, V.H.; Almorox, J.; Ibrakhimov, M.; Saito, L. Empirical Models for Estimating Daily Global Solar Radiation in Yucatán Peninsula, Mexico. Energy Convers. Manag. 2016, 110, 448–456. [Google Scholar] [CrossRef]
  133. Samy, M.M.; Almamlook, R.E.; Elkhouly, H.I.; Barakat, S. Decision-Making and Optimal Design of Green Energy System Based on Statistical Methods and Artificial Neural Network Approaches. Sustain. Cities Soc. 2022, 84, 104015. [Google Scholar] [CrossRef]
  134. Zakari, A.; Khan, I.; Tan, D.; Alvarado, R.; Dagar, V. Energy Efficiency and Sustainable Development Goals (SDGs). Energy 2022, 239, 122365. [Google Scholar] [CrossRef]
  135. Touzani, S.; Granderson, J.; Fernandes, S. Gradient Boosting Machine for Modeling the Energy Consumption of Commercial Buildings. Energy Build. 2018, 158, 1533–1543. [Google Scholar] [CrossRef]
  136. Yang, D.; van der Meer, D. Post-Processing in Solar Forecasting: Ten Overarching Thinking Tools. Renew. Sustain. Energy Rev. 2021, 140, 110735. [Google Scholar] [CrossRef]
  137. Phan, Q.-T.; Wu, Y.-K.; Phan, Q.-D.; Lo, H.-Y. A Novel Forecasting Model for Solar Power Generation by a Deep Learning Framework With Data Preprocessing and Postprocessing. IEEE Trans. Ind. Appl. 2023, 59, 220–231. [Google Scholar] [CrossRef]
  138. Bakker, K.; Whan, K.; Knap, W.; Schmeits, M. Comparison of Statistical Post-Processing Methods for Probabilistic NWP Forecasts of Solar Radiation. Sol. Energy 2019, 191, 138–150. [Google Scholar] [CrossRef]
  139. Alvarenga, R.; Herbaux, H.; Linguet, L. Combination of Post-Processing Methods to Improve High-Resolution NWP Solar Irradiance Forecasts in French Guiana. Eng. Proc. 2022, 18, 27. [Google Scholar] [CrossRef]
  140. Das, U.K.; Tey, K.S.; Seyedmahmoudian, M.; Idna Idris, M.Y.; Mekhilef, S.; Horan, B.; Stojcevski, A. SVR-Based Model to Forecast PV Power Generation under Different Weather Conditions. Energies 2017, 10, 876. [Google Scholar] [CrossRef]
  141. Provan, G.; Buaudacháin, S.Ó. Sensor Calibration and Diagnostics Under Parameter Uncertainty: A Smart Building Application. IFAC Proc. Vol. 2012, 45, 409–414. [Google Scholar] [CrossRef]
  142. Hakami, A.M.; Hasan, K.N.; Alzubaidi, M.; Datta, M. A Review of Uncertainty Modelling Techniques for Probabilistic Stability Analysis of Renewable-Rich Power Systems. Energies 2023, 16, 112. [Google Scholar] [CrossRef]
  143. Siow, L.T.; Lee, J.R.; Ooi, E.H.; Lau, E.V. Application of Graphene and Graphene Derivatives in Cooling of Photovoltaic (PV) Solar Panels: A Review. Renew. Sustain. Energy Rev. 2024, 193, 114288. [Google Scholar] [CrossRef]
  144. Piccoli, E.; Dama, A.; Dolara, A.; Leva, S. Experimental Validation of a Model for PV Systems under Partial Shading for Building Integrated Applications. Sol. Energy 2019, 183, 356–370. [Google Scholar] [CrossRef]
  145. Li, Q.; Zhu, L.; Sun, Y.; Lu, L.; Yang, Y. Performance Prediction of Building Integrated Photovoltaics under No-Shading, Shading and Masking Conditions Using a Multi-Physics Model. Energy 2020, 213, 118795. [Google Scholar] [CrossRef]
  146. Polo, J.; Martín-Chivelet, N.; Alonso-Abella, M.; Alonso-García, C. Photovoltaic Generation on Vertical Façades in Urban Context from Open Satellite-Derived Solar Resource Data. Sol. Energy 2021, 224, 1396–1405. [Google Scholar] [CrossRef]
  147. Hao, W.; Tablada, A.; Shi, X.; Wang, L.; Meng, X. Efficiency Analysis of the Photovoltaic Shading and Vertical Farming System by Employing the Artificial Neural Network (ANN) Method. Buildings 2024, 14, 94. [Google Scholar] [CrossRef]
  148. Nguyen, D.C.; Ishikawa, Y. Artificial Neural Network for Predicting Annual Output Energy of Building-Integrated Photovoltaics Based on the 2-Terminal Perovskite/Silicon Tandem Cells under Realistic Conditions. Energy Rep. 2022, 8, 10819–10832. [Google Scholar] [CrossRef]
  149. De Paiva, G.M.; Pimentel, S.P.; Marra, E.G.; de Alvarenga, B.P.; Mussetta, M.; Leva, S. Intra-Day Forecasting of Building-Integrated PV Systems for Power Systems Operation Using ANN Ensemble. In Proceedings of the 2019 IEEE Milan PowerTech, Milan, Italy, 23–27 June 2019; pp. 1–5. [Google Scholar]
  150. Polo, J.; Martín-Chivelet, N.; Sanz-Saiz, C. BIPV Modeling with Artificial Neural Networks: Towards a BIPV Digital Twin. Energies 2022, 15, 4173. [Google Scholar] [CrossRef]
  151. Lee, H.; Kang, E.; Kim, D.; Yoon, J. Development of the Estimation Model for the Maximum Power Point of Building-Applied Photovoltaic Systems Based on Machine Learning. J. Build. Eng. 2023, 76, 107285. [Google Scholar] [CrossRef]
  152. Dimd, B.D.; Völler, S.; Midtgård, O.-M.; Cali, U.; Sevault, A. Quantification of the Impact of Azimuth and Tilt Angle on the Performance of a PV Output Power Forecasting Model for BIPVs. IEEE J. Photovolt. 2024, 14, 194–200. [Google Scholar] [CrossRef]
  153. Dimd, B.D.; Völler, S.; Midtgård, O.-M.; Sevault, A. The Effect of Mixed Orientation on the Accuracy of a Forecast Model for Building Integrated Photovoltaic Systems. Energy Rep. 2023, 9, 202–207. [Google Scholar] [CrossRef]
  154. Jeong, J.; Lee, D.; Chae, Y.T. A Novel Approach for Day-Ahead Hourly Building-Integrated Photovoltaic Power Prediction by Using Feature Engineering and Simple Weather Forecasting Service. Energies 2023, 16, 7477. [Google Scholar] [CrossRef]
  155. Shin, W.-G.; Shin, J.-Y.; Hwang, H.-M.; Park, C.-H.; Ko, S.-W. Power Generation Prediction of Building-Integrated Photovoltaic System with Colored Modules Using Machine Learning. Energies 2022, 15, 2589. [Google Scholar] [CrossRef]
  156. de Queiroz Corrêa, L.; Bagnis, D.; Rabelo Melo Franco, P.; Ferreira da Costa Junior, E.; Oliveira Souza da Costa, A. Evaluating Energy Generation of a Building-Integrated Organic Photovoltaic Vertical Façade: A Case Study of Latin America’s Pioneering Installation. Renew. Energy 2024, 220, 119662. [Google Scholar] [CrossRef]
  157. Fara, L.; Diaconu, A.; Craciunescu, D.; Fara, S. Forecasting of Energy Production for Photovoltaic Systems Based on ARIMA and ANN Advanced Models. Int. J. Photoenergy 2021, 2021, e6777488. [Google Scholar] [CrossRef]
  158. Luo, X.J.; Oyedele, L.O.; Ajayi, A.O.; Akinade, O.O. Comparative Study of Machine Learning-Based Multi-Objective Prediction Framework for Multiple Building Energy Loads. Sustain. Cities Soc. 2020, 61, 102283. [Google Scholar] [CrossRef]
  159. Kabilan, R.; Chandran, V.; Yogapriya, J.; Karthick, A.; Gandhi, P.P.; Mohanavel, V.; Rahim, R.; Manoharan, S. Short-Term Power Prediction of Building Integrated Photovoltaic (BIPV) System Based on Machine Learning Algorithms. Int. J. Photoenergy 2021, 2021, e5582418. [Google Scholar] [CrossRef]
  160. Dourhmi, M.; Benlamine, K.; Abouelaziz, I.; Zghal, M.; Masrour, T.; Jouane, Y. Improved Hourly Prediction of BIPV Photovoltaic Power Building Using Artificial Learning Machine: A Case Study. In Proceedings of the Emerging Trends in Intelligent Systems & Network Security; Ben Ahmed, M., Abdelhakim, B.A., Ane, B.K., Rosiyadi, D., Eds.; Springer International Publishing: Cham, Switzerland, 2023; pp. 270–280. [Google Scholar]
  161. Photogrammetry and Deep Learning for Energy Production Prediction and Building-Integrated Photovoltaics Decarbonization|Building Simulation. Available online: https://link.springer.com/article/10.1007/s12273-023-1089-y (accessed on 26 August 2024).
  162. Lee, D.; Jeong, J.; Yoon, S.H.; Chae, Y.T. Improvement of Short-Term BIPV Power Predictions Using Feature Engineering and a Recurrent Neural Network. Energies 2019, 12, 3247. [Google Scholar] [CrossRef]
  163. Jouane, Y.; Sow, M.C.; Oussous, O.; Vontobel, N.; Zghal, M. Forecasting Photovoltaic Energy for a Winter House Using a Hybrid Deep Learning Model. In Proceedings of the 2023 12th International Conference on Renewable Energy Research and Applications (ICRERA), Oshawa, ON, Canada, 29 August–1 September 2023; pp. 1–5. [Google Scholar]
  164. Sow, M.C.; Jouane, Y.; Oussous, O.; Vontobel, N.; Zghal, M. Very Short-Term Prediction of Photovoltaic Energy in the Winter Building for an Automatic Energy Management System. In Proceedings of the 2023 IEEE International Conference on Energy Technologies for Future Grids (ETFG), Wollongong, Australia, 3–6 December 2023; pp. 1–5. [Google Scholar]
  165. Sarkar, S.; Karthick, A.; Kumar Chinnaiyan, V.; Patil, P.P. Energy Forecasting of the Building-Integrated Photovoltaic Façade Using Hybrid LSTM. Environ. Sci. Pollut. Res. 2023, 30, 45977–45985. [Google Scholar] [CrossRef]
  166. Polo, J.; Martín-Chivelet, N.; Alonso-Abella, M.; Sanz-Saiz, C.; Cuenca, J.; de la Cruz, M. Exploring the PV Power Forecasting at Building Façades Using Gradient Boosting Methods. Energies 2023, 16, 1495. [Google Scholar] [CrossRef]
  167. Zhi, Y.; Sun, T.; Yang, X. A Physical Model with Meteorological Forecasting for Hourly Rooftop Photovoltaic Power Prediction. J. Build. Eng. 2023, 75, 106997. [Google Scholar] [CrossRef]
  168. Massucco, S.; Mosaico, G.; Saviozzi, M.; Silvestro, F. A Hybrid Technique for Day-Ahead PV Generation Forecasting Using Clear-Sky Models or Ensemble of Artificial Neural Networks According to a Decision Tree Approach. Energies 2019, 12, 1298. [Google Scholar] [CrossRef]
  169. Reynolds, J.; Ahmad, M.W.; Rezgui, Y.; Hippolyte, J.-L. Operational Supply and Demand Optimisation of a Multi-Vector District Energy System Using Artificial Neural Networks and a Genetic Algorithm. Appl. Energy 2019, 235, 699–713. [Google Scholar] [CrossRef]
  170. Gheouany, S.; Ouadi, H.; El Bakali, S. Optimal Active and Reactive Energy Management for a Smart Microgrid System under the Moroccan Grid Pricing Code. Energy 2024, 306, 132462. [Google Scholar] [CrossRef]
  171. Sabzehgar, R.; Amirhosseini, D.Z.; Rasouli, M. Solar Power Forecast for a Residential Smart Microgrid Based on Numerical Weather Predictions Using Artificial Intelligence Methods. J. Build. Eng. 2020, 32, 101629. [Google Scholar] [CrossRef]
  172. Kaffash, M.; Bruninx, K.; Deconinck, G. Data-Driven Forecasting of Local PV Generation for Stochastic PV-Battery System Management. Int. J. Energy Res. 2021, 45, 15962–15979. [Google Scholar] [CrossRef]
  173. Kaffash, M.; Deconinck, G. Ensemble Machine Learning Forecaster for Day Ahead PV System Generation. In Proceedings of the 2019 IEEE 7th International Conference on Smart Energy Grid Engineering (SEGE), Oshawa, ON, Canada, 12–14 August 2019; pp. 92–96. [Google Scholar]
  174. Chung, M.H. Estimating Solar Insolation and Power Generation of Photovoltaic Systems Using Previous Day Weather Data. Adv. Civ. Eng. 2020, 2020, e8701368. [Google Scholar] [CrossRef]
  175. Cordeiro-Costas, M.; Villanueva, D.; Eguía-Oller, P.; Granada-Álvarez, E. Intelligent Energy Storage Management Trade-off System Applied to Deep Learning Predictions. J. Energy Storage 2023, 61, 106784. [Google Scholar] [CrossRef]
  176. Ghenai, C.; Ahmad, F.F.; Rejeb, O.; Bettayeb, M. Artificial Neural Networks for Power Output Forecasting from Bifacial Solar PV System with Enhanced Building Roof Surface Albedo. J. Build. Eng. 2022, 56, 104799. [Google Scholar] [CrossRef]
  177. Parvez, I.; Sarwat, A.; Debnath, A.; Olowu, T.; Dastgir, M.G.; Riggs, H. Multi-Layer Perceptron Based Photovoltaic Forecasting for Rooftop PV Applications in Smart Grid. In Proceedings of the 2020 SoutheastCon, Raleigh, NC, USA, 28–29 March 2020; pp. 1–6. [Google Scholar]
  178. Sarmas, E.; Strompolas, S.; Marinakis, V.; Santori, F.; Bucarelli, M.A.; Doukas, H. An Incremental Learning Framework for Photovoltaic Production and Load Forecasting in Energy Microgrids. Electronics 2022, 11, 3962. [Google Scholar] [CrossRef]
  179. Meteier, Q.; El Kamali, M.; Angelini, L.; Abou Khaled, O. A Recommender System for Increasing Energy Efficiency of Solar-Powered Smart Homes. Sensors 2023, 23, 7974. [Google Scholar] [CrossRef]
  180. Piotrowski, P.; Parol, M.; Kapler, P.; Fetliński, B. Advanced Forecasting Methods of 5-Minute Power Generation in a PV System for Microgrid Operation Control. Energies 2022, 15, 2645. [Google Scholar] [CrossRef]
  181. Hosseini, P.; Taheri, S.; Akhavan, J.; Razban, A. Privacy-Preserving Federated Learning: Application to behind-the-Meter Solar Photovoltaic Generation Forecasting. Energy Convers. Manag. 2023, 283, 116900. [Google Scholar] [CrossRef]
  182. Kim, T.; Kim, J. A Regional Day-Ahead Rooftop Photovoltaic Generation Forecasting Model Considering Unauthorized Photovoltaic Installation. Energies 2021, 14, 4256. [Google Scholar] [CrossRef]
  183. Trigo-González, M.; Cortés-Carmona, M.; Marzo, A.; Alonso-Montesinos, J.; Martínez-Durbán, M.; López, G.; Portillo, C.; Batlles, F.J. Photovoltaic Power Electricity Generation Nowcasting Combining Sky Camera Images and Learning Supervised Algorithms in the Southern Spain. Renew. Energy 2023, 206, 251–262. [Google Scholar] [CrossRef]
  184. VanDeventer, W.; Jamei, E.; Thirunavukkarasu, G.S.; Seyedmahmoudian, M.; Soon, T.K.; Horan, B.; Mekhilef, S.; Stojcevski, A. Short-Term PV Power Forecasting Using Hybrid GASVM Technique. Renew. Energy 2019, 140, 367–379. [Google Scholar] [CrossRef]
  185. Bird, M.; Acha, S.; Brun, N.L.; Shah, N. Assessing the Modelling Approach and Datasets Required for Fault Detection in Photovoltaic Systems. In Proceedings of the 2019 IEEE Industry Applications Society Annual Meeting, Baltimore, MD, USA, 29 September–3 October 2019; pp. 1–6. [Google Scholar]
  186. Scott, C.; Ahsan, M.; Albarbar, A. Machine Learning for Forecasting a Photovoltaic (PV) Generation System. Energy 2023, 278, 127807. [Google Scholar] [CrossRef]
  187. Teixeira, B.; Pinto, T.; Faria, P.; Vale, Z. PV Generation Forecasting Model for Energy Management in Buildings. In Proceedings of the Progress in Artificial Intelligence; Marreiros, G., Melo, F.S., Lau, N., Lopes Cardoso, H., Reis, L.P., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 176–182. [Google Scholar]
  188. Tavares, I.; Manfredini, R.; Almeida, J.; Soares, J.; Ramos, S.; Foroozandeh, Z.; Vale, Z. Comparison of PV Power Generation Forecasting in a Residential Building Using ANN and DNN. IFAC-PapersOnLine 2022, 55, 291–296. [Google Scholar] [CrossRef]
  189. Duhirwe, P.N.; Hwang, J.K.; Ngarambe, J.; Kim, S.; Kim, K.; Song, K.; Yun, G.Y. A Novel Deep Learning-Based Integrated Photovoltaic, Energy Storage System and Electric Heat Pump System: Optimising Energy Usage and Costs. Int. J. Energy Res. 2021, 45, 9306–9325. [Google Scholar] [CrossRef]
  190. Shivam, K.; Tzou, J.-C.; Wu, S.-C. A Multi-Objective Predictive Energy Management Strategy for Residential Grid-Connected PV-Battery Hybrid Systems Based on Machine Learning Technique. Energy Convers. Manag. 2021, 237, 114103. [Google Scholar] [CrossRef]
  191. Kazem, H.A.; Yousif, J.; Chaichan, M.T.; Al-Waeli, A.H.A. Experimental and Deep Learning Artificial Neural Network Approach for Evaluating Grid-Connected Photovoltaic Systems. Int. J. Energy Res. 2019, 43, 8572–8591. [Google Scholar] [CrossRef]
  192. Park, M.K.; Lee, J.M.; Kang, W.H.; Choi, J.M.; Lee, K.H. Predictive Model for PV Power Generation Using RNN (LSTM). J. Mech. Sci. Technol. 2021, 35, 795–803. [Google Scholar] [CrossRef]
  193. Son, N.; Jung, M. Analysis of Meteorological Factor Multivariate Models for Medium- and Long-Term Photovoltaic Solar Power Forecasting Using Long Short-Term Memory. Appl. Sci. 2021, 11, 316. [Google Scholar] [CrossRef]
  194. Xu, X.; Xu, Y.; Wang, M.-H.; Li, J.; Xu, Z.; Chai, S.; He, Y. Data-Driven Game-Based Pricing for Sharing Rooftop Photovoltaic Generation and Energy Storage in the Residential Building Cluster Under Uncertainties. IEEE Trans. Ind. Inform. 2021, 17, 4480–4491. [Google Scholar] [CrossRef]
  195. Alden, R.E.; Gong, H.; Jones, E.S.; Ababei, C.; Ionel, D.M. Artificial Intelligence Method for the Forecast and Separation of Total and HVAC Loads With Application to Energy Management of Smart and NZE Homes. IEEE Access 2021, 9, 160497–160509. [Google Scholar] [CrossRef]
  196. Wen, L.; Zhou, K.; Yang, S.; Lu, X. Optimal Load Dispatch of Community Microgrid with Deep Learning Based Solar Power and Load Forecasting. Energy 2019, 171, 1053–1065. [Google Scholar] [CrossRef]
  197. Lateko, A.A.H.; Yang, H.-T.; Huang, C.-M.; Aprillia, H.; Hsu, C.-Y.; Zhong, J.-L.; Phương, N.H. Stacking Ensemble Method with the RNN Meta-Learner for Short-Term PV Power Forecasting. Energies 2021, 14, 4733. [Google Scholar] [CrossRef]
  198. Iao, H.-W.; Lao, K.-W.; Kang, J. Learning-Based Solar Power and Load Forecasting in DC Net-Zero Energy Building with Incomplete Data. In Proceedings of the 2022 7th International Conference on Power and Renewable Energy (ICPRE), Shanghai, China, 23–26 September 2022; pp. 583–589. [Google Scholar]
  199. Iao, H.-W.; Lao, K.-W. Integrated Load Consumption and PV Output Forecasting of Net-Zero Energy Buildings Considering KNN-GAN Data Augmentation. In Proceedings of the 2023 International Conference on Power Energy Systems and Applications (ICoPESA), Nanjing, China 24–26 February 2023; pp. 399–406. [Google Scholar]
  200. Hu, Z.; Gao, Y.; Ji, S.; Mae, M.; Imaizumi, T. Improved Multistep Ahead Photovoltaic Power Prediction Model Based on LSTM and Self-Attention with Weather Forecast Data. Appl. Energy 2024, 359, 122709. [Google Scholar] [CrossRef]
  201. Hernandez-Robles, I.A.; González-Ramírez, X.; Antonio Álvarez-Jaime, J. Effectiveness of Forecasters Based on Neural Networks for Energy Management in Zero Energy Buildings. Energy Build. 2024, 316, 114372. [Google Scholar] [CrossRef]
  202. Cordeiro-Costas, M.; Labandeira-Pérez, H.; Villanueva, D.; Pérez-Orozco, R.; Eguía-Oller, P. NSGA-II Based Short-Term Building Energy Management Using Optimal LSTM-MLP Forecasts. Int. J. Electr. Power Energy Syst. 2024, 159, 110070. [Google Scholar] [CrossRef]
  203. Li, Q.; Xu, Y.; Chew, B.S.H.; Ding, H.; Zhao, G. An Integrated Missing-Data Tolerant Model for Probabilistic PV Power Generation Forecasting. IEEE Trans. Power Syst. 2022, 37, 4447–4459. [Google Scholar] [CrossRef]
  204. de Carvalho Costa, R.L. Convolutional-LSTM Networks and Generalization in Forecasting of Household Photovoltaic Generation. Eng. Appl. Artif. Intell. 2022, 116, 105458. [Google Scholar] [CrossRef]
  205. Sarmas, E.; Spiliotis, E.; Stamatopoulos, E.; Marinakis, V.; Doukas, H. Short-Term Photovoltaic Power Forecasting Using Meta-Learning and Numerical Weather Prediction Independent Long Short-Term Memory Models. Renew. Energy 2023, 216, 118997. [Google Scholar] [CrossRef]
  206. Guo, X.; Mo, Y.; Yan, K. Short-Term Photovoltaic Power Forecasting Based on Historical Information and Deep Learning Methods. Sensors 2022, 22, 9630. [Google Scholar] [CrossRef]
  207. Pan, K.; Xie, C.; Lai, C.S.; Wang, D.; Lai, L.L. Photovoltaic Output Power Estimation and Baseline Prediction Approach for a Residential Distribution Network with Behind-the-Meter Systems. Forecasting 2020, 2, 470–487. [Google Scholar] [CrossRef]
  208. Sangrody, H.; Zhou, N.; Zhang, Z. Similarity-Based Models for Day-Ahead Solar PV Generation Forecasting. IEEE Access 2020, 8, 104469–104478. [Google Scholar] [CrossRef]
  209. Tripathy, D.S.; Rajanarayan Prusty, B.; Jena, D.; Sahu, M.K. Multi-Time Instant Probabilistic PV Generation Forecasting Using Quantile Regression Forests. In Proceedings of the 2020 IEEE 9th Power India International Conference (PIICON), Sonepat, India, 28 February–1 March 2020; pp. 1–6. [Google Scholar]
  210. Tripathy, D.S.; Prusty, B.R.; Bingi, K. A K-Nearest Neighbor-Based Averaging Model for Probabilistic PV Generation Forecasting. Int. J. Numer. Model. Electron. Netw. Devices Fields 2022, 35, e2983. [Google Scholar] [CrossRef]
  211. Kabir, F.; Yu, N.; Yao, W.; Yang, R.; Zhang, Y. Estimation of Behind-the-Meter Solar Generation by Integrating Physical with Statistical Models. In Proceedings of the 2019 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm), Beijing, China, 21–23 October 2019; pp. 1–6. [Google Scholar]
  212. Kabir, F.; Yu, N.; Yao, W.; Yang, R.; Zhang, Y. Joint Estimation of Behind-the-Meter Solar Generation in a Community. IEEE Trans. Sustain. Energy 2021, 12, 682–694. [Google Scholar] [CrossRef]
  213. Allouhi, A. Solar PV Integration in Commercial Buildings for Self-Consumption Based on Life-Cycle Economic/Environmental Multi-Objective Optimization. J. Clean. Prod. 2020, 270, 122375. [Google Scholar] [CrossRef]
  214. Kallio, S.; Siroux, M. Photovoltaic Power Prediction for Solar Micro-Grid Optimal Control. Energy Rep. 2023, 9, 594–601. [Google Scholar] [CrossRef]
  215. Capotosto, T.; Rita Di Fazio, A.; Perna, S.; Conte, F.; Iannello, G.; De Falco, P. Day-Ahead Forecast of PV Systems and End-Users in the Contest of Renewable Energy Communities. In Proceedings of the 2022 AEIT International Annual Conference (AEIT), Rome, Italy, 3–5 October 2022; pp. 1–6. [Google Scholar]
  216. Jiranantacharoen, P.; Benjapolakul, W. Photovoltaic Power Generation Forecast by Using Estimator Model and Kalman Filter. In Proceedings of the 2019 International Conference on Power, Energy and Innovations (ICPEI), Pattaya, Thailand, 16–18 October 2019; pp. 12–15. [Google Scholar]
  217. Gellert, A.; Fiore, U.; Florea, A.; Chis, R.; Palmieri, F. Forecasting Electricity Consumption and Production in Smart Homes through Statistical Methods. Sustain. Cities Soc. 2022, 76, 103426. [Google Scholar] [CrossRef]
  218. Lee, S.; Park, S.; Kang, B.; Choi, M.; Jang, H.; Shmilovitz, D.; Park, S. Enhancing Zero-Energy Building Operations for ESG: Accurate Solar Power Prediction through Automatic Machine Learning. Buildings 2023, 13, 2050. [Google Scholar] [CrossRef]
  219. Furusawa, H.; Horita, Y. Effect Verification of Training Period for Prediction of Photovoltaic Power Generation Using ML. In Proceedings of the 2023 IEEE 12th Global Conference on Consumer Electronics (GCCE), Nara, Japan, 10–13 October 2023; pp. 293–295. [Google Scholar]
  220. Lee, M.; Park, J.; Na, S.-I.; Choi, H.S.; Bu, B.-S.; Kim, J. An Analysis of Battery Degradation in the Integrated Energy Storage System with Solar Photovoltaic Generation. Electronics 2020, 9, 701. [Google Scholar] [CrossRef]
  221. Bottieau, J.; De Grève, Z.; Piraux, T.; Dubois, A.; Vallée, F.; Toubeau, J.-F. A Cross-Learning Approach for Cold-Start Forecasting of Residential Photovoltaic Generation. Electr. Power Syst. Res. 2022, 212, 108415. [Google Scholar] [CrossRef]
  222. Zhang, R.; Ma, H.; Hua, W.; Saha, T.K.; Zhou, X. Data-Driven Photovoltaic Generation Forecasting Based on a Bayesian Network With Spatial–Temporal Correlation Analysis. IEEE Trans. Ind. Inform. 2020, 16, 1635–1644. [Google Scholar] [CrossRef]
  223. Bot, K.; Santos, S.; Laouali, I.; Ruano, A.; da Graça Ruano, M. Design of Ensemble Forecasting Models for Home Energy Management Systems. Energies 2021, 14, 7664. [Google Scholar] [CrossRef]
  224. Yang, Z.; Mourshed, M.; Liu, K.; Xu, X.; Feng, S. A Novel Competitive Swarm Optimized RBF Neural Network Model for Short-Term Solar Power Generation Forecasting. Neurocomputing 2020, 397, 415–421. [Google Scholar] [CrossRef]
  225. Sun, F.; Li, L.; Bian, D.; Ji, H.; Li, N.; Wang, S. Short-Term PV Power Data Prediction Based on Improved FCM with WTEEMD and Adaptive Weather Weights. J. Build. Eng. 2024, 90, 109408. [Google Scholar] [CrossRef]
  226. Peng, T.; Fu, Y.; Wang, Y.; Xiong, J.; Suo, L.; Nazir, M.S.; Zhang, C. An Intelligent Hybrid Approach for Photovoltaic Power Forecasting Using Enhanced Chaos Game Optimization Algorithm and Locality Sensitive Hashing Based Informer Model. J. Build. Eng. 2023, 78, 107635. [Google Scholar] [CrossRef]
  227. Khan, W.; Walker, S.; Zeiler, W. Improved Solar Photovoltaic Energy Generation Forecast Using Deep Learning-Based Ensemble Stacking Approach. Energy 2022, 240, 122812. [Google Scholar] [CrossRef]
  228. Etxegarai, G.; Zapirain, I.; Camblong, H.; Ugartemendia, J.; Hernandez, J.; Curea, O. Photovoltaic Energy Production Forecasting in a Short Term Horizon: Comparison between Analytical and Machine Learning Models. Appl. Sci. 2022, 12, 12171. [Google Scholar] [CrossRef]
  229. Kaffash, M.; Ceusters, G.; Deconinck, G. Interval Optimization to Schedule a Multi-Energy System with Data-Driven PV Uncertainty Representation. Energies 2021, 14, 2739. [Google Scholar] [CrossRef]
  230. Hatamian, M.; Panigrahi, B.; Dehury, C.K. Location-Aware Green Energy Availability Forecasting for Multiple Time Frames in Smart Buildings: The Case of Estonia. Meas. Sens. 2023, 25, 100644. [Google Scholar] [CrossRef]
  231. Di Giovanni, G.; Rotilio, M.; Giusti, L.; Ehtsham, M. Exploiting Building Information Modeling and Machine Learning for Optimizing Rooftop Photovoltaic Systems. Energy Build. 2024, 313, 114250. [Google Scholar] [CrossRef]
  232. Cordeiro-Costas, M.; Villanueva, D.; Eguía-Oller, P.; Granada-Álvarez, E. Machine Learning and Deep Learning Models Applied to Photovoltaic Production Forecasting. Appl. Sci. 2022, 12, 8769. [Google Scholar] [CrossRef]
  233. Mohana, M.; Saidi, A.S.; Alelyani, S.; Alshayeb, M.J.; Basha, S.; Anqi, A.E. Small-Scale Solar Photovoltaic Power Prediction for Residential Load in Saudi Arabia Using Machine Learning. Energies 2021, 14, 6759. [Google Scholar] [CrossRef]
  234. Abdulrazzaq, A.J. Performance Assessment of Rooftop Photovoltaic Systems Using Real-World Weather Data and Machine Learning-Based Prediction. J. Eng. Comput. Sci. 2025, 4, 905–911. [Google Scholar]
  235. Kut, P.; Pietrucha-Urbanik, K. Forecasting Short-Term Photovoltaic Energy Production to Optimize Self-Consumption in Home Systems Based on Real-World Meteorological Data and Machine Learning. Energies 2025, 18, 4403. [Google Scholar] [CrossRef]
  236. D’Agostino, D.; Minelli, F.; D’Urso, M.; Minichiello, F. Fixed and Tracking PV Systems for Net Zero Energy Buildings: Comparison between Yearly and Monthly Energy Balance. Renew. Energy 2022, 195, 809–824. [Google Scholar] [CrossRef]
  237. D’Agostino, D.; Minelli, F.; Minichiello, F. An Innovative Multi-Stakeholder Decision Methodology for the Optimal Energy Retrofit of Shopping Mall Buildings. Energy Build. 2025, 344, 115958. [Google Scholar] [CrossRef]
  238. Michalakopoulos, V.; Sarantinopoulos, E.; Sarmas, E.; Marinakis, V. Empowering Federated Learning Techniques for Privacy-Preserving PV Forecasting. Energy Rep. 2024, 12, 2244–2256. [Google Scholar] [CrossRef]
  239. Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 6000–6010. [Google Scholar]
  240. Husein, M.; Gago, E.J.; Hasan, B.; Pegalajar, M.C. Towards Energy Efficiency: A Comprehensive Review of Deep Learning-Based Photovoltaic Power Forecasting Strategies. Heliyon 2024, 10, e33419. [Google Scholar] [CrossRef] [PubMed]
  241. Oliveira Santos, V.; Marinho, F.P.; Costa Rocha, P.A.; Thé, J.V.G.; Gharabaghi, B. Application of Quantum Neural Network for Solar Irradiance Forecasting: A Case Study Using the Folsom Dataset, California. Energies 2024, 17, 3580. [Google Scholar] [CrossRef]
  242. Kumar K, K.; Nutakki, M.; Koduru, S.; Mandava, S. Quantum Support Vector Machine for Forecasting House Energy Consumption: A Comparative Study with Deep Learning Models. J. Cloud Comput. 2024, 13, 105. [Google Scholar] [CrossRef]
  243. Khan, S.Z.; Muzammil, N.; Ghafoor, S.; Khan, H.; Zaidi, S.M.H.; Aljohani, A.J.; Aziz, I. Quantum Long Short-Term Memory (QLSTM) vs. Classical LSTM in Time Series Forecasting: A Comparative Study in Solar Power Forecasting. Front. Phys. 2024, 12, 1439180. [Google Scholar] [CrossRef]
  244. Ayasi, B.; Vázquez, I.X.; Saleh, M.; Garcia-Vico, A.M.; Carmona, C.J. Application of Spiking Neural Networks and Traditional Artificial Neural Networks for Solar Radiation Forecasting in Photovoltaic Systems in Arab Countries. Neural Comput. Applic. 2025, 37, 9095–9127. [Google Scholar] [CrossRef]
  245. Thangaraj, V.k.; Nachimuthu, D.S.; Francis, V.A.R. Wind Speed Forecasting at Wind Farm Locations with an Unique Hybrid PSO-ALO Based Modified Spiking Neural Network. Energy Syst. 2025, 16, 713–741. [Google Scholar] [CrossRef]
  246. Chen, B.; Li, H.; Han, K. Short-Term Photovoltaic Power Forecasting Method Based on K-Means Algorithm and Spiking Neural Networks. In Proceedings of the 8th Renewable Power Generation Conference (RPG 2019), Shanghai, China, 24–25 October 2019. [Google Scholar] [CrossRef]
  247. Hu, Y.; Cheng, X.; Wang, S.; Chen, J.; Zhao, T.; Dai, E. Times Series Forecasting for Urban Building Energy Consumption Based on Graph Convolutional Network. Appl. Energy 2022, 307, 118231. [Google Scholar] [CrossRef]
  248. Zhu, N.; Wang, Y.; Yuan, K.; Yan, J.; Li, Y.; Zhang, K. GGNet: A Novel Graph Structure for Power Forecasting in Renewable Power Plants Considering Temporal Lead-Lag Correlations. Appl. Energy 2024, 364, 123194. [Google Scholar] [CrossRef]
  249. Yang, C.; Li, S.; Gou, Z. Spatiotemporal Prediction of Urban Building Rooftop Photovoltaic Potential Based on GCN-LSTM. Energy Build. 2025, 334, 115522. [Google Scholar] [CrossRef]
  250. Boukaf, M.; Fadli, F.; Meskin, N. A Comprehensive Review of Digital Twin Technology in Building Energy Consumption Forecasting. IEEE Access 2024, 12, 187778–187799. [Google Scholar] [CrossRef]
  251. Kandemir, E.; Hasan, A.; Kvamsdal, T.; Abdel-Afou Alaliyat, S. Predictive Digital Twin for Wind Energy Systems: A Literature Review. Energy Inf. 2024, 7, 68. [Google Scholar] [CrossRef]
  252. Olayiwola, O.; Cali, U.; Elsden, M.; Yadav, P. Enhanced Solar Photovoltaic System Management and Integration: The Digital Twin Concept. Solar 2025, 5, 7. [Google Scholar] [CrossRef]
  253. El-Baz, W.; Tzscheutschler, P.; Wagner, U. Day-Ahead Probabilistic PV Generation Forecast for Buildings Energy Management Systems. Sol. Energy 2018, 171, 478–490. [Google Scholar] [CrossRef]
  254. MILP Optimized Management of Domestic PV-Battery Using Two Days-Ahead Forecasts. Available online: https://ieeexplore.ieee.org/abstract/document/9732432 (accessed on 7 September 2025).
  255. Impacts of Forecast Accuracy on Grid Integration of Renewable Energy Sources. Available online: https://ieeexplore.ieee.org/document/6652486 (accessed on 9 September 2025).
Figure 1. Method of selecting articles.
Figure 1. Method of selecting articles.
Energies 18 05007 g001
Figure 2. Different types of inverters schematic.
Figure 2. Different types of inverters schematic.
Energies 18 05007 g002
Figure 3. Shallow Neural Networks (A) and Deep Neural Networks (B).
Figure 3. Shallow Neural Networks (A) and Deep Neural Networks (B).
Energies 18 05007 g003
Figure 4. The step-by-step process for applying ML.
Figure 4. The step-by-step process for applying ML.
Energies 18 05007 g004
Figure 5. Distribution of the reported NRMSE (%) values.
Figure 5. Distribution of the reported NRMSE (%) values.
Energies 18 05007 g005
Figure 6. Geographical focus of the studies.
Figure 6. Geographical focus of the studies.
Energies 18 05007 g006
Table 1. Different forecasting horizons [50].
Table 1. Different forecasting horizons [50].
ClassificationRangeApplication
Very short-termSeconds to minutesControl and management of PV systems, electricity market operations, and microgrid control
Short-termUp to 72 hControl of power system operations, unit commitment, and economic dispatch
Medium-termA few days to a weekPlanning maintenance and operations of solar power plants
Long-termMonths to a yearPlanning maintenance and operations of solar power plants
Table 2. Databases.
Table 2. Databases.
Data Focusing onRegion/ExplanationReferences
Weather dataGlobal[73,74,75,76]
The USA[77,78]
Korea[79]
Swedish Meteorological and Hydrological Institute (SMHI)[80]
- Desert Knowledge Australia Solar Center (DKASC)
- Quality data and knowledge related to solar power
[81]
Solar dataGlobal[82,83,84]
PV generation dataGlobal[85,86]
Weather variables and PV generation dataGlobal[87,88]
Global level + Other data sets related to power systems[89]
Global level + Other data sets related to energy[90]
A case study + Other related data sets[91,92]
Wind dataGlobal[93]
Table 3. The concept and differences between ensemble and hybrid methods.
Table 3. The concept and differences between ensemble and hybrid methods.
AspectEnsemble MethodsHybrid Methods
DefinitionCombine multiple instances of the same or similar models [123] (e.g., RFs (ensemble of DTs))Combine different types of models (e.g., Physical model + ML) [124]
FocusCombine similar model outputs to create a stronger predictive model [123]Captures different aspects of the problem (e.g., physical + stochastic patterns) [124]
Use in PV
Forecasting
Effective for improving overall prediction accuracyModeling complex environmental and system interactions
Table 4. Key Metrics.
Table 4. Key Metrics.
MetricAbbreviationFormulaExplanation
Maximum ErrorMaxE m a x i = 1 n ( p r e d i c t i o n i a c t u a l i ) The maximum error between the predicted and actual values
Minimum ErrorMinE m i n i = 1 n ( p r e d i c t i o n i a c t u a l i ) The minimum error between predicted and actual values
Absolute Error A E i p r e d i c t i o n i a c t u a l i The absolute difference between predicted and actual values
Maximum Absolute ErrorMaxAE m a x i = 1 n p r e d i c t i o n i a c t u a l i The maximum of absolute errors
Minimum Absolute ErrorMinAE m i n i = 1 n p r e d i c t i o n i a c t u a l i The minimum absolute error
Maximum Absolute Percentage ErrorMaxAPE m a x i = 1 n p r e d i c t i o n i a c t u a l i a c t u a l i The maximum percentage error relative to actual values
Relative Error R E i p r e d i c t i o n i a c t u a l i a c t u a l i The error relative to actual values
Mean Relative ErrorMRE 1 n i = 1 n p r e d i c t i o n i a c t u a l i a c t u a l i The average of relative errors
Relative Absolute ErrorRAE i = 1 n p r e d i c t i o n i a c t u a l i i = 1 n a c t u a l i a c t u a l ¯ The error relative to the absolute deviation from the mean of actuals
Mean Absolute ErrorMAE 1 n i = 1 n p r e d i c t i o n i a c t u a l i The average of absolute errors
Normalized Mean Absolute ErrorNMAE M A E a c t u a l m a x a c t u a l m i n Normalizes MAE by the range of actual values
Mean Absolute Percentage ErrorMAPE 1 n i = 1 n p r e d i c t i o n i a c t u a l i a c t u a l i The average of absolute percentage errors
Mean Squared ErrorMSE 1 n i = 1 n ( p r e d i c t i o n i a c t u a l i ) 2 Average of squared errors
Root Mean Square ErrorRMSE 1 n i = 1 n ( p r e d i c t i o n i a c t u a l i ) 2 The square root of MSE
Root Mean Squared Percentage ErrorRMSPE 1 n i = 1 n ( p r e d i c t i o n i a c t u a l i a c t u a l i ) 2 The square root of the mean squared percentage error
Normalized Root Mean Square ErrorNRMSE R M S E a c t u a l m a x a c t u a l m i n Normalizes RMSE by the range of actual values
Relative Root Mean Squared ErrorRRMSE R M S E a c t u a l ¯ RMSE relative to the actual mean
Coefficient of VariationCV S t a n d a r d D e v i a t i o n o f a c t u a l a c t u a l ¯ Measures the variability of actual values relative to their mean
Coefficient of Variation of Root Mean Squared ErrorCVRMSE R M S E a c t u a l ¯ Normalizes RMSE relative to the mean of actuals
Mean Absolute Scaled ErrorMASE MAE 1 n 1 i = 2 n a c t u a l i a c t u a l i 1 Scales MAE using the mean absolute error from a naive prediction model
Coefficient of DeterminationR2 1 i = 1 n ( a c t u a l i p r e d i c t i o n i ) 2 i = 1 n ( a c t u a l i a c t u a l ¯ ) 2 The proportion of variance in actual values explained by predictions
Mean Bias ErrorMBE 1 n i = 1 n ( p r e d i c t i o n i a c t u l i ) The average bias between predictions and actuals
Normalized Mean Bias ErrorNMBE M B E a c t u a l m a x a c t u a l m i n or M B E a c t u a l ¯ Normalizes the mean bias error by either the range or mean of actuals
Mean Absolute DeviationMAD 1 n i = 1 n a c t u a l i a c t u a l ¯ The average deviation of actual values from their mean
* a c t u a l ¯ is the mean.
Table 5. Methodologies employed in the reviewed articles on BIPV.
Table 5. Methodologies employed in the reviewed articles on BIPV.
Models Methodologies/TechniquesRefs.
Physical Models-Simplified electrical mathematical model considering multi-physics[145]
SAPM[146]
Statistical ModelsNNANN[147,148,149,150,151,157,158,159,160,161,162]
LSTM[152,153,158,163,164]
NARX[154]
Regression Neural Network[155]
CNN[161,163,164]
RNN[162]
MLLR[156,160,161]
ARIMA[157]
SVM[158,162]
QSVM[159]
DT[159,161]
DTR[160]
SVR[160,161]
CART[162]
CHAID[162]
Hybrid Models-CNN-LSTM[163,164]
RF-LSTM-CEEMDAN[165]
Ensemble Models-RF[160,161,162,166]
Adaptive Boosting[161]
XGBoost[166]
Table 6. Methodologies employed in the reviewed articles on BAPV.
Table 6. Methodologies employed in the reviewed articles on BAPV.
Models Methodologies/TechniquesRefs.
Physical Models-An equivalent circuit model[167]
Clear-Sky Model[168]
Corrected Clear-Sky Model[168]
OpenModelica[228]
Statistical ModelsNNANN[169,170,171,183,185,186,187,188,214]
FFNN[172,174,228,229]
SNN[175,232]
NARX[176,228]
MLP[177,178,179,180,201,230]
DNN[188,189,233]
Residual Dilated Causal Convolutional Network[190]
RNN[191,232]
Time Lagged Recurrent Networks[191]
LSTM[192,193,194,195,199,201,204]
BiLSTM[198,201]
Deep Recurrent Neural Network-LSTM[196]
GRU[198,201]
Transformer models[199]
Recursive LSTM[203]
CNN[204,232]
MLPersistence model[180,215,229]
LR[180,186,233]
KNN[180,207,230,231]
SVM[180,183,185,186,187,233]
SVR[182,228,229,230,231,232]
ARIMA[187,215,217]
SBFMs based on KNN[208]
QRF[209]
Quantile K-Nearest Neighbors Regression Averaging[210]
Markov model regression[211,212]
MLR[213,214,215]
TBATS[217]
AML[218,219]
Autoregressive model[220]
cross-learning[221]
Bayesian Network With Spatial-Temporal Correlation Analysis[222]
Simple Moving Average[229]
DTs[231]
LASSO[229,233]
Polynomial Regression[233]
Hybrid Models-Connection of three MLP models[180]
MLP-based FL[181]
GA-SVM[184]
LSTM with self-attention mechanisms[200]
LSTM-MLP[202]
CNN-LSTM[198,204]
Singular Spectrum Analysis + BiLSTM + Bayesian Optimization[206]
ARIMA combined with the Kalman filter[216]
CSO-RBF[224]
WTEEMD-FCM-IGWO-LSTM[225]
VMD-Enhanced Chaos Game Optimization-Locality Sensitive Hashing Attention-Informer model[226]
Ensemble Models-Basic Ensemble Method of NNs[168]
Ensembles of FFNN and SVM[173]
ETs[179]
Gradient-Boosted Trees[180]
RFs[180,185,186,187,230,231,232,233]
Weighted Averaging Ensemble[180]
Ensemble Method with the RNN Meta-Learner[197]
Meta-learning used four models of LSTM[205]
RBF-ANN[223]
DSE-XGB[227]
XGBoost[230,232,233]
Table 7. Metrics employed in the reviewed articles on BAPV, BIPV, and Total.
Table 7. Metrics employed in the reviewed articles on BAPV, BIPV, and Total.
MetricBAPVBIPVTotal
RMSE
(37) + (17)
[168,170,174,177,178,179,180,181,184,185,186,187,191,192,193,196,198,199,200,201,204,205,206,209,210,214,216,219,222,223,224,225,226,227,228,230,231][145,146,147,148,149,150,152,153,155,157,159,160,162,163,164,165,166]54
MAE
(30) + (11)
[167,170,172,174,177,178,187,188,190,193,196,197,200,204,206,207,208,209,210,214,215,217,218,219,225,226,227,228,230,231][147,148,149,152,155,157,159,160,163,164,165]41
R2
(17) + (10)
[169,179,189,190,191,197,200,201,214,218,219,225,226,227,228,230,232][150,151,154,155,156,157,159,160,161,165]27
MAPE
(15) + (7)
[171,181,182,184,185,186,187,190,191,196,198,199,215,220,230][148,154,157,158,159,162,165]22
NRMSE
(21) + (0)
[168,170,172,175,177,178,182,183,188,189,190,197,202,203,204,208,213,216,221,229,232]-21
MSE
(11) + (9)
[171,176,191,198,199,211,212,213,214,226,233][162,165,167,171,178,179,181,182,183]20
NMAE
(5) + (1)
[170,177,182,215,229][166]6
MRE
(5) + (1)
[167,173,185,197,208][150]6
MBE
(3) + (2)
[168,174,180][150,166]5
NMBE
(5) + (0)
[175,183,189,202,232]-5
Table 8. Overview of the reviewed PV forecasting studies in the building sector.
Table 8. Overview of the reviewed PV forecasting studies in the building sector.
MethodsForecasting AccuracyTipsDeployment TypesGeographical LocationRef.
Simplified electrical mathematical model considering multi-physicsMSE & RMSE Illustrated on fig.- Shading
- Masking
University BIPV labTianjin/China[145]
SAPMR2 for each façade
East = 0.73
South = 0.89
West = 0.90
ShadingUniversity campusMadrid/Spain[146]
ANNR2 =
from 0.63 to 0.88
Vertical FarmingResidentialSingapore[147]
ANNMSE = 1.26_RooftopGifu/Japan[148]
ANNFor 15-min
MAE = 34.17 W/m2
RMSE = 60.08 W/m2
_Engineering SchoolGoiania/Brazil[149]
ANNBased on the different façades
MRE ranged approximately from 6% to 15%
Partial shadingUniversity areaMadrid/Spain[150]
ANNR2 = 0.9768
CvRMSE (%) = 35.22
Partial shading
_Republic of Korea[151]
LSTMRMSE (kW) = 2.24
MAE (kW) = 1.12
WAPE (%) = 4.66
_ZEB laboratoryTrondheim/Norway[152]
LSTMRMSE (kW) = 2.242
WAPE (%) = 4.664
_ZEB laboratoryTrondheim/Norway[153]
NARXFor Partially Cloudy days
MAPE = 11.55%
R2 = 0.95
_OfficeSeoul/Korea[154]
Regression Neural NetworkRMSE (kW) = 0.0754
MAE (kW) = 0.0372
- Colored PV
- Shading
_Daejeon/Korea[155]
Mathematical modeling with multiple regressionR2 = 0.81Organic PV_São Paulo/Brazil[156]
ARIMA + ANNMonthly:
rRMSE (%) = from 6.2 and 53
_University labBucharest/Romania[157]
ANNMPE (%) = 6.29ShadingOffice buildingThe UK[158]
ANNRMSE = 4.42%
R2 = 0.8833
_Residential buildingsKovilpatti/India[159]
CNNMSE (kW) = 0.046
R2 = 0.96
- Several models examinedCampus buildingStrasbourg/France[161]
RNNMAPE (%) = 23.79_Office buildingSouth Korea[162]
CNN + LSTMMSE & RMSE Illustrated on fig._Winter housePoschiavo/Switzerland[163]
CNN + LSTMMAE = 4.98
RMSE = 14.06
_Winter housePoschiavo/Switzerland[164]
CEEMDAN  +  RF + LSTMFor a Flat roof:
RMSE = 2.97
MAE = 2.475
_Institute of EngineeringIndia[165]
XGBoostRMSE = 0.89 and 0.21 kW for south and eastShadingOfficeMadrid/Spain[166]
Maximum Power Point Tracking algorithmMAE =15.9%
MRE = 18.1%
_HouseholdRuicheng/China[167]
Hybrid TreeRMSE (kW) = 1.5583
nRMSE = 0.3892
ShadingUniversity/Economics SchoolGenova/Italy[168]
Back-propagation ANNR2 (Train) = 0.9489
R2 (Test) = 0.9412
_Several typesCardiff/The UK[169]
Stacking-based ANNShort-term:
NRMSE (%) = 6.49
NMAE (%) = 3.68
_
ResidentialMorocco[170]
NN-based NWPMAPE = 45.30%_ResidentialSan Diego/The USA[171]
FFNNMRE = 9.15%_Research centerGenk/Belgium[172]
FFNN + SVM + RFNRMSE (%) = 11.89_Research centerGenk/Belgium[173]
Multilayer FFNNRMSE (kWh) = 1.421
MAE (kWh) = 1.133
__Seoul/Korea[174]
SNNnRMSE = 7.15%
nMBE = −0.21%
_University CampusVigo/Spain[175]
NARXFor Albedo α = 0.8
nMSE = 6.06634 × 10−1
Bifacial PV_Sharjah/UAE[176]
MLPMAE = 0.0809
nRMSE = 0.0054
_Smart home
(NREL Database)
USA[177]
MLPMAE = 6.697
RMSE = 13.260
nRMSE = 0.527
_HeadquartersTerni/Italy[178]
MLPR2 = 84.81
RMSE = 0.36
_Smart HomeSwiss[179]
MLPRMSE (w) = 61.633
nMAPE (%) = 0.805
_University PV LaboratoryWarsaw/Poland[180]
MLP-based FLCase 1 Average:
RMSE (%) = 9.72
MAPE (%) = 12.88
Behind-the-meterSeveral residencesNew Mexico/USA[181]
SVRnMAE (%) = 2.95%
nRMSE (%) = 5.41%
Behind-the-meterUtilitySydney/Australia[182]
ANNnRMSE (%) range from 7.71 to 21.43- Several conditions of the skySolar Energy Research CentreAlmería/Spain[183]
GA-based SVMRMSE (W) = 11.226
MAPE (%) = 1.7052
_Deakin University, Engineering Dep.Victoria/Australia[184]
RFMRE (%) = 2.7_Several commercial rooftopsThe UK[185]
RFRMSE = 32_A non-domestic buildingThe UK[186]
ANNMAPE from 0.1868 to 0.2073_GECAD research group buildingPorto/Portugal[187]
ANNMAE (kW) = 0.09223
SMAPE = 0.04947
WAPE = 0.09894
nRMSE = 0.06213
_A Residential Building
(15 apartments)
Due to restrictions, it’s not possible to tell the location[188]
4-layer DNNR2 = 0.95_Retail shopKorea[189]
- Residual Dilated Causal Convolutional NetworkR2 = 0.9308
MAPE = 3.4819
SMAPE = 1.2003
NRMSE = 0.0589
_ResidentialTainan/Taiwan[190]
- Time Lagged Recurrent Networks and RNNsMAPE (%) = 1.5032_Solar energy lab, Sohar UniversityOman[191]
Multi-layer-LSTMCv(RMSE) = 13.2 %__Jincheon/Korea[192]
- LSTM
- 22 multivariate models (combining solar radiation, sunlight, etc.)
Medium-term No.
RMSE = 5.42
MAE = 3.21
Long-term No.
RMSE = 9.23
MAE = 5.83
- 22 multivariate models (combining solar radiation, sunlight, etc.)An industrial buildingGyeonggi-do/Korea[193]
LSTM__- 6 apartments
Each has 60 households
Australia[194]
LSTMDaily:
CVRMSE (%) = 11.1
_Smart homeFlorida/USA[195]
DRNN-LSTMRMSE = 7.536
MAE = 4.369
MAPE (%) = 15.87
_ResidentialYulara/Australia[196]
Stacking ensemble models with RNNMRE (%) = 4.29%
nRMSE (%) = 6.16
MAE (kW) = 8.59
R2 = 0.86
_Industrial Co.Taiwan[197]
BiLSTM with KNNRMSE (kW) = 1.984_NZEB
Institute of Building Research
Shenzhen/China[198]
GAN + KNN + LSTM + TransformerAverage reduction of RMSE (kW) = 4.603 in each LSTM-based_NZEB
Institute of Building Research
Shenzhen/China[199]
LSTM + self-attentionsRMSE (kW) = 0.651
MAE (kW) = 0.306
R2 = 0.934
_HousesUrayasu/Japan[200]
Dual-layer LSTMRMSE = 0.0542- Under cloudiness
- Under solar intermittency
ZEBSalamanca/Mexico[201]
LSTM-MLP-NSGA IINRMSE (%) = 5.5_University CampusVigo/Spain[202]
Recursive LSTMSingapore:
nRMSE (%) =15.25
WMAPE (%) = 68.47
Australia:
nRMSE (%) =15.12
WMAPE (%) = 38.95
- Under the missing data condition- University Campus
- University building
Queensland/Australia
Nanyang/Singapore
[203]
Convolutional-LSTMFor Half an hour:
MAE = 2.9
RMSE = 5.2
NRMSE = 0.03
_HouseholdSydney/Australia[204]
Meta-learning + LSTM variantsJoao’s location:
RMSE = 2.273
Forecast skill index = 0.5
_- Institution of social solidarity
- Dental medicine unit
- Elementary school
Lisbon/Portugal[205]
Singular Spectrum Analysis + BiLSTM + Bayesian Optimization15 Min-Ahead, Dataset 1:
MAE = 9.44
RMSE = 12.29
R2 = 0.971
_Real-world rooftop stationsEastern China[206]
Utilizing the KNN algorithmSpring season:
RAE (%) = 30.61
MAE (kW) = 27.79
- Behind the Meter
- Temperature Correction
ResidentialAustralia[207]
hSBFM + KNNMAE (W) = 826.2
MRE (%) = 15.3
nRMSE (%) = 10.8
_University buildingNew York/The USA[208]
QRFFirst PV system: MAE = 0.666
RMSE = 0.924
_Three nearby locations, rooftop PVThe USA[209]
- Quantile KNN Regression AveragingFirst PV system: MAE = 0.587
RMSE = 0.884
_Three nearby locations, rooftop PVThe USA[210]
- Hidden Markov Model RegressionMSE = 0.23
MASE = 3.08
CV = 0.62
Behind the MeterStreetAustin/The USA[211]
- Mixed Hidden Markov Model RegressionMSE = 0.13
MASE = 2.13
CV = 0.47
Behind the MeterStreetAustin/The USA[212]
MLR + GANRMSE = 0.943
MSE = 0.0049
NMSE = 0.9967
Soiling losses- Commercial BuildingsMorocco[213]
ANNOverall:
MAE = 13.34
MSE = 1517
RMSE = 38.96
R2 = 0.935
Microinverter- INSA ICUBE LaboratoryStrasbourg/France[214]
MLRHour-Ahead:
MAE (kW) = 1.42
MAPE (%) = 21.30
NMAE (%) = 4.37
_- A residential
- A public building
Rome/Italy[215]
ARIMA + Kalman filterRMSE = 0.135
nRMSE = 0.1688
Skill Score = 0.788
_- Three PV rooftops located near each otherThailand[216]
TBATSMAE (W) = 73.62_Smart homesRomania[217]
AMLBayesian Ridge, PV1:
MAE = 0.207
R2 = 0.997
AML employs 5 algorithms- ZEB
- Energy Institute
South Korea[218]
AMLET regressor, Dec.
MAE = 1.48
RMSE = 2.197
R2 = 0.857
AML employs 18 algorithmsUniversity CampusToyama/Japan[219]
Autoregressive modelSunny days:
MAPE = 0.167
Cloudy days:
MAPE = 0.327
- Under sky conditions_Korea[220]
- Improved ensembled cross-learning forecastingSites Average (GBRT model):
NRMSE (%) = 10.67
- Several models examinedResidential- Western Belgium
- Northern France
[221]
- Spatial-Temporal Correlation Analysis + Bayesian Network15 min ahead:
RMSE = 0.04
- 5 models examined- More than 20 buildingsAustralia[222]
RBF-ANN + GAModel 4:
RMSE = 1.69
_ResidenceAlgarve, Portugal[223]
RBF-ANN + CSOSystem 1:
RMSE = 4.988 × 10−3
_- A community of buildingNetherlands[224]
WTEEMD + FCM + IGWO + LSTMAverage:
MAE = 1.7915
RMSE = 2.1542
R2 = 0.9915
- Several models examinedZEBAlice Springs/Australia[225]
VMD + Informer + Enhanced Chaos Game OptimizationApril:
MSE (kW) = 224.21
MAE (kW) = 10.246
RMSE (kW) = 14.97
R2 = 0.9732
- Several models examined- Steel Structure Engineering Co.Nantong/China[226]
DSE-XGBCase study (II):
MAE (kWh) = 0.59
RMSE (kWh) = 0.78
R2 = 0.96
_Commercial BuildingsNetherlands[227]
SVRJuly:
R2 = 0.934
RMSE = 252.46
MAE = 141.81
- Four distinct models comparisonUniversity buildingBidart/France[228]
LASSONRMSE (%) = 14.24
NMAE (%) = 10.68
R2 = 0.57
- 5 models examinedIndustrialGermany[229]
RF30 min:
MAPE = 5.27
MSE = 0.032
RMSE = 0.110
R2 = 0.987
- Several models examinedSmart buildingsEstonia[230]
RFMAPE (%) = 28.34
RMSE (kWh/d) = 139.10
- Shading
- 4 models examined
Office and warehouseAbruzzo/Italy[231]
RFnMBE (%) = 0.41
nRMSE (%) = 1.88
R2 = 0.99
- Several models examined- A single-family dwellingMaryland/USA[232]
PRMSE is illustrated in the figure.- Several models examinedResidentialSaudi Arabia[233]
Table 9. Comparison of forecasting performance (R2) across studies.
Table 9. Comparison of forecasting performance (R2) across studies.
MethodsForecasting AccuracyCase StudiesGeographical LocationRef.
SAPMR2 for each façade
East = 0.73
South = 0.89
West = 0.90
University campusMadrid/Spain[146]
ANNR2 =
from 0.63 to 0.88
ResidentialSingapore[147]
ANNR2 = 0.9768_Korea[151]
NARXR2 = 0.95OfficeSeoul/Korea[154]
Mathematical modeling with multiple regressionR2 = 0.81_São Paulo/Brazil[156]
ANNR2 = 0.8833Residential buildingsKovilpatti/India[159]
CNNR2 = 0.96Campus buildingStrasbourg/France[161]
Back-propagation ANNR2 = 0.9489Several typesCardiff/The UK[169]
MLPR2 = 0.8481Smart HomeSwiss[179]
4-layer DNNR2 = 0.95Retail shopKorea[189]
- Residual Dilated Causal Convolutional NetworkR2 = 0.9308ResidentialTainan/Taiwan[190]
Stacking ensemble models with RNNR2 = 0.86Industrial Co.Taiwan[197]
LSTM + self-attentionsR2 = 0.934HousesUrayasu/Japan[200]
Singular Spectrum Analysis + BiLSTM + Bayesian OptimizationR2 = 0.971Real-world rooftop stationsEastern China[206]
ANNR2 = 0.935LaboratoryStrasbourg/France[214]
AMLR2 = 0.997Energy InstituteSouth Korea[218]
AMLR2 = 0.857University CampusToyama/Japan[219]
WTEEMD + FCM + IGWO + LSTMR2 = 0.9915ZEBAlice Springs/Australia[225]
VMD + Informer + Enhanced Chaos Game OptimizationR2 = 0.9732- Steel Structure Engineering Co.Nantong/China[226]
DSE-XGBR2 = 0.96Commercial Netherlands[227]
SVRR2 = 0.934University buildingBidart/France[228]
LASSOR2 = 0.57IndustrialGermany[229]
RFR2 = 0.987Smart buildingsEstonia[230]
RFR2 = 0.99- A single-family dwellingMaryland/USA[232]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pedram, O.; Soares, A.; Moura, P. A Review of Methodologies for Photovoltaic Energy Generation Forecasting in the Building Sector. Energies 2025, 18, 5007. https://doi.org/10.3390/en18185007

AMA Style

Pedram O, Soares A, Moura P. A Review of Methodologies for Photovoltaic Energy Generation Forecasting in the Building Sector. Energies. 2025; 18(18):5007. https://doi.org/10.3390/en18185007

Chicago/Turabian Style

Pedram, Omid, Ana Soares, and Pedro Moura. 2025. "A Review of Methodologies for Photovoltaic Energy Generation Forecasting in the Building Sector" Energies 18, no. 18: 5007. https://doi.org/10.3390/en18185007

APA Style

Pedram, O., Soares, A., & Moura, P. (2025). A Review of Methodologies for Photovoltaic Energy Generation Forecasting in the Building Sector. Energies, 18(18), 5007. https://doi.org/10.3390/en18185007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop