1. Introduction
The Performance Ratio (PR) is a crucial metric for various stakeholders as the PR can capture plant performance [
1,
2]. PR is a unitless quantity that indicates the quality of the PV plant by assessing the produced energy with respect to the expected energy [
2] and is a critical performance assessment metric for a PV system [
3].
Stakeholders can use different temporal resolutions of performance metrics, such as hourly, monthly or annual PR estimations. The research indicates that the PR fluctuates throughout the year, which is hypothesised to be because of the spectrum. Thus, an hourly or monthly PR estimation does not indicate the annual PR performance. Ignoring this gives a false representation of the PV plant’s performance to stakeholders.
The calculation of PR adheres to the standard testing conditions (STCs) [
4]. STCs are the industry-wide standard conditions to determine the PV module’s performance. The STC are a cell temperature of 25 °C (
), an irradiance of 1000
(
), and a reference spectrum at 1.5 air mass (AM1.5) [
5]. These values correspond to the irradiance and spectrum of a clear sky upon a sun-facing 37°-tilted surface (
) with a 48.19° solar zenith angle (
). STC approximates solar noon near the spring and autumn equinoxes in the United States of America.
The PV modules generally do not respond to the full AM1.5 spectrum [
5]. Atmospheric conditions cause variations in spectral irradiance due to the scattering and absorption phenomenon, and the three main weather variables that have an effect are AM, AOD, and PW. All three of these variables change over a day and throughout the seasons [
5,
6], and for clear-sky conditions, the AM, AOD, and PW contribute more to the changes in the spectrum. Ignoring the changes in the spectrum’s distributions leads to unrealistic yield predictions [
7].
Most of the Sun’s incident radiation at the top of Earth’s atmosphere falls within the 290 to 3000 nm wavelength range. The spectral distribution of this solar radiation is influenced by local environmental factors and the Sun’s position [
8]. The Bird model, presented in [
9], simulates solar spectral irradiance based on various spectral inputs. Different PV technologies exhibit varying absorption characteristics across the solar spectrum due to differences in their material composition, bandgap energies, and structural design. For instance, crystalline silicon modules typically absorb light efficiently in the visible spectrum but have reduced effectiveness at longer wavelengths, while thin-film technologies, such as cadmium telluride (CdTe) and copper indium gallium selenide (CIGS), can absorb a broader range of wavelengths. These variations in absorption lead to distinct relative spectral responses among different PV modules, affecting their overall efficiency and performance under diverse environmental conditions [
7].
The bandgap energy of a PV cell decreases at higher operating temperature conditions, influencing the cell to absorb longer-wavelength photons. However, increasing the wind speed improves efficiency, as the cooling effect of the wind is more pronounced and blows away dust particles, which cleans the PV modules by removing dust deposits [
10,
11]. Farr and Stein assessed the spatial variations in temperature for a 16-module PV array, where the authors noted that the wind’s directionality caused trends in the spatial temperature variation [
12]; however, the authors could not fully explain the observed trends. Established corrections, such as the temperature and irradiance or weather-corrected PR (WC-PR), do not consider spectral influences. The industry has yet to adopt spectra-corrected PR (SC-PR) [
13].
Until now, there have been no adjustments to account for these spectral influences on the PR. There is also the problem that some form of spectral measurements is needed to correct for the spectrum. Ground-based solar spectrum measurements are scarce and costly and require high maintenance [
7]. Satellite-based data map the annual average spectral influences over large geographical areas; however, these simulation techniques require further experimental validation [
7].
The current international standard defines four temperature and four broadband irradiance power ratings but only one spectrum [
14], despite the many studies that indicate that the spectrum does influence the output of PV technology [
7,
15,
16,
17,
18,
19,
20]. Many studies have also noted seasonal variations in PR [
21,
22,
23,
24].
Even a small percentage error in yield estimations can have a significant impact when carried over multiple years [
25]. Spectral data are limited and scarce [
25], so the viable solution is spectral correction functions using various variables. Due to the extrapolation error, where the instantaneous PR is not equal to the annual PR in most scenarios, long-term data concerning PV power output and meteorological data have to be stored. Therefore, there remains a gap in the literature that illustrates how spectral corrections can be applied to the PR calculation and address these problems.
The manuscript is structured as follows: the introduction in
Section 1 provides a comprehensive background to the challenges associated with performance estimations, the spectrum’s influence, and long-term PV system performance predictions. The development of a novel spectral correction method for calculating the Performance Ratio of PV systems (
Section 2) follows the introduction. The methodology section,
Section 3 provides details on the data used, comparison metrics, the model development process, and the selection and description of the final model. The results section (
Section 4) presents the findings and validation of the spectrally corrected Performance Ratio. The manuscript concludes in
Section 5 with a summary of the key insights and implications of the study.
2. Developing a Spectral Corrected Performance Ratio Calculation
NREL developed a weather-corrected PR formula, which is more specific to a geographical location [
3,
26]. The methodology accounts for the effects of ambient temperature (
), wind speed (
) and irradiance on the performance of the PV module.
The normal PR calculation is as follows:
In Equation (
1),
is the measured PV energy generation,
is the sum of the nameplate ratings for all modules installed during the acceptance test, and
is the measured POA irradiance averaged over the time step
i [
4,
26].
The weather-corrected PR calculation is as follows:
is the cell temperature calculated from measured meteorological data, and
is the temperature coefficient for power (%/°C, negative in sign).
Δ
T denotes the conduction temperature drop, and
is the back-surface temperature of the module:
In Equation (
4),
is the conduction/convection heat transfer coefficient, and the coefficients
a,
b and Δ
T are coefficients for a glass/cell/polymer sheet on an open rack mounting structure (see
Table 1) [
27].
Finally,
is the average calculated cell temperature weighted by irradiance of one year of weather data and is calculated using:
and are the POA irradiance and the calculated cell operating temperature for each hour.
accounts for the presence of water vapour in the atmosphere and is calculated using [
28,
29,
30]:
is the apparent water vapour scale height (km) and
is the surface water vapour density (
).
is the saturation water vapour pressure, and T is the temperature in Kelvin.
is determined by
where
Kelvin.
is estimated using
with the coefficients values defined as
,
,
,
,
,
,
and
[
28,
29,
30].
The general expression of AM is:
The approximation in Equation (
10) is reasonably accurate for
of up to 75°, which led Kasten and Young to propose a new equation to model the AM using [
31]
The absolute AM (
) is the pressurised normalisation of AM, expressed as
where
P refers to the atmospheric pressure at the test site, and
is the atmospheric pressure at sea level.
As the SC-PR aims to be a universal application, the is used as a spectral variable instead of the to include the altitude.
The definition of spectral mismatch (
) is
and
are the lower and upper wavelength limits of spectral absorption of the PV module.
and
are the wavelength limits of the spectrum [
7].
is the reference spectral irradiance at STC, and
is the spectral response of the PV module.
is the measured solar spectral irradiance [
32].
The formula multiplies the irradiance with the SR and the spectral irradiance and integrates over the wavelengths. When the SR at a specific wavelength, is higher, and a larger amount of the spectral irradiance at wavelength can generate a current in the PV module, compared to a lower SR. indicates whether the PV performance will be more or less favourable to irradiance closer to reference conditions (STC).
accounts for the spectrum’s influence on PV performance. The interpretation of
is that the PV module undergoes a spectral loss when
is less than unity and undergoes a spectral gain if
is greater than unity [
17].
Figure 1 shows the default simulated SR of a poly-Si PV module used throughout the paper. The SR of a PV module indicates the spectral sensitivity and is a function of wavelength. The module can only absorb irradiance between certain wavelengths. In
Figure 1, the SR is higher for wavelengths between 0.75 and 1 μm, which means that irradiance at these wavelengths can generate a higher current by the solar cells than the irradiance at lower wavelengths. It can also have the effect that larger variations in 0.75–1 μm in the spectral irradiance will have a more pronounced impact than lower wavelengths with a lower SR. The SR, therefore, encapsulates the PV technology’s sensitivity to the solar spectral irradiance.
can be successfully substituted with Equation (
14) in spectral MM estimations, as studied in [
33]:
where
can be substituted using a constant value of 0.123, also studied in [
33].
The SC-PR must correct the WC-PR to account for spectral influences to estimate the annual PR:
Thus, the equation should hold that
where
is the spectral correction function:
For the WC-PR, the following corrections for temperature (
) and irradiance (
) still hold:
The goal is to fit to ensure .
3. Methodology
In continuation of the previous discussions of the importance of accurate long-term performance estimations derived from short-term measurements and the significant impact of spectral variations on PV system performance, this section outlines the methodological framework employed in this study. The methodology encompasses a detailed examination of the data sources, the criteria used for comparative analysis, and the systematic development of the spectral correction model for the Performance Ratio. The selection process for the final model was meticulously conducted, taking into account factors such as complexity, robustness, and temporal transferability, to ensure its broad applicability. Additionally, this section provides a comprehensive summary of the spectral correction Performance Ratio (SC-PR) model, setting the stage for the subsequent presentation of validation results in
Section 4 and the study’s key findings and conclusions in
Section 5.
3.1. Data
Previous research on the data analytics of the unique available dataset has been discussed regarding the spectral influences on performance [
34] and the wind and temperature effects of long-term degradation within a utility-scale PV plant [
35], situated in a semi-arid region in Northern Cape, South Africa.
Figure 2 describes the four-weather-station layout within the PV plant. The correlation analysis of the POA irradiance from the weather stations linked the different inverters, shown with different colour groupings, and the interested reader is referred to [
35]. The unique dataset has 84 inverters, each with their power output and associated weather station in a resolution of 5 min intervals. The quality control procedure excludes all data points where the POA is less than 20
and a calculated PR and WC-PR (Equations (
1) and (
2)) greater than 100%. Data from inverters that showed faulty data in the analysis of [
36] were subsequently excluded from the dataset. The data used to formulate the spectral corrections are 5-min data of generated PV power, POA irradiance,
,
,
,
, constant
, and estimated
.
3.2. Comparison Metrics
The baseline models are normal PR and WC-PR. The error is the difference between the models and the annual PR. Therefore, if the instantaneous PR calculated using Equation (
1) is 75% but the annual PR is calculated as 80%, there is a 5% difference between what the normal PR predicted and the annual PR. Note that 5% as a fraction of 80% results in a 6.25% error, expressed as a percentage of the annual PR.
The comparison metrics are the RMSE, MAE and MBE, all expressed as a percentage of the annual PR.
3.3. Spectral Variable Selection for Correction Model
The spectral influences consist of two components: the input spectral influence (spectral irradiance) and the technology’s spectral influence (the PV module’s SR).
Figure 3 assesses the influences of different spectral variables on the short circuit current. The figure calculates the PW using Equation (
6) ranges with varying
,
and
.
Figure 3 shows that the
significantly impacts the
and, in turn, the power output. The spectral correction methodology excludes PW,
and
due to their minimal impact on spectral irradiance.
The SR (
Figure 1) must be considered part of the technology’s influence on performance.
is used to consider the input spectral influence, as it significantly influences the input spectral irradiance, as shown in
Figure 3, compared to the other spectral variables.
The hypothesis is that this makes the contributions more universally applicable because is determined based on the location, and the PV technology determines the SR.
The PV plant’s region is semi-arid, and the clear-sky Bird and Riordan model [
9] is used to model the spectral irradiance, as most weathering profiles are sunny and clear-sky days for PV plants. Therefore, the novel spectral correction variable accounts for the change in spectral irradiance.
From Equation (
13),
and
:
can thus simplify the spectral mismatch to the following [
37]:
The newly defined spectral correction factor
results in
A similar analysis of
, as discussed in [
33] for spectral MM formulation, assesses the sensitivity of
and
on
simulations.
Figure 4 shows the distribution of how the
compares for the measured versus estimated
and the RMSE of estimating
.
Figure 4a results indicate that a constant
and estimated
using Equation (
14) correspond well with the measured
.
Figure 4b shows the RMSE of estimating
with the estimated
and constant
. The results indicate a reasonable error when estimating
using estimated constant
and estimated
instead of measured variables.
The results indicate that the is less sensitive to the and used for modelling. Therefore, the is a viable spectral correction variable.
Table 2 shows the correlation matrix between the goal correction value
and spectral variables.
is inversely correlated, and
has a moderately positive correlation with
.
The following variables are chosen as possible correction variables as they indicate the greatest influence on the relationship to the fitted correction value:
represents the spectral irradiance and location, and
represents the technology.
Figure 5 and
Figure 6 show the relationship between the correction factor
(from Equation (
21)) and the
and
SC.
Figure 5a and
Figure 6a show binned
and
, the mean correction factor in these bins and the total generated PV power. In
Figure 5a, higher
levels have less generated PV power, but the correction factor drops to less than 1.
Figure 6a indicates a more varied distribution of the generated PV power for different bins, and the
is less varied, though it decreases as
increases.
Figure 5b and
Figure 6b show the
,
and
relationships. As expected, there are considerably more data points at lower
levels with a negative correlation.
Figure 6b shows a positively correlated relationship, capped close to 1 as
increases.
Figure 5b shows the amount of generated power for different AM bins and the mean correction factor (a value greater than 1 represents an overestimated PR, and a value less than 1 represents an underestimated PR).
The relationship shown in
Figure 5b shows an inverse relationship between the ideal correction and AM, also supported by the correlation matrix in
Table 2.
has a moderate positive correlation with the desired correction factor (from
Table 2).
3.4. Model Development
In the previous section, it was determined that
and
were the two variables that had the greatest influence on the fitted correction value. The next step is determining a function that encapsulates the spectral changes in the SC-PR formula in Equation (
21).
The model is fitted using statistical methods such as regression. Regression techniques can take several forms, including linear, multi-linear, and non-linear approaches.
A linear relationship can be expressed as
where
y represents the dependent variable,
x is the independent variable,
is the intercept, and
is the slope of the line. Regression analysis helps quantify the strength of the relationship between
y and
x [
38].
To estimate
and
, the least squares method is used, which minimises the sum of the squared differences between the observed and predicted values. This sum, known as the residual sum of squares (denoted
), is given by
where
is the predicted value for each observation. Minimising the
ensures that the regression line best fits the data points.
Polynomial regression extends this concept by modelling the dependent variable
y as a polynomial of degree
n with respect to
x:
This paper combines data science techniques and modelling, primarily using Python as the programming language [
39], through the Anaconda interface [
40], along with various supporting libraries [
41,
42,
43,
44,
45]. The pandas library [
42] is employed for efficient data manipulation and analysis. The pvlib library [
41] provides essential functions and classes related to PV systems. Additionally, solar spectral irradiance is modelled using the pvlib spectrum module [
43]. The SciPy library [
44] is used for optimisation algorithms, particularly for fitting coefficients in the regression models outlined in this section. The SC-PR model was also calculated, and its performance metrics were evaluated.
The model development for SC-PR consists of considering many factors, such as the complexity and robustness of the empirical formula. An analysis of the model complexity compares different non-polynomials (or rational functions) of and polynomials of , i.e., how complex the model has to be and how much the model improves the error. A model with less complexity and more robustness is a more ideal solution.
Order functions with positive and negative exponents are selected due to the non-linear relationships in
Figure 5b and
Figure 6b.
The spectral correction function takes the form of
where the AM function
can be in the inverse form due to its negative correlation with
from
Table 2:
and the
function
can take the form of its moderately positive correlation with
from
Table 2:
The section further assesses a piece-wise model using
versus having one continuous empirical formula. Thus,
The training–testing ratio of the data is a 60:40 split.
The model notation in the section from Equation (
31) is denoted as
The subscript t refers to the model type, where continuous is denoted as -subscript and piece-wise as -subscript.
, as in Equations (
32) and (
35), refers to
, as in Equations (
33) and (
35), refers to
Thus,
is the spectral correction function with the form
3.5. Model Selection
Figure 7 summarises the model selection methodology, leading to the two viable models discussed in this section. Two spectral variables representing the location and technology are used to developan SC-PR. The new model’s complexity, robustness and temporal transferability are important considerations in the model selection criteria.
3.5.1. Model Complexity, Temporal Sensitivity and Transferability
Figure 8 shows the MAE errors compared to the model’s complexity. The x-markers indicate continuous, and the dots indicate piece-wise models. The continuous models have higher errors than their piece-wise counterparts. Further, the error slowly improves as the model complexity increases, indicating there will be a trade-off between increasing performance by a minuscule amount for a much more complex model.
Figure 9 shows the MAE improvement of the different iterations combining Equations (
32) and (
33).
In
Figure 9, the dashed lines indicate that the model is continuous, as described in Equation (
34), and the solid lines indicate a piece-wise model. The piece-wise model outperforms the continuous model in all scenarios, being 5-min, hourly, and monthly aggregated. The results indicate that AM-only and
-only functions show the worst performance. The conclusion is that the model needs to combine
and
instead of only
or
.
Further, a more complex model can introduce uncertainties and additional computation, which may not be necessary to achieve the final result. A complicated model is not necessarily an ideal or practical model.
There has to be a trade-off between choosing a simpler model with a slightly higher error versus an extremely complex model with a slightly lower error. Therefore, the improvement of the
function from
to
and
is small enough that the conclusion is to limit the largest exponent of the
function (Equation (
33)) to a maximum of 3. Similarly, with AM, the improvement from
to
(in Equation (
32)) shows that both
and
can still be considered a viable option.
A further consideration is the temporal sensitivity and transferability. The model must transfer relatively well to hourly and monthly aggregated data, as shown in
Figure 10 and
Figure 11. The conclusions made from the 5-min datasets hold to these, indicating that
- or
-only models and
should be excluded as a possible spectral correction method for PR because significant improvement is not noticeable over a simpler model.
Table 3 shows the 5-min results for different
and
intervals. The normal and WC-PR show the worst performance in estimating the annual PR. The continuous models show poorer performance for lower
levels.
Table 4 shows the sensitivity of transferability of different temporal resolutions. The normal and WC-PR have the worst performance. All the continuous models overestimate the monthly aggregated data. The piece-wise models perform marginally better than the continuous models for the different metrics.
The analysis showed that
was unsuitable. Additionally,
and
are unsuitable as they need to show significant improvement over the
function to be a considered option, confirming the conclusions made from
Figure 8 and
Figure 9. The piece-wise model shows better performance than the continuous model. However,
and
show promising results.
The results in
Figure 8 and
Figure 9 show that piece-wise models showed higher error improvements over WC-PR than continuous models, and the AM-only or
-only models were not viable options. The results indicate that a combination of the two for a correction model would have higher accuracy in correcting PR for the spectrum.
3.5.2. Model Robustness
The ideal model should be robust while being as accurate as possible, which requires assessing how the coefficients vary throughout the PV plant. Due to the nature of the dataset, there are 84 inverters with their annual PR. Therefore, individual models are developed for each inverter and then compared to the entire PV plant to indicate if there is a significant change in coefficient variables. The ideal model would be one where very little variability is noticed and does not overfit on a certain variable, i.e., not large coefficients for and small coefficients for or vice versa, which can result in a universal application of the formula.
From the initial model complexity assessment, viable models were selected, and their robustness was further scrutinised. The analysis considered each model’s coefficient distribution to assess the robustness and evaluated the error improvement over WC-PR and temporal transferability across the entire plant. Some models showed excellent improvements, whereas others indicated overfitting or poor robustness, leading to the selection of two viable models, as summarised in
Table 5, highlighted in green.
Thus, two possible models that could be viable asan SC-PR are
:
:
Figure 12 shows how coefficients change across the PV plant for each inverter for the function
. The coefficient distribution across the three bins is relatively stable.
Figure 13 shows the MAE improvement over WC-PR for each inverter block.
Figure 13a,b indicates that the model significantly improves over the WC-PR for the entire plant. The monthly aggregated data in
Figure 13c show marginal improvements in most inverters.
Figure 14 shows how coefficients change across the PV plant for each inverter for
.
The
coefficients in
Figure 14a are close to zero, and the
b coefficients vary significantly across the plant.
Figure 14b,c shows a more stable coefficient distribution across the different inverters. The results in
Table 3 and
Table 4 indicate that the model does better than the piece-wise counterpart. In
Table 4, the piece-wise function significantly outperforms the Normal and WC-PR for hourly and monthly aggregated data.
Figure 13 shows the distribution of the PV plant for improvements over the WC-PR of each inverter. The 5-min resolution (
Figure 13a) shows ranges of improvement over WC-PR between 15 and 28%, with the hourly aggregated resolution (
Figure 13b) ranging between 24 and 41% improvement and the monthly aggregated resolution (
Figure 13c) between a 6.9% decrease to 30% increase (−6.9% to 30%).
Figure 15 shows the distribution of the PV plant for improvements over the WC-PR of each inverter. The 5-min resolution (
Figure 15a) shows ranges of improvement over WC-PR between 18 and 31%, with the hourly aggregated resolution (
Figure 15b) ranging between 26 and 43% improvement and the monthly aggregated resolution (
Figure 15c) between a 16% decrease to a 9% increase (−16% to 9%).
A final comparison of the two models assesses the most suitable spectral correction function for PR estimations.
3.5.3. Final Model Selection
Table 6 summarises
Table 4 and indicates that the two models significantly outperform the normal and WC-PR. There is a significant reduction in error in the RMSE and MAE. The models were originally fitted for 5-min resolution data to have as many data as possible; however, the hourly metrics are still an important consideration for the final model selection. There is a marginal difference between the two new models regarding performance in the 5-min resolution.
Table 7 summarises from
Table 3 and shows the RMSE and MAE metrics for the 5-min data for different AM intervals. The lower AM intervals (0.5 AM to 1.5 AM) generate the most PV power, as shown in
Figure 5a, showing improvement in reducing the WC-PR and normal PR for both models.
After 6 AM, the amount of generated PV power compared to power generated for lower
levels becomes incomparable. It is important to focus on improving the periods where the most PV power is generated rather than focusing on higher AM, which is associated with lower PV power generation.
Figure 5b shows that most data are below 10 AM. The mid-range
levels (1.5 AM to 6 AM) show that the
function performs marginally better than
with the
function. For higher
(greater than 6), there is a significant improvement in reducing the error over WC-PR.
Table 8 summarises
Table 3, showing the MAE for different
-intervals. The results indicate that the two models considerably improve over WC-PR in estimating the annual PR compared to WC-PR.
Table 9 summarises
Table 4 and shows the hourly metrics, an important criterion the model has adhered to viable spectral correction method. The results indicate considerable improvements over the WC-PR, reducing the MBE to almost zero. The normal and WC-PR all show slightly under estimations for annual PR from the instantaneous measurements.
Table 10 summarises
Table 4 and shows the monthly aggregated data. The monthly model is outside this manuscript’s scope; however, a discussion showing its temporal transferability indicates potential future improvements.
The normal PR has the worst performance in estimating the annual PR. The
function is comparative with WC-PR, whereas the
function marginally reduces the RMSE and MAE better. The metrics in
Table 10 are all mean-based. As the number of monthly aggregated data is considerably less than the hourly or 5-min data, the magnitude of the errors is influenced.
The coefficient distribution across the entire PV plant indicates how robust the model is.
Figure 12 shows the
with the
function, where the coefficients were quite stable for all the inverters.
Figure 12a shows that the
has more influence in the lower
ranges, whereas the
has a greater influence in the mid and higher
ranges.
Figure 14 shows
with the
function, where the coefficients were quite stable for all the inverters, except in the lower
range (
Figure 14a), where there was also overfitting on the
b coefficient.
Overall, the final verdict is to use the simpler model, , over the more complex model.
3.6. Novel Spectra-Corrected Performance Ratio Estimation
Step 1: Data
Subhourly data points (ideally 5–10 min intervals) of the data required that we determine the weather-corrected Performance Ratio. At least one full year is required for weather-corrected PR to determine the weighted cell temperature:
- (a)
Generated PV Power;
- (b)
GHI for ;
- (c)
POA irradiance (or use GHI with the correct combination of decomposition and transposition model);
- (d)
Wind speed, ambient temperature, and relative humidity.
Spectral variables:
- (a)
Estimate , PW, and AOD500;
- (b)
Determine and ;
- (c)
SR of PV technology;
- (d)
Simulate the spectral correction factor using , AM, PW, and AOD500.
Quality control of the dataset.
- (a)
All calculated PR must be less than 1 (WC-PR and Normal-PR);
- (b)
Remove all faulty data/inverters/periods where non-spectrum-related issues occurred (such as breakdowns, curtailments);
- (c)
Remove all POA less than 20 .
Step 2: Spectra-Corrected Performance Ratio
where
where
Table 11 shows the coefficients.
4. Results
The previous section discussed the methodology of selecting a model to correct the spectrum for PR estimates. Two spectral variables,
and
, representing the location and technology, were selected to make the formula universally applicable.
Section 3.6 discusses incorporating and applying the SC-PR formula.
In this section, the novel SC-PR formulation’s results are presented, and the three temporal resolutions, hourly, 5-min, and monthly, are discussed. However, the 5-min and hourly resolutions precede the monthly resolution, as this is the most common application. The spectrum changes rapidly, and the model better reflects higher-resolution data.
4.1. Annual Results
4.1.1. Year 1
Table 12 shows the normal, WC-PR, and SC-PR results of Year 1. The SC-PR estimates the annual PR from hourly and 5-min aggregated data with a lower error than normal and WC-PR. The MBE is also significantly reduced for higher-temporal-resolution data. The monthly aggregated data show slightly worse performance than WC-PR.
Figure 16 visualises the normal, WC-PR and SC-PR of Year 1 when
and POA
. The SC-PR shows less variability than normal and WC-PR.
Figure 16a,b shows overestimations (SC-PR > 1) and underestimations, as do the normal and WC-PR.
Figure 17 shows the distribution of estimating PR compared to normal and WC-PR, which indicates that the SC-PR estimates more data points closer to the annual PR, compared to WC-PR and normal PR.
Figure 18 demonstrates how using SC-PR improves the estimate of annual PR from instant data by removing timestamps with low PV power generation (
and POA
).
4.1.2. Year 2
Table 13 shows the normal, WC-PR and SC-PR results of Year 2. The results are similar to Year 1. The SC-PR estimates the annual PR from hourly and 5-min aggregated data with a lower error than normal and WC-PR. The MBE is also significantly reduced for data with higher temporal resolution. The monthly aggregated data performs better than WC-PR and normal PR, in contrast to Year 1.
Figure 19 visualises the normal, WC-PR, and SC-PR of Year 2 for data points where
and POA
. The SC-PR shows less variability than normal and WC-PR.
Figure 19a,b shows overestimations (SC-PR > 1) and underestimations, as do the normal and WC-PR.
Figure 20 shows the distribution of SC-PR compared to normal and WC-PR, which shows that both the WC-PR and normal PR for more ideal scenarios underestimate the PR.
In
Figure 21, eliminating timestamps featuring low PV power generation (
and POA
) highlights the improved accuracy in estimating the annual PR using SC-PR with instantaneous data.
4.1.3. Year 3
Table 14 shows the normal, WC-PR, and SC-PR results of Year 3. The results are similar to Years 1 and 2. The SC-PR estimates the annual PR from hourly and 5-min aggregated data with a lower error than normal and WC-PR. The MBE is also significantly reduced for higher-temporal-resolution data. The monthly aggregated data perform similarly to WC-PR but considerably improve on normal PR.
Figure 22 visualises the normal, WC-PR, and SC-PR of Year 2 when
and POA
. The SC-PR shows less variability than normal and WC-PR.
Figure 22a,b show overestimations (SC-PR > 1) and underestimations, as do the normal and WC-PR.
Figure 23 shows the distribution of SC-PR compared to normal and WC-PR with similar results to Year 1 and Year 2. The SC-PR estimates more of the data points closer to the annual PR compared to WC-PR and Normal PR.
Figure 24 demonstrates the removal of timestamps with lower PV power generation (
and POA
), highlighting how using SC-PR improves the estimation of annual PR based on instantaneous data.
4.1.4. Year 4
Table 15 shows the normal, WC-PR, and SC-PR results of Year 4. The results are similar to Years 1–3. The SC-PR estimates the annual PR from hourly and 5-min aggregated data with a lower error than normal and WC-PR. The MBE is also significantly reduced for higher-temporal-resolution data. The monthly aggregated data show worse performance to WC-PR but considerably improve on normal PR.
Figure 25 visualises the normal, WC-PR, and SC-PR of Year 2 when
and POA
. The SC-PR shows less variability than Normal and WC-PR.
Figure 25a,b show overestimations (SC-PR > 1) and underestimations, as do the normal and WC-PR.
Figure 26 shows the distribution of SC-PR compared to normal and WC-PR with similar results to Year 1 to Year 3. The SC-PR estimates more of the data points closer to the annual PR compared to WC-PR and normal PR for the hourly aggregated data.
Figure 27 removes lower PV power generation timestamps,
and POA
, which indicates the improvement the SC-PR makes in estimating the annual PR from instantaneous data.
4.2. Discussion
The newly defined spectral correction methodology for Performance Ratio successfully uses instantaneous measurements to estimate annual PR, with reasonable accuracy. The model shows significantly improved accuracy for higher AM values, where the spectral influence is non-linear. The model has a universal application due to its incorporation of the location and technology, but further testing and validation will be required to obtain additional sets of a and b coefficients. The method is a piece-wise empirical model including two spectral variables: , representing the location and irradiance, and a newly defined spectral correction factor, , representing the technology.
The new spectral corrections show promising results with reduced RMSE, MAE, and MBE for different and bins and the overall dataset. To the author’s knowledge, there have been no attempts in the literature to correct the Performance Ratio with spectral corrections. The novel methodology addresses and successfully estimates PR from aggregated hourly values with reasonable accuracy to estimate a PV plant’s annual PR. The model does show slight overestimations and underestimations for higher levels, but the results could also indicate faults with the plant.
The data used in the model have several limitations, such as the decomposition and transposition models: when using GHI with the decomposition and transposition models to approximate POA irradiance, these models may introduce biases, especially in non-standard or extreme weather conditions. Errors in irradiance, temperature, and wind speed sensors could propagate through the model. Miscalibrations or sensor degradation could skew the Performance Ratio, leading to false results. Removing faulty data and periods with curtailments or breakdowns may inadvertently remove valid periods of low production due to spectral or operational conditions. The aggregation of spectral-corrected data to hourly intervals could lead to the loss of finer details in PR performance variations, especially during rapid changes in irradiance or atmospheric conditions. Non-linear factors like inverter efficiency, partial shading, or temperature-induced degradation might not be fully captured in the weather-corrected PR, leading to deviations from actual performance.
Different PV technologies have varying spectral responses, and the correction factor might not generalise well across all types of panels, leading to inaccuracies in the calculated spectral-corrected PR. The lack of more available data for various technologies, geographical locations, and climates hinders the validation process. Errors of faulty measurement or plant equipment can be masked in the statistics closer to STC or ideal operational conditions. The SC-PR model compensates for the spectrum in the non-ideal spectral scenarios, and high could exacerbate these errors, indicating faults in the plant worth addressing. These limitations could further refine and improve the spectral correction methodology and increase the model’s robustness.
The model shows an adequate temporal transference from 5-min to hourly data. The recommendation for the model is to use higher temporal resolution, as the spectrum is highly variable. The model corrects the PR for the spectrum, especially for higher levels with lower generation of PV power. Therefore, the recommendation is to use periods of the day when the PV module will be in higher production and the spectral irradiance is greater.
5. Conclusions
The newly developed spectral correction methodology for estimating the PR demonstrates effective accuracy when extrapolating instantaneous measurements to the annual PR. Notably, the model exhibits a substantial enhancement in performance as the AM increases, particularly in scenarios where the solar spectrum deviates from STC. The new model boasts universal applicability due to its capacity for geographical location and technology-specific factors.
The methodology comprises a piece-wise empirical model that integrates two key spectral variables: , which reflects the location and irradiance, and a newly defined spectral correction factor denoted as , signifying the technology in use. These new spectral corrections yield promising outcomes, showcasing reduced RMSE, MAE, and MBE across various AMa and intervals and different temporal resolutions.
Minor overestimations and underestimations are discernible for higher AM levels, indicating potential maintenance issues within the PV plant that require attention. Additionally, the model demonstrates satisfactory temporal transferability when transitioning from 5-min to hourly data to monthly data. Given these findings, it is advisable to consider the model’s application at higher temporal resolutions, leveraging the inherent adaptability of the solar spectrum. The model is particularly effective in correcting the PR when dealing with higher AM levels and reduced PV power generation compared to conventional PR estimations. It is advisable to operate the model during times of the day when the PV modules operate at peak production and spectral irradiance is more predictable.
Validation of the novel spectral correction methodology is essential for various technology types and climates. Its application must be updated in software tools as the new model can indicate the financial implications to stakeholders.
The proposed novel approach marks the first documented effort in the literature to rectify PR using spectral corrections for large utility-scale PV plants. The groundbreaking methodology effectively addresses and accomplishes estimating PR based on hourly aggregated values, yielding satisfactory accuracy in projecting a large PV plant’s annual performance.