Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking

Panagoulias, Dimitrios P.; Sarmas, Elissaios; Marinakis, Vangelis; Virvou, Maria; Tsihrintzis, George A.

doi:10.3390/electronics14224484

Open AccessArticle

Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking

by

Dimitrios P. Panagoulias

¹

,

Elissaios Sarmas

²

,

Vangelis Marinakis

²

,

Maria Virvou

¹

and

George A. Tsihrintzis

^1,*

¹

Department of Informatics, University of Piraeus, Karaoli ke Dimitriou 80, 185 34 Piraeus, Greece

²

Decision Support Systems Lab, School of Electrical and Computer Engineering, National Technical University of Athens, 157 72 Zografou, Greece

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(22), 4484; https://doi.org/10.3390/electronics14224484

Submission received: 24 September 2025 / Revised: 27 October 2025 / Accepted: 5 November 2025 / Published: 17 November 2025

(This article belongs to the Special Issue Trends and Prospects in AI-Empowered Information Systems and Technologies)

Download

Browse Figures

Versions Notes

Abstract

We present an enhanced Extreme Value Analysis (EVA) framework designed to improve the forecasting of extremely low-production events in photovoltaic (PV) systems and to reveal the key inter-variable relationships governing performance under extreme conditions. The proposed Extreme Value Dynamic Benchmarking Method (EVDBM) extends classical EVA by integrating the Dynamic Identification of Significant Correlation (DISC)-thresholding algorithm and explainable AI (XAI) mechanisms, enabling dynamic identification and quantification of correlation shifts during extreme scenarios. Through a combination of grid and Bayesian optimization, EVDBM adaptively fine-tunes variable weights to improve fit, interpretability, and benchmarking consistency. By transforming return values predicted via EVA into dynamic benchmarking scores, EVDBM evolves static tail modeling into a data-driven, explainable benchmarking system capable of identifying critical vulnerabilities and resilience patterns in real time. Applied to real PV production datasets, EVDBM achieved an average improvement of 13.2% in correlation-based

R_{c o r r}^{2}

and demonstrated statistically significant reductions in residual error (

p_{t} < 0.01

) in the João dataset, confirming its robustness and generalizability. Quantile-to-quantile analyses further showed improved alignment between modeled and empirical extremes, validating the method’s stability across distributional tails. Ablation studies revealed cumulative gains in interpretability and predictive stability in the EVA → EVDBM → EVDBM + XAI progression, while computational complexity remained near-linear with respect to input dimensionality. Overall, EVDBM delivers a transparent, statistically validated, and operationally interpretable framework for extreme event modeling. Its explainable benchmarking structure supports actionable insights for risk management, infrastructure resilience, and strategic energy planning, establishing EVDBM as a generalizable approach for understanding and managing extremes across diverse application domains.

Keywords:

renewable energy sources; photovoltaic park; extreme value analysis; data analytics; big data; decision making; risk assessment; predictive modeling; forecasting; benchmarking

1. Introduction

Understanding and managing extreme events is crucial in the energy sector, where fluctuations in supply and demand can lead to severe disruptions. In renewable energy systems, extreme weather events—such as prolonged cloudy periods, intense storms, or heatwaves—can cause significant deviations in power generation and demand and lead to instability in grid management, energy pricing, and infrastructure resilience. Accurate prediction of these extreme events is essential for energy security, economic stability, and optimizing renewable energy integration.

Extreme Value Analysis (EVA) has proven to be an effective statistical tool for modeling rare but high-impact events [1,2]. For instance, EVA has been applied to analyze the impact of infrequent extreme weather events on annual peak electricity demand, revealing their effects on grid stability and consumer response [3]. Similarly, in the evolving landscape of electric mobility, EVA has been employed to extract long-term trends in electric vehicle (EV) charging demand, helping stakeholders anticipate infrastructure needs and policy adjustments [4,5,6,7,8,9,10,11]. The unpredictability of solar energy generation further underscores the importance of robust extreme event forecasting models as variations in photovoltaic (PV) output due to rare extreme weather conditions can significantly impact grid reliability and energy trading strategies [12,13,14,15].

The present study was conceived as an EVA-based optimization and benchmarking approach, focusing on enhancing the interpretability and adaptive calibration of extreme value models. By enhancing EVA with explainable AI (XAI) and optimization techniques, more reliable forecasting and decision-support systems can be developed to anticipate and mitigate the impact of extreme energy fluctuations, thus improving decision making [16]. This paper introduces the Extreme Value Dynamic Benchmarking Method (EVDBM), a novel EVA-based methodology which has been tailored to renewable energy applications. EVDBM is shown to ensure more precise identification of rare (but high-impact) extreme low-production events and, thus, enables better proactive energy management.

While prior EVA-related research has incorporated machine learning or XAI elements primarily to interpret black-box predictors of extreme phenomena, these approaches have not embedded explainability within the EVA process itself. Existing frameworks typically apply post hoc interpretation or feature attribution after model fitting. In contrast, EVDBM integrates explainability [17,18,19,20,21,22,23] at the core of the extreme value modeling process through the Dynamic Identification of Significant Correlation (DISC)-thresholding mechanism. This internal integration allows EVDBM to dynamically quantify and interpret the evolving relationships between variables under extreme conditions, providing not only improved predictive performance but also interpretable causal structure discovery.

More specifically, EVDBM integrates extreme value theory with the innovative Dynamic Identification of Significant Correlation (DISC)-thresholding algorithm. This integration also acts as an explainable artificial intelligence (XAI) layer by dynamically identifying and quantifying the importance of correlations between key variables under extreme conditions. This added interpretability allows decision-makers to better understand the relationships between critical factors during extreme scenarios and to project how these relationships are likely to adjust in the future.

A key feature of EVDBM lies in its use of an optimization mechanism (grid search, Bayesian optimization [24,25,26,27,28,29,30]) to fine-tune weights assigned to related variables, aligning them with the EVA-dependent variable under analysis. This process ensures that EVDBM adapts dynamically to the specific circumstances of each case, maximizing its explanatory power. The result is a significant improvement in model performance compared to standard EVA. Indeed, in our experimental evaluation with real photovoltaic energy production data, a 13.21% increase in

R^{2}

is achieved, which clearly demonstrates EVDBM’s enhanced ability to capture variance in extreme scenarios.

Beyond improving predictive accuracy, EVDBM introduces the concept of dynamic benchmarking—a continuously adaptive framework that recalibrates the benchmark itself based on evolving extreme conditions. Unlike static EVA implementations that evaluate models using fixed thresholds or historical baselines, EVDBM dynamically updates its reference structure through the DISC mechanism and weight optimization process. This enables fairer and more context-aware comparisons between different sites, time periods, or operational conditions, particularly in systems affected by non-stationary climatic or operational dynamics. As a result, EVDBM transforms benchmarking from a static evaluation into a learning process that evolves with environmental variability.

Additionally, the EVDBM methodology provides a robust quantitative mechanism for comparing different use cases by generating a final benchmarking score based on weighted performance during extreme events. This scoring system incorporates both the frequency of past extreme events and the predicted severity and likelihood of future occurrences, projecting how related conditions tied to the EVA-dependent variable behave under projected stress. By integrating historical data with probabilistic projections, EVDBM offers a forward-looking performance evaluation and, thus, enables a meaningful comparison of cases under extreme conditions.

Moreover, this scoring framework is highly adaptable and valuable in energy systems, particularly for benchmarking photovoltaic (PV) performance under extreme weather conditions, assessing grid stability during peak demand events, and evaluating renewable energy resilience against climate-induced fluctuations. By extracting and quantifying correlations between critical variables—such as solar irradiance, temperature variations, and energy consumption patterns—and dynamically optimizing their contributions, EVDBM provides actionable insights into the drivers of extreme energy fluctuations. These insights support more informed decision-making in risk management, infrastructure planning, and resource allocation, ensuring greater stability and efficiency in renewable energy integration.

In more detail, the paper is structured as follows: In Section 2, previous works are presented in relation to Extreme Value Analysis with an additional context that focuses on the components of EVDBM. In Section 3, the EVDBM methodology is described and explained in detail. In Section 4, the methodology is applied to PV production data. Lastly, in Section 5, a discussion of results and comparison with relative metrics takes place, limitations are acknowledged and future related works are outlined.

2. Related Work

2.1. Applications of Extreme Value Analysis

EVA has been used in several and diverse data processing/data analytics applications with good and useful results. For example, in [31], the authors used extreme value theory for the estimation of risk in finite-time systems, especially for cases when data collection is either expensive and/or impossible. On another occasion and for the monitoring of rare and damaging consequences of high blood glucose, EVA has been deployed using the block maxima approach [32]. Many more examples of applications of EVT can be found in the recent literature, but here we only report those considered more relevant to our research. As we use photvoltaic production data as a use case of our proposed EVDBM, we review some of the literature on applications of EVA to energy production/consumption data.

Extreme Value Analysis (EVA) is vital in renewable energy for balancing demand and supply. Studies [33,34] have employed the Peaks-Over-Threshold (POT) method to model solar and wind power production, estimating peak frequencies and their size distributions. Clustering methods are used to optimize fit and extended timeframes are essential to capture seasonal effects. This analysis aids in managing the inherent variability and unpredictability of renewable energy sources.

In [35], on the other hand, estimators for the extreme value index and extreme quantiles in a semi-supervised setting were developed, leveraging tail dependence between a target variable and co-variates, with applications to rainfall data in France. In [36], the authors review available software for statistical modeling of extreme events related to climate change. In [37], a novel method is proposed for estimating the probability of extreme events from independent observations, with improved accuracy by minimizing the variance of order-ranked observations and, thus, eliminating the need for subjective user decisions. EVA has also been used in partial coverage inspection (PCI) to estimate the largest expected defect in incomplete datasets, though uncertainties in return level estimations are often underreported [38].

2.2. Theory of Extreme Value Analysis

In this section, the key notions of extreme value theory are highlighted. Specifically, EVA can be approached from two different angles. The first one refers to the block maxima (minima) series: According to block maxima (minima), the annual maximum (minimum) of time series data is extracted, generating an annual maxima (minima) series, simply referred to as AMS. The analysis of the AMS datasets is most frequently based on the results of the Fisher–Tippett–Gnedenko theorem, which leads to the fitting of the generalized extreme value distribution. A wide range of distributions can also be applied. The limiting distributions for the maximum (minimum) of a collection of random variables from the same distribution is the basis of the examined theorem [39].

The second approach to EVA makes use of the Peaks-Over-Threshold (POT) methodology. This constitutes a key approach in EVA which involves identifying and analyzing peak values that surpass a set threshold in a data series. The analysis typically fits two distributions: one for event frequency and another for peak sizes. According to the Pickands–Balkema–De Haan theorem, POT extreme values converge to the Generalized Pareto Distribution, while a Poisson distribution models event counts. In this approach, the return level (R.V.) represents the expected value exceeding the threshold once per time/space interval T with probability

1 / T

[40].

At any given point in the examined space, the probability density function of a continuous random variable can provide the relative likelihood that the random variable is located near the sample space [39]. The shape of the probability distribution is calculated via the L-moments, which represent linear combinations of order statistics (L-statistics) similar to conventional moments. They are used to calculate quantities analogous to standard deviation, skewness and kurtosis, and can, thus, be termed L-scale, L-skewness and L-kurtosis. Therefore, they summarize the shape of the probability distribution and are defined as below:

L_{4} = \frac{n * Σ n_{i} {(Y_{i} - \tilde{Y})}^{4}}{{(Σ n_{i} {(Y_{i} ˘ \tilde{Y})}^{2})}^{2}}

.

L_{4}

= L-kurtosis.

Y_{i}

: ith variable of the distribution.

\tilde{Y}

: mean of the distribution.

n: number of variables in the distribution.

{\tilde{μ}}_{3} = \frac{Σ_{i}^{N} {(X_{i} - \tilde{X})}^{3}}{(N - 1) * σ^{3}}

.

{\tilde{μ}}_{3}

= L-skewness .

N = number of variables in the distribution.

X_{i}

= random variables.

\tilde{X}

= mean of the distribution.

σ

= standard deviation.

2.3. Pearson Correlation

The Pearson correlation coefficient [41,42,43,44,45], often denoted as r, is a measure of the linear relationship between two variables. The Pearson correlation coefficient quantifies the degree to which two variables, X and Y, are linearly related. It ranges from −1 to 1, where

r = 1 indicates a perfect positive linear relationship (as X increases, Y increases proportionally);

r = −1 indicates a perfect negative linear relationship (as X increases, Y decreases proportionally);

r = 0 indicates no linear relationship between X and Y.

The Pearson correlation coefficient between two variables, X and Y, is calculated as

r = \frac{Cov (X, Y)}{σ_{X} σ_{Y}}

where

Cov (X, Y)

is the covariance between X and Y;

σ_{X}

and

σ_{Y}

are the standard deviations of X and Y, respectively.

The covariance measures how two variables move together and is defined as

Cov (X, Y) = \frac{1}{n} \sum_{i = 1}^{n} (X_{i} - μ_{X}) (Y_{i} - μ_{Y})

where

n is the number of data points;

X_{i}

and

Y_{i}

are the individual values of the variables X and Y;

μ_{X}

and

μ_{Y}

are the means (averages) of X and Y, respectively.

Lastly, the standard deviation of a variable X is calculated as the measure of how spread out the values of X are and is given by

σ_{X} = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(X_{i} - μ_{X})}^{2}}

.

L_{4} = \frac{n * Σ n_{i} {(Y_{i} - \tilde{Y})}^{4}}{{(Σ n_{i} {(Y_{i} ˘ \tilde{Y})}^{2})}^{2}}

.

L_{4}

= L-kurtosis.

Y_{i}

: ith variable of the distribution.

\tilde{Y}

: mean of the distribution.

n: number of variables in the distribution.

{\tilde{μ}}_{3} = \frac{Σ_{i}^{N} {(X_{i} - \tilde{X})}^{3}}{(N - 1) * σ^{3}}

.

{\tilde{μ}}_{3}

= L-skewness.

N = number of variables in the distribution.

X_{i}

= random variables.

\tilde{X}

= mean of the distribution.

σ

= standard deviation.

2.4. Normalization

Normalization refers to the process of scaling variables so that they fit within a common range (e.g., [0, 1] or mean 0 and standard deviation 1). In many applications, normalization preserves the relative differences between variables but still retains their units in some form (although scaled). It is often used in data science, statistics, and machine learning, where the goal is to make variables comparable by bringing them onto the same scale [46,47,48,49]. Common approaches in normalization aiming at transforming the variables into a comparable, dimensionless format are the following:

Min–Max Normalization (Feature Scaling): Min–max normalization is a rescaling technique where variables are linearly scaled to a specific range, often [0, 1]. The formula is $x_{normalized} = \frac{x - min (X)}{max (X) - min (X)}$ , where
–
x is the original value of the variable.
–
$\min (X)$ and $\max (X)$ are the minimum and maximum values of the variable X .
Z-Score Normalization (Standardization): Z-score normalization transforms each variable by subtracting the mean and dividing by the standard deviation. The formula is $x_{standardized} = \frac{x - μ_{X}}{σ_{X}}$ , where
–
x is the original value;
–
$μ_{X}$ is the mean of the variable X;
–
$σ_{X}$ is the standard deviation of X.

3. The Extreme Value Dynamic Benchmarking Method (EVDBM)

In this section, we describe the expansion of the EVA encapsulated into the proposed EVDBM methodology. Evaluating the circumstances related to extreme values involves an elaborate process focused on identifying, explaining, and expanding on extreme circumstances. This process incorporates weight adjustments and improved

R^{2}

metrics to enhance pattern detection and decision-making accuracy.

Process

In Figure 1 we outline the EVA-Driven Weighted Benchmarking Method (EVDBM). It integrates Extreme Value Analysis (EVA), Dynamic Identification of Significant Correlation Changes (DISC) and a weighted benchmarking approach. DISC ensures that significant correlation changes are assessed relative to each farm’s baseline, providing a consistent and comparable framework. The process is iterative, with each step exporting intermediate results in JSON or CSV format to be utilized by the subsequent steps, culminating in an XAI report and benchmarking scores. The process flow is as follows:

Step 1: Data Synthesis

Inputs: Raw data (environmental) in JSON/CSV format.
Tasks:
–
Evaluation: Assess available data to define parameters for analysis.
–
Filter by Timeframe: Select relevant time intervals.
–
Parameter Definition: Define dependent and related variables.
Output: Filtered data with defined parameters in JSON/CSV format.

Step 2: Extreme Value Analysis (EVA)

Inputs: Filtered data from Step 1.
Tasks:
–
Threshold Selection: Use the Peaks-Over-Threshold (POT) method.
–
Model Fitting: Fit a statistical model (e.g., Generalized Pareto Distribution, Poisson distribution).
–
Return Value Calculation: Compute the return level $R_{V}$ , representing the value exceeded on average once every interval T, calculated as

$P_{j} (X > x) = \frac{1}{T_{return}} \cdot x {(T)}^{C I}$

(1)

for
∗
Return Value: $P_{j} (X > x_{0})$ (a);
∗
Lower CI: $P (X > x_{lower})$ (b);
∗
Upper CI: $P (X > x_{upper})$ (c).
Output: EVA results (plots, thresholds, return levels) in JSON/CSV format.

Step 3: Circumstance Analysis

Inputs: EVA results from Step 2 (observed extremes, return values).
Tasks:
–
DISC Algorithm for XAI: Analyze significant correlation changes, where the change in correlation between two variables i and j under extreme conditions is defined as

$Δ ρ_{i j} = ρ_{i j}^{extreme} - ρ_{i j}$

(2)

where
∗
$ρ_{i j}^{extreme}$ is the Pearson correlation coefficient between variables i and j during extreme scenarios (e.g., extreme low photovoltaic production events).
∗
$ρ_{i j}$ is the baseline Pearson correlation between the same variables under normal conditions.
The value $Δ ρ_{i j}$ quantifies how the strength and direction of the relationship between two variables shift during extreme conditions.
Classify changes as

$Classified (Δ ρ_{i j}) = \{\begin{matrix} HPC (High Positive Correlation), & Δ ρ_{i j} > P_{90} (Δ ρ) \\ HNC (High Negative Correlation), & Δ ρ_{i j} < P_{10} (Δ ρ) \\ 0 (Not Significant), & otherwise \end{matrix}$

is the Pearson correlation coefficient between variables i and j during extreme scenarios.
Empirical $R^{2}$ Calculation: Compute the $R^{2}$ for the EVA model:

$R^{2} = 1 - \frac{{SS}_{residual}}{{SS}_{total}},$

(3)

where

${SS}_{residual} = \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2},$

(4)

${SS}_{total} = \sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}$

(5)
Grid search and Bayesian optimization for weight adaptation.
The optimization process within EVDBM was designed to identify the optimal contribution of each correlated variable to the dynamic benchmarking score. Initially, a grid search procedure was implemented across predefined weight intervals $(w_{1}, w_{2}, \dots, w_{n})$ to explore all possible combinations within the parameter space. For each weight vector $w$ , the model computed the Sum Weighted Circumstances ( $SumWeightedCircumstances = \sum_{i = 1}^{n} w_{i} \cdot V_{i}$ )
where $V_{i}$ represents the normalized contribution of each correlated driver variable. The objective function optimized during this process was the empirical coefficient of determination ( $R^{2}$ ) (3)
with residual and total sums of squares defined in (4) and (5). The optimal weight configuration $w^{*}$ was thus determined as

$w^{*} = arg max_{w} R^{2}$

(6)

A Bayesian optimization (BO) variant was also implemented. In this formulation, the search for $w^{*}$ was guided by a Gaussian Process surrogate model, which iteratively updated the posterior belief of the objective function. This approach adaptively sampled the most promising regions of the weight space (0–1), balancing exploration and exploitation to efficiently approximate the global optimum of $R^{2}$ . Both optimization methods demonstrated stability, with Bayesian optimization achieving comparable performance to the exhaustive grid search at significantly lower computational cost.
Benchmarking Score (B): The benchmarking score enumerates the final score per examined use case and the highest (or lowest score) suggests that there are better conditions associated to that case. We calculate the benchmarking score per projected year(s) as reported by the EVA, using the associated scaling factor and weighted variables. Calculations are considered for return value and the associated confidence intervals. This allows for visual representation of the process to inform on more accurate judgments and comparisons. The probability of exceeding a threshold $P (X > x_0)$ accounts for the future risk. To account both for historical occurrences and the associated probabilities, the final scaling factor is the normalized sum of extreme events:

$S_{j} = E_{c} \times P (X > x)$

(7)

where
–
N is the number of extreme events for a case (c);
–
$\sum_{k = 1}^{m} N_{k}$ is the total number of extreme events across all cases.
Thus considering (6) and (7) the final score B is calculated as follows:

$B_{i} = S_{j} \times \sum_{i = 1}^{n} b (V_{i})$

(8)
Error Metric Calculation: Compute the mean absolute error (MAE) and Mean Squared Error (MSE) between observed extremes and benchmarking scores to measure increases or decreases in error in prediction:

$MAE = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|$

(9)

$MSE = \frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}$

(10)

These metrics assess the alignment of observed and computed benchmarking scores but are not minimized directly.
Outputs: Optimized weights, benchmarking results over return periods using a logarithmic scale to enhance clarity, and calculated $R^{2}$ , MAE, and MSE in JSON/CSV format for comparison with baseline metrics.

4. Application and Evaluation of EVDBM

In this section we apply the EVDBM methodology on photovoltaic (PV) data taken from two different PV plants [50,51]. We follow the three-step process outlined, without strictly proposing an optimization strategy but rather outlining a few as an example of applicability. We will also first introduce some “dummy” weights in order to conclude with a final benchmarking score.

In our analysis we considered total production in kwh of the PV production farms. We considered as the time range of interest the peak hours, which occur during midday; thus, a time range of 3 h from 13:00 to 16:00 was examined. The data were taken from [52,53] and acquired from two PV plants situated in various regions of Portugal as provided from the non-profit organization, Coopérnico. More descriptive statistics per plant examined will be provided in the following sections. We analyze and apply EVA to production below the 25th percentile. The variables related to the production to be analyzed are shown in Table A1; for context, the Produzida/Production row, which refers to the dependent variable for the EVA, is provided. We also attach the main analytical manually pre-set constants for reference, as time range and percentile, to be applied in both use cases.

4.1. EVDBM Application Use Case 1: “Zarco” Data

4.1.1. Data Synthesis

Report: From a total of 21,932 data points, 3656 were related to peak time production. During the peak hours, there are 3656 data points to be analyzed, with a mean production of 23.5 kwh. As suggested, the peak time hours are analyzed; thus, the timeframe ranges between 13:00 and 16:00, when the sun is the highest, below the 25th percentile of the production level. The key parameters (variables to be examined and constants) are shown in Table A1. The analysis of the examined data is described in detail in Table 1 and in Figure 2. The 25th percentile corresponds to a production level below 18.25.

4.1.2. EVA

We filtered the data to include only these timeframes. EVA was then applied using the Peaks-Over-Threshold method, focusing on data below our defined threshold. This threshold was set to capture the lowest 25% of production values, determined after statistical analysis of the data. To analyze the extreme value distribution, we fitted the data to a Generalized Pareto Distribution (GPD) model.

The histogram bars are not visible above the x-axis, which suggests that the observed values are so well-matched to the predicted values that the bars are hidden behind the PDF line. This would indicate an excellent fit if the theoretical model’s PDF line accurately represents the observed histogram. Return periods (in years) can be seen in Table 2.

The numbers (1, 2, 5, 10, 25, …) represent how often an event of a certain magnitude is expected to occur. For example, a 100-year-return-period event is something that, on average, we would expect to happen once every 100 years.

The return values represent the estimated magnitude of the event (e.g., production level) for each return period. The fact that these values are mostly negative suggests that we are dealing with a scenario where the focus is on low production levels or deficits. As the return period increases, the return values tend to become more negative, indicating that more extreme deficits are less frequent.

Lower and upper confidence intervals (CI) provide a range around the return value, within which the true value is expected to lie with a certain level of confidence. The confidence intervals become wider as the return period increases, indicating more uncertainty in predicting more extreme events.

One-year return period: The return value is 0.35 with a CI of [ 1.66, 0.11]. This means that in any given year, we can expect an extremely low production level of around 0.35 during peak hours, with a reasonable range of uncertainty between 1.66 and 0.11, and based on our analysis approximating 0 after 5 years.

As can be seen in the return value plot in Figure 3, the data are well fitted within the distribution; thus, the model is more reliable in predicting low production levels, allowing for more confidence in the model’s predictive capabilities.

The theoretical

R^{2}

value of 0.997 and p-value of 0.000 confirm the model’s reliability in predicting the cumulative probabilities of low-production events, as shown in the plots in Figure 3, suggesting a good fit.

4.1.3. Circumstance Analysis

Applying the DISC thresholding, we extract the significant differences in correlations between the extremely low production and the normal production in the analyzed sample of peak time ranges (Table 3).

Following, the relevant information, the extremes are extracted to be used for benchmarking and comparison with a second use case to be analyzed in the next section.

4.2. EVDBM Application Use Case 2: “João” Data

4.2.1. Data Synthesis

Report: From a total of 21,908 data points, 3656 were related to peak time production. During the peak hours, there are 3652 data points to be analyzed, with a mean production of 23.5 kwh (Table 4). As suggested, the peak time hours are analyzed; thus, the timeframe ranges between 13:00 and 16:00, when the sun is the highest, below the 25th percentile of the production level. The key parameters (variables to be examined and constants) are shown in Table A1. The analysis of the examined data is described in detail in Table 1. The 25th percentile corresponds to a production level below 13.43 (Figure 4).

4.2.2. EVA

Following the previous approach we filter the data as per the time range under the chosen percentile and apply EVA. As can be seen in the return value plot in Figure 5, the data are well fitted within the distribution; thus, the model is more reliable in predicting low production levels, allowing for more confidence in the model’s predictive capabilities.

Return periods (in years) can be seen in Table 5. One-year return period: The return value is 1.05 with a CI of [1.96, 0.49]. This means that in any given year, we can expect an extremely low production level of around 1.05 during peak hours, with a reasonable range of uncertainty, significantly higher than the observed production of use case 1.

4.2.3. Circumstance Analysis

The Statistical summary of weather and production data of the “João” use case can be seen in Table 6.

Applying the DISC thresholding, we extract the significant differences in correlations between the extremely low production and the normal production in the analyzed sample of peak time ranges (Table A1). As can be seen in Figure 6, significant differences are identified, for example between humidity and diffuse solar w/m² (HNN), and temperature and diffuse solar (HPC), for

Δ ρ_{i j} = ρ_{i j}^{extreme} - ρ_{i j}

. For the Zarco data, a significant positive correlation (

+ 0.71

) was observed between solar w/m² and diffuse solar w/m². Additionally, there were significant negative correlations of diffuse solar w/m² with humidity (

- 0.56

) and cloud cover (

- 0.53

).

In the case of the João data, a positive correlation (

+ 0.33

) was found between solar w/m² and diffuse solar w/m². Furthermore, diffuse solar w/m² showed negative correlations with humidity (

- 0.34

) and WindspeedKmph (

- 0.38

). The differences in the magnitude of correlations between Zarco and João are evident. For example, “Solar w/m²” and “Diffuse Solar w/m²” have a stronger correlation in Zarco (

+ 0.71

) than João (

+ 0.33

). Despite these differences, the DISC-thresholding algorithm isolates significant changes relative to their respective normal scenarios. This ensures that the correlation structure is normalized within each farm’s dataset, making it comparable across farms during the weight shifting optimization process.

Calculated as:

Δ T = T (x) - T (y), Δ \bar{v} = {\bar{v}}_{extreme} - {\bar{v}}_{normal}

(11)

4.3. Benchmarking

In this section we extract the benchmarking score applying the methodology described in the previous section and plot the results to highlight the differences between the two examined scenarios. Figure 7 shows that for relatively similar cases, distribution per year is about the same using the EVA-Driven Weighted Benchmarking Algorithm.

We applied grid-search weight shifting to define the optimal weights for

R^{2}

, presented in Table 7. Solar radiation is given the highest weight because of its dominant influence on PV performance. Cloud cover and diffuse solar radiation are also heavily weighted, reflecting their significant roles, especially during low-production periods. Temperature, windspeed, and humidity have lower weights, as their impacts are secondary but still important to consider. The normalized, weighted and scaled benchmarking scores, following the methods described, are plotted in Figure 8 and Figure 9.

A high benchmarking score suggests that the combination of environmental conditions has a greater impact on driving the system toward extreme low production. A high score implies that the system (PV plant) is more sensitive or vulnerable to these environmental factors under extreme conditions. The higher score indicates that extreme low production is likely to occur when these specific conditions are met, showing a greater dependency on or sensitivity to adverse conditions. Thus, a higher score indicates less resilience to environmental variations.

Since B2 has the highest EVDBM score it is clear that B2 (João plant) is more sensitive to adverse conditions and is less resilient against fluctuations in related circumstances.

4.4. Statistical Validation and Significance Testing

To assess whether the performance gains achieved by the proposed Extreme Value Distribution-Based Model (EVDBM) were statistically significant relative to the baseline Extreme Value Analysis (EVA), a comprehensive statistical validation was conducted.

Three complementary tests were applied: (i) Fisher’s r-to-z transformation to examine correlation structure differences; (ii) paired t-tests on absolute residuals to evaluate average error reductions; and (iii) Kolmogorov–Smirnov (KS) tests to compare the distributional characteristics of model residuals.

Fisher’s r-to-z transformation compares the correlation coefficients between observed and predicted values to test whether EVDBM modifies the dependency structure captured by EVA.
Paired t-test on absolute residuals assesses whether EVDBM achieves a statistically significant reduction in average prediction error magnitude compared to EVA.
Kolmogorov–Smirnov (KS) test evaluates whether the distributional form of residuals differs between models, indicating potential systematic bias or heteroscedasticity.

Each model variant (EVDBM, EVDBM with grid-search weight optimization and EVDBM with Bayesian weighting) was compared against the EVA baseline using these tests across two photovoltaic datasets: Zarco and João. In addition to classical error metrics (MAE, MSE), a correlation-based

R_{c o r r}^{2}

metric was computed as the squared Pearson correlation between standardized observed and predicted extreme values, capturing shape similarity rather than variance alone (Table 8).

For the Zarco dataset, results show that EVDBM achieved marginal improvements in mean absolute error (−2.6%) compared with EVA, although none of the differences were statistically significant (

p > 0.1

across all tests). The correlation structure between observed and modeled extremes remained nearly identical, suggesting that the dataset’s limited tail variability constrains detectable performance differences.

In contrast, for the João dataset, all EVDBM variants consistently reduced prediction errors (MAE and MSE) and achieved higher correlation-based

R_{c o r r}^{2}

values relative to EVA. Paired t-tests indicated statistically significant reductions in absolute error for EVDBM, EVDBMweights, and EVDBMbayes (

p < 0.01

), confirming that the improvements are not due to random variation. Fisher’s r-to-z and KS tests yielded non-significant results (

p > 0.95

), implying that the improvements stem from lower error magnitudes rather than a change in the correlation or residual distributional structure. These findings support the robustness of EVDBM under datasets exhibiting stronger tail behavior, such as João, where the optimization and weighting mechanisms lead to meaningful predictive gains. The results also emphasize that, while the performance gains may appear modest in absolute terms, their statistical significance indicates enhanced reliability of the model’s predictions under extreme conditions. Consequently, EVDBM can be viewed as a more stable and generalizable framework for extreme photovoltaic event modeling, with performance that scales with the degree of nonlinearity and tail complexity in the data.

To complement the statistical tests of significance, a distributional analysis was performed by comparing the quantile-to-quantile alignment between observed and modeled extremes (Figure 10). This visualization provides an interpretable view of how well each model reproduces the empirical distribution of photovoltaic extremes across quantile levels. The corresponding correlation-based

R_{c o r r}^{2}

values for both datasets are summarized in Table 9, confirming the consistency of EVDBM performance across quantiles.

Figure 10 illustrates the empirical–model quantile correspondence for the João and Zarco datasets. The black curve denotes the observed empirical quantiles, while the colored lines correspond to modeled quantiles from EVA and the proposed EVDBM variants. For both datasets, EVDBM and its optimized variants (weighting and Bayesian optimization) demonstrate visibly improved tail alignment relative to EVA, with

R_{c o r r}^{2}

increasing from 0.90 to 0.93 for Zarco and from 0.95 to 0.98 for João (Table 9). This improvement indicates that EVDBM more accurately reproduces the empirical shape of the extreme value distribution, particularly in the upper 5% quantile region.

4.5. Ablation and Computational Complexity Analysis

To contextualize the performance evolution of the proposed framework, a structured ablation comparison was performed. The baseline Extreme Value Analysis (EVA) captures statistical extremes from univariate distributions without contextual weighting or multi-factor consideration. The proposed Extreme Value Distribution-Based Model (EVDBM) introduces multi-variable benchmarking via weighted normalization of related circumstances (e.g., solar irradiance, humidity, windspeed), while the EVDBM + XAI variant extends the model with explainable scoring metrics and correlation heatmaps (

Δ ρ_{i j}

) to guide model-driven decision support.

The ablation sequence (Table 10) shows a steady increase in both distributional alignment and interpretability. While EVA provides only statistical tail estimates, EVDBM incorporates contextual dependencies through multivariate weighting, yielding consistent performance improvement in both datasets.

EVDBM + XAI further introduces an interpretable layer that integrates benchmarking scores with correlation-based diagnostics, enabling domain experts to understand the drivers of extreme deviations. The incremental increase in complexity is minimal compared to the added decision-making capacity.

From a computational perspective (Table 11), the core EVDBM process scales linearly with the number of contextual variables (m) and optimization steps (k) in grid or Bayesian search. Even in the full XAI configuration, runtime remains within approximately 2.3× the baseline EVA, maintaining suitability for real-time or near-real-time applications. The additional interpretability layer introduces only a marginal increase in memory footprint due to correlation mapping and feature attribution visualization.

5. Discussion of Results

Compared to baseline EVA, EVDBM introduces a dynamic weighting and benchmarking mechanism that captures multivariate dependencies among related variables (e.g., irradiance, humidity, temperature, and wind). Through grid search and Bayesian optimization, EVDBM adaptively adjusts variable weights to optimize the alignment between observed and modeled extremes. Across both test sites (Zarco and João), the correlation-based

R_{c o r r}^{2}

rose from 0.90 to 0.93 for Zarco and 0.95 to 0.98 for João (Table 9).

The quantile-to-quantile comparison plots (Figure 10) further highlight this improvement, illustrating how EVDBM and its optimized variants more accurately reproduce empirical tail distributions, particularly in the upper 5% quantile region. These enhancements demonstrate that EVDBM is not merely statistically sound but also distributionally consistent, yielding smoother and more stable projections across extreme-value quantiles.

The EVDBM framework was statistically validated using multiple significance tests (Table 8). For both datasets, Fisher’s r-to-z, paired t-tests, and Kolmogorov–Smirnov (KS) tests were applied to compare EVDBM variants against EVA. In the João dataset, EVDBM variants achieved statistically significant reductions in residual error (

p_{t} < 0.01

), confirming the robustness of the improvement. The Zarco dataset, characterized by smoother and less volatile extremes, showed stable but not statistically significant gains (

p > 0.1

), indicating that EVDBM preserves accuracy under more stationary conditions. KS tests confirmed that EVDBM did not distort the underlying distributional structure, maintaining statistical consistency with empirical residuals (

p_{K S} > 0.95

across all variants). These findings reinforce EVDBM’s robustness and generalizability, demonstrating that the model enhances tail fidelity and interpretability without compromising distributional integrity.

Although bootstrapping was initially implemented as part of the statistical validation process, it was later removed due to sampling imbalance between EVA-generated return values and EVDBM benchmarking points. The unequal sampling density introduced bias in the resampling procedure, particularly when aligning extreme quantiles across the two formulations. This limitation was explicitly addressed in the revised discussion, emphasizing that the presented statistical tests (Fisher’s r–to–z, paired t-test, and KS test) provide a more stable and interpretable basis for assessing model significance. Future work will explore bootstrapped cross-validation under equalized sampling conditions to further quantify uncertainty and robustness across multiple extreme value regimes.

The ablation analysis (Table 10) shows that the progression from EVA to EVDBM to EVDBM + XAI yields cumulative gains in interpretability, predictive stability, and decision value. While EVA efficiently estimates univariate extremes, it lacks explanatory capability. EVDBM extends this by introducing weighted benchmarking across correlated drivers, enhancing both accuracy and contextual understanding. The XAI-enhanced variant (EVDBM + XAI) further integrates DISC-based correlation maps (

Δ ρ_{i j}

) and ABCDE heatmap scoring, offering visual interpretability of the factors driving extreme behavior. Together, these layers allow EVDBM not only to predict extreme events but also to explain and justify their occurrence—an essential capability for informed decision-making in energy systems.

5.1. Conclusions

The DISC-thresholding algorithm identifies and isolates significant shifts in variable correlations between normal and extreme scenarios. This provides interpretability, as it allows domain experts to understand which relationships between variables (e.g., “Solar w/m²” and “Diffuse Solar w/m²”) become more critical under extreme conditions. Also, it provides context-specific insights; by highlighting variable dependencies within each farm, DISC enables the identification of drivers of extreme behavior, even when datasets differ structurally.

Identical weights result from shared structural dependencies (e.g., “Solar w/m²” and “Diffuse Solar w/m²”) highlighted by DISC. This demonstrates that while the magnitude of correlation shifts differs, the underlying variables driving extreme behavior remain consistent. DISC helps explain why the same variables dominate the optimization process across datasets: they exhibit the most significant and meaningful changes under extreme conditions.

Global and Local XAI in EVDBM

Global Explainability: DISC and grid search explain which variables (e.g., “Solar w/m²” and “Diffuse Solar w/m²”) are critical across all farms, offering insights into general drivers of extreme (low) production levels.
Local Explainability: Heatmaps from DISC provide farm-specific insights, explaining unique correlation patterns (e.g., the stronger impact of “Humidity” in Zarco compared to João).

DISC ensures transparency by producing intuitive and interpretable outputs, such as heatmaps and correlation shifts, alongside the associated benchmarking scores with optimized weights. These outputs provide clear and actionable explanations for extreme production behaviors, enhancing the overall explainability of the framework. The ability to interpret and trust extreme event predictions is critical for energy grid operators, policymakers, and infrastructure planners. EVDBM ensures that predictions are not only accurate but also interpretable, enabling better real-time responses to energy fluctuations and long-term strategic planning.

The outputs of EVDBM directly align with ongoing EU and national policy frameworks that emphasize resilience assessment, risk-aware planning, and explainable energy analytics. In particular, the framework supports policy objectives outlined in the European Green Deal and the EU Strategy on Adaptation to Climate Change, which call for transparent, data-driven tools capable of quantifying vulnerability and improving system resilience under extreme environmental conditions. By providing interpretable benchmarking indicators—such as DISC-based correlation maps, resilience indices, and dynamic return-level evaluations—EVDBM delivers actionable intelligence for grid operators, regulators, and policymakers. These outputs facilitate compliance with energy resilience directives and assist in defining evidence-based investment strategies, infrastructure priorities, and early-warning thresholds for critical energy assets. In doing so, EVDBM bridges the gap between technical analysis and operational policy needs, promoting a measurable, explainable foundation for risk-informed decision-making in renewable energy systems.

5.2. Limitations and Future Work

The main limitations of this work to be addressed are the following:

Sensitivity to Data Quality and Availability: The accuracy of EVDBM depends on the quality, frequency, and representativeness of historical data, especially under extreme conditions. Sparse or incomplete recordings of rare events can lead to biased parameter estimation and less reliable benchmarking scores. This limitation is particularly relevant for small-scale or newly commissioned systems, where the empirical tail behavior is still underdeveloped.
Partial Assumption of Stationarity: While the model accounts for dynamic relationships through the DISC-thresholding and adaptive weighting mechanisms, it still assumes partial stationarity in the long-term behavior of the underlying variables. Structural shifts—such as climate trends, technological upgrades, or policy interventions—can alter the probability and intensity of future extremes, potentially reducing the validity of historical benchmarks.
Dependence on Model Calibration and Optimization: The performance of EVDBM is influenced by the configuration of its optimization routines (grid or Bayesian search). Inadequate tuning ranges or limited computational budgets may lead to suboptimal weight identification, especially in high-dimensional correlation spaces. Although this is mitigated by the explainable design, calibration sensitivity remains a factor in achieving consistent performance.
Interpretability–Performance Trade-off: The inclusion of the DISC and XAI layers enhances transparency but increases model complexity and the number of intermediary parameters. In certain scenarios with low variance or weak correlations, these layers may add interpretative noise rather than clear explanatory value.
Computational Overhead for Real-Time Deployment: Although EVDBM maintains near-linear computational scaling, the additional explainability modules (DISC computation, correlation heatmaps, and ABCDE scoring) introduce extra processing steps. This can be limiting in real-time or embedded energy forecasting systems with strict latency requirements.
Scope of Validation: The present evaluation focused on two photovoltaic datasets. Broader validation across domains (e.g., hydrology, finance, or climate risk) and larger, higher-resolution datasets is required to confirm generalizability and to refine the sensitivity of the statistical significance tests.

In our future work, we intend to expand the application of EVDBM within the energy sector, particularly in areas where highly specific decision-making can optimize energy management and resilience planning. This includes forecasting extreme fluctuations in renewable energy generation, optimizing grid balancing strategies, and improving predictive maintenance for photovoltaic (PV) systems. Additionally, we are working on integrating the EVDBM algorithm for scenario-based analysis, enabling what-if simulations and advanced risk-management applications in energy trading, storage optimization, and climate impact assessments. By incorporating a more dynamic approach to extreme event prediction, we aim to further enhance adaptive energy strategies and long-term sustainability planning. Future research will extend this framework through comparative evaluations against other predictive and analytical paradigms, including quantile regression, Generalized Pareto and Bayesian hierarchical models, and machine-learning-based extreme value predictors. Additional work will explore the integration of non-stationary EVA formulations, uncertainty propagation through Bayesian updating, and hybrid deep learning–EVA architectures for dynamic forecasting. Moreover, expanding the validation of EVDBM across multi-domain datasets—such as climate risk, hydrology, and financial extremes—will help assess its generalizability and refine its explainability under diverse operational conditions.

Author Contributions

Conceptualization, D.P.P. and G.A.T.; methodology, V.M., M.V. and G.A.T.; software, D.P.P. and E.S.; validation, V.M., M.V. and G.A.T.; data curation, D.P.P. and E.S.; writing—original draft, D.P.P., E.S., V.M., M.V. and G.A.T.; writing—review and editing, G.A.T.; visualization, D.P.P.; supervision, V.M.; project administration, V.M. All authors have read and agreed to the published version of the manuscript.

Funding

The work presented is based on research conducted within the framework of the Horizon Europe research and innovation programme EnerTEF (Grant Agreement No. 101172887). The content of the paper is the sole responsibility of its authors and does not necessarily reflect the views of the European Commission.

Data Availability Statement

The various processes implemented with printouts and comprehensive instructions (library versions and requirements) are available as a GitHub repository [54], which at the time of writing is at version 1.0.

Acknowledgments

The content of the paper is the sole responsibility of its authors and does not necessarily reflect the views of the EC.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

EVA	Extreme Value Analysis
EVT	Extreme Value Theory
XAI	AI Explainability
PV	PhotoVoltaic
EVDBM	Extreme Value Dynamic Benchmarking Method
DISC	Dynamic Identification of Significant Correlation
POT	Peak-Over-Threshold
R.V.	Return Level
PDF	Probability Density Function
HPC	High Positive Correlation
HNN	High Negative Correlation

Appendix A

Table A1. Overview of data columns and pre-set constant.

	Variables	Non-Null Count and Dtype
1	Humidity v1	non-null float64
2	Temperature v2	non-null float64
3	cloudcover v3	non-null float64
4	windspeedKmph v4	non-null float64
5	Solar w/m² v5	non-null float64
6	Diffuse Solar w/m² v6	non-null float64
7	Produzida/Production v7	non-null float64
	Constant	Value
-	Time Range	13:00–14:00
-	Percentile	25%
-	EVA method	POT (Peaks-Over-Threshold)
-	Extreme type	Low
-	Negative Significance	<10%
-	Positive Significance	>90%

Table A2. Original and normalized values of PV farms examined.

Metric	Zarco	João	Zarco	João
	Original	Normalised	Original	Normalised
Humidity	67.728571	69.057971	0.483587	0.481159
Temperature	18.742857	18.463768	0.518681	0.538647
Cloud Cover	55.457143	58.202899	0.554571	0.582029
Windspeed (Kmph)	18.500000	18.202899	0.453125	0.307044
Solar (W/m²)	368.734781	367.013457	0.334679	0.340758
Diffuse Solar (W/m²)	241.513144	221.890545	0.503480	0.487093

Table A3. B1 and B2 scores across indices.

Index	B1 Score	B2 Score
1	0.06236	0.18233
2	0.01376	0.05734
3	0.0018	0.01345
4	0.0004	0.00485
5	0.00005	0.0014192
6	0.000012	0.0006064
7	0.0000027	0.0002735
8	0.00000036	0.00010110
9	0.0000000815	0.0000489
10	0.0000000180	0.0000239

Algorithm A1 Feature engineering and analysis

1:: Initial:
2:: Load data
3:: Print the descriptive statistics of the loaded data.
4:: Save statistics to JSON file.
5:: GetsNumericNonNumerics:
6:: Load data
7:: Set the path for file operations.
8:: Count the number of columns in the data.
9:: Identify and save non-numeric columns from the data to JSON file.
10:: Identify and save numeric columns from the data to JSON file.
11:: PrimaryAnalysis:
12:: Load data as df
13:: Set the path for file operations.
14:: Count the number of columns in the data.
15:: for each column in df do
16:: Determine the datatype of the column.
17:: if datatype is numeric then
18:: Print “New df” to signal data analysis.
19:: Analyse and Plot “New df”.
20:: end if
21:: if datatype is non-numeric then
22:: Print “New df2” to signal data analysis.
23:: Analyse and Plot “New df2”.
24:: end if
25:: end for
26:: Save “New df” to JSON file.
27:: Save “New df2” to JSON file.
28:: End of Algorithm.

Algorithm A2 D.O.I algorithm

1:: Score:
2:: Load data into a dataframe “New df”
3:: Initialize a dictionary named D.O.I where “Adopters” are keys and “ranges” are values
4:: Save the path for the file operations
5:: Define Conditions
6:: for each row in “New df” do
7:: if Conditions are met for the row then
8:: Retrieve the row
9:: Calculate the average of Data values from all columns of the row, named as Score
10:: Associate Score with ranges from “Adopters” and define adopter name
11:: Append adopter name and Score to D.O.I
12:: end if
13:: end for
14:: Save D.O.I to a JSON file in the saved path
15:: End of Algorithm.

Algorithm A3 TAM algorithm

1:: Condition:
2:: Load data “New df”
3:: Set path for the JSON file and the output directory.
4:: Define a list of wanted column names based on Condition.
5:: Load the desired dataset from a JSON file using the defined list.
6:: Copy the specific columns from the loaded data to a new dataframe.
7:: Perform hierarchical clustering on the new dataframe
8:: Save Plots To File
9:: Save paths and labels to JSON file.
10:: End of Algorithm.

Algorithm A4 ModelDeconstruction

Require:: Model Object (model_object), JSON Path (json_path), JSON file name (name)
Ensure:: Model Summary as a JSON file and Model Architecture as an image file
1:: Begin
2:: Load the model from the model_object
3:: Generate model summary and store it into a list
4:: Create a table using every alternate line from the list (excluding first and last two lines)
5:: for every entry in the table do
6:: Split the entry based on multiple spaces and remove the last part of each entry
7:: Append the cleaned up entry to a new_table
8:: end for
9:: Convert new_table into a data frame (df) where the first row becomes column names
10:: Generate an image of the model architecture and save it as “modelArchitecture.png” at json_path
11:: Save the dictionary (list) into a JSON file at json_path with the name specified
12:: End

Algorithm A5 LocalExplainer

Require:: path, X_train, X_test, WhatToTest, y_test, profile, inputsList, categoryName, jsonpath
Ensure:: Several plots saved as images and a JSON file saved at jsonpath
1:: Begin
2:: Call LocalJson function with parameters categoryName, profile, WhatToTest, and inputsList to prepare a list
3:: Call localModelEvaluator function with ’path’ as an argument to load the model
4:: Initialize an Explainer using the loaded model and the X_train data
5:: Generate explanation values using the Explainer on X_test
6:: Make a prediction using the loaded model on X_test for WhatToTest
7:: Update “Prediction” and “Actual value” in the list with the predicted and actual values
8:: Generate various plots using explanation values and save them as images at specified jsonpath
9:: Update corresponding image paths in the list under “XaiImages”
10:: Save the updated list to a JSON file at jsonpath with categoryName as the file name
11:: End

Algorithm A6 GlobalExplainer

Require:: path, X_train, X_test, profile, categoryName, jsonpath
Ensure:: Plots saved as images and a JSON file saved at jsonpath
1:: Begin
2:: Call GlobalJson function with categoryName to prepare a list
3:: Call GlobalModelEvaluator function with ’path’ as an argument to load the model
4:: Initialize an Explainer using the loaded model and the X_train data
5:: Generate explanation values using the Explainer on X_test
6:: Generate a summary plot using explanation values and save it as an image at specified jsonpath. Update corresponding image path in the list under “OtherPlots”
7:: for each feature in the profile do
8:: Generate a dependence plot using the explanation values and feature, save the plot as an image at specified jsonpath, update the corresponding image path in the list under “DependencyPlots”
9:: end for
10:: Save the updated list to a JSON file at jsonpath with categoryName as the file name
11:: End

References

Panagoulias, D.P.; Sotiropoulos, D.N.; Tsihrintzis, G.A. An extreme value analysis-based systemic approach in healthcare information systems: The case of dietary intake. Electronics 2022, 12, 204. [Google Scholar] [CrossRef]
Panagoulias, D.P.; Sotiropoulos, D.N.; Tsihrintzis, G.A. Extreme value analysis for dietary intake based on weight class. In Proceedings of the 2022 13th International Conference on Information, Intelligence, Systems & Applications (IISA), Corfu, Greece, 18–20 July 2022; IEEE: New York, NY, USA, 2022; pp. 1–7. [Google Scholar]
Li, Y.; Jones, B. The use of extreme value theory for forecasting long-term substation maximum electricity demand. IEEE Trans. Power Syst. 2020, 35, 128–139. [Google Scholar] [CrossRef]
Panagoulias, D.P.; Sarmas, E.; Marinakis, V.; Tsihrintzis, G.A. Circumstance Evaluation Using Extreme Value Analysis on Charging Station Data: The Case of DEI Blue in Greece. In Proceedings of the Extended Selected Papers of the 14th International Conference on Information, Intelligence, Systems, and Applications—IISA2023, Volos, Greece, 10–12 July 2023; Bourbakis, N.G., Tsihrintzis, G.A., Virvou, M., Jain, L.C., Eds.; Springer: Berlin/Heidelberg, Germany, 2024; Volume 1093. Lecture Notes in Networks and Systems (LNNS). [Google Scholar]
Harris, R. The accuracy of design values predicted from extreme value analysis. J. Wind Eng. Ind. Aerodyn. 2001, 89, 153–164. [Google Scholar] [CrossRef]
Coles, S.G.; Powell, E.A. Bayesian methods in extreme value modelling: A review and new developments. Int. Stat. Rev./Rev. Int. De Stat. 1996, 64, 119–136. [Google Scholar] [CrossRef]
Guillou, A.; Hall, P. A diagnostic for selecting the threshold in extreme value analysis. J. R. Stat. Soc. Ser. B (Stat. Methodol.) 2001, 63, 293–305. [Google Scholar] [CrossRef]
Smith, R.L. Extreme value analysis of environmental time series: An application to trend detection in ground-level ozone. Stat. Sci. 1989, 4, 367–377. [Google Scholar]
Cheng, L.; AghaKouchak, A.; Gilleland, E.; Katz, R.W. Non-stationary extreme value analysis in a changing climate. Clim. Chang. 2014, 127, 353–369. [Google Scholar] [CrossRef]
Makkonen, L. Problems in the extreme value analysis. Struct. Saf. 2008, 30, 405–419. [Google Scholar] [CrossRef]
Charras-Garrido, M.; Lezaud, P. Extreme value analysis: An introduction. J. De La Société Française De Stat. 2013, 154, 66–97. [Google Scholar]
Ghorbel, A.; Trabelsi, A. Energy portfolio risk management using time-varying extreme value copula methods. Econ. Model. 2014, 38, 470–485. [Google Scholar] [CrossRef]
Xie, W.J.; Jiang, Z.Q.; Zhou, W.X. Extreme value statistics and recurrence intervals of NYMEX energy futures volatility. Econ. Model. 2014, 36, 8–17. [Google Scholar] [CrossRef]
Doherty, K.; Folley, M.; Whittaker, T.; Doherty, R. Extreme value analysis of wave energy converters. In Proceedings of the ISOPE International Ocean and Polar Engineering Conference, Maui, HI, USA, 19–24 June 2011; ISOPE: Mountain View, CA, USA, 2011; p. ISOPE–I. [Google Scholar]
Holmes, J.; Moriarty, W. Application of the generalized Pareto distribution to extreme value analysis in wind engineering. J. Wind Eng. Ind. Aerodyn. 1999, 83, 1–10. [Google Scholar] [CrossRef]
Otto, T.U.; Mamassian, P. Noise and correlations in parallel perceptual decision making. Curr. Biol. 2012, 22, 1391–1396. [Google Scholar] [CrossRef]
De Bruijn, H.; Warnier, M.; Janssen, M. The perils and pitfalls of explainable AI: Strategies for explaining algorithmic decision-making. Gov. Inf. Q. 2022, 39, 101666. [Google Scholar] [CrossRef]
Alufaisan, Y.; Marusich, L.R.; Bakdash, J.Z.; Zhou, Y.; Kantarcioglu, M. Does explainable artificial intelligence improve human decision-making? In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; Volume 35, pp. 6618–6626. [Google Scholar]
Rane, N.; Choudhary, S.; Rane, J. Explainable Artificial Intelligence (XAI) approaches for transparency and accountability in financial decision-making. Ssrn Electron. J. 2023. [Google Scholar] [CrossRef]
Wu, Y.; Zhang, Z.; Kou, G.; Zhang, H.; Chao, X.; Li, C.C.; Dong, Y.; Herrera, F. Distributed linguistic representations in decision making: Taxonomy, key elements and applications, and challenges in data science and explainable artificial intelligence. Inf. Fusion 2021, 65, 165–178. [Google Scholar] [CrossRef]
van Leersum, C.M.; Maathuis, C. Human centred explainable AI decision-making in healthcare. J. Responsible Technol. 2025, 21, 100108. [Google Scholar] [CrossRef]
Černevičienė, J.; Kabašinskas, A. Review of multi-criteria decision-making methods in finance using explainable artificial intelligence. Front. Artif. Intell. 2022, 5, 827584. [Google Scholar] [CrossRef]
Thuy, A.; Benoit, D.F. Explainability through uncertainty: Trustworthy decision-making with neural networks. Eur. J. Oper. Res. 2024, 317, 330–340. [Google Scholar] [CrossRef]
Wilson, J.; Hutter, F.; Deisenroth, M. Maximizing acquisition functions for Bayesian optimization. Adv. Neural Inf. Process. Syst. 2018, 31, 9906–9917. [Google Scholar]
Laumanns, M.; Ocenasek, J. Bayesian optimization algorithms for multi-objective optimization. In Proceedings of the International Conference on Parallel Problem Solving from Nature, Granada, Spain, 7–11 September 2002; Springer: Berlin/Heidelberg, Germany, 2002; pp. 298–307. [Google Scholar]
Balandat, M.; Karrer, B.; Jiang, D.; Daulton, S.; Letham, B.; Wilson, A.G.; Bakshy, E. BoTorch: A framework for efficient Monte-Carlo Bayesian optimization. Adv. Neural Inf. Process. Syst. 2020, 33, 21524–21538. [Google Scholar]
Wu, J.; Poloczek, M.; Wilson, A.G.; Frazier, P. Bayesian optimization with gradients. Adv. Neural Inf. Process. Syst. 2017, 30, 5273–5284. [Google Scholar]
Pelikan, M. Bayesian optimization algorithm. In Hierarchical Bayesian Optimization Algorithm: Toward a New Generation of Evolutionary Algorithms; Springer: Berlin/Heidelberg, Germany, 2005; pp. 31–48. [Google Scholar]
Wang, X.; Jin, Y.; Schmitt, S.; Olhofer, M. Recent advances in Bayesian optimization. ACM Comput. Surv. 2023, 55, 1–36. [Google Scholar] [CrossRef]
Shahriari, B.; Swersky, K.; Wang, Z.; Adams, R.P.; De Freitas, N. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 2015, 104, 148–175. [Google Scholar] [CrossRef]
Arsenault, E.; Wang, Y.; Chapman, M.P. Towards Scalable Risk Analysis for Stochastic Systems Using Extreme Value Theory. arXiv 2022, arXiv:2203.12689. [Google Scholar] [CrossRef]
Szigeti, M.; Ferenci, T.; Kovács, L. The use of block maxima method of extreme value statistics to characterise blood glucose curves. In Proceedings of the 2020 IEEE 15th International Conference of System of Systems Engineering (SoSE), Budapest, Hungary, 2–4 June 2020; IEEE: New York, NY, USA, 2020; pp. 433–438. [Google Scholar]
Chen, H.; Zhao, T. Modeling power loss during blackouts in China using non-stationary generalized extreme value distribution. Energy 2020, 195, 117044. [Google Scholar] [CrossRef]
Westerlund, P.; Naim, W. Extreme value analysis of power system data. In Proceedings of the ITISE 2019-International Conference on Time Series and Forecasting, Granada, Spain, 25–27 September 2019; Volume 1, pp. 322–327. [Google Scholar]
Ahmed, H.; Einmahl, J.H.; Zhou, C. Extreme value statistics in semi-supervised models. J. Am. Stat. Assoc. 2024, 120, 291–304. [Google Scholar] [CrossRef]
Gilleland, E.; Ribatet, M.; Stephenson, A.G. A software review for extreme value analysis. Extremes 2013, 16, 103–119. [Google Scholar] [CrossRef]
Makkonen, L.; Tikanmäki, M. An improved method of extreme value analysis. J. Hydrol. X 2019, 2, 100012. [Google Scholar] [CrossRef]
Benstock, D.; Cegla, F. Extreme value analysis (EVA) of inspection data and its uncertainties. Ndt E Int. 2017, 87, 68–77. [Google Scholar] [CrossRef]
Coles, S.; Bawa, J.; Trenner, L.; Dorazio, P. An Introduction to Statistical Modeling of Extreme Values; Springer: Berlin/Heidelberg, Germany, 2001; Volume 208. [Google Scholar]
Xu, S. Proceedings of 2013 World Agricultural Outlook Conference; Springer: Berlin/Heidelberg, Germany, 2014. [Google Scholar]
Jebli, I.; Belouadha, F.Z.; Kabbaj, M.I.; Tilioua, A. Prediction of solar energy guided by pearson correlation using machine learning. Energy 2021, 224, 120109. [Google Scholar] [CrossRef]
Benesty, J.; Chen, J.; Huang, Y. On the importance of the Pearson correlation coefficient in noise reduction. IEEE Trans. Audio, Speech, Lang. Process. 2008, 16, 757–765. [Google Scholar] [CrossRef]
Sedgwick, P. Pearson’s correlation coefficient. Bmj 2012, 345, e4483. [Google Scholar] [CrossRef]
Indelman, V. No correlations involved: Decision making under uncertainty in a conservative sparse information space. IEEE Robot. Autom. Lett. 2016, 1, 407–414. [Google Scholar] [CrossRef]
Lee, D.S.; Abdullah, K.L.; Subramanian, P.; Bachmann, R.T.; Ong, S.L. An integrated review of the correlation between critical thinking ability and clinical decision-making in nursing. J. Clin. Nurs. 2017, 26, 4065–4079. [Google Scholar] [CrossRef]
Carrino, L. Data versus survey-based normalisation in a multidimensional analysis of social inclusion. Ital. Econ. J. 2016, 2, 305–345. [Google Scholar] [CrossRef]
Vafaei, N.; Ribeiro, R.A.; Camarinha-Matos, L.M. Data normalisation techniques in decision making: Case study with TOPSIS method. Int. J. Inf. Decis. Sci. 2018, 10, 19–38. [Google Scholar] [CrossRef]
Hancock, A.A.; Bush, E.N.; Stanisic, D.; Kyncl, J.J.; Lin, C.T. Data normalization before statistical analysis: Keeping the horse before the cart. Trends Pharmacol. Sci. 1988, 9, 29–32. [Google Scholar] [CrossRef] [PubMed]
Abdi, H.; Williams, L.J. Normalizing data. Encycl. Res. Des. 2010, 1, 935–938. [Google Scholar]
Sarmas, E.; Spiliotis, E.; Stamatopoulos, E.; Marinakis, V.; Doukas, H. Short-term photovoltaic power forecasting using meta-learning and numerical weather prediction independent Long Short-Term Memory models. Renew. Energy 2023, 216, 118997. [Google Scholar] [CrossRef]
Sarmas, E.; Dimitropoulos, N.; Marinakis, V.; Mylona, Z.; Doukas, H. Transfer learning strategies for solar power forecasting under data scarcity. Sci. Rep. 2022, 12, 1–13. [Google Scholar] [CrossRef] [PubMed]
Sarmas, E.; Kleideri, M.; Matias, N.; Pereira, C.; Antunes, A.R. Photovoltaic Power Production Dataset. Mendeley Data 2024. [Google Scholar] [CrossRef]
Ilias, L.; Sarmas, E.; Marinakis, V.; Askounis, D.; Doukas, H. Unsupervised domain adaptation methods for photovoltaic power forecasting. Appl. Soft Comput. 2023, 149, 110979. [Google Scholar] [CrossRef]
Panagoulias, D.P. EVDBM: GitHub Repository. 2024. Available online: https://github.com/dimitris1pana/EVDBM (accessed on 25 November 2024).

Figure 1. Extreme Value Dynamic Benchmarking Method (EVDBM): The proposed EVA-driven weighted benchmarking framework integrates three core stages—(1) Data Synthesis, (2) Extreme Value Analysis, and (3) Circumstance Analysis—to enhance interpretability and robustness in modeling extreme events. In Step 1, input data are evaluated, filtered by timeframe, and defined by key parameters to generate structured datasets (JSON/CSV). In Step 2, return values are derived through EVA using appropriate statistical distributions. In Step 3, the Dynamic Identification of Significant Correlation (DISC) algorithm detects shifts in correlation under extreme conditions, while grid search and Bayesian weight shifting optimize variable importance relative to the dependent variable, yielding a final EVDBM score and XAI report. The process produces interpretable benchmarking outcomes that quantify resilience and vulnerability across multiple cases.

Figure 2. Peaks under threshold for use case 1, “Zarco”. Dotted lines represent the threshold.

Figure 3. Return plots and distributions for “Zarco” use case 1.

Figure 4. Peaks under threshold for “João” use case 2. Dotted lines represent the threshold.

Figure 5. Return plots and distributions for João—use case 2.

Figure 6. Significant correlation differences between extreme and normal production levels for peak times for Zarco and João (use cases 1 and 2) using the DISC-thresholding algorithm with a 90% threshold as an XAI component.

Figure 7. Extreme events grouped by year and time per case: XAI component.

Figure 8. Extreme Value Dynamic Benchmarking Method (EVDBM) applied to photovoltaic energy production for analysis of related circumstances, XAI component.

Figure 9. Bayesian optimization-produced weights, XAI component.

Figure 10. Quantile-to-quantile comparison between observed empirical production and modeled return values for the João (left) and Zarco (right) datasets. Black lines represent the empirical quantiles derived from observed extreme events, while colored curves denote the modeled quantiles obtained from EVA and EVDBM variants. The reported

R_{c o r r}^{2}

values correspond to correlation-based goodness-of-fit scores between empirical and modeled distributions. EVDBM variants exhibit improved alignment with the empirical tail distribution, particularly for higher quantile levels, indicating enhanced ability to reproduce the upper extremes of photovoltaic production events. Note: Different legend colors may overlap and not be fully visible, but they are retained for clarity.

Figure 10. Quantile-to-quantile comparison between observed empirical production and modeled return values for the João (left) and Zarco (right) datasets. Black lines represent the empirical quantiles derived from observed extreme events, while colored curves denote the modeled quantiles obtained from EVA and EVDBM variants. The reported

R_{c o r r}^{2}

values correspond to correlation-based goodness-of-fit scores between empirical and modeled distributions. EVDBM variants exhibit improved alignment with the empirical tail distribution, particularly for higher quantile levels, indicating enhanced ability to reproduce the upper extremes of photovoltaic production events. Note: Different legend colors may overlap and not be fully visible, but they are retained for clarity.

Table 1. Descriptive statistics for production—Zarco use case 1.

Statistic	Value
Count	3656.00
Mean	29.22
Standard Deviation (std)	12.84
Minimum (min)	0
25th Percentile	18.25
Median (50%)	33.25
75th Percentile	40.25
Maximum (max)	48.50

Table 2. Return periods and confidence intervals—Zarco use case 1.

Return Period	Return Value	Lower CI	Upper CI
1.00	0.35	1.66	0.11
2.00	0.15	1.31	0.04
5.00	0.05	0.75	0.00
10.00	0.02	0.58	−0.20
25.00	0.01	0.52	−0.36

Table 3. Statistical summary of weather and production data for “Zarco” use case 1.

[Zarco]	v1	v2	v3	v4	v5	v6	v7
Count	−3586.00	−3586.00	−3586.00	−3586.00	−3586.00	−3586.00	−3586.00
Mean	10.02	−1.55	27.84	0.78	−204.85	73.64	−19.82
Min	23.00	5.00	0.00	3.00	32.30	20.64	0.00
25%	9.00	1.00	22.00	1.00	−147.80	63.51	−12.75
50%	6.00	−1.00	50.50	1.00	−237.93	113.13	−23.50
75%	14.00	−3.00	38.75	0.00	−339.21	104.32	−26.25
Max	−4.00	−15.00	0.00	−15.00	−106.84	−11.46	−30.50
Std	−0.81	−2.18	1.71	−0.52	−57.07	8.25	−7.55

Table 4. Descriptive statistics for production—“João” use case 2.

Statistic	Value
Count	3652.00
Mean	23.80
Standard Deviation (std)	11.51
Minimum (min)	0.25
25th Percentile	13.43
Median (50%)	28.75
75th Percentile	33.75
Maximum (max)	40.00

Table 5. Return periods and confidence intervals—João use case 2.

Return Period	Return Value	Lower CI	Upper CI
1.00	1.05	1.96	0.49
2.00	0.66	1.34	0.35
5.00	0.39	0.91	0.18
10.00	0.28	0.81	−0.06
25.00	0.20	0.72	−0.23

Table 6. Statistical summary of weather and production data—“João” use case 2.

[João]	v1	v2	v3	v4	v5	v6	v7
Count	−3583.00	−3583.00	−3583.00	−3583.00	−3583.00	−3583.00	−3583.00
Mean	11.32	−1.81	30.48	0.48	−194.70	50.41	−16.33
Min	23.00	5.00	0.00	4.00	1.19	3.59	0.00
25%	11.00	0.00	27.00	1.00	−100.30	52.25	−8.44
50%	9.00	−1.00	57.00	1.00	−239.32	79.61	−21.25
75%	12.00	−3.00	38.00	−1.00	−342.23	65.87	−23.50
Max	−1.00	−16.00	0.00	−3.00	−86.26	−26.15	−26.75
Std	−1.83	−2.28	0.04	0.35	−52.86	0.52	−7.94

Table 7. Expertly curated weights for each variable.

Variable	Suggested Weight $w_{i}$
Humidity (v1)	$w_{1} = 0.05$
Temperature (v2)	$w_{2} = 0.1$
Cloud Cover (v3)	$w_{3} = 0.1$
Windspeed (v4)	$w_{4} = 0.05$
Solar (Radiation) (v5)	$w_{5} = 0.3$
Diffuse Solar (v6)	$w_{6} = 0.2$

Table 8. Comparative performance and statistical significance of EVA and EVDBM variants for the Zarco and João datasets. Significant p-values (<0.05) from paired t-tests are shown in bold.

Dataset	Model	MAE	MSE	$R_{corr}^{2}$	$r_{baseline}$	$r_{model}$	$p_{Fisher}$	$p_{t - test}$	$p_{KS}$	Significant?
Zarco	EVA	1.2031	1.8208	0.0080	0.0896	0.0896	1.0000	–	1.0000	No
	EVDBM	1.1716	1.8320	0.0071	0.0896	0.0840	0.9922	0.1011	1.0000	No
	EVDMBweights	1.1716	1.8320	0.0071	0.0896	0.0840	0.9922	0.1276	1.0000	No
	EVDBMbayes	1.1716	1.8320	0.0071	0.0896	0.0840	0.9922	0.1173	1.0000	No
João	EVA	1.1939	1.8143	0.0086	0.0929	0.0929	1.0000	–	1.0000	No
	EVDBM	1.1884	1.7515	0.0154	0.0929	0.1242	0.9561	0.0013	0.9895	Yes
	EVDMBweights	1.1884	1.7515	0.0154	0.0929	0.1242	0.9561	0.0074	0.9895	Yes
	EVDBMbayes	1.1884	1.7515	0.0154	0.0929	0.1242	0.9561	0.0041	0.9895	Yes

Table 9. Distributional alignment measured by

R_{c o r r}^{2}

between empirical and modeled quantile functions.

Table 9. Distributional alignment measured by

R_{c o r r}^{2}

between empirical and modeled quantile functions.

Model	Zarco	João
EVA	0.900	0.952
EVDBM	0.928	0.976
EVDMBweights	0.928	0.976
EVDBMbayes	0.928	0.976

Table 10. Naive ablation comparison among baseline EVA, EVDBM, and EVDBM + XAI. The scoring dimension refers to the integration of the benchmarking value for explainable decision-making.

Model Variant	Inputs Considered	Scoring Logic	Explainability Layer	Distribution Fit ( $R_{corr}^{2}$ )	MAE	Intended Use Case
EVA (Baseline)	Single variable (extremes only)	None	None	0.90 (Zarco), 0.95 (João)	1.20–1.19	Statistical characterization only
EVDBM	Multivariate (physical drivers)	Weighted benchmark	Partial (weights)	0.93 (Zarco), 0.97 (João)	1.17–1.18	Enhanced prediction under multivariate dependence
EVDBM + XAI	Multivariate + Correlation shifts	Benchmark + XAI-driven scoring	Full ( $Δ ρ_{i j}$ , feature impact, ABCDE heatmaps)	0.93–0.97 across datasets	1.17–1.18	Decision support and model interpretability

Table 11. Algorithmic and computational complexity comparison of EVA and EVDBM variants. All models were evaluated on identical hardware under identical sampling configurations.

Model	Complexity (Big-O)	Runtime (avg)	Memory Overhead
EVA (Baseline)	$O (n)$	1.0x (reference)	Low
EVDBM (Weighted)	$O (n \times m)$	1.4x	Moderate
EVDBM + GridSearch	$O (n \times m \times k)$	2.1x	Moderate
EVDBM + Bayesian	$O (n \times m)$ (optimized)	1.7x	Moderate
EVDBM + XAI (Full)	$O (n \times m)$ + XAI overhead	2.3x	Slightly higher (visual output)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Panagoulias, D.P.; Sarmas, E.; Marinakis, V.; Virvou, M.; Tsihrintzis, G.A. Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking. Electronics 2025, 14, 4484. https://doi.org/10.3390/electronics14224484

AMA Style

Panagoulias DP, Sarmas E, Marinakis V, Virvou M, Tsihrintzis GA. Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking. Electronics. 2025; 14(22):4484. https://doi.org/10.3390/electronics14224484

Chicago/Turabian Style

Panagoulias, Dimitrios P., Elissaios Sarmas, Vangelis Marinakis, Maria Virvou, and George A. Tsihrintzis. 2025. "Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking" Electronics 14, no. 22: 4484. https://doi.org/10.3390/electronics14224484

APA Style

Panagoulias, D. P., Sarmas, E., Marinakis, V., Virvou, M., & Tsihrintzis, G. A. (2025). Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking. Electronics, 14(22), 4484. https://doi.org/10.3390/electronics14224484

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Explainable Optimization of Extreme Value Analysis for Photovoltaic Prediction: Introducing Dynamic Correlation Shifts and Weighted Benchmarking

Abstract

1. Introduction

2. Related Work

2.1. Applications of Extreme Value Analysis

2.2. Theory of Extreme Value Analysis

2.3. Pearson Correlation

2.4. Normalization

3. The Extreme Value Dynamic Benchmarking Method (EVDBM)

Process

4. Application and Evaluation of EVDBM

4.1. EVDBM Application Use Case 1: “Zarco” Data

4.1.1. Data Synthesis

4.1.2. EVA

4.1.3. Circumstance Analysis

4.2. EVDBM Application Use Case 2: “João” Data

4.2.1. Data Synthesis

4.2.2. EVA

4.2.3. Circumstance Analysis

4.3. Benchmarking

4.4. Statistical Validation and Significance Testing

4.5. Ablation and Computational Complexity Analysis

5. Discussion of Results

5.1. Conclusions

Global and Local XAI in EVDBM

5.2. Limitations and Future Work

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI