3.4. EMY Validation
The EMY model validation is carried out comparing V
oc results against the V
oc levels obtained from the real data of each project presented in
Table 4, as well as against the V
oc levels provided by the TMY model, comparing the performance of both models to predict maximum voltage levels in PV systems.
As the first step, maximum V
oc levels are evaluated, comparing values obtained through EMY100 and TMY models against SCADA real data for each project, which results are represented in
Figure 22:
Figure 22 provides monthly V
oc maximum values, obtained through the SCADA real data of each project, TMY, and EMY100 datasets, respectively. The analysis shows that the TMY-based estimations generally tend to underestimate the real V
oc levels. This underestimation is particularly pronounced during high-irradiance periods, which are critical from a system design perspective. In some cases, the deviation exceeded 5%, potentially leading to conservative sizing of inverters and transformers that might not reflect real peak conditions. In contrast, the EMY100 model demonstrates a more conservative estimation, with voltage values consistently exceeding or closely matching SCADA measurements. While this might appear as an overestimation under typical conditions, such a tendency is advantageous in the context of safety designs. EMY100 reduces the likelihood of underrated designs by capturing upper-bound operational scenarios and ensuring more robust system performance under extreme environmental conditions.
The comparative results across all seven projects included in
Figure 22 show that EMY100 offers better alignment with the upper voltage ranges observed in the real datasets, while maintaining manageable deviation margins. This confirms EMY100 as a reliable tool for risk-averse system planning, especially when accurate modeling of worst-case scenarios is required. In summary, although TMY remains a valid tool for long-term energy yield assessment, its application in electrical design must be approached with caution. The EMY100 model, with its emphasis on extremes, offers a safer alternative for voltage prediction, minimizing the risk of undervaluing critical parameters that influence the stability and durability of photovoltaic installations.
After the monthly evaluation, the analysis is focused on the maximum V
oc levels for each project; EMY is implemented through different percentiles between EMY100 and EMY10 to validate the complete behavior of the new model.
Table 5 includes the maximum V
oc levels obtained through SCADA, TMY, and different EMY percentiles, providing a powerful view of EMY model performance.
The results provided in
Table 5 highlight not only the relative accuracy of each model, but also the configurability and flexibility of the EMY approach. In five out of seven projects, the SCADA-based V
oc level exceeds the values predicted by the TMY model. This consistent underestimation by TMY emphasizes its limitations when assessing voltage extremes. By contrast, the EMY100 model provided values that were higher than the SCADA maximums in six of the seven projects, closely aligning with real operational peaks. The unique case where SCADA value slightly exceeds EMY100 (Project 1), the difference is minimal (less than 4 V, or under 0.3% deviation), suggesting a strong capacity of EMY100 to capture critical voltage scenarios without excessively overestimating.
A particularly valuable feature of the EMY model is its percentile-based tunability, allowing users to select different confidence levels depending on design priorities. As the percentile decreases from EMY100 to EMY10, a gradual reduction in maximum voltage can be observed across all projects. This trend demonstrates that the EMY model is not only reliable for extreme-case scenarios but also adaptable for probabilistic or risk-based design criteria. For example, a system designer prioritizing extreme safety margins might opt for EMY100 or EMY90, while a designer focused on cost-efficiency with moderate risk tolerance may use EMY50 or EMY40. This configurability provides a strategic advantage over static models like TMY, enabling a better balance between robustness and optimization. The EMY model effectively acts as a modular tool, aligning with international best practices that increasingly favor flexibility in probabilistic system modeling.
The evaluation of maximum V
oc levels per day is carried out, and results are provided through
Table 6 and
Table 7. The first one provides number of days in which V
oc levels obtained from the SCADA system are higher than each model, from TMY to EMY10, whereas the second one includes the maximum deviation between SCADA V
oc values and each model. These indicators reflect both the frequency and severity of model performance, which are crucial when assessing risk margins in string sizing.
The TMY model consistently shows the highest frequency of underestimation across all projects, with the real voltage surpassing TMY predictions on an average of over 180 days per year. For ins·tance, Projects 4 and 5 experienced exceedances on 308 and 288 days respectively, representing 84% and 79% of the year. Simultaneously, the maximum deviations for TMY were the largest in nearly every project, reaching up to 150.24 V in Project 6 and 96.11 V in Project 5. These results highlight the systematic underestimation risk when relying solely on TMY datasets, potentially leading to unsafe system designs. However, the EMY100 model demonstrates significantly improved performance, with both lower exceedance frequency and reduced voltage gaps. The number of days when SCADA readings exceeded EMY100 predictions dropped significantly, to as few as 1–3 days in Projects 6 and 7, staying below 50 days in all but one project. Moreover, the maximum deviations were consistently modest, typically under 30 V, with most projects falling in the 15–25 V range. This indicates that EMY100 is capable of reliably capturing near-extreme voltage values while preserving realistic margins, making it a conservative yet accurate design reference.
The EMY model offers a tunable structure, where lower percentiles (e.g., EMY90, EMY80, etc.) provide a spectrum of design options. As the percentile decreases, both the frequency of exceedance and the maximum deviation naturally increase, offering a configurable trade-off between system robustness and cost-efficiency. For instance, moving from EMY100 to EMY50 increases exceedance days (e.g., from 21 to 105 in Project 1) and deviation (from 20.06 V to 58.60 V), yet still remains substantially more reliable than TMY in most scenarios.
While the EMY model is primarily conceived as a design tool to ensure PV system safety margins, particularly under maximum voltage scenarios covered by EMY100, it is also relevant to assess its consistency in different operational conditions, adjusting the desired percentile. Thus, a RMSE analysis was performed, comparing each EMY percentile model and TMY against V
oc levels obtained from the SCADA real data (
Table 8). RMSE provides a quantitative measure of how closely the predicted daily voltages match observed values over the evaluation year.
The analysis reveals that no single EMY percentile consistently minimizes the RMSE across all projects, underscoring the inherent variability of this type of assessment. For instance, while EMY40 and EMY30 provide the lowest RMSE in Projects 1 and 7, EMY80 and EMY90 perform better in Projects 3 and 4, and EMY70–60 range proves most accurate for Projects 5 and 6. This lack of uniformity reflects a fundamental characteristic of the comparison: the EMY and TMY datasets are constructed from multi-year historical climatology, whereas the SCADA data correspond to a specific year. As such, perfect alignment is neither expected nor indicative of model quality in this context.
To further interpret the RMSE behavior of the EMY model across different percentiles, we examined the results in conjunction with the climatic and geographic characteristics of each project (
Table 4). This approach explores whether certain EMY percentiles align better with specific environmental profiles, such as temperature range or irradiance, thus providing insight into the contextual suitability of each percentile level.
Projects 1, 2, 3, and 5 share relatively similar climatic patterns: wide annual temperature ranges (minimums below −5 °C and maximums above 40 °C) and high global tilted irradiation (GTI > 530 W/m2). Project 1 (best RMSE: EMY40–30) and Project 3 (best RMSE: EMY30) show lower RMSE values at mid-to-low percentiles, suggesting that under highly variable conditions, these percentiles better track the distribution of daily voltages. Project 2, however, has relatively high RMSE across all EMYs, with no clear advantage over TMY. This may indicate local anomalies or operational behaviors not well captured by climatological models. Project 5, with the lowest average temperature (15.68 °C) and moderate irradiance, sees lowest RMSE at EMY70 and EMY80, suggesting that in colder locations with slightly lower GTI, more conservative (higher percentile) EMYs better reflect measured voltages.
Project 4 exhibits a narrower temperature range (8.4–29.8 °C) and very high GTI (622.5 W/m2). Interestingly, this project shows its lowest RMSE at EMY80 and EMY90 (8.04 and 8.27 V), outperforming both TMY and EMY100. This indicates that in stable climates with minimal seasonal variation, higher EMY percentiles may better match real operating behavior due to limited extreme events, allowing more accurate forecasting with lower safety margins.
Project 6 is characterized by the lowest irradiance (GTI = 308.5 W/m2) and coolest conditions (average ambient temperature of 11.69 °C). Notably, none of the EMY percentiles achieves a particularly low RMSE, with values ranging from 32.63 to 46.89 V. TMY performs best in this case (RMSE = 42.34 V), though still with a large error. This suggests that in low-irradiance, temperate climates, both EMY and TMY models may struggle to represent daily variability effectively, potentially due to frequent cloud cover and stochastic irradiance patterns not well captured in averaged models.
Project 7, with the highest average temperature (27.32 °C) and narrow thermal range, demonstrates its lowest RMSE at EMY10 (7.14 V) and EMY30 (7.60 V), even outperforming TMY (7.98 V). In tropical environments with high baseline irradiance and limited fluctuation, lower EMY percentiles may suffice, as daily maximum voltages are more predictable and rarely extreme. This aligns with the observation that conservative EMYs (e.g., EMY100) may overestimate voltage in stable tropical zones, leading to unnecessarily high RMSE.
This contextual analysis reveals that the EMY percentile providing the lowest RMSE is not uniform across projects, and appears to depend on local climatic conditions as follows:
High variability climates tend to favor mid-percentile EMYs (30–70).
Stable tropical climates perform best with lower percentiles (EMY80–10).
Low irradiance and temperate zones show no strong alignment, indicating limited model resolution under high atmospheric variability.
These findings reaffirm that the EMY model’s strength lies not in minimizing RMSE, but in offering a configurable framework that adapts to project-specific design risk profiles. While RMSE helps assess operational alignment, percentile selection should ultimately be guided by engineering conservatism and environmental predictability, not solely by statistical error.
Finally, to quantify the performance of the EMY100 approach relative to the conventional TMY method, an accuracy analysis was conducted using the maximum V
oc values from SCADA measurements of the seven utility-scale PV projects considered in this study. The evaluation focused on two metrics: the Absolute Percentage Error (APE) and the Absolute Error (AE, in volts) for both datasets, and the results are presented in
Table 9. This comparison directly assesses the capability of each approach to replicate real maximum V
oc values, which are critical for string sizing in large-scale PV systems.
Table 9 shows that EMY100 consistently delivers lower APE and AE values than the conventional TMY method. On average, the APE decreases from 1.00% with TMY to 0.61% with EMY100, corresponding to a relative reduction of 38.6% in percentage error. In absolute terms, the AE drops from 14.70 V to 8.63 V, representing a 41.3% reduction. These reductions are highly relevant in engineering practice, as even small voltage deviations can cause overvoltage events that exceed design or regulatory limits in high-power PV plants. The mean reduction of more than 6 V achieved by EMY100 translates into a significantly lower likelihood of surpassing the maximum allowable DC voltage, supporting both operational safety and compliance.