Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine

Benton, Brandon N.; Buster, Grant; Pinchuk, Pavlo; Glaws, Andrew; King, Ryan N.; Maclaurin, Galen; Chernyakhovskiy, Ilya

doi:10.3390/en18143769

Open AccessArticle

Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine

by

Brandon N. Benton

^*

,

Grant Buster

,

Pavlo Pinchuk

,

Andrew Glaws

,

Ryan N. King

,

Galen Maclaurin

and

Ilya Chernyakhovskiy

National Renewable Energy Laboratory, Golden, CO 80401, USA

^*

Author to whom correspondence should be addressed.

Energies 2025, 18(14), 3769; https://doi.org/10.3390/en18143769

Submission received: 9 June 2025 / Revised: 7 July 2025 / Accepted: 15 July 2025 / Published: 16 July 2025

Download

Browse Figures

Review Reports Versions Notes

Abstract

With a potentially increasing share of the electricity grid relying on wind to provide generating capacity and energy, there is an expanding global need for historically accurate, spatiotemporally continuous, high-resolution wind data. Conventional downscaling methods for generating these data based on numerical weather prediction have a high computational burden and require extensive tuning for historical accuracy. In this work, we present a novel deep learning-based spatiotemporal downscaling method using generative adversarial networks (GANs) for generating historically accurate high-resolution wind resource data from the European Centre for Medium-Range Weather Forecasting Reanalysis version 5 data (ERA5). In contrast to previous approaches, which used coarsened high-resolution data as low-resolution training data, we use true low-resolution simulation outputs. We show that by training a GAN model with ERA5 as the low-resolution input and Wind Integration National Dataset Toolkit (WTK) data as the high-resolution target, we achieved results comparable in historical accuracy and spatiotemporal variability to conventional dynamical downscaling. This GAN-based downscaling method additionally reduces computational costs over dynamical downscaling by two orders of magnitude. We applied this approach to downscale 30 km, hourly ERA5 data to 2 km, 5 min wind data for January 2000 through December 2023 at multiple hub heights over Ukraine, Moldova, and part of Romania. With WTK coverage limited to North America from 2007–2013, this is a significant spatiotemporal generalization. The geographic extent centered on Ukraine was motivated by stakeholders and energy-planning needs to rebuild the Ukrainian power grid in a decentralized manner. This 24-year data record is the first member of the super-resolution for renewable energy resource data with wind from the reanalysis data dataset (Sup3rWind).

Keywords:

machine learning; downscaling; wind energy; ERA5; wind toolkit

1. Introduction

With the potential increase in wind energy in the power system, high-resolution spatiotemporal wind resource datasets are becoming increasingly important [1,2]. These historically accurate meteorological data are invaluable for ensuring resource adequacy [3], reliable system operations [3], well-functioning electricity markets [3], and more. These applications require wind resource data that capture detailed meteorological processes (i.e., processes occurring at ≤3 km and sub-hourly resolutions [4]). Although these data are vital to the success of future investment in wind energy, said data are difficult to produce and rarely available [2]. Purchasing high-resolution time series wind resource data can be costly for large geographic extents covering a long-term historical record, and generating regional or national high-resolution datasets can be expensive in terms of both labor hours and computational costs. Furthermore, the uncertainty of existing high-resolution wind resource data is often not quantified, nor is the data extensively validated against observations [1,2].

The most common approach for generating high-resolution historical meteorological datasets is downscaling global reanalysis data. The downscaling techniques can be roughly separated into two groups: dynamical or statistical downscaling. Dynamical downscaling uses regional climate models or numerical weather prediction models, with lower-resolution meteorological data as lateral boundary conditions, to perform direct numerical simulations of high-resolution fields. Statistical downscaling generates high-resolution fields by applying previously identified statistical relationships between large-scale and small-scale content. Statistical downscaling is computationally efficient but fails to resolve important small-scale features [5]. Dynamical downscaling provides more realistic dynamics, especially over complex terrain, but can be prohibitively expensive to perform over large regions and time periods [6]. Additionally, producing high-fidelity output can require meticulously tailoring dynamical downscaling simulation configurations to the specific application [7,8,9].

1.1. Previous Work

Historically, most spatiotemporally continuous high-resolution wind data have been generated with dynamical downscaling [4,10,11,12] or statistical downscaling [13,14,15]. As mentioned previously, dynamical downscaling leverages non-linear numerical weather prediction, like the Weather Research and Forecasting System [16], to generate highly accurate outputs but can require resources that make large-scale data generation infeasible [17]. Dynamical downscaling can also require extensive tuning to select the best physics schemes and constant reinitializations of the simulations to ensure limited drift from the boundary conditions [10,11,18]. There are numerous statistical downscaling methods such as localized constructed analogs (LOCA) [19], combined bias correction with spatial disaggregation [20], and Bartlett–Lewis rectangular pulse models [21]. These methods can be significantly faster than dynamical downscaling but can also fail to simultaneously capture short-time and fine-scale spatial dynamics essential for accurate downstream modeling for energy systems [5].

The intersection of deep learning and meteorological modeling is an active area of research, with promising developments specifically regarding weather forecasting [22,23,24,25]. Machine learning methods are being adopted by established forecasting centers and will soon play an integral role in operational predictions [26]. However, research in deep learning-based downscaling, also called super-resolution, is less active, especially when it comes to fully gridded spatiotemporal wind speed downscaling. Existing research on machine learning applications to downscaling is mostly focused on pointwise spatial enhancement [27,28] with regression methods [29,30], and often for less dynamic fields like precipitation and temperature rather than wind speed [28,30,31,32,33]. When wind fields are downscaled, they typically provide a coarse sampling of over a kilometer along the vertical dimension or a single near-surface field [27,28,34]. However, wind energy modeling applications such as the Renewable Energy Potential model [35] require a finer sampling of near-surface wind fields over typical wind turbine height (s).

Super-resolution leverages deep convolutional networks with various model architectures, such as UNets, CNNs, and GANs [36,37,38,39]. GANs, in particular, have demonstrated superior performance over these standard regression models in generating more realistic spatial structures [40,41]. This previous work on super-resolution has relied on the assumption that coarsened or averaged, high-resolution data are a good approximation for low-resolution data [37,40,42,43,44]. We previously used GANs trained in this way to downscale wind data over Southeast Asia [45]. While this assumption can lead to excellent results, our approach instead uses low-resolution European Centre for Medium-Range Weather Forecasting Reanalysis v5 (ERA5) input data paired with high-resolution dynamical downscaling outputs as target data. We also include multiple low-resolution variables in training, which are not super-resolved, solely to better inform the enhancement of the high-resolution outputs and improve model generalization. This aligns with the process of dynamical downscaling, which uses multiple variables from low-resolution simulation output as boundary conditions for high-resolution simulations. We show that by training with separate low-resolution and high-resolution simulation data in this way, we achieved performance comparable to dynamical downscaling.

1.2. Overview

In this work, we present a novel deep learning-based spatiotemporal downscaling approach using generative adversarial networks (GANs). These networks were trained with pairs of low-resolution simulation data from ERA5 and high-resolution simulation data from the Wind Integration National Dataset Toolkit (WTK) [10]. This differs from previous approaches [45], which use coarsened high-resolution data as the low-resolution training data. With this paired training approach, models learn a transformation closer to dynamical downscaling instead of an un-coarsening operation. Since true low-resolution simulations (ERA5) can differ significantly from coarsened dynamically downscaled data, this approach leads to more accurate and physically realistic outputs. Additionally, when training on coarsened high-resolution data, low-resolution training features must be available in the high-resolution data. By pairing ERA5 and WTK, additional low-resolution training features can be included, which enables models to learn a more robust relationship between the low-resolution climate representation and high-resolution outputs. Fully trained models can then generate accurate high-resolution data from low-resolution input [40] orders of magnitude faster than conventional dynamical downscaling. These models can be deployed for new regions without additional tuning. This deployment is faster and simpler in practice than dynamical downscaling, without many of the logistical difficulties involved, like consistent reinitializations.

We demonstrate the performance of our approach across out-of-sample regions in North America and over Ukraine, Moldova, and parts of Romania. While the primary focus was Ukraine, surrounding areas were included to preserve a rectangular grid. Results from a broad suite of performance measures show excellent fidelity with observations across diverse regions with complex terrain and with underlying physics of dynamically downscaled data. We downscale and make publicly available 24 years (2000–2023) of 30 km, hourly ERA5 to 2 km, 5 min resolution data over this region. With WTK coverage limited to North America from 2007–2013, this is a significant geographic generalization. This 24-year data record is the first member of the super-resolution for renewable energy resource data with wind from the reanalysis data dataset (Sup3rWind). The focus on Ukraine was motivated by stakeholders and energy-planning needs to rebuild the Ukrainian power grid in a decentralized manner after the conflict with Russia. At the end of 2024, the Ukrainian power grid had lost more than 50% of pre-war capacity [46], with nearly 90% of wind power capacity out of operation [47]. However, the high resilience of decentralized generation from wind, strong policy support, and international investment continue to drive more construction [48].

This paper is organized as follows. In Section 2.1 and Section 2.2, we describe the general problem of downscaling, define the notation used throughout the paper, and discuss the numerous data sources used in this work. In Section 2.3, Section 2.4, Section 2.5 and Section 2.6, we cover our GAN model setup, model training, bias correction, and use of the model in inference. In Section 3.1 and Section 3.2, we look at the physical performance of our downscaling results across various performance measures and compare the results against observations across different regions in North America and Ukraine. In Section 4, we discuss possible directions for future work. We conclude with final remarks in Section 5.

2. Materials and Methods

2.1. Problem Statement and Notation

The problem of downscaling low-resolution data is as follows. Given a low-resolution state

x

, a target spatial enhancement

s

for each spatial dimension (

s^{2}

overall), and a target temporal enhancement

t

, we want to find a function

G_{s, t}

that will take

x

, enhance the spatial dimensions by a factor of

s

and the temporal dimension by a factor of

t

, and give us a spatiotemporally enhanced high-resolution state

x^{'}

. Under some simplifying assumptions, we can decompose

G_{s, t} (x)

into separate functions for spatial and temporal enhancement,

G_{1, t} (G_{s, 1} (x))

. We can further decompose these functions into intermediate enhancement functions if the products of intermediate enhancement factors are equal to

s

or

t

. The terms introduced here, along with other frequently used terms, are summarized in Table 1.

2.2. Data Description

ERA5: We downloaded ERA5 [49] for 2007–2013 to train the first enhancement step model. ERA5 is an atmospheric reanalysis dataset that is an optimal combination of observations from various measurement sources and the output of a numerical model using a Bayesian estimation process called data assimilation [50]. ERA5 consists of hourly estimates of several atmospheric variables at a latitude and longitude resolution of 0.25° (~30 km at the equator) from the surface of the earth to roughly 100 km altitude from 1979 to the present day.

As our focus is to generate high-resolution wind resource data, we selected variables from ERA5 close to the surface. We also selected variables that would encourage accuracy during extreme events and over different types of complex terrain. Good model generalization also requires learning the relationships between the low-resolution climate representation and the high-resolution outputs. Prior to training, ERA5 data were regridded to match the 15-times spatially coarsened WTK grid, and ERA5 wind components were bias-corrected to the WTK so that the 2007–2013 monthly means and standard deviations matched those of WTK. This ensured that training was not influenced by bias between low- and high-resolution data, and we applied separate bias correction prior to inference. The ERA5 configuration is summarized in Table 2. The complete set of training features used is listed in Table 3.

WTK: WTK is high-resolution (2 km, 5 min) wind data that covers Canada, the United States, and Mexico from 2007 through 2013. We can, in theory, use any ERA-based downscaled data product as high-resolution target data. We selected the National Renewable Energy Laboratory (NREL)’s WTK [10] because of its extensive use by U.S. stakeholders for wind resource and energy production analysis and because WTK has demonstrated good performance across various performance measures. In particular, WTK shows good agreement with observations for diurnal and seasonal correlation coefficients, mean absolute error (MAE), mean wind speeds, and absolute bias [51]. The WTK was produced with Weather Research and Forecasting (WRF) version 3.4.1 using ERA-Interim, the predecessor to ERA5, for initialization and boundary conditions. WTK data include wind speed and wind direction at 10, 40, 80, 100, 120, 160, and 200 m above ground level. The wind speed and wind direction data served as the high-resolution targets for our downscaling framework. Coarsened WTK data are also used as the low-resolution data for

G_{5, 1}

and

G_{1, 12}

(ERA5 data are only needed for input to the first enhancement step). The WTK configuration is summarized in Table 2.

Vortex Wind Data from the International Renewable Energy Agency Global Atlas: We download long-term monthly wind speed means from the International Renewable Energy Agency Global Atlas (data provided by Vortex [12]) over Ukraine and the contiguous United States (CONUS) to use for bias correction prior to inference. Vortex via the International Renewable Energy Agency Global Atlas provides high-resolution wind speed data globally [52] and easily downloadable 20-year climatological monthly means. We bias-corrected ERA5 data over Ukraine by matching the corrected ERA5 monthly means over 2000–2020 with the Vortex monthly means. We bias-corrected ERA5 data over CONUS prior to inference used for validation against observational data. Bias correction is described in Section 2.5.

Meteorological Assimilation Data Ingest System (MADIS): MADIS is a comprehensive collection of meteorological observations covering the entire globe [53]. It is maintained by the National Oceanic and Atmospheric Administration and is primarily used for weather forecasting, research, and various atmospheric studies. MADIS integrates data from various sources, including federal agencies, research institutions, and commercial entities, ensuring broad coverage and diversity of observations. The dataset undergoes quality control procedures to identify and correct errors, ensuring high-quality data for analysis and modeling purposes.

We used the MADIS API to download a full year of surface observations of wind speed and direction for 40 locations within the Ukraine downscaling domain (Figure 1). The observations for each location were mapped onto an hourly temporal grid using a simple average for time steps containing multiple observations. We removed any locations missing observational data for more than half of the time steps. The resulting validation data consists of 8784 hourly observations of wind speed for 2020 at 10 m height for 37 locations across the modeling domain.

Second Wind Forecast Improvement Project (WFIP2): We used WFIP2 observation data to assess model performance over CONUS. WFIP2 is a U.S. Department of Energy and National Oceanic and Atmospheric Administration-funded effort to improve weather prediction forecast skills for turbine-height winds in regions with complex terrain. A core component of WFIP2 was an 18-month field campaign that took place in the U.S. Pacific Northwest between October 2015 and March 2017 [54].

Ukraine Wind Farm Observations: We obtained wind measurement data performed by Deutsche WindGuard Consulting GmbH (Varel, Germany), GEIO-NET Umweltconsulting GmbH (Hannover, Germany), and ENERPARK Inżyniera Wiatrowa Sp. z o. o (Warsaw, Poland) for planned wind farm sites throughout Ukraine. Due to security concerns, we refer to the five wind farm sites as Wind Farm A–E rather than their actual locations. The wind speed measurements for Wind Farm A were conducted using a 100 m high met mast for wind speeds at approximately 100 m and 80 m. Sets of measurements for Wind Farms B and C were performed using a 120 m high met mast, yielding wind speed measurements at approximately 120 m, 100 m, 75 m, and 50 m. The wind speed measurements for Wind Farm D were conducted using a 120 m high met mast for wind speeds at approximately 120 m, 116 m, 100 m, 80 m, and 60 m. Measurements for Wind Farm E were collected using an 82 m high met mast and extrapolated to the turbine hub height of 94 m using wind shear exponents calculated from mast data. This collection of observational wind speeds was used to validate Sup3rWind data across Ukraine (Section 3.2). The Wind Farm observation heights are listed in Table 4.

Table 2. Summary of ERA5 and wind toolkit configurations.

	ERA5	Wind Toolkit
Output Variables	Numerous meteorological variables at the surface and 137 pressure levels up to around 80 km. Includes wind speed, wind direction, temperature, pressure, relative humidity, heat fluxes, precipitation, cape, etc. For a complete list, see [55].	Wind speed, wind direction, air temperature, and pressure at 15 m, 47 m, 80 m, 112 m, 145 m, and 177 m. Interpolated to 10 m, 40 m, 80 m, 100 m, 120 m, 160 m, and 200 m. Surface pressure and relative humidity at 2 m.
Resolution	30 km, hourly.	2 km, 5 min.
Boundary Conditions/Inputs	4D-Variational Data Assimilation from satellites, surface observations, and other sources. Atmospheric state that best fits model forecast and observations [56]. Assimilation performed with 12 h windows.	6-hourly scale-selective grid nudging towards ERA-Interim. GTOPO30 terrain data.

2.3. Model Description

For this work, we trained a total of three super-resolution models, described in Table 3. The first step,

G_{3, 1}

, performed 3-times spatial enhancement; the second step,

G_{5, 1},

performed 5-times spatial enhancement; and the third step,

G_{1, 12},

performed 12-times temporal enhancement. When these steps were applied successively to a low-resolution state

x

,

G_{1, 12} (G_{5, 1} (G_{3, 1} (x)))

, they performed a total of 15-times spatial enhancement and 12-times temporal enhancement,

G_{15, 12}

. Training and inference flow are diagrammed in Figure 2. These models mostly follow the approach in [40], with a few important distinctions: (1) we used a modified content loss function to encourage model accuracy across extreme values. This loss, shown in Equation (1), includes mean absolute error terms for the minimums and maximums across both space and time; (2) we incorporated mid-network high-resolution topography injection for a more accurate representation of wind flow over fine-scale complex terrain; and (3) we trained on distinct low-resolution and high-resolution datasets, as opposed to using coarsened high-resolution data as the low-resolution GAN input. The topography injection differs from standard model input in that all standard model inputs are low-resolution. As this low-resolution data goes through the model, it is eventually enhanced by up-sampling layers in the middle of the model network. Right after this up-sampling, high-resolution topography data can be combined with the up-sampled data. This high-resolution topography is elevation above sea level data sourced from GTOPO30 [57].

\begin{matrix} L (x, y_{t r u e}) = & m a e (y_{t r u e}, y_{s y n t h}) + m a e ({{m a x}_{t} (y}_{t r u e}), m a x_{t} (y_{s y n t h})) \\ + m a e ({{m i n}_{t} (y}_{t r u e}), {m i n}_{t} (y_{s y n t h})) \\ + m a e ({m a x_{s} (y}_{t r u e}), m a x_{s} (y_{s y n t h})) \\ + m a e ({{m i n}_{s} (y}_{t r u e}), {m i n}_{s} (y_{s y n t h})) \end{matrix}

(1)

The loss function used to encourage accuracy across extreme values,

m a e

, is mean absolute value,

y_{t r u e}

is the true high-resolution data,

y_{s y n t h}

is the high-resolution model output,

m a x_{t}

is the maximum across all time, and

m a x_{s}

is the maximum across all space.

An extensive codebase has been developed to implement easily customizable GAN architectures and handle data extraction, batching, and model training to distribute the forward passes of input data through the GAN across multiple nodes. This codebase is released as the super-resolution for renewable resource data (sup3r) package and is installable through the python package index [58]. Sup3r version 0.1.2 was used for this work.

Table 3. 30 km, hourly to 2 km, 5 min model steps.

Model Step	Enhancement	Training Features	Input Data Source	Output Target Data Source	Training Time
1. $G_{3, 1}$	Three-times spatial	U/V wind vector components at 10, 100, and 200 m, topography, cape, k index, surface pressure, instantaneous moisture flux, surface temperature, surface latent heat flux, 2 m dewpoint temperature, friction velocity	ERA5 (30 km, hourly)	Coarsened WTK (10 km, hourly)	240 compute node hours, 2500 epochs
2. $G_{5, 1}$	Five-times spatial	U/V wind vector components at 10, 40, 80, 100, 120, 160, and 200 m + topography	Coarsened WTK (10 km, hourly)	Subsampled WTK (2 km, hourly)	50 compute node hours, 7000 epochs
3. $G_{1, 12}$	Twelve-times temporal	U/V wind vector components at 10, 40, 80, 100, 120, 160, and 200 m + topography	Subsampled WTK (2 km, hourly)	Original WTK (2 km, 5 min)	200 compute node hours, 10,000 epochs

2.4. Model Training

The first step generator,

G_{3, 1}

, was trained with ERA5 data as low-resolution input and WTK coarsened to 10 km hourly as the high-resolution target for 2007–2009 and 2011–2013. We kept 2010 as a holdout year for validation. The WTK data had a nominal resolution of 2 km, 5 min, so high-resolution targets sampled from these data were coarsened five times spatially and subsampled twelve times temporally for the first model step. Both the second- and third-step models were trained on coarsened WTK data, as in [40]. The input for the second step,

G_{5, 1,}

is 10 km, hourly WTK (five times spatially coarsened and twelve times subsampled in time), and the high-resolution target for

G_{5, 1}

was 2 km, hourly WTK (subsampled 12 times temporally). The input for the third step,

G_{1, 12},

is WTK subsampled 12 times temporally, and the high-resolution target is the original WTK. These steps are summarized in Table 3.

For each model step, training observations were sampled from the domains shown in Figure 3. Training was performed on the Eagle high-performance computing system at NREL using two NVIDIA V100 GPUs (Taiwan Semiconductor Manufacturing Company, Hsinchu Science Park, Taiwan). Each training epoch consisted of 100 batches, with 64 observations per batch. Batches were built by randomly sampling spatiotemporal chunks from the six training years and two different training domains. Each spatiotemporal chunk was 15 × 15 × 5 low-resolution pixels. For the third step, generator

G_{1, 12}

, random sampling along the time dimension was weighted by the time-specific loss. For instance, if the model was performing worst on summer observations during a given training epoch, more observations were selected from the summer for the next epoch. This data-centric training approach ensures that the model performs well over a wide range of season-specific weather conditions.

2.5. Bias Correction

We performed bias correction on the ERA5 wind speed input data prior to training

G_{3, 1}

and prior to inference. It is well known that ERA5 frequently underestimates wind speeds, especially in complex terrain [59,60,61]. While the GAN models could be trained on biased data to learn bias correction, we were concerned about this not generalizing well to new geographic regions. Thus, we opted for region-specific bias correction on low-resolution input as a preprocessing step. Prior to training, we computed bias correction factors that shifted the 2007–2013 means and standard deviations of the ERA5 to match those of coarsened WTK data. For each ERA5 grid point (

i, j

) and wind speed hub height (10 m, 100 m, or 200 m), monthly (

m

) means (µ) and standard deviations (σ) were computed for 2007–2013 for both ERA5 and coarsened WTK. ERA5 was then bias-corrected for each grid point, hub height, and month as follows:

{E R A 5}_{i j m} \to [{E R A 5}_{i j m} - µ_{i j m}] \frac{{\hat{σ}}_{i j m}}{σ_{i j m}} + {\hat{µ}}_{i j m}

(2)

where

μ

,

σ

,

\hat{µ}

, and

\hat{σ}

are the means and standard deviations for ERA5 and the coarsened WTK, respectively.

To perform bias correction prior to inference, we used monthly mean wind speeds provided by Vortex, described in Section 2.2. The global availability of the Vortex data allowed us to use it for both CONUS validation and the Ukraine data production. These means were for 2001–2020 and available only as high as 160 m. Standard deviations were not available. We linearly extrapolated to 200 m, then computed multiplicative correction factors using the means:

{E R A 5}_{i j m} \to {E R A 5}_{i j m} \frac{{\hat{µ}}_{i j m}}{µ_{i j m}}

(3)

where

\hat{µ}

and µ are the 2001–2020 means for Vortex and ERA5, respectively.

2.6. Inference

We downscaled ERA5 over Ukraine, Moldova, and some of Romania (Figure 1) for 2000–2023, from 30 km hourly to 2 km, 5 min resolution. With models trained only on the CONUS regions shown in Figure 3, this is a significant geographic generalization. Prior to inference, the ERA5 input data were bias-corrected using long-term monthly means from Vortex, described in Section 2.2. Inference is a memory-bound process, so we split the input data into chunks and parallelized the forward pass on these chunks independently. The full low-resolution domain was first chunked across the time dimension, and each chunk,

x,

passed through

G_{5, 1} (G_{3, 1} (x))

to perform 15-times spatial enhancement. Chunks were made to overlap in time to enable stitching without seams. Spatially enhanced output was chunked across both space and time, with chunks (

x^{'}

) overlapping across all dimensions and then passed through

G_{1, 12} (x^{'})

to perform the final 12-times temporal enhancement. A year of input for the first two models consisted of 300 chunks. The spatially enhanced input to the final model then consisted of 65,000 spatiotemporal chunks. Forward passes were distributed over 30 compute nodes on the NREL Eagle high-performance computer, and full spatiotemporal enhancement for a year was completed in 40 node hours using 36 CPUs per compute node for inference. This is more than 85 times faster than the dynamical downscaling of ERA5 with WRF to the same 2 km, 5 min resolution based on internal testing with WRF on the same hardware. When using GPUs for inference, the speedup can be as much as 500 times.

Table 4. Wind farm data details.

Location	Time Period	Heights
Wind Farm A	January 2012–December 2015	100 m, 80 m
Wind Farm B	September 2019–September 2020	120 m, 100 m, 75 m, 50 m
Wind Farm C	November 2020–January 2022	120 m, 100 m, 75 m, 50 m
Wind Farm D	November 2021–September 2023	120 m, 116 m, 100 m, 80 m, 60 m
Wind Farm E	January 2022–December 2022	94 m

3. Results

Time and resource limitations prevented extensive hyperparameter search and cross-validation. We trained a few models, with different adversarial weights and selected the one that performed the best on the 2010 WFIP2 observations within the validation regions shown in Figure 3. Model performance was also assessed on 2010 WTK data within these regions. The year 2010 was not included in the training data, and these validation regions were outside of the training domain, so these three regions and time periods enabled spatiotemporal cross-validation. This validation was followed by the generation of a high-resolution 24-year wind data product over Ukraine, Moldova, and eastern Romania. We assessed the performance of these data over Ukraine by comparing them against wind farm and MADIS observational data.

Performance against observations was evaluated with coefficients of determination (

R^{2}

), Pearson correlation coefficients, mean bias error (MBE), MAE, KS-test statistic, diurnal cycle, wind speed variability distribution, bias distribution, and mean relative quantile error (MRQE).

R^{2}

is defined as the square of the Pearson correlation coefficient, with a value of one indicating that the dependent variable is completely determined by the independent variable and a value of zero indicating the opposite. The KS-test statistic measures the maximum difference between the predicted and empirical CDFs, with a value of zero indicating perfect agreement. The diurnal cycle is the average pattern that occurs over the course of an entire day. The wind speed variability distribution is the probability distribution of the change in wind speed over time. The bias distribution measures the probability of under- or overestimation of wind speed. The MRQE is defined as follows:

MRQE = \frac{1}{D} \sum_{i = 1}^{D} \frac{{\hat{Q}}_{i} - Q_{i}}{Q_{i}}

(4)

where

{\hat{Q}}_{i}

is the

i

-th quantile of the model output, and

Q_{i}

is the

i

-th quantile of the observation data. We used the MRQE to quantify model performance in resolving extreme events. Negative values indicate underestimation of extremes, and positive values represent overestimation. We evaluated MRQE with 20 logarithmically spaced quantile bins (0.8, 0.999). The MRQE is a particularly important performance measure because accurately capturing long tails is essential for downstream applications of renewable resource data and extreme event estimation. This is also why we compared wind speed variability distributions and KS-test statistics. The wind speed variability distribution is the probability distribution for the wind speed time derivative. The KS-test statistic quantifies the maximum disagreement between cumulative probability distributions of wind speeds.

We estimate the p-values for performance measure differences between Sup3rWind and baselines (ERA5 and/or WTK) by bootstrapping distributions for these differences over 1000 samples. For each observation site, we compute the original performance measure difference and the distribution of this performance measure difference by resampling the time series data 1000 times. The proportion of values in this distribution that exceeds the original performance measure difference gives the p-value estimate. We additionally compute the p-value for time series differences between Sup3rWind and baselines using the Wilcoxon signed-rank test.

3.1. CONUS Validation

Figure 4 contains calculated statistical and physical quantities for the various validation regions. We see strong agreement between Sup3rWind and WTK; specifically, the long tails of the wind speed gradient and the wind speed variability distribution for the WTK data are well captured by Sup3rWind. Further, the inertial range (i.e., high

k

) region in the turbulent kinetic energy is also recovered by Sup3rWind. In Figure 5, Table 5, and Table 6, we compare Sup3rWind with WFIP2 observations across the three CONUS validation regions. The WFIP2 measurement heights vary by location but are between 20 m and 50 m above ground. Coefficients of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. For each region, we see excellent agreement between Sup3rWind and WTK and a significant improvement over ERA5. Because we used WTK data for training, we were ultimately limited to the accuracy of this ground truth. There is still room for improvement against observations. We discuss this more in the section Future Research Directions.

3.2. Ukraine, Moldova, and Eastern Romania Performance

We generated 24 total years of wind data over Ukraine, Moldova, and eastern Romania. Using these data for power system modeling requires high resolution, extensive validation, a long-term data record, and physical consistency across a wide range of conditions [2]. We performed extensive validation and demonstration of the accuracy of Sup3rWind with comparisons against data from five wind farm sites and 37 MADIS sites. Some details for the wind farm sites are shown in Table 4. MADIS sites are all 10 m above ground level, and the wind farm data are distributed between 50 m to 130 m above ground level. Performance across MADIS and wind farm sites is comparable to performance across CONUS validation regions.

3.2.1. Wind Farm Site Comparisons

In Figure 6 and Figure 7, we show performance against wind farm observations, with each location averaged over all available hub heights. In Figure 6, we see improved MBEs and MRQEs, as well as improvement in KS-test statistics, over ERA5. MBE is within ±1 m per second for each wind farm location. Figure 7 shows good agreement with observation for wind speed variability and correlations. We see improvement in MAE for diurnal cycles over ERA5 at some sites, although there is some noise introduced in these cycles, and one site is significantly overestimated. Values of performance measures averaged across all wind farm observations are shown in Table 7. p-Values for these performance measures are shown in Table 8.

While CONUS validation showed substantial improvement over ERA5 for Sup3rWind, we do not see the same relative performance across Ukraine. Statistics for Sup3rWind in Ukraine fall in a similar range as for CONUS, while ERA5 performs significantly better. Sup3rWind provides the most improvement here on spatiotemporal variability, relative quantile errors, and KS-test statistics. The increased performance of ERA5 is likely due to the less complex terrain. In the CONUS validation, we saw the best performance of ERA5 in the Midwest, the flattest region. We also saw the best correlations between Sup3rWind and Wind Farm E, the site closest to the Carpathian Mountains.

3.2.2. MADIS Site Comparisons

We additionally look at the performance of Sup3rWind across multiple MADIS sites. MADIS measurements are near-surface, approximately 10 m above ground level. It is important to note that near-surface performance can differ significantly from performance at typical wind turbine height. To summarize performance across many MADIS sites, we computed statistics on regional averages. Each of the four quadrants of the spatial domain was used to compute northeast, southeast, southwest, and northwest regional averages. Performance relative to ERA5 for these regions is shown in Figure 8 and Figure 9. In Figure 8, we see excellent agreement with observations, with high correlations and MBE within ±1 m per second for all regions. We also see better performance in capturing extreme values, as measured with MRQE. Values averaged over all MADIS sites are shown in Table 9. Statistics averaged across all MADIS sites. The associated p-values are shown in Table 10. In Figure 9, we see improved wind speed variability distributions and diurnal cycles. We again see good performance for ERA5 across the region. The most favorable comparison between Sup3rWind and ERA5 for correlations is seen in the southwest, where the terrain is most complex.

4. Discussion

The results shown in this paper support the use of the wind data created by GAN-based downscaling. Downscaling ERA5 data with GANs is shown to produce physically realistic wind across space and time (Figure 4) and historically accurate profiles when compared to ground measurements (Figure 5, Figure 7 and Figure 9) in nearly all out-of-sample validation conditions. This approach is shown to generalize well to different geographic regions, with training data selected only from CONUS and inference performed over Eastern Europe. Conditioning model output on high-resolution terrain data, a broad set of low-resolution features, and region-specific bias correction should enable the model to generalize to arbitrary regions. While a year-long 2 km, 5 min WRF simulation for CONUS is estimated to cost 50,000 compute node hours on the NREL high-performance computing hardware, our GAN framework can create a year of equivalent high-resolution data in 585 compute node hours using CPUs for inference. This shows a more than 85-times speedup for GAN-based downscaling over dynamical downscaling with WRF. The speedup can be as much as 500 times when using GPUs for inference.

We see good agreement between Sup3rWind, WTK, and observations across a broad suite of performance measures. Through the probability distributions for the temporal derivative and spatial gradient of wind speed and the turbulent kinetic energy spectrum, we see that Sup3rWind achieves excellent fidelity for the underlying physics of the high-resolution target data. Through site-specific coefficients of determination, absolute errors, and bias errors, we see high fidelity between Sup3rWind and observations across diverse regions with complex terrain.

Our efforts culminated in the production of a 24-year wind data record, with 2 km, 5 min spatiotemporal resolution, over Ukraine, Moldova, and eastern Romania. These data were extensively validated using observational data from over 40 different locations, spanning over 9 years, covering heights from 10 m to 120 m above the ground. The performance for Sup3rWind over Ukraine was comparable to that over the CONUS validation regions, showing low mean errors and high correlations. Sup3rWind agreed well with ERA5 while significantly improving the representation of wind speed variability and accuracy of extremes. Diurnal cycles were mostly improved over ERA5, while some noise was introduced in these cycles at wind farm locations. MBE also improved on average, although there is room for improvement on a site-wise basis. All data, models, and software produced through this work are publicly released at no cost, described more in Data Availability Statement.

Future Research Directions

This work poses a variety of additional research directions to pursue. While the model performance shown here is impressive, especially considering the limited training data and training time, we would like to improve accuracy even further. In the future, we would like to conduct a thorough architecture optimization to reduce network complexity and further speed up inference. Additionally, the models presented in this work were trained using only two GPUs and 6 years of training data. This is extremely limited by industry standards, where weather forecasting models are frequently trained on 30+ years of data and on 100+ GPUs [7,40]. Increasing the amount of training data and computational resources could further improve accuracy. Within the confines of the established framework, we are definitionally limited to the accuracy of the high-resolution target dataset. To combat this, we would like to perform a more extensive feature importance analysis on the broad set of ERA5 variables available for training and to explore physics-based loss terms derived from the Navier–Stokes equations. We can leverage some previous work on ERA5 feature importance [62]. Another exciting path for future work would focus on incorporating available observational data as part of either training or post-training data assimilation.

5. Conclusions

In this work, we have shown that by training a GAN model using ERA5 input data and WTK target data, we achieved results comparable in historical accuracy and spatiotemporal variability to conventional dynamical downscaling. Additionally, we extended the spatial enhancement GAN framework described in [40] to include temporal enhancement, incorporate a modified content loss function to encourage the accuracy of extreme values, and include a mid-network high-resolution topography injection that improved the high-resolution resource assessment in complex terrain. We demonstrated the use and performance of this method through comparisons with high-resolution target data and observational data for CONUS regions in the Pacific Northwest, Midwest, and Northeast. We downscaled ERA5 with this approach to produce a 24-year, high-resolution, high-accuracy, extensively validated wind dataset over Ukraine, Moldova, and eastern Romania. The ERA5 data were enhanced by 15 times along each spatial dimension and 12 times along the temporal dimension, going from 30 km hourly to 2 km, 5 min resolution. These data are comparable to state-of-the-art wind resource datasets developed with physics-based models and are publicly available through multiple easy-access options. We saw strong fidelity across performance measures and observation comparisons while reducing computation expense by two orders of magnitude. Python code for feature engineering, data handling, model training, and inference is also publicly available [58].

Author Contributions

B.N.B. developed software, developed methods, trained models, produced data, and wrote the paper. G.B. developed software, advised on methods, and wrote the paper. P.P. developed software and wrote the paper. A.G. and R.N.K. advised on methods and wrote the paper. G.M. and I.C. advised on and wrote the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was authored in part by the National Renewable Energy Laboratory (NREL), operated by Alliance for Sustainable Energy, LLC, for the U.S. Department of Energy (DOE) under Contract No. DE-AC36-08GO28308. Funding provided by the United States Agency for International Development (USAID) under Contract No. IAG-17-2050. The views expressed in this report do not necessarily represent the views of the DOE or the U.S. Government, or any agency thereof, including USAID. The publisher, by accepting the article for publication, acknowledges that the U.S. Government retains a nonexclusive, paid-up, irrevocable, worldwide license to publish or reproduce the published form of this work or allow others to do so for U.S. Government purposes.

Data Availability Statement

The software developed for feature engineering, data handling, training, and inference is available on GitHub at https://github.com/NREL/sup3r (accessed on 14 July 2025). Sup3r version 0.1.2 was specifically used for this work. The full environment yaml file and the configuration files used to run inference are available at https://github.com/NREL/sup3r/tree/main/examples/sup3rwind (accessed on 14 July 2025). Training data for this work was obtained through the NREL WIND Toolkit, which is available for download from https://www.nrel.gov/grid/wind-toolkit.html (accessed on 14 October 2022), and ERA5, which is available from https://www.ecmwf.int/en/forecasts/dataset/ecmwf-reanalysis-v5 (accessed on 10 June 2024). The sup3r software also provides utilities for downloading ERA5 data and performing pre-processing. The final data over Ukraine, Moldova, and Romania are easily accessible through NREL’s Renewable Energy Data Explorer (www.re-explorer.org (accessed on 14 July 2025)). Additionally, NREL provides several API options where users can download the data with Python or other programming languages (more information can be found at https://developer.nrel.gov/docs/wind/wind-toolkit/sup3rwind-ukraine-download (accessed on 14 July 2025)). The full dataset is available for download directly via the Open Energy Data Initiative on Amazon Web Services Public Datasets at Directly via OEDI on AWS Public Datasets: nrel-pds-wtk/sup3rwind/ukraine/v1.0.0/5 min and nrel-pds-wtk/sup3rwind/ukraine/v1.0.0/60 min.

Acknowledgments

The authors would like to thank Caroline Draxl, Evan Rosenlieb, Guilherme Pimenta Castelao, and Jaemo Yang for their thoughtful reviews. The authors would also like to thank Reid Olson and Nicole Taverna for making the super-resolution for renewable energy resource data with wind from reanalysis data (Sup3rWind) and models available via the Open Energy Data Initiative.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Holttinen, H.; Kiviluoma, J.; Levy, T.; Jun, L.; Eriksen, P.B.; Orths, A.; Cutululis, N.; Silva, V.; Neau, E.; Dobschinski, J.; et al. Design and Operation of Power Systems with Large Amounts of Wind Power: Final Summary Report; IEA WIND Task 25, Phase four 2015–2017, in VTT Technology; VTT Technical Research Centre of Finland: Espoo, Finland, 2019. [Google Scholar] [CrossRef]
Sharp, J.; Milligan, M.; Bloomfield, H.C. Weather Dataset Needs for Planning and Analyzing Modern Power Systems. October 2023. Available online: https://www.esig.energy/wp-content/uploads/2023/10/ESIG-Weather-Datasets-full-report-2023b.pdf (accessed on 14 July 2025).
Dong, Z.; Wong, K.P.; Meng, K.; Luo, F.; Yao, F.; Zhao, J. Wind power impact on system operations and planning. In Proceedings of the IEEE PES General Meeting, Minneapolis, MN, USA, 25–29 July 2010; IEEE: New York, NY, USA, 2010; pp. 1–5. Available online: https://ieeexplore.ieee.org/abstract/document/5590222/ (accessed on 27 April 2024).
Clifton, A.; Hodge, B.; Draxl, C.; Badger, J.; Habte, A. Wind and solar resource data sets. WIREs Energy Environ. 2018, 7, e276. [Google Scholar] [CrossRef]
Murphy, J. An Evaluation of Statistical and Dynamical Techniques for Downscaling Local Climate. J. Clim. 1999, 12, 2256–2284. [Google Scholar] [CrossRef]
Martinez-García, F.P.; Contreras-de-Villar, A.; Muñoz-Perez, J.J. Review of Wind Models at a Local Scale: Advantages and Disadvantages. J. Mar. Sci. Eng. 2021, 9, 318. [Google Scholar] [CrossRef]
Benton, B.N.; Alessi, M.J.; Herrera, D.A.; Li, X.; Carrillo, C.M.; Ault, T.R. Minor impacts of major volcanic eruptions on hurricanes in dynamically-downscaled last millennium simulations. Clim. Dyn. 2022, 59, 1597–1615. [Google Scholar] [CrossRef]
Knutson, T.R.; Sirutis, J.J.; Bender, M.A.; Tuleya, R.E. Dynamical Downscaling Projections of Late 21st Century US Landfalling Hurricane Activity. Clim. Change 2021, in press. [Google Scholar]
Rockel, B.; Castro, C.L.; Pielke Sr, R.A.; von Storch, H.; Leoncini, G. Dynamical downscaling: Assessment of model system dependent retained and added variability for two different regional climate models. J. Geophys. Res. Atmos. 2008, 113, D21107. [Google Scholar] [CrossRef]
Draxl, C.; Clifton, A.; Hodge, B.M.; McCaa, J. The Wind Integration National Dataset (WIND) Toolkit. Appl. Energy 2015, 151, 355–366. [Google Scholar] [CrossRef]
Draxl, C.; Wang, J.; Sheridan, L.; Jung, C.; Bodini, N.; Buckhold, S.; Aghili, C.; Peco, K.; Kotamarthi, R.; Kumler, A.; et al. WTK-LED: The WIND Toolkit Long-Term Ensemble Dataset; National Renewable Energy Laboratory: Golden, CO, USA, 2024. [Google Scholar] [CrossRef]
VORTEX FdC, S.L. Vortex ERA5 Downscaling: Validation Results. Available online: https://www.vortexfdc.com/assets/docs/validation_ERA5.pdf (accessed on 6 April 2024).
Winstral, A.; Jonas, T.; Helbig, N. Statistical Downscaling of Gridded Wind Speed Data Using Local Topography. J. Hydrometeorol. 2017, 18, 335–348. [Google Scholar] [CrossRef]
González-Aparicio, I.; Monforti, F.; Volker, P.; Zucker, A.; Careri, F.; Huld, T.; Badger, J. Simulating European wind power generation applying statistical downscaling to reanalysis data. Appl. Energy 2017, 199, 155–168. [Google Scholar] [CrossRef]
Salameh, T.; Drobinski, P.; Vrac, M.; Naveau, P. Statistical downscaling of near-surface wind over complex terrain in southern France. Meteorol. Atmos. Phys. 2009, 103, 253–265. [Google Scholar] [CrossRef]
Onwukwe, C.; Jackson, P.L. Meteorological Downscaling with WRF Model, Version 4.0, and Comparative Evaluation of Planetary Boundary Layer Schemes over a Complex Coastal Airshed. J. Appl. Meteorol. Climatol. 2020, 59, 1295–1319. [Google Scholar] [CrossRef]
Zhou, E.; Mai, T. Electrification Futures Study: Operational Analysis of U.S. Power Systems with Increased Electrification and Demand-Side Flexibility; National Renewable Energy Laboratory (NREL): Golden, CO, USA, 2021. [Google Scholar] [CrossRef]
Michalakes, J.; Hacker, J.; Loft, R.; McCracken, M.O.; Snavely, A.; Wright, N.J.; Spelce, T.; Gorda, B.; Walkup, R. WRF nature run. J. Phys. Conf. Ser. 2008, 125, 012022. [Google Scholar] [CrossRef]
Pierce, D.W.; Cayan, D.R.; Thrasher, B.L. Statistical Downscaling Using Localized Constructed Analogs (LOCA). J. Hydrometeorol. 2014, 15, 2558–2585. [Google Scholar] [CrossRef]
Wood, A.W.; Leung, L.R.; Sridhar, V.; Lettenmaier, D.P. Hydrologic Implications of Dynamical and Statistical Approaches to Downscaling Climate Model Outputs. Clim. Change 2004, 62, 189–216. [Google Scholar] [CrossRef]
Kaczmarska, J.; Isham, V.; Onof, C. Point process models for fine-resolution rainfall. Hydrol. Sci. J. 2014, 59, 1972–1991. [Google Scholar] [CrossRef]
Bi, K.; Xie, L.; Zhang, H.; Chen, X.; Gu, X.; Tian, Q. Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast. arXiv 2022, arXiv:2211.02556. [Google Scholar] [CrossRef]
Lam, R.; Sanchez-Gonzalez, A.; Willson, M.; Wirnsberger, P.; Fortunato, M.; Alet, F.; Ravuri, S.; Ewalds, T.; Eaton-Rosen, Z.; Hu, W.; et al. Learning skillful medium-range global weather forecasting. Science 2023, 382, 1416–1421. [Google Scholar] [CrossRef] [PubMed]
Nguyen, T.; Brandstetter, J.; Kapoor, A.; Gupta, J.K.; Grover, A. ClimaX: A foundation model for weather and climate. arXiv 2023. [Google Scholar] [CrossRef]
Pathak, J.; Subramanian, S.; Harrington, P.; Raja, S.; Chattopadhyay, A.; Mardani, M.; Kurth, T.; Hall, D.; Li, Z.; Azizzadenesheli, K.; et al. FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators. arXiv 2022, arXiv:2202.11214. [Google Scholar] [CrossRef]
Morrissey, M. ECMWF Unveils Alpha Version of New ML Model, ECMWF. Available online: https://www.ecmwf.int/en/about/media-centre/aifs-blog/2023/ECMWF-unveils-alpha-version-of-new-ML-model (accessed on 27 March 2024).
Gerges, F.; Boufadel, M.C.; Bou-Zeid, E.; Nassif, H.; Wang, J.T.L. Downscaling daily wind speed with Bayesian deep learning for climate monitoring. Int. J. Data Sci. Anal. 2023, 17, 411–424. [Google Scholar] [CrossRef]
Hu, W.; Scholz, Y.; Yeligeti, M.; von Bremen, L.; Deng, Y. Downscaling ERA5 wind speed data: A machine learning approach considering topographic influences. Environ. Res. Lett. 2023, 18, 094007. [Google Scholar] [CrossRef]
Chen, S.-T.; Yu, P.-S.; Tang, Y.-H. Statistical downscaling of daily precipitation using support vector machines and multivariate analysis. J. Hydrol. 2010, 385, 13–22. [Google Scholar] [CrossRef]
Pang, B.; Yue, J.; Zhao, G.; Xu, Z. Statistical Downscaling of Temperature with the Random Forest Model. Adv. Meteorol. 2017, 2017, e7265178. [Google Scholar] [CrossRef]
Sachindra, D.A.; Ahmed, K.; Rashid, M.M.; Shahid, S.; Perera, B.J.C. Statistical downscaling of precipitation using machine learning techniques. Atmos. Res. 2018, 212, 240–258. [Google Scholar] [CrossRef]
Sekiyama, T.T.; Hayashi, S.; Kaneko, R.; Fukui, K. Surrogate Downscaling of Mesoscale Wind Fields Using Ensemble Superresolution Convolutional Neural Networks. Artif. Intell. Earth Syst. 2023, 2, 230007. [Google Scholar] [CrossRef]
Xu, R.; Chen, N.; Chen, Y.; Chen, Z. Downscaling and Projection of Multi-CMIP5 Precipitation Using Machine Learning Methods in the Upper Han River Basin. Adv. Meteorol. 2020, 2020, e8680436. [Google Scholar] [CrossRef]
Hobeichi, S.; Nishant, N.; Shao, Y.; Abramowitz, G.; Pitman, A.; Sherwood, S.; Bishop, C.; Green, S. Using Machine Learning to Cut the Cost of Dynamical Downscaling. Earth’s Future 2023, 11, e2022EF003291. [Google Scholar] [CrossRef]
Maclaurin, G.; Grue, N.; Lopez, A.; Heimiller, D.; Rossol, M.; Buster, G.; Williams, T. The Renewable Energy Potential (reV) Model: A Geospatial Platform for Technical Potential and Supply Curve Modeling; NREL: Golden, CO, USA, 2021. [Google Scholar]
Kim, J.; Lee, J.K.; Lee, K.M. Deeply-Recursive Convolutional Network for Image Super-Resolution. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 1637–1645. [Google Scholar] [CrossRef]
Tran, D.T.; Robinson, H.; Rasheed, A.; San, O.; Tabib, M.; Kvamsdal, T. GANs enabled super-resolution reconstruction of wind field. J. Phys. Conf. Ser. 2020, 1669, 012029. [Google Scholar] [CrossRef]
Passarella, L.S.; Mahajan, S.; Pal, A.; Norman, M.R. Reconstructing High Resolution ESM Data Through a Novel Fast Super Resolution Convolutional Neural Network (FSRCNN). Geophys. Res. Lett. 2022, 49, e2021GL097571. [Google Scholar] [CrossRef]
Hu, X.; Naiel, M.A.; Wong, A.; Lamm, M.; Fieguth, P. RUNet: A Robust UNet Architecture for Image Super-Resolution. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, 16–17 June 2019. [Google Scholar] [CrossRef]
Stengel, K.; Glaws, A.; Hettinger, D.; King, R.N. Adversarial super-resolution of climatological wind and solar data. Proc. Natl. Acad. Sci. USA 2020, 117, 16805–16815. [Google Scholar] [CrossRef] [PubMed]
Chen, H.; Zhang, X.; Liu, Y.; Zeng, Q. Generative Adversarial Networks Capabilities for Super-Resolution Reconstruction of Weather Radar Echo Images. Atmosphere 2019, 10, 555. [Google Scholar] [CrossRef]
Jiang, Y.; Yang, K.; Shao, C.; Zhou, X.; Zhao, L.; Chen, Y.; Wu, H. A downscaling approach for constructing high-resolution precipitation dataset over the Tibetan Plateau from ERA5 reanalysis. Atmos. Res. 2021, 256, 105574. [Google Scholar] [CrossRef]
Ledig, C.; Theis, L.; Huszar, F.; Caballero, J.; Cunningham, A.; Acosta, A.; Aitken, A.; Tejani, A.; Totz, J.; Wang, Z.; et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 105–114. [Google Scholar] [CrossRef]
Yasuda, Y.; Onishi, R.; Matsuda, K. Super-resolution of three-dimensional temperature and velocity for building-resolving urban micrometeorology using physics-guided convolutional neural networks with image inpainting techniques. Build. Environ. 2023, 243, 110613. [Google Scholar] [CrossRef]
Rosencrans, D.; Benton, B.; Buster, G.; Glaws, A.; King, R.; Lundquist, J.; Gu, J.; Maclaurin, G. Wind Resource Data for Southeast Asia Using a Hybrid Numerical Weather Prediction with Machine Learning Super Resolution Approach, NREL/TP-5000-85481, 1984839, MainId:86254; National Renewable Energy Laboratory (NREL): Golden, CO, USA, 2023. [Google Scholar]
Bandura, R.; Romanishyn, A. Striving for Access, Security, and Sustainability: Ukraine’s Transition to a Modern and Decentralized Energy System. 2025. Available online: https://www.csis.org/analysis/striving-access-security-and-sustainability (accessed on 3 July 2025).
UNECE Renewable Energy Status Report 2022, Rana Adib, Executive Director, REN21|UNECE. Available online: https://unece.org/sed/documents/2022/11/presentations/unece-renewable-energy-status-report-2022-rana-adib-executive?utm_source=chatgpt.com (accessed on 3 July 2025).
Prengaman, P. Ukraine Has Seen Success in Building Clean Energy, Which Is Harder for Russia to Destroy, AP News. Available online: https://apnews.com/article/ukraine-clean-renewable-energy-russian-bombing-distributed-1f226213742cc057f9f65208167e6f38 (accessed on 3 July 2025).
Hersbach, H.; Bell, B.; Berrisford, P.; Hirahara, S.; Horányi, A.; Muñoz-Sabater, J.; Nicolas, J.; Peubey, C.; Radu, R.; Schepers, D.; et al. The ERA5 global reanalysis. Quart J. R. Meteoro. Soc. 2020, 146, 1999–2049. [Google Scholar] [CrossRef]
Kalnay, E. Atmospheric Modeling, Data Assimilation and Predictability; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
Sheridan, L.M.; Phillips, C.; Orrell, A.C.; Berg, L.K.; Tinnesand, H.; Rai, R.K.; Zisman, S.; Duplyakin, D.; Flaherty, J.E. Validation of wind resource and energy production simulations for small wind turbines in the United States. Wind Energy Sci. 2022, 7, 659–676. [Google Scholar] [CrossRef]
Estima, J.; Fichaux, N.; Menard, L.; Ghedira, H. The global solar and wind atlas: A unique global spatial data infrastructure for all renewable energy. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on MapInteraction, in MapInteract ’13, New York, NY, USA, 5 November 2013; Association for Computing Machinery: New York, NY, USA, 2013; pp. 36–39. [Google Scholar] [CrossRef]
NOAA NCEP Meteorological Assimilation Data Ingest System (MADIS). Available online: https://madis.ncep.noaa.gov/ (accessed on 6 December 2023).
Wilczak, J.M.; Stoelinga, M.; Berg, L.K.; Sharp, J.; Draxl, C.; McCaffrey, K.; Banta, R.M.; Bianco, L.; Djalalova, I.; Lundquist, J.K.; et al. The Second Wind Forecast Improvement Project (WFIP2): Observational Field Campaign. Bull. Am. Meteorol. Soc. 2019, 100, 1701–1723. [Google Scholar] [CrossRef]
Complete ERA5 Global Atmospheric Reanalysis. Available online: https://cds.climate.copernicus.eu/datasets/reanalysis-era5-complete?tab=overview (accessed on 3 July 2025).
Dee, D.P.; Uppala, S.M.; Simmons, A.J.; Berrisford, P.; Poli, P.; Kobayashi, S.; Andrae, U.; Balmaseda, M.A.; Balsamo, G.; Bauer, P.; et al. The ERA-Interim reanalysis: Configuration and performance of the data assimilation system. Q. J. R. Meteorol. Soc. 2011, 137, 553–597. [Google Scholar] [CrossRef]
USGS EROS Archive-Digital Elevation-Global 30 Arc-Second Elevation (GTOPO30)|U.S. Geological Survey. Available online: https://www.usgs.gov/centers/eros/science/usgs-eros-archive-digital-elevation-global-30-arc-second-elevation-gtopo30 (accessed on 7 July 2025).
Benton, B.; Buster, G.; Glaws, A.; King, R. sup3r (Super Resolution for Renewable Resource Data); National Renewable Energy Lab. (NREL): Golden, CO, USA, 2022; Available online: https://zenodo.org/records/10402581 (accessed on 14 July 2025).
Wilczak, J.M.; Akish, E.; Capotondi, A.; Compo, G.P. Evaluation and Bias Correction of the ERA5 Reanalysis over the United States for Wind and Solar Energy Applications. Energies 2024, 17, 1667. [Google Scholar] [CrossRef]
Millstein, D.; Jeong, S.; Ancell, A.; Wiser, R. A database of hourly wind speed and modeled generation for US wind plants based on three meteorological models. Sci. Data 2023, 10, 883. [Google Scholar] [CrossRef] [PubMed]
Potisomporn, P.; Adcock, T.A.A.; Vogel, C.R. Evaluating ERA5 reanalysis predictions of low wind speed events around the UK. Energy Rep. 2023, 10, 4781–4790. [Google Scholar] [CrossRef]
Bouallègue, Z.B.; Cooper, F.; Chantry, M.; Düben, P.; Bechtold, P.; Sandu, I. Statistical Modeling of 2-m Temperature and 10-m Wind Speed Forecast Errors. Mon. Weather. Rev. 2023, 151, 897–911. [Google Scholar] [CrossRef]

Figure 1. Ukraine, Moldova, and Romania downscaling domain. MADIS observation sites are shown in dark red. Wind farm locations are not shown due to security concerns.

Figure 2. GAN training and inference flow. Inference is performed with only the generator.

Figure 3. GAN training and validation domains. Observation locations outside of training domain shown in red.

Figure 4. Wind speed (100 m AGL) distribution comparisons between ERA5, Sup3rWind, and original WTK across all validation regions. Columns from left to right: probability distribution of longitudinal wind speed gradient, probability distribution of wind speed time derivative, and normalized turbulent kinetic energy spectrum. The dashed line in the kinetic energy plots follows the

k^{- 5 / 3}

Kolmogorov scaling law.

Figure 4. Wind speed (100 m AGL) distribution comparisons between ERA5, Sup3rWind, and original WTK across all validation regions. Columns from left to right: probability distribution of longitudinal wind speed gradient, probability distribution of wind speed time derivative, and normalized turbulent kinetic energy spectrum. The dashed line in the kinetic energy plots follows the

k^{- 5 / 3}

Kolmogorov scaling law.

Figure 5. Region-wide comparisons against 2010 observations. Columns from left to right: Sup3rWind vs. observation point cloud, WTK vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Figure 5. Region-wide comparisons against 2010 observations. Columns from left to right: Sup3rWind vs. observation point cloud, WTK vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Figure 6. Summary of Sup3rWind performance against Ukraine vertically averaged wind farm observations. (Top), (left) to (right): MAE, MBE, and Pearson correlation coefficients. (Bottom), (left) to (right): coefficient of determination, MRQE, and KS-test statistic. A–E labels refer to the wind farms listed in Table 4.

Figure 7. Summary of Sup3rWind performance against Ukraine vertically averaged wind farm observations. Columns from left to right: Sup3rWind vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Wind Farms A–E from top to bottom row. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. MAE of the diurnal cycle is shown above each diurnal cycle plot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Figure 7. Summary of Sup3rWind performance against Ukraine vertically averaged wind farm observations. Columns from left to right: Sup3rWind vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Wind Farms A–E from top to bottom row. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. MAE of the diurnal cycle is shown above each diurnal cycle plot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Figure 8. Summary of performance against Ukraine MADIS observations. (Top), (left) to (right): MAE, MBE, and Pearson correlation coefficients. (Bottom), (left) to (right): coefficient of determination, MRQE, and KS-test statistic.

Figure 9. Summary of Sup3rWind performance against Ukraine MADIS observations. Columns from left to right: Sup3rWind vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. MAE of the diurnal cycle is shown above each diurnal cycle plot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Figure 9. Summary of Sup3rWind performance against Ukraine MADIS observations. Columns from left to right: Sup3rWind vs. observation point cloud, ERA5 vs. observation point cloud, probability distribution of the wind speed variability, diurnal cycle, and bias distribution. Coefficient of determination (

R^{2}

), MAE, and MBE are shown above each scatterplot. MAE of the diurnal cycle is shown above each diurnal cycle plot. The color scheme in the scatter plots is used to show density. The dashed vertical line in the bias distribution plots is positioned at zero bias.

Table 1. A summary of terms used in this paper.

Terms	Meaning
True low-resolution data	Output of a low-resolution simulation. In contrast to artificial low-resolution data obtained through coarsening high-resolution simulation output. The primary example used is ERA5.
High-resolution target data $(y_{t r u e}$ )	Output of a high-resolution dynamical downscaling simulation. In contrast to synthetic high-resolution data obtained through GAN-based downscaling. The primary example used is WTK.
$G_{3, 1}$	Generator, trained with ERA5 input and coarsened WTK (10 km, hourly) as target data with modified content loss function, which enhances low-resolution data by spatial factor 3 (first enhancement step).
$G_{5, 1}$	Generator, trained with coarsened WTK (10 km, hourly) as input data and subsampled WTK (2 km, hourly) as target data, which enhances low-resolution data by spatial factor 5 (second enhancement step).
$G_{1, 12}$	Generator, trained with subsampled WTK (2 km, hourly) as input data and original WTK (2 km, 5 min) as target data, which enhances low-resolution data by temporal factor 12 (third enhancement step).
$G_{15, 12}$	Composite generator that performs all three enhancement steps to go from ERA5 (30 km, hourly) to 2 km, 5 min resolution.

Table 5. Statistics averaged across all CONUS validation regions.

Performance Measure	Sup3rWind	WTK	ERA5
MAE	1.901 m/s	1.769 m/s	2.428 m/s
MBE	−0.434 m/s	0.079 m/s	−1.908 m/s
Pearson Correlation Coefficient	0.721	0.741	0.692
Coefficient of Determination	0.524	0.555	0.492
Mean Relative Quantile Error	−0.075	−0.036	−0.345
KS-Test Statistic	0.115	0.109	0.292

Table 6. p-Values for performance measure differences averaged across all CONUS validation regions.

Performance Measure	Sup3rWind vs. WTK	Sup3rWind vs. ERA5
MAE	0.0379	0.0679
MBE	0.0526	0.0
Pearson Correlation Coefficient	0.241	0.00843
Coefficient of Determination	0.241	0.00914
Mean Relative Quantile Error	0.0846	0.0
Wilcoxon Signed-Rank Test	0.0373	0.0

Table 7. Statistics averaged across all wind farm observations.

Performance Measure	Sup3rWind	ERA5
MAE	1.7186 m/s	1.6202 m/s
MBE	−0.4879 m/s	−0.7407 m/s
Pearson Correlation Coefficient	0.7598	0.8016
Coefficient of Determination	0.5772	0.6426
MRQE	−0.105	−0.1321
KS-Test Statistic	0.0671	0.1124

Table 8. p-Values for performance measure differences averaged across all wind farm observations.

Performance Measure	Sup3rWind vs. ERA5
MAE	0.0304
MBE	0.041
Pearson Correlation Coefficient	0.0378
Coefficient of Determination	0.0378
MRQE	0.0
Wilcoxon Signed-Rank Test	0.00158

Table 9. Statistics averaged across all MADIS sites.

Performance Measure	Sup3rWind	ERA5
MAE	0.4209 m/s	0.4743 m/s
MBE	−0.1453 m/s	−0.2389 m/s
Pearson Correlation Coefficient	0.9088	0.8999
Coefficient of Determination	0.8259	0.8098
MRQE	−0.0543	−0.1287
KS-Test Statistic	0.0598	0.1011

Table 10. p-Values for performance measure differences averaged across all MADIS sites.

Performance Measure	Sup3rWind vs. ERA5
MAE	0.00651
MBE	0.0212
Pearson Correlation Coefficient	0.0167
Coefficient of Determination	0.0165
MRQE	0.0298
Wilcoxon Signed-Rank Test	0.00211

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Benton, B.N.; Buster, G.; Pinchuk, P.; Glaws, A.; King, R.N.; Maclaurin, G.; Chernyakhovskiy, I. Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine. Energies 2025, 18, 3769. https://doi.org/10.3390/en18143769

AMA Style

Benton BN, Buster G, Pinchuk P, Glaws A, King RN, Maclaurin G, Chernyakhovskiy I. Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine. Energies. 2025; 18(14):3769. https://doi.org/10.3390/en18143769

Chicago/Turabian Style

Benton, Brandon N., Grant Buster, Pavlo Pinchuk, Andrew Glaws, Ryan N. King, Galen Maclaurin, and Ilya Chernyakhovskiy. 2025. "Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine" Energies 18, no. 14: 3769. https://doi.org/10.3390/en18143769

APA Style

Benton, B. N., Buster, G., Pinchuk, P., Glaws, A., King, R. N., Maclaurin, G., & Chernyakhovskiy, I. (2025). Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine. Energies, 18(14), 3769. https://doi.org/10.3390/en18143769

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Super-Resolution for Renewable Energy Resource Data with Wind from Reanalysis Data and Application to Ukraine

Abstract

1. Introduction

1.1. Previous Work

1.2. Overview

2. Materials and Methods

2.1. Problem Statement and Notation

2.2. Data Description

2.3. Model Description

2.4. Model Training

2.5. Bias Correction

2.6. Inference

3. Results

3.1. CONUS Validation

3.2. Ukraine, Moldova, and Eastern Romania Performance

3.2.1. Wind Farm Site Comparisons

3.2.2. MADIS Site Comparisons

4. Discussion

Future Research Directions

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI