1. Introduction
The transition toward sustainable energy systems has placed renewable resources at the forefront of global strategies to mitigate climate change and reduce dependence on fossil fuels. Among them, wind energy has emerged as one of the most mature and scalable technologies, with installed capacity expanding rapidly worldwide [
1]. Wind turbines, which convert the kinetic energy of moving air into electrical power, represent the cornerstone of this transition. Accurately modeling their performance is crucial for optimizing energy production, facilitating grid integration, and ensuring the reliable operation of wind farms.
Nevertheless, predicting wind turbine power output remains a challenging task. The power curve is inherently non-linear and influenced by multiple meteorological, geographical, and mechanical factors. Classical physics-based formulations, such as Betz’s law, provide a theoretical upper bound but fail to capture the variability observed in real-world conditions. This complexity has motivated the adoption of data-driven approaches. In particular, machine learning (ML) methods have attracted increasing attention due to their ability to model non-linear relationships and learn directly from empirical data [
2,
3].
Among ML techniques, artificial neural networks (ANNs) [
4,
5,
6] stand out as universal function approximators [
7,
8,
9], capable of capturing intricate dependencies between input and output variables. However, standard multilayer perceptrons (MLPs)—feedforward fully-connected ANNs—often suffer from
spectral bias, which limits their ability to approximate rapidly varying or oscillatory functions [
10,
11]. This limitation is particularly relevant in wind energy modeling, where abrupt transitions occur near the cut-in, rated, and cut-out wind speeds of a turbine.
To address this challenge, we propose using
Fourier feature encoding (FFE) as a means to enrich the input representation of wind speed measurements. By projecting scalar inputs into a high-dimensional sinusoidal space, FFE alleviates spectral bias and enables MLPs to better approximate complex, non-linear power curves. Our approach builds upon the influential work of Tancik et al. [
12], who demonstrated that Fourier feature mappings substantially improve the ability of MLPs to learn high-frequency functions in low-dimensional domains. While their study focused mainly on computer vision and graphics applications, we
extend this concept to the
energy domain, where, to the best of our knowledge, the use of input-level random Fourier feature encodings with simple MLPs has received little to no prior attention.
Specifically, we present a systematic comparison between a conventional plain-input MLP and an FFE-enhanced MLP, both trained on the real-world, open-source dataset of a
wind turbine, provided by the Research Group on Intelligent Engineering and Computing for Advanced Innovation and Development (GECAD) [
13]. Our experiments show that Fourier-based encoding yields a substantial improvement in predictive accuracy, reducing the mean absolute error (MAE) by more than
, and achieving an
score of
on the
validation dataset. The obtained results highlight the effectiveness of FFE in overcoming the spectral bias of standard MLPs, and establish it as a novel methodology for accurate wind turbine power prediction.
The remainder of this article is organized as follows.
Section 2 reviews prior work on the application of ANNs to wind-turbine power-curve modeling, as well as recent advances in integrating FFE within neural network architectures.
Section 3 provides an overview of wind-turbine operation.
Section 4 details the research methodology, including the dataset characteristics and preprocessing steps.
Section 5 presents the architecture of the predictive models developed in this study—namely, a baseline MLP with plain wind speed input and an FFE-enhanced MLP designed to mitigate spectral bias.
Section 6 reports and interprets the experimental outcomes, comparing the predictive performance of the two approaches. Finally,
Section 7 summarizes the main findings and outlines potential directions for future research and model generalization.
2. Related Work
This section situates our work within two interconnected strands of research, namely, the extensive use of ANNs for wind-turbine power-curve modeling, and the recent advances in FFE as a means to mitigate spectral bias in neural networks.
A substantial body of wind energy research has established ANNs as strong models for turbine power-curve estimation. Compared with manufacturer S-curves and other parametric fits, ANN-based power curves capture site-specific non-linearities and operational effects more accurately, as summarized by comprehensive reviews of power-curve methodologies and applications [
14]. Pelletier, Masson, and Tahan, for example, trained ANN power-curve models with multiple meteorological/operational inputs (wind speed, air density, turbulence intensity, shear, etc.) and reported markedly lower absolute and random errors than parametric baselines [
15]. Manobel et al. combined Gaussian process (GP) prefiltering with ANN regression for power-curve modeling and achieved further accuracy gains over GP-only or ANN-only approaches [
16]. Extending beyond one-dimensional curves, Yan et al. proposed two-dimensional (speed-direction) ANN power curves at the wind-farm level, reducing bias and improving accuracy relative to one-dimensional mappings [
17]. More recently, explainable-AI frameworks tailored to data-driven power-curve models have been introduced to evaluate physical plausibility and robustness, addressing transparency concerns in operational settings [
18].
Despite these advances, most ANN-based power-curve models rely on raw wind speed inputs, which, as will be discussed in detail, can limit their ability to represent rapid, high-frequency, or sharply varying patterns present in real measurement data. Such behaviors are precisely where standard MLPs tend to struggle, due to their inherent spectral bias—i.e., their tendency to learn smooth, low-frequency relationships more easily than high-frequency or rapidly changing behaviors [
10,
11]. This challenge motivates the incorporation of Fourier-based input encodings.
A first, influential use of fixed sinusoidal bases in deep learning appears in the Transformer models. In particular, Vaswani et al. introduced sinusoidal positional encodings to inject token order directly into the representation space [
19]. Although not framed as “Fourier features”, these deterministic sine–cosine embeddings effectively instantiate a Fourier basis and demonstrate how frequency-structured priors can improve learning and generalization. Motivated by such observations, Rahaman et al. provided empirical and theoretical evidence that standard MLPs exhibit spectral bias, fitting low-frequency components far more readily than high-frequency content [
10]. This clarified why naïve-coordinate-based networks blur fine detail and why explicit frequency encodings might help.
Against this backdrop, Tancik et al. crystallized Fourier feature mappings for coordinate-based MLPs, showing that passing inputs through multifrequency sinusoidal embeddings expands a network’s effective bandwidth and counters spectral bias [
12]. With these encodings, plain MLPs reliably capture high-frequency structure in low-dimensional domains that they otherwise fail to learn.
A prominent application is neural radiance fields (NeRF), where an MLP represents a continuous radiance field for novel-view synthesis [
20]. NeRF’s positional (Fourier-style) encodings of 3D coordinates and viewing directions are crucial, as they enable the network to capture high-frequency variations in appearance and geometry. Without these encodings, the model produces blurred structures and textures, whereas their inclusion allows it to reconstruct sharp edges and fine scene details with photorealistic accuracy.
Beyond vision, Fourier features have proven valuable in modeling oscillatory physics. Jandrell and Cox proposed a data-efficient framework for perturbation-dependent optical transmission in multimode fibers, encoding physical perturbations with a Fourier feature basis and training a compact MLP to learn highly oscillatory mappings [
21]. On experimental fiber data, their model attained a complex correlation of
to ground truth, while using
fewer parameters than conventional baselines, underscoring how explicit frequency bases stabilize optimization and improve fidelity in real systems.
In the domain of physics-informed neural networks (PINNs), designed for problems governed by partial differential equations [
22], incorporating Fourier features has proven effective in capturing multiscale solutions [
23]. For instance, Wang et al. showed that Fourier feature MLPs alleviate the spectral bias in PINNs, enabling them to solve high-frequency or multiscale problems much more accurately [
24].
Taken together, while ANNs are well established for power-curve modeling, Fourier feature encoding has not yet been explored in this domain, leaving an open opportunity for further improvement. This is precisely where our contribution lies, as outlined in the sections that follow.
3. Wind Turbines
Before presenting the core of the proposed methodology, we briefly outline how a modern horizontal-axis wind turbine converts wind energy into electricity and why the resulting power curve exhibits a highly non-linear, piecewise structure. A wind turbine transforms the kinetic energy of moving air into electrical power through a sequence of aerodynamic, mechanical, and electrical processes. The rotor blades extract energy from the wind and transfer it through the drivetrain to the generator, while control systems regulate this conversion to maintain efficient and safe operation [
25].
Structurally, a wind turbine comprises several interdependent components, each affecting the overall power output. The tower elevates the rotor to higher altitudes where wind speeds are stronger and steadier, thereby increasing the kinetic energy available for conversion. Mounted atop the tower, the nacelle houses the gearbox, generator, and control systems that together transform mechanical rotation into electricity. The rotor blades, typically made of lightweight composite materials, are responsible for capturing the wind’s kinetic energy; their aerodynamic design determines how effectively this energy is converted into torque. The gearbox adjusts the low-speed, high-torque rotation of the rotor into the high-speed rotation required by the generator, which converts mechanical energy into electrical energy through electromagnetic induction. Meanwhile, control systems—including pitch and yaw mechanisms—continuously optimize the orientation and rotational speed of the blades to maximize energy capture under varying wind conditions, while preventing mechanical overload. Finally, the anemometer and wind vane provide real-time measurements of wind speed and direction, enabling the control systems to adaptively adjust turbine operation for efficiency and safety [
26].
The electrical power output
P of a wind turbine depends on the air density, rotor swept area, and wind speed, as well as on the turbine’s aerodynamic, mechanical, and electrical efficiencies. This relationship is expressed by
where
is the air density (
),
A is the swept area of the rotor blades (
), and
v is the wind speed (
). The term
denotes the power coefficient, a dimensionless measure of how efficiently the rotor converts the wind’s kinetic power (
) into mechanical power. According to Betz’s law, the theoretical upper limit of
is 0.593, meaning that no turbine can capture more than 59.3% of the wind’s kinetic energy. In practice, however, aerodynamic imperfections, frictional losses, and wake effects reduce this value to typically between 0.3 and 0.4, depending on the rotor geometry, airfoil design, and operating conditions. The parameters
and
denote the mechanical and electrical efficiencies, representing drivetrain losses (bearings, shafts, gearbox) and conversion losses in the generator and power electronics, respectively.
Equation (
1) is only valid for a
certain range of wind speeds, as the electrical power that can finally be generated is
at most equal to the rated electrical power of the generator. Thus, when the wind speed is large enough that the electrical power calculated from Equation (
1) is greater than the aforementioned rated electrical power, the electrical power actually produced is equal to that rated value. Some important values of wind speed, related to the power generation of the wind turbine, are also noted: Firstly, the
cut-in speed is the minimum wind speed (typically 3–4 m/s) at which the wind turbine starts to produce mechanical power, as it is hindered by friction and other mechanical losses. Secondly, the
rated speed is the wind speed at which the generator starts to produce the rated (i.e., maximum) electrical power. Lastly, from the rated, up to a wind speed called
cut-out speed, the generator limits its output to the rated electrical power. For speeds greater than the cut-out speed, power generation is interrupted to protect the installation. Against this background, and for constant values for
,
,
,
, and
A, the
power curve of a wind turbine, in view of Equation (
1), is depicted in
Figure 1.
As indicated by Equation (
1) and illustrated in
Figure 1, between the cut-in speed and the rated speed, the generated electrical power
P grows as a cubic function of the wind speed
v, before saturating at the rated power and dropping abruptly at the cut-out speed. This combination of smooth cubic growth and abrupt transitions in the power curve leads to a highly complex power production pattern that is difficult to approximate with conventional linear models or plain-input neural networks. Such non-linearities provide the main motivation for employing advanced input encodings, such as FFE, which, as we will demonstrate, enrich the input space with sinusoidal components and allow neural networks to better capture the sharp, piecewise behavior of real turbine power curves.
4. Methodology
The methodology adopted in this work is presented in this section, covering the research design, data profile, and preprocessing procedures.
4.1. Research Design
The research design of this study follows a structured, stepwise methodology aimed at developing, training, and evaluating neural network models for wind turbine power prediction. The workflow begins with the selection and preparation of a real-world dataset containing synchronized wind speed and power measurements from a turbine. Following data acquisition, the dataset undergoes a standardized preprocessing pipeline that includes cleaning and chronological splitting. Particular attention is given to preventing temporal leakage by ensuring that all preprocessing operations are fitted exclusively on the training period.
The study then proceeds with the development of two predictive models, namely, a baseline MLP using raw wind speed inputs, and an enhanced MLP that incorporates FFE. Model training, hyperparameter selection, and early-stopping strategies are applied consistently across both architectures, enabling a controlled comparison. The evaluation phase uses a strictly held-out, chronologically later portion of the dataset to assess generalization performance under realistic operational conditions.
4.2. Data Profile
For the present study, we employ an open-source, real-world dataset provided by the Research Group on Intelligent Engineering and Computing for Advanced Innovation and Development (GECAD) [
13]. The dataset contains measurements of wind speed and corresponding electrical power output from a
rated wind turbine, recorded throughout the year 2011. Each entry consists of a pair
, where
v is the wind speed (in
) and
P is the electrical power generated (in
), sampled at 10 min intervals. An excerpt of the dataset is shown in
Table 1, while
Figure 2 illustrates the time series of both the turbine’s electrical power output and the corresponding wind speed over a representative portion of the dataset.
The dataset covers the entire year 2011 at 10 min intervals, yielding a total of 50,364 samples. Of these, the earliest 35,254 samples (approximately 70%) were used for training, while the most recent 15,110 samples (approximately 30%) formed the held-out validation set. This chronological separation ensures that no information from the future leaks into the training process and provides a realistic assessment of predictive performance.
6. Results and Discussion
Having introduced the architecture and parameters of the developed ANNs, this section presents and interprets the experimental results. Accordingly,
Table 2 summarizes the error metrics obtained for both models on the
validation dataset. Recall that the dataset was split chronologically, with the first 70% of samples used for training and the remaining 30% reserved as a held-out validation set, ensuring a leakage-free and operationally realistic evaluation. On that basis, the plain-input MLP achieves an
of
and a mean absolute error (MAE) of
, indicating that while it captures the overall trend of the power curve, residual deviations persist (as shall be illustrated subsequently). By contrast, the FFE-MLP delivers substantially better performance, reducing the MAE to only
and lowering the mean squared error (MSE) by nearly an order of magnitude. Its
score of
confirms an almost perfect fit to the true power values.
The training dynamics of the developed models are illustrated in
Figure 5, which shows the evolution of the validation MAE (computed on normalized outputs) across epochs for both models. It is evident that the plain-input MLP converges slowly and stabilizes at a higher validation error, while the Fourier feature MLP exhibits both faster convergence and consistently lower validation error. These observations are fully consistent with the error metrics of
Table 2, where the Fourier-based model clearly outperforms the plain-input baseline.
Thereafter,
Figure 6a illustrates the temporal behavior of the predicted power compared to the actual (ground truth) output in a representative segment of the validation dataset. The plain-input MLP (orange) succeeds in capturing the general trend of the power curve, but noticeable deviations arise, especially at the transition points between low- and high-power regions. In contrast, the MLP enriched with FFE (green) aligns almost perfectly with the true signal, exhibiting significantly reduced error and enhanced stability.
A complementary perspective is provided in
Figure 6b, which presents the scatter plot of estimated power against wind speed. The plain MLP (orange) approximates the power curve reasonably well, but introduces discrepancies around the cut-in, rated, and cut-out regions. These regions are characterized by abrupt non-linearities that plain-input networks tend to smooth out. On the other hand, the FFE-MLP (green) almost exactly overlaps with the ground truth, showing only minor deviations at extreme wind speeds. In particular, a few scattered predictions appear above the cut-out region, which are inconsistent with turbine physics, as power generation must be strictly zero beyond the safety threshold. This can be remedied by applying a simple corrective rule, according to which the predicted power values are clamped to zero slightly beyond the nominal cut-out speed—acknowledging that this threshold is not precisely defined and may vary across turbine models and operating conditions. With this adjustment, the Fourier-based model remains both highly accurate and physically consistent. This confirms that Fourier features effectively mitigate spectral bias, enabling the model to learn both the cubic relationship of Equation (
1) and the sharp discontinuities that define the turbine’s power curve.
The demonstrated improvement achieved by FFE is not only of methodological interest, but also of practical importance for renewable energy applications. Accurate prediction of wind turbine power output directly affects energy scheduling, reserve allocation, and grid stability. By substantially reducing prediction error, the FFE-MLP model enables operators to anticipate output fluctuations more reliably, particularly around the cut-in, rated, and cut-out speeds where abrupt changes occur. In addition, enhanced modeling accuracy facilitates more efficient turbine control strategies, which rely on precise characterization of turbine behavior. Equally importantly, the improved representation of sharp non-linear transitions enables a more reliable design of protection curves, alarm thresholds, and shut-down logic. As protection relays and power-electronic controllers depend on timely detection of overspeed and overproduction events, a model that captures these high-frequency variations reduces nuisance trips and improves the selectivity of alarms. This leads to more stable operation during gusts, turbulence bursts, or sudden grid disturbances, and supports better event logging in supervisory control systems. The approach also holds promise for predictive maintenance, as deviations between predicted and actual power could serve as early indicators of mechanical or aerodynamic faults.
Beyond these operational benefits, the proposed approach also highlights a broader innovation pathway for wind energy systems. By improving the fidelity with which high-frequency events and abrupt transitions are captured, Fourier-enhanced models support more resilient plant operation, enabling turbines to maintain stable production even under rapidly changing atmospheric or grid conditions. This improved robustness reduces the likelihood of unnecessary curtailments, mitigates cascading effects during disturbances, and contributes to maintaining continuous power delivery from converter-dominated renewable fleets. As wind penetration increases and system operators place greater emphasis on reliability, inertia emulation, and fault-ride-through compliance, accurate modeling of fast dynamic behavior becomes a key enabler for secure integration. In this context, the FFE-MLP framework represents not only a modeling refinement, but also a step toward next-generation forecasting and control strategies capable of enhancing production efficiency, reducing operational risk, and strengthening overall grid stability.
7. Conclusions
This work presented a comparative study of artificial neural networks for modeling wind turbine power generation, with a particular focus on the effect of Fourier feature encoding (FFE) on predictive performance. Using a real-world dataset of a wind turbine, we evaluated two multilayer perceptron (MLP) architectures, that is, one trained directly on raw wind speed inputs, and one trained on Fourier-encoded inputs.
The obtained results clearly indicate that Fourier features substantially enhance the representational power of MLPs. While the plain-input MLP was able to capture general trends, it struggled with the highly non-linear regions of the wind speed–power curve, especially near the cut-in, rated, and cut-out speeds. By contrast, the Fourier-enhanced MLP achieved nearly perfect alignment with the ground truth, as demonstrated by both visual comparisons and quantitative metrics (mean absolute error reduced by more than , close to unity).
While our experiments focus specifically on wind-turbine power modeling, the underlying mechanism—Fourier feature inputs mitigating spectral bias—suggests broader applicability to regression tasks in the energy domain that involve oscillatory or sharply non-linear input–output relationships. This includes other renewable energy applications such as photovoltaic generation modeling, where non-linearities and threshold effects similarly challenge standard machine learning models. Future work will extend the present single-turbine framework to a multiturbine collaborative model, explicitly incorporating farm-level interactions such as turbine spacing, wake effects, relative positioning, and intra-farm turbulence patterns. We also aim to integrate additional meteorological variables relevant at the wind farm scale (such as turbulence intensity, air density variations, and directional shear), and to systematically investigate the impact of hyperparameter choices (e.g., number of Fourier features, frequency scaling, and network depth) on model performance. In addition, we plan to explore hybrid architectures that combine Fourier features with symbolic regression for enhanced interpretability [
29].