1. Introduction
Countries around the world are facing the dual challenges of the depletion of natural resources and the continuous deterioration of the ecological environment. As a renewable energy source, wind energy features low overall development and utilization costs and minimal environmental pollution. As the world’s largest wind energy market, China’s new wind power installed capacity accounted for 70% of the world’s new wind power installed capacity in 2024, and wind energy applications are increasingly popular. [
1,
2]. As the country’s exploitable wind resources in plains and offshore regions approach saturation [
3], attention has increasingly shifted toward forested regions [
4]. According to national statistics on the overlap between forested areas and wind resource coverage, forested areas account for 23.46% of China’s total land area [
5]. Among these, the northeastern forest regions represent nearly one-third of China’s total forested area [
6], indicating considerable untapped potential for wind energy development [
7]. A comprehensive understanding and accurate assessment of the wind potential in candidate regions are essential prerequisites for rational wind farm planning and efficient utilization of wind energy resources [
8].
The development of wind energy in forest areas is not unique to China, but in the general trend of the global transition to renewable energy, forest areas have gradually become an important location direction for onshore wind power development [
9]. Nordic countries with high forest cover are increasingly locating wind farms in such areas [
10]. However, the development of wind energy resources in forest areas faces a double challenge. On the one hand, the wind field in forest areas has a significantly high complexity [
11]. Due to the joint influence of rolling terrain and dense vegetation, the near-earth atmospheric boundary layer air flow undergoes deflection, separation and abrupt change, wind direction disturbance is frequent, and the vertical and horizontal exchange of momentum, heat and water vapor between atmospheric turbulence and the boundary layer is more complex [
12,
13,
14]. The complex tree crown structure (characterized by tree height and leaf area index) will cause high intensity turbulence and strong vertical wind shear [
15], which brings great uncertainty to the analysis of wind field characteristics and wind energy assessment in forest areas [
16]. On the other hand, seasonal changes in vegetation (such as defoliation and foliation) will cause dynamic changes in surface roughness, further increasing the complexity of wind resource assessment [
17]. Some researchers have found [
18,
19] that the prediction error of near-surface wind speed of wind farms in China is about 25% to 40%, especially in mountainous areas with complex terrain and significantly affected by ground fluctuations and vegetation cover changes. Wang et al. [
20] used the WRF model to simulate the near-surface wind field of complex terrain. The results showed that although different PBL schemes could accurately predict the dominant wind direction and wind speed trend, there were still systematic deviations. Therefore, it is very important to fully understand and accurately assess the wind energy potential of forest areas [
21], and it is necessary to explore an accurate method for better prediction and analysis of wind resources [
22].
Assessing wind energy potential over complex terrain requires detailed statistical analysis of wind resource data. Currently, three primary methods are used to obtain wind resource data [
23]: field measurements [
24,
25], wind tunnel experiments [
26,
27], and numerical simulations [
28,
29]. Field measurements can provide spatiotemporal wind data under real atmospheric conditions, which is suitable for studying large-scale wind field characteristics and their ecological effects [
30]. However, it is limited by the high cost and the limited number of wind measuring towers [
31,
32,
33]. Wind tunnel experiment, on the other hand, can provide controllable experimental conditions and high-resolution local wind field data [
34], yet it does not have obvious advantages due to the consideration of cost, cycle and test accuracy [
35,
36]. The numerical simulation method can efficiently construct regional wind field models under real terrain conditions and has significant advantages when combined with reanalysis meteorological data [
35]. By configuring various parameter schemes, these methods can adapt to different simulation processes [
37,
38]. With the rapid advancement of computer technology, their applications have become increasingly widespread and mature. Therefore, this study employs numerical simulation to acquire wind field data and conduct corresponding wind energy assessments.
In the field of wind energy, the Weather Research and Forecasting (WRF) model is the most widely used numerical simulation tool [
35,
39,
40]. The WRF model overcomes the dependence on in situ meteorological measurements such as mast data and features a well-developed dynamic framework with diverse physical parameterization schemes. Its open-source structure allows for effective simulation and assessment of wind fields [
41]. However, as a mesoscale model with a spatial resolution typically on the order of kilometers, WRF’s predictive accuracy remains limited. What is more, the vertical and horizontal exchange of momentum, heat, water vapor and so on in atmospheric turbulence and the atmospheric boundary layer is more complex [
12,
13,
14], which makes it difficult to fully meet the actual needs of wind power development. To improve simulation precision, researchers have commonly enhanced terrain resolution [
42], refined land-use categories [
43], or tested various combinations of physical parameter schemes [
21]. Updating the default WRF sea surface temperature (SST) and roughness length (Z
0) significantly improved wind simulation accuracy in coastal regions, with greater enhancement near the coastline than offshore [
44]. In forested ecological regions, wind fields are further influenced by heterogeneous vegetation cover and complex canopy structures [
45]. The leaf area index (LAI), a key parameter describing canopy morphology, directly affects turbulent transport rates in the near-surface layer and consequently modulates momentum, mass, and energy exchanges between the surface and the atmosphere [
46]. It is therefore considered a primary factor governing vertical wind profiles over vegetated areas [
47]. The default Noah land surface scheme of WRF uses typical climatological values to set LAI, which cannot fully reflect the local canopy characteristics. Taking the study of wheat dry-hot wind as an example, Wang Shu et al. found that PKULM was more accurate in expressing vegetation physiological processes due to the introduction of higher time resolution LAI data. Finally, the latent heat flux simulated by PKULM is more consistent with the physiological process law and observation law of vegetation than Noah LSM [
48]. Hence, examining the impact of modified LAI values on wind field simulations is of significant importance for improving model performance in forested terrains.
The accuracy of the atmospheric initial state is another critical factor influencing WRF simulation performance [
49]. Fu et al. [
50] compared simulations of high-wind typhoon events driven by two reanalysis datasets, NCEP-FNL and ECMWF-ERA5, and found that higher-resolution forcing data improved both the representation of typhoon structure and the accuracy of peripheral wind fields. Yang et al. [
51] found that the WRF model could reproduce the mean climate, interannual variations, precipitation cycles, and surface temperature (T2 m), though with some deviations. Shi et al. [
52] reported that assimilating observational data into the initial atmospheric fields markedly improved model quality and simulation accuracy. Commonly used data assimilation techniques include three-dimensional variational method (3DVAR), four-dimensional variational method (4DVAR), ensemble Kalman filter method (EnKF), and variational assimilation method (Hybrid) [
53]. Wang et al. [
54] compared precipitation simulations with and without four-dimensional nudging and observed significant improvements in precipitation distribution and magnitude after assimilation. Among these approaches, the 3DVAR method remains the most widely applied. In addition, Yang et al. [
55] investigated the benefits of improving surface information quality to enhance rainfall simulation, using the enhanced land use and leaf area index (LAI) dataset from the WRF-3 DVAR assimilation system as a case study.
It is worth noting that the aforementioned improvements mainly focused on a single factor and lacked collaborative optimization studies on multi-source information such as parametric schemes, surface parameters, and initial fields. Especially in forest areas, studies that comprehensively consider the seasonal applicability of PBL schemes, local adjustments of canopy parameters, and data assimilation are still rarely reported. To address these research gaps, this study proposes the following core hypotheses: By integrating the multi-source collaborative optimization framework of PBL scheme seasonal selection, LAI local adjustment, and 3DVAR data assimilation, the wind field simulation accuracy in forest areas can be significantly improved, and there are certain physical laws. To verify this hypothesis, this study designs four research questions: (i) Are there significant differences in the performance of different PBL schemes in different seasons and at different heights in the northeastern forest area? What are the patterns? (ii) Can the LAI correction based on local data effectively reduce the systematic deviation in the simulation of near-ground wind speed? What are the differences in the improvement effect at different heights? (iii) Can the introduction of 3DVAR data assimilation further improve the wind field simulation accuracy? (iv) How does the wind field simulation result after multi-source collaborative optimization affect the assessment of wind energy resources? What is the optimization effect? To answer these questions, the main research activities include: (i) constructing simulation experiments of PBL schemes at different heights in spring and winter; (ii) integrating local LAI values into WRF and designing comparative experiments; (iii) introducing the 3DVAR data assimilation method and designing comparative experiments with or without assimilation; (iv) based on the simulation results before and after optimization, calculating the wind power density and evaluating the impact of the multi-source optimization framework on the assessment of wind energy resources.
The scientific contribution of this study lies in constructing a multi-source collaborative optimization framework that integrates the seasonal selection of PBL parameterization schemes, local adjustment of LAI, and 3DVAR data assimilation based on the WRF model. It systematically evaluates the synergy effect of the three. It reveals the advantages of the YSU scheme, with unstable stratification in the spring the high layer in the winter, as well as the applicability of the local scheme near the ground in winter. It quantifies the cumulative improvement effect of local LAI adjustment and the introduction of data assimilation on the simulation of the near-ground wind field in forest areas and verifies the practical value of the collaborative framework. On this basis, this study further analyzes the impact of the multi-source optimization scheme on the assessment of wind energy resources and verifies the superiority of the collaborative optimization framework in wind field simulation in forest areas. The research results can provide scientific basis for wind farm site selection, turbine selection, and forest engineering design in the northeastern forest area, and also provide methodological references for wind field simulation optimization in other regions.
3. Results and Discussion
The flow chart of the multi-source optimization methods involved in this chapter is shown in
Figure 3. In the
Figure 3, the upward arrow indicates the degree of improvement in the MRE error indicator value. In order to verify the core assumptions and answer the research questions raised at the beginning of this paper, this chapter carries out the following four aspects of research content. Specifically, a variety of planetary boundary layer schemes PBL were selected for comparative analysis of simulation effects, and then the canopy characteristic parameter LAI was dynamically modified locally, and then 3DVAR assimilation data was introduced. Finally, the wind field simulation effect improvement analysis of multi-source optimization method was carried out from the perspective of wind energy assessment.
To accurately evaluate the simulation accuracy of wind speed and wind direction, six statistical indicators were employed: mean relative error (MRE), mean error (ME), root mean square error (RMSE), mean absolute error (MAE), Normalized Mean Bias (NMB) and correlation coefficient (R). Their mathematical formulations are expressed as follows.
where
i represents the time index,
n denotes the total number of samples,
indicates the simulated value at time
i, and
denotes the corresponding observed value. The
and
represent the mean values of the simulated and observed datasets, respectively.
When calculating the error indicators for wind direction, both the simulated wind direction values and the measured wind direction values are considered. At this time, since wind direction is a cyclic variable that varies within the range of 0° to 360°, the traditional linear calculation will produce incorrect jumps at the 0°/360° boundaries. Therefore, the values of
in Formulas (9)–(12) need to be cyclically corrected to ensure that the error is within the range of [−180°, 180°]. The specific details are as shown in Formula (14).
3.1. Results of Parameterization Scheme Configuration
The simulation results are analyzed.
Table 2 shows the error index and significance test results of simulated wind speed and measured wind speed at different heights in May and December under different planetary boundary layer (PBL) parameterization schemes. To objectively assess the statistical significance of the differences among the various schemes, this study employed the paired sample Wilcoxon signed-rank test and conducted a Fisher Z transformation comparison test on the correlation coefficients (R) among the schemes to examine the potential systematic deviation between the simulated values and the observed values at a 0.05 significance level.
As shown in
Table 2, the root mean square error of all experiments reached its peak at a height of 20 m (RMSE > 2.1 m/s), mainly due to the strong aerodynamic resistance within the forest canopy–atmosphere interface and the sub-grid scale terrain turbulence. As the altitude increased, the correlation coefficient of Case 1 showed a gradually increasing trend, indicating that the WRF model has an outstanding ability to capture the mesoscale weather forcing and wind speed dynamics in the upper part of the planetary boundary layer (PBL). At 100 m height, the absolute error increased, which was due to the increase in wind speed with altitude, and the R increased, which was due to the fact that the turbulence in the wind field is more stable than that near the ground. Overall, Case 1 consistently demonstrated the most robust prediction performance in all vertical layers. In May, all the indicators of Case 1 remained at the leading position. At the 20 m and 50 m heights, the correlation coefficient of the YSU scheme was significantly higher than that of the MYJ and MYNN schemes, indicating that YSU has superior ability in simulating the wind speed trend under the spring unstable stratification. In December, Case 1 showed slightly higher RMSE and NMB values at the 20 m near-ground height. This was attributed to the advantages of Case 2 and Case 3 in using the local TKE closure scheme to simulate the near-ground winter stable stratification, while YSU was limited by the physical constraints of excessive mixing. However, at an altitude of 100 m, the R value of YSU (0.55) was still higher than that of MYJ and MYNN, but there was no significant difference.
Overall, in spring, YSU demonstrated significant advantages at all heights. In winter, although YSU could also simulate wind speed relatively well near the ground, its performance was slightly inferior to the local closed-loop scheme. However, as the height increased, the simulation ability of YSU narrowed the gap with other schemes even more. Moreover, this indicates to some extent that YSU has excellent simulation performance in both spring and winter seasons, and maintains stable performance in simulating mountain forest wind fields with strong dynamic shear.
It is worth noting that all three schemes showed positive mean error (ME) values (ranging from 0.58 to 1.05 m/s), indicating a systematic overestimation of wind speed. This suggests that even with the YSU and MM5 schemes, WRF still exhibits systematic overestimation when dealing with high roughness surface layers. Part of the reason is that the current schemes cannot capture the dynamic influence of the near-ground vegetation roughness on the wind field, and the model’s portrayal of near-ground momentum decay and energy dissipation is insufficient. Therefore, it is necessary to introduce vegetation feature descriptions in the WRF model to improve the accuracy of wind field simulation in the forest areas of Northeast China.
The wind direction simulation errors at different heights are summarized in
Table 3.
As shown in
Table 3, under weak wind conditions, wind direction in forested areas exhibits strong randomness and localized characteristics. The Northeast Forest Region demonstrates significant thermal inhomogeneity in May, with complex canopy structures frequently causing weak instability or neutral stratification in the near-surface layer. The YSU scheme achieves optimal RMSE and MAE values across three altitudes, with most ME values approaching optimal levels, demonstrating strong capability in capturing wind direction variations under spring instability stratification. This stems from YSU’s non-local approach, which fully accounts for full-layer mixing effects induced by large vortex thermal driving, while MYNN and MYJ schemes as local (Local) approaches primarily focus on gradient diffusion between adjacent layers. The MM5 scheme, based on the Moinin–Obhoff similarity theory and incorporating stability correction with segmented function optimization for stability parameters, exhibits greater robustness in handling friction velocity. Comparative results from December reveal MYNN’s slight advantage in lower layers and YSU’s minimum RMSE in upper layers. This reflects that local closure schemes better maintain near-surface structures under winter stability stratification, while non-local approaches can also simulate fundamental wind direction characteristics at higher altitudes. The December comparative analysis of various schemes demonstrates MYNN’s marginal advantage in lower layers, while YSU achieves the lowest RMSE in upper layers. This indicates that local closure schemes under stable winter stratification better preserve near-surface structures, though non-local schemes can also accurately simulate fundamental wind direction characteristics in upper layers.
In summary, the integrated wind speed and direction simulation results for both summer and winter demonstrate that the YSU scheme achieves optimal overall performance in spring. Although its near-surface wind direction in winter is slightly inferior to MYJ, its overall error level remains comparable to MYJ and MYNN, indicating low sensitivity. Therefore, this section answers the research question Q1 by comparing the simulation errors of different PBL schemes in May and December and at different heights: The YSU scheme has the best performance in unstable stratification in May and in high layers in winter, and its RMSE is reduced by up to 25.7% compared with other schemes. The MYJ scheme has a slight advantage in the surface layer only in December (the RMSE difference is within 6%, and the R value is not much different), which reveals that PBL scheme shows inter-seasonal applicability in the simulation of a mountain forest wind field. The selection of the PBL parameter scheme is fully justified. However, the single mesoscale parameterization scheme still exhibits significant limitations in fine-scale forest canopy simulation, such as overestimating wind speeds, which requires further optimization.
Based on the above analysis, the main conclusions of this study can provide the following guidance for other forest areas. In the unstable stratification of spring, priority should be given to non-local solutions (such as YSU); in the stable stratification of winter, if the research focus is on the near-surface wind field (such as in forest ecological studies), local solutions (such as MYJ) can be considered; if the focus is on the upper-level wind field (such as at the height of the wind turbine hub), YSU remains the more reliable choice.
3.2. Results of Forest Canopy Characteristic Parameters LAI
The dominant tree species in the Maoershan region include Xing ‘an deciduous pine, Korean pine, red pine, birch, Mongolian oak, poplar and water elm. According to the MODIS land surface classification, these correspond to Type 4 (deciduous broadleaf forest) and Type 5 (mixed forest). Considering the local forest composition, simulations were conducted during May–June, the active growing season characterized by rapid canopy development. Based on field LAI measurements obtained using the LAI-2200 system [
60], localized LAI corrections were applied to the study area. The crown height was 16 m, and the minimum and maximum values of roughness length were adjusted to 1.3 m and 1.9 m for deciduous broadleaf forest, and 1.4 m and 1.9 m for mixed forest. The default LAI values were adjusted every ten days to capture the dynamic canopy development during the leaf-expansion period, as summarized in
Table 4.
Figure 4 presents a comparative illustration of the discrepancies between simulated wind speed profiles before and after LAI correction and corresponding observational data.
Table 5 quantifies the results, providing an error analysis of simulated versus observed wind speeds and wind directions at various heights prior to and following the LAI localization correction.
Figure 4 results indicate that the default LAI (Case 1) significantly overestimates simulated wind speeds at 20 m height compared to observations, reflecting an underestimation of canopy density in northeastern forest regions. Following localized correction (Case 4), the wind profile gradient markedly diminishes, demonstrating that enhanced canopy resistance produces a more realistic deceleration effect on the lower atmosphere.
Furthermore, the quantitative analysis in
Table 5 shows that the model has reduced the average relative error (MRE) of wind speeds at 20 m, 50 m, and 100 m by 0.141, 0.072, and 0.013, respectively. The accuracy at 20 m and 50 m has increased by approximately 35%. This indicates that the model has a clear and consistent physical response to changes in the LAI parameter. Additionally, the corrected LAI parameter can more accurately reflect the true canopy structure characteristics of the vegetation, thereby realistically reproducing the influence of canopy resistance on turbulence during the simulation process. The increase in LAI directly enhances the physical obstructive area of the canopy to the airflow, resulting in a significant increase in the dynamic roughness. As the localized LAI depicts a more dense forest canopy structure, the zero plane moves upward. According to the logarithmic wind profile theory, after the localized LAI corrects the over-sparse deviation that is commonly present in the default dataset in the northeastern forest area, the increase in roughness and the displacement of the zero plane jointly leads to an enhancement of the momentum sink at a height of 20 m, so the improvement value of MRE at 20 m is as high as 0.141. This correction enables the simulation process to more realistically reproduce the reduction effect of the canopy interior and edge on turbulence. In addition, the influence of LAI correction on wind speed simulation is most significant below the 20 m height. This is because LAI is closely related to the density of the vegetation and the configuration of the canopy; above the canopy height, LAI cannot directly exert its effect but indirectly affects the high-altitude wind field situation through the vertical transfer of turbulent shear stress, and the energy gradually dissipates as the height increases.
All three simulation cases confirmed the effectiveness of local LAI correction, demonstrating that incorporating site-specific canopy data significantly enhances wind speed simulation accuracy. In contrast, the improvement in wind direction accuracy was relatively modest: the Mean Root Square Error (MRE) at 20 m decreased by only 0.023, representing a 28% accuracy gain; however, the accuracy changes at 50 m and 100 m were negligible. The reason for this is that the turbulence intensity in the forest area is relatively high, and the turbulence structure is complex. The thermal effects in the forest area (such as surface heating and cooling) influence the formation of the local wind field, and there are systematic deviations in the large-scale background field.
In summary, this section answers the research question Q2 by comparing the simulation results before and after LAI correction: The correction of localLAIparameters effectively made up for the deficiency of the default data set in describing the characteristics of forest areas, reducing the wind speed error by 35% and improving the wind direction accuracy by 28%. The optimization effect was attenuated with the height and enhanced with the increase in LAI, which was due to the fact that LAI was a characteristic parameter describing the influence of the near-surface canopy on the wind field. After the local dynamic correction, the improvement effect is particularly prominent near the ground.
This indicates that the local adjustment of LAI should be included as a standard configuration in the WRF simulation of forest areas, especially for the simulations in the summer (when LAI is high). In the next stage, based on the corrected vegetation canopy characteristic parameters in this section, 3DVAR three-dimensional variational assimilation technology will be introduced. By integrating multi-source conventional meteorological observation data, the system deviation will be improved, and the deep optimization of the temporal and spatial evolution process of the forest area wind field will be achieved.
3.3. Results of 3DVAR Data Assimilation
The trial period was selected from 00:00 UTC on 1 May 2021 to 00:00 UTC on 1 June 2021, and from 00:00 UTC on 1 December 2020 to 00:00 UTC on 1 January 2021. Cold-start 24 h forecasts were conducted daily based on initial fields at 00:00 and 12:00. Using two sets of forecast fields simulated by the WRF model at the same time points, with forecast lead times of 12 h and 24 h, respectively, as sample data, the background error covariance B was subsequently calculated. A periodic assimilation strategy with six-hour intervals was employed, with the assimilation time window set to t ± 3 h. Observation data were read at 00:00, 06:00, 12, and 18 h (UTC). Two experimental configurations were designed: Case 4, a control experiment without data assimilation, and Case 5, an experiment with assimilation. Simulations were conducted for the winter period (17–19 December 2020) and the summer period (16–18 May 2021).
The results were compared to evaluate the effect of data assimilation on improving the accuracy of the simulated wind field.
Figure 5 and
Figure 6 illustrate the simulated wind speed results for the two seasons.
As shown in
Figure 5, both Case 4 and Case 5 reproduced the temporal evolution of the observed near-surface wind speed at three height levels. As height increased to 50 m, Case 5 exhibited significantly improved agreement with the measured data compared with Case 4, capturing both the magnitude and fluctuation characteristics of the observed wind speed. At 100 m height, the simulated curve nearly coincided with the observed curve, and the bias between Case 4 and the measurements was substantially reduced. The wind speed in the forested region generally increased between 00:00 and 12:00 (local time), reaching its daily maximum around noon.
By comparing assimilation with non-assimilation schemes in December versus May, it is evident that data assimilation significantly improves WRF’s temporal and amplitude corrections for wind speed simulations during the December period. The hourly comparison in
Figure 6 demonstrates that Assimilation Scenario 5 accurately captures extreme events’ timing and intensity at all three observation heights during the trough at 07:00 on 18 December and the peak at 13:00. In contrast, Non-Assimilation Scenario 4 exhibits significant temporal lag and amplitude deviation at 50 m, showing persistently low wind speeds starting at 12:00 on 18 December, maximum errors by 17:00, and peak errors exceeding 3 h. Combined with field observation records of sudden rainfall events, this indicates that short-term intense convective processes triggered by precipitation caused abrupt changes in near-surface wind fields. These effects were inadequately captured in the non-assimilated model, leading to increased deviations in Case 4. Therefore, meteorological observation assimilation data plays a corrective role in WRF model calculations.
Figure 7 shows the time evolution process of wind direction at 20 m, 50 m, and 100 meters during the observation and simulation data in May. The observation data indicates that the wind direction fluctuations throughout the simulation period exhibit significant non-stationary characteristics. Weather analysis suggests that these oscillations are mainly driven by convective activities accompanying precipitation events. Overall, the observation agreement of Case 5 is significantly better than that of Case 4. From 15:00 on May 16th to 00:00 on May 17th, a transition in the key wind direction from north wind to east wind was observed. However, Case 4 failed to timely capture this transition and exhibited significant phase lag and systematic undemrestimation at other times. In contrast, Case 5 accurately captured the physical characteristics and intensity changes in this directional shear throughout the time series and at various heights. This improvement fully demonstrates that by optimizing the 3DVAR assimilation method of initial conditions, the model can significantly improve its accuracy in representing large-scale circulation dynamics.
Furthermore, influenced by the underlying heterogeneous forest, the wind fields at 20 m and 50 m exhibit high-frequency fluctuation characteristics. Case 5 shows stronger sensitivity to these small-scale disturbances. Although both schemes tend to underestimate the intensity of the wind direction transition, the residuals of Case 5 are always lower than those of Case 4. For example, a sudden drop in directional wind speed occurred suddenly in the early morning of 17 May. In conclusion, these results verify that the 3DVAR assimilation method not only constrains the background field but also constrains the stable boundary layer wind field structure through dynamic balance constraints.
Figure 8 further demonstrates that at 20 m, 50 m, and 100 m, the assimilated simulations (Case 5) yielded wind direction results consistently closer to the observations, with enhanced temporal stability. The results show that during this period, the wind direction in the forest area mainly concentrated between 250° and 350°, exhibiting a stable west–northwest prevailing wind characteristic. Compared with Case 4 (without assimilation scheme), Case 5 (3DVAR assimilation) has a more advantageous phase control in the reporting stage and the period of intense fluctuations. For example, around 15:00 on 17 December, the measured wind direction showed a significant deflection towards the northwest. Case 5 was able to more quickly fit this evolution trend and reduce the accumulation of initial field deviations.
The quantitative error analysis of wind speed and direction simulations at various altitudes in May 2021 and December 2020, after incorporating the WRF-3DVAR assimilation module, is presented in
Table 6.
Table 6 presents a quantitative comparison of the simulation errors of wind speed and wind direction at different heights between May 2021 and December 2020. In May, the correlation coefficients of wind speed at all heights exceeded 0.7. The assimilation significantly improved the accuracy of wind direction simulation within the boundary layer, especially in high-altitude areas, where the effect was more pronounced. This improvement stemmed from the fact that the upper-level flow in the boundary layer was mainly dominated by pressure gradients and Coriolis forces, while surface friction and thermal disturbances were relatively weak. Therefore, the WRF model’s wind direction calculation based on statics was more accurate. In December, the assimilation raised the correlation coefficient of wind speed at 20 m to 0.8 and the correlation coefficient of wind direction to 0.75. The improvement in winter was weaker than that in summer, and this seasonal difference reflected the influence of the cold climate characteristics of the region: the strong temperature inversion and stable boundary layer in winter suppressed turbulence and enhanced local terrain and thermal forcing effects. Overall, the assimilation effects in both May and December were favorable. The reason is that the boundary layer is more active in the growth season of May, and the vertical mixing caused by the forest canopy is significantly enhanced. At this time, the B matrix shows a wider vertical correlation scale, and the observation increment can effectively propagate upward from the canopy top to the upper part of the boundary layer. In the leaf-fall period of December, affected by strong radiation cooling, stable inversion layers frequently occur in the near-surface layer. The increase in stratification stability leads to a contraction of the vertical characteristic scale of the B matrix, significantly improving the physical adaptability of the assimilation system on the complex forest surface. Additionally, snow cover would change the surface albedo and roughness, and the current assimilation configuration did not explicitly consider the land surface with snow cover.
Compared with the studies by Cao et al. [
3] which only focused on the localization of LAI, the various evaluation indicators for each height in this study have shown significant improvements. This verifies the effectiveness of the multi-source optimization framework that combines PBL optimization, LAI localization, and data assimilation, and can effectively enhance the simulation accuracy. The simulation results of this model provide a more reliable basis for high-resolution wind resource assessment in forest areas. Moreover, uncertainties exist within the analysis domain itself and the model configuration. For instance, snow cover alters surface albedo and roughness, yet the current assimilation setup does not explicitly account for snow-covered terrain. Consequently, these uncertainties propagate into subsequent wind speed and direction forecasts. Future work should seek methods to improve this process, thereby better quantifying and constraining such uncertainty propagation.
Overall, this section answers the research question Q3 by comparing the simulation results of assimilated and non-assimilated schemes: The 3DVAR assimilation significantly reduces the uncertainty of the low-level wind field, and shows better performance in the inertial subregion with more uniform physical conditions, which makes the correlation coefficient of wind speed up to 0.869 and the wind direction speed up to 0.813, confirming the effectiveness of constructing different B matrices for different seasons.
It is worth noting that the operational steps of the assimilation scheme do not depend on a specific location. As long as the study area has basic meteorological observation data, it can be replicated and implemented. This provides a direct applicable technical route for other forest areas. Moreover, for regions with obvious seasonal changes, using a single B matrix will introduce additional errors. Therefore, when applying 3DVAR assimilation in other forest areas, other forest areas should independently construct different-season B matrices based on the historical simulation field to more accurately reflect the error characteristics of flow-dependent effects. Furthermore, the physical constraint observation operator constructed in this study significantly improves the assimilation accuracy of near-surface wind speed by explicitly expressing the momentum sink effect of the canopy. This improvement idea is not only applicable to forest areas but can also be extended to other underlying surfaces with obvious canopy effects (such as urban canopies, high-stature crop areas), providing a method reference for data assimilation of complex underlying surfaces.
3.4. Results of Wind Energy Assessment
Based on the optimized parameterization scheme (YSU), the locally corrected canopy parameter (LAI), and the 3DVAR-assimilated data, wind power density was calculated for different simulation periods. The conventional WRF configuration was denoted as TWRF, while the optimized WRF configuration incorporating parameterization optimization, LAI correction, and data assimilation was denoted as OWRF, and both were used for comparative wind energy evaluations.
Figure 9 presents the wind power density at different heights for May 2021. The results exhibit pronounced vertical spatial heterogeneity, providing essential data for analyzing the vertical gradient of wind energy resources in the forested region. At the 20 m height, the overall trend of wind power density simulated by the optimized OWRF scheme aligns more closely with the measured values compared with the traditional TWRF scheme. Between 20:00 on 17 May and 03:00 on the following day, the wind power density at the meteorological tower site was relatively high, reaching two distinct peaks at approximately 60 W/m
2.
The underlying reason is that at lower altitudes, the forest canopy coverage is high, and the terrain exhibits significant undulations, resulting in enhanced surface friction and stronger influence of local circulation, which in turn lead to lower wind power density near the surface. The OWRF scheme effectively accounts for the influence of near-surface canopy characteristics on the wind field, enabling the model to accurately reproduce the timing and magnitude of the observed wind power peaks. In contrast, the traditional WRF scheme fails to capture the physical features of wind field variability near the surface, resulting in weaker performance in reproducing the temporal evolution and peak characteristics of wind power density.
Similarly, analysis of the wind power density at 50 m and 100 m (
Figure 9) indicates that the maximum values reach 79 W/m
2 and 290 W/m
2, respectively. Aside from these peaks, wind power density generally increases with height, indicating that the influence of terrain roughness and vegetation friction diminishes at higher elevations, leading to higher overall wind power density and richer wind resources. The modulating effect of terrain on the wind field exhibits a clear vertical attenuation, while atmospheric processes gradually dominate the wind field structure.
At both 50 m and 100 m heights, the two experiments reproduce the temporal variations in observed wind power density with good agreement in the overall trend; however, OWRF successfully captured two power density peaks appearing in the afternoon of 17 May, whereas TWRF produced only one peak and exhibited larger simulation errors. This improvement is attributed to the combined optimization of the YSU planetary boundary layer scheme and data assimilation, confirming that OWRF demonstrates superior overall performance.
Figure 10 shows that in December 2020, the spatial heterogeneity of wind power density with height remains significant. At 20 m height, wind power density ranges from 30 to 130 W/m
2, slightly higher than in May. For the extreme wind power values, the simulation error of OWRF is 10 W/m
2, compared with 72 W/m
2 for TWRF, while the overall RMSE of OWRF is 9 W/m
2 lower than that of TWRF. At 50–100 m heights, wind power density increases substantially, expanding across 300–450 W/m
2, indicating a much greater availability of wind energy than in May.
Overall, the OWRF simulation results agree more closely with the observations and exhibit larger wind power density maxima than in spring. This is primarily due to the strong northwesterly winds generated by the pressure gradient between the Siberian High and the North Pacific Low during winter. The sparser winter canopy also reduces surface roughness, weakening the terrain-induced dynamic forcing and thus increasing overall wind speeds and power density. As altitude increases, the resistance effects of vegetation and topography are further reduced, resulting in higher wind power density and more abundant wind resources.
To further explore the reasons for the differences in wind power generation density, the Weibull distribution function was adopted for fitting and parameter analysis. The results are shown in
Table 7.
In spring, the c values of OWRF at various heights were closer to the observed values than those of TWRF. Taking 50 m as an example, the c value of TWRF was too high, resulting in a severe overestimation of its peak wind power density; while OWRF, through the YSU scheme to optimize the local turbulent mixing process and combined with LAI to accurately depict the drag effect of the forest canopy, reduced the c value error to 24%, and the corresponding wind power density was more in line with the observation. This indicates that the accurate simulation of the mean wind speed by the OWRF scheme is the basis for improving the accuracy of wind power density estimation. In winter, at a height of 20 m, the k value of OWRF was slightly higher than the observed value, indicating that the simulated wind speed distribution was more concentrated than the measured one. This may be related to the fact that the YSU scheme still has certain deviations in depicting turbulent mixing in the near-ground layer under stable stratification; at a height of 100 m, the c value of OWRF was almost the same as the observation, while TWRF was 11% higher. This shows that the optimized OWRF effectively constrained the large-scale circulation background and accurately reproduced the high-altitude wind speed characteristics under the strong northwest wind in winter, explaining the high agreement between OWRF and the observation at the high-altitude wind power density in
Figure 10. Overall, the OWRF scheme enhances low-level simulation accuracy by optimizing near-surface physical process descriptions, while effectively suppressing error propagation to higher layers, thereby systematically improving the reliability of upper-level wind energy assessment.
In conclusion, the Weibull parameter analysis confirmed that the OWRF scheme improved the simulation ability of the mean wind speed and distribution characteristics through the YSU scheme to optimize the turbulent process, LAI to correct the canopy drag effect, and data assimilation to constrain the large-scale background field, effectively improving the accuracy of wind power density estimation. The accurate characterization of the scale parameter c is the core advantage of the optimization scheme.
The calculations for TWRF, OWRF and the AEP and CF of the observed values at a height of 100 m yield an annual actual power generation total of 5668.2 MWH, with a capacity factor of 25.88%. The AEP and CF of TWR are 11,660.8 MWH and 53.25% respectively, while those of OWRF are 6365.7 MWH and 29.07% respectively. It can be seen that the default TWRF model leads to a significant distortion in the assessment of wind energy potential, and the annual power generation assessment value is overestimated by 105.7% (confidence level = 53.25%). This deviation mainly stems from the neglect of canopy resistance in forest areas, especially being more pronounced during the growing season. In contrast, the OWRF model that integrates dynamic leaf area index (LAI) and three-dimensional variational (3DVAR) data can significantly improve the consistency between the assessment results and the observed data. The annual AEP error is significantly reduced to 12.3% (confidence level = 29.07%). Research on the Romanian Carpathians [
17] also indicates that vegetation significantly alters the temporal and spatial distribution of wind energy potential. The dynamic changes in vegetation driven by land use changes may have a drastic impact on the wind energy potential of complex terrain. This also suggests that the assessment of wind energy in forest areas must take into account the seasonal dynamics and long-term changes in canopy characteristics. The significant improvement in December data (error of only 4.2%) indicates that OWRF can accurately capture the stable wind energy potential during the dormant season, providing a scientific basis for seasonal energy scheduling in high-latitude forest areas.
By conducting error propagation analysis to quantify the uncertainty of the WPD assessment, it can be calculated that the relative uncertainty of the optimized OWRF scheme at a height of 20 m in spring decreased from 46.80% to 14.06%, and as the height increased, the uncertainty decreased. This indicates that with the weakening of the influence of complex canopy turbulence, the model stability was enhanced. In winter, it decreased from 30.61% to 9.06%, effectively verifying that the OWRF scheme can effectively improve the reliability of wind energy assessment.
Based on the wind field simulation results after multi-source optimization, this section calculates the wind power density, Weibull parameters, etc., and fully answers the research question Q4: The multi-source optimization can comprehensively improve the key indicators of wind energy assessment, especially in May, the c value at the height of 50 m is optimized by 78.6% compared with the traditional WRF model, and the MRE at the height of 20 m is reduced by 35%, which is relatively limited. This may be because the wind field at this height is affected by multiple factors such as terrain turbulence, thermal drive and local circulation in addition to canopy drag. In general, the results successfully verify that the optimization method can improve the accuracy of wind energy resource assessment and has engineering value.
4. Conclusions
To address the accuracy limitations of traditional WRF models in simulating wind fields in forested areas, this study proposes a multi-source optimization method that integrates localized canopy parameters (LAI) with 3DVAR data assimilation. This approach significantly improves wind field reproduction capabilities in complex terrain forested areas. Furthermore, it reveals the spatiotemporal distribution patterns of wind energy resources, offering practical reference value. The main research conclusions are summarized in three key points.
(1) Through comparative analysis of various planetary boundary layer parameterization schemes, it was found that the YSU scheme significantly improves the accuracy of forest wind field simulation results compared to MYNN and MYJ schemes. Specifically, the non-local YSU scheme performs best in the spring (with unstable stratification), while the local MYJ scheme has a slight advantage near the ground in the winter (with stable stratification). This pattern stems from the physical nature of different closed schemes—the non-local scheme takes into account the full-layer mixing of thermal large-scale vortices through the anti-gradient term, making it more suitable for convective active unstable stratification; the local scheme assumes that turbulence is determined solely by local gradients and can better maintain the inversion structure under stable stratification.
(2) The forest canopy structure plays a crucial role in numerical simulations of near-surface wind fields in forested areas. Based on measured LAI-2200 data, after applying localized corrections to the leaf area index (LAI) data representing local forest canopy characteristics on a decadal scale, the average relative error of near-surface wind speed decreased by 35%, while wind direction simulation accuracy improved by approximately 28%, significantly enhancing the precision of simulation results. This indicates that the local adjustment of LAI can effectively reduce the systematic deviation in the simulation of near-surface wind speed, and the improvement effect is more obvious in summer (with high LAI). This finding validates the momentum absorption effect of the forest canopy on the wind field and its seasonal variation characteristics. The physical mechanism—the canopy affects the near-surface wind profile by increasing surface roughness and momentum sink—is universal. Therefore, any WRF simulation of forest areas should consider the seasonal dynamics of LAI instead of using fixed default values of the climatic state.
(3) By further integrating the WRF-3DVAR data assimilation method into the WRF model and comparing it with the traditional WRF model without assimilation data, the results showed that the WRF-3DVAR assimilation method demonstrated better fitting performance in wind energy assessment during both winter and summer. Specifically, the improvement in wind speed simulation accuracy is relatively limited in winter, while the enhancement is more significant in summer. The correlation coefficient R value for wind direction can be increased to the highest of 0.813, and the R value for wind speed can be increased to the highest of 0.869. The improvement is most significant in the surface layer, which is most directly affected by the canopy, and naturally decreases with the increase in height. In summer, when the canopy closure is high, the improvement is better than in winter, when the canopy is sparse. This vertical distribution feature just verifies that the multi-source optimization method in this study is successful in characterizing the forest canopy momentum absorption effect, and the model accurately captures the physical mechanism of the canopy as a momentum sink, so that the improvement effect is concentrated in the near stratum where the physical process is the most complex. In addition, the strategy of constructing the background error covariance matrix B by season is confirmed to be effective, and the simulation results in May and December are highly consistent with the observations, which provides a more scientific reference for the assessment of wind energy in forest areas. It provides a more scientific reference basis for actual wind energy assessment and development, offering insights for further optimization of forest area wind field numerical simulation research.
(4) The wind power density at 100 m altitude in the Maroershan can reach 475 W/m2, 38% higher than that in summer, indicating that the forest area of Maoershan has stable and exploitable wind energy resources. The vertical wind energy stability also shows marked improvement, with resource value increasing with altitude. The area proves particularly valuable for wind energy development at elevations above 100 m.
Research findings demonstrate that the WRF optimization approach integrating local LAI correction with data assimilation significantly enhances the accuracy of near-surface wind field simulations in forested areas. This advancement provides scientific support for wind energy development, wind turbine deployment, and windbreak engineering in forest regions. It also improves the applicability of the WRF model under complex vegetation cover and non-flat terrain, offering valuable references for forestry meteorological research. This method enhances the applicability of the WRF model in complex vegetation coverage and non-flat terrain conditions, successfully forming a clear, replicable optimization method based on WRF simulations for detailed simulation of wind fields in forest areas. This method itself is not dependent on specific locations. For instance, the non-local YSU scheme performs best under unstable spring stratification, the local MYJ scheme has a slight advantage near the ground in stable winter stratification, but the simulation of the upper-level wind field still favors YSU; the local LAI correction should be included as a standard configuration for WRF simulations in forest areas, especially for simulations of high LAI in summer; and when conducting assimilation experiments, different seasons should be separately constructed B rather than using a single B matrix. The method of constructing an observation operator based on LAI and canopy parameters can be extended to other canopy substrates. These findings are not accidental results specific to a certain location. The research framework and physical laws can be generalized to wind field simulations and wind energy assessment studies in other forest areas. Future studies could further integrate seasonal variations and complex weather patterns to achieve more precise simulations of turbulent structures.
This study constructed a multi-source collaborative optimization framework integrating PBL scheme optimization, LAI local dynamic correction, and 3DVAR data assimilation. It systematically verified the core hypothesis: the multi-source optimization method can significantly improve the simulation accuracy of the wind field in forest areas, and the optimization effect follows clear physical laws. Regarding four research questions, the results show that (i) the YSU scheme performs best in spring unstable stratification and winter upper layers, while the MYJ scheme has a slight advantage in the near-ground layer but is not significantly different from the YSU scheme, revealing the seasonal and height patterns of the PBL scheme; (ii) local LAI correction reduces the error of near-ground layer wind speed by 35% and improves the wind direction accuracy by 28%, and the optimization effect decays with height and strengthens with increasing LAI. Local LAI correction should be regarded as a standard configuration for forest WRF simulation, especially suitable for summer simulations with high LAI values; (iii) 3DVAR assimilation achieves the highest wind speed correlation coefficient of 0.869 and wind direction of 0.813, confirming the necessity of the season-specific B matrix, and it should be constructed separately for different seasons instead of using a single B matrix; (iv) multi-source optimization significantly improves the wind energy evaluation indicators. The error of the c value at 50 m in spring is reduced by 78.6% compared to the traditional WRF model, verifying the engineering value of the multi-source optimization framework of this study for wind energy resource assessment. The core innovation of this study lies in clarifying the synergy effect of PBL optimization selection, LAI local dynamic correction, and 3DVAR assimilation, revealing the vertical and seasonal physical laws of wind field simulation in forest areas, enhancing the applicability of the WRF model in complex vegetation coverage and non-flat terrain conditions, providing scientific basis for forest wind power development, and offering a methodology that can be extended to other complex vegetation areas. Future research can further integrate seasonal variations and complex weather patterns to achieve more accurate simulation of turbulent structures.