Departure from Flux-Gradient Relation in the Planetary Boundary Layer

: It is well known that when eddies are small, the eddy ﬂuxes can be directly related to the mean vertical gradients, the so-called ﬂux-gradient relation, but such a relation becomes weaker the larger the coherent structures. Here, we show that this relation does not hold at heights relevant for wind energy applications. The ﬂux–gradient relation assumes that the angle ( β ) between the vector of vertical ﬂux of horizontal momentum and the vector of the mean vertical gradient of horizontal velocity is zero, i.e., these vectors are aligned. Our observations do not support this assumption, either onshore or offshore. Here, we present analyses of a misalignment between these vectors from a Doppler wind lidar observations and large-eddy simulations. We also use a real-time mesoscale model output for inter-comparison with the lidar-observed vertical proﬁles of wind speed, wind direction, momentum ﬂuxes, and the angle between the horizontal velocity vector and the momentum ﬂux vector up to 500 m, both offshore and onshore. The observations show this within the height range 100–500 m, β = − 18 ◦ offshore and β = − 12 ◦ onshore, on average. However, the large-eddy simulations show β ≈ 0 ◦ both offshore and onshore. We show that observed and mesoscale-simulated vertical proﬁles of mean wind speed and momentum ﬂuxes agree well; however, the mesoscale results signiﬁcantly deviate from the wind-turning observations. Our results show a bias between the (cid:104) u (cid:48) w (cid:48) (cid:105) vertical proﬁles derived from radial velocity measurements of a pulsed proﬁling lidar with those derived from the NEWA-WRF outputs. We assess the ﬁltering effect inherent to the lidar’s probe volume averaging by combining the Mann model and estimations of the length scale for both sites. The angle between the stress vector relative to the mean wind vector (i.e., α ) derived from the NEWA-WRF outputs agrees well with the lidar-derived values, both offshore and onshore.


Introduction
Numerical weather prediction models, such as the Weather Research and Forecast (WRF) mesoscale model [1], are widely used to simulate the atmosphere's dynamics. WRF outputs are continuously applied for different purposes and in wind energy, in particular, are the common source of wind climatologies at present. A number of atmospheric processes cannot be resolved at the mesoscale resolution, so we have to rely on model parameterizations to characterize turbulence within the planetary boundary layer (PBL). Hence, it is important to evaluate the ability of PBL parameterizations to reproduce the atmospheric processes they are intended for.
Some limitations in mesoscale models are inherent from parameterizations of the PBL. These models use a turbulence closure of the vertical transfer of momentum, which is responsible for well-known and long-standing biases related to enhanced diffusion and underestimation of the turning of the wind, especially under stably stratified conditions [2], as well as strongly baroclinic boundary layers [3]. Therefore, accurate observations of the wind and turbulence measures across the PBL, e.g., using remote sensing technologies, are valuable for the evaluation of PBL parameterizations commonly used in mesoscale models.
Ground-based Doppler wind lidars measure the wind speed with an accuracy comparable to traditional anemometry [4,5] and are able to reach heights of up to 2 km, depending on the aerosol concentration in the atmosphere. Measuring all heights simultaneously from a single instrument, e.g., with a pulsed lidar, removes uncertainties in wind-turning effects analyses when compared to the use of multiple instruments mounted on a meteorological mast. Such long-range profiling lidars can also be used to estimate second-order moments. Mann et al. [4] presented results with regards to the degree of turbulence attenuation originated from spatial averaging within the lidar measurement volume, while Sathe and Mann [6] provided an overview of methods used to derive turbulence from lidar measurements. Lidar measurements, therefore, provide a unique opportunity to evaluate the mesoscale models' ability to reproduce turbulence characteristics and to improve the suite of parametrizations that such models offer.
A number of PBL parameterizations in the WRF model are based on a flux-gradient relation, in which the eddy fluxes are related to the mean vertical gradients of velocity where U and V, aligned with the geographical coordinates, represent the two horizontal mean wind components, u and v are fluctuations around their means, and represent an ensemble average. The eddy diffusivity or momentum exchange coefficient K m can be computed differently in PBL parameterizations. As the influence of the Coriolis force tends to increase in the atmosphere, a misalignment between the stress vector (− u w , − v w ) and the mean wind vector (U, V) is expected. Further, the flux-gradient relation might break down as a result of a misalignment between the stress vector and the vertical gradient of the mean wind vector (∂U/∂z, ∂V/∂z). Figure 1 illustrates these angles, hereafter referred to as α and β, respectively, and defined as (3) where atan2(x, y) is defined such that it provides the angle of the vector relative to the x-axis. Consequently, β = 0°is inherently assumed in numerical models when applying a flux-gradient relation to express the vertical transport of horizontal momentum. Berg et al. [7] observed vertical profiles of β, which were non-zero, using measurements from a profiling lidar up to 200 m at Høvsøre, a close-to-flat site in Denmark. However, as mentioned by the authors, the observed misalignment could have been a product of the surface heterogeneity at the site.
In a study focused on low-level jets over open-water fetch, Svensson et al. [8] showed a mismatch between lidar-observed and WRF-derived momentum flux projected onto the direction of the mean wind, where lidar-based second-order statistics were filtered due to the effect of the probe volume when compared to sonic anemometer measurements. However, they did not address the limitations of the WRF model with regards to the flux-gradient relation validity.
Using large-eddy simulation (LES) of the neutral, unstable, and stable over flat terrain, Berg et al. [7] found β values close to zero throughout the PBL. Kosović and Curry [9] performed an LES of the stable PBL and found a maximum of β ≈ −10 • , but they did not attempt to further investigate the misalignment (personal communication).
This work presents mean wind and momentum flux vertical profiles computed both at an offshore and an onshore location using observations from a long-range profiling Doppler wind lidar and the WRF model output from the New European Wind Atlas (NEWA) project (hereafter NEWA-WRF), which uses a PBL scheme where the momentum transport is parameterized with a flux-gradient relation [10]. The objectives herein are threefold: 1.
The comparison of onshore and offshore vertical profiles of α and β between lidar observations and the NEWA-WRF output; 2.
Analysis of the behavior of β as well as the wind direction bias between lidar observations and the NEWA-WRF output as a function of atmospheric stability; 3.
The evaluation of α and β profiles using idealized LESs under offshore-and onshorelike surface conditions.
In the preliminary study of Santos et al. [11], negative values of β were observed by analyzing offshore lidar measurements. The present work extends the work in Santos et al. [11] by assessing the validity of the flux-gradient relation, i.e., whether or not β approaches zero, using not only offshore, but also onshore, lidar measurements. Further, we include the analysis of the vertical profile of β using idealized LES with surface boundary conditions resembling onshore and offshore conditions. We also discuss the consequences of the departure from flux-gradient relation for, e.g., mesoscale models.

Site and Instrumentation
We present results from two measurement campaigns, one offshore and another onshore, where a long-range pulsed profiling Doppler lidar (WindCube WLS70 from Leosphere) was deployed next to meteorological masts for one-year measurement campaigns. Figure 2 shows the location of both sites: one at the FINO3 research platform (55°11.7 N, 7°9.5 E) and the other next to the Hamburg meteorological mast (53°31.2 N, 10°6.31 E).
The WLS70 measured the radial or line-of-sight velocity v r at four azimuthal positions separated 90 • , namely v r,N , v r,S , v r,E , v r,W . This conical scanning pattern is performed with an angle of φ = 14.67 • from the vertical. Pulses were sent to the atmosphere at each azimuth for more than 2 s and the wind vector was derived every ≈10 s every 50 m from 100 up to 2000 m above the lidar level. The lidar probe volume length, defined by the full width at half of the expected maximum (FWHM) of the lidar's along-beam sensitivity function, is ≈75 m [12]. Due to a persistent negative bias at 300 m, already noticed in previous work [11], this measurement level was removed from this analysis.
At FINO3, the lidar was placed at 24.5 m above mean sea level (amsl) and measured for a full year next to a 100-m tall meteorological mast, instrumented with cup anemometers at 30, 40, 50, 60, 70, 80 and 106 m. Hereafter, all heights are referred to amsl. Peña et al. [13] combined the first nine months of cup anemometers and lidar measurements to characterize the mean wind profile from 30 m up to 1000 m at the site.
At the onshore site, the lidar was measuring for a one-year campaign 170-m away from a 280-m high television tower located southeast of downtown Hamburg. The tower is instrumented with METEK USA-1 sonic anemometers from Metek GmbH, at 50, 110, 175 and 250 m. Brümmer et al. [14] showed a comprehensive introduction of the site and its wind climatology. Floors et al. [12] used these onshore lidar measurements, among others, to assess the influence of baroclinicity on the measured wind profile.

Data Selection and Filtering
The FINO3 dataset was selected based on the entire lidar measurement campaign, from September 2013 to October 2014. For the Hamburg site, we selected the first two months of measurements from 6 April 2011 to 2 June 2011 where the lidar was operating without breaks and properly aligned with the north.
For both sites, the data were filtered according to the carrier-to-noise ratio (CNR), which can be used as a quality measure of the v r estimation. Values with CNR higher than −29 dB are considered valid in this work. Only wind profiles with valid data at all heights up to 500 m were selected. In addition to the CNR threshold, data were excluded when the radial velocity greatly exceeds the median value over a 30 min period, i.e., |v r −ṽ r | > 6 m s −1 .
From the observations, we computed 30-min statistics throughout this work. From a sample of 10,926 30-min periods at FINO3, we applied the above filters to obtain 76.2% of valid data (8323) up to 500 m. At the Hamburg site, 2719 30-min periods were filtered so we obtained 40% of valid full profiles (1089).
At FINO3, high-frequency measurements from sonic anemometers were not available and so we could not directly compute the Obukhov length (L). Hence, we used the potential temperature difference as a proxy for atmospheric stability. The potential temperature difference is defined as ∆Θ = Θ 30m − Θ buoy , where Θ 30m is the potential temperature using concurrent air temperature measurements at 30 m and Θ buoy is the water temperature measurement at 6-m depth from a nearby buoy. The criterion for stability classification reads simply as ∆Θ < 0 K for unstable conditions and ∆Θ > 0 K for stable conditions. At Hamburg, sonic anemometer measurements at 50 m represent surface-layer conditions [12]. Therefore, we computed L using these measurements and defined stability regimes as |L| > 500 m representing neutral, 0 m ≤ L ≤ 500 m stable, and −500 m ≤ L < 0 m unstable conditions. For FINO3, the wind direction reconstructed from the lidar and a wind vane at 100 m had an offset of -11.7° [13]. Thus, we apply this offset in our computations. At Hamburg, the lidar was aligned with the north for the selected period, so no offset was applied.

Observed and Modeled Momentum Fluxes
Using observations from a profiling lidar, the momentum fluxes can be computed from the difference between the radial velocity variance σ(v r ) of lidar beams from two opposing azimuthal measurements. This was originally proposed by Eberhard et al. [15] and more recently applied by Mann et al. [4], where δ is the lidar offset mentioned above, defined by the angle between the v r,N laser beam and the north. For a detailed derivation, see Appendix B.
Here, we assume horizontal homogeneity of the wind field, as we combine radial wind speeds from a number of points which are separated by up to 261.8 m, at the 500-m height level. Therefore, we expect that the vertical profiles of momentum fluxes up to 500 m are better estimated using Equation (5) at the offshore than at the onshore site given the possible contamination from surface inhomogeneities at the latter site.
We choose to evaluate the NEWA-WRF outputs in this work, which were described in detail by Hahmann et al. [10]. These outputs offer a time series of a number of meteorological variables covering the period of both lidar campaigns, with a temporal and horizontal resolution of 30 min and 3 km, respectively, and seven vertical levels at 50, 75, 100, 150, 200, 250, and 500 m. A linear interpolation over the spatial domain was performed using the nearest neighbor grid cells to extract the time series at FINO3 and Hamburg locations. Hahmann et al. [10] compared the mean wind speed and wind direction profiles using several PBL schemes in the WRF model against observations from several meteorological masts, including FINO3. A further comparison of the NEWA-WRF outputs using more meteorological masts was performed by Dörenkämper et al. [16]. The results of these comparisons demonstrated that a modified version of the Mellor-Yamada Nakanishi-Niino (MYNN) scheme [17] resulted in the lowest wind speed bias at most sites.
The MYNN level 2.5 (MYNN2) is a local scheme, i.e., K m is derived from local quantities. We can thus estimate the local momentum fluxes using Equations (1) and (2) from the NEWA-WRF output by computing first K m as where l is a master length scale, q = √ 2e with e being the turbulent kinetic energy (TKE), which in MYNN2 is based on the prognostic TKE from Mellor-Yamada [18], and S is a stability function. Since l is not stored in the NEWA-WRF output, we derive it following the work of Nakanishi and Niino [17].

Idealized WRF-LES
Previous LESs under neutral, stable, and neutral PBLs performed with a surface condition typical of an onshore site showed β = 0 • [7]. We therefore performed two idealized LESs of a neutral PBL over a flat surface; one over water, i.e., a roughness length of 0.0002 m, and the other over land, i.e., a roughness length of 0.65 m, which is the value found over the urban sector at the Hamburg tower [19]. The LESs were performed using the WRF model (version 4.1.2) with a single domain of 7500 m × 7500 m × 2000 m in the two horizontal and vertical directions, respectively, with a horizontal resolution of 15 m and a Coriolis parameter correspondent to the latitude of the FINO3 site and the Hamburg tower for the offshore and onshore simulations, respectively. The vertical grid spacing of ≈5 m was kept constant up to 250 m and stretched out thereafter. We used a time-step of 0.2 s and the subgrid-scale model of Deardorff [20].
The simulations were performed for a dry atmosphere. The initial potential temperature was kept constant (289.5 K) up to 700 m, where we imposed an inversion of 10 K km −1 upwards. At all vertical levels, the initial u and v velocities were set to 11 and 0 m s −1 , respectively, at FINO3 and to 10 and 0 m s −1 , respectively, at Hamburg to try to match the highest observed values at both sites. MOST was applied at the surface via the WRF-in-built surface-layer scheme. The LES was run for 12 h, a heat flux of 0 K m −1 was imposed at the surface, and periodic boundary conditions were applied.
The relevant parameters within the whole domain were output every 10 s within the period 10.5-11.5 h because we found the highest velocity for both simulations within this hour. The statistics were computed over this one-hour period (such as means, variances, and covariances) and then spatially averaged over the whole domain.
The total momentum fluxes were computed by adding the subgrid to the resolved parts. Note that the coordinate system, in this case, is relative to the wind direction at the PBL top.

Results
The FINO3 and Hamburg sites are only separated by 267 km, hence they are influenced by the same large-scale weather patterns. At both sites, westerly winds are prevalent [13,14]; therefore, we select wind direction sectors of 270°± 45°and 275°± 45°f or FINO3 and Hamburg, respectively, using lidar measurements at 100 m as reference for the wind direction. At FINO3, this sector represents one with a several-hundred-kilometer fetch, whereas at Hamburg, this is the 'urban' sector, which is characterized by industrial buildings and warehouses [14]. The selected sectors represent ≈50% of the amount of winds at FINO3 and ≈45% for Hamburg. A sensitivity analysis on the size of the wind sector was performed (not shown) and we did not find a significant difference in the characteristics of vertical profiles of wind speed, wind direction and momentum fluxes.

Bias on Wind Turning
When comparing the vertical profiles of horizontal wind components at FINO3 (Figure 3a), the NEWA-WRF wind veers (turns clockwise) with height in a lesser degree than what is observed. This underestimation of the turning of the wind is familiar in numerical weather models [3] and was previously observed using the instrument from this work [21]. The wind turning is also not properly simulated at the Hamburg site ( Figure 4a); although the simulated zonal wind component of NEWA-WRF agrees well with the measurements, there is a difference in the vertical profiles of the meridional wind component. As previously mentioned, the lidar's ability to measure all vertical levels at once makes it an ideal instrument to quantify such biases. Figure 5 shows the vertical profiles of the wind direction bias between the lidar and NEWA-WRF for stable and unstable conditions at both sites. NEWA-WRF consistently underestimates the observed wind veer, with the highest bias under stable conditions, as expected [3]. The difference between onshore ( Figure 5a) and offshore (Figure 5b) conditions is that, over water, the wind direction bias is small (≈1 • ) at 100 m and increases with height. Contrarily, over the urban fetch the bias is quite significant (>5 • ) at 100 m and decreases towards the PBL top, where it reaches similar values to those at offshore conditions. The wind turning bias behavior under stable conditions agrees with previous results over offshore conditions; Simpson et al. [22] argued that the PBL schemes generally enhance turbulence mixing under stable conditions, which results in an underestimation of both the wind shear and wind turning. One possible reason for the wind direction bias at both sites is systematic mismatch between the simulated and actual stability condition. NEWA-WRF also outputs L, so we can compare the concurrent simulated and observed stability. At FINO3, unstable conditions represent 85.5%, based on the observed potential temperature gradient for all wind sectors. By comparing the stability measures of simulated and observed wind profiles, we find that stable conditions are correctly simulated 76% of the time, whereas 90% of the observed unstable profiles are also unstable based on the model output.
Under the urban sector at Hamburg, neutral conditions are predominant, as discussed by Gryning et al. [19]. For all stability conditions at Hamburg, the estimated PBL height from NEWA-WRF is above the measured profiles. When L computed by the sonic anemometer is compared to the output from NEWA-WRF, we find that 60% of the observed stable profiles are correctly simulated as such, whereas the number increases to 94% for the observed unstable profiles.

Momentum Fluxes and Lidar's Turbulence Attenuation
For the analyzed westerly winds, both the observed and NEWA-WRF vertical profiles of momentum flux (Figures 3b and 4b) show v w values close to zero and u w values decreasing in magnitude almost linearly with height. However, at both sites, there is an important mismatch between the lidar observations and NEWA-WRF for u w . This could be caused either by the computations involving Equation (6) or turbulence attenuation due to the lidar's spatial averaging (or both).
For the analyzed period, the observed and NEWA-WRF vertical profiles of velocity gradients are rather similar (not shown). Therefore, if the bias comes mainly from the NEWA-WRF momentum fluxes, then our estimate of K m using Equation (6) could be flawed. However, we know that the lidar-derived momentum fluxes have an inherent bias due to spatial averaging along the lidar beam [4].
To gain insights into the origin of this bias, we estimate the lidar's turbulence attenuation under neutral conditions using the turbulence spectral model of Mann [23] (hereafter, the Mann model), with a turbulent eddy-lifetime parameter Γ = 3.9, as recommended by the IEC standard [24]. Γ describes the degree of anisotropy of turbulence; Γ = 0 corresponds to fully isotropic turbulence. Figure 6 shows the ratio of the filtered to the unfiltered momentum flux u w f / u w u as a function of the ratio of the lidar's probe volume (i.e., the FWHM) to the turbulence length scale (L MM ). L MM is another parameter of the Mann model.
The black full line in Figure 6 shows the prediction of the ratio u w f / u w u using the Mann model. For a detailed description of the calculations, refer to Mann et al. [4]. For the WLS70 used in this study, FWHM = 75 m, so we only need to know the value of L MM to compute the degree of filtering. We can estimate the turbulence length scale using the approximation by Kelly [25], where σ U h is the standard deviation of the horizontal velocity and ∂U h /∂z is the horizontal wind shear, which is computed here using the polynomial of U h suggested by Högström [26]. We account for the observations at all vertical levels of the masts. The length scale at 100 m is computed as a linear interpolation between the closest measurements at each mast. The dashed lines in Figure 6 show that the turbulence length scale estimated at FINO3 is larger than that at Hamburg, which results in a larger attenuation of the momentum flux at Hamburg (74.4%) compared to that at FINO3 (83.4%). As turbulence length scales depend on atmospheric stability [27], we expect that the dominant unstable conditions at FINO3 result in larger turbulence length scales compared to those at Hamburg where neutral conditions are more often observed.
By compensating the lidar-observed u w values at 100 m with the results in Figure 6, we find a better agreement between the lidar-and NEWA-WRF derived values. However, its worth noticing that the NEWA-WRF momentum fluxes are parameterized from Equation (6) so the simulated momentum fluxes do not necessarily have the same magnitude as the observations. Furthermore, the attenuation estimated in Figure 6 is subject to uncertainties from the spectral turbulence model, the FWHM value, and the L MM estimation (refer to Appendix A for the vertical profiles of estimated L MM at each site). Figures 3c and 4c show the vertical profiles of α and β for offshore and onshore conditions, respectively. As expected from the LESs of Berg et al. [7], α is close to zero near the surface and increases with height. Furthermore, the results using the NEWA-WRF output agree well with those of the lidar at both sites. However, note that a good agreement on α could occur due to a combination of biases in both wind direction and momentum fluxes.

The Angles of the Stress Vector with Mean Wind Vector (α) and with Mean Wind Shear Vector (β)
The most noticeable discrepancy between the lidar measurements and the NEWA-WRF output is on β , since we use flux-gradient relation to derive the eddy fluxes with the NEWA-WRF output. This theory assumes this misalignment is zero. The misalignment β for all heights is on average close to −18 • offshore and −12 • onshore. If we reduce the wind sector to ±30 • , the mean β for all heights is −17 • offshore and −9 • onshore, i.e., the result β < 0 is not sensitive to the wind sector at either of the sites. Here, the non-linear behavior of β with height, with a kink at 250 m, is attributed to faulty lidar measurements during both campaigns, as pointed out in Section 2.1.
The β profiles and, hence, the validity of the flux-gradient relation are sensitive to the vertical gradients of the mean wind. Therefore, we expect that PBL schemes using the assumption of a flux-gradient relation (such as MYNN2 and other local PBL schemes) show a better performance when the vertical mean wind gradients are more significant, i.e., under stable conditions [28]. Figure 7 shows the vertical profiles of β as a function of atmospheric stability for both sites. To avoid the spikes seen in Figures 3 and 4, the value at the third measurement height is the result of a linear interpolation. The vertical profiles of β , derived from the lidar observations, show that the misalignment can be as small as ≈−7°on average for offshore conditions and close to 0 • onshore, under stable conditions. Additionally, under stable conditions β approaches zero close to the PBL top. Although fluctuating, the β values under unstable conditions are always larger (in module) than those under stable conditions at both sites, as expected. Figure 8 shows the WRF-LES results with regard to the turning angles for a neutral PBL over a water-like and a land surface roughness. Over water and below 250 m (Figure 8a), α is negative, partly due to the higher impact of the subgrid-scale model on the u w term compared to the v w term. Within the range of 250-550 m, α is comparable to the observed values at FINO3. β is nearly 0 • within a large portion of the PBL and increases (in magnitude) with height at the vertical level where both velocity gradients and eddy fluxes are near zero. Within the range 40-200 m, β becomes slightly positive because of the combination of the impact of the subgrid-scale model on the eddy fluxes and the excessive vertical wind shear within the surface layer that the subgrid-scale model produces [29,30]. Over land (Figure 8b), α increases with height, is always positive, and is slightly higher than the observed values at the Hamburg tower. β is, as in the offshore case, nearly 0 • within the first 550 m. Due to the high roughness of the onshore case, the resolved terms are less impacted by the subgrid-scale model and so β does not show a strong departure from zero within the surface layer, although there is also excessive vertical wind shear.

Discussion and Conclusions
We extend the work from Santos et al. [11] and present novel lidar observations that show a clear misalignment between the vertical gradient of the mean wind vector and the stress vector, β, up to 500 m for both offshore and onshore conditions. The measurements were performed with a long-range Doppler wind lidar on the FINO3 offshore platform and close to the Hamburg weather mast. Such a misalignment is assumed to be zero by the PBL schemes that apply a flux-gradient relation and that are normally utilized within current numerical weather models and for wind resource assessment, e.g., the WRF model.
The observed β -values increase from about −5 • at 100 m to about −20 • at 500 m at both offshore and onshore sites. During stably stratified conditions, where larger vertical wind gradients are present, β is still negative but above −10 • and approaches zero at the PBL top. The β value has a larger magnitude for unstable compared to stable conditions at all heights for both sites, and this magnitude increases with height onshore. Hence, the basic assumption of the flux-gradient relation, which we utilize to derive the eddy fluxes from the NEWA-WRF outputs holds better under stable compared to unstable atmospheric conditions, as expected, due to the larger coherent structures in the latter.
Our results show a bias between the u w vertical profiles derived from radial velocity measurements of a pulsed profiling lidar with those derived from the NEWA-WRF outputs. We assess the filtering effect inherent to the lidar's probe volume averaging by combining the Mann model and estimations of the length scale for both sites. The angle between the stress vector relative to the mean wind vector (i.e., α) derived from the NEWA-WRF outputs agrees well with the lidar-derived values, both offshore and onshore.
This agreement is likely caused by a balance between the wind direction and momentum fluxes' biases.
We observe an underestimation of the wind-turning by the NEWA-WRF output at the offshore site and an opposite turning at the onshore site. We speculate that this is mainly due to a wrong surface roughness assignment (or wrong land-use description) at around Hamburg tower in NEWA-WRF. It is worth noting that the heterogeneous condition at the onshore site is far from the PBL scheme assumptions used to derive the flux-gradient relation. This challenge can be partly overcome by the use of three-dimensional PBL schemes [31].
The wind direction bias is larger (in magnitude) under stable atmospheric conditions for both sites. At the offshore site, the wind direction bias is small (≈1.5 • ) at 100 m and increases with height, whereas the bias decreases (in magnitude) with height at the onshore site. Sandu et al. [2] argued that such wind turning biases in mesoscale models are the product of artificial enhancement of the mixing by turbulence closures, which, in turn, modifies the Ekman pumping and has a direct impact on, e.g., the ability to simulate cloud formation mechanisms.
The idealized WRF-LESs, both onshore and offshore, also show β values close to 0 • within the bulk of the PBL, further demonstrating first, the need to investigate the conditions at which this misalignment occurs and, second, the need to better represent the PBL with numerical models. Note that the simulations of Berg et al. [7] and Kosović and Curry [9] used a different LES framework than our WRF-LES. A real-time WRF-LES, like that performed by Schalkwijk et al. [32], might be useful for further investigating, e.g., whether or not forcing conditions affect the behavior of β. Finally, an analysis of the impact of baroclinicity can also be performed with the lidar observations, since both barotropic and baroclinic conditions were included in the mean vertical profiles.   Figure A1 shows the vertical profiles of the estimated length scale L MM according to Equation (7) from observations at both sites. At the FINO3 site, where unstable conditions are predominant, the length scales decrease with increasing wind speed, partly due to the dominant neutral stability under high wind conditions. Contrarily, within the urban sector from the Hamburg tower, the length scale increases with increasing wind speeds, similarly to previous observations over flat and homogeneous land [33]. where a indicates the fluctuations around the mean. Subtracting these equations and using the trigonometric relation 2 sinφ cosφ = sin 2φ one gets Similarly, using the North and South beams the equation for v w can be obtained. If the beams are not perfectly aligned with the principal geographical directions but deviates with an angle δ then the wind in the East-North coordinate system can be expressed as Taking the fluctuations on both sides, multiplying with w and averaging we get a relation between the fluxes in the two systems. Combining that equation with Equation (A8) and the corrosponding equation for v w we finally get Equation (5).