Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements

Simakhin, Valerii Anan’evich; Potekaev, Alexander Ivanovich; Cherepanov, Oleg Sergeevich; Shamanaeva, Liudmila Grigor’evna

doi:10.3390/app13106116

Open AccessArticle

Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements

by

Valerii Anan’evich Simakhin

¹,

Alexander Ivanovich Potekaev

^2,3,*,

Oleg Sergeevich Cherepanov

¹ and

Liudmila Grigor’evna Shamanaeva

^2,4

¹

Department of Automated Systems Software, Kurgan State University, Kurgan 640000, Russia

²

Faculty of Physics, Tomsk State University, Tomsk 634050, Russia

³

Tomsk Scientific Center of the Siberian Branch of the Russian Academy of Sciences, Tomsk 634055, Russia

⁴

V.E. Zuev Institute of Atmospheric Optics of the Siberian Branch of the Russian Academy of Sciences, Tomsk 634055, Russia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2023, 13(10), 6116; https://doi.org/10.3390/app13106116

Submission received: 12 April 2023 / Revised: 10 May 2023 / Accepted: 15 May 2023 / Published: 16 May 2023

(This article belongs to the Special Issue Advanced Observation for Geophysics, Climatology and Astronomy)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Featured Application

In the present work, the spatiotemporal dynamics of kinetic wind energy is analyzed with and without allowance being made for the kinetic energy of the outliers retrieved by the postprocessing of the minisodar measurements of three wind velocity components and their variances. The wind outliers are taken to mean wind velocities, including wind gusts, the distribution of which deviates from the prior distribution of the majority of observations. The minisodar data were processed using robust parametric, semiparametric, and semi-nonparametric algorithms developed by the authors. Allowing for the contribution of the wind outliers in the parametric estimates of the kinetic wind energy enabled its fine structure to be determined, along with an estimation of its effect on the landing and takeoff of airplanes, light flying objects, high-rise buildings, and bridges, and an evaluation of the energy potentials of wind turbines.

Abstract

In the present work, we analyze the spatiotemporal dynamics of the kinetic wind energy with and without allowance for the kinetic energy of outliers. We first separated the contributions of the mean kinetic energy and the kinetic energy of the outliers and estimated the latter using robust parametric, semiparametric, and semi-nonparametric algorithms developed by the authors. The kinetic wind energy was estimated by the postprocessing of minisodar measurements of three wind velocity components and their variances in the lower 200 m layer of the atmosphere. By the outliers, we mean wind velocities, including wind gusts, the distribution of which deviates from the prior distribution of the majority of observations. A nonmonotonic increase in the kinetic energy of the outliers with sounding altitude was established. Physically, this can be explained by a nonmonotonic increase in the turbulent kinetic energy of local air vortices in the atmospheric boundary layer (ABL). The vertical extension of the outlier layers was of the order of 10–20 m.

Keywords:

atmospheric boundary layer; parametric; semiparametric; nonparametric estimates of the spatiotemporal dynamics of the kinetic energy of ordered air motion and outliers; minisodar measurements

1. Introduction

Knowledge of the flow disturbances in the boundary layer of the atmosphere is very important for the design of wind turbines, the landing and takeoff of airplanes, the design of bridges, etc. The kinetic energy of air mass motion in the ABL

E_{Σ}

is the sum of two components: the kinetic energy of the ordered motion

E_{M}

, associated with the mean wind velocity

\bar{V}

, and the turbulence kinetic energy

E_{T}

, associated with the wind velocity variance

σ^{2}

. The vector of the wind velocity components

V (x, t, z) = (V_{x}, V_{y}, V_{z})

forms the non-stationary random process defined by different n-dimensional distributions depending on a great number of factors. It is clear that to construct a mathematical model of the wind velocity in the ABL based on these n-dimensional distributions is extremely difficult if, indeed, it is possible. Usually simpler problems with fixed stable factors (locality, underlying surface, time, altitude) are considered for which particular mathematical models are developed and solutions are obtained. For a fixed altitude, consider the quasi-stationary period T during which the wind velocity

V

with distribution

F (ζ, \vec{θ})

is represented as a superposition of the stationary process

V_{0}

characterizing the stationary motion of air masses with the prior distribution

G (ζ, \vec{θ})

and the small outlier fraction ε with the velocity

V_{out}

and the distribution

H (ζ)

. By the outliers, we mean any wind velocities, including wind gusts, the distribution

H (ζ)

of which deviates from the main wind velocity distribution

G (ζ, \vec{θ})

during the period T [1,2,3]. For this model, the kinetic wind energy E in the ABL is the sum of two components

E = E_{0} + E_{out}

, where

E_{^{0}}

is the kinetic energy of the reference stationary process, and

E_{out}

is the kinetic energy of the outliers. The probabilistic problem formulation is defined by the prior information on the basic types of three distributions

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ, \vec{θ})

.

The kinetic energy of the motion of air masses includes both the prior distribution

G (ζ, \vec{θ})

that characterizes the stationary motion used to predict the wind energy potential of wind turbines, and the distribution

H (ζ, \vec{θ})

that characterizes the kinetic energy of wind outliers, used to estimate the effect of wind outliers on objects in the ABL, for example, light drones, bridges, and high-rise buildings.

The appearance of Doppler acoustic radars (sodars) makes it possible to obtain information on the altitude profiles of wind velocity vector components with high spatial (up to several meters) and temporal resolution [4]. However, usually the outliers are detected and then eliminated from raw wind field data [5,6]. Thus, in [7], various outlier detection and elimination methods were analyzed, and the conclusion was drawn that in the previous works, outliers were simply removed and considered as missing data. In [8], a visual inspection method was used to eliminate the outliers. In this case, the information on the outliers and their contribution to the kinetic wind energy in the ABL was completely lost.

In the present work, we succeeded in analyzing the spatiotemporal dynamics of kinetic wind energy with and without allowance being made for the kinetic energy of outliers. We first represented the distribution

F (ζ, \vec{θ})

as a mixture of distributions

G (ζ, \vec{θ})

and

H (ζ)

; we separated the contributions of the mean kinetic energy and the kinetic energy of the outliers, and estimated the latter using robust parametric, semiparametric, and semi-nonparametric algorithms developed by us.

We studied the diurnal hourly dynamics of kinetic wind energy in the lower 200 m layer of the atmosphere based on minisodar measurements of the wind velocity vector that obeys the distribution

F (ζ, \vec{θ})

. Like all big data arrays, the minisodar measurements of altitude profiles of the wind velocity vectors have outliers and measurement errors. The integral estimate of the kinetic wind energy in the ABL

E_{Σ}

can be obtained from sample means and variances of the vertical profiles of the wind velocity vectors. To estimate the energy of the mean wind velocity, the unbiased, consistent, and effective estimates of the parameters of distribution

G (ζ, \vec{θ})

should be obtained, based on the measured vertical profiles of the wind velocity vector with the distribution

F (ζ, \vec{θ})

. This can be achieved by using robust estimates of the parameters of the distribution

F (ζ, \vec{θ})

.

Robust statistics is an actively developing branch of mathematics [8,9]. At present, different methods (including subjective ones) of obtaining robust estimates are proposed based on the minimization of various robustness criteria. Thus, in works [9,10,11], the criteria of minimizing the maximum asymptotic bias and the maximum variance were proposed. The advantages and disadvantages of the criterion of minimizing the maximum entropy were analyzed in [6,9]. The criteria of the maximum robustness and of the maximum distance were used for the minimum distance (MD) estimates in works [11,12]. The adaptive estimators were studied in works [6,11]. For example, for the shear parameter, about 50 different robust estimates were used, for example, based on robust M-estimators [11,13], regression credibility [14], or robust linear regression models [15]. This is obviously caused by the fact that in robust statistics, there is no established robustness criterion, unlike the efficiency criterion in classical mathematical statistics [16].

First, the robustness and efficiency criteria turned out to be contradictory, which stimulated the application of the robust estimates with intermediate characteristics (for example, the MD and Hellinger distance estimates [10,11] and the adaptive estimates [6,11]). Second, for the asymmetric outlier characteristics typical of the ABL [6,12,17,18], the robust estimates are biased and inconsistent [6,19]. Third, methods for obtaining the robust estimates are aimed at the removal of external outliers distant from the bulk of observations, whereas the presence of internal outliers can lead to essential errors in decision-making. All these problems stimulate the development of new robust, efficient algorithms for a wide class of outlier distributions, including asymmetric distributions and distributions with internal outliers. Such algorithms should adapt to the priori distributions. In the present work, robust algorithms are synthesized by the maximum likelihood method that converge to the effective algorithms for the inhomogeneous minisodar measurement data.

Unfortunately, real measurement data usually do not fit the mathematical models on which mathematical statistics heavily relies. Their processing requires solutions of a number of mathematical problems by robust nonparametric statistical methods [6]. Indeed, the problems of studying the spatiotemporal dynamics of kinetic wind energy in the ABL depend significantly on prior information on the distributions

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ)

. They can be divided into different classes: parametric (when

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ)

are parameterized to within unknown parameters

\vec{θ}

), semiparametric (when

F (ζ, \vec{θ})

and

G (ζ, \vec{θ})

are parameterized, and the form of

H (ζ)

is unknown), and semi-nonparametric (when the forms of

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ)

are unknown) problems of robust statistics [6]. As to the prior distribution

G (ζ, \vec{θ})

, from physical considerations, most researchers think that it belongs to the class of symmetric normal-type distributions with light or medium tails [1,2,3]. The dynamics of the atmosphere in the ABL depends on many significant and not always measurable parameters. In this regard, the distribution

F (ζ, \vec{θ})

, representing a mixture of the distributions

G (ζ, \vec{θ})

and

H (ζ)

, should be referred to the nonparametric class. Traditionally, this class of problems is of great interest to scientists and has a century-old solution history [20,21]. Typical robust problems belong to the class of semiparametric statistics, provided that the fraction ε and the distribution of the outliers

H (ζ)

are unknown. In this case, the problem arises: how many sample observations, and from which side, should be trimmed to ensure the stability of the solutions obtained [12,17,20,21]. For example, symmetric trimming provides unbiased estimates for the symmetric distributions

G (ζ, \vec{θ})

and eliminates remote outliers [12,17], but the application of these procedures for asymmetric internal outliers leads to biased and ineffective estimates [6]. The semi-nonparametric problems for the unknown forms of

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ)

were not considered within the framework of robust statistics [6,12]. This is due to the fact that, for the unknown forms of the distributions

G (ζ, \vec{θ})

and

H (ζ)

, the formal problem arises in the assessment of their differences. On the other hand, it should be noted that although the forms of distributions of the majority of observations

G (ζ, \vec{θ})

and outliers

H (ζ)

are unknown, researchers usually implicitly use some additional prior information concerning these distributions (their semi-nonparametric forms) to distinguish them. For example, the information on the class of distributions

H (ζ)

with external (internal) symmetric (asymmetric) outliers, and on the normal symmetric distributions

G (ζ, \vec{θ})

with light or medium tails, is often implicitly implied and allows symmetric trimming to be performed at a level of

2 σ

. The wind outliers, including wind gusts, are investigated using the skewness and kurtosis of their distributions [17,18]. This additional information on the distributions should be normalized and taken into account [22,23] to synthesize robust semi-nonparametric algorithms [12]. Hence, to estimate the kinetic wind energy components in the ABL from inhomogeneous data samples with the distribution

F (ζ, \vec{θ})

, effective robust algorithms for the processing of experimental data with different levels of prior uncertainty should be developed.

In the present work, new robust parametric, semiparametric, and semi-nonparametric algorithms of the weighted maximum likelihood method are used to process experimental data with different levels of a priori statistical uncertainty to estimate the kinetic wind energy components in the ABL. The robust estimates of the total kinetic wind energy with and without allowance for the contribution of the outliers and of their difference equal to the kinetic wind energy of the outliers are obtained by postprocessing the measurements with an AV4000 minisodar [4].

The problem solved in the present work is formulated in Section 2 below. Section 3 describes the robust parametric algorithms and their application for estimating kinetic wind energy with and without allowance for the contribution of the outliers. In Section 4, the robust semiparametric algorithms are described and the results of their application are given. The kinetic wind energy with allowance for the contribution of the outliers and its mean and the turbulence components are presented together with the kinetic energy of the outliers retrieved by postprocessing of minisodar measurements. In the Conclusion, the main obtained results are given.

2. Problem Formulation

The kinetic wind energy in the ABL

E_{Σ} = m V^{2} / 2

is determined by the energy of the motion of air masses—the wind energy. Below, we present the kinetic energy in the ABL reduced to unit air mass

E = E_{Σ} / m

and measured in m²/s² (m²/s² = J/kg) [24]. It is natural that the regularities in the spatiotemporal behavior of the reduced kinetic energy will fully refer to the total kinetic energy. For this reason, we use the term kinetic energy for the kinetic energy per unit air mass. It is equal to the sum of two components: the mean kinetic energy E_M, associated with the mean wind velocity

\bar{V}

, and the turbulence kinetic energy E_T, associated with the wind velocity variance

σ^{2}

. Following [24], we can write:

\begin{matrix} E = (E_{M} + E_{T}) / m = \frac{1}{2} ({\bar{V}}^{2} + σ^{2}), \\ E_{M} = \frac{1}{2} (V_{x}^{2} + V_{y}^{2} + V_{z}^{2}), \\ E_{T} = \frac{1}{2} (σ_{x}^{2} + σ_{y}^{2} + σ_{z}^{2}), \end{matrix}

(1)

where

{\bar{V}}_{x}

,

{\bar{V}}_{y}

, and

{\bar{V}}_{z}

are mean values of the x, y, and z components of the wind velocity, and

σ_{x}^{2}

,

σ_{y}^{2}

, and

σ_{z}^{2}

are their variances. The wind vector

V (x, t, z) = (V_{x}, V_{y}, V_{z})

is a non-stationary random process. At a fixed altitude z, we consider the quasi-stationary interval

T

, and for

t \in T

, represent the process

V (x, t, z)

as a superposition of the process

V_{0}

, stationary in a broad sense, with the prior distribution

G (ζ, \vec{θ})

characterizing the ordered movement of air masses, and the fraction ε of the outliers

V_{out}

with the distribution

H (ζ)

characterizing the inhomogeneous movement of air masses. As a convenient probabilistic model of the real distributions

F (ζ, \vec{θ})

used for studying the ABL, we take the mixture of the distributions

G (ζ, \vec{θ})

and

H (ζ)

that satisfies the regularity conditions. We consider that these models approximately coincide with the prior distribution

G (ζ, \vec{θ})

characterizing the ordered movement of air masses for

t \in T

[1]. For this model, the energy of movement of air masses in the ABL, E is the sum of two components

E = E_{^{0}} + E_{^{out}}

, where

E_{^{0}}

is the energy of the stationary process, and

E_{^{out}}

is the energy of the outliers. With an allowance for Formula (1), we can write

E_{M 0} = \frac{1}{2} {({\bar{V}}_{0})}^{2}

,

E_{T 0} = \frac{1}{2} {(σ_{0})}^{2}

,

E_{M out} = \frac{1}{2} {({\bar{V}}_{out})}^{2}

, and

E_{T out} = \frac{1}{2} {(σ_{out})}^{2}

.

We studied the kinetic wind energy characteristics in the ABL by postprocessing the minisodar measurements of the x, y, and z components of the wind velocity

V_{x} (z_{j}, t_{k})

,

V_{y} (z_{j}, t_{k})

, and

V_{z} (z_{j}, t_{k})

in the jth strobe at altitude

z_{j}

in the kth measurement series started at time

t_{k}

during observation time T. We considered N observations (samples)

V_{x i} (z_{j}, t_{k})

,

V_{y i} (z_{j}, t_{k})

, and

V_{z i} (z_{j}, t_{k})

,

i = 1, \dots, N

, during observation time T. Based on these samples, it is required to obtain unbiased, consistent estimates

({\hat{E}}_{M}, {\hat{E}}_{T})

,

({\hat{E}}_{M 0}, {\hat{E}}_{T 0})

, and

({\hat{E}}_{M out}, {\hat{E}}_{T out})

of the processes

V

,

V_{0}

, and

V_{out}

.

The unbiased consistent estimate

\hat{E} = ({\hat{E}}_{M}, {\hat{E}}_{T})

averaged over the observation period T started at time

t_{k} \in T

is equal to the mean and variance of the samples

V_{x i} (z_{j}, t_{k})

,

V_{y i} (z_{j}, t_{k})

, and

V_{z i} (i, z_{j}, t_{k})

,

i = 1, \dots, N

. The unbiased and consistent estimates of

{\hat{E}}_{0} = ({\hat{E}}_{M 0}, {\hat{E}}_{T 0})

may be calculated from the unbiased and consistent estimates of the distribution

G (ζ, \vec{θ})

that may be obtained using the robust algorithms for the distribution

F (ζ, \vec{θ})

based on the samples

V_{x i} (z_{j}, t_{k})

,

V_{y i} (z_{j}, t_{k})

,

V_{z i} (z_{j}, t_{k})

,

i = 1, \dots, N

,

t_{k} \in T

, from the distribution

F (ζ, \vec{θ})

. Finally, the unbiased and consistent estimate of the outlier energy

({\hat{E}}_{M out}, {\hat{E}}_{T out})

is equal to the difference

{\hat{E}}_{out} = (\hat{E} - {\hat{E}}_{0})

. Hence, the main problem is reduced to obtaining asymptotically unbiased, consistent, and effective estimates of the parameters of the distribution

G (ζ, \vec{θ})

based on the samples from the distribution

F (ζ, \vec{θ})

; that is, the robust estimates for the distribution

F (ζ, \vec{θ})

based on prior information on the forms of the distributions

F (ζ, \vec{θ})

,

G (ζ, \vec{θ})

, and

H (ζ)

and their superpositions.

As already indicated above, robust statistics based on different robustness criteria cannot guarantee unbiased, consistent, and effective estimates of parameters, especially for asymmetric distributions

G (ζ, \vec{θ})

[6,10,11,19]. Considering the algorithms for robust estimation on different levels of prior information based on the weighted maximum likelihood method, we can formalize the problem. To simplify the derivation of the robust algorithms, we designated the samples

V_{x i} (z_{j}, t_{k})

,

V_{y i} (z_{j}, t_{k})

, and

V_{z i} (z_{j}, t_{k})

,

i = 1, \dots, N

, by

{\vec{Z}}_{N} = (ζ_{1}, \dots, ζ_{N})

. Let

{\vec{Z}}_{N} = (ζ_{1}, \dots, ζ_{N})

be the inhomogeneous sample of independent random variables with the distribution function

F (ζ, \vec{θ}) \in P

, where P is the class of distributions in the form of the mixture of the distributions satisfying the regularity conditions of the maximum likelihood method (MLM) [6,10,11]:

F (ζ, \vec{θ}) = (1 - ε) G (ζ, \vec{θ}) + ε H (ζ),

(2)

where

G (ζ, \vec{θ}) \in P_{0}

is the a priori model of the distribution function,

H (ζ) \in P

is the outlier distribution,

ε \geq 0

is the outlier fraction, and

\vec{θ} = {(θ_{1}, \dots, θ_{k})}^{T}

is the unknown parameter vector of the distribution. We designated by

f (ζ, \vec{θ})

,

g (ζ, \vec{θ})

, and

h (ζ)

the corresponding distribution densities. Considering the problem of constructing the robust effective estimate

{\vec{θ}}_{N}^{*} = {(θ_{1 N}^{*}, \dots, θ_{k N}^{*})}^{T}

for the unknown parameter

\vec{θ} = {(θ_{1}, \dots, θ_{k})}^{T}

of the prior distribution

G (ζ, \vec{θ})

from the inhomogeneous sample

{\vec{Z}}_{N} = (ζ_{1}, \dots, ζ_{N})

with the distribution

F (ζ, θ) \in P_{θ}

, the mathematical model (2) can be based on both parametric and nonparametric models of the distribution function and on their superpositions—semiparametric and semi-nonparametric models.

3. Robust Parametric Algorithms and Their Application for Estimating Kinetic Wind Energy

Let us consider the problem at the parametric level of prior uncertainty. For the sample

{\vec{Z}}_{N} = (ζ_{1}, \dots, ζ_{N})

with the distribution

F (ζ, \vec{θ})

, we first derived the maximum likelihood estimate (MLE)

{\vec{θ}}_{N}^{*}

in

F (ζ, \vec{θ})

. By analogy with [5,8], it can be shown that the consistent asymptotically unbiased effective estimates

{\vec{θ}}_{N}^{*} = {(θ_{1 N}^{*}, \dots, θ_{k N}^{*})}^{T}

for the prior distribution

G (ζ, \vec{θ})

are determined from the system of estimation equations:

\frac{1}{N} \sum_{i = 1}^{N} U (ζ_{i}, {\vec{θ}}_{N}^{*}) W (ζ_{i}, {\vec{θ}}_{N}^{*}) = 0, j = \bar{1, k},

(3)

U_{j} (ζ, {\vec{θ}}_{N}^{*}) = \frac{\partial}{\partial θ_{j}} \ln g (ζ, {\vec{θ}}_{N}^{*}) |_{θ = {\vec{θ}}_{N}^{*}},

(4)

W (ζ, {\vec{θ}}_{N}^{*}) = \frac{(1 - ε) g (ζ, {\vec{θ}}_{N}^{*})}{f (ζ, {\vec{θ}}_{N}^{*})} .

(5)

The estimates

{\vec{θ}}_{N}^{*}

from the system of estimation Equation (3) are determined by iterations and are estimates of the weighted maximum likelihood method (WMLM). The efficiency of the estimates follows from the MLM, the unbiasedness of the estimates for

G (ζ, \vec{θ})

was shown in [12], and their robustness follows from the bounded variances provided with the weight functions

W (ζ, {\vec{θ}}_{N}^{*})

given by Equation (5).

As an example, we considered the algorithms for robust estimates of the mean and variance

\vec{θ} = (μ, s)

of the prior normal distribution

G (ζ, \vec{θ})

:

g (ζ, u) = \frac{1}{s \sqrt{2 π}} \exp (- \frac{u^{2}}{2}), f (ζ, u, ν, λ) = (1 - ε) g (ζ, u) + ε g (ζ, ν, λ),

U_{1} (ζ, u) = u, U_{2} (ζ, u) = [u^{2} - 1], W (ζ, ν, λ) = \frac{(1 - ε) g (ζ, μ, s)}{f (ζ, ν, λ)}, u = \frac{ζ - μ}{s} .

(6)

The plots of the corresponding distribution densities, weight functions, and estimate functions for the normal distribution are shown in Figure 1 for the indicated outlier fractions.

Figure 2a shows an example of the parametric estimates of the vertical profiles of the total kinetic wind energy without (E₀, the orange curve) and with an allowance for the contribution of the kinetic outlier energy (E, the blue curve) retrieved from measurements with the commercial triaxial Doppler monostatic minisodar AV4000 (Atmospheric Systems Corporation, Santa Clara, CA, USA) [4]; its sounding range was 5–200 m with vertical resolution Δz = 5 m. The acoustic antenna was an array of 50 loudspeakers used to both transmit and receive acoustic signals at a frequency of 4900 Hz. This loudspeaker array was electrically steered to generate three independent beams: one vertical and two others at elevation angles of 76° in two mutually orthogonal planes. The minisodar had a pulse repetition period of 4 s and a pulse duration of 60 ms. The minisodar provided one vertical signal profile in all three channels every 4 s, which was used to calculate the wind vector components V_x(z, t), V_y(z, t), and V_z(z, t), and their variances,

σ_{x}^{2} (z, t)

,

σ_{y}^{2} (z, t)

, and

σ_{z}^{2} (z, t)

from the well-known formulas for the Doppler frequency shifts. To investigate their dynamics, we sampled and processed 150 vertical profiles recorded from the beginning of each hour from 00:00 till 23:00 to obtain 10 min averages and to estimate the total kinetic wind energy E(z, t) and its components caused by the stationary air movement E₀(z, t) and the wind outliers E_out(z, t).

Continuous minisodar measurements were taken in the vicinity of Santa Clarita, CA, USA (34°23′29.9904″ N, 118°32′33.3096″ W) over a flat underlying surface without tall vegetation [25] from 10 to 17 September 2003. During the period of the measurements, the weather was dry, warm and sunny. Here, we postprocessed 10-min averaged minisodar measurements that started on 10 September at 12:00 LT, and at every hour from 00:00 till 23:00 on 16 September. The temperature during the period of measurements on 10 September was 23 °C, and the mean wind speed was 3.57 m/s. The maximum daytime temperature on 16 September was 25.7 °C, and the minimum temperature at night was 16 °C. The average wind velocity was 9.1 m/s.

From Figure 2, it can be seen that without the contribution of the kinetic outlier energy, the curve is quite smooth, whereas with an allowance made for this contribution, its layered structure can clearly be seen. It should also be noted that the atmospheric layers with strong turbulence were also pointed out by Shikhovtsev et al. in [26] and Bolbasova et al. in [27], where the maximum changes of the turbulence layer strengths were observed in the lower layer of the atmosphere at altitudes up to 70 m, and the diurnal variations of their altitudes were also indicated. The blue curve in Figure 2a also shows that local outlier layers were clearly manifested in the vertical profile of the total kinetic energy starting from an altitude of 50 m; moreover, the thickness of these layers remained practically unchanged to altitudes of 150 m. Above 150 m, the curves synchronously changed with altitude and had kinetic energy maxima at z = 175 m. This suggests that new robust algorithms [14] for the detection and selection of outliers of various origins in the observation samples at different levels of a priori statistical uncertainty are especially important for altitude ranges in which the kinetic outlier energy is significant. For this reason, of considerable interest are the physical reasons for the appearance of these features in the kinetic outlier energy. Figure 2b shows an example of the altitude profiles of the turbulence kinetic energy component E_T without (the orange curve) and with allowance for the contribution of the kinetic outlier energy (the blue curve). The comparison of Figure 2a,b demonstrates that, in this situation, the altitude behavior of the orange and blue curves differs quantitatively only in the layer below 25 m; above this altitude, they qualitatively agree. This suggests that at these altitudes, the predominant contribution to the total kinetic wind energy comes from the turbulence kinetic energy component.

In Figure 2a,b, the fairly smooth (orange) curves of the parametric estimates of the vertical profiles of the kinetic energy of stationary air motion, disregarding the contribution of the kinetic outlier energy, can be seen. Taking into account the contribution of the kinetic energy of outliers, the blue curves depict the fine structure of the altitude dependence of the total kinetic energy. This testifies that, in this case, attention should be focused on the kinetic energy of the outliers.

4. Robust Semiparametric Algorithms and Their Application for Estimating Kinetic Wind Energy

Let us consider the problem at the semiparametric level of prior uncertainty. In this case, it is assumed that

G (ζ, \vec{θ})

is known to within a finite number of the unknown parameters

\vec{θ}

, and the information on the outliers

\{ε, H (ζ)\}

, that is, on the form of the distribution function

H (ζ)

and, hence,

F (ζ, \vec{θ})

is unknown. For the sample

{\vec{Z}}_{N} = (ζ_{1}, \dots, ζ_{N})

from

F (ζ, \vec{θ})

, we obtained the estimate

{\vec{θ}}_{N}

by the maximum likelihood method. By analogy with [12], it is possible to show that the asymptotically effective (unbiased and consistent) estimates

{\vec{θ}}_{N}^{*} = {(θ_{1 N}^{*}, \dots, θ_{k N}^{*})}^{T}

of the WMLM for the prior distribution

G (ζ, \vec{θ})

are determined from the system of the estimation equations:

\frac{1}{N} \sum_{i = 1}^{N} U (ζ_{i}, {\vec{θ}}_{N}^{*}) W_{N} (ζ_{i}, {\vec{θ}}_{N}^{*}) = 0, j = \bar{1, k},

(7)

U_{j} (ζ, {\vec{θ}}_{N}^{*}) = \frac{\partial}{\partial θ_{j}} \ln g (ζ, {\vec{θ}}_{N}^{*}) |_{θ = {\vec{θ}}_{N}^{*}},

(8)

W_{N} (ζ_{i}, {\vec{θ}}_{N}^{*}) = \frac{(1 - ε) g (ζ_{i}, {\vec{θ}}_{N}^{*})}{f_{N} (ζ_{i}, {\vec{θ}}_{N}^{*})},

(9)

f_{N} (ζ) = \frac{1}{N h_{N}} \sum_{i = 1}^{N} k (\frac{ζ - x_{i}}{h_{N}}),

(10)

where

f_{N} (ζ)

is the nonparametric estimate of the Parzen–Rosenblatt density,

k (u)

is the kernel function, and

h_{N}

is the bandwidth [6,8]. For

f_{N} (ζ)

to be consistent and unbiased, the following conditions must be satisfied:

\{k (u) = k (- u), \int_{- \infty}^{\infty} k (u) d u = 1, h_{N} \to 0, N h_{N} \to \infty when N \to \infty\}

.

The estimates

{\vec{θ}}_{N}^{*}

are defined from the system of estimation Equation (8) by iterations. The efficiency of the estimates follows from the WMLM, and the unbiased and bounded variances of the estimates are provided by the weight functions

W_{N} (ζ_{i}, {\vec{θ}}_{N}^{*})

given by Equation (9). For example, for the normal distribution with the shear parameter

G (ζ, θ)

,

g (ζ, θ) = \frac{1}{\sqrt{2 π} σ} \exp (- \frac{{(ζ - θ)}^{2}}{2 σ^{2}}) .

Estimation Equation (8) assumes the form

\sum_{i = 1}^{N} (ζ_{i} - θ_{N}) W (ζ_{i}, θ_{N}) = 0,

W_{N} (ζ_{i}, θ_{N}) = \frac{\exp (- \frac{{(ζ_{i} - θ_{N})}^{2}}{2 σ^{2}})}{f_{N} (ζ_{i})}, f_{N} (ζ) = \frac{1}{N \sqrt{2 π} h_{N}} \sum_{j = 1}^{N} \exp (- \frac{1}{2} {(\frac{ζ - ζ_{j}}{h_{N}})}^{2}) .

(11)

In Figure 3, the results of modeling are shown for observations of the superposition of the normal distributions of asymmetric internal outliers with

θ = 0, ε = 0.1, μ = 1, and λ = 0.2

f (ζ, θ) = \frac{0.9}{\sqrt{2 π}} \exp [- \frac{ζ^{2}}{2}] + \frac{0.1}{\sqrt{2 π} λ} \exp [- \frac{{(ζ - μ)}^{2}}{2 λ}] .

To determine the kinetic energy

E_{0}

of the stationary process, we needed to determine the mean

\bar{X} (\vec{θ})

and variance

σ^{2} (\vec{θ})

of the distribution

G (ζ, \vec{θ})

:

\bar{X} (\vec{θ}) = \int_{- \infty}^{\infty} ζ \cdot g (ζ, \vec{θ}) d ζ, σ^{2} (\vec{θ}) = {\int_{- \infty}^{\infty} (ζ - \bar{X} (\vec{θ}))}^{2} \cdot g (ζ, \vec{θ}) d ζ .

(12)

Defining from Equations (8)–(10) the asymptotically effective robust estimates of the parameters

{\vec{θ}}_{N}^{*} = {(θ_{1 N}^{*}, \dots, θ_{k N}^{*})}^{T}

and substituting them into Equation (8), we obtained for

\bar{X} (\vec{θ})

and

σ^{2} (\vec{θ})

the asymptotically effective robust estimates of the mean

{\bar{X}}_{N} ({\vec{θ}}_{N}^{*})

and variance

σ_{N}^{2} ({\vec{θ}}_{N}^{*})

and determined on their basis the kinetic energy

E_{0}

of the stationary process.

5. Robust Semi-Nonparametric Algorithms and Their Application for Estimation of Kinetic Wind Energy

Let us consider the class of semi-nonparametric problems of robust statistics in which the information on the prior distribution

G (ζ, \vec{θ})

and the outliers

\{H (ζ), ε\}

; that is, on the forms of the distribution functions

H (ζ)

,

G (ζ, \vec{θ})

and, hence,

F (ζ, \vec{θ})

is unknown. As the parameters

\vec{θ}

of this nonparametric class of distributions

G (ζ, \vec{θ})

, the shear and scale parameters can be considered and, in general, some functionals of

G (ζ, \vec{θ})

. Note that though the forms of distribution of the majority of observations

G (ζ, \vec{θ})

and of the outliers

H (ζ)

are unknown, the researchers usually use some additional prior information on these distributions which allows them to be distinguished. We formalized this additional prior information. It is known a priori that

G (ζ, θ) \in {\tilde{P}}_{0 θ}

, where

{\tilde{P}}_{0 θ}

is a nonparametric class of continuous functions satisfying to the conditions

S_{i} = \int ψ_{i} (x) d G (x, θ) = 0, i = 1, \dots, r,

(13)

where the functions

ψ_{1}, \dots, ψ_{r}

are known. The assignment of the prior information in the form of Equation (13) allows one to consider a wide spectrum of information on

G (ζ, θ)

of both quantitative and qualitative character [22,23]. Taking into account prior information (13), the modified nonparametric estimate of the Parzen–Rosenblatt density

{\tilde{g}}_{N} (ζ, θ)

can be constructed [22,23]:

{\tilde{g}}_{N} (ζ) = \frac{1}{N} \sum_{i = 1}^{N} \frac{1}{h_{N}} k (\frac{ζ - ζ_{i}}{h_{N}}) [1 - ‖\frac{1}{N} \sum_{i = 1}^{N} ψ_{l} (ζ_{i})‖ Λ_{N}^{- 1} {‖\frac{1}{N} \sum_{i = 1}^{N} ψ_{j} (ζ_{i})‖}^{T}] .

(14)

The estimate

{\tilde{g}}_{N} (ζ)

is asymptotically unbiased, consistent, and

\sqrt{N h_{N}} [{\tilde{g}}_{N} (ζ) - E {\tilde{g}}_{N} (ζ)]

has the asymptotically normal distribution [22,23].

Let us define the WMLM estimation equation for

G (ζ, θ) \in {\tilde{P}}_{0 θ}

in the form

\frac{1}{N} \sum_{i = 1}^{N} {\tilde{φ}}_{N} (ζ_{i}, θ_{N}) = 0,

(15)

where the estimation function

{\tilde{φ}}_{N} (ζ, θ)

is

{\tilde{φ}}_{N} (ζ, θ) = \frac{\partial}{\partial θ} \ln {\tilde{g}}_{N} (ζ, θ) \frac{{\tilde{g}}_{N} (ζ, θ)}{f_{N} (ζ, θ)} = \tilde{U} (ζ, θ) {\tilde{W}}_{N} (ζ, θ),

(16)

\tilde{U} (ζ, θ)

is the function of the MLE contribution for

\tilde{G} (ζ, θ)

,

{\tilde{W}}_{N} (ζ, θ)

is the weight function:

{\tilde{W}}_{N} (ζ, θ) = \frac{(1 - ε) {\tilde{g}}_{N} (ζ, θ)}{f_{N} (ζ, θ)},

(17)

and

f_{N} (ζ)

is the nonparametric estimate of the Parzen–Rosenblatt density given by Equation (10). Formulas (15)–(17) define the adaptive nonparametric estimate (ANE)

{\tilde{θ}}_{N}^{*}

of the WMLM.

For example, let

G (ζ, θ) \in {\tilde{P}}_{0 θ}

be a symmetric function of the parameter

θ

. The modified nonparametric estimate of the Parzen–Rosenblatt density takes the form [22,23]

{\tilde{g}}_{N} (ζ, θ) = \frac{1}{2} [\frac{1}{N h_{N}} \sum_{j = 1}^{N} k (\frac{ζ - ζ_{j}}{h_{N}}) - \frac{1}{N h_{N}} \sum_{i = 1}^{N} k (\frac{2 θ - ζ - ζ_{i}}{h_{N}})] .

(18)

The estimate

{\tilde{g}}_{N} (ζ, θ)

is asymptotically unbiased, consistent, and has the minimal root-mean square error on the nonparametric class of distributions

{\tilde{P}}_{0 θ}

.

Let

β

be the unknown parameter of

G (ζ, θ) \in {\tilde{P}}_{0 θ}

in the form:

β = \int ρ (x, θ) d G (x, θ) or \int ρ (x, θ) d G (x, θ) - β = 0

(19)

To derive the robust estimate

β_{N}

of the parameter β, consider the estimation equation in the form:

\frac{1}{N} \sum_{i = 1}^{N} {\tilde{ω}}_{N} (ζ_{i}, θ_{N}) = 0,

(20)

where the estimation function

{\tilde{φ}}_{N} (ζ, θ)

has the form:

{\tilde{ω}}_{N} (ζ, θ) = (ρ (ζ, θ) - β) {\tilde{W}}_{N} (ζ, θ)

(21)

{\tilde{W}}_{N} (ζ, θ) = \frac{{\tilde{g}}_{N} (ζ, θ)}{f_{N} (ζ, θ)}

is the weight function,

f_{N} (ζ)

is the nonparametric estimate of the Parzen–Rosenblatt density given by Equation (10), and

{\tilde{g}}_{N} (ζ, θ)

is the modified nonparametric estimate of the Parzen–Rosenblatt density. Formulas (20) and (21) define the robust semi-nonparametric estimate β_N of the parameter β. A mathematical analysis of these estimates is beyond the scope of the present work. Note only that they are asymptotically unbiased and efficient for the distribution

G (ζ, θ) \in {\tilde{P}}_{0 θ}

, provided that the sample

{\vec{Ζ}}_{N} = (ζ_{1}, \dots, ζ_{N})

is from

F (ζ, \vec{θ})

.

For example, let

G (ζ, θ) \in {\tilde{P}}_{0 θ}

be the class of symmetric functions of the shear parameter θ. Let us designate by

(μ, s)

the average and variance of

G (ζ, θ)

. In this case, the estimation equations for the robust nonparametric estimates

(μ_{N}, s_{N})

and normal kernel function (7) take the following forms:

\sum_{i = 1}^{N} \sum_{j = 1}^{N} (μ_{N} - t_{i j}) \frac{{\tilde{g}}_{N} (- \frac{{(μ_{N} - t_{i j})}^{2}}{2 σ_{N}^{2}})}{f_{N} (ζ_{i})} = 0,

\sum_{i = 1}^{N} \sum_{j = 1}^{N} {(σ_{N} - t_{i j})}^{2} \frac{{\tilde{g}}_{N} (- \frac{{(μ_{N} - t_{i j})}^{2}}{2 σ_{N}^{2}})}{f_{N} (ζ_{i})} = 0

where

t_{i, j} = \frac{ζ_{i} + ζ_{j}}{2}

are the Walsh half-sums and

{\tilde{g}}_{N} (ζ, θ)

is defined by Formula (10). The estimates are determined by iterations. The estimates of the kinetic energies of the stationary process without outliers,

E_{0}

, and with the outliers,

E_{out}

, were obtained using these formulas. Table 1 presents the semi-nonparametric estimates of the altitude dependence of the kinetic wind energy without outliers,

E_{0}

, the kinetic wind energy with an allowance for the contribution of the outliers, E, and their difference

E - E_{0} = E_{out}

equal to the kinetic energy of the outliers obtained by the postprocessing of the minisodar measurements.

Consider first the semi-nonparametric estimates

E_{0}

of the kinetic energy without outliers. Their nearly monotonic increase with altitude could be easily traced from 20 to 200 m. The large E₀ value at z = 10 m is apparently caused by the effect of the underlying surface. The rate of E₀ growth increased with altitude starting from z = 150 m. Our statistical analysis of the altitude behavior of the total kinetic energy E with an allowance for the contribution of the kinetic energy of outliers revealed the following. The E values differred only slightly from the corresponding E₀ values at altitudes up to 50 m, that is, in this altitude range, the contribution of the kinetic outlier energy to the total kinetic energy was low. The situation radically changed above 50 m. An essential nonmonotonic increase in E values was observed with a further increase in the altitude, accompanied by considerable deviations from the monotonic dependence.

As to the altitude dependence of the kinetic energy of outliers E_out, it was low at altitudes z ≤ 50 m. Its increase at z = 10 m was caused by the effect of the underlying surface. From 50 to 170 m, the nonmonotonic increase in E_out was observed together with increasing E_out values between 100 and 110 m. Above 170 m, a certain decrease in E_out was observed.

Figure 4 shows the diurnal hourly dynamics of the robust semi-nonparametric estimates of the total kinetic wind energy with an allowance for the outliers E (a); its mean, E_MKE (b), and turbulence, E_TKE (c), components; and the kinetic energy of the outliers

E_{out} = E - E_{0}

(d) retrieved by the postprocessing of the minisodar measurements. From Figure 4a, it can be seen that the maximum values (red color area) E_max ~ 500 m²/s² were observed at night, from 00:00 till 02:00, local time, and in the evening, from 19:00 till 24:00. However, whereas at night the lower boundary of the layer with enhanced kinetic energy was about 125 m, at night it descended to 75 m. From 02:00 till 19:00, E did not exceed 100 m²/s² in the lower layer. The upper boundary of this layer first ascended from about 50 m to 200 m at noon and then descended to 100 m.

The robust semi-nonparametric estimates of the diurnal hourly behavior of the mean kinetic wind energy component E_M (Figure 4b) demonstrated that from 04:00 till 19:00, its values were about 50 m²/s² and were practically independent of time of day and sensing altitude. From midnight till 02:30, E_M underwent local variations, and reached 225 m²/s² (red area) at altitudes above 125 m. From 02:30 till 04:00, it increased to 100 m²/s² at altitudes from 25 to 75 m (light blue area). From 19:00 till 24:00, areas with enhanced values of E_M ~ 225 m²/s² (red areas) were observed. As a whole, the contribution of E_M to the total kinetic wind energy E was low.

Figure 4c shows the robust semi-nonparametric estimates of the hourly dynamics each day of the turbulence kinetic energy component E_T obtained by the postprocessing of the minisodar measurements. From the figure, it can be seen that E_T increased with altitude z. In the lower layer, the altitude of which first increased from 75 m at 00:00 to 200 m at 01:00 and then decreased to 75 m at 24:00 undergoing significant altitude hourly variations, it did not exceed 80 m²/s². Above 75 m, an area of enhanced E_T was clearly visible.

Figure 4d shows the diurnal hourly dynamics of the kinetic wind energy of the outliers. It can be seen that the contribution of the outliers became especially pronounced at night from 00:00 till 02:00 at altitudes above 50 m. From 05:00 till 19:00, the kinetic energy of the outliers reached 60 m²/s² at altitudes above 100 m. In the evening and at night, the kinetic energy of the outliers (red areas) reached 120 m²/s², that is, providing significant contribution to the total kinetic energy E~200 m²/s² in this altitude range (see Figure 4a), and its effect was more pronounced at altitudes between 50 and 100 m.

6. Conclusions

Based on the proposed parametric, semiparametric, and semi-nonparametric algorithms developed by the authors, the spatiotemporal dynamics of the total kinetic wind energy without E₀, and with allowance E, for the kinetic energy of the outliers, E_out; its mean, E_M, and turbulence, E_T, components were analyzed by the postprocessing of the minisodar measurements of the three wind vector components and their variances in the lower 200 m layer of the atmosphere. The parametric, semiparametric, and semi-nonparametric robust algorithms for obtaining effective

\hat{E}

,

{\hat{E}}_{0}

, and

{\hat{E}}_{out}

estimates with the model distributions

F (ζ, \vec{θ}), G (ζ, \vec{θ})

, and

H (ζ, \vec{θ})

used to analyze their spatiotemporal dynamics allowed us to reveal some important physical features. Without making allowances for the contribution of the kinetic energy of the outliers, the altitude profiles of the kinetic wind energy components in the ABL were depicted as quite smooth curves. The allowance for the contribution of the kinetic energy of the outliers revealed their thin-layered structure, thereby demonstrating the efficiency of the proposed robust algorithms. The nonmonotonic increase in the kinetic energy of the wind outliers with altitude sounding was established. Physically, this can be explained by the nonmonotonic increase in the turbulence kinetic energy of local air vortices in the ABL. The vertical extension of the outlier layers was of the order of 10–20 m.

Author Contributions

Conceptualization, V.A.S., A.I.P., O.S.C. and L.G.S.; methodology, V.A.S., A.I.P., O.S.C. and L.G.S.; validation, V.A.S., A.I.P., O.S.C. and L.G.S.; formal analysis, V.A.S., A.I.P., O.S.C. and L.G.S.; investigation, V.A.S., A.I.P., O.S.C. and L.G.S.; resources, V.A.S., A.I.P., O.S.C. and L.G.S.; data curation, V.A.S., A.I.P., O.S.C. and L.G.S.; writing—original draft preparation, V.A.S., A.I.P., O.S.C. and L.G.S.; writing—review and editing, V.A.S., A.I.P., O.S.C. and L.G.S.; visualization, V.A.S., A.I.P., O.S.C. and L.G.S.; supervision, V.A.S., A.I.P., O.S.C. and L.G.S.; project administration, V.A.S., A.I.P., O.S.C. and L.G.S.; funding acquisition, V.A.S., A.I.P., O.S.C. and L.G.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Higher Education of the Russian Federation (V.E. Zuev Institute of Atmospheric Optics of the Siberian Branch of the Russian Academy of Sciences).

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Foken, T. Micrometeorology; Springer: Berlin/Heidelberg, Germany, 2008; 306p. [Google Scholar]
Byzova, N.L.; Ivanov, V.N.; Garger, E.K. Turbulence in the Atmospheric Boundary Layer; Gidrometeoizdat: Leningrad, Russia, 1989; 264p. [Google Scholar]
Monin, A.S. Structure of the atmospheric turbulence. Theory Probab. Its Appl. 1958, 3, 285–317. [Google Scholar] [CrossRef]
Doppler MiniSoDAR System: Operation and Maintenance Manual. Available online: https://home.chpc.utah.edu/~{}\{\}u0035056/5910_2010/Minisodarmanual.pdf (accessed on 30 December 2020).
Bradley, S. Atmospheric Acoustic Remote Sensing: Principles and Applications; CRC Press Taylor & Francis Group: Boca Raton, FL, USA, 2007; 296p. [Google Scholar]
Simakhin, V.A.; Cherepanov, O.S.; Shamanaeva, L.G. Experimental Data Processing under Conditions of A Priori Statistical Uncertainty; Scientific Technology Publishing House: Tomsk, Russia, 2021; 340p. [Google Scholar]
Zou, M.; Diokic, S.Z. A review of approaches for the detection and treatment of outliers in processing wind turbine and wind farm measurements. Energies 2020, 13, 4228. [Google Scholar] [CrossRef]
Hampel, F.R.; Ronchetti, E.M.; Rousseeuw, P.J.; Stahel, W.A. The Approach Based on Influence Functions; John Wiley & Sons, Inc.: New York, NY, USA, 1986; 502p. [Google Scholar]
Gill, S.; Stephen, B.; Galloway, S. Wind Turbine Condition Assessment through Power Curve Copula Modeling. IEEE Trans. Sustain. Energy 2012, 3, 94–101. [Google Scholar] [CrossRef]
Shurygin, A.M. Applied Statistics. Robustness. Estimation. Prediction; Finansy i Statistika: Moscow, Russia, 2000; 223p. [Google Scholar]
Shulenin, V.P. Robust Methods of Mathematical Statistics; Scientific Technology Publishing House: Tomsk, Russia, 2016; 260p. [Google Scholar]
Simakhin, V.A.; Shamanaeva, L.G.; Avdjushina, A.E. Robust semiparametric and semi-nonparametric estimates of inhomogeneous experimental data. Russ. Phys. J. 2021, 64, 355–366. [Google Scholar] [CrossRef]
De Menezes, D.Q.F.; Prata, D.M.; Secchi, A.R.; Pinto, J.C. A review on robust M-estimators for regression analysis. Comp. Chem. Eng. 2021, 147, 107254. [Google Scholar] [CrossRef]
Pitselis, G.A. review on robust estimators applied to regression credibility. J. Comput. Appl. Math. 2013, 239, 231–249. [Google Scholar] [CrossRef]
Yu, C.; Yao, W. Robust linear regression: A review and comparison. Commun. Stat. Simul. Comput. 2016, 46, 6261–6282. [Google Scholar] [CrossRef]
Baeldung. Robust Estimators in Robust Statistics. Math and Logic. Probability and Statistics. 2023. Available online: http://www.baeldung.com/cs/robust-estimators-in-robust-statistics (accessed on 11 April 2023).
Fedorov, V.A. Measuring the parameters of the radial components of the wind velocity vector with the “Volna-3” sodar. Opt. Atm. Okeana 2003, 16, 151–155. [Google Scholar]
Gladkikh, V.A.; Nevzorova, I.V.; Mamyshev, V.P.; Odintsov, S.L. Skewness and kurtosis of the distribution of the outer scales of turbulence in the near-surface layer of the atmosphere. In Proceedings of the XXV International Symposium on Atmospheric and Ocean Optics: Atmospheric Physics, Novosibirsk, Russia, 1–5 July 2019. [Google Scholar]
Jaeckel, L.A. Robust estimates of location: Symmetry and asymmetric contamination. Ann. Math. Stat. 1971, 42, 1020–1034. [Google Scholar] [CrossRef]
Muthukrishnan, R.; Poonkuzhali, G. A comprehensive survey on outlier detection methods. Am. Eurasian J. Sci. Res. 2017, 12, 161–171. [Google Scholar]
Chandola, V.; Banerjee, A.; Kumar, V. Anomaly detection: A survey. ACM Comput. Surv. 2009, 41, 58. [Google Scholar] [CrossRef]
Dmitriev, Y.G.; Koshkin, G.M. Nonparametric estimators of probability characteristics using unbiased prior conditions. Stat. Pap. 2018, 59, 1559–1575. [Google Scholar] [CrossRef]
Dmitriev, Y.G.; Koshkin, G.M. Using additional data in nonparametric estimation of density functionals. Avtomat. Telemekh. 1987, 10, 47–59. [Google Scholar]
Potekaev, A.; Shamanaeva, L.; Kulagina, V. Spatiotemporal dynamics of the kinetic energy in the atmospheric boundary layer from minisodar measurements. Atmosphere 2021, 12, 421. [Google Scholar] [CrossRef]
Underwood, K.H.; Shamanaeva, L.G. Temporal dynamics of longitudinal and transverse velocity structure functions retrieved from the data of acoustic sounding. Russ. Phys. J. 2011, 54, 113–120. [Google Scholar] [CrossRef]
Shikhovtsev, A.Y.; Kiselev, A.V.; Kovadlo, P.G.; Kolobov, D.Y.; Lukin, V.P.; Tomin, V.E. Method for estimating the atmospheric layers with strong turbulence. Atmos. Ocean. Opt. 2020, 33, 295–301. [Google Scholar] [CrossRef]
Bolbasova, L.A.; Shikhovtsev, A.Y.; Kopylov, E.A.; Selin, A.A.; Lukin, V.P.; Kovadlo, P.G. Daytime optical turbulence and wind speed distributions at the Baikal Astrophysical Observatory. Mon. Not. R. Astron. Soc. 2019, 482, 2619–2626. [Google Scholar] [CrossRef]

Figure 1. Prior normal distribution: (a) distribution densities

g (ζ, \vec{θ}), ε = 0

(cruve 1);

f (ζ, 1, 1), ε = 0.1

–internal outlier (curve 2);

f (ζ, 5, 1), ε = 0.1

–external outlier (curve 3); (b) weight functions

W (ζ, 0, 1), ε = 0

(curve 1); MLE

W (ζ, 1, 1), ε = 0.1

(curve 2); weight function

W (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimate [6,7] (curve 3); (c) estimation functions

U_{1} (ζ, 0, 1), ε = 0

(curve 1); MLE

U_{1} (ζ, 1, 1), ε = 0.1

(curve 2); estimation fuction

U_{1} (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimate (curve 3); (d) estimation functions

U_{2} (ζ, 0, 1), ε = 0

(curve 1); MLE

U_{2} (ζ, 1, 1), ε = 0.1

(curve 2); estimation function

U_{2} (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimates (curve 3).

Figure 1. Prior normal distribution: (a) distribution densities

g (ζ, \vec{θ}), ε = 0

(cruve 1);

f (ζ, 1, 1), ε = 0.1

–internal outlier (curve 2);

f (ζ, 5, 1), ε = 0.1

–external outlier (curve 3); (b) weight functions

W (ζ, 0, 1), ε = 0

(curve 1); MLE

W (ζ, 1, 1), ε = 0.1

(curve 2); weight function

W (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimate [6,7] (curve 3); (c) estimation functions

U_{1} (ζ, 0, 1), ε = 0

(curve 1); MLE

U_{1} (ζ, 1, 1), ε = 0.1

(curve 2); estimation fuction

U_{1} (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimate (curve 3); (d) estimation functions

U_{2} (ζ, 0, 1), ε = 0

(curve 1); MLE

U_{2} (ζ, 1, 1), ε = 0.1

(curve 2); estimation function

U_{2} (ζ, 0, 1), ε = 0.1

for the robust highly efficient MD Hellinger distance estimates (curve 3).

Figure 2. Vertical profiles of the parametric estimates of the kinetic wind energy in the atmosphere obtained by postprocessing of the 10 min AV4000 minisodar measurement series on 10 September 2003, started at 12:00, local time: (a) the total kinetic energy without (E₀, the orange curve) and with allowance for the contribution of the kinetic outlier energy (E, blue curve), and (b) turbulence kinetic energy component without (

E_{0 T}

, the orange curve) and with allowance for the contribution of the kinetic outlier energy (E_T, the blue curve).

Figure 2. Vertical profiles of the parametric estimates of the kinetic wind energy in the atmosphere obtained by postprocessing of the 10 min AV4000 minisodar measurement series on 10 September 2003, started at 12:00, local time: (a) the total kinetic energy without (E₀, the orange curve) and with allowance for the contribution of the kinetic outlier energy (E, blue curve), and (b) turbulence kinetic energy component without (

E_{0 T}

, the orange curve) and with allowance for the contribution of the kinetic outlier energy (E_T, the blue curve).

Figure 3. Model of the asymmetric internal outliers with

θ = 0, ε = 0.1, μ = 1, and λ = 0.2

: (a) distribution density, where curve 1 shows the estimate

f_{N} (x)

with normal kernel (7), and curve 2 shows the distribution density

f (x)

; (b) weight function

W (x)

, where curve 1 shows the estimate

W_{N} (x)

with normal kernel (7), and curve 2 shows the parametric estimate

W (x)

.

Figure 3. Model of the asymmetric internal outliers with

θ = 0, ε = 0.1, μ = 1, and λ = 0.2

: (a) distribution density, where curve 1 shows the estimate

f_{N} (x)

with normal kernel (7), and curve 2 shows the distribution density

f (x)

; (b) weight function

W (x)

, where curve 1 shows the estimate

W_{N} (x)

with normal kernel (7), and curve 2 shows the parametric estimate

W (x)

.

Figure 4. Diurnal dynamics at hourly intervals of the robust semi-nonparametric estimates of the total kinetic wind energy with allowance for the outliers E (a); its mean, E_M (b); and turbulence, E_T (c); components; and the kinetic energy of the outliers

E_{out} = E - E_{0}

(d); retrieved by postprocessing of minisodar measurements observed from midnight till 04:00, when E_T changed from 400 to 200 m²/s². One more area with enhanced ET values was observed from 14:00 till 24:00; here, E_T also changed from 400 to 200 m²/s², and its lower boundary descended from 175 m at 15:00 to 125 m at 24:00. A comparison of Figure 4a–c shows that the turbulence kinetic energy component defines exactly the local features of the daily dynamics at hourly intervals of the total kinetic energy, and the daily dynamics by the hour of the mean kinetic energy component forms the background for the turbulence kinetic energy.

Figure 4. Diurnal dynamics at hourly intervals of the robust semi-nonparametric estimates of the total kinetic wind energy with allowance for the outliers E (a); its mean, E_M (b); and turbulence, E_T (c); components; and the kinetic energy of the outliers

E_{out} = E - E_{0}

(d); retrieved by postprocessing of minisodar measurements observed from midnight till 04:00, when E_T changed from 400 to 200 m²/s². One more area with enhanced ET values was observed from 14:00 till 24:00; here, E_T also changed from 400 to 200 m²/s², and its lower boundary descended from 175 m at 15:00 to 125 m at 24:00. A comparison of Figure 4a–c shows that the turbulence kinetic energy component defines exactly the local features of the daily dynamics at hourly intervals of the total kinetic energy, and the daily dynamics by the hour of the mean kinetic energy component forms the background for the turbulence kinetic energy.

Table 1. Semi-nonparametric estimates of the kinetic wind energy components.

Altitude z, m	10	20	30	40	50	60	70	80	90	100
E₀, m²/s²	8.03	3.29	4.40	4.13	3.76	5.82	6.47	5.60	6.75	8.03
E, m²/s²	9.62	3.34	4.62	4.33	4.17	9.25	11.50	10.60	12.68	23.68
E_out, m²/s²	1.59	0.05	0.22	0.20	0.41	3.43	5.03	5.0	5.93	15.65
Altitude z, m	110	120	130	140	150	160	170	180	190	200
E₀, m²/s²	8.93	6.61	8.36	11.0	11.7	16.6	34.76	40.13	53.94	85.83
E, m²/s²	20.73	15.46	16.8	28.83	28.09	35.39	80.35	76.71	92.58	121.80
E_out, m²/s²	11.8	8.85	8.44	17.83	16.39	18.79	45.59	36.58	38.64	35.97

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Simakhin, V.A.; Potekaev, A.I.; Cherepanov, O.S.; Shamanaeva, L.G. Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements. Appl. Sci. 2023, 13, 6116. https://doi.org/10.3390/app13106116

AMA Style

Simakhin VA, Potekaev AI, Cherepanov OS, Shamanaeva LG. Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements. Applied Sciences. 2023; 13(10):6116. https://doi.org/10.3390/app13106116

Chicago/Turabian Style

Simakhin, Valerii Anan’evich, Alexander Ivanovich Potekaev, Oleg Sergeevich Cherepanov, and Liudmila Grigor’evna Shamanaeva. 2023. "Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements" Applied Sciences 13, no. 10: 6116. https://doi.org/10.3390/app13106116

APA Style

Simakhin, V. A., Potekaev, A. I., Cherepanov, O. S., & Shamanaeva, L. G. (2023). Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements. Applied Sciences, 13(10), 6116. https://doi.org/10.3390/app13106116

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Parametric, Semiparametric, and Semi-Nonparametric Estimates of the Kinetic Energy of Ordered Air Motion and Wind Outliers in the Atmospheric Boundary Layer from Minisodar Measurements

Abstract

Featured Application

Abstract

1. Introduction

2. Problem Formulation

3. Robust Parametric Algorithms and Their Application for Estimating Kinetic Wind Energy

4. Robust Semiparametric Algorithms and Their Application for Estimating Kinetic Wind Energy

5. Robust Semi-Nonparametric Algorithms and Their Application for Estimation of Kinetic Wind Energy

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI