Kernel Estimation of the Extropy Function under α-Mixing Dependent Data

Radhakumari Maya; Muhammed Rasheed Irshad; Hassan Bakouch; Archana Krishnakumar; Najla Qarmalah

doi:10.3390/sym15040796

,

and

¹

Department of Statistics, University College, Trivandrum 695 034, Kerala, India

²

Department of Statistics, Cochin University of Science and Technology, Cochin 682 022, Kerala, India

³

Department of Mathematics, College of Science, Qassim University, Buraydah 51452, Saudi Arabia

⁴

Department of Mathematics, Faculty of Science, Tanta University, Tanta 31111, Egypt

Symmetry2023, 15(4), 796;https://doi.org/10.3390/sym15040796

Version Notes

Order Reprints

Abstract

Shannon developed the idea of entropy in 1948, which relates to the measure of uncertainty associated with a random variable X. The contribution of the extropy function as a dual complement of entropy is one of the key modern results based on Shannon’s work. In order to develop the inferential aspects of the extropy function, this paper proposes a non-parametric kernel type estimator as a new method of measuring uncertainty. Here, the observations are exhibiting

α

-mixing dependence. Asymptotic properties of the estimator are proved under appropriate regularity conditions. For comparison’s sake, a simple non-parametric estimator is proposed, and in this respect, the performance of the estimator is investigated using a Monte Carlo simulation study based on mean-squared error and using two real-life data.

Keywords:

entropy; extropy; kernel estimator; α-mixing; simulation

1. Introduction

Reference [1] made a significant contribution to statistics by coining the term “entropy”, which refers to the measurement of uncertainty in a probability distribution. If X is a non-negative random variable (rv) that admits an absolutely continuous cumulative distribution function (cdf)

F (x)

with the corresponding probability density function (pdf)

f (x)

, then the Shannon entropy concept associated with X can be defined as follows:

R (X) = - \int_{0}^{+ \infty} f (x) log f (x) d x .

(1)

Previous research explores numerous extended and generalized versions of Shannon’s entropy function. Indeed, an excellent review of various developments on Shannon’s entropy function and its inferential aspects is covered by [2].

One of the main contemporary findings based on Shannon’s work is the contribution of a study by [3], which suggests the extropy function as a dual complement of entropy. It is defined for an absolutely continuous and non-negative rv X with pdf

f (x)

as the following:

J (X) = - \frac{1}{2} \int_{0}^{+ \infty} f^{2} (x) d x .

(2)

It is evident from Equation (2) that J(X) < 0, and hence extropy, in contrast to entropy, is always negative. The research by [3] notes that a binary distribution’s entropy and extropy are equal and as in the case of entropy, the maximum extropy distribution is the uniform distribution. Following on from the work of [3] the study of extropy has increased considerably both from a theoretical and applied point of view. Reference [4] made a comparison with extropy and some existing measures of uncertainty available in the literature and showed that there are situations where extropy can be utilized to deliver more information than those measures of uncertainty. One of the main statistical application of extropy is the total log scoring rule, which is used to score the forecasting distributions. In the application point of view, the use of extropy in automatic speech recognition was given by [5]. One can refer to [6] for the application of extropy in thermodynamics and statistical mechanics. Reference [7] explored some properties of it, involving some characterization results using order statistics and record values. Several works are available in the literature related to the inferential aspect of extropy function based on independent observations. Reference [8] proposed the development of extropy estimators in applications for testing uniformity and also used extropy to compare the uncertainties of two rvs. A study by [9] provided kernel estimation of extropy function under length-biased sampling. More recently, Reference [10] developed non-parametric log kernel estimator of extropy function.

In practice, it seems more realistic to drop the idea of independence, and replace it with some mode of dependence. However, in the case of extropy function, no inferential aspects have been proposed in previous research based on dependent data. With this in mind, the goal of this current research is to propose a non-parametric estimator of extropy function that relies on recursive kernel type estimation based on dependent data. Even various mixing conditions are available in the literature;

α

-mixing is the better mixing condition and has many applications (see, Reference [11]), which motivates us to estimate extropy function under

α

-mixing dependence condition. Unlike other research relating to recursive estimation, this paper develops a detailed analysis on recursive kernel estimation using simulation and time series data.

Let

(Ω, A, P)

be a probability space and

A_{i}^{m}

be the

σ

-algebra of events obtained by the rvs

{X_{j}; i \leq j \leq m}

. The stationary process

{X_{j}}

is said to satisfy the

α

-mixing (strong mixing) condition if

\underset{F \in A_{i + m}^{+ \infty}}{sup_{E \in A_{- \infty}^{m}}} | P (E F) - P (E) P (F) | = α (m) ↓ 0

(3)

as

m \to + \infty

. This implies that as m approaches infinity, then the rvs

X_{i}

and

X_{i + m}

become asymptotically independent. Here, the coefficient

α (m)

is known as the mixing coefficient.

Previous researchers in the field have studied non-parametric estimation for dependent data. These investigations include the non-parametric kernel type estimation of past extropy, and residual extropy, under

α

-mixing dependence condition (see, References [12,13]). Reference [14] developed non-parametric kernel type estimators for Mathai-Haubold entropy and its residual version based on

α

-mixing dependent data. Furthermore, a study by Reference [15] explores the recursive and non-recursive kernel estimation of negative cumulative residual extropy under

α

-mixing dependence condition.

This current paper adopts the following structure: Section 2 explores non-parametric recursive kernel estimator for the extropy function, while Section 3 presents the asymptotic properties of the proposed estimator. Section 4 outlines a simulation study in order to illustrate the performance of the proposed estimator. Section 5 discusses how the application of the estimator to real-life data is implemented. Finally, Section 6 provides a conclusion with some future aspects.

2. Estimation of Extropy Function

In this section, the idea for the development of a non-parametric recursive kernel estimator for the extropy function will be proposed. The main feature of recursive density estimators in comparison with non-recursive estimators is that they can be updated with each additional observation. In the case of non-recursive estimators, they must be entirely recomputed.

Let

{X_{i}; 1 \leq i \leq n}

be a sequence of identically distributed rvs representing the life-times for n components. Here, the life-times are assumed to be

α

-mixing. The assumptions are made in the study by [16]. Reference [17] is used for deriving the asymptotic properties of the estimator and for the purpose of comparison, the definition of a simple non-parametric estimator of

J (X)

can be given as follows:

J_{n}^{*} (X) = \frac{- 1}{2} \{\frac{1}{n} \sum_{i = 1}^{n} f_{n}^{2} (X_{i})\},

(4)

where

f_{n} (X_{i}) = \frac{1}{(n - 1)} \sum_{j \neq i = 1}^{n} \frac{1}{ψ_{j}} κ (\frac{X_{i} - X_{j}}{ψ_{j}}) .

(5)

This represents the kernel estimator obtained from the sample without

X_{i}

(see, Reference [18]).

Following on from this, the non-parametric recursive kernel estimator for

J (X)

is:

{\hat{J}}_{n} (X) = \frac{- 1}{2} \{\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x\},

(6)

where

{\hat{f}}_{n} (x)

is a non-parametric estimator of density function.

However, the mostly used non-parametric recursive estimator of

f (x)

is the kernel estimator (see, Reference [19]), which is given as:

{\hat{f}}_{n} (x) = \frac{1}{n} \sum_{j = 1}^{n} \frac{1}{ψ_{j}} κ (\frac{x - X_{j}}{ψ_{j}}),

(7)

where

κ (x)

satisfies the conditions:

κ (x)

is bounded, non-negative, symmetric,

κ_{i} (x) = (1 / ψ_{i}) κ (x / ψ_{i})

,

\int_{- \infty}^{+ \infty} κ (x) d x = 1

,

\int_{- \infty}^{+ \infty} x κ (x) d x = 0

and

{ψ_{n}}

is a sequence of positive bandwidths such that

ψ_{n} \to 0

and

n ψ_{n} \to + \infty

as

n \to + \infty

,

lim_{n \to \infty} (1 / n) \sum_{j = 1}^{n} {(ψ_{j} / ψ_{n})}^{l} = β_{l} < + \infty, l = 1, 2, \dots, s + 1

and

lim_{n \to + \infty} (1 / n) \sum_{j = 1}^{n} {(ψ_{n} / ψ_{j})}^{l} = θ_{l} < + \infty, 1 \leq l < 2

.

Under

α

-mixing dependence condition, the expressions for the bias and variance of

{\hat{f}}_{n} (x)

are outlined by (see, Reference [16]):

Bias \overset{}{} ({\hat{f}}_{n} (x)) ⋍ \frac{ψ_{n}^{s} ζ_{s}}{s!} f^{(s)} (x) β_{s}

(8)

and

Var \overset{}{} ({\hat{f}}_{n} (x)) ⋍ \frac{θ_{1} f (x)}{n ψ_{n}} ζ_{κ},

(9)

where

ζ_{s} = \int_{- \infty}^{+ \infty} u^{s} κ (u) d u

,

f^{(s)} (x)

is the

s^{t h}

derivative of the pdf and

ζ_{κ} = \int_{- \infty}^{+ \infty} κ^{2} (u) d u

.

3. Recursive Property and Asymptotic Results

This section supports that the proposed estimator

{\hat{J}}_{n} (X)

exhibits a recursive property and the asymptotic results of the corresponding estimator are shown here.

Theorem 1.

Let

{\hat{J}}_{n} (X)

be a non-parametric estimator of

J (X)

as defined in Equation (6). Then, it meets the recursive property:

\begin{matrix} {\hat{J}}_{n} (X) = \frac{{(n - 1)}^{2}}{n^{2}} {\hat{J}}_{n - 1} (X) - \frac{1}{n^{2} ψ_{n}} \{\int_{0}^{+ \infty} κ (\frac{x - X_{n}}{ψ_{n}}) ((n - 1) {\hat{f}}_{n - 1} (x) \\ + \frac{1}{2 ψ_{n}} κ (\frac{x - X_{n}}{ψ_{n}})) d x\} . \end{matrix}

(10)

Proof of Theorem 1.

We have,

{\hat{J}}_{n} (X) = \frac{- 1}{2} \int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x,

(11)

{\hat{J}}_{n - 1} (X) = \frac{- 1}{2} \int_{0}^{+ \infty} {\hat{f}}_{n - 1}^{2} (x) d x,

(12)

and

f_{n} (x) = \frac{n - 1}{n} {\hat{f}}_{n - 1} (x) + \frac{1}{n ψ_{n}} κ (\frac{x - X_{n}}{ψ_{n}}) .

(13)

here,

\begin{matrix} - 2 {\hat{J}}_{n} (X) & = \int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x, \\ = \int_{0}^{+ \infty} {[\frac{n - 1}{n} {\hat{f}}_{n - 1} (x) + \frac{1}{n ψ_{n}} κ (\frac{x - X_{n}}{ψ_{n}})]}^{2} d x . \end{matrix}

(14)

then, we get

\begin{matrix} - 2 {\hat{J}}_{n} (X) & = \frac{{(n - 1)}^{2}}{n^{2}} \int_{0}^{+ \infty} {\hat{f}}_{n - 1}^{2} (x) d x + \frac{1}{n^{2} ψ_{n}^{2}} \int_{0}^{+ \infty} κ^{2} (\frac{x - X_{n}}{ψ_{n}}) d x \\ + \frac{2 (n - 1)}{n^{2} ψ_{n}} \int_{0}^{+ \infty} {\hat{f}}_{n - 1} (x) κ (\frac{x - X_{n}}{ψ_{n}}) d x \\ = \frac{{(n - 1)}^{2}}{n^{2}} (- 2 {\hat{J}}_{n - 1} (X)) + \frac{1}{{(n ψ_{n})}^{2}} \int_{0}^{+ \infty} κ^{2} (\frac{x - X_{n}}{ψ_{n}}) d x \\ + \frac{2 (n - 1)}{n^{2} ψ_{n}} \int_{0}^{+ \infty} {\hat{f}}_{n - 1} (x) κ (\frac{x - X_{n}}{ψ_{n}}) d x . \end{matrix}

(15)

The result of rearranging the equation above is Equation (10). Hence, the theorem is proved. □

Theorem 2.

Let

κ (x)

be a kernel of order s and let

\{ψ_{n}\}

be a sequence of numbers that satisfies the conditions given in Section 2. Then,

{\hat{J}}_{n} (X)

is a consistent estimator of

J (X)

, that is

{\hat{J}}_{n} (X) \overset{p}{\to} J (X) .

Proof of Theorem 2.

By using Taylor’s series expansion, we have

\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x ⋍ \int_{0}^{+ \infty} f^{2} (x) d x + 2 \int_{0}^{+ \infty} ({\hat{f}}_{n} (x) - f (x)) f (x) d x .

(16)

Hence, the expressions for the bias, variance and mean-squared error (

M S E

) of

\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x

are, respectively,

\begin{matrix} Bias (\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x) & ⋍ 2 \int_{0}^{+ \infty} Bias ({\hat{f}}_{n} (x)) f (x) d x, \\ = \frac{2 ψ_{n}^{s} β_{s} ζ_{s}}{s!} \int_{0}^{+ \infty} f^{(s)} (x) f (x) d x \end{matrix}

(17)

and

\begin{matrix} Var (\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x) & ⋍ 4 \int_{0}^{+ \infty} Var ({\hat{f}}_{n} (x)) f^{2} (x) d x \\ = \frac{4 θ_{1}}{n ψ_{n}} ζ_{κ} \int_{0}^{+ \infty} f^{3} (x) d x . \end{matrix}

(18)

\begin{matrix} M S E (\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x) & ⋍ {(\frac{2 ψ_{n}^{s} β_{s} ζ_{s}}{s!} \int_{0}^{+ \infty} f^{(s)} (x) f (x) d x)}^{2} + \frac{4 θ_{1}}{n ψ_{n}} ζ_{κ} \int_{0}^{+ \infty} f^{3} (x) d x . \end{matrix}

(19)

From Equation (19), as

n \to + \infty

,

M S E (\int_{0}^{+ \infty} {\hat{f}}_{n}^{2} (x) d x) \to 0

.

Therefore,

{\hat{J}}_{n} (X) \overset{p}{\to} J (X) .

Thus, the theorem is proved. □

Remark 1.

The bias and variance of the estimator

{\hat{J}}_{n} (X)

is obtained as

B i a s ({\hat{J}}_{n} (X)) ⋍ \frac{ψ_{n}^{s} β_{s} ζ_{s}}{s!} \int_{0}^{+ \infty} f^{(s)} (x) f (x) d x

(20)

and

V a r ({\hat{J}}_{n} (X)) ⋍ \frac{θ_{1}}{n ψ_{n}} ζ_{κ} \int_{0}^{+ \infty} f^{3} (x) d x .

(21)

Theorem 3.

Suppose

{\hat{J}}_{n} (X)

is a non-parametric estimator of

J (X)

as defined in Equation (6). Then,

{\hat{J}}_{n} (X)

is integratedly consistent in quadratic mean estimator of

J (X)

.

Proof of Theorem 3.

(Proof of Theorem 3). Consider the mean integrated square error of

{\hat{J}}_{n} (X)

denoted as

Δ ({\hat{J}}_{n} (X))

. Then,

\begin{matrix} Δ ({\hat{J}}_{n} (X)) & = E [\int_{- \infty}^{+ \infty} {({\hat{J}}_{n} (X) - J (X))}^{2}] d x \\ = \int_{- \infty}^{+ \infty} [Var ({\hat{J}}_{n} (X)) + Bias {({\hat{J}}_{n} (X))}^{2}] d x \\ = \int_{- \infty}^{+ \infty} M S E ({\hat{J}}_{n} (X)) d x . \end{matrix}

(22)

By using Equations (20) and (21), we get

M S E ({\hat{J}}_{n} (X)) ⋍ {(\frac{ψ_{n}^{s} β_{s} ζ_{s}}{s!} \int_{0}^{+ \infty} f^{(s)} (x) f (x) d x)}^{2} + \frac{θ_{1}}{n ψ_{n}} ζ_{κ} \int_{0}^{+ \infty} f^{3} (x) d x .

(23)

From Equation (23), as

n \to + \infty

M S E ({\hat{J}}_{n} (X)) \to 0,

and from Equation (22), we get

Δ ({\hat{J}}_{n} (X)) \to 0, n \to + \infty .

(24)

Hence, the theorem is proved using Equation (24), (see Reference [20]) . □

Theorem 4.

Suppose

{\hat{J}}_{n} (X)

is a non-parametric estimator of

J (X)

as defined in Equation (6), satisfying the assumptions given in Section 2, then

{(n ψ_{n})}^{\frac{1}{2}} \{\frac{{\hat{J}}_{n} (X)}{ϕ_{J}} - \frac{J (X)}{ϕ_{J}}\}

(25)

has a standard normal distribution (N(0,1)) as

n \to + \infty

with

ϕ_{J}^{2} ⋍ θ_{1} ζ_{κ} \int_{0}^{+ \infty} f^{3} (x) d x .

(26)

Proof of Theorem 4.

We have

{(n ψ_{n})}^{\frac{1}{2}} ({\hat{J}}_{n} (X) - J (X)) ⋍ - {(n ψ_{n})}^{\frac{1}{2}} \int_{0}^{+ \infty} ({\hat{f}}_{n} (x) - f (x)) f (x) d x .

(27)

By using the asymptotic normality of

{\hat{f}}_{n} (x)

given in Reference [16], we can conclude that

{(n ψ_{n})}^{\frac{1}{2}} \{\frac{{\hat{J}}_{n} (X) - J (X)}{ϕ_{J}}\}

has N(0,1) as

n \to + \infty

with

ϕ_{J}^{2}

given in Equation (26). Hence, the theorem results. □

4. Monte Carlo Simulation

A simulation study is carried out to compare the kernel estimator

{\hat{J}}_{n} (X)

and

J_{n}^{*} (X)

in terms of the

M S E

. In the first case,

{X_{i}}

is generated from the exponential AR(1) process with correlation coefficient

ρ

= 0.2 and parameter

λ = 1.5

. The Gaussian kernel is used as the kernel function for the estimation. The estimated value, bias and

M S E

of

{\hat{J}}_{n} (X)

and

J_{n}^{*} (X)

for various sample sizes (n) 10, 20, 50, 100, 150, 200, 250, 300, 350, 400, 450, 500 and these values are calculated in the case of the exponential AR(1) process, as shown in Table 1 and Table 2.

Table 1. Exponential AR(1), Estimated value (E), Bias and

M S E

of

{\hat{J}}_{n} (X)

with

J (X)

= 0.375.

Table 2. Exponential AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.375.

From Table 1 and Table 2, it can be seen that the proposed estimate

{\hat{J}}_{n} (X)

for the extropy function performs better than the estimate

J_{n}^{*} (X)

based on the obtained MSE.

Furthermore, Gaussian distribution with parameters

μ

= 5 and

σ

= 3 is considered, from which

{X_{i}}

is generated for constructing the Gaussian AR(1) process with correlation coefficient

ρ

= 0.5. The estimation is conducted using the Gaussian kernel function. The bias and MSE of

{\hat{J}}_{n} (X)

and

J_{n}^{*} (X)

for various sample sizes 10, 20, 50, 100, 150, 200, 250, 300, 350, 400, 450 and 500 are calculated for the Gaussian AR(1) process, as shown in Table 3 and Table 4.

Table 3. Gaussian AR(1), E, Bias and MSE of

{\hat{J}}_{n} (X)

with

J (X)

= 0.0399.

Table 4. Gaussian AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.0399.

Table 3 and Table 4 show that the proposed estimate

{\hat{J}}_{n} (X)

for the extropy function is better than the estimate

J_{n}^{*} (X)

based on the MSE. All simulations were executed using R programming language with standard computation time. Moreover, R-code to find

{\hat{J}}_{n} (X)

by simulation for size n = 50 is given in the Appendix A.

5. Data Analysis

5.1. Application 1

For the first application, time series data relating to International Airline Passengers: Monthly Totals from January 1949 to December 1960 (comprising thousands of passengers) are considered for analysis. This time series data are also used in a study by [21]. The time series plot, sample auto correlation function (ACF) and partial autocorrelation function (PACF) plot of the data are shown in Figure 1, Figure 2 and Figure 3.

Figure 1. Time series plot of monthly totals of International Airline Passengers data.

Figure 2. ACF plot of monthly totals of International Airline Passengers data.

Figure 3. PACF plot of the data monthly totals of International Airline Passengers.

From Figure 1, Figure 2 and Figure 3, the data show random components, seasonality, and trends. The time series data are decomposed in order to arrive at estimates of trends, seasonal and random components, using the moving average method. Figure 4 shows the decomposition of the time series data into trends, seasonal and random components.

Figure 4. Decomposition plot of the data.

After removing the seasonality and trend components, a time series plot of the data, with random components is produced, as shown in Figure 5.

Figure 5. Time series plot of random component.

The AR(1) model is fitted to the data with a correlation coefficient of

ϕ

= 0.4069 and an intercept = 0.9981. The ACF plot of residuals of the fitted AR(1) model is shown in Figure 6.

Figure 6. Sample ACF of residuals of fitted AR(1) model.

Gaussian distribution with the parameters

μ

and

σ

is fitted to the data. And the acquired Kolmogorov-Smirnov (KS) statistic value is 0.083416, with corresponding p-value = 0.3173. This shows that Gaussian distribution is an appropriate fit for the data. The maximum likelihood estimates of the parameter are

\hat{μ}

= 0.9982 and

\hat{σ}

= 0.0332.

Using the maximum likelihood estimates, the estimate of extropy is

- 3.873615

. The extropy values are estimated using the proposed estimates given in Equations (6) and (4). The corresponding estimates are

{\hat{J}}_{n} (X) = - 3.62753

and

J_{n}^{*} (X) = - 42.45683

. Thus, it becomes clear that the proposed estimate

{\hat{J}}_{n} (X)

sits closer to the value

- 3.873615

and the estimate

J_{n}^{*} (X)

is far removed from the same value. As a result, the estimate

{\hat{J}}_{n} (X)

is considered better than

J_{n}^{*} (X)

as a fit for the extropy data.

5.2. Application 2

For this application, the time series data ‘The Failure of Computer Patterns’ used in [22] is considered. These data comprise 257 observations, where the quantities relate to successive times-to-failures. The time series plot of the given data for this application is shown in Figure 7.

Figure 7. Time series plot of the data “Failure of computer patterns”.

From an analysis of this time series plot, it becomes clear that data are non-stationary. Hence, the data are made stationary by calculating the difference. The time series plot of the stationary data is shown in Figure 8.

Figure 8. Time series plot of stationary data.

For the given observations, eight outliers are present, and these are removed from the data. The time series plot, ACF and PACF of the remaining observations are shown in Figure 9, Figure 10 and Figure 11.

Figure 9. Time series plot of computer failure data patterns without outliers.

Figure 10. ACF plot of computer failure data patterns without outliers.

Figure 11. PACF plot of computer failure data patterns without outliers.

The AR(1) model is fitted to the data and the correlation coefficient

ϕ = 0.5854

is obtained. Also, exponential distribution with the rate parameter

λ

is fitted to the data and we obtain KS =

0.07501

and p-value = 0.1121. Hence, it is apparent that the exponential distribution is a satisfactory fit to the data. The maximum likelihood estimate for the data is

\hat{λ} = 0.00327

and the corresponding estimate for extropy is −0.0008186.

The estimates of extropy using kernel estimation are

{\hat{J}}_{n} (X) = - 0.0008317

and

J_{n}^{*} (X) = - 0.000002574

. Thus, it becomes clear that the proposed estimate is closer to the estimate of extropy using the maximum likelihood estimation method, rather than the estimate of

J_{n}^{*} (X)

. Thus,

{\hat{J}}_{n} (X)

is seen to be a better estimator than

J_{n}^{*} (X)

when fitting this data.

From the two applications discussed above, it is clear that the proposed estimator performed well in real-life scenario applications.

6. Conclusions and Future Works

This paper has explored the non-parametric estimators for extropy function using kernel type estimation, where observations under consideration are

α

-mixing dependent. Certain asymptotic properties of the proposed estimator are investigated and proved. Furthermore, a simulation study has been conducted to compare the performance of the estimates, and the suitability of the estimator is shown using real-life data applications. It can be concluded that the proposed estimator

{\hat{J}}_{n} (X)

is superior to

J_{n}^{*} (X)

.

Several generalizations of extropy function, such as weighted extropy and Tsallis extropy are proposed in the literature. Estimation of those generalizations based on dependent observations can be considered as one of the future works. In addition to the

α

-mixing dependence condition one can develop inferential aspects of extropy and its various generalizations based on

ϕ

-mixing and

ρ

-mixing dependence condition.

Author Contributions

Conceptualization, R.M., M.R.I. and H.B.; methodology, R.M., M.R.I. and A.K.; writing—original draft preparation, A.K.; review and editing, M.R.I., H. B. and N.Q.; validation, N.Q.; software, A.K. and N.Q.; visualization, A.K. and N.Q.; funding acquisition, N.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R376), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors gratefully acknowledge Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R376), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia on the financial support for this project.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

R-code to find

{\hat{J}}_{n} (X)

by simulation for size n = 50

rm(list=ls())

library(MASS)

intt=function(x){

fn=h=rep()

for(i in 1:n)

{

h[i]=1/i^(0.5)

fn[i]=exp((-0.5/h[i]^2)*(x-X[i])^2)/(sqrt(2*pi)*h[i])

}

return((sum(fn)/n)^2)

}

n=50

for(s in 1:1000){

Jn[s]=(-1/2)*(integrate(Vectorize(intt),0,Inf)$value)

}

Jncap=mean(Jn)

References

Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd ed.; Wiley Series in Telecommunications and Signal Processing; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
Frank, L.; Sanfilippo, G.; Agro, G. Extropy complementary dual of entropy. Stat. Sci. 2015, 30, 40–58. [Google Scholar]
Jose, J.; Abdul Sathar, E.I. Extropy for past life based on classical records. J. Indian Soc. Probab. Stat. 2021, 22, 27–46. [Google Scholar] [CrossRef]
Becerra, A.; de la Rosa, J.I.; Gonzàlez, E.; Pedroza, A.D.; Escalante, N.I. Training deep neural networks with non-uniform frame-level cost function for automatic speech recognition. Multimed. Tools Appl. 2018, 77, 27231–27267. [Google Scholar] [CrossRef]
Martinas, K.; Frankowicz, M. Extropy-reformulation of the Entropy Principle. Period. Polytech. Chem. Eng. 2000, 44, 29–38. [Google Scholar]
Qiu, G. The extropy of order statistics and record values. Stat. Probab. Lett. 2017, 120, 52–60. [Google Scholar] [CrossRef]
Qiu, G.; Jia, K. Extropy estimators with applications in testing uniformity. J. Nonparametric Stat. 2018, 30, 182–196. [Google Scholar] [CrossRef]
Rajesh, R.; Rajesh, G.; Sunoj, S. Kernel estimation of extropy function under length-biased sampling. Stat. Probab. Lett. 2022, 181, 109290. [Google Scholar] [CrossRef]
Irshad, M.R.; Maya, R. Non-parametric log kernel estimation of extropy function. Chil. J. Stat. 2022, 13, 155–163. [Google Scholar]
Rosenblatt, M. A central limit theorem and a strong mixing condition. Proc. Natl. Acad. Sci. USA 1956, 42, 43–47. [Google Scholar] [CrossRef] [PubMed]
Irshad, M.R.; Maya, R. Non-parametric estimation of the past extropy under α-mixing dependence condition. Ric. Mat. 2022, 71, 723–734. [Google Scholar] [CrossRef]
Maya, R.; Irshad, M.R. Kernel estimation of the residual extropy under α-mixing dependence condition. S. Afr. Stat. J. 2019, 53, 65–72. [Google Scholar] [CrossRef]
Maya, R.; Irshad, M.R. Kernel estimation of Mathai-Haubold entropy and residual Mathai-Haubold entropy functions under α-mixing dependence condition. Am. J. Math. Manag. Sci. 2022, 41, 148–159. [Google Scholar] [CrossRef]
Maya, R.; Irshad, M.R.; Archana, K. Recursive and non-recursive kernel estimation of negative cumulative residual extropy under α-mixing dependence condition. Ric. Mat. 2021, 55, 1–21. [Google Scholar] [CrossRef]
Masry, E. Recursive probability density estimation for weakly dependent stationary processes. IEEE Trans. Inf. Theory 1986, 32, 254–267. [Google Scholar] [CrossRef]
Härdle, W.K. Smoothing Techniques: With Implementation in S; Springer Science and Business Media: Berlin/Heidelberg, Germany, 1991. [Google Scholar]
Hall, P.; Morton, S.C. On estimation of entropy. Ann. Inst. Stat. Math. 1993, 45, 69–88. [Google Scholar] [CrossRef]
Wolverton, C.; Wagner, T. Asymptotically optimal discriminant functions for pattern classification. IEEE Trans. Inf. Theory 1969, 15, 258–265. [Google Scholar] [CrossRef]
Wegman, E.J. Nonparametric probability density estimation: I. A summary of available methods. Technometrics 1972, 14, 533–546. [Google Scholar] [CrossRef]
Box, G.E.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Lewis, P.A.W. A branching poisson process model for the analysis of computer failure patterns. J. R. Stat. Soc. Ser. (Methodol.) 1964, 26, 398–456. [Google Scholar] [CrossRef]

Figure 1. Time series plot of monthly totals of International Airline Passengers data.

Figure 2. ACF plot of monthly totals of International Airline Passengers data.

Figure 3. PACF plot of the data monthly totals of International Airline Passengers.

Figure 4. Decomposition plot of the data.

Figure 5. Time series plot of random component.

Figure 6. Sample ACF of residuals of fitted AR(1) model.

Figure 7. Time series plot of the data “Failure of computer patterns”.

Figure 8. Time series plot of stationary data.

Figure 9. Time series plot of computer failure data patterns without outliers.

Figure 10. ACF plot of computer failure data patterns without outliers.

Figure 11. PACF plot of computer failure data patterns without outliers.

Table 1. Exponential AR(1), Estimated value (E), Bias and

M S E

of

{\hat{J}}_{n} (X)

with

J (X)

= 0.375.

Table 1. Exponential AR(1), Estimated value (E), Bias and

M S E

of

{\hat{J}}_{n} (X)

with

J (X)

= 0.375.

n	E	Bias	MSE
10	−0.1756	0.1994	0.0406
20	−0.2089	0.1661	0.0284
50	−0.2476	0.1274	0.0169
100	−0.2731	0.1019	0.011
150	−0.2870	0.0880	0.0083
200	−0.2971	0.0779	0.0065
250	−0.3037	0.0713	0.0054
300	−0.3081	0.0669	0.0048
350	−0.3119	0.0631	0.0043
400	−0.3155	0.0595	0.0038
450	−0.3189	0.0561	0.0034
500	−0.3211	0.0539	0.0031

Table 2. Exponential AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.375.

Table 2. Exponential AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.375.

n	E	Bias	MSE
10	−0.1254	0.2496	0.0657
20	−0.1573	0.2177	0.0509
50	−0.2026	0.1724	0.0331
100	−0.2361	0.1389	0.022
150	−0.2518	0.1232	0.0177
200	−0.2643	0.1107	0.0141
250	−0.2746	0.1004	0.0117
300	−0.2797	0.0953	0.0107
350	−0.2855	0.0895	0.0096
400	−0.2897	0.0853	0.0087
450	−0.2946	0.0804	0.0077
500	−0.2979	0.0771	0.0071

Table 3. Gaussian AR(1), E, Bias and MSE of

{\hat{J}}_{n} (X)

with

J (X)

= 0.0399.

Table 3. Gaussian AR(1), E, Bias and MSE of

{\hat{J}}_{n} (X)

with

J (X)

= 0.0399.

n	E	Bias	MSE
10	−0.0534	−0.0135	0.0015
20	−0.0516	−0.0117	0.0010
50	−0.0461	−0.0063	0.0005
100	−0.0435	−0.0036	0.0003
150	−0.0431	−0.0032	0.0002
200	−0.0427	−0.0028	0.0001
250	−0.0424	−0.0025	0.0001
300	−0.0422	−0.0023	0.0001
350	−0.0413	−0.0014	0.0001
400	−0.0413	−0.0014	0.0001
450	−0.0414	−0.0015	0.0001
500	−0.0416	−0.0017	0.0001

Table 4. Gaussian AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.0399.

Table 4. Gaussian AR(1), E, Bias and MSE of

J_{n}^{*} (X)

with

J (X)

= 0.0399.

n	E	Bias	MSE
10	−0.0337	0.0062	0.0020
20	−0.0279	0.0120	0.0012
50	−0.0233	0.0166	0.0006
100	−0.0219	0.0180	0.0005
150	−0.0212	0.0187	0.0005
200	−0.0210	0.0189	0.0005
250	−0.0213	0.0186	0.0004
300	−0.0213	0.0186	0.0004
350	−0.0202	0.0196	0.0004
400	−0.0207	0.0192	0.0004
450	−0.0207	0.0192	0.0004
500	−0.0208	0.0191	0.0004

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Kernel Estimation of the Extropy Function under α-Mixing Dependent Data

Abstract

1. Introduction

2. Estimation of Extropy Function

3. Recursive Property and Asymptotic Results

4. Monte Carlo Simulation

5. Data Analysis

5.1. Application 1

5.2. Application 2

6. Conclusions and Future Works

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics